MW

Tag open source - page 2

Recent Posts

html2text rewrite (May 16, 2007)

A few days ago I started a complete rewrite of html2text. It now uses a new htmlparser (also written by me) which should make the whole HTML cleanup process obsolete. The generic XML parser which is currently used dies on invalid XHTML, with my parser it should be possible do handle errors and parse HTML 4.01 documents without any regex magic beforehand.

You’ll hear more of this in about a week as I’ll be on vacation until the 24th.

continue reading...

updated TypoGridder (April 12, 2007)

Yes I know it is pretty darn late but hey - what else are holidays there for? Except learning and partying and dancing and chilling and… well yes I just could not stop until it worked like I wanted it to.

My TypoGridder now works as a MiniTool for Konqueror and as a bookmarklet in Firefox and Opera. It should have the correct height on any page and should be on top of everything. Please test it and report any bugs.

Visit TypoGridder project Page

continue reading...

TypoGridder (April 11, 2007)

I recently read two articles about using grids in webdesign. It just makes sense. Read them yourself:

I’ve wrote a little javascript which displays a basline grid on any page (via bookmarklet). For more information visit the project website.

Please use this article for any feedback.

PS: And as you might see, I have a lot to do to let my website align to any grid. The curse of a layman.

continue reading...

TypoGridder (April 11, 2007)

I recently read two articles about using grids in webdesign. It just makes sense. Read them yourself:

I’ve wrote a little javascript which displays a basline grid on any page (via bookmarklet). For more information visit the project website.

Please use this article for any feedback.

PS: And as you might see, I have a lot to do to let my website align to any grid. The curse of a layman.

continue reading...

How To Wipe Your HDD (December 30, 2006)

I had the job to erase the HDDs of our oldest computer (100Mhz with Win 95). Since a villain could hypothetically get the data even if you do a format C: I searched the web for help. Eventually I came across dban which is just what I was looking for:

  1. small: dban fits on one floppy
  2. free: it’s open source!
  3. sophisticated wiping mechanisms

How to create a dban boot floppy on linux

  1. Download dban, pick the *.exe file
  2. Unzip the downloaded file: unzip dban-1.0.7_i386.exe
  3. Get a floppy, optionally format it and put dban on it: mount /dev/fd0 && dd if=dban-1.0.7_i386.ima of=/dev/fd0 bs=1024 && umount /dev/fd0
  4. Done

Now put the floppy in the target pc and boot it up. Make sure (via your BIOS) that a floppy has the highest boot priority. You should be welcomed by a screen with a warning, saying dban is dangerous, ‘cause it will wipe all your data. That’s just what I’m looking for - thanks!

continue reading...

html2text.php version 1.3 released (December 24, 2006)

Update: Use Markdownify, it’s the successor to html2text.

I just released html2text version 1.3 which sports a ton of bug fixes. Most notably all features of php markdown extra are now fully supported, including footnotes and abbrevations.

Also wrapping should work like intended and inline links (like <foo@bar.com>) won’t be converted to block links (like [foo@bar.com]([foo@bar.com](mailto:foo@bar.com))).

In the next version I’ll add some more options, especially disabling php markdown extra support. Also I’ll clean up the code a bit.

Merry Christmas to you!

continue reading...

html2text.php 1.1 (July 23, 2006)

Update: Use Markdownify, it’s the successor to html2text.

I changed my html2text.php function and it now supports non markdownable elements better. Previously something like <p class="foobar">...</p> would have resulted in <p>...</p>. Now these elements (which could be ported to markdown) will be left in plain html.

Additionally I made some changes which should lead to an improved performance.

Download

Get it while it’s hot: html2text.php 1.1 (.tar.gz ~ 120.9 KB)

Known Bugs

Yes, there are some, which I’ll try to fix in the next days (note: to better point out the bugs I just write what happens if you convert html to markdown to html):

  • Also if the parent element (e.g. <table>) gets parsed and a child <tr>,<td> or<th> has attributes they will be ignored and dropped. Workaround: Add a attribute to the parent element (e.g. a class / id).
  • If you give a single <li> element in the middle of a list some attributes it wont lose them, but will produce not well formed html:

    <ul><li>abc</li> <li class="foo">bar</li> </ul>
    

continue reading...