Bloghtml2text 1.6.1 released Syndicate content

Mon, 04/30/2007 - 12:00

Update: Use Markdownify, it’s the successor to html2text.

I just released html2text 1.6, a minor bugfix release which now handles empty img tags (<img>, i.e. without the trailing slash) . Thanks to Bermi Ferrer for his patch.

Also I missed a debug output in one of the functions which is now removed and I beautified the source code a little bit (I added more whitespaces).

You can download it on the project website:

Update: 1.6.1 now handles empty <br> and <hr>

Comments

nice work Mon, 04/30/2007 - 18:40 — visitor (not verified)

Nice work Milian!

This class is exactly what I’m looking for … Thanks! Another question: I’m looking for a standalone PHP class, which is able to tidy HTML data, without needing to install libtidy on the PHP server. Do you know something?

no, sorry Mon, 04/30/2007 - 23:03 — Milian Wolff

No, I don’t know anything like this but would be interested myself.

Html2text does some very rudimentary cleanup in html2text::load_string but I imagine it could be much more sophisticated and faster as well.

I’m thinking about using/writing a html parser for html2text in the future which would make error handling possible - the xml parser breaks on any error which sucks very much.

Post new comment

The content of this field is kept private and will not be shown publicly.
  • You can use Markdown syntax to format and style the text. Also see Markdown Extra for tables, footnotes, and more.
  • You can enable syntax highlighting of source code with the following tags: <code>, <blockcode>, <pre>.
  • Web page addresses and e-mail addresses turn into links automatically.

More information about formatting options