»
S
I
D
E
B
A
R
«
Converting HTML into Unicode in Python
November 28th, 2009 by admin

To convert HTML into Unicode in Python, Python has the htmlentitydefs module, but this doesn’t include a function to unescape HTML entities.

Python developer Fredrik Lundh (author of elementtree, among other things) has such a function (you can find it here) on his website, which works with decimal, hex and named entities.

Share and Enjoy:
  • Print
  • Digg
  • Sphinn
  • del.icio.us
  • Facebook
  • Mixx
  • Google Bookmarks
  • Blogplay



Similar Posts:
Leave a Reply

Powered by WP Hashcash