>From an email conversation with leonardr: The relevant changes should all be in revno 305. The diff is very large (~800 lines) and I don't know how well it would apply to 4.0.2, but that's where to look for it.
Actually, the changes are extensive enough that if you applied them to 4.0.2 I wouldn't feel comfortable calling the result "4.0.2." The change involves API changes, most notable with the UnicodeDammit class. That's why this release was called 4.3.0 instead of 4.2.2. I don't know how you deal with such things, but I wanted you to know. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/972466 Title: lxml HTML parser mangles documents whose <meta> tags define the charset as other than UTF-8 To manage notifications about this bug go to: https://bugs.launchpad.net/beautifulsoup/+bug/972466/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs