NOTE: The current preferred location for bug reports is the GitHub issue tracker.
Bug 228 - Give more graceful messages on mislabeled UTF-16
Give more graceful messages on mislabeled UTF-16
Status: NEW
Classification: Unclassified
Component: HTML parser
All All
: P2 normal
Assigned To: Nobody
Depends on:
  Show dependency treegraph
Reported: 2008-06-03 11:07 CEST by Henri Sivonen
Modified: 2009-11-23 17:16 CET (History)
0 users

See Also:


Note You need to log in before you can comment on or make changes to this bug.
Description Henri Sivonen 2008-06-03 11:07:01 CEST
20:58 < zcorpan_> hsivonen: could have more useful messages
21:01 < hsivonen> zcorpan_: do you have a suggestion on how to handle it better?
21:01 < hsivonen> (aside: in practice that one isn't much of a concern because 
                  UTF-16 isn't too popular for serialization)
21:01 < Dashiva> hsivonen: You could search for a doctype in any poisition, and 
                 if you find one, report that as a possible error cause?
21:02 < zcorpan_> hsivonen: sniff for a utf-16 BOM even if http content-type 
                  says something different