NOTE: The current preferred location for bug reports is the GitHub issue tracker.
Bug 693 - use IANA Character Sets registry for "charset" and "meta-charset" datatype checking
use IANA Character Sets registry for "charset" and "meta-charset" datatype ch...
Status: RESOLVED FIXED
Product: Validator.nu
Classification: Unclassified
Component: Datatype library
HEAD
All All
: P2 enhancement
Assigned To: Michael[tm] Smith
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-12-16 13:51 CET by Michael[tm] Smith
Modified: 2010-01-08 08:13 CET (History)
0 users

See Also:


Attachments
patch (15.29 KB, patch)
2009-12-31 13:31 CET, Michael[tm] Smith
Details
patch (18.22 KB, patch)
2010-01-08 08:13 CET, Michael[tm] Smith
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Michael[tm] Smith 2009-12-16 13:51:33 CET
The current HTML5 draft says that a document's character encoding declaration "must be an ASCII case-insensitive match for the preferred MIME name for that encoding" and that "the preferred MIME name of a character encoding is the name or alias labeled as "preferred MIME name" in the IANA Character Sets registry, if there is one, or the encoding's name, if none of the aliases are so labeled."

http://dev.w3.org/html5/spec/semantics.html#charset
http://dev.w3.org/html5/spec/infrastructure.html#preferred-mime-name

The current v.nu "charset" and "meta-charset" datatype checking does not consult the IANA Character Sets registry at all. It seems that what it optimally should do is something similar to the way in which the "language" datatype checking relies on parsing and consulting the IANA Language Subtag registry.

http://www.iana.org/assignments/character-sets
Comment 1 Michael[tm] Smith 2009-12-31 13:31:34 CET
Created attachment 141 [details]
patch