Ticket #767 (new defect)

Opened 3 years ago

Last modified 19 months ago

Highlighted source code on hackage.haskell.org mangles Unicode

Reported by: andersk Owned by:
Priority: normal Milestone: HackageDB
Component: hackageDB website Version:
Severity: normal Keywords:
Cc: Difficulty: very easy (<1 hour)
GHC Version: Platform:

Description

On hackage.haskell.org, when you browse the source of a module that contains Unicode (e.g.  Data.Eq.Unicode), it is sent with a Content-Type: text/html header with no charset. There is a charset in the XML declaration <?xml version="1.0" encoding="UTF-8"?>, but that is ignored by Firefox because of the non-XML Content-Type. Therefore, the wrong encoding is detected and the Unicode symbols get mangled.

Possible fixes include sending Content-Type: text/html; charset=UTF-8, or sending Content-Type: application/xhtml+xml so that the XML declaration is respected , or both (Content-Type: application/xhtml+xml; charset=utf-8), or adding equivalent <meta http-equiv="Content-Type"> tags.

Change History

Changed 19 months ago by gtirloni

Confirmed problem on Firefox 7.0.1 (Linux/x86_64).

It doesn't seem to occur on Chrome 15 and Konqueror 4.7.2.

It has something to do with the implementation of RFC3023 which says HTTP should have priority over XML encoding. Firefox and Opera seem to implement it while WebKit? (Chrome) doesn't. See more at  http://lists.w3.org/Archives/Public/public-html-bugzilla/2011Jun/0303.html

Note: See TracTickets for help on using tickets.