Child pages
  • Encoding and mis-encoding: latin (iso-8859-1) and utf-8

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Letterutf mis-encoded as-latinmis-encoded latin shown-as-utfExplanationReference

æøåÆØÅ

� = 0xFFFD
2 bytes often à and ...� = 0xEF 0xBF 0xBD

UTF-8 multi-byte string is interpreted
with a single-byte encoding

stackoverflow.com 2

æ    
ø"Ã,"   
å

"Ã¥"

   
Æ    
Ø    
Å