In case I ever need it: Charset Detector :: Summary.
It is emperical (you cannot 100% reliably find out what character set / encoding a file is), but has a good score.
A similar problem is detecting the language. There too you can get a good score.
–jeroen
via:
- User ShreevatsaR – Stack Overflow.
- algorithm – How Can I Best Guess the Encoding when the BOM (Byte Order Mark) is Missing? – Stack Overflow.
- c# – How can I detect the encoding/codepage of a text file – Stack Overflow.
- Shared Development: Character encoding detection.
- A composite approach to language/encoding detection.
- seamonkey mozilla/extensions/universalchardet/src/base/.
Filed under: Delphi, Delphi 2009, Delphi 2010, Delphi XE, Delphi XE2, Delphi XE3, Delphi XE4, Delphi XE5, Development, Software Development