Unicode

HomePage | RecentChanges | EditorIndex | TextEditorFamilies | Preferences

The editor must be able to read, display, edit, and write UTF-8 files with and without BOMs (Byte Order Markers) and UTF-16 files in both big- and little-endian formats. It will be able to distinguish a UTF-8 without BOM from an alternate encoding. Typically, this would be an 8-bit encoding (ISO 8859-1, Win-1252, pure ASCII, or other), but there are other multibyte encodings that could be supported. Ideally, Unicode code points above hex FFFF would be supported.

The editor must accept Unicode text pasted from an alternate source (such as web browser) without losing information; this may mean changing the encoding associated with the buffer.


HomePage | RecentChanges | EditorIndex | TextEditorFamilies | Preferences
Edit text of this page | View other revisions
Last edited October 9, 2014 7:49 pm (diff)
Search: