Available Character Encodings
DeltaWalker supports the same wide-range of character encodings supported by the Java 2 Platform, Standard Edition, v. 1.6 or newer. A non-exhaustive list of encodings follows.
The first column in both tables contains the names of the encodings, a.k.a. canonical names, that DeltaWalker uses. When an encoding has an alternative name, it is listed in the second column of the tables. Although the alternative names are being gradually phased out, they are listed here as many people may be familiar with them. The third column gives a brief description of each.
Basic Character Encodings
Name | Alternative Name | Description |
---|---|---|
US-ASCII | ASCII | American Standard Code for Information Interchange |
windows-1250 | Cp1250 | Windows Eastern European |
windows-1251 | Cp1251 | Windows Cyrillic |
windows-1252 | Cp1252 | Windows Latin-1 |
windows-1253 | Cp1253 | Windows Greek |
windows-1254 | Cp1254 | Windows Turkish |
windows-1257 | Cp1257 | Windows Baltic |
ISO-8859-1 | ISO8859_1 | ISO 8859-1, Latin Alphabet No. 1 |
ISO-8859-2 | ISO8859_2 | Latin Alphabet No. 2 |
ISO-8859-4 | ISO8859_4 | Latin Alphabet No. 4 |
ISO-8859-5 | ISO8859_5 | Latin/Cyrillic Alphabet |
ISO-8859-7 | ISO8859_7 | Latin/Greek Alphabet |
ISO-8859-9 | ISO8859_9 | Latin Alphabet No. 5 |
ISO-8859-13 | ISO8859_13 | Latin Alphabet No. 7 |
ISO-8859-15 | ISO8859_15 | Latin Alphabet No. 9 |
KOI8-R | KOI8_R | KOI8-R, Russian |
UTF-8 | UTF8 | Eight-bit UCS Transformation Format |
UTF-16 | Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark | |
UTF-16BE | UnicodeBigUnmarked | Sixteen-bit Unicode Transformation Format, big-endian byte order |
UTF-16LE | UnicodeLittleUnmarked | Sixteen-bit Unicode Transformation Format, little-endian byte order |
Extended Encodings
Name | Alternative Name | Description |
---|---|---|
windows-1255 | Cp1255 | Windows Hebrew |
windows-1256 | Cp1256 | Windows Arabic |
windows-1258 | Cp1258 | Windows Vietnamese |
ISO-8859-3 | ISO8859_3 | Latin Alphabet No. 3 |
ISO-8859-6 | ISO8859_6 | Latin/Arabic Alphabet |
ISO-8859-8 | ISO8859_8 | Latin/Hebrew Alphabet |
windows-31j | MS932 | Windows Japanese |
EUC-JP | EUC_JP | JISX 0201, 0208 and 0212, EUC encoding Japanese |
x-EUC-JP-LINUX | EUC_JP_LINUX | JISX 0201, 0208, EUC encoding Japanese |
Shift_JIS | SJIS | Shift-JIS, Japanese |
ISO-2022-JP | ISO2022JP | JIS X 0201, 0208, in ISO 2022 form, Japanese |
x-mswin-936 | MS936 | Windows Simplified Chinese |
GB18030 | Simplified Chinese, PRC standard | |
x-EUC-CN | EUC_CN | GB2312, EUC encoding, Simplified Chinese |
GBK | GBK, Simplified Chinese | |
ISCII91 | ISCII91 | ISCII91 encoding of Indic scripts |
x-windows-949 | MS949 | Windows Korean |
EUC-KR | EUC_KR | KS C 5601, EUC encoding, Korean |
ISO-2022-KR | ISO2022KR | ISO 2022 KR, Korean |
x-windows-950 | MS950 | Windows Traditional Chinese |
x-MS950-HKSCS | MS950_HKSCS | Windows Traditional Chinese with Hong Kong extensions |
x-EUC-TW | EUC_TW | CNS11643 (Plane 1-3), EUC encoding, Traditional Chinese |
Big5 | Big5, Traditional Chinese | |
Big5-HKSCS | Big5_HKSCS | Big5 with Hong Kong extensions, Traditional Chinese |
TIS-620 | TIS620 | TIS620, Thai |