Unicode
Unicode adalah suatu standar teknis yang dirancang untuk mengizinkan teks dan simbol dari semua sistem tulisan di dunia untuk ditampilkan dan dimanipulasi secara konsisten oleh komputer. Dikembangkan secara tandem dengan standar Universal Character Set dan dipublikasikan dalam bentuk buku The Unicode Standard. Unicode mengandung suatu kumpulan karakter, suatu metodologi pengkodean dan kumpulan standar penyandian karakter, suatu kumpulan bagan kode untuk referensi visual, deskripsi sifat karakter seperti huruf besar dan huruf kecil, suatu kumpulan data referensi berkas komputer, serta aturan normalisasi, dekomposisi, pembandingan (collation), serta penggambaran (rendering).
Alias | Universal Coded Character Set (UCS) |
---|---|
Bahasa | Internasional |
Standar | Unicode Standard |
Status terkini | versi 14.0 |
Format encoding | UTF-8, UTF-16, GB18030 Jarang dipakai: UTF-32, BOCU, SCSU, UTF-7 |
Didahului oleh | ISO 8859, lainnya |
Unicode Consortium, suatu organisasi nirlaba yang mengkoordinasikan pengembangan Unicode memiliki tujuan ambisius untuk dapat, pada akhirnya, menggantikan skema pengkodean karakter yang ada dengan Unicode dan skema Unicode Transformation Format (UTF) nya, karena banyak skema yang ada sekarang memiliki keterbatasan ukuran dan lingkup dan takserasi dengan lingkungan multibahasa. Kesuksesan Unicode menyatukan set karakter telah membawa pada penggunaannya yang luas dan pradominan dalam internasionalisasi dan lokalisasi perangkat lunak komputer. Standar ini telah diterapkan pada teknologi-teknologi terkini, termasuk XML, bahasa pemrograman Java, dan sistem operasi modern.
Aksara Nusantara dalam Unicode
Aksara-aksara Nusantara yang telah memiliki register Unicode adalah:
- Aksara Bugis (Lontara), alamat 1A00-1A1F pada Unicode Versi 4.1
- Aksara Bali, alamat 1B00-1B7F pada Unicode Versi 5.0
- Aksara Sunda Kaganga, alamat 1B80-1BBF pada Unicode Versi 5.1
- Aksara Rejang, alamat A930-A95F pada Unicode Versi 5.1
- Aksara Jawa, alamat A980-A9DF pada Unicode Versi 5.2
- Aksara Batak, alamat 1BC0-1BFF pada Unicode Versi 6.0
- Aksara Makassar, alamat 11EE0-11EFF pada Unicode Versi 11.0
Aksara-aksara Nusantara dalam proses pengesahan untuk memiliki register Unicode adalah:
- Aksara Kawi, sementara merujuk pada alamat register 11DB-11DF
Tabel Unicode pada Basic Multilingual Plane (BMP)
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0040 | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
0050 | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
0060 | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
0070 | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL |
00C0 | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
00D0 | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß |
00E0 | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
00F0 | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ |
0100 | Ā | ā | Ă | ă | Ą | ą | Ć | ć | Ĉ | ĉ | Ċ | ċ | Č | č | Ď | ď |
0110 | Đ | đ | Ē | ē | Ĕ | ĕ | Ė | ė | Ę | ę | Ě | ě | Ĝ | ĝ | Ğ | ğ |
0120 | Ġ | ġ | Ģ | ģ | Ĥ | ĥ | Ħ | ħ | Ĩ | ĩ | Ī | ī | Ĭ | ĭ | Į | į |
0130 | İ | ı | IJ | ij | Ĵ | ĵ | Ķ | ķ | ĸ | Ĺ | ĺ | Ļ | ļ | Ľ | ľ | Ŀ |
0140 | ŀ | Ł | ł | Ń | ń | Ņ | ņ | Ň | ň | ʼn | Ŋ | ŋ | Ō | ō | Ŏ | ŏ |
0150 | Ő | ő | Œ | œ | Ŕ | ŕ | Ŗ | ŗ | Ř | ř | Ś | ś | Ŝ | ŝ | Ş | ş |
0160 | Š | š | Ţ | ţ | Ť | ť | Ŧ | ŧ | Ũ | ũ | Ū | ū | Ŭ | ŭ | Ů | ů |
0170 | Ű | ű | Ų | ų | Ŵ | ŵ | Ŷ | ŷ | Ÿ | Ź | ź | Ż | ż | Ž | ž | ſ |
0180 | ƀ | Ɓ | Ƃ | ƃ | Ƅ | ƅ | Ɔ | Ƈ | ƈ | Ɖ | Ɗ | Ƌ | ƌ | ƍ | Ǝ | Ə |
0190 | Ɛ | Ƒ | ƒ | Ɠ | Ɣ | ƕ | Ɩ | Ɨ | Ƙ | ƙ | ƚ | ƛ | Ɯ | Ɲ | ƞ | Ɵ |
01A0 | Ơ | ơ | Ƣ | ƣ | Ƥ | ƥ | Ʀ | Ƨ | ƨ | Ʃ | ƪ | ƫ | Ƭ | ƭ | Ʈ | Ư |
01B0 | ư | Ʊ | Ʋ | Ƴ | ƴ | Ƶ | ƶ | Ʒ | Ƹ | ƹ | ƺ | ƻ | Ƽ | ƽ | ƾ | ƿ |
01C0 | ǀ | ǁ | ǂ | ǃ | DŽ | Dž | dž | LJ | Lj | lj | NJ | Nj | nj | Ǎ | ǎ | Ǐ |
01D0 | ǐ | Ǒ | ǒ | Ǔ | ǔ | Ǖ | ǖ | Ǘ | ǘ | Ǚ | ǚ | Ǜ | ǜ | ǝ | Ǟ | ǟ |
01E0 | Ǡ | ǡ | Ǣ | ǣ | Ǥ | ǥ | Ǧ | ǧ | Ǩ | ǩ | Ǫ | ǫ | Ǭ | ǭ | Ǯ | ǯ |
01F0 | ǰ | DZ | Dz | dz | Ǵ | ǵ | Ƕ | Ƿ | Ǹ | ǹ | Ǻ | ǻ | Ǽ | ǽ | Ǿ | ǿ |
0200 | Ȁ | ȁ | Ȃ | ȃ | Ȅ | ȅ | Ȇ | ȇ | Ȉ | ȉ | Ȋ | ȋ | Ȍ | ȍ | Ȏ | ȏ |
0210 | Ȑ | ȑ | Ȓ | ȓ | Ȕ | ȕ | Ȗ | ȗ | Ș | ș | Ț | ț | Ȝ | ȝ | Ȟ | ȟ |
0220 | Ƞ | ȡ | Ȣ | ȣ | Ȥ | ȥ | Ȧ | ȧ | Ȩ | ȩ | Ȫ | ȫ | Ȭ | ȭ | Ȯ | ȯ |
0230 | Ȱ | ȱ | Ȳ | ȳ | ȴ | ȵ | ȶ | ȷ | ȸ | ȹ | Ⱥ | Ȼ | ȼ | Ƚ | Ⱦ | ȿ |
0240 | ɀ | Ɂ | ɂ | Ƀ | Ʉ | Ʌ | Ɇ | ɇ | Ɉ | ɉ | Ɋ | ɋ | Ɍ | ɍ | Ɏ | ɏ |
0250 | ɐ | ɑ | ɒ | ɓ | ɔ | ɕ | ɖ | ɗ | ɘ | ə | ɚ | ɛ | ɜ | ɝ | ɞ | ɟ |
0260 | ɠ | ɡ | ɢ | ɣ | ɤ | ɥ | ɦ | ɧ | ɨ | ɩ | ɪ | ɫ | ɬ | ɭ | ɮ | ɯ |
0270 | ɰ | ɱ | ɲ | ɳ | ɴ | ɵ | ɶ | ɷ | ɸ | ɹ | ɺ | ɻ | ɼ | ɽ | ɾ | ɿ |
0280 | ʀ | ʁ | ʂ | ʃ | ʄ | ʅ | ʆ | ʇ | ʈ | ʉ | ʊ | ʋ | ʌ | ʍ | ʎ | ʏ |
0290 | ʐ | ʑ | ʒ | ʓ | ʔ | ʕ | ʖ | ʗ | ʘ | ʙ | ʚ | ʛ | ʜ | ʝ | ʞ | ʟ |
02A0 | ʠ | ʡ | ʢ | ʣ | ʤ | ʥ | ʦ | ʧ | ʨ | ʩ | ʪ | ʫ | ʬ | ʭ | ʮ | ʯ |
1D00 | ᴀ | ᴁ | ᴂ | ᴃ | ᴄ | ᴅ | ᴆ | ᴇ | ᴈ | ᴉ | ᴊ | ᴋ | ᴌ | ᴍ | ᴎ | ᴏ |
1D10 | ᴐ | ᴑ | ᴒ | ᴓ | ᴔ | ᴕ | ᴖ | ᴗ | ᴘ | ᴙ | ᴚ | ᴛ | ᴜ | ᴝ | ᴞ | ᴟ |
1D20 | ᴠ | ᴡ | ᴢ | ᴣ | ᴤ | ᴥ | ᴦ | ᴧ | ᴨ | ᴩ | ᴪ | ᴫ | ᴬ | ᴭ | ᴮ | ᴯ |
1D30 | ᴰ | ᴱ | ᴲ | ᴳ | ᴴ | ᴵ | ᴶ | ᴷ | ᴸ | ᴹ | ᴺ | ᴻ | ᴼ | ᴽ | ᴾ | ᴿ |
1D40 | ᵀ | ᵁ | ᵂ | ᵃ | ᵄ | ᵅ | ᵆ | ᵇ | ᵈ | ᵉ | ᵊ | ᵋ | ᵌ | ᵍ | ᵎ | ᵏ |
1D50 | ᵐ | ᵑ | ᵒ | ᵓ | ᵔ | ᵕ | ᵖ | ᵗ | ᵘ | ᵙ | ᵚ | ᵛ | ᵜ | ᵝ | ᵞ | ᵟ |
1D60 | ᵠ | ᵡ | ᵢ | ᵣ | ᵤ | ᵥ | ᵦ | ᵧ | ᵨ | ᵩ | ᵪ | ᵫ | ᵬ | ᵭ | ᵮ | ᵯ |
1D70 | ᵰ | ᵱ | ᵲ | ᵳ | ᵴ | ᵵ | ᵶ | ᵷ | ᵸ | ᵹ | ᵺ | ᵻ | ᵼ | ᵽ | ᵾ | ᵿ |
1D80 | ᶀ | ᶁ | ᶂ | ᶃ | ᶄ | ᶅ | ᶆ | ᶇ | ᶈ | ᶉ | ᶊ | ᶋ | ᶌ | ᶍ | ᶎ | ᶏ |
1D90 | ᶐ | ᶑ | ᶒ | ᶓ | ᶔ | ᶕ | ᶖ | ᶗ | ᶘ | ᶙ | ᶚ | ᶛ | ᶜ | ᶝ | ᶞ | ᶟ |
1DA0 | ᶠ | ᶡ | ᶢ | ᶣ | ᶤ | ᶥ | ᶦ | ᶧ | ᶨ | ᶩ | ᶪ | ᶫ | ᶬ | ᶭ | ᶮ | ᶯ |
1DB0 | ᶰ | ᶱ | ᶲ | ᶳ | ᶴ | ᶵ | ᶶ | ᶷ | ᶸ | ᶹ | ᶺ | ᶻ | ᶼ | ᶽ | ᶾ | ᶿ |
1E00 | Ḁ | ḁ | Ḃ | ḃ | Ḅ | ḅ | Ḇ | ḇ | Ḉ | ḉ | Ḋ | ḋ | Ḍ | ḍ | Ḏ | ḏ |
1E10 | Ḑ | ḑ | Ḓ | ḓ | Ḕ | ḕ | Ḗ | ḗ | Ḙ | ḙ | Ḛ | ḛ | Ḝ | ḝ | Ḟ | ḟ |
1E20 | Ḡ | ḡ | Ḣ | ḣ | Ḥ | ḥ | Ḧ | ḧ | Ḩ | ḩ | Ḫ | ḫ | Ḭ | ḭ | Ḯ | ḯ |
1E30 | Ḱ | ḱ | Ḳ | ḳ | Ḵ | ḵ | Ḷ | ḷ | Ḹ | ḹ | Ḻ | ḻ | Ḽ | ḽ | Ḿ | ḿ |
1E40 | Ṁ | ṁ | Ṃ | ṃ | Ṅ | ṅ | Ṇ | ṇ | Ṉ | ṉ | Ṋ | ṋ | Ṍ | ṍ | Ṏ | ṏ |
1E50 | Ṑ | ṑ | Ṓ | ṓ | Ṕ | ṕ | Ṗ | ṗ | Ṙ | ṙ | Ṛ | ṛ | Ṝ | ṝ | Ṟ | ṟ |
1E60 | Ṡ | ṡ | Ṣ | ṣ | Ṥ | ṥ | Ṧ | ṧ | Ṩ | ṩ | Ṫ | ṫ | Ṭ | ṭ | Ṯ | ṯ |
1E70 | Ṱ | ṱ | Ṳ | ṳ | Ṵ | ṵ | Ṷ | ṷ | Ṹ | ṹ | Ṻ | ṻ | Ṽ | ṽ | Ṿ | ṿ |
1E80 | Ẁ | ẁ | Ẃ | ẃ | Ẅ | ẅ | Ẇ | ẇ | Ẉ | ẉ | Ẋ | ẋ | Ẍ | ẍ | Ẏ | ẏ |
1E90 | Ẑ | ẑ | Ẓ | ẓ | Ẕ | ẕ | ẖ | ẗ | ẘ | ẙ | ẚ | ẛ | ẜ | ẝ | ẞ | ẟ |
1EA0 | Ạ | ạ | Ả | ả | Ấ | ấ | Ầ | ầ | Ẩ | ẩ | Ẫ | ẫ | Ậ | ậ | Ắ | ắ |
1EB0 | Ằ | ằ | Ẳ | ẳ | Ẵ | ẵ | Ặ | ặ | Ẹ | ẹ | Ẻ | ẻ | Ẽ | ẽ | Ế | ế |
1EC0 | Ề | ề | Ể | ể | Ễ | ễ | Ệ | ệ | Ỉ | ỉ | Ị | ị | Ọ | ọ | Ỏ | ỏ |
1ED0 | Ố | ố | Ồ | ồ | Ổ | ổ | Ỗ | ỗ | Ộ | ộ | Ớ | ớ | Ờ | ờ | Ở | ở |
1EE0 | Ỡ | ỡ | Ợ | ợ | Ụ | ụ | Ủ | ủ | Ứ | ứ | Ừ | ừ | Ử | ử | Ữ | ữ |
1EF0 | Ự | ự | Ỳ | ỳ | Ỵ | ỵ | Ỷ | ỷ | Ỹ | ỹ | Ỻ | ỻ | Ỽ | ỽ | Ỿ | ỿ |
2100 | ℀ | ℁ | ℂ | ℃ | ℄ | ℅ | ℆ | ℇ | ℈ | ℉ | ℊ | ℋ | ℌ | ℍ | ℎ | ℏ |
2110 | ℐ | ℑ | ℒ | ℓ | ℔ | ℕ | № | ℗ | ℘ | ℙ | ℚ | ℛ | ℜ | ℝ | ℞ | ℟ |
2120 | ℠ | ℡ | ™ | ℣ | ℤ | ℥ | Ω | ℧ | ℨ | ℩ | K | Å | ℬ | ℭ | ℮ | ℯ |
2130 | ℰ | ℱ | Ⅎ | ℳ | ℴ | ℵ | ℶ | ℷ | ℸ | ℹ | ℺ | ℻ | ℼ | ℽ | ℾ | ℿ |
2140 | ⅀ | ⅁ | ⅂ | ⅃ | ⅄ | ⅅ | ⅆ | ⅇ | ⅈ | ⅉ | ⅊ | ⅋ | ⅌ | ⅍ | ⅎ | ⅏ |
2490 | ⒐ | ⒑ | ⒒ | ⒓ | ⒔ | ⒕ | ⒖ | ⒗ | ⒘ | ⒙ | ⒚ | ⒛ | ⒜ | ⒝ | ⒞ | ⒟ |
24A0 | ⒠ | ⒡ | ⒢ | ⒣ | ⒤ | ⒥ | ⒦ | ⒧ | ⒨ | ⒩ | ⒪ | ⒫ | ⒬ | ⒭ | ⒮ | ⒯ |
24B0 | ⒰ | ⒱ | ⒲ | ⒳ | ⒴ | ⒵ | Ⓐ | Ⓑ | Ⓒ | Ⓓ | Ⓔ | Ⓕ | Ⓖ | Ⓗ | Ⓘ | Ⓙ |
24C0 | Ⓚ | Ⓛ | Ⓜ | Ⓝ | Ⓞ | Ⓟ | Ⓠ | Ⓡ | Ⓢ | Ⓣ | Ⓤ | Ⓥ | Ⓦ | Ⓧ | Ⓨ | Ⓩ |
24D0 | ⓐ | ⓑ | ⓒ | ⓓ | ⓔ | ⓕ | ⓖ | ⓗ | ⓘ | ⓙ | ⓚ | ⓛ | ⓜ | ⓝ | ⓞ | ⓟ |
24E0 | ⓠ | ⓡ | ⓢ | ⓣ | ⓤ | ⓥ | ⓦ | ⓧ | ⓨ | ⓩ | ⓪ | ⓫ | ⓬ | ⓭ | ⓮ | ⓯ |
2C60 | Ⱡ | ⱡ | Ɫ | Ᵽ | Ɽ | ⱥ | ⱦ | Ⱨ | ⱨ | Ⱪ | ⱪ | Ⱬ | ⱬ | Ɑ | Ɱ | Ɐ |
2C70 | ⱱ | Ⱳ | ⱳ | ⱴ | Ⱶ | ⱶ | ⱷ | ⱸ | ⱹ | ⱺ | ⱻ | ⱼ | ⱽ | |||
A720 | ꜠ | ꜡ | Ꜣ | ꜣ | Ꜥ | ꜥ | Ꜧ | ꜧ | Ꜩ | ꜩ | Ꜫ | ꜫ | Ꜭ | ꜭ | Ꜯ | ꜯ |
A730 | ꜰ | ꜱ | Ꜳ | ꜳ | Ꜵ | ꜵ | Ꜷ | ꜷ | Ꜹ | ꜹ | Ꜻ | ꜻ | Ꜽ | ꜽ | Ꜿ | ꜿ |
A740 | Ꝁ | ꝁ | Ꝃ | ꝃ | Ꝅ | ꝅ | Ꝇ | ꝇ | Ꝉ | ꝉ | Ꝋ | ꝋ | Ꝍ | ꝍ | Ꝏ | ꝏ |
A750 | Ꝑ | ꝑ | Ꝓ | ꝓ | Ꝕ | ꝕ | Ꝗ | ꝗ | Ꝙ | ꝙ | Ꝛ | ꝛ | Ꝝ | ꝝ | Ꝟ | ꝟ |
A760 | Ꝡ | ꝡ | Ꝣ | ꝣ | Ꝥ | ꝥ | Ꝧ | ꝧ | Ꝩ | ꝩ | Ꝫ | ꝫ | Ꝭ | ꝭ | Ꝯ | ꝯ |
fff | ꝰ | ꝱ | ꝲ | ꝳ | ꝴ | ꝵ | ꝶ | ꝷ | ꝸ | Ꝺ | ꝺ | Ꝼ | ꝼ | Ᵹ | Ꝿ | ꝿ |
fff | Ꞁ | ꞁ | Ꞃ | ꞃ | Ꞅ | ꞅ | Ꞇ | ꞇ | ꞈ | ꞉ | ꞊ | Ꞌ | ꞌ | |||
fff | ꟻ | ꟼ | ꟽ | ꟾ | ꟿ | |||||||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
fff | ꤀ | ꤁ | ꤂ | ꤃ | ꤄ | ꤅ | ꤆ | ꤇ | ꤈ | ꤉ | ꤊ | ꤋ | ꤌ | ꤍ | ꤎ | ꤏ |
fff | ꤐ | ꤑ | ꤒ | ꤓ | ꤔ | ꤕ | ꤖ | ꤗ | ꤘ | ꤙ | ꤚ | ꤛ | ꤜ | ꤝ | ꤞ | ꤟ |
fff | ꤠ | ꤡ | ꤢ | ꤣ | ꤤ | ꤥ | ꤦ | ꤧ | ꤨ | ꤩ | ꤪ | ꤫ | ꤬ | ꤭ | ꤮ | ꤯ |
fff | ꤰ | ꤱ | ꤲ | ꤳ | ꤴ | ꤵ | ꤶ | ꤷ | ꤸ | ꤹ | ꤺ | ꤻ | ꤼ | ꤽ | ꤾ | ꤿ |
fff | ꥀ | ꥁ | ꥂ | ꥃ | ꥄ | ꥅ | ꥆ | ꥇ | ꥈ | ꥉ | ꥊ | ꥋ | ꥌ | ꥍ | ꥎ | ꥏ |
fff | ꥐ | ꥑ | ꥒ | ꥓ | ꥟ | |||||||||||
fff | ||||||||||||||||
fff | ||||||||||||||||
fff | ||||||||||||||||
fff | ||||||||||||||||
fff | ||||||||||||||||
ffff | ||||||||||||||||
fff | ||||||||||||||||
fff | ||||||||||||||||
fff | ||||||||||||||||
fff | ||||||||||||||||
fff | ff | fi | fl | ffi | ffl | ſt | st | |||||||||
FFff | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
FFff | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
FFff | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
Unicode dan huruf komputer
Fon (huruf komputer) bebas maupun berbayar yang berdasarkan Unicode telah tersedia bebas, sejak fon TrueType dan OpenType mendukung Unicode. Informasi setiap bentuk huruf disimpan dengan menggunakan substitusi karakter universal.
Lihat pula
Pustaka
- The Complete Manual of Typography, James Felici, Adobe Press; 1st edition, 2002
- Unicode Demystified: A Practical Programmer's Guide to the Encoding Standard, Richard Gillam, Addison-Wesley Professional; 1st edition, 2002
- Unicode Explained, Jukka K. Korpela, O'Reilly; 1st edition, 2006
- The Unicode Standard, Version 5.0, Fifth Edition, The Unicode Consortium, Addison-Wesley Professional, Oct. 27, 2006. ISBN 0-321-48091-0
- The Unicode Standard, Version 4.0, The Unicode Consortium, Addison-Wesley Professional, Aug. 27, 2003. ISBN 0-321-18578-1
Pranala luar
- (Inggris) The Unicode Consortium
- (Inggris) decodeunicode
- (Inggris) Unicode Character Search
- (Inggris) Unicode Code Converter v3
- Cuping font