UTF-8: Perbedaan antara revisi

Konten dihapus Konten ditambahkan
k Bot: Perubahan kosmetika
kTidak ada ringkasan suntingan
 
(13 revisi perantara oleh 9 pengguna tidak ditampilkan)
Baris 17:
| date = August 1, 1998
| accessdate = November 8, 2007
| archive-date = 2007-10-26
}}</ref> UTF-8 juga terus meningkat penggunaannya sebagai ''default character encoding'' dalam [[sistem operasi]], [[bahasa pemrograman]], [[application programming interface|API]], dan [[aplikasi perangkat lunak]].
| archive-url = https://web.archive.org/web/20071026103104/http://www.imc.org/mail-i18n.html
| dead-url = yes
}}</ref> UTF-8 juga terus meningkat penggunaannya sebagai ''default character encoding'' dalam [[sistem operasi]], [[bahasa pemrograman]], [[application programming interface|API]], dan [[aplikasi perangkat lunak]].
 
<!--
Baris 39 ⟶ 42:
 
== Deskripsi ==
Desain UTF-8 dapat dilihat di tabel berikut yaitu skema yang asalnya diusulkan oleh Dave Prosser dan selanjutnya dimodifikasi oleh Ken Thompson (<code>x</code> diganti degandengan bit dari ''code point''):
 
{| class="wikitable"
Baris 472 ⟶ 475:
 
An increasingly popular option is to detect errors with a separate API, and for converters to translate the first byte to a replacement and continue parsing with the next byte. These error bytes will always have the high bit set. Popular replacements include:
* The [[replacement character]] "�" (U+FFFD)
* The invalid Unicode code points U+DC80..U+DCFF where the low 8 bits are the byte's value.
* The Unicode code points U+0080..U+00FF with the same value as the byte, thus interpreting the bytes according to [[ISO/IEC 8859-1|ISO-8859-1]].
Baris 543 ⟶ 546:
The .dex format defined by [[Dalvik (software)|Dalvik]] also uses the same modified UTF-8 to represent string values.<ref>{{cite web
| url = http://source.android.com/tech/dalvik/dex-format.html
| title = .dex — Dalvikdex—Dalvik Executable Format
| quote = [T]he .dex format encodes its string data in a de facto standard modified UTF-8 form, hereafter referred to as MUTF-8.
| accessdate = April 9, 2013
Baris 660 ⟶ 663:
* RFC 2044 (1996)
* RFC 2279 (1998)
* ''The Unicode Standard, Version 3.0'', §2.3 (2000) plus Corrigendum #1 : UTF-8 Shortest Form (2000)
* ''Unicode Standard Annex #27: Unicode 3.1'' (2001)
 
Semua sama dalam mekanika umum, dengan perbedaan pokok pada topik-topik misalnya mengizinkan range nilai code point dan safe handling untuk invalid input.
* [http://doc.cat-v.org/plan_9/4th_edition/papers/utf Original UTF-8 paper] ([http://plan9.bell-labs.com/sys/doc/utf.pdf or pdf] {{Webarchive|url=https://web.archive.org/web/20000917055036/http://plan9.bell-labs.com/sys/doc/utf.pdf |date=2000-09-17 }}) for [[Plan 9 from Bell Labs]]
* RFC 5198 defines UTF-8 [[Unicode equivalence|NFC]] for Network Interchange
* UTF-8 test pages by [http://www.user.uni-hannover.de/nhtcapri/multilingual1.html Andreas Prilop] {{Webarchive|url=https://web.archive.org/web/20171130213601/http://www.user.uni-hannover.de/nhtcapri/multilingual1.html |date=2017-11-30 }}, [http://titus.uni-frankfurt.de/indexe.htm?/unicode/unitest.htm Jost Gippert] and the [http://www.w3.org/2001/06/utf-8-test/UTF-8-demo.html World Wide Web Consortium]
* [http://dotancohen.com/howto/email-utf8.html How to configure e-mail clients to send UTF-8 text]
* Unix/Linux: [http://www.cl.cam.ac.uk/~mgk25/unicode.html UTF-8/Unicode FAQ], [http://www.linux.org/docs/ldp/howto/Unicode-HOWTO.html Linux Unicode HOWTO] {{Webarchive|url=https://web.archive.org/web/20050404103611/http://www.linux.org/docs/ldp/howto/Unicode-HOWTO.html |date=2005-04-04 }}, [http://www.gentoo.org/doc/en/utf-8.xml UTF-8 and Gentoo]
* [http://www.utf8-chartable.de/ The Unicode/UTF-8-character table] displays UTF-8 in a variety of formats (with Unicode and HTML encoding information)
* [http://www.alanwood.net/unicode/browsers.html Unicode and Multilingual Web Browsers] from Alan Wood's Unicode Resources describes support and additional configuration of Unicode/UTF-8 in modern browsers
* [http://jspwiki.org/wiki/JSPWikiBrowserCompatibility JSP Wiki Browser Compatibility page] {{Webarchive|url=https://web.archive.org/web/20130524020747/http://www.jspwiki.org/wiki/JSPWikiBrowserCompatibility |date=2013-05-24 }} details specific problems with UTF-8 in older browsers
* [http://tlt.psu.edu/suggestions/international/bylanguage/math.html#browsers Mathematical Symbols in Unicode] {{Webarchive|url=https://web.archive.org/web/20080821174200/http://tlt.psu.edu/suggestions/international/bylanguage/math.html#browsers |date=2008-08-21 }}
* [http://demo.icu-project.org/icu-bin/convexp?conv=UTF-8 Graphical View of UTF-8 in ICU's Converter Explorer]