English Deutsch Français 简体中文 繁體中文
Book123, Download eBooks for Free - Anytime! Submit your article

Categories

Share With Friends



Like Book123?! Give us +1

Archive by Date

Search Tag

Newest

Useful Links


The Unicode Standard, Version 4.0

Posted on 2010-04-10




Name:The Unicode Standard, Version 4.0
ASIN/ISBN:0321185781
Language:English
File size:13.9 Mb
   The Unicode Standard, Version 4.0

Free Download Now     Free register and download UseNet downloader, then you can FREE Download from UseNet.

    Download without Limit " The Unicode Standard, Version 4.0 " from UseNet for FREE!


extensive additions of CJK characters to cover dictionaries and historic usage

many new symbols for mathematical and technical publication

substantially improved specification of conformance requirements, incorporating the character encoding model

encoding of supplementary characters

formalized policies for stability of the standard

clarification of semantics of special characters, including the byte order mark

major expansion of Unicode Character Database properties and of specifications for text boundaries and casing

more minority scripts, including Limbu, Tai Le, Osmanya, and Philippine scripts

more historic scripts, including Linear B, Cypriot, and Ugaritic

tightened definition of encoding terms, including UTF-32

Furthermore, many individual characters were added to meet the requirements of users and implementers alike. The Unicode Standard maintains consistency with the international standard ISO/IEC 10646. Version 4.0 of the Unicode Standard corresponds to ISO/IEC 10646:2003.0.1 About the Unicode StandardThis book, together with the Unicode Standard Annexes described in Appendix B, and the Unicode Character Database, defines Version 4.0 of the Unicode Standard. The book gives the general principles, requirements for conformance, and guidelines for implementers, followed by character code charts and names.Concepts, Architecture, Conformance, and GuidelinesThe first five chapters of Version 4.0 introduce the Unicode Standard and provide the fundamental information needed to produce a conforming implementation. Basic text processing, working with combining marks, and encoding forms are all described. A special chapter on implementation guidelines answers many common questions that arise when implementing Unicode.Chapter 1 introduces the standard& 8217;s basic concepts, design basis, and coverage, and discusses basic text handling requirements.Chapter 2 sets forth the fundamental principles underlying the Unicode Standard and covers specific topics such as text processes, overall character properties, and the use of combining marks.Chapter 3 constitutes the formal statement of conformance. This chapter also presents the normative algorithms for two processes: the canonical ordering of combining marks and the encoding of Korean Hangul syllables by conjoining jamo.Chapter 4 describes character properties in detail, both normative (required) and informative. Tables giving additional character property information appear in the Unicode Character Database.Chapter 5 discusses implementation issues, including compression, strategies for dealing with unknown and unsupported characters, and transcoding tother standards.Character Block DescriptionsChapters 6 through 15 contain the character block descriptions that give basic information about each script or group of symbols and may discuss specific characters or pertinent layout information. Some of this information is required in order to produce conformant implementations of these scripts and other collections of characters.Chapter 6 introduces writing systems and describes the general punctuation characters.Chapter 7 presents the European Alphabetic scripts, including Latin, Greek, Cyrillic, Armenian, Georgian, and associated combining marks.Chapter 8 presents the Middle Eastern, left-to-left scripts: Hebrew, Arabic, Syriac, and Thaana.Chapter 9 covers the South Asian scripts, including Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, Malayalam, Sinhala, Tibetan, and Limbu. Chapter 10 covers the Southeast Asian scripts, including Thai, Lao, Tai Le, Myanmar, Khmer, and Philippine scripts.Chapter 11 presents the East Asian scripts, including Han, Hiragana, Katakana, Hangul, Bopomofo, and Yi.Chapter 12 presents other scripts, including Ethiopic, Mongolian, Osmanya, Cherokee, Canadian Aboriginal Syllabics, Deseret, and Shavian.Chapter 13 describes archaic scripts, including Ogham, Old Italic, Runic, Gothic, Ugaritic, Linear B, and Cypriot.Chapter 14 presents symbols, including currency, letterlike and technical symbols, mathematical operators, and musical symbols.Chapter 15 describes other topics such as private-use characters, surrogate code points, and special characters.Charts and Han Radical-Stroke IndexThe next two chapters document the Unicode Standard& 8217;s character code assignments, their names and important descriptive information, and provide a Han radical-stroke index that aids in locating specific ideographs encoded in Unicode.Chapter 16 gives the code charts and the Character Names List. The code charts contain the normative character encoding assignments, and the names list contains normative information as well as useful cross references and informational notes.Chapter 17 provides a radical-stroke index to East Asian ideographs.AppendicesThe appendices contain detailed background information on important topics regarding the history of the Unicode Standard and its relationship to ISO/IEC 10646.Appendix A describes the history of Han Unification in the Unicode Standard.Appendix B provides abstracts of Unicode Technical Reports and lists other important Unicode resources.Appendix C details the relationship between the Unicode Standard and ISO/IEC 10646.Appendix D lists the changes to the Unicode Standard since Version 3.0.The appendices are followed by a glossary of terms, a bibliography, and two indices: an index to Unicode characters and an index to the text of the book.0.2 The Unicode Character Database and Technical ReportsThe Unicode Character Database is a collection of data files that contain character code points, character names and character property data.See Appendix B for a summary overview of important Unicode Technical Standards, Unicode Technical Reports and Unicode Standard Annexes.On the CD-ROMThe CD-ROM also contains additional information, such as sample code, which is maintained on the Unicode ftp site at: ftp.unicode.For the complete contents of the CD-ROM see its ReadMe.txt file.0.3 Notational ConventionsThroughout this book, certain typographic conventions are used.Code PointsIn running text, an individual Unicode code point can be expressed as U n, where n is from four to six hexadecimal digits, using the digits 0-9 and uppercase letters A-F (for 10 through 15, respectively). There should be no leading zeros, unless the code point would have fewer than four hexadecimal digits; for example, U 0001, U 0012, U 0123, U 1234, U 12345, U 102345. .

The book can be accessed chapterwise here: .

Buy Book at Lowest Price on Amazon

.

Related Articles


Technical The Unicode Standard, Version 3.0

Technical The Unicode Standard, Version 3.0

Author: The Unicode Consortium, Joan Aliprand, Julie Allen, Rick McGowan, Joe Becker, MiPublisher: Addison-Wesley Pub CoPublish Date: 16 February, 2000ISBN: 0201616335

Technical The Unicode Standard, Version 5.0

Technical The Unicode Standard, Version 5.0

The Unicode Standard, Version 5.0 Publisher: Addison-Wesley ProfessionalNumber Of Pages: 1472Publication Date: 2006-11-09Sales Rank: 252571ISBN / ASIN: 0321480910EAN: 9780321480910Binding: HardcoverManufacturer: Addison-Wes ...

Programming Unicode Demystified. A Practical Programmer's Guide to the Encoding Standard

Programming Unicode Demystified. A Practical Programmer's Guide to the Encoding Standard

Technical Unicode Standard, Version 5.0 (5th Edition)

Technical Unicode Standard, Version 5.0 (5th Edition)

Publisher: Addison-Wesley ProfessionalLanguage: EnglishISBN: 0321480910Paperback:?1472 pagesData: Nov 2006Format: PDFDescription: “Hard copy versions of the Unicode Standard have been among the most crucial and most heavily used refer ...

Technical Unicode Demystified: A Practical Programmer's Guide to the Encoding Standard

Technical Unicode Demystified: A Practical Programmer's Guide to the Encoding Standard

Technical The Unicode Standard 5.0

Technical The Unicode Standard 5.0

Unicode Standard, Version 5.0, The (5th Edition) .by The Unicode Consortium (Author) . .

Share this page with your friends now!
Text link
Forum (BBCode)
Website (HTML)
Tags:
Unicode   Standard   Version  
 

DISCLAIMER:

This site does not store The Unicode Standard, Version 4.0 on its server. We only index and link to The Unicode Standard, Version 4.0 provided by other sites. Please contact the content providers to delete The Unicode Standard, Version 4.0 if any and email us, we'll remove relevant links or contents immediately.

Comments (0) All

Verify: Verify

    Sign In   Not yet a member?