Terug naar de index. Back to the index.
This is the document about the Big5 character set,
found on address http://www.cns11643.gov.tw/web/big5/.
I made this readable by putting it in a html page with character set Big5.
( Hope I helped, and did not violate copy rights.)

BIG-5碼介紹

1.簡介

 BIG-5碼,係由資策會於1984年策劃制定,宗旨原是儘量不使用到控制碼範圍,並配合國人自制的五大(BIG-5)套裝軟體。由於委託民間設計,導致初期的BIG-5碼並不能使用五大套裝軟體。雖然如此,市面上絕大多數的套裝軟體都是在BIG-5內碼系統發展出來的,因此目前市面上有2-3BIG-5碼版本,對使用者來說很難明白其中差異,所以在2003年由財團法人中文數位化技術推廣基金會接受經濟部標準檢驗局委託,召集國內業者代表、專家和學者,就BIG-5編碼字元表原始版本和各主要業界版本予以重整之最新版本,其排列規則說明如下:

 

2.BIG-5碼的字集

 BIG-5碼系統為兩位元組之內碼系統,共可定義19782個字碼,其高、低位元組的範圍如下:

translation
BIG-5 code introduction

1. Synopsis

The BIG-5code, was can plan the formulation from the capitalhorse whipin 1984, the objective originally is does not use the control code scope as far as possible, and coordinated the people self-restraint five to be big(BIG-5)the coverall software. As a result of the request folk design, causes the initial periodthe BIG-5code certainly not to be able to use five big coveralls software. For all that, in the market condition ? the majority coverall software all is developsin the BIG-5 encoding system, therefore in the present market condition has2-3BIG-5code edition, to the user said very difficult to understand difference, therefore acceptsthe economydepartment standard examination bureau request in 2003 by the financial group legal person Chinese several technologies promotions foundation, calls the domestic industry representative, the expert and the scholar,BIG-5edits the symbol Yuan table primitive edition and each main field edition rallies the newest edition, its arrangement rule explanation as follows:

2.BIG-5 code character collection

The BIG-5 code system is encoding system two Yuan groups, altogether may define 19,782 characters codes, its is high, the low position Yuan Tsu's scope as follows:

高位元組 ── A1H ∼ FEH (*126)

        8EH ∼ A0H

        81H ∼ 8DH

低位元組 ── 40H ∼ 7EH (*157)

        A1H ∼ FEH

translation
Top digit Yuan group - - A 1 H ~ F E H (* 126)
8 E H ~ A 0 H

81 H ~ 8 D H

Low position Yuan group - - 40 H ~ 7 E H (* 157)

A 1 H ~ F E H

 在本系統中,我們在上述的範圍內,規劃出標準字、特殊符號及使用者造字的區域,分別說明如下:

☆標準字(STDFONT)

translation
In this system, we in the above scope, plan the standard character, the distinctive mark and the user make the character the region, separately explains as follows:

? standard character (STDFONT)

  使用範圍 字數 保留範圍 字數
常用字 A440∼C67E 5401 C6A1∼C8FE 408
次常用字 C940∼F9D5 7652 F9D6∼F9FE 41
合 計

13053

449

translation
Use scope Number of words Retention scope Number of words
Frequently used character A 440 ~ C 67 E 5,401 C 6 A 1 ~ C 8 F E 408
Inferior frequently used character C 940 ~ F 9 D 5 7,652 F 9 D 6 ~ F 9 F E 41
Equals 13,053
449

※標準字中:兀(A461、C94A[刪除])與 嗀(DCD1、DDFC[刪除]) 兩個字重碼
※BIG5-ETen 與CP950中的倚天字使用次常用字保留範圍共41字

☆特殊符號(SPCFONT、SPCFSUPP)

 1.各種符號區(SPCFONT)

translation
* In standard character: ? (A461, C94A [ deletion ]) and ? (DCD1, DDFC [ deletion ]) two characters heavy codes * In BIG5-ETen and CP950relies on the day characterto use the inferior frequently used character to retain the scope altogether 41 characters

? distinctive mark (SPCFONT, SPCFSUPP)

1. Each kind of mark area (SPCFONT)

  使用範圍 字數 保留範圍 字數
標準字 A140∼A3BF 408 --------- ---
控制碼 A3C0∼A3E0 33 A3E1∼A3FE 30
合 計

411

30

translation
Use scope Number of words Retention scope Number of words
Standard character A 140 ~ A 3 B F 408 - - - - - - - - - - - -
Control code A 3 C 0 ~ A 3 E 0 33 A 3 E 1 ~ A 3 F E 30
Equals 411
30

※CP950的歐元符號(€)使用控制碼保留範圍A3E1位置

 2.罕用符號區(SPCFSUPP)

translation
* CP950 euro mark () ? ? melancholy ? X retention scope A3E1 position

2. Rarely uses the mark area (SPCFSUPP)

  使用範圍 字數 保留範圍 字數
標準字 C6A1∼C8FE 408 --------- ---
合 計

408

translation
Use scope Number of words Retention scope Number of words
Standard character C 6 A 1 ~ C 8 F E 408 - - - - - - - - - - - -
Equals 408
0

※BIG5-ETen中的倚天擴充字使用罕用符號區C6A1~C8D3範圍,內容有日文假名、俄文等特殊符號
※BIG5-2003中取消 〃(C6DE)、仝(C6DF)以及BIG5-ETen中C7F3~C8D3範圍所定義的俄文與特殊符號

使用者造字(USRFONT)

translation
* In BIG5-ETenrelies on the day expansion characteruse rarely to use the mark area C6A1~C8D3 scope, the content to have the Japanese fictitious
name, Russian and so on the distinctive mark
* In BIG5-2003 cancels Russian and the distinctive mark which ? (C6DE), ? (C6DF) as well as in BIG5-ETen the C7F3~C8D3 scope defines

?the user makes the character (USRFONT)

  使用範圍 字數 保留範圍 字數
第一段 FA40∼FEFE 785 --------- ---
第二段 8E40∼A0FE 2983 --------- ---
第三段 8140∼8DFE 2041 --------- ---
合 計

5809

translation
Use scope Number of words Retention scope Number of words
First section F A 40 ~ F E F E 785 - - - - - - - - - - - -
Second section 8 E 40 ~ A 0 F E 2,983 - - - - - - - - - - - -
Third section 8,140 ~ 8 D F E 2,041 - - - - - - - - - - - -
Equals 5,809
0

 

3.各種BIG5碼的比較

 台灣地區所使用的BIG5碼主要版本:

translation
3. Each kind of BIG5 code comparison

The Taiwan area uses BIG5 code main edition:

版本

說明

BIG5-1984 最早由資策會所定的版本
BIG5-ETen 倚天版本
CP950 微軟所使用的版本
BIG5-2003 2003年由財團法人中文數位化技術推廣基金會接受經濟部標準檢驗局委託,召集國內業者代表、專家和學者,就BIG-5編碼字元表原始版本和各主要業界版本予以重整之最新版本
BIG5-IBM IBM所使用的版本
translation
Edition
Explanation

BIG5-1984 Most early whips the edition by the capital which the office decides
BIG5-ETen Relies on the day edition
CP950 Microsoft uses edition
BIG5-2003 In 2003 accepts the economy department standard examination bureau request by the financial group legal person Chinese several technologies promotions foundation, calls the domestic industry representative, the expert and the scholar, BIG-5 edits the symbol Yuan table primitive edition and each main field edition rallies the newest edition
BIG5-IBM IBM uses edition

  

BIG5-2003與各版本BIG5碼比較表:

translation
BIG5-2003 and various editions BIG5 code comparison table:
版本 BIG5-2003 BIG5-1984 BIG5-ETen Microsoft-CP950 BIG5-IBM
使用者造字區
(8140 - A0FE)
符號區
(A140 - A2CE)
全形英文字母
(A2CF - A343)
全形希臘字母
(A344 - A373)
注音符號
(A374 - A3BF)
控制符號
(A3C0 - A3E0)
歐元符號
(A3E1)
保留
(A3E2 - A3FE)
常用字
(A440 - C67E)
數字符號
(C6A1 - C6BE)
部首
(C6BF - C6D7)
罕用符號
(C6D8 - C6E6)
日文平假名
(C6E7 - C77A)
日文片假名
(C77B - C7F2)
保留
(C7F3 - C8FE)

(C7F3-C8D3)

有*
次常用字
(C940 - F9D5)
七個倚天外字集的擴充字
(F9D6 - F9DC)
表格符號
(F9DD - F9FE)
使用者造字區和新常用字
(FA40 - FEFE)
translation
Edition BIG5-2003 BIG5-1984 BIG5-ETen Microsoft-CP950 BIG5-IBM
The user makes the character area
(8,140 - A0FE) Has Does not have Has Has Does not have
Mark area
(A140 - A2CE) Has Has Has Has Has
Entire shape English letter
(A2CF - A343) Has Has Has Has Has
Entire shape Greek letters
(A344 - A373) Has Has Has Has Has
Phonetic alphabet
(A374 - A3BF) Has Has Has Has Has
Control mark
(A3C0 - A3E0) Has Has Does not have Does not have Has
Euro mark
(A3E1) Has Has Does not have Has Does not have
Retention
(A3E2 - A3FE) Has Has Does not have Does not have Does not have
Frequently used character
(A440 - C67E) Has Has Has Has Has
Numeric character
(C6A1 - C6BE) Has Does not have Has Has Has
Character radical
(C6BF - C6D7) Has Does not have Has Has Has
Rarely uses the mark
(C6D8 - C6E6) Has Does not have Has Has Has
Japanese hiragana
(C6E7 - C77A) Has Does not have Has Has Has
Japanese katakana
(C77B - C7F2) Has Does not have Has Has Has
Retention
(C7F3 - C8FE) Has Does not have Has

(C7F3-C8D3)
Has Has *
Inferior frequently used character
(C940 - F9D5) Has Does not have Has Has Has
Seven rely on outside the day the character collection expansion character
(F9D6 - F9DC) Has Does not have Has Has Has
Form mark
(F9DD - F9FE) Has Does not have Has Has Has
The user makes the character area and the new frequently used character
(FA40 - FEFE) Has Does not have Has Has Does not have

*:僅編碼(C7F3 - C878)、(C8CD-C8D3)。

translation
*: Only codes (C7F3 - C878), (C8CD-C8D3).

 

 

 

BIG-5碼使用範圍表

translation
BIG-5 code use scope table

Terug naar de index. Back to the index.