#
# Name: BIG5 to Unicode table (complete)
# Unicode version: 1.1
# Table version: 0.0d3
# Table format: Format A
# Date: 11 February 1994
#
# Copyright (c) 1991-1994 Unicode, Inc. All Rights reserved.
#
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
# No claims are made as to fitness for any particular purpose. No
# warranties of any kind are expressed or implied. The recipient
# agrees to determine applicability of information provided. If this
# file has been provided on magnetic media by Unicode, Inc., the sole
# remedy for any claim will be exchange of defective media within 90
# days of receipt.
#
# Recipient is granted the right to make copies in any form for
# internal distribution and to freely use the information supplied
# in the creation of products supporting Unicode. Unicode, Inc.
# specifically excludes the right to re-distribute this file directly
# to third parties or other organizations whether for profit or not.
#
# General notes:
#
#
# This table contains one set of mappings from BIG5 into Unicode.
# Note that these data are *possible* mappings only and may not be the
# same as those used by actual products, nor may they be the best suited
# for all uses. For more information on the mappings between various code
# pages incorporating the repertoire of BIG5 and Unicode, consult the
# VENDORS mapping data. Normative information on the mapping between
# BIG5 and Unicode may be found in the Unihan.txt file in the
# latest Unicode Character Database.
#
# If you have carefully considered the fact that the mappings in
# this table are only one possible set of mappings between BIG5 and
# Unicode and have no normative status, but still feel that you
# have located an error in the table that requires fixing, you may
# report any such error to errata@unicode.org.
#
# WARNING! It is currently impossible to provide round-trip compatibility
# between BIG5 and Unicode.
#
# A number of characters are not currently mapped because
# of conflicts with other mappings. They are as follows:
#
# BIG5 Description Comments
#
# 0xA15A SPACING UNDERSCORE duplicates A1C4
# 0xA1C3 SPACING HEAVY OVERSCORE not in Unicode
# 0xA1C5 SPACING HEAVY UNDERSCORE not in Unicode
# 0xA1FE LT DIAG UP RIGHT TO LOW LEFT duplicates A2AC
# 0xA240 LT DIAG UP LEFT TO LOW RIGHT duplicates A2AD
# 0xA2CC HANGZHOU NUMERAL TEN conflicts with A451 mapping
# 0xA2CE HANGZHOU NUMERAL THIRTY conflicts with A4CA mapping
#
# We currently map all of these characters to U+FFFD REPLACEMENT CHARACTER.
# It is also possible to map these characters to their duplicates, or to
# the user zone.
#
# Notes:
#
# 1. In addition to the above, there is some uncertainty about the
# mappings in the range C6A1 - C8FE, and F9DD - F9FE. The ETEN
# version of BIG5 organizes the former range differently, and adds
# additional characters in the latter range. The correct mappings
# these ranges need to be determined.
#
# 2. There is an uncertainty in the mapping of the Big Five character
# 0xA3BC. This character occurs within the Big Five block of tone marks
# for bopomofo and is intended to be the tone mark for the first tone in
# Mandarin Chinese. We have selected the mapping U+02C9 MODIFIER LETTER
# MACRON (Mandarin Chinese first tone) to reflect this semantic.
# However, because bopomofo uses the absense of a tone mark to indicate
# the first Mandarin tone, most implementations of Big Five represent
# this character with a blank space, and so a mapping such as U+2003 EM
# SPACE might be preferred.
#
# Format: Three tab-separated columns
# Column #1 is the BIG5 code (in hex as 0xXXXX)
# Column #2 is the Unicode (in hex as 0xXXXX)
# Column #3 is the Unicode name (follows a comment sign, '#')
# The official names for Unicode characters U+4E00
# to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
# where XXXX is the code point. Including all these
# names in this file increases its size substantially
# and needlessly. The token "<CJK>" is used for the
# name of these characters. If necessary, it can be
# expanded algorithmically by a parser or editor.
#
# The entries are in BIG5 order
#
#
0xA140 0x3000 # IDEOGRAPHIC SPACE
0xA141 0xFF0C # FULLWIDTH COMMA
0xA142 0x3001 # IDEOGRAPHIC COMMA
0xA143 0x3002 # IDEOGRAPHIC FULL STOP
0xA144 0xFF0E # FULLWIDTH FULL STOP
0xA145 0x2022 # BULLET
0xA146 0xFF1B # FULLWIDTH SEMICOLON
0xA147 0xFF1A # FULLWIDTH COLON
0xA148 0xFF1F # FULLWIDTH QUESTION MARK
0xA149 0xFF01 # FULLWIDTH EXCLAMATION MARK
0xA14A 0xFE30 # PRESENTATION FORM FOR VERTICAL TWO DOT LEADER
0xA14B 0x2026 # HORIZONTAL ELLIPSIS
0xA14C 0x2025 # TWO DOT LEADER
0xA14D 0xFE50 # SMALL COMMA
0xA14E 0xFF64 # HALFWIDTH IDEOGRAPHIC COMMA
0xA14F 0xFE52 # SMALL FULL STOP
0xA150 0x00B7 # MIDDLE DOT
0xA151 0xFE54 # SMALL SEMICOLON
0xA152 0xFE55 # SMALL COLON
0xA153 0xFE56 # SMALL QUESTION MARK
0xA154 0xFE57 # SMALL EXCLAMATION MARK
0xA155 0xFF5C # FULLWIDTH VERTICAL BAR
0xA156 0x2013 # EN DASH
0xA157 0xFE31 # PRESENTATION FORM FOR VERTICAL EM DASH
0xA158 0x2014 # EM DASH
0xA159 0xFE33 # PRESENTATION FORM FOR VERTICAL LOW LINE
0xA15A 0xFFFD # *** NO MAPPING ***
0xA15B 0xFE34 # PRESENTATION FORM FOR VERTICAL WAVY LOW LINE
0xA15C 0xFE4F # WAVY LOW LINE
0xA15D 0xFF08 # FULLWIDTH LEFT PARENTHESIS
0xA15E 0xFF09 # FULLWIDTH RIGHT PARENTHESIS
0xA15F 0xFE35 # PRESENTATION FORM FOR VERTICAL LEFT PARENTHESIS
0xA160 0xFE36 # PRESENTATION FORM FOR VERTICAL RIGHT PARENTHESIS
0xA161 0xFF5B # FULLWIDTH LEFT CURLY BRACKET
0xA162 0xFF5D # FULLWIDTH RIGHT CURLY BRACKET
0xA163 0xFE37 # PRESENTATION FORM FOR VERTICAL LEFT CURLY BRACKET
0xA164 0xFE38 # PRESENTATION FORM FOR VERTICAL RIGHT CURLY BRACKET
0xA165 0x3014 # LEFT TORTOISE SHELL BRACKET
0xA166 0x3015 # RIGHT TORTOISE SHELL BRACKET
0xA167 0xFE39 # PRESENTATION FORM FOR VERTICAL LEFT TORTOISE SHELL BRACKET
0xA168 0xFE3A # PRESENTATION FORM FOR VERTICAL RIGHT TORTOISE SHELL BRACKET
0xA169 0x3010 # LEFT BLACK LENTICULAR BRACKET
0xA16A 0x3011 # RIGHT BLACK LENTICULAR BRACKET
0xA16B 0xFE3B # PRESENTATION FORM FOR VERTICAL LEFT BLACK LENTICULAR BRACKET
0xA16C 0xFE3C # PRESENTATION FORM FOR VERTICAL RIGHT BLACK LENTICULAR BRACKET
0xA16D 0x300A # LEFT DOUBLE ANGLE BRACKET
0xA16E 0x300B # RIGHT DOUBLE ANGLE BRACKET
0xA16F 0xFE3D # PRESENTATION FORM FOR VERTICAL LEFT DOUBLE ANGLE BRACKET
0xA170 0xFE3E # PRESENTATION FORM FOR VERTICAL RIGHT DOUBLE ANGLE BRACKET
0xA171 0x3008 # LEFT ANGLE BRACKET
0xA172 0x3009 # RIGHT ANGLE BRACKET
0xA173 0xFE3F # PRESENTATION FORM FOR VERTICAL LEFT ANGLE BRACKET
0xA174 0xFE40 # PRESENTATION FORM FOR VERTICAL RIGHT ANGLE BRACKET
0xA175 0x300C # LEFT CORNER BRACKET
0xA176 0x300D # RIGHT CORNER BRACKET
0xA177 0xFE41 # PRESENTATION FORM FOR VERTICAL LEFT CORNER BRACKET
0xA178 0xFE42 # PRESENTATION FORM FOR VERTICAL RIGHT CORNER BRACKET
0xA179 0x300E # LEFT WHITE CORNER BRACKET
0xA17A 0x300F # RIGHT WHITE CORNER BRACKET
0xA17B 0xFE43 # PRESENTATION FORM FOR VERTICAL LEFT WHITE CORNER BRACKET
0xA17C 0xFE44 # PRESENTATION FORM FOR VERTICAL RIGHT WHITE CORNER BRACKET
0xA17D 0xFE59 # SMALL LEFT PARENTHESIS
0xA17E 0xFE5A # SMALL RIGHT PARENTHESIS
0xA1A1 0xFE5B # SMALL LEFT CURLY BRACKET
0xA1A2 0xFE5C # SMALL RIGHT CURLY BRACKET
0xA1A3 0xFE5D # SMALL LEFT TORTOISE SHELL BRACKET
0xA1A4 0xFE5E # SMALL RIGHT TORTOISE SHELL BRACKET
0xA1A5 0x2018 # LEFT SINGLE QUOTATION MARK
0xA1A6 0x2019 # RIGHT SINGLE QUOTATION MARK
0xA1A7 0x201C # LEFT DOUBLE QUOTATION MARK
0xA1A8 0x201D # RIGHT DOUBLE QUOTATION MARK
0xA1A9 0x301D # REVERSED DOUBLE PRIME QUOTATION MARK
0xA1AA 0x301E # DOUBLE PRIME QUOTATION MARK
0xA1AB 0x2035 # REVERSED PRIME
0xA1AC 0x2032 # PRIME
0xA1AD 0xFF03 # FULLWIDTH NUMBER SIGN
0xA1AE 0xFF06 # FULLWIDTH AMPERSAND
0xA1AF 0xFF0A # FULLWIDTH ASTERISK
0xA1B0 0x203B # REFERENCE MARK
0xA1B1 0x00A7 # SECTION SIGN