没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
The Unicode Standard, Version 16.0
Archived Code Charts
This file contains the complete set of character code tables and list of character names for
The Unicode Standard, Version 16.0
This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard.
See https://www.unicode.org/errata/ for an up-to-date list of errata.
See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See
https://www.unicode.org/charts/PDF/Unicode-16.0/ for charts showing only the characters added in Unicode 16.0. See
https://www.unicode.org/Public/16.0.0/charts/ for a complete archived file of character code charts for Unicode 16.0. See
https://www.unicode.org/charts/About.html#Conventions for conventions used in these code charts, and other general
information.
Disclaimer
These charts are provided as the online reference to the character contents of the Unicode Standard, Version 16.0 but do
not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete
understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode
Standard, Version 16.0, online at https://www.unicode.org/versions/Unicode16.0.0/, as well as the Unicode Standard
Annexes, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available
online.
See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/
A thorough understanding of the information contained in these additional sources is required for a successful
implementation.
Fonts
The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be
expected in actual fonts.
See https://www.unicode.org/charts/fonts.html for a list.
Terms of Use
© 1991
–2024 Unicode, Inc. This publication is protected by copyright, and permission must be obtained from Unicode,
Inc. prior to any reproduction, modification, or other use not permitted by the Terms of Use
(https://www.unicode.org/copyright.html). Specifically, you may make copies of this publication and may annotate and
translate it solely for personal or internal business purposes and not for public distribution, provided that any such
permitted copies and modifications fully reproduce all copyright and other legal notices contained in the original. You
may not make copies of or modifications to this publication for public distribution, or incorporate it in whole or in part
into any product or publication without the express written permission of Unicode.
The Unicode Consortium specifically grants ISO a license to produce such code charts with their associated character
names list to show the repertoire of characters for that standard, as a normatively referenced, integral part of that
standard.
Unicode uses most fonts under restricted license from the original font owner. You may not extract, copy, modify, or
distribute fonts or font data from any Unicode Products, including this publication, without license from the font owner.
Use of all Unicode Products, including this publication, is governed by the Unicode Terms of Use
(https://www.unicode.org/copyright.html). The authors, contributors, and publishers have taken care in the preparation of
this publication, but make no express or implied representation or warranty of any kind and assume no responsibility or
liability for errors or omissions or for consequential or incidental damages that may arise therefrom. This publication is
provided
“AS-IS” without charge as a convenience to users.
Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries.
The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved.2
007FC0 Controls and Basic Latin 0000
000 001 002 003 004 005 006 007
!
"
#
$
%
&
'
(
)
*
+
,
-
.
/
0
1
2
3
4
5
6
7
8
9
:
;
<
=
>
?
@
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
[
\
]
^
_
`
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
y
z
{
|
}
~
0000
0001
0002
0003
0004
0005
0006
0007
0008
0009
000A
000B
000C
000D
000E
000F
0010
0011
0012
0013
0014
0015
0016
0017
0018
0019
001A
001B
001C
001D
001E
001F
0020
0021
0022
0023
0024
0025
0026
0027
0028
0029
002A
002B
002C
002D
002E
002F
0030
0031
0032
0033
0034
0035
0036
0037
0038
0039
003A
003B
003C
003D
003E
003F
0040
0041
0042
0043
0044
0045
0046
0047
0048
0049
004A
004B
004C
004D
004E
004F
0050
0051
0052
0053
0054
0055
0056
0057
0058
0059
005A
005B
005C
005D
005E
005F
0060
0061
0062
0063
0064
0065
0066
0067
0068
0069
006A
006B
006C
006D
006E
006F
0070
0071
0072
0073
0074
0075
0076
0077
0078
0079
007A
007B
007C
007D
007E
007F
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved. 3
0022C0 Controls and Basic Latin 0000
001A <control>
= SUBSTITUTE
→ FFFD replacement character
001B
<control>
= ESCAPE
001C
<control>
= INFORMATION SEPARATOR FOUR
= file separator (FS)
001D
<control>
= INFORMATION SEPARATOR THREE
= group separator (GS)
001E
<control>
= INFORMATION SEPARATOR TWO
= record separator (RS)
001F
<control>
= INFORMATION SEPARATOR ONE
= unit separator (US)
ASCII punctuation and symbols
Based on ISO/IEC 646.
0020
SPACE
• sometimes considered a control code
• other space characters: 2000–200A
→ 00A0
no-break space
→ 200B zero width space
→ 202F
narrow no-break space
→ 2060
word joiner
→ 2420␠ symbol for space
→ 2422
␢ blank symbol
→ 2423
␣ open box
→ 3000 ideographic space
→ FEFF
zero width no-break space
0021
! EXCLAMATION MARK
= factorial
= bang
→ 00A1¡ inverted exclamation mark
→ 01C3
ǃ latin letter retroflex click
→ 203C‼ double exclamation mark
→ 203D
‽ interrobang
→ 26A0
⚠ warning sign
→ 2757❗ heavy exclamation mark symbol
→ 2762
❢ heavy exclamation mark ornament
→ 2E53
⹓ medieval exclamation mark
→ A71Dꜝ modifier letter raised exclamation mark
0022 " QUOTATION MARK
= double quote
• neutral (vertical), used as opening or closing
quotation mark
• preferred characters in English for paired
quotation marks are 201C
“ & 201D”
• 05F4״ is preferred for gershayim when writing
Hebrew
→ 02BAʺ modifier letter double prime
→ 02DD
˝ double acute accent
→ 02EE
ˮ modifier letter double apostrophe
→ 030B$ combining double acute accent
→ 030E
$ combining double vertical line above
→ 05F4
״ hebrew punctuation gershayim
→ 201C“ left double quotation mark
→ 201D
” right double quotation mark
→ 2033
″ double prime
→ 3003〃 ditto mark
C0 controls
Alias names are those for ISO/IEC 6429:1992. Commonly used
alternative aliases are also shown.
0000
<control>
= NULL
0001
<control>
= START OF HEADING
0002
<control>
= START OF TEXT
0003
<control>
= END OF TEXT
0004
<control>
= END OF TRANSMISSION
0005
<control>
= ENQUIRY
0006
<control>
= ACKNOWLEDGE
0007
<control>
= BELL
0008
<control>
= BACKSPACE
0009
<control>
= CHARACTER TABULATION
= horizontal tabulation (HT)
= tab
000A
<control>
= LINE FEED (LF)
= new line (NL)
= end of line (EOL)
000B
<control>
= LINE TABULATION
= vertical tabulation (VT)
000C
<control>
= FORM FEED (FF)
000D
<control>
= CARRIAGE RETURN (CR)
000E
<control>
= SHIFT OUT
• known as LOCKING-SHIFT ONE in 8-bit
environments
000F <control>
= SHIFT IN
• known as LOCKING-SHIFT ZERO in 8-bit
environments
0010
<control>
= DATA LINK ESCAPE
0011
<control>
= DEVICE CONTROL ONE
0012
<control>
= DEVICE CONTROL TWO
0013
<control>
= DEVICE CONTROL THREE
0014
<control>
= DEVICE CONTROL FOUR
0015
<control>
= NEGATIVE ACKNOWLEDGE
0016
<control>
= SYNCHRONOUS IDLE
0017
<control>
= END OF TRANSMISSION BLOCK
0018
<control>
= CANCEL
0019
<control>
= END OF MEDIUM
The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved.4
002EC0 Controls and Basic Latin 0023
002A * ASTERISK
= star
• can have five or six spokes
→ 066D٭ arabic five pointed star
→ 203B
※ reference mark
→ 2042⁂ asterism
→ 204E
low asterisk
→ 2051
two asterisks aligned vertically
→ 20F0$ combining asterisk above
→ 2217∗ asterisk operator
→ 26B9
⚹ sextile
→ 2731✱ heavy asterisk
→ A673
꙳ slavonic asterisk
→ 1F7B6
medium six spoked asterisk
ASCII math operator
002B + PLUS SIGN
→ 02D6
˖ modifier letter plus sign
→ 2212− minus sign
→ 2795
➕ heavy plus sign
→ FB29
hebrew letter alternative plus sign
→ 1F7A2
light greek cross
ASCII punctuation
002C , COMMA
= the use as decimal or thousands separator is
locale dependent
→ 060C، arabic comma
→ 066B٫ arabic decimal separator
→ 201A
‚ single low-9 quotation mark
→ 2E12
⸒ hypodiastole
→ 2E41⹁ reversed comma
→ 2E4C
⹌ medieval comma
→ 3001
、 ideographic comma
002D
- HYPHEN-MINUS
= hyphen, dash
= minus sign
• used generically for hyphen, minus sign or en
dash, all of which have dedicated alternatives
→ 00AD soft hyphen
→ 02D7
˗ modifier letter minus sign
→ 2010
‐ hyphen
→ 2011 non-breaking hyphen
→ 2012
‒ figure dash
→ 2013
– en dash
→ 2027‧ hyphenation point
→ 2043
⁃ hyphen bullet
→ 2212− minus sign
→ 10191
𐆑 roman uncia sign
002E . FULL STOP
= period, dot, decimal point
• the use as decimal point is locale dependent
• may be rendered as a raised decimal point in
old style numbers
→ 00B7· middle dot
→ 06D4۔ arabic full stop
→ 2024
․ one dot leader
→ 2026… horizontal ellipsis
→ 2E33
⸳ raised dot
→ 2E3C
⸼ stenographic full stop
→ 3002。 ideographic full stop
0023 # NUMBER SIGN
= pound sign (weight)
= hashtag, hash
= crosshatch, octothorpe
• for denoting musical sharp 266F♯ is preferred
→ 2114
l b bar symbol
→ 2116№ numero sign
→ 2317⌗ viewdata square
→ 266F
♯ music sharp sign
→ 29E3⧣ equals sign and slanted parallel
0024 $ DOLLAR SIGN
= milréis, escudo
• used for many peso currencies in Latin America
and elsewhere
• glyph may have one or two vertical bars
• other currency symbol characters start at
20A0
₠
→ 00A2¢ cent sign
→ 00A4
¤ currency sign
→ 20B1
peso sign
→ 1F4B2💲 heavy dollar sign
0025 % PERCENT SIGN
→ 066A٪ arabic percent sign
→ 2030
‰ per mille sign
→ 2031
‱ per ten thousand sign
→ 2052
commercial minus sign
0026
& AMPERSAND
= and
• originally derived from a ligature of ‘e’ and ‘t’
→ 204A
⁊ tironian sign et
→ 214B⅋ turned ampersand
→ 1F674
heavy ampersand ornament
0027 ' APOSTROPHE
= apostrophe-quote (1.0)
= single quote
= APL quote
• neutral (vertical) glyph with mixed usage
• 2019’ is preferred for apostrophe
• preferred characters in English for paired
quotation marks are 2018
‘ & 2019’
• 05F3׳ is preferred for geresh when writing
Hebrew
→ 02B9
ʹ modifier letter prime
→ 02BCʼ modifier letter apostrophe
→ 02C8
ˈ modifier letter vertical line
→ 0301
$ combining acute accent
→ 030D$ combining vertical line above
→ 05F3
׳ hebrew punctuation geresh
→ 2018
‘ left single quotation mark
→ 2019’ right single quotation mark
→ 2032
′ prime
→ A78C
ꞌ latin small letter saltillo
0028
( LEFT PARENTHESIS
= opening parenthesis (1.0)
0029
) RIGHT PARENTHESIS
= closing parenthesis (1.0)
• see discussion on semantics of paired
bracketing characters
The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved. 5
0048C0 Controls and Basic Latin 002F
003D = EQUALS SIGN
• other related characters: 2241≁–2263≣
→ 1400᐀ canadian syllabics hyphen
→ 2248≈ almost equal to
→ 2260≠ not equal to
→ 2261≡ identical to
→ 2E40
⹀ double hyphen
→ 30A0
゠ katakana-hiragana double hyphen
→ A78A
꞊ modifier letter short equals sign
→ FE66﹦ small equals sign
→ 10190
𐆐 roman sextans sign
→ 1F7F0
🟰 heavy equals sign
003E
> GREATER-THAN SIGN
→ 02C3
˃ modifier letter right arrowhead
→ 203A
› single right-pointing angle quotation
mark
→ 232A〉 right-pointing angle bracket
→ 27E9⟩ mathematical right angle bracket
→ 3009
〉 right angle bracket
ASCII punctuation
003F ? QUESTION MARK
→ 00BF
¿ inverted question mark
→ 037E
; greek question mark
→ 061F؟ arabic question mark
→ 203D
‽ interrobang
→ 2047
double question mark
→ 2753❓ black question mark ornament
→ 2BD1
uncertainty sign
→ 2E2E
⸮ reversed question mark
→ 2E54⹔ medieval question mark
→ FFFD
replacement character
0040
@ COMMERCIAL AT
= at sign
= arroba (old Spanish unit of weight)
→ 24D0ⓐ circled latin small letter a
Uppercase Latin alphabet
0041
A LATIN CAPITAL LETTER A
0042
B LATIN CAPITAL LETTER B
→ 212Cℬ script capital b
0043
C LATIN CAPITAL LETTER C
→ 03F9
Ϲ greek capital lunate sigma symbol
→ 2102ℂ double-struck capital c
→ 2103
℃ degree celsius
→ 212Dℭ black-letter capital c
→ 216D
Ⅽ roman numeral one hundred
0044 D LATIN CAPITAL LETTER D
→ 216E
Ⅾ roman numeral five hundred
0045
E LATIN CAPITAL LETTER E
→ 0190
Ɛ latin capital letter open e
→ 2107ℇ euler constant
→ 2130ℰ script capital e
0046
F LATIN CAPITAL LETTER F
→ 2109
℉ degree fahrenheit
→ 2131ℱ script capital f
→ 2132
Ⅎ turned capital f
0047
G LATIN CAPITAL LETTER G
0048 H LATIN CAPITAL LETTER H
→ 210Bℋ script capital h
→ 210Cℌ black-letter capital h
→ 210Dℍ double-struck capital h
002F
/ SOLIDUS
= slash, forward slash, virgule
→ 0338$ combining long solidus overlay
→ 2044
⁄ fraction slash
→ 2215∕ division slash
→ 27CB⟋ mathematical rising diagonal
→ 29F8⧸ big solidus
→ 2E4A
⹊ dotted solidus
ASCII digits
0030 0 DIGIT ZERO
⁓ 0030 FE000 short diagonal stroke form
0031
1 DIGIT ONE
0032 2 DIGIT TWO
→ 01BB
ƻ latin letter two with stroke
→ 218A
↊ turned digit two
0033
3 DIGIT THREE
→ 218B
↋ turned digit three
→ A7AB
Ɜ latin capital letter reversed open e
0034
4 DIGIT FOUR
→ A72C
Ꜭ latin capital letter cuatrillo
0035
5 DIGIT FIVE
→ 01BC
Ƽ latin capital letter tone five
0036 6 DIGIT SIX
0037
7 DIGIT SEVEN
0038 8 DIGIT EIGHT
0039 9 DIGIT NINE
ASCII punctuation
003A
: COLON
• also used to denote division or scale; for that
mathematical use 2236∶ is preferred
• in Finnish and Swedish, also used as intra-word
punctation (abbreviation mark)
→ 02D0ː modifier letter triangular colon
→ 02F8
˸ modifier letter raised colon
→ 0589
։ armenian full stop
→ 05C3׃ hebrew punctuation sof pasuq
→ 1361 ethiopic wordspace
→ 1365፥ ethiopic colon
→ 205A
⁚ two dot punctuation
→ 205D
⁝ tricolon
→ 2236∶ ratio
→ A789
꞉ modifier letter colon
→ FE30
︰ presentation form for vertical two
dot leader
003B
; SEMICOLON
• this, and not 037E;, is the preferred character
for ’Greek question mark’
→ 037E
; greek question mark
→ 061B؛ arabic semicolon
→ 204F
reversed semicolon
→ 2E35
⸵ turned semicolon
ASCII mathematical operators
Other mathematical operators start at 2200.
003C < LESS-THAN SIGN
• paired with 003E> for ASCII-based angle
bracket markup conventions
→ 02C2˂ modifier letter left arrowhead
→ 2039
‹ single left-pointing angle quotation
mark
→ 2329
〈 left-pointing angle bracket
→ 27E8⟨ mathematical left angle bracket
→ 3008
〈 left angle bracket
剩余3112页未读,继续阅读
资源评论
时空印象
- 粉丝: 0
- 资源: 1
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 市场营销职业规划.pptx
- 广联达2024最新3226写锁与加密狗授权工具2.6锁
- C++ 中 std::unordered-map 与 std::map:容器选型的深度剖析
- java学生考勤管理系统源码数据库 MySQL源码类型 WebForm
- 此工具集成资产探测半自动化,WEB渗透,burp及浏览器插件,社工钓鱼,APP小程序渗透,批量漏洞扫描,漏洞利用,内网渗透,隧道代理,免杀,Windows及linux应急响应,训练靶场环境部署.zip
- 游乐场快速通行系统QuickPass的改进算法研究与模拟
- 深度解析 C++ 中 final 和 override 关键字的强大功能与应用
- LCD1602案例分析
- C#ASP.NET协同办公管理考勤子系统源码带文档数据库 SQL2012源码类型 WebForm
- C#环境下433MHz高频射频卡开发详解与智能门禁系统实现
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功