- 1、本文档共10页,可阅读全部内容。
- 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
字符编码ASCII,Unicode和UTF-8详解(Character encoding, ASCII, Unicode, and UTF-8)
字符编码ASCII,Unicode和UTF-8详解(Character encoding, ASCII, Unicode, and UTF-8)
This noon, I suddenly wanted to find out the relationship between Unicode and UTF-8, so I began to check the information on the internet.
As a result, this problem is more complicated than I expected. Its only 9 oclock since lunch time.
Here are my notes, mainly to sort out my thoughts. But I try to write it in a straightforward way, in the hope that it will be useful to other friends. After all, character encoding is the cornerstone of computer technology, and you have to know a little bit of coding knowledge if you want to be proficient in using a computer.
1. ASCII code
We know that within the computer, all the information is eventually represented as a binary string. Each binary bit (bit) has 0 and 1 states, so the eight binary bits can be combined into 256 States, called (byte). That is to say, a byte can be used to represent 256 different states, each corresponding to one symbol, i.e., 256 symbols, from 0000000 to
In the 60s of the last century, the United States developed a set of character encoding, and made a uniform stipulation on the relationship between English characters and binary digits. This is called ASCII code, has been used so far.
The ASCII code specifies a total of 128 characters, such as the space SPACE is 32 (binary, and the uppercase letter A is 65 (binary. These 128 symbols, including 32 printed symbols that cannot be printed, take up only one byte of the latter 7 bits, and the first 1 are uniformly specified as 0.
2, non ASCII encoding
It is enough to encode English with 128 symbols, but it is not enough to represent other languages and 128 symbols. For example, in French, with phonetic symbols above a letter, it will not be able to use ASCII code. As a result, some European countries decided to make new symbols using the highest bits of inactivity in bytes. For example, in French type encoding is 130 (binary. As a result, the co
您可能关注的文档
- 国内外案例__转载(Domestic and foreign cases __ reprint).doc
- 国内企业的人力资源管理挑战(Challenges of human resource management in domestic enterprises).doc
- 国内外零售企业大举扩张 人才难找薪水水涨船高_业界_科技时代_新浪网(Domestic and foreign retail enterprises expand personnel salary rises to find _ industry _ era of science and technology _ sina.com.cn).doc
- 国内外滤波器的发展(Development of filter at home and abroad).doc
- 国内外高分电影及影评(High score films and reviews at home and abroad).doc
- 国外农村养老保障体系建设及其对我国的启示(Construction of rural endowment insurance system in foreign countries and Its Enlightenment to China).doc
- 国内零售市场竞争和主要特点(Domestic retail market competition and key features).doc
- 国学书目书目(Bibliography of Sinology).doc
- 国家职能观 财政职能及财政支出范围的理论选择(The theory of national function view, financial function and financial expenditure range).doc
- 国家语委语料库(American National Corpus).doc
文档评论(0)