-
GB code is the standard formulated by the National Standards Committee, and the general GB code refers to GB2312, which is the standard formulated by the state in 1980 and the earliest national character set standard. Later, the GBK (National Standard Extension) code was designated for the expansion of processing power, and the range of Chinese characters has increased considerably.
The latest (probably last year) is the latest standard, and GBK is double-byte encoded. GB18030 is a hybrid of single-byte (0x00-0x7f), double-byte (equivalent to GBK), and four-byte encoding (of course, these are very obscure Chinese characters).
-
The national standard code is the standard code for the exchange of Chinese character information, but because the highest bit of the byte before and after it is 0, it conflicts with the ASCII code, such as the word "Bao", the national standard code is 31h and 23h, and the scii of the Western characters "1" and "" is also 31h and 23h, and now if there are two bytes in the memory for 31h and 23h,; Is this one Chinese character or two Western characters "1"; and " "?".So there is ambiguity, obviously, the national standard code is not possible to be directly adopted inside the computer, so,; The in-machine code of Chinese characters adopts the deformed national standard code, and its transformation method is: add 128 to each byte of the national standard code, that is, the highest bit of the two bytes is changed from 0 to 1, and the remaining 7 bits remain unchanged, such as:
From the above we know that the national standard code of the word "Bao" is 3123H, the first byte is 00110001B, the last byte is 00100011B, and the high position is changed to 10110001B and 10100011B is B1A3H, therefore, the machine code of the word is B1A3H;.
-
National standard code: Yu Zhi refers to the "information exchange Chinese character coding character set" published by China in 1980, codenamed "GB2312-80". Consists of two consecutive bytes.
Machine-code: In the computer to represent the Chinese character ** is the Chinese character machine code, the Chinese character machine code evolved from the national standard code, the two bytes representing the national standard code (each byte can be converted into 8-bit binary number, less than 8 digits in front of the zero) The highest digit is added to "1", which becomes the Chinese character machine code.
Trick algorithm: 1) The area code is converted into a hexadecimal number representation first.
2) (hexadecimal representation of the area code) 2020h national standard code;
3) GB code 8080H internal code; Actually 8080h (1000000010000000) 2
Example 1: The national standard code of the known Chinese character "big" is 3473h, and its internal code.
Answer: (34)h= (00110100) 2, add "1" to the top (far left) and become (10110100) 2=(b4)h; Serving the reeds.
73)h=(01110011) 2, which becomes (11110011) 2=(f3)h;
Answer: b4f3h.
or 3473H 8080H B4F3H, and the internal code is B4F3H.
Example 2: The location code 2083 seeks the national standard to discuss the elimination code.
Answer: 1. The area code is 20 and the tag code is 83.
2. Convert the area code 2083 to hexadecimal representation as 1453h; (20)^10=(14)h;(83)^10=(53)h;
h 2020h 3473h, get the national standard code 3473h.
-
GB code. Refers to national standards**.
The national standard is strong and the standard is crowned with "GB". The recommended standard is "GB T". Compared with many ISO international standards, many national standards are equivalent to adopting (IDT, identicalto other standards), modifying (modifiedinrelationto, other standards; Prior to 2000, it was referred to as "equivalent adoption, EQV, equivalentto other criteria) or non-equivalence adoption (NEQ, notequivalentto other criteria).
The official use of GB18030 is now mandatory in the People's Republic of China, but older computers still use GB2312. "GB" often means GB2312-80 or GB18030-2005 in the computer collar scattered field. Both are the standard for the Chinese coding system and are used in Chinese Simplified in China and Singapore.
-
An outer code, also known as an input code, is a set of keyboard symbols used to input Chinese characters into a computer. Commonly used input codes include pinyin code, Wubi font code, natural code, phenoform code, cognitive code, area code and telegram code.
A good coding should have simple coding rules, easy to learn and remember, easy to operate, and repeat the code rate.
Low and fast input speed, everyone can choose according to their own needs.
A is defined in ASCII as 01000001, which is the decimal system 65, and with this standard, when we enter A, the computer can pass the ASCII code.
Know the binary of the characters you enter.
The encoding is 01000001.
And without such a standard, we have to find a way to tell the computer that we have entered an a; There is no such standard, and we need to recode it on other machines to tell the computer that we want to type A.
The ASCII code does not refer to the decimal system, but to binary. It's just a little more customary to use decimal representation, for example, in ASCII code, the binary encoding of a is 01000001, and if it is expressed in decimal is 65, use hexadecimal.
It means 41h.
-
Legal analysis: the national standard code is the national standard code of Chinese characters. The national standard code of Chinese characters was created in 1980, and the national standard for Chinese character coding was promulgated in order to make each Chinese character have a national unified **.
Each Chinese character has a binary code, which is called the Chinese character national standard code. In China's Chinese character ** standard GB2312-80, there are 6763 commonly used Chinese characters that stipulate binary encoding. Each Chinese character has a binary code, which is called the Chinese character national standard code.
Legal basis: Company Law of the People's Republic of China
Article 6 To establish a company, an application for establishment registration shall be made to the company registration authority in accordance with the law. If the establishment conditions stipulated in this Law are met, they shall be registered as a limited liability company or a stock company by the company registration authority; If it does not meet the establishment conditions stipulated in this Law, it shall not be registered as a limited liability company or a share****.
Where laws and administrative regulations stipulate that the establishment of a company must be submitted for approval, the approval formalities shall be completed in accordance with the law before the company is registered. The public may apply to the company registration authority for inquiries into the company's registration matters, and the company registration authority shall provide inquiry services.
Article 7 A business license shall be issued by the company registration authority for a company established in accordance with the law. The date of issuance of the company's business license is the date of incorporation of the company. The company's business license shall indicate the company's name, domicile, registered capital, business scope, name of legal representative, and other matters.
If there is a change in the items recorded in the company's business license, the company shall go through the change registration in accordance with the law, and the company registration authority shall renew the business license.
-
Exocode refers to the "external" form of encoding that can be directly understood after being learned(e.g. text or phonetic symbols)'".
Chinese input method.
The encoding of Chinese characters is an outer code. Common Chinese codes include Cangjie code, row and column code, Dayi code, shrimp code, Zhuyin code, and Pinyin code.
-
We often speak of Chinese characters"Internal code"with"Outer code"。
The internal code is the information encoding used for the storage, processing and transmission of Chinese characters in the computer. It must be compatible with the ASCII code but not conflicting.
So put the highest position of the two bytes of the national standard code'1', to distinguish it from the Western, this is the inner code. The input code of the kanji is called"Outer code"。The input code is the code we use when entering Chinese characters.
Common outer codes are divided into numeric codes (such as area codes), pinyin codes and glyph codes (such as Wubi).
Let's talk about the area code,"Ugh"The area code is 1601, and the hexadecimal is 0x10, 0x01. This conflicts with ASCII encoding, which is widely used by computers. In order to be compatible with the ASCII encoding of 00-7F, we add A0 to the high and low bytes of the area code.
That's it"Ugh"The code becomes b0a1. We refer to the addition of two A0 encodings as GB2312 encodings, although the original GB2312 text does not mention this at all.
The internal code refers to the character encoding inside the operating system. The internal code of early operating systems was language-dependent. Nowadays, Windows uses Unicode internally, and then adapts to various languages with ** pages"Internal code"The concept is a bit more vague.
We generally refer to the encoding specified on the default page as an internal code. There is no official definition of the term encoding. **page is also just a customary name for Microsoft.
As programmers, we just need to know what they are, and we don't need to look too much into these terms.
The so-called ** page (code page) is a character encoding for a language. For example, the code page of the GBK is CP936, the code page of BIG5 is CP950, and the code page of GB2312 is CP20936.
In Windows, there is the concept of a default page, i.e., what encoding is used to interpret characters by default. For example, Windows Notepad opens a text file, and the contents are byte streams: ba, ba, d7, how should I interpret it?
Is it explained according to Unicode encoding, GBK, BIG5, or ISO8859-1? If you explain it by GBK, you will get it"Kanji"Two words. According to other encoding explanations, the corresponding character may not be found, or the wrong character may be found.
So-called"Wrong"It refers to the fact that it does not match the original intention of the author of the text, and then there is garbled characters.
The answer is that Windows interprets the byte stream in the text file according to the current default page. The default page can be set through the regional options of the Control Panel. There is an ANSI in the save behavior of Notepad, which is actually saved according to the encoding method of the default ** page.
-
The national standard code of Chinese characters GB2312-80 stipulates that according to the frequency of use of Chinese characters, Chinese characters are divided into commonly used Chinese characters (Level 1) and non-essential Chinese characters (Level 2) Chinese characters.
The GB2312 standard includes a total of 6,763 Chinese characters, including 3,755 first-level Chinese characters and 3,008 second-level Chinese characters.
Chinese characters are by far the longest continuously used script, and they are the only ones that have been passed down to the present day among the major writing systems of the Ridge Period of Antiquity, and Chinese characters have been the main official script in all Chinese dynasties.
In ancient times, Chinese characters also served as the only international communication script in East Asia, and until the 20th century, it was still the official written standard text of Japan, the Korean Peninsula, Vietnam, the Ryukyus and other countries.
The national standard code is the standard code for the exchange of information between Chinese characters.
Wrong. National standard code: according to the international standard of the development of Chinese characters and symbols of the exchange code, the national standard GB2312-80 contains 7445 Chinese characters and characters, according to the characteristics and position of each character, all characters are divided into 94 areas, 94 bits. >>>More
PUK code. It is a security password for the mobile phone card, and the SIM card is used to protect your safety. >>>More
Make an appointment in advance every Tuesday at 12 o'clock. >>>More
Hello, P0507 Fault Code Definition: The engine speed of the idle air control system is higher than expected. >>>More