Chinese Characters Chinese Characters

Dataset Description

Each GB code in the database consists of six digits. The first two digits represent the province, the second two the prefecture, and the third two the county. Province, prefectural, and county codings are contained in the linked document, Province codings. Because GB codes do not exist for every administrative unit that existed in the life of the database, it was necessary to create codes in certain circumstances. The construction of the database and information on assignment of values is described in the linked document, Procedures used in the creating the GB database. The dataset consists of ten fields for each record (see variable codes for a description of coding scheme):

   FIELD	DESCRIPTION						FORMAT
--------------------------------------------------------------------------------------------
C-gbcode	GB code							integer
C-source	Source of code						text
N-pinyin	Romanized name in Pinyin				text 
N-local		Romanized name using some non-Chinese pronunciations    text
N-hanzi		Name in Chinese characters				text
H		Hierarchical position of unit (county-level)		integer
A		Administrative status of unit				integer
change 		Configuration of changes				text
P               Hierarchical position of unit (prefecture-level)        integer
fromdate	First day the coded configuration was in effect		8 digits in yyyymmdd
todate		Last day the coded configuration was in effect		8 digits in yyyymmdd
NOTES		Details regarding special circumstances			text (255 characters)

Files and formats

   FILE         FORMAT
-----------------------------------------------------------------------------------
gbcodes1.mdb    Microsoft Access Version 2.0 with user interface that allows to
                search on specific names or codes and also automatically create
                provincial and/or temporal subsets of the full 1982-1994 GB Codes 
                database.  The current version requires Microsoft Access 2.0 or 
                higher to run. 

gbcodes1.txt    Plain ASCII, comma delimited with field names.


Last modified on 12 May, 1997 Return to CITAS Homepage