Updating Population, GDP, Literacy

Updated 2021-02-10 by Yoshito

Instructions are based on Chrome browser.

Load the World DataBank

The World DataBank is at (http://databank.worldbank.org/data/views/variableselection/selectvariables.aspx?source=world-development-indicators). Unfortunately, they keep changing the link. If the page has been moved, try to get to it by doing the following. Each of the links are what currently works, but that again may change.

  1. Go to http://worldbank.org
  2. Click “View More Data” in the Data section (http://data.worldbank.org/)
  3. Click “Data Catalog” (http://datacatalog.worldbank.org/)
  4. Search “World Development Indicators” (http://data.worldbank.org/data-catalog/world-development-indicators)
  5. In “Data \& Resources” tab, click on the blue “Databank” link. It should open a new Window - https://databank.worldbank.org/reports.aspx?source=world-development-indicators

Once you are there, generate a file by using the following steps. There are 3 collapsible sections, “Country”, “Series”, and “Time”

Load UN Literacy Data

  1. Go to http://data.un.org/Data.aspx?d=POP&f=tableCode:31

    If this link is broken, go to http://unstats.un.org/unsd/demographic/products/socind/ and look for “Population by literacy, age, sex and urban/rural residence”

  2. Expand all filters. Choose the latest year, and Both Sexes. Leave other filters alone. Click Apply Filter
  3. Click Download -> XML
  4. Save this as tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/external/un_literacy.xml
  5. run the tool UnLiteracyParser to get a detailed validation of this data. As of this writing, there are 12 unexplained errors where the data does not add up.

Load CIA Factbook

Note: The CIA World Factbook has been sunset, see CLDR-19239

Convert the data

  1. If you saw any different country names above, you’ll need to edit external/alternate_country_names.txt to add them.
    1. For example, we needed to add Czechia in 2016.
  2. Q: How would I know?
    1. If two-letter non-countries are added, then you’ll need to adjust StandardCodes.isCountry.
  3. Q: How would I know?
    1. Run “AddPopulationData -DADD_POP=true” and look for errors.
  4. java -jar -DADD_POP=true -DCLDR_DIR=${HOME}/src/cldr cldr.jar org.unicode.cldr.tool.AddPopulationData
  5. Once everything looks ok, check everything in to git.
  6. Once done, then run the ConvertLanguageData tool as on Update Language Script Info