Revision History

This page contains notes on data and documentation issues, fixes, and revisions. Please inform the IPUMS staff (at of any problems with the database, so we can make corrections.

March 2018: Merged NAPP datasets into IPUMS International

For the purpose of longterm preservation and sustainability, all NAPP datasets will henceforth be accessed through the IPUMS International dissemination system.

NAPP data users should note that many NAPP variables are available from IPUMS International by different names. For a complete list of NAPP variables that have been renamed in IPUMS-International, refer to the crosswalk. Note that all original detail in the census datasets is retained in the source variables accessible through IPUMS International.

August 2017: Added new and revised datasets

Released new full-count datasets for Great Britain 1851, 1861, 1871 (Scotland only), 1891, and 1901.

Released revised full-count data for Great Britain 1881.

Released a new full-count dataset for Sweden 1910.

Released revised full-count datasets for Sweden 1890 and 1900. The revision includes the following changes that improve comparability across Sweden datasets:

November 2016: Added new dataset

Released the full-count dataset for United States 1850.

November 2015: Added new and revised datasets

Released new full-count datasets for Great Britain 1911, Denmark 1787 and 1801, Iceland 1703 and 1910, and Sweden 1880.

Released a new dataset for Iceland 1729 which contains full-count data for three counties: Rangárvallasýsla, Árnessýsla, Hnappadalssýsla.

Released a new 5% sample for Canada 1911.

Released a revised full-count dataset for United States 1880. The revision includes the following additions and improvements:

Released a revised full-count dataset for Iceland 1901. The revision includes improved household breaks as well as refined versions of parish and farm ID variables. The dataset also includes a new parish of birth variable.

June 2012: Added new datasets and linked samples

Released a new full-count dataset for Noway 1910.

Released a new full-count dataset for Sweden 1890 and a slightly revised full-count dataset for Sweden 1900.

Released samples of linked males, females and couples across the 1851 and 1881 Great Britain datasets.

July 2011: Added new datasets

Released new full-count datasets for Iceland 1801 and 1901.

Released a new full-count dataset for Norway 1801.

Released two new samples for Canada. The 1852 sample is a systematic 1-in-5 sample of the national population. The 1891 sample combines three slightly overlapping subsamples of 5, 10 and 100% into one national sample.

February 2011: Improved web interface

Introduced a new version of the web user interface for browsing variables and creating data extracts. The new system is explicitly designed around the concept of a "data cart" to which one adds variables and samples while browsing, and from which one "checks out" to generate a data extract.

July 2010: Added new datasets, released expanded dataset, and added linked data samples

Released new datasets from the United States from 1850 to 1870 and 1900 to 1910.

Added an expanded version of the 1880 United States dataset, with additional education and disability variables and a 1-in-5 oversample of the minority population.

Released a new sample from Mecklenburg-Schwerin 1819, which includes full count data for the city of Rostock.

Linked datasets across samples in the United States and Norway. The datasets for Norway include linked males and couples across all three census years from 1865 to 1900. The datasets for the United States include 7 linked pairs of census years involving the 880 complete count data. The linked years include: 1850-1880, 1860-1880, 1870-1880, 1880-1900, 1880-1910, 1880-1920, and 1880-1930. We have created three independent linked samples for each paired year: linked men, linked women, and linked married couples. For more information on the linked samples, refer to the linked samples page.

October 2008: Released new complete count dataset, revised existing datasets, and revised the data extraction system

Released a complete count dataset of Sweden 1900.

Revised existing complete count datasets (Canada 1881, Norway 1865, Norway 1900, United States 1880, England and Wales 1881, and Scotland 1881) and sample datasets (Canada 1871, Canada 1901, and Norway 1875).

Significantly altered our web interface to accommodate the growing number of samples and variables and to give users greater control while browsing variables or defining a data extract. The NAPP data extract system now replicates the IPUMS-International data extract system.

October 2006: Released four new datasets, revised existing datasets, and implemented a new extract system

Released four new datasets (Canada 1871 and 1901, Norway 1865, Scotland 1881).

Revised all existing datasets (Canada 1881, England and Wales 1881, Norway 1900, United States 1880). New and revised datasets contain a substantial number of newly constructed variables:

Family interrelationship variables: Added variables on number of couples, mothers, and fathers in household. Added grandparent pointers (analogous to the existing MOMLOC, POPLOC, and SPLOC pointer variables, but for grandparents). Number of sons or daughters married or unmaried is now available for all datasets with relationship information; previously, variables were only available for England and Wales 1881. Added new variable for number of children under age 10, analogous to the existing NCHLT5 variable. Relationship to household head codes are now available in IPUMS-International format.

Geographic variables: For Canada and the United States, urban residence, (IPUMS compatible) city codes, and city populations are now available. Enumeration and supervisors districts are available for the United States.

Work and employment variables: Labor force participation is now available for all samples. Harmonized occupational codes (adapted from the HISCO coding scheme) are available for the United States, Canada, and Norway. PRODUCT codes for sales workers are now available for both the United States and Norway. Standardized occupational strings (OCCLABEL) are available for the United States; this variable corrects spelling mistakes, expands abbreviations, and standardizes common phrases, to allow researchers searching for very specific occupations a better chance of finding these individuals.

Ethnicity and migration variables: Simplified country of birth codes identifying individuals as being born in a specific NAPP country, or in any other country (NAPPSTER), are now available for all samples. SPANNAME is now available for the United States.

Other variables: AGEMONTH now available for the United States.

Moved data to a completely new extract system that is consistent with the IPUMS-USA and IPUMS-International extract systems.

2005: Added data, harmonized additional occupational data, added constructed family relationship variables

Completed additions to the 2004 data release, including adding Scotland to the Great Britain sample, harmonizing occupational data for Great Britain, and adding imputed relationships for Canada 1881.

December 2004: Posted revisions to the Canadian and United States datasets

Corrected missing values in Canada 1881.

Corrected missing values and improved the code for constructing household inter-relationship pointers for the United States 1880 data. Also added the following new variables:

LABFORCE: Added labor force participation variable based on the gainful occupation definition. This variable is consistent with the IPUMS LABFORCE variable for all pre-1940 censuses.

SEIUS: This variable for the Duncan Socioeconomic Index is consistent with the IPUMS variable SEI.

OCSCORUS: This occupational income score variable reports median total income in 1950 for the occupation (OCC50US). The unit of this variable is hundreds of 1950 dollars. Thus, an occupational income score of 70 means that the median total income of all people with the same occupation in 1950 was $7000. This variable is consistent with the IPUMS variable OCCSCORE.

NAPPSTER: This recode of country of birth identifies the five NAPP countries, assigns all other birthplaces to one code, and retains a code for unknown birthplace. This variable allows users to easily select all people born in any NAPP country.

SEAUS: This variable for state economic area (a grouping of contiguous counties that had close economic ties at the 1940 and 1950 censuses) is consistent with the IPUMS variable SEA, except in the Dakota Territory.

YEAR: Reports the year the census was conducted. Note that this is a four digit variable, while the IPUMS YEAR variable uses two digits.

RELEASED: Reports the date this version of the data was released.

November 2004: Released two new census datasets

All variables that are common to the censuses of Great Britain, Canada, Norway, and the United States are coded in harmonized coding schemes.

Norway 1900: users should be aware of the following data characteristics.

England and Wales 1881: Users should be aware of the following characteristics

December 2003: Released Canada 1881 data

Released a new dataset for Canada 1881. All variables that are common to the censuses of the United States and Canada are coded in harmonized coding schemes. Users should note the following issues:

August 2003: Updated the United States 1880 data

We have posted some corrections to the previous dataset.

July 2003: Released preliminary United States 1880 data

Released preliminary data from the U.S. Census of 1880. These preliminary data are not complete. Users should be aware of several issues.