- Questionnaire Text
- UnHarmonized Variables
160 sets of 9-digit replicate weights are included in extracts where this selection is made. The variables will be named REPWTP1 through REPWTP160. Selecting replicate weights will dramatically increase the size and processing time of extracts; users should request them only if they plan to use them.
REPWTP1-REPWTP160 are 9-digit numeric variables with four implied decimal places.
REPWTP provides 160 separate person-level weights that allow users to generate empirically derived standard errors. Household-level replicate weights are available in REPWT.
More information about replicate weight creation and use is available on the IPUMS CPS replicate weights FAQ page and in the Census Bureau's "Estimating ASEC Variances with Replicate Weights" document.
Calculating the standard error of an estimate enables the construction of a confidence interval around the sample estimate of interest and may also be used in hypothesis testing. In theory, the standard error of an estimate measures the variation of a statistic across multiple samples of a given population. Researchers can use replicate weights to mirror this theoretical approach when only sample data is available, and the resulting standard errors have a higher degree of precision than standard asymptotic standard errors.
The 2005-onward CPS samples contain 160 replicate weights at the person level (variables named REPWTP1 through REPWTP160). For self-representing strata (i.e., areas with populations large enough to be represented with certainty in the sample), the replicate weights were produced using what is known as the successive difference replication (SDR) method. For non-self-representing strata (areas that are not represented with certainty in the sample), the replicate weights were produced using the modified half-sample technique. Both methods involve repeated implementations of the initial (full-sample) weighting algorithm, such that full information about the CPS sample is available in the replicate weights. Nevertheless, users should use these replicate weights only for generating variance estimates, not for obtaining unique parameter estimates.
Estimates on the entire population are prepared by projecting forward the resident population from the last available census. These projections are derived by updating the demographic census data from a number of other data sources that account for death, births and net migration. About 3 years after every census (i.e. 2003 for the 2000 Census and 2013 for the 2010 Census), the Census Bureau updates its independent population control and provides a new weight for the relevant years.
Two important points should be noted here. First, the lag between when the Census is conducted and when the CPS weights are updated is about 3 years. While the Census data are being processed, the CPS files are made available using the weighting scheme from the US Census prior to the latest Census. Second, once the files are updated, the old weights become obsolete and are replaced in the IPUMS data extract system. Published estimates from the lag years that use the old weights are not always updated. For example, 2010 poverty estimates were released in March using the 2000 population controls. Once the 2010 population controls were made available, IPUMS CPS replaced the March 2010, 2011, and 2012 weights that are based on the 2000 population control with weights that are based on the 2010 population controls. IPUMS CPS replaced March 2020 and 2021 weights that are based on the 2010 population control with weights that are based on the 2020 population controls.
IPUMS CPS makes available only the most up-to-date weights. The old values are available here: Old SPM and Weights Values.
User Note: The successive difference replication approach (SDR) is different from other methods for creating replicate weights such as balanced repeated replication (BRR) and jackknife estimation, and standard statistical software packages have no built-in method to handle them. However, Stata's jackknife standard error program can be adapted to calculate replicate standard errors for CPS data; see the IPUMS CPS replicate weights FAQ for details.
User Caution: For analyses that include the 2014 ASEC sample, please see the comparability tab. For analyses that include ASEC samples from before and after 2020, please see the comparability tab.
Additionally, it is possible for replicate weights to take negative values for certain cases; again, users should use these weights only for variance estimation purposes and not to obtain independent estimates.
Replicate weights are only available for CPS from 2005 and on.
In the 2014 ASEC sample, the Census Bureau divided the sample randomly into a 3/8 group and a 5/8 group. The smaller portion of the sample was given the redesigned income questions. The replicate weights were assigned to records so that both the 3/8ths file and the 5/8ths file independently sum to be representative of the entire US population. The flag variable HFLAG differentiates the respondent records. Please see our User Note on the 3/8 file redesign.
The most recent decennial Census has typically been used as the base population estimate, however, the 2020 Census was not suitable for this purpose due to the imposition of differential privacy meant that variables required for estimates and linking records to administrative data were unavailable and delays in both enumeration and data processing brought about by the COVID-19 pandemic. To deal with these challenges, a "Blended Base" population was established, combining information from multiple resources to arrive at the base population estimate historically provided by the decennial Census. For more information on the construction of the Blended Base for vintage 2021 estimates, see Methodology for the United States Population Estimates: Vintage 2021. The Census Bureau recommends using the 2020 Census-based weights when comparing the 2020 or 2021 ASEC to the 2022 and later ASEC data; these data are incorporated into IPUMS CPS REPWTP. The 2010 Census-based weights should be used when analyzing the 2020 or 2021 ASEC data with earlier years; these data can be downloaded from this page. See the Census Bureau's Guidance on Using Replicate Weights for more information.
Comparability with IPUMS-USA
The person-level replicate weights are, broadly speaking, comparable to the person-level replicate weights in the ACS and PRCS samples 2005 and later. However, the details of the replicate weight calculations differ for non-self-representing CPS strata. Additionally, ACS/PRCS replicate weights are integers, while CPS replicate weights are not.
- All persons.
|2005 – 2022||-||-||X||-||-||-||-||-||-||-||-||-|