The use of administrative data based official statistics has extraordinarily increased in the last few decades. As administrative data exist as a product of systems designed for other primary purposes, it is particularly important for statisticians to ensure that the resulting statistics meet the quality users’ needs. These include evaluating the coverage error that is one of the main sources of the bias when producing administrative based statistics. The coverage errors are traditionally addressed through a coverage survey called the Post-enumeration Survey (PES). The PES is an independent survey of enumeration areas, which measures the accuracy of the population count. In this paper we address the problem by estimating the under-coverage through a spatial sampling carried out on the enumeration areas. We computed the relative Root Mean Squared Errors (RRMSE) of the estimates obtained by using some Spatially Balanced Designs and we used as a benchmark those obtained with a classical sampling design: the Simple Random Sampling (SRS). The results show that the gain obtained by using these sampling designs is not trivial. The same experiment was conducted both with constant first order inclusion probabilities and with probabilities proportional to the size of the enumeration areas (PPS).

Integrating census and administrative data sources within a spatial sampling framework

BENEDETTI, ROBERTO;PIERSIMONI, FEDERICA
2015-01-01

Abstract

The use of administrative data based official statistics has extraordinarily increased in the last few decades. As administrative data exist as a product of systems designed for other primary purposes, it is particularly important for statisticians to ensure that the resulting statistics meet the quality users’ needs. These include evaluating the coverage error that is one of the main sources of the bias when producing administrative based statistics. The coverage errors are traditionally addressed through a coverage survey called the Post-enumeration Survey (PES). The PES is an independent survey of enumeration areas, which measures the accuracy of the population count. In this paper we address the problem by estimating the under-coverage through a spatial sampling carried out on the enumeration areas. We computed the relative Root Mean Squared Errors (RRMSE) of the estimates obtained by using some Spatially Balanced Designs and we used as a benchmark those obtained with a classical sampling design: the Simple Random Sampling (SRS). The results show that the gain obtained by using these sampling designs is not trivial. The same experiment was conducted both with constant first order inclusion probabilities and with probabilities proportional to the size of the enumeration areas (PPS).
2015
978-90-73592-35-3
File in questo prodotto:
File Dimensione Formato  
ISI (2015)-IntData.pdf

Solo gestori archivio

Tipologia: PDF editoriale
Dimensione 1.92 MB
Formato Adobe PDF
1.92 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11564/671157
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact