SciELO - Scientific Electronic Library Online

 
vol.11 número2El problema de la motivación y desarrollo del capital humano en las pymes de Rosario, Argentina: Una propuesta basada en neurocienciasEstrategias de tic en empresas de Córdoba, Argentina: un modelo estructural índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

  • No hay articulos citadosCitado por SciELO

Links relacionados

  • No hay articulos similaresSimilares en SciELO

Compartir


SaberEs

versión impresa ISSN 1852-4418versión On-line ISSN 1852-4222

Resumen

RIANO, María Eugenia. Missing data imputation using spatial statistics techniques applied to Uruguay census of population and housing. SaberEs [online]. 2019, vol.11, n.2, pp.153-169. ISSN 1852-4418.

Uruguay National Census was quality and coverage positively evaluated in general, attaining international standard requirements. However, the data collecting process had some difficulties. The omission are concentrated in segments socioeconomically vulnerable. This could have an impact over the algorithm performed by the government to select the beneficiary population of cash-transfer programs. The heterogeneous spatial pattern of the target population and of the omission itself makes necessary define regions for the imputation of the missing data. Regions are obtained by means of spatial oblique decision trees. Spatial Autorregresive models are adjusted for each region. The models are assessed using cross-validation methods. Results are compared with the performance of a global model for the whole map. Except by one region, models that minimize cross-validation's errors show a similar lag in each region. The cross-validation error for the global model is quite similar. Nevertheless, spatial autocorrelation is detected according to the Moran test for residuals. Hence, the data imputation is performed by regions, with local SAR models, selecting the lag according to the cross-validation error. Results show that target population is underestimated approximately by a 5% over the total obtained with census data.

Palabras clave : Classification and Regression trees; Cross-validation; SAR models.

        · resumen en Español     · texto en Español

 

Creative Commons License Todo el contenido de esta revista, excepto dónde está identificado, está bajo una Licencia Creative Commons