SciELO - Scientific Electronic Library Online

 
vol.16 número3Una mirada cuantitativa sobre las primeras cuatro décadas de la zooarqueología de los Andes Centro-SurPensamiento Integrado: Agregación de conjuntos de datos arqueológicos a escala internacional índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

  • No hay articulos citadosCitado por SciELO

Links relacionados

  • No hay articulos similaresSimilares en SciELO

Compartir


Revista del Museo de Antropología

versión impresa ISSN 1852-060Xversión On-line ISSN 1852-4826

Resumen

AGUILAR, Humberto. Scraping Archaeology: A Methodological Approach from the Web Scraping and Text Mining. Rev. Mus. Antropol. [online]. 2023, vol.16, n.3, pp.439-450.  Epub 28-Dic-2023. ISSN 1852-060X.  http://dx.doi.org/10.31048/1852.4826.v16.n2.41094.

As the amount of information available on the web increases, so does the task of locating and analysing it, and performing this task manually can be costly in terms of time and effort. Although search engines and database engines can help to find the required information, in large digital infrastructures where search results are in the thousands - or more - new tools are needed to effectively retrieve the searched content. This paper proposes the application of Web Scraping and Text Mining as methodological inputs to be able to compile and process large volumes of data in digital infrastructures in a more automated way. The automation of both processes provides a great advantage in analysing textual corpora of thousands of records, which significantly simplifies the collection of different types of data, facilitating the work considerably. It is hoped that this contribution will expand the possibilities of the archaeological community in terms of a novel methodology for the collection and handling of structured and unstructured data that can be integrated into the research of the wider archaeological community.

Palabras clave : R; Web scraping; Text mining; Data analytics; Digital Archaeology.

        · resumen en Español     · texto en Español     · Español ( pdf )