The representativeness threshold for the CETA subcorpus of the Coruña Corpus

Autores/as

Palabras clave:

representativeness, ReCor, specialized Corpus, Zipf's Law, N-gram, Coruña Corpus, CETA, astronomy

Resumen

The concept of representativeness is the main distinguishing characteristic of specialised corpora in comparison to other sets of texts. The Coruña Corpus of English Scientific Writing currently comprises four published subcorpora (astronomy, life sciences, history, and philosophy) plus three others under compilation (physics, chemistry and linguistics). In this paper we aim to assess the lexical density of the text samples in CETA, the Corpus of English Texts on Astronomy, by means of the ReCor tool, a posteriori. The study is motivated by the following question: does quantitative representativeness analysis using ReCor provide, in the form of a cross-check, further validation of previous research on the representativeness of CETA? Previous work (Crespo and Moskowich, 2010) has indicated that the CETA corpus is well designed and valid for the purposes for which it was intended. We will here suggest metrics to measure these findings. The most important contribution of this study is to offer quantitative data collection results using the ReCor tool, which allows data triangulation and consequently ensures overall data quality. Results show that data analysis with the ReCor tool supports previous findings, and thus we are able to verify that CETA is indeed representative of the language of its time and register.

 

Descargas

Los datos de descarga aún no están disponibles.

Biografía del autor/a

  • Elena Alfaya-Lamas, Universidade da Coruña

    Elena Alfaya-Lamas obtained an MA in Germanic Philology from the University of Santiago de Compostela in 1994 and a PhD in English Historical Linguistics in 2002. From 1998 to 2000 she was a postgraduate worker and scholarship holder in the Department of Linguistics of the University of Edinburgh. In November 2001 she became an Associate Lecturer at CESUGA-University College Dubin. In October 2003 she obtained a position as an “Isidro Parga Pondal” researcher at the University of A Coruña and in October 2004 she became a Lecturer and Researcher in the area of Information and Documentation Science at the University of A Coruña.

    Her main research interests are historical linguistics, cognitive linguistics, discourse analysis, gender studies and mind-consciousness studies. She studied with the Mindfulness Association and the Kagyu lineage for years, developing competence in the range of skills necessary to teach Mindfulness, passing the Universities of Bangor, Exeter and Oxford Mindfulness-based Interventions, Teaching Assessment Criteria, MBI:TAC. She is currently co-heading the Mindfulness Association in Spain.

    She teaches Informational Behaviour, Historical Archives and Records Management, Scientific Research Techniques and Digital and Information Management.

     

  • Menchu Garrote Espantoso, Universidade da Coruña

    Menchu Garrote se gradúa en Información y documentación en la Facultad de Humanidades y Documentación de la Universidade da Coruña en 2019. Obtiene el Premio Extraordinario Fin de Estudios (Universidade da Coruña) y Premio Excelencia Académica de Galicia (Xunta de Galicia). Actualmente cursa el Máster Universitario en Patrimonio Histórico: Investigación y Gestión en el Campus de Toledo de la Universidad de Castilla-La Mancha. Comenzó su acercamiento a la investigación cuando obtuvo la beca de colaboración en formación complementaria en los departamentos universitarios de los centros propios de la UDC durante el curso 2018/19. Tutorizada por la Dra. Alfaya-Lamas se adentró en la investigación sobre el Coruña Corpus diseñado por el grupo de investigación MUSTE de la UDC.

Descargas

Publicado

2021-12-08

Cómo citar

The representativeness threshold for the CETA subcorpus of the Coruña Corpus. (2021). Revista De Lenguas Para Fines Específicos, 27(2), 125-139. https://ojsspdc.ulpgc.es/ojs/index.php/LFE/article/view/1402

Artículos similares

1-10 de 117

También puede Iniciar una búsqueda de similitud avanzada para este artículo.