Diseño y construcción de un corpus oral multidialectal. El corpus amaresco

  1. Carcelén Guerrero, Andrea 1
  2. Uclés Ramada, Gloria
  1. 1 Universitat de València
    info

    Universitat de València

    Valencia, España

    ROR https://ror.org/043nxc105

Journal:
Normas: revista de estudios lingüísticos hispánicos

ISSN: 2174-7245

Year of publication: 2019

Volume: 9

Issue: 1

Pages: 17-36

Type: Article

DOI: 10.7203/NORMAS.V9I1.16007 DIALNET GOOGLE SCHOLAR lock_openDialnet editor

More publications in: Normas: revista de estudios lingüísticos hispánicos

Abstract

This paper describes the protocol used to build the Ameresco corpus (America Colloquial Spanish). Collecting a corpus containing more than one dialect poses a series of challenges. On the one hand, managing a large number of external teams requires that the methodology used is sound. On the other hand, the methodology should be in line with the goals that the project aims to reach and with essential corpus design features such as issues when recording, the transcription and labelling system and the anonymisation of sensitive data. All these aspects should be thoughtfully chosen so that the quality standards set by the scientific community are reached.