Diseño y construcción de un corpus oral multidialectal. El corpus amaresco

  1. Carcelén Guerrero, Andrea 1
  2. Uclés Ramada, Gloria
  1. 1 Universitat de València
    info

    Universitat de València

    Valencia, España

    ROR https://ror.org/043nxc105

Revista:
Normas: revista de estudios lingüísticos hispánicos

ISSN: 2174-7245

Ano de publicación: 2019

Volume: 9

Número: 1

Páxinas: 17-36

Tipo: Artigo

DOI: 10.7203/NORMAS.V9I1.16007 DIALNET GOOGLE SCHOLAR lock_openDialnet editor

Outras publicacións en: Normas: revista de estudios lingüísticos hispánicos

Resumo

This paper describes the protocol used to build the Ameresco corpus (America Colloquial Spanish). Collecting a corpus containing more than one dialect poses a series of challenges. On the one hand, managing a large number of external teams requires that the methodology used is sound. On the other hand, the methodology should be in line with the goals that the project aims to reach and with essential corpus design features such as issues when recording, the transcription and labelling system and the anonymisation of sensitive data. All these aspects should be thoughtfully chosen so that the quality standards set by the scientific community are reached.