Publicaciones (5) Publicaciones de LEOPOLDO PLA SEMPERE

2023

  1. MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages

    Proceedings of the 24th Annual Conference of the European Association for Machine Translation, EAMT 2023

2022

  1. Building Domain-specific Corpora from the Web: the Case of European Digital Service Infrastructures

    Proceedings of the International Conference on Language Resources and Evaluation, LREC 2022 - 15th Workshop on Building and Using Comparable Corpora, BUCC 2022

  2. MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages

    EAMT 2022 - Proceedings of the 23rd Annual Conference of the European Association for Machine Translation

2020

  1. ParaCrawl: Web-Scale Acquisition of Parallel Corpora

    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020)

  2. ParaCrawl: Web-scale acquisition of parallel corpora

    Proceedings of the Annual Meeting of the Association for Computational Linguistics