Publicaciones (437) Publicaciones en las que ha participado algún/a investigador/a Ver datos de investigación referenciados.
2024
-
A Universal Dependencies Treebank for Highland Puebla Nahuatl
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024
-
Becoming a High-Resource Language in Speech: The Catalan Case in the Common Voice Corpus
2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
-
Building a Data Infrastructure for a Mid-Resource Language: The Case of Catalan
2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
-
CLARIAH-ES: Strategic Network for the Integration in the European Research Infrastructures in Social Sciences and Humanities
CEUR Workshop Proceedings
-
Curated Datasets and Neural Models for Machine Translation of Informal Registers between Mayan and Spanish Vernaculars
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024
-
Developing a Benchmark for Pronunciation Feedback: Creation of a Phonemically Annotated Speech Corpus of isiZulu Language Learner Speech
2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
-
Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages
2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
-
FastSpell: the LangId Magic Spell
2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
-
Non-Fluent Synthetic Target-Language Data Improve Neural Machine Translation
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 46, Núm. 2, pp. 837-850
-
Producing a Parallel Universal Dependencies Treebank of Ancient Hebrew and Ancient Greek via Cross-Lingual Projection
2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
-
Universal Dependencies for Saraiki
Joint Workshop on Multiword Expressions and Universal Dependencies, MWE-UD 2024 at LREC-COLING 2024 - Workshop Proceedings
2023
-
A benchmark of Spanish language datasets for computationally driven research
Journal of Information Science, Vol. 49, Núm. 6, pp. 1451-1461
-
A finite-state morphological analyser for Highland Puebla Nahuatl
Proceedings of the Annual Meeting of the Association for Computational Linguistics
-
Codex to corpus: Exploring annotation and processing for an open and extensible machine-readable edition of the Florentine Codex
Proceedings of the Annual Meeting of the Association for Computational Linguistics
-
Comparing methods of orthographic conversion for Bàsàá, a language of Cameroon
4th Workshop on Resources for African Indigenous Languages, RAIL 2023 - Proceedings of the Workshop
-
Developing finite-state language technology for Maya
Proceedings of the Annual Meeting of the Association for Computational Linguistics
-
Exploiting large pre-trained models for low-resource neural machine translation
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, EAMT 2023
-
Identifying Student Profiles Within Online Judge Systems Using Explainable Artificial Intelligence
IEEE Transactions on Learning Technologies, Vol. 16, Núm. 6, pp. 955-969
-
Introduction
TLT 2023 - 21st International Workshop on Treebanks and Linguistic Theories (TLT, GURT/SyntaxFest 2023), Proceedings of the Conference
-
Introduction
CxGsNLP 2023 - 1st International Workshop on Construction Grammars and NLP (CxGs+NLP, GURT/SyntaxFest 2023), Proceedings of the Conference