Publicaciones (141) Publicaciones de JORGE CALVO ZARAGOZA

2024

  1. A TRANSFORMER APPROACH FOR POLYPHONIC AUDIO-TO-SCORE TRANSCRIPTION

    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

2023

  1. A Holistic Approach for Aligned Music and Lyrics Transcription

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. A Weakly-Supervised Approach for Layout Analysis in Music Score Images

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  3. Addressing Class Imbalance in Multilabel Prototype Generation for k-Nearest Neighbor Classification

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  4. An experimental study on marine debris location and recognition using object detection

    Pattern Recognition Letters, Vol. 168, pp. 154-161

  5. Automatic Detection of Comic Characters: An Analysis of Model Robustness Across Domains

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  6. End-to-End page-Level assessment of handwritten text recognition

    Pattern Recognition, Vol. 142

  7. End-to-end optical music recognition for pianoform sheet music

    International Journal on Document Analysis and Recognition

  8. Evaluating Domain Generalization in Kitchen Utensils Classification

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  9. Few-Shot Music Symbol Classification via Self-Supervised Learning and Nearest Neighbor

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  10. Few-shot symbol classification via self-supervised learning and nearest neighbor

    Pattern Recognition Letters, Vol. 167, pp. 1-8

  11. Insights into end-to-end audio-to-score transcription with real recordings: A case study with saxophone works

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

  12. Kurcuma: a kitchen utensil recognition collection for unsupervised domain adaptation

    Pattern Analysis and Applications, Vol. 26, Núm. 4, pp. 1557-1569

  13. Late multimodal fusion for image and audio music transcription

    Expert Systems with Applications, Vol. 216

  14. Lifelong Learning for Document Image Binarization: An Experimental Study

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  15. Multimodal Strategies for Image and Audio Music Transcription: A Comparative Study

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  16. Multimodal recognition of frustration during game-play with deep neural networks

    Multimedia Tools and Applications, Vol. 82, Núm. 9, pp. 13617-13636

  17. Optical Music Recognition: Recent Advances, Current Challenges, and Future Directions

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  18. Optical music recognition for homophonic scores with neural networks and synthetic music generation

    International Journal of Multimedia Information Retrieval, Vol. 12, Núm. 1

  19. Test-Time Augmentation for Document Image Binarization

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)