Skip to main content

Showing 1–50 of 61 results for author: Gomez, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13565  [pdf, other

    cs.CY cs.AI cs.HC

    Aligning Trustworthy AI with Democracy: A Dual Taxonomy of Opportunities and Risks

    Authors: Oier Mentxaka, Natalia Díaz-Rodríguez, Mark Coeckelbergh, Marcos López de Prado, Emilia Gómez, David Fernández Llorca, Enrique Herrera-Viedma, Francisco Herrera

    Abstract: Artificial Intelligence (AI) poses both significant risks and valuable opportunities for democratic governance. This paper introduces a dual taxonomy to evaluate AI's complex relationship with democracy: the AI Risks to Democracy (AIRD) taxonomy, which identifies how AI can undermine core democratic principles such as autonomy, fairness, and trust; and the AI's Positive Contributions to Democracy… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 26 pages, 5 figures

  2. arXiv:2502.06559  [pdf, other

    cs.AI

    Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation

    Authors: Maria Eriksson, Erasmo Purificato, Arman Noroozian, Joao Vinagre, Guillaume Chaslot, Emilia Gomez, David Fernandez-Llorca

    Abstract: Quantitative Artificial Intelligence (AI) Benchmarks have emerged as fundamental tools for evaluating the performance, capability, and safety of AI models and systems. Currently, they shape the direction of AI development and are playing an increasingly prominent role in regulatory frameworks. As their influence grows, however, so too does concerns about how and with what effects they evaluate hig… ▽ More

    Submitted 25 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: Under review as conference paper

    ACM Class: I.2.0; A.1

  3. arXiv:2501.06137  [pdf, ps, other

    cs.AI cs.CY cs.SI

    Supervision policies can shape long-term risk management in general-purpose AI models

    Authors: Manuel Cebrian, Emilia Gomez, David Fernandez Llorca

    Abstract: The rapid proliferation and deployment of General-Purpose AI (GPAI) models, including large language models (LLMs), present unprecedented challenges for AI supervisory entities. We hypothesize that these entities will need to navigate an emergent ecosystem of risk and incident reporting, likely to exceed their supervision capacity. To investigate this, we develop a simulation framework parameteriz… ▽ More

    Submitted 10 June, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

    Comments: 24 pages, 14 figures

  4. arXiv:2411.03474  [pdf, other

    cs.CE

    GRATEV2.0: Computational Tools for Real-time Analysis of High-throughput High-resolution TEM (HRTEM) Images of Conjugated Polymers

    Authors: Dhruv Gamdha, Ryan Fair, Adarsh Krishnamurthy, Enrique Gomez, Baskar Ganapathysubramanian

    Abstract: Automated analysis of high-resolution transmission electron microscopy (HRTEM) images is increasingly essential for advancing research in organic electronics, where precise characterization of nanoscale crystal structures is crucial for optimizing material properties. This paper introduces an open-source computational framework called GRATEV2.0 (GRaph-based Analysis of TEM), designed for real-time… ▽ More

    Submitted 24 December, 2024; v1 submitted 5 November, 2024; originally announced November 2024.

    Comments: 27 pages, 13 figures, 5 tables

  5. arXiv:2407.14364  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio

    Authors: Roser Batlle-Roca, Wei-Hisang Liao, Xavier Serra, Yuki Mitsufuji, Emilia Gómez

    Abstract: Recent advancements in music generation are raising multiple concerns about the implications of AI in creative music processes, current business models and impacts related to intellectual property management. A relevant discussion and related technical challenge is the potential replication and plagiarism of the training set in AI-generated music, which could lead to misuse of data and intellectua… ▽ More

    Submitted 1 August, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Accepted at ISMIR 2024

  6. arXiv:2403.14641  [pdf, other

    cs.CY cs.AI cs.LG

    Testing autonomous vehicles and AI: perspectives and challenges from cybersecurity, transparency, robustness and fairness

    Authors: David Fernández Llorca, Ronan Hamon, Henrik Junklewitz, Kathrin Grosse, Lars Kunze, Patrick Seiniger, Robert Swaim, Nick Reed, Alexandre Alahi, Emilia Gómez, Ignacio Sánchez, Akos Kriston

    Abstract: This study explores the complexities of integrating Artificial Intelligence (AI) into Autonomous Vehicles (AVs), examining the challenges introduced by AI components and the impact on testing procedures, focusing on some of the essential requirements for trustworthy AI. Topics addressed include the role of AI at various operational layers of AVs, the implications of the EU's AI Act on AVs, and the… ▽ More

    Submitted 21 February, 2024; originally announced March 2024.

    Comments: 44 pages, 8 figures, submitted to a peer-review journal

  7. Face Recognition: to Deploy or not to Deploy? A Framework for Assessing the Proportional Use of Face Recognition Systems in Real-World Scenarios

    Authors: Pablo Negri, Isabelle Hupont, Emilia Gomez

    Abstract: Face recognition (FR) has reached a high technical maturity. However, its use needs to be carefully assessed from an ethical perspective, especially in sensitive scenarios. This is precisely the focus of this paper: the use of FR for the identification of specific subjects in moderately to densely crowded spaces (e.g. public spaces, sports stadiums, train stations) and law enforcement scenarios. I… ▽ More

    Submitted 3 September, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Published a shorter version in IEEE International Conference on Automatic Face and Gesture Recognition 2024

  8. arXiv:2401.12593  [pdf, other

    cs.IR cs.AI

    MOReGIn: Multi-Objective Recommendation at the Global and Individual Levels

    Authors: Elizabeth Gómez, David Contreras, Ludovico Boratto, Maria Salamó

    Abstract: Multi-Objective Recommender Systems (MORSs) emerged as a paradigm to guarantee multiple (often conflicting) goals. Besides accuracy, a MORS can operate at the global level, where additional beyond-accuracy goals are met for the system as a whole, or at the individual level, meaning that the recommendations are tailored to the needs of each user. The state-of-the-art MORSs either operate at the glo… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  9. arXiv:2312.06306  [pdf, other

    cs.CV cs.AI

    Attribute Annotation and Bias Evaluation in Visual Datasets for Autonomous Driving

    Authors: David Fernández Llorca, Pedro Frau, Ignacio Parra, Rubén Izquierdo, Emilia Gómez

    Abstract: This paper addresses the often overlooked issue of fairness in the autonomous driving domain, particularly in vision-based perception and prediction systems, which play a pivotal role in the overall functioning of Autonomous Vehicles (AVs). We focus our analysis on biases present in some of the most commonly used visual datasets for training person and vehicle detection systems. We introduce an an… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Paper submitted to the IEEE TIV journal; 17 pages, 16 figures, 10 tables

  10. arXiv:2310.12786  [pdf, other

    cs.DC cs.PF

    SYNPA: SMT Performance Analysis and Allocation of Threads to Cores in ARM Processors

    Authors: Marta Navarro, Josué Feliu, Salvador Petit, María E. Gómez, Julio Sahuquillo

    Abstract: Simultaneous multithreading processors improve throughput over single-threaded processors thanks to sharing internal core resources among instructions from distinct threads. However, resource sharing introduces inter-thread interference within the core, which has a negative impact on individual application performance and can significantly increase the turnaround time of multi-program workloads. T… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 11 pages, 9 figures

  11. arXiv:2310.00091  [pdf, other

    cs.HC cs.SE

    Towards Automated Accessibility Report Generation for Mobile Apps

    Authors: Amanda Swearngin, Jason Wu, Xiaoyi Zhang, Esteban Gomez, Jen Coughenour, Rachel Stukenborg, Bhavya Garg, Greg Hughes, Adriana Hilliard, Jeffrey P. Bigham, Jeffrey Nichols

    Abstract: Many apps have basic accessibility issues, like missing labels or low contrast. Automated tools can help app developers catch basic issues, but can be laborious or require writing dedicated tests. We propose a system, motivated by a collaborative process with accessibility stakeholders at a large technology company, to generate whole app accessibility reports by combining varied data collection me… ▽ More

    Submitted 16 October, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: 24 pages, 8 figures

  12. arXiv:2309.03512  [pdf, other

    cs.IR cs.CY

    Behind Recommender Systems: the Geography of the ACM RecSys Community

    Authors: Lorenzo Porcaro, João Vinagre, Pedro Frau, Isabelle Hupont, Emilia Gómez

    Abstract: The amount and dissemination rate of media content accessible online is nowadays overwhelming. Recommender Systems filter this information into manageable streams or feeds, adapted to our personal needs or preferences. It is of utter importance that algorithms employed to filter information do not distort or cut out important elements from our perspectives of the world. Under this principle, it is… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Presented at the 6th FAccTRec Workshop: Responsible Recommendation (FAccTRec '23), September 18, 2023, Singapore

  13. arXiv:2306.13701  [pdf, other

    cs.CY cs.AI

    Use case cards: a use case reporting framework inspired by the European AI Act

    Authors: Isabelle Hupont, David Fernández-Llorca, Sandra Baldassarri, Emilia Gómez

    Abstract: Despite recent efforts by the Artificial Intelligence (AI) community to move towards standardised procedures for documenting models, methods, systems or datasets, there is currently no methodology focused on use cases aligned with the risk-based approach of the European AI Act (AI Act). In this paper, we propose a new framework for the documentation of use cases, that we call "use case cards", bas… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 33 pages, 9 figures

  14. arXiv:2305.16809  [pdf

    cs.CL cs.AI cs.HC

    GenQ: Automated Question Generation to Support Caregivers While Reading Stories with Children

    Authors: Arun Balajiee Lekshmi Narayanan, Ligia E. Gomez, Martha Michelle Soto Fernandez, Tri Nguyen, Chris Blais, M. Adelaida Restrepo, Art Glenberg

    Abstract: When caregivers ask open--ended questions to motivate dialogue with children, it facilitates the child's reading comprehension skills.Although there is scope for use of technological tools, referred here as "intelligent tutoring systems", to scaffold this process, it is currently unclear whether existing intelligent systems that generate human--language like questions is beneficial. Additionally,… ▽ More

    Submitted 25 September, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  15. arXiv:2305.16109  [pdf, other

    cs.CE

    CACTUS: A Computational Framework for Generating Realistic White Matter Microstructure Substrates

    Authors: Juan Luis Villarreal-Haro, Remy Gardier, Erick J Canales-Rodriguez, Elda Fischi Gomez, Gabriel Girard, Jean-Philippe Thiran, Jonathan Rafael-Patino

    Abstract: Monte-Carlo diffusion simulations are a powerful tool for validating tissue microstructure models by generating synthetic diffusion-weighted magnetic resonance images (DW-MRI) in controlled environments. This is fundamental for understanding the link between micrometre-scale tissue properties and DW-MRI signals measured at the millimetre-scale, optimising acquisition protocols to target microstruc… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 21 pages, 7 figures

  16. arXiv:2305.09319  [pdf, other

    cs.IR

    Fairness and Diversity in Information Access Systems

    Authors: Lorenzo Porcaro, Carlos Castillo, Emilia Gómez, João Vinagre

    Abstract: Among the seven key requirements to achieve trustworthy AI proposed by the High-Level Expert Group on Artificial Intelligence (AI-HLEG) established by the European Commission (EC), the fifth requirement ("Diversity, non-discrimination and fairness") declares: "In order to achieve Trustworthy AI, we must enable inclusion and diversity throughout the entire AI system's life cycle. [...] This require… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Presented at the European Workshop on Algorithmic Fairness (EWAF'23) Winterthur, Switzerland, June 7-9, 2023

  17. arXiv:2212.00592  [pdf, other

    cs.HC cs.IR

    Assessing the Impact of Music Recommendation Diversity on Listeners: A Longitudinal Study

    Authors: Lorenzo Porcaro, Emilia Gómez, Carlos Castillo

    Abstract: We present the results of a 12-week longitudinal user study wherein the participants, 110 subjects from Southern Europe, received on a daily basis Electronic Music (EM) diversified recommendations. By analyzing their explicit and implicit feedback, we show that exposure to specific levels of music recommendation diversity may be responsible for long-term impacts on listeners' attitudes. In particu… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  18. arXiv:2211.01817  [pdf, other

    cs.AI cs.CY cs.LG

    Liability regimes in the age of AI: a use-case driven analysis of the burden of proof

    Authors: David Fernández Llorca, Vicky Charisi, Ronan Hamon, Ignacio Sánchez, Emilia Gómez

    Abstract: New emerging technologies powered by Artificial Intelligence (AI) have the potential to disruptively transform our societies for the better. In particular, data-driven learning approaches (i.e., Machine Learning (ML)) have been a true revolution in the advancement of multiple technologies in various application domains. But at the same time there is growing concern about certain intrinsic characte… ▽ More

    Submitted 17 March, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: Paper published at the Journal of Artificial Intelligence Research

    Journal ref: Journal of Artificial Intelligence Research, Vol. 76 (2023), pp. 613-644

  19. arXiv:2211.00596  [pdf, other

    cs.DC cs.DM

    Algebra of N-event synchronization

    Authors: Ernesto Gomez, Keith E. Schubert, Khalil Dajani

    Abstract: We have previously defined synchronization (Gomez, E. and K. Schubert 2011) as a relation between the times at which a pair of events can happen, and introduced an algebra that covers all possible relations for such pairs. In this work we introduce the synchronization matrix, to make it easier to calculate the properties and results of $N$ event synchronizations, such as are commonly encountered i… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 9 pages, 2 figures

    ACM Class: B.4.3; D.3.1; D.3.2; D.3.3

  20. arXiv:2209.09666  [pdf, other

    cs.SE cs.AI

    Documenting use cases in the affective computing domain using Unified Modeling Language

    Authors: Isabelle Hupont, Emilia Gomez

    Abstract: The study of the ethical impact of AI and the design of trustworthy systems needs the analysis of the scenarios where AI systems are used, which is related to the software engineering concept of "use case" and the "intended purpose" legal term. However, there is no standard methodology for use case documentation covering the context of use, scope, functional requirements and risks of an AI system.… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 8 pages, 5 figures, 2 tables

  21. arXiv:2209.02403  [pdf

    cs.HC

    Guidelines to Develop Trustworthy Conversational Agents for Children

    Authors: Marina Escobar-Planas, Emilia Gómez, Carlos-D Martínez-Hinarejos

    Abstract: Conversational agents (CAs) embodied in speakers or chatbots are becoming very popular in some countries, and despite their adult-centred design, they have become part of children's lives, generating a need for children-centric trustworthy systems. This paper presents a literature review to identify the main opportunities, challenges and risks brought by CAs when used by children. We then consider… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 19 pages

  22. Federated Data Analytics: A Study on Linear Models

    Authors: Xubo Yue, Raed Al Kontar, Ana María Estrada Gómez

    Abstract: As edge devices become increasingly powerful, data analytics are gradually moving from a centralized to a decentralized regime where edge compute resources are exploited to process more of the data locally. This regime of analytics is coined as federated data analytics (FDA). In spite of the recent success stories of FDA, most literature focuses exclusively on deep neural networks. In this work, w… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Journal ref: IISE Transactions, 2023

  23. arXiv:2203.01657  [pdf, other

    cs.AI

    Monitoring Diversity of AI Conferences: Lessons Learnt and Future Challenges in the DivinAI Project

    Authors: Isabelle Hupont, Emilia Gomez, Songul Tolan, Lorenzo Porcaro, Ana Freire

    Abstract: DivinAI is an open and collaborative initiative promoted by the European Commission's Joint Research Centre to measure and monitor diversity indicators related to AI conferences, with special focus on gender balance, geographical representation, and presence of academia vs companies. This paper summarizes the main achievements and lessons learnt during the first year of life of the DivinAI project… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: 5 pages, 3 figures

  24. arXiv:2201.10249  [pdf, ps, other

    cs.HC cs.IR

    Diversity in the Music Listening Experience: Insights from Focus Group Interviews

    Authors: Lorenzo Porcaro, Emilia Gómez, Carlos Castillo

    Abstract: Music listening in today's digital spaces is highly characterized by the availability of huge music catalogues, accessible by people all over the world. In this scenario, recommender systems are designed to guide listeners in finding tracks and artists that best fit their requests, having therefore the power to influence the diversity of the music they listen to. Albeit several works have proposed… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  25. arXiv:2112.04975  [pdf, ps, other

    cs.SD cs.HC eess.AS

    Personalized musically induced emotions of not-so-popular Colombian music

    Authors: Juan Sebastián Gómez-Cañón, Perfecto Herrera, Estefanía Cano, Emilia Gómez

    Abstract: This work presents an initial proof of concept of how Music Emotion Recognition (MER) systems could be intentionally biased with respect to annotations of musically induced emotions in a political context. In specific, we analyze traditional Colombian music containing politically charged lyrics of two types: (1) vallenatos and social songs from the "left-wing" guerrilla Fuerzas Armadas Revoluciona… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Journal ref: HCAI Human Centered AI Workshop at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  26. arXiv:2110.09239  [pdf, ps, other

    cs.SD eess.AS q-bio.QM

    EIHW-MTG: Second DiCOVA Challenge System Report

    Authors: Adria Mallol-Ragolta, Helena Cuesta, Emilia Gómez, Björn W. Schuller

    Abstract: This work presents an outer product-based approach to fuse the embedded representations generated from the spectrograms of cough, breath, and speech samples for the automatic detection of COVID-19. To extract deep learnt representations from the spectrograms, we compare the performance of a CNN trained from scratch and a ResNet18 architecture fine-tuned for the task at hand. Furthermore, we invest… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  27. arXiv:2110.06543  [pdf, ps, other

    cs.SD cs.LG eess.AS

    EIHW-MTG DiCOVA 2021 Challenge System Report

    Authors: Adria Mallol-Ragolta, Helena Cuesta, Emilia Gómez, Björn W. Schuller

    Abstract: This paper aims to automatically detect COVID-19 patients by analysing the acoustic information embedded in coughs. COVID-19 affects the respiratory system, and, consequently, respiratory-related signals have the potential to contain salient information for the task at hand. We focus on analysing the spectrogram representations of coughing samples with the aim to investigate whether COVID-19 alter… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  28. arXiv:2109.15188  [pdf, other

    cs.SD cs.IR eess.AS

    Assessing Algorithmic Biases for Musical Version Identification

    Authors: Furkan Yesiler, Marius Miron, Joan Serrà, Emilia Gómez

    Abstract: Version identification (VI) systems now offer accurate and scalable solutions for detecting different renditions of a musical composition, allowing the use of these systems in industrial applications and throughout the wider music ecosystem. Such use can have an important impact on various stakeholders regarding recognition and financial benefits, including how royalties are circulated for digital… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  29. arXiv:2109.07907  [pdf, other

    cs.CY

    How diverse is the ACII community? Analysing gender, geographical and business diversity of Affective Computing research

    Authors: Isabelle Hupont, Songül Tolan, Ana Freire, Lorenzo Porcaro, Sara Estevez, Emilia Gómez

    Abstract: ACII is the premier international forum for presenting the latest research on affective computing. In this work, we monitor, quantify and reflect on the diversity in ACII conference across time by computing a set of indexes. We measure diversity in terms of gender, geographic location and academia vs research centres vs industry, and consider three different actors: authors, keynote speakers and o… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Comments: 8 pages, 7 figures, 4 tables

  30. arXiv:2105.10371  [pdf, other

    cs.SD cs.LG eess.AS

    LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters

    Authors: Pritish Chandna, António Ramires, Xavier Serra, Emilia Gómez

    Abstract: Loops, seamlessly repeatable musical segments, are a cornerstone of modern music production. Contemporary artists often mix and match various sampled or pre-recorded loops based on musical criteria such as rhythm, harmony and timbral texture to create compositions. Taking such criteria into account, we present LoopNet, a feed-forward generative model for creating loops conditioned on intuitive par… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

  31. Perceptions of Diversity in Electronic Music: the Impact of Listener, Artist, and Track Characteristics

    Authors: Lorenzo Porcaro, Emilia Gómez, Carlos Castillo

    Abstract: Shared practices to assess the diversity of retrieval system results are still debated in the Information Retrieval community, partly because of the challenges of determining what diversity means in specific scenarios, and of understanding how diversity is perceived by end-users. The field of Music Information Retrieval is not exempt from this issue. Even if fields such as Musicology or Sociology… ▽ More

    Submitted 26 November, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

  32. arXiv:2101.02098  [pdf, other

    cs.SD cs.IR eess.AS

    Investigating the efficacy of music version retrieval systems for setlist identification

    Authors: Furkan Yesiler, Emilio Molina, Joan Serrà, Emilia Gómez

    Abstract: The setlist identification (SLI) task addresses a music recognition use case where the goal is to retrieve the metadata and timestamps for all the tracks played in live music events. Due to various musical and non-musical changes in live performances, developing automatic SLI systems is still a challenging task that, despite its industrial relevance, has been under-explored in the academic literat… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

  33. arXiv:2010.05031  [pdf, other

    cs.DC

    Understanding Cloud Workloads Performance in a Production like Environment

    Authors: Lucia Pons, Josué Feliu, José Puche, Chaoyi Huang, Salvador Petit, Julio Pons, María E. Gómez, Julio Sahuquillo

    Abstract: Understanding inter-VM interference is of paramount importance to provide a sound knowledge and understand where performance degradation comes from in the current public cloud. With this aim, this paper devises a workload taxonomy that classifies applications according to how the major system resources affect their performance (e.g., tail latency) as a function of the level of load (e.g., QPS). Af… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: 16 pages, 17 figures. Submitted to Journal of Parallel and Distributed Computing

  34. arXiv:2010.03284  [pdf, other

    cs.SD cs.LG eess.AS

    Less is more: Faster and better music version identification with embedding distillation

    Authors: Furkan Yesiler, Joan Serrà, Emilia Gómez

    Abstract: Version identification systems aim to detect different renditions of the same underlying musical composition (loosely called cover songs). By learning to encode entire recordings into plain vector embeddings, recent systems have made significant progress in bridging the gap between accuracy and scalability, which has been a key challenge for nearly two decades. In this work, we propose to further… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted to the 21st International Society for Music Information Retrieval Conference (ISMIR 2020)

  35. arXiv:2009.09875  [pdf, other

    eess.AS cs.LG

    A Deep Learning Based Analysis-Synthesis Framework For Unison Singing

    Authors: Pritish Chandna, Helena Cuesta, Emilia Gómez

    Abstract: Unison singing is the name given to an ensemble of singers simultaneously singing the same melody and lyrics. While each individual singer in a unison sings the same principle melody, there are slight timing and pitch deviations between the singers, which, along with the ensemble of timbres, give the listener a perceived sense of "unison". In this paper, we present a study of unison singing in the… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  36. arXiv:2009.04172  [pdf, other

    eess.AS cs.LG cs.SD

    Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks

    Authors: Helena Cuesta, Brian McFee, Emilia Gómez

    Abstract: This paper addresses the extraction of multiple F0 values from polyphonic and a cappella vocal performances using convolutional neural networks (CNNs). We address the major challenges of ensemble singing, i.e., all melodic sources are vocals and singers sing in harmony. We build upon an existing architecture to produce a pitch salience function of the input signal, where the harmonic constant-Q tr… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: Accepted to the 21st International Society for Music Information Retrieval (ISMIR) Conference (2020)

  37. arXiv:2009.01715  [pdf, other

    cs.IR

    Exploring Artist Gender Bias in Music Recommendation

    Authors: Dougal Shakespeare, Lorenzo Porcaro, Emilia Gómez, Carlos Castillo

    Abstract: Music Recommender Systems (mRS) are designed to give personalised and meaningful recommendations of items (i.e. songs, playlists or artists) to a user base, thereby reflecting and further complementing individual users' specific music preferences. Whilst accuracy metrics have been widely applied to evaluate recommendations in mRS literature, evaluating a user's item utility from other impact-orien… ▽ More

    Submitted 6 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: Presented at the 2nd Workshop on the Impact of Recommender Systems (ImpactRS), at the 14th ACM Conference on Recommender Systems (RecSys 2020)

  38. arXiv:2008.07645  [pdf, other

    eess.AS cs.LG cs.SD

    Deep Learning Based Source Separation Applied To Choir Ensembles

    Authors: Darius Petermann, Pritish Chandna, Helena Cuesta, Jordi Bonada, Emilia Gomez

    Abstract: Choral singing is a widely practiced form of ensemble singing wherein a group of people sing simultaneously in polyphonic harmony. The most commonly practiced setting for choir ensembles consists of four parts; Soprano, Alto, Tenor and Bass (SATB), each with its own range of fundamental frequencies (F$0$s). The task of source separation for this choral setting entails separating the SATB mixture i… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: To appear at the 21st International Society for Music Information Retrieval Conference, Montréal, Canada, 2020, audio examples available at: "https://darius522.github.io/satb-source-separation-results/"

  39. Conditioned Source Separation for Music Instrument Performances

    Authors: Olga Slizovskaia, Gloria Haro, Emilia Gómez

    Abstract: In music source separation, the number of sources may vary for each piece and some of the sources may belong to the same family of instruments, thus sharing timbral characteristics and making the sources more correlated. This leads to additional challenges in the source separation problem. This paper proposes a source separation method for multiple musical instruments sounding simultaneously and e… ▽ More

    Submitted 7 July, 2021; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: 14 pages, 5 figures, under review

  40. arXiv:2004.02541  [pdf, other

    eess.AS cs.CV cs.LG

    Vocoder-Based Speech Synthesis from Silent Videos

    Authors: Daniel Michelsanti, Olga Slizovskaia, Gloria Haro, Emilia Gómez, Zheng-Hua Tan, Jesper Jensen

    Abstract: Both acoustic and visual information influence human perception of speech. For this reason, the lack of audio in a video sequence determines an extremely low speech intelligibility for untrained lip readers. In this paper, we present a way to synthesise speech from the silent video of a talker using deep learning. The system learns a mapping function from raw video frames to acoustic features and… ▽ More

    Submitted 15 August, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: Accepted to Interspeech 2020

  41. arXiv:2003.10414  [pdf, other

    cs.SD cs.IR cs.LG cs.MM

    Multi-channel U-Net for Music Source Separation

    Authors: Venkatesh S. Kadandale, Juan F. Montesinos, Gloria Haro, Emilia Gómez

    Abstract: A fairly straightforward approach for music source separation is to train independent models, wherein each model is dedicated for estimating only a specific source. Training a single model to estimate multiple sources generally does not perform as well as the independent dedicated models. However, Conditioned U-Net (C-U-Net) uses a control mechanism to train a single model for multi-source separat… ▽ More

    Submitted 4 September, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: The paper has been accepted at IEEE MMSP2020. Project Page: https://vskadandale.github.io/multi-channel-unet

  42. arXiv:2003.04794  [pdf, other

    cs.LG cs.CY stat.ML

    Addressing multiple metrics of group fairness in data-driven decision making

    Authors: Marius Miron, Songül Tolan, Emilia Gómez, Carlos Castillo

    Abstract: The Fairness, Accountability, and Transparency in Machine Learning (FAT-ML) literature proposes a varied set of group fairness metrics to measure discrimination against socio-demographic groups that are characterized by a protected feature, such as gender or race.Such a system can be deemed as either fair or unfair depending on the choice of the metric. Several metrics have been proposed, some of… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

  43. arXiv:2002.04933  [pdf, other

    eess.AS cs.LG cs.SD

    Content Based Singing Voice Extraction From a Musical Mixture

    Authors: Pritish Chandna, Merlijn Blaauw, Jordi Bonada, Emilia Gomez

    Abstract: We present a deep learning based methodology for extracting the singing voice signal from a musical mixture based on the underlying linguistic content. Our model follows an encoder decoder architecture and takes as input the magnitude component of the spectrogram of a musical mixture with vocals. The encoder part of the model is trained via knowledge distillation using a teacher network to learn a… ▽ More

    Submitted 17 February, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: To be published in ICASSP 2020

    Journal ref: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain

  44. arXiv:2001.09778  [pdf

    cs.CY cs.AI

    Artificial intelligence in medicine and healthcare: a review and classification of current and near-future applications and their ethical and social Impact

    Authors: Emilio Gómez-González, Emilia Gomez, Javier Márquez-Rivas, Manuel Guerrero-Claro, Isabel Fernández-Lizaranzu, María Isabel Relimpio-López, Manuel E. Dorado, María José Mayorga-Buiza, Guillermo Izquierdo-Ayuso, Luis Capitán-Morales

    Abstract: This paper provides an overview of the current and near-future applications of Artificial Intelligence (AI) in Medicine and Health Care and presents a classification according to their ethical and societal aspects, potential benefits and pitfalls, and issues that can be considered controversial and are not deeply discussed in the literature. This work is based on an analysis of the state of the… ▽ More

    Submitted 6 February, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

  45. arXiv:2001.07038  [pdf, other

    cs.DL cs.AI cs.CY

    Measuring Diversity of Artificial Intelligence Conferences

    Authors: Ana Freire, Lorenzo Porcaro, Emilia Gómez

    Abstract: The lack of diversity of the Artificial Intelligence (AI) field is nowadays a concern, and several initiatives such as funding schemes and mentoring programs have been designed to overcome it. However, there is no indication on how these initiatives actually impact AI diversity in the short and long term. This work studies the concept of diversity in this particular context and proposes a small se… ▽ More

    Submitted 22 March, 2021; v1 submitted 20 January, 2020; originally announced January 2020.

  46. arXiv:1911.11853  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Neural Percussive Synthesis Parameterised by High-Level Timbral Features

    Authors: António Ramires, Pritish Chandna, Xavier Favory, Emilia Gómez, Xavier Serra

    Abstract: We present a deep neural network-based methodology for synthesising percussive sounds with control over high-level timbral characteristics of the sounds. This approach allows for intuitive control of a synthesizer, enabling the user to shape sounds without extensive knowledge of signal processing. We use a feedforward convolutional neural network-based architecture, which is able to map input para… ▽ More

    Submitted 3 April, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

  47. arXiv:1910.12551  [pdf, other

    cs.SD cs.LG eess.AS

    Accurate and Scalable Version Identification Using Musically-Motivated Embeddings

    Authors: Furkan Yesiler, Joan Serrà, Emilia Gómez

    Abstract: The version identification (VI) task deals with the automatic detection of recordings that correspond to the same underlying musical piece. Despite many efforts, VI is still an open problem, with much room for improvement, specially with regard to combining accuracy and scalability. In this paper, we present MOVE, a musically-motivated method for accurate and scalable version identification. MOVE… ▽ More

    Submitted 13 April, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

  48. arXiv:1909.05882  [pdf, other

    cs.SD cs.CL eess.AS

    The emotions that we perceive in music: the influence of language and lyrics comprehension on agreement

    Authors: Juan Sebastián Gómez Cañón, Perfecto Herrera, Emilia Gómez, Estefanía Cano

    Abstract: In the present study, we address the relationship between the emotions perceived in pop and rock music (mainly in Euro-American styles with English lyrics) and the language spoken by the listener. Our goal is to understand the influence of lyrics comprehension on the perception of emotions and use this information to improve Music Emotion Recognition (MER) models. Two main research questions are a… ▽ More

    Submitted 25 October, 2019; v1 submitted 12 September, 2019; originally announced September 2019.

  49. arXiv:1907.01813  [pdf, other

    cs.SD cs.LG eess.AS

    A Case Study of Deep-Learned Activations via Hand-Crafted Audio Features

    Authors: Olga Slizovskaia, Emilia Gómez, Gloria Haro

    Abstract: The explainability of Convolutional Neural Networks (CNNs) is a particularly challenging task in all areas of application, and it is notably under-researched in music and audio domain. In this paper, we approach explainability by exploiting the knowledge we have on hand-crafted audio features. Our study focuses on a well-defined MIR task, the recognition of musical instruments from user-generated… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

    Comments: The 2018 Joint Workshop on Machine Learning for Music, The Federated Artificial Intelligence Meeting (FAIM), Joint workshop program of ICML, IJCAI/ECAI, and AAMAS, Stockholm, Sweden, Saturday, July 14th, 2018

  50. arXiv:1904.05086  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    A Framework for Multi-f0 Modeling in SATB Choir Recordings

    Authors: Helena Cuesta, Emilia Gómez, Pritish Chandna

    Abstract: Fundamental frequency (f0) modeling is an important but relatively unexplored aspect of choir singing. Performance evaluation as well as auditory analysis of singing, whether individually or in a choir, often depend on extracting f0 contours for the singing voice. However, due to the large number of singers, singing at a similar frequency range, extracting the exact individual pitch contours from… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.