Skip to main content

Showing 1–50 of 52 results for author: Sánchez, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04990  [pdf, ps, other

    cs.CV

    Multi-scale Image Super Resolution with a Single Auto-Regressive Model

    Authors: Enrique Sanchez, Isma Hadji, Adrian Bulat, Christos Tzelepis, Brais Martinez, Georgios Tzimiropoulos

    Abstract: In this paper we tackle Image Super Resolution (ISR), using recent advances in Visual Auto-Regressive (VAR) modeling. VAR iteratively estimates the residual in latent space between gradually increasing image scales, a process referred to as next-scale prediction. Thus, the strong priors learned during pre-training align well with the downstream task (ISR). To our knowledge, only VARSR has exploite… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Enrique Sanchez and Isma Hadji equally contributed to this work. Project site https://github.com/saic-fi/ms_sr_var

  2. arXiv:2506.00474  [pdf

    eess.IV cs.CV

    A European Multi-Center Breast Cancer MRI Dataset

    Authors: Gustav Müller-Franzes, Lorena Escudero Sánchez, Nicholas Payne, Alexandra Athanasiou, Michael Kalogeropoulos, Aitor Lopez, Alfredo Miguel Soro Busto, Julia Camps Herrero, Nika Rasoolzadeh, Tianyu Zhang, Ritse Mann, Debora Jutz, Maike Bode, Christiane Kuhl, Wouter Veldhuis, Oliver Lester Saldanha, JieFu Zhu, Jakob Nikolas Kather, Daniel Truhn, Fiona J. Gilbert

    Abstract: Detecting breast cancer early is of the utmost importance to effectively treat the millions of women afflicted by breast cancer worldwide every year. Although mammography is the primary imaging modality for screening breast cancer, there is an increasing interest in adding magnetic resonance imaging (MRI) to screening programmes, particularly for women at high risk. Recent guidelines by the Europe… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  3. arXiv:2502.04314  [pdf, other

    cs.CL

    BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation

    Authors: The Omnilingual MT Team, Pierre Andrews, Mikel Artetxe, Mariano Coria Meglioli, Marta R. Costa-jussà, Joe Chuang, David Dale, Cynthia Gao, Jean Maillard, Alex Mourachko, Christophe Ropers, Safiyyah Saleem, Eduardo Sánchez, Ioannis Tsiamas, Arina Turkatenko, Albert Ventayol-Boada, Shireen Yates

    Abstract: This paper presents BOUQuET, a multicentric and multi-register/domain dataset and benchmark, and its broader collaborative extension initiative. This dataset is handcrafted in non-English languages first, each of these source languages being represented among the 23 languages commonly used by half of the world's population and therefore having the potential to serve as pivot languages that will en… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    ACM Class: I.2.7

  4. arXiv:2412.08821  [pdf, other

    cs.CL

    Large Concept Models: Language Modeling in a Sentence Representation Space

    Authors: LCM team, Loïc Barrault, Paul-Ambroise Duquenne, Maha Elbayad, Artyom Kozhevnikov, Belen Alastruey, Pierre Andrews, Mariano Coria, Guillaume Couairon, Marta R. Costa-jussà, David Dale, Hady Elsahar, Kevin Heffernan, João Maria Janeiro, Tuan Tran, Christophe Ropers, Eduardo Sánchez, Robin San Roman, Alexandre Mourachko, Safiyyah Saleem, Holger Schwenk

    Abstract: LLMs have revolutionized the field of artificial intelligence and have emerged as the de-facto tool for many tasks. The current established technology of LLMs is to process input and generate output at the token level. This is in sharp contrast to humans who operate at multiple levels of abstraction, well beyond single words, to analyze information and to generate creative content. In this paper,… ▽ More

    Submitted 15 December, 2024; v1 submitted 11 December, 2024; originally announced December 2024.

    Comments: 49 pages

  5. arXiv:2412.08279  [pdf, other

    cs.CL

    Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation

    Authors: Marta R. Costa-jussà, Joy Chen, Ifeoluwanimi Adebara, Joe Chuang, Christophe Ropers, Eduardo Sánchez

    Abstract: The purpose of this work is to share an English-Yorùbá evaluation dataset for open-book reading comprehension and text generation to assess the performance of models both in a high- and a low- resource language. The dataset contains 358 questions and answers on 338 English documents and 208 Yorùbá documents. The average document length is ~ 10k words for English and 430 words for Yorùbá. Experimen… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    ACM Class: I.2.7

  6. arXiv:2412.08268  [pdf, other

    cs.CL

    LCFO: Long Context and Long Form Output Dataset and Benchmarking

    Authors: Marta R. Costa-jussà, Pierre Andrews, Mariano Coria Meglioli, Joy Chen, Joe Chuang, David Dale, Christophe Ropers, Alexandre Mourachko, Eduardo Sánchez, Holger Schwenk, Tuan Tran, Arina Turkatenko, Carleigh Wood

    Abstract: This paper presents the Long Context and Form Output (LCFO) benchmark, a novel evaluation framework for assessing gradual summarization and summary expansion capabilities across diverse domains. LCFO consists of long input documents (5k words average length), each of which comes with three summaries of different lengths (20%, 10%, and 5% of the input text), as well as approximately 15 questions an… ▽ More

    Submitted 12 December, 2024; v1 submitted 11 December, 2024; originally announced December 2024.

    ACM Class: I.2.7

  7. arXiv:2411.08135  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    On the Role of Speech Data in Reducing Toxicity Detection Bias

    Authors: Samuel J. Bell, Mariano Coria Meglioli, Megan Richards, Eduardo Sánchez, Christophe Ropers, Skyler Wang, Adina Williams, Levent Sagun, Marta R. Costa-jussà

    Abstract: Text toxicity detection systems exhibit significant biases, producing disproportionate rates of false positives on samples mentioning demographic groups. But what about toxicity detection in speech? To investigate the extent to which text-based biases are mitigated by speech-based systems, we produce a set of high-quality group annotations for the multilingual MuTox dataset, and then leverage thes… ▽ More

    Submitted 16 May, 2025; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: Accepted at NAACL 2025

    Journal ref: In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (Volume 1), pages 1454-1468

  8. arXiv:2411.02479  [pdf, other

    cs.RO cs.AI cs.LG

    Digitizing Touch with an Artificial Multimodal Fingertip

    Authors: Mike Lambeta, Tingfan Wu, Ali Sengul, Victoria Rose Most, Nolan Black, Kevin Sawyer, Romeo Mercado, Haozhi Qi, Alexander Sohn, Byron Taylor, Norb Tydingco, Gregg Kammerer, Dave Stroud, Jake Khatha, Kurt Jenkins, Kyle Most, Neal Stein, Ricardo Chavira, Thomas Craven-Bartle, Eric Sanchez, Yitian Ding, Jitendra Malik, Roberto Calandra

    Abstract: Touch is a crucial sensing modality that provides rich information about object properties and interactions with the physical environment. Humans and robots both benefit from using touch to perceive and interact with the surrounding environment (Johansson and Flanagan, 2009; Li et al., 2020; Calandra et al., 2017). However, no existing systems provide rich, multi-modal digital touch-sensing capabi… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: 28 pages

    ACM Class: I.2.0; I.2.9

  9. arXiv:2410.18710  [pdf, other

    q-bio.QM cs.AI

    Uncovering the Genetic Basis of Glioblastoma Heterogeneity through Multimodal Analysis of Whole Slide Images and RNA Sequencing Data

    Authors: Ahmad Berjaoui, Louis Roussel, Eduardo Hugo Sanchez, Elizabeth Cohen-Jonathan Moyal

    Abstract: Glioblastoma is a highly aggressive form of brain cancer characterized by rapid progression and poor prognosis. Despite advances in treatment, the underlying genetic mechanisms driving this aggressiveness remain poorly understood. In this study, we employed multimodal deep learning approaches to investigate glioblastoma heterogeneity using joint image/RNA-seq analysis. Our results reveal novel gen… ▽ More

    Submitted 19 May, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

  10. arXiv:2409.17370  [pdf, other

    cs.CV cs.AI

    The Overfocusing Bias of Convolutional Neural Networks: A Saliency-Guided Regularization Approach

    Authors: David Bertoin, Eduardo Hugo Sanchez, Mehdi Zouitine, Emmanuel Rachelson

    Abstract: Despite transformers being considered as the new standard in computer vision, convolutional neural networks (CNNs) still outperform them in low-data regimes. Nonetheless, CNNs often make decisions based on narrow, specific regions of input images, especially when training data is limited. This behavior can severely compromise the model's generalization capabilities, making it disproportionately de… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  11. arXiv:2409.12126  [pdf, other

    cs.CL

    Linguini: A benchmark for language-agnostic linguistic reasoning

    Authors: Eduardo Sánchez, Belen Alastruey, Christophe Ropers, Pontus Stenetorp, Mikel Artetxe, Marta R. Costa-jussà

    Abstract: We propose a new benchmark to measure a language model's linguistic reasoning skills without relying on pre-existing language-specific knowledge. The test covers 894 questions grouped in 160 problems across 75 (mostly) extremely low-resource languages, extracted from the International Linguistic Olympiad corpus. To attain high accuracy on this benchmark, models don't need previous knowledge of the… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  12. arXiv:2409.03911  [pdf, other

    cs.CV cs.AI

    The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives

    Authors: Èric Śanchez, Adrià Molina, Oriol Ramos Terrades

    Abstract: The use of image analysis in automated photography management is an increasing trend in heritage institutions. Such tools alleviate the human cost associated with the manual and expensive annotation of new data sources while facilitating fast access to the citizenship through online indexes and search engines. However, available tagging and description tools are usually designed around modern phot… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: Accepted at ECCV workshop AI4DH

  13. arXiv:2408.05609  [pdf, other

    eess.SY cs.AI cs.LG cs.MA cs.RO

    Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at Scale

    Authors: Vindula Jayawardana, Baptiste Freydt, Ao Qu, Cameron Hickert, Edgar Sanchez, Catherine Tang, Mark Taylor, Blaine Leonard, Cathy Wu

    Abstract: The sheer scale and diversity of transportation make it a formidable sector to decarbonize. Here, we consider an emerging opportunity to reduce carbon emissions: the growing adoption of semi-autonomous vehicles, which can be programmed to mitigate stop-and-go traffic through intelligent speed commands and, thus, reduce emissions. But would such dynamic eco-driving move the needle on climate change… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: In review

  14. arXiv:2407.16470  [pdf, other

    cs.CL cs.AI

    Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models

    Authors: Kenza Benkirane, Laura Gongas, Shahar Pelles, Naomi Fuchs, Joshua Darmon, Pontus Stenetorp, David Ifeoluwa Adelani, Eduardo Sánchez

    Abstract: Recent advancements in massively multilingual machine translation systems have significantly enhanced translation accuracy; however, even the best performing systems still generate hallucinations, severely impacting user trust. Detecting hallucinations in Machine Translation (MT) remains a critical challenge, particularly since existing methods excel with High-Resource Languages (HRLs) but exhibit… ▽ More

    Submitted 20 October, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: Authors Kenza Benkirane and Laura Gongas contributed equally to this work

    ACM Class: I.2.7

  15. arXiv:2406.07191  [pdf, other

    cs.CV

    MeMSVD: Long-Range Temporal Structure Capturing Using Incremental SVD

    Authors: Ioanna Ntinou, Enrique Sanchez, Georgios Tzimiropoulos

    Abstract: This paper is on long-term video understanding where the goal is to recognise human actions over long temporal windows (up to minutes long). In prior work, long temporal context is captured by constructing a long-term memory bank consisting of past and future video features which are then integrated into standard (short-term) video recognition backbones through the use of attention mechanisms. Two… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ICIP 2024

  16. arXiv:2401.05450  [pdf

    cs.HC

    Reorienting Learning Game Design in Design-Based Research: a Case Study

    Authors: Nadine Mandran, Estelle Prior, Eric Sanchez, Mathieu Vermeulen

    Abstract: One of the main difficulties remains the collaboration between the various experts involved in designing the Learning Games (LG). Our literature review focuses on the pitfalls and principles that have been identified by various authors in learning games design. Based on this review, a prototype was designed to support the LG design process and to study more precisely the collaboration between acto… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  17. arXiv:2312.17686  [pdf, other

    cs.CV

    Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization

    Authors: Ioanna Ntinou, Enrique Sanchez, Georgios Tzimiropoulos

    Abstract: Action Localization is a challenging problem that combines detection and recognition tasks, which are often addressed separately. State-of-the-art methods rely on off-the-shelf bounding box detections pre-computed at high resolution, and propose transformer models that focus on the classification task alone. Such two-stage solutions are prohibitive for real-time deployment. On the other hand, sing… ▽ More

    Submitted 23 May, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  18. arXiv:2312.03186  [pdf, other

    cs.LG cs.AI cs.CY

    Data-Driven Traffic Reconstruction and Kernel Methods for Identifying Stop-and-Go Congestion

    Authors: Edgar Ramirez Sanchez, Shreyaa Raghavan, Cathy Wu

    Abstract: Identifying stop-and-go events (SAGs) in traffic flow presents an important avenue for advancing data-driven research for climate change mitigation and sustainability, owing to their substantial impact on carbon emissions, travel time, fuel consumption, and roadway safety. In fact, SAGs are estimated to account for 33-50% of highway driving externalities. However, insufficient attention has been p… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Presented at NeurIPS 2023 workshops: Tackling Climate Change with Machine Learning & Computational Sustainability

  19. ViKi-HyCo: A Hybrid-Control approach for complex car-like maneuvers

    Authors: Edison P. Velasco Sánchez, Miguel Ángel Muñoz-Bañón, Francisco A. Candelas, Santiago T. Puente, Fernando Torres

    Abstract: While Visual Servoing is deeply studied to perform simple maneuvers, the literature does not commonly address complex cases where the target is far out of the camera's field of view (FOV) during the maneuver. For this reason, in this paper, we present ViKi-HyCo (Visual Servoing and Kinematic Hybrid-Controller). This approach generates the necessary maneuvers for the complex positioning of a non-ho… ▽ More

    Submitted 16 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: This paper is published at the journal "IEEE Access"

    Journal ref: In IEEE Access, vol. 12, pp. 65428-65443, May. 2024

  20. arXiv:2309.03175  [pdf, other

    cs.CL

    Gender-specific Machine Translation with Large Language Models

    Authors: Eduardo Sánchez, Pierre Andrews, Pontus Stenetorp, Mikel Artetxe, Marta R. Costa-jussà

    Abstract: While machine translation (MT) systems have seen significant improvements, it is still common for translations to reflect societal biases, such as gender bias. Decoder-only Large Language Models (LLMs) have demonstrated potential in MT, albeit with performance slightly lagging behind traditional encoder-decoder Neural Machine Translation (NMT) systems. However, LLMs offer a unique advantage: the a… ▽ More

    Submitted 16 April, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

  21. arXiv:2307.01753  [pdf, other

    astro-ph.CO cs.LG physics.comp-ph physics.data-an

    Local primordial non-Gaussianity from the large-scale clustering of photometric DESI luminous red galaxies

    Authors: Mehdi Rezaie, Ashley J. Ross, Hee-Jong Seo, Hui Kong, Anna Porredon, Lado Samushia, Edmond Chaussidon, Alex Krolewski, Arnaud de Mattia, Florian Beutler, Jessica Nicole Aguilar, Steven Ahlen, Shadab Alam, Santiago Avila, Benedict Bahr-Kalus, Jose Bermejo-Climent, David Brooks, Todd Claybaugh, Shaun Cole, Kyle Dawson, Axel de la Macorra, Peter Doel, Andreu Font-Ribera, Jaime E. Forero-Romero, Satya Gontcho A Gontcho , et al. (24 additional authors not shown)

    Abstract: We use angular clustering of luminous red galaxies from the Dark Energy Spectroscopic Instrument (DESI) imaging surveys to constrain the local primordial non-Gaussianity parameter $\fnl$. Our sample comprises over 12 million targets, covering 14,000 square degrees of the sky, with redshifts in the range $0.2< z < 1.35$. We identify Galactic extinction, survey depth, and astronomical seeing as the… ▽ More

    Submitted 25 June, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 21 pages, 17 figures, 7 tables (Appendix excluded). Published in MNRAS

  22. arXiv:2306.06149  [pdf, other

    cs.CV cs.AI

    Read, look and detect: Bounding box annotation from image-caption pairs

    Authors: Eduardo Hugo Sanchez

    Abstract: Various methods have been proposed to detect objects while reducing the cost of data annotation. For instance, weakly supervised object detection (WSOD) methods rely only on image-level annotations during training. Unfortunately, data annotation remains expensive since annotators must provide the categories describing the content of each image and labeling is restricted to a fixed set of categorie… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  23. arXiv:2306.04645  [pdf, other

    cs.LG cs.AR cs.DC

    Special Session: Approximation and Fault Resiliency of DNN Accelerators

    Authors: Mohammad Hasan Ahmadilivani, Mario Barbareschi, Salvatore Barone, Alberto Bosio, Masoud Daneshtalab, Salvatore Della Torca, Gabriele Gavarini, Maksim Jenihhin, Jaan Raik, Annachiara Ruospo, Ernesto Sanchez, Mahdi Taheri

    Abstract: Deep Learning, and in particular, Deep Neural Network (DNN) is nowadays widely used in many scenarios, including safety-critical applications such as autonomous driving. In this context, besides energy efficiency and performance, reliability plays a crucial role since a system failure can jeopardize human life. As with any other device, the reliability of hardware architectures running DNNs has to… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 10 pages, 6 tables, 9 figures

  24. PSP Framework: A novel risk assessment method in compliance with ISO/SAE-21434

    Authors: Franco Oberti, Ernesto Sanchez, Alessandro Savino, Filippo Parisi, Stefano Di Carlo

    Abstract: As more cars connect to the internet and other devices, the automotive market has become a lucrative target for cyberattacks. This has made the industry more vulnerable to security threats. As a result, car manufacturers and governments are working together to reduce risks and prevent cyberattacks in the automotive sector. However, existing attack feasibility models derived from the information te… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Journal ref: 2023 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W)

  25. arXiv:2212.02854  [pdf, other

    eess.IV cs.CV

    Automated Segmentation of Computed Tomography Images with Submanifold Sparse Convolutional Networks

    Authors: Saúl Alonso-Monsalve, Leigh H. Whitehead, Adam Aurisano, Lorena Escudero Sanchez

    Abstract: Quantitative cancer image analysis relies on the accurate delineation of tumours, a very specialised and time-consuming task. For this reason, methods for automated segmentation of tumours in medical imaging have been extensively developed in recent years, being Computed Tomography one of the most popular imaging modalities explored. However, the large amount of 3D voxels in a typical scan is proh… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  26. arXiv:2210.02390  [pdf, other

    cs.CV cs.AI cs.LG

    Bayesian Prompt Learning for Image-Language Model Generalization

    Authors: Mohammad Mahdi Derakhshani, Enrique Sanchez, Adrian Bulat, Victor Guilherme Turrisi da Costa, Cees G. M. Snoek, Georgios Tzimiropoulos, Brais Martinez

    Abstract: Foundational image-language models have generated considerable interest due to their efficient adaptation to downstream tasks by prompt learning. Prompt learning treats part of the language model input as trainable while freezing the rest, and optimizes an Empirical Risk Minimization objective. However, Empirical Risk Minimization is known to suffer from distributional shifts which hurt generaliza… ▽ More

    Submitted 20 August, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at ICCV 2023

  27. arXiv:2209.15000  [pdf, other

    cs.CV cs.AI cs.LG

    REST: REtrieve & Self-Train for generative action recognition

    Authors: Adrian Bulat, Enrique Sanchez, Brais Martinez, Georgios Tzimiropoulos

    Abstract: This work is on training a generative action/video recognition model whose output is a free-form action-specific caption describing the video (rather than an action class label). A generative approach has practical advantages like producing more fine-grained and human-readable output, and being naturally open-world. To this end, we propose to adapt a pre-trained generative Vision & Language (V&L)… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  28. arXiv:2209.09563  [pdf, other

    cs.LG cs.CV eess.IV

    Calibrating Ensembles for Scalable Uncertainty Quantification in Deep Learning-based Medical Segmentation

    Authors: Thomas Buddenkotte, Lorena Escudero Sanchez, Mireia Crispin-Ortuzar, Ramona Woitek, Cathal McCague, James D. Brenton, Ozan Öktem, Evis Sala, Leonardo Rundo

    Abstract: Uncertainty quantification in automated image analysis is highly desired in many applications. Typically, machine learning models in classification or segmentation are only developed to provide binary answers; however, quantifying the uncertainty of the models can play a critical role for example in active learning or machine human interaction. Uncertainty quantification is especially difficult wh… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  29. Lumen Shape Reconstruction using a Soft Robotic Balloon Catheter and Electrical Impedance Tomography

    Authors: James Avery, Mark Runciman, Cristina Fiani, Elena Monfort Sanchez, Saina Akhond, Zhuang Liu, Kirill Aristovich, George Mylonas

    Abstract: Incorrectly sized balloon catheters can lead to increased post-surgical complications, yet even with preoperative imaging, correct selection remains a challenge. With limited feedback during surgery, it is difficult to verify correct deployment. We propose the use of integrated impedance measurements and Electrical Impedance Tomography (EIT) imaging to assess the deformation of the balloon and det… ▽ More

    Submitted 23 August, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Published version in IROS 2022 The IEEE/RSJ International Conference on Intelligent Robots and Systems. Improved Figure 3, discussion and more concise methods section

  30. arXiv:2206.14706  [pdf

    q-bio.BM cs.IT

    Molecular information theory meets protein folding

    Authors: Ignacio E. Sánchez, Ezequiel A. Galpern, Martín M. Garibaldi, Diego U. Ferreiro

    Abstract: We propose an application of molecular information theory to analyze the folding of single domain proteins. We analyze results from various areas of protein science, such as sequence-based potentials, reduced amino acid alphabets, backbone configurational entropy, secondary structure content, residue burial layers, and mutational studies of protein stability changes. We found that the average info… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 33pages, 2 figures, plus supporting information

  31. CAN-MM: Multiplexed Message Authentication Code for Controller Area Network message authentication in road vehicles

    Authors: Franco Oberti, Ernesto Sanchez, Alessandro Savino, Filippo Parisi, Stefano Di Carlo

    Abstract: The automotive market is increasingly profitable for cyberattacks with the constant shift toward fully interconnected vehicles. Electronic Control Units (ECUs) installed on cars often operate in a critical and hostile environment. Hence, both carmakers and governments have decided to support a series of initiatives to mitigate risks and threats belonging to the automotive domain. The Controller Ar… ▽ More

    Submitted 22 May, 2024; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: in IEEE Transactions on Vehicular Technology

  32. LIN-MM: Multiplexed Message Authentication Code for Local Interconnect Network message authentication in road vehicles

    Authors: Franco Oberti, Ernesto Sanchez, Alessandro Savino, Filippo Parisi, Mirco Brero, Stefano Di Carlo

    Abstract: The automotive market is profitable for cyberattacks with the constant shift toward interconnected vehicles. Electronic Control Units (ECUs) installed on cars often operate in a critical and hostile environment. Hence, both carmakers and governments have supported initiatives to mitigate risks and threats belonging to the automotive domain. The Local Interconnect Network (LIN) is one of the most u… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Journal ref: 2022 IEEE 28th International Symposium on On-Line Testing and Robust System Design (IOLTS)

  33. arXiv:2205.15895  [pdf, other

    cs.CV

    From Keypoints to Object Landmarks via Self-Training Correspondence: A novel approach to Unsupervised Landmark Discovery

    Authors: Dimitrios Mallis, Enrique Sanchez, Matt Bell, Georgios Tzimiropoulos

    Abstract: This paper proposes a novel paradigm for the unsupervised learning of object landmark detectors. Contrary to existing methods that build on auxiliary tasks such as image generation or equivariance, we propose a self-training approach where, departing from generic keypoints, a landmark detector and descriptor is trained to improve itself, tuning the keypoints into distinctive landmarks. To this end… ▽ More

    Submitted 25 February, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

  34. arXiv:2204.13666  [pdf, other

    cs.LG cs.AR

    Schrödinger's FP: Dynamic Adaptation of Floating-Point Containers for Deep Learning Training

    Authors: Miloš Nikolić, Enrique Torres Sanchez, Jiahui Wang, Ali Hadi Zadeh, Mostafa Mahmoud, Ameer Abdelhadi, Kareem Ibrahim, Andreas Moshovos

    Abstract: The transfer of tensors from/to memory during neural network training dominates time and energy. To improve energy efficiency and performance, research has been exploring ways to use narrower data representations. So far, these attempts relied on user-directed trial-and-error to achieve convergence. We present methods that relieve users from this responsibility. Our methods dynamically adjust the… ▽ More

    Submitted 16 May, 2024; v1 submitted 28 April, 2022; originally announced April 2022.

  35. EXT-TAURUM P2T: an Extended Secure CAN-FD Architecture for Road Vehicles

    Authors: Franco Oberti, Alessandro Savino, Ernesto Sanchez, Filippo Parisi, Stefano Di Carlo

    Abstract: The automobile industry is no longer relying on pure mechanical systems; instead, it benefits from advanced Electronic Control Units (ECUs) in order to provide new and complex functionalities in the effort to move toward fully connected cars. However, connected cars provide a dangerous playground for hackers. Vehicles are becoming increasingly vulnerable to cyber attacks as they come equipped with… ▽ More

    Submitted 7 March, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

  36. Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

    Authors: Xiang Bai, Hanchen Wang, Liya Ma, Yongchao Xu, Jiefeng Gan, Ziwei Fan, Fan Yang, Ke Ma, Jiehua Yang, Song Bai, Chang Shu, Xinyu Zou, Renhao Huang, Changzheng Zhang, Xiaowu Liu, Dandan Tu, Chuou Xu, Wenqing Zhang, Xi Wang, Anguo Chen, Yu Zeng, Dehua Yang, Ming-Wei Wang, Nagaraj Holalkere, Neil J. Halin , et al. (21 additional authors not shown)

    Abstract: Artificial intelligence (AI) provides a promising substitution for streamlining COVID-19 diagnoses. However, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalised model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI),… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Nature Machine Intelligence

  37. arXiv:2111.02360  [pdf, other

    cs.CV

    Subpixel Heatmap Regression for Facial Landmark Localization

    Authors: Adrian Bulat, Enrique Sanchez, Georgios Tzimiropoulos

    Abstract: Deep Learning models based on heatmap regression have revolutionized the task of facial landmark localization with existing models working robustly under large poses, non-uniform illumination and shadows, occlusions and self-occlusions, low resolution and blur. However, despite their wide adoption, heatmap regression approaches suffer from discretization-induced errors related to both the heatmap… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: Accepted at BMVC 2021

  38. arXiv:2103.16554  [pdf, other

    cs.CV cs.LG

    Pre-training strategies and datasets for facial representation learning

    Authors: Adrian Bulat, Shiyang Cheng, Jing Yang, Andrew Garbett, Enrique Sanchez, Georgios Tzimiropoulos

    Abstract: What is the best way to learn a universal face representation? Recent work on Deep Learning in the area of face analysis has focused on supervised learning for specific tasks of interest (e.g. face recognition, facial landmark localization etc.) but has overlooked the overarching question of how to find a facial representation that can be readily adapted to several facial analysis tasks and datase… ▽ More

    Submitted 20 July, 2022; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted at ECCV 2022

  39. arXiv:2103.13372  [pdf, other

    cs.CV cs.LG

    Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition

    Authors: Enrique Sanchez, Mani Kumar Tellamekala, Michel Valstar, Georgios Tzimiropoulos

    Abstract: Temporal context is key to the recognition of expressions of emotion. Existing methods, that rely on recurrent or self-attention models to enforce temporal consistency, work on the feature level, ignoring the task-specific temporal dependencies, and fail to model context uncertainty. To alleviate these issues, we build upon the framework of Neural Processes to propose a method for apparent emotion… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR 2021

  40. arXiv:2012.05928  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM cs.LG

    A machine learning approach to galaxy properties: joint redshift-stellar mass probability distributions with Random Forest

    Authors: S. Mucesh, W. G. Hartley, A. Palmese, O. Lahav, L. Whiteway, A. F. L. Bluck, A. Alarcon, A. Amon, K. Bechtol, G. M. Bernstein, A. Carnero Rosell, M. Carrasco Kind, A. Choi, K. Eckert, S. Everett, D. Gruen, R. A. Gruendl, I. Harrison, E. M. Huff, N. Kuropatkin, I. Sevilla-Noarbe, E. Sheldon, B. Yanny, M. Aguena, S. Allam , et al. (50 additional authors not shown)

    Abstract: We demonstrate that highly accurate joint redshift-stellar mass probability distribution functions (PDFs) can be obtained using the Random Forest (RF) machine learning (ML) algorithm, even with few photometric bands available. As an example, we use the Dark Energy Survey (DES), combined with the COSMOS2015 catalogue for redshifts and stellar masses. We build two ML models: one containing deep phot… ▽ More

    Submitted 19 February, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 18 pages, 8 figures, Accepted by MNRAS

    Report number: FERMILAB-PUB-20-653-AE, DES-2020-0542

    Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 502, Issue 2, April 2021, Pages 2770-2786

  41. arXiv:2011.01864  [pdf, other

    cs.CV

    Semi-supervised Facial Action Unit Intensity Estimation with Contrastive Learning

    Authors: Enrique Sanchez, Adrian Bulat, Anestis Zaganidis, Georgios Tzimiropoulos

    Abstract: This paper tackles the challenging problem of estimating the intensity of Facial Action Units with few labeled images. Contrary to previous works, our method does not require to manually select key frames, and produces state-of-the-art results with as little as $2\%$ of annotated frames, which are \textit{randomly chosen}. To this end, we propose a semi-supervised learning approach where a spatio-… ▽ More

    Submitted 4 November, 2020; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: ACCV 2020

  42. arXiv:2009.12856  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG

    Machine Learning for Searching the Dark Energy Survey for Trans-Neptunian Objects

    Authors: B. Henghes, O. Lahav, D. W. Gerdes, E. Lin, R. Morgan, T. M. C. Abbott, M. Aguena, S. Allam, J. Annis, S. Avila, E. Bertin, D. Brooks, D. L. Burke, A. CarneroRosell, M. CarrascoKind, J. Carretero, C. Conselice, M. Costanzi, L. N. da Costa, J. DeVicente, S. Desai, H. T. Diehl, P. Doel, S. Everett, I. Ferrero , et al. (34 additional authors not shown)

    Abstract: In this paper we investigate how implementing machine learning could improve the efficiency of the search for Trans-Neptunian Objects (TNOs) within Dark Energy Survey (DES) data when used alongside orbit fitting. The discovery of multiple TNOs that appear to show a similarity in their orbital parameters has led to the suggestion that one or more undetected planets, an as yet undiscovered "Planet 9… ▽ More

    Submitted 10 December, 2020; v1 submitted 27 September, 2020; originally announced September 2020.

    Comments: Published in PASP, 16 pages, 6 figures

    Journal ref: PASP 133 014501 (2021)

  43. arXiv:2007.00017  [pdf, other

    quant-ph cs.CE q-fin.ST

    Dynamic Portfolio Optimization with Real Datasets Using Quantum Processors and Quantum-Inspired Tensor Networks

    Authors: Samuel Mugel, Carlos Kuchkovsky, Escolastico Sanchez, Samuel Fernandez-Lorenzo, Jorge Luis-Hita, Enrique Lizaso, Roman Orus

    Abstract: In this paper we tackle the problem of dynamic portfolio optimization, i.e., determining the optimal trading trajectory for an investment portfolio of assets over a period of time, taking into account transaction costs and other possible constraints. This problem is central to quantitative finance. After a detailed introduction to the problem, we implement a number of quantum and quantum-inspired… ▽ More

    Submitted 6 December, 2021; v1 submitted 30 June, 2020; originally announced July 2020.

    Comments: 13 pages, 5 figures, 5 tables, revised version, to appear in Physical Review Research

    Journal ref: Phys. Rev. Research 4, 013006 (2022)

  44. arXiv:2004.07165  [pdf, other

    cs.CV

    A recurrent cycle consistency loss for progressive face-to-face synthesis

    Authors: Enrique Sanchez, Michel Valstar

    Abstract: This paper addresses a major flaw of the cycle consistency loss when used to preserve the input appearance in the face-to-face synthesis domain. In particular, we show that the images generated by a network trained using this loss conceal a noise that hinders their use for further tasks. To overcome this limitation, we propose a ''recurrent cycle consistency loss" which for different sequences of… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: Accepted to FG 2020 (Oral). arXiv admin note: substantial text overlap with arXiv:1811.03492

  45. arXiv:2004.06657  [pdf, other

    cs.CV

    A Transfer Learning approach to Heatmap Regression for Action Unit intensity estimation

    Authors: Ioanna Ntinou, Enrique Sanchez, Adrian Bulat, Michel Valstar, Georgios Tzimiropoulos

    Abstract: Action Units (AUs) are geometrically-based atomic facial muscle movements known to produce appearance changes at specific facial locations. Motivated by this observation we propose a novel AU modelling problem that consists of jointly estimating their localisation and intensity. To this end, we propose a simple yet efficient approach based on Heatmap Regression that merges both problems into a sin… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: Submitted for review to IEEE Trans. on Affective Computing

  46. arXiv:1912.03915  [pdf, other

    stat.ML cs.LG

    Learning Disentangled Representations via Mutual Information Estimation

    Authors: Eduardo Hugo Sanchez, Mathieu Serrurier, Mathias Ortner

    Abstract: In this paper, we investigate the problem of learning disentangled representations. Given a pair of images sharing some attributes, we aim to create a low-dimensional representation which is split into two parts: a shared representation that captures the common information between the images and an exclusive representation that contains the specific information of each image. To address this issue… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

  47. arXiv:1910.09469  [pdf, other

    cs.CV cs.LG eess.IV

    Object landmark discovery through unsupervised adaptation

    Authors: Enrique Sanchez, Georgios Tzimiropoulos

    Abstract: This paper proposes a method to ease the unsupervised learning of object landmark detectors. Similarly to previous methods, our approach is fully unsupervised in a sense that it does not require or make any use of annotated landmarks for the target object category. Contrary to previous works, we do however assume that a landmark detector, which has already learned a structured representation for a… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019. Code is available https://github.com/ESanchezLozano/SAIC-Unsupervised-landmark-detection-NeurIPS2019

  48. arXiv:1907.07776  [pdf

    cs.AR cs.LG

    CADS: Core-Aware Dynamic Scheduler for Multicore Memory Controllers

    Authors: Eduardo Olmedo Sanchez, Xian-He Sun

    Abstract: Memory controller scheduling is crucial in multicore processors, where DRAM bandwidth is shared. Since increased number of requests from multiple cores of processors becomes a source of bottleneck, scheduling the requests efficiently is necessary to utilize all the computing power these processors offer. However, current multicore processors are using traditional memory controllers, which are desi… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

  49. arXiv:1903.08863  [pdf, other

    cs.CV cs.LG

    Learning Disentangled Representations of Satellite Image Time Series

    Authors: Eduardo Sanchez, Mathieu Serrurier, Mathias Ortner

    Abstract: In this paper, we investigate how to learn a suitable representation of satellite image time series in an unsupervised manner by leveraging large amounts of unlabeled data. Additionally , we aim to disentangle the representation of time series into two representations: a shared representation that captures the common information between the images of a time series and an exclusive representation t… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

  50. arXiv:1811.03492  [pdf, other

    cs.CV

    Triple consistency loss for pairing distributions in GAN-based face synthesis

    Authors: Enrique Sanchez, Michel Valstar

    Abstract: Generative Adversarial Networks have shown impressive results for the task of object translation, including face-to-face translation. A key component behind the success of recent approaches is the self-consistency loss, which encourages a network to recover the original input image when the output generated for a desired attribute is itself passed through the same network, but with the target attr… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: Project site https://github.com/ESanchezLozano/GANnotation , https://youtu.be/-8r7zexg4yg