Skip to main content

Showing 1–50 of 106 results for author: Araújo, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.20196  [pdf, other

    cs.SE cs.AI cs.HC

    Prompting LLMs for Code Editing: Struggles and Remedies

    Authors: Daye Nam, Ahmed Omran, Ambar Murillo, Saksham Thakur, Abner Araujo, Marcel Blistein, Alexander Frömmgen, Vincent Hellendoorn, Satish Chandra

    Abstract: Large Language Models (LLMs) are rapidly transforming software engineering, with coding assistants embedded in an IDE becoming increasingly prevalent. While research has focused on improving the tools and understanding developer perceptions, a critical gap exists in understanding how developers actually use these tools in their daily workflows, and, crucially, where they struggle. This paper addre… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  2. arXiv:2503.21581  [pdf, other

    cs.CV cs.AI

    AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion

    Authors: Liuyue Xie, Jiancong Guo, Ozan Cakmakci, Andre Araujo, Laszlo A. Jeni, Zhiheng Jia

    Abstract: Accurate camera calibration is a fundamental task for 3D perception, especially when dealing with real-world, in-the-wild environments where complex optical distortions are common. Existing methods often rely on pre-rectified images or calibration patterns, which limits their applicability and flexibility. In this work, we introduce a novel framework that addresses these challenges by jointly mode… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  3. Charting 5G Energy Efficiency: Flexible Energy Modeling for Sustainable Networks

    Authors: Anderson L de Araujo, Luc Deneire, Guillaume Urvoy-Keller, André L F de Almeida

    Abstract: Despite the rapid advancements in 5G technology, accurately assessing the energy consumption of its Radio Access Networks (RANs) remains a challenge due to the diverse range of applicable technologies and implementation solutions. Designing a versatile power model for estimating the 5G RANspecific power consumption requires extensive data collection and experimental studies to capture the diverse… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Journal ref: 20th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob 2024), Oct 2024, Paris, France. pp.721-726

  4. arXiv:2502.07950  [pdf, other

    cs.SE

    Embracing Experiential Learning: Hackathons as an Educational Strategy for Shaping Soft Skills in Software Engineering

    Authors: Allysson Allex Araújo, Marcos Kalinowski, Maria Teresa Baldassarre

    Abstract: In recent years, Software Engineering (SE) scholars and practitioners have emphasized the importance of integrating soft skills into SE education. However, teaching and learning soft skills are complex, as they cannot be acquired passively through raw knowledge acquisition. On the other hand, hackathons have attracted increasing attention due to their experiential, collaborative, and intensive nat… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: To appear in Proceedings of the 2025 IEEE Conference on Software Engineering Education and Training, CSEE&T 2025

  5. arXiv:2502.05108  [pdf, other

    cs.SE

    Towards Emotionally Intelligent Software Engineers: Understanding Students' Self-Perceptions After a Cooperative Learning Experience

    Authors: Allysson Allex Araújo, Marcos Kalinowski, Matheus Paixao, Daniel Graziotin

    Abstract: [Background] Emotional Intelligence (EI) can impact Software Engineering (SE) outcomes through improved team communication, conflict resolution, and stress management. SE workers face increasing pressure to develop both technical and interpersonal skills, as modern software development emphasizes collaborative work and complex team interactions. Despite EI's documented importance in professional p… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: 12 pages, 4 figures. To appear in Proceedings of the 2025 IEEE/ACM 17th International Conference on Cooperative and Human Aspects of Software Engineering, CHASE 2025

  6. arXiv:2501.11431  [pdf, other

    cs.SE

    Blockchain Developer Experience: A Multivocal Literature Review

    Authors: P. Soares, A. A. Araujo, G. Destefanis, R. Neykova, R. Saraiva, J. Souza

    Abstract: The rise of smart contracts has expanded blockchain's capabilities, enabling the development of innovative decentralized applications (dApps). However, this advancement brings its own challenges, including the management of distributed architectures and immutable data. Addressing these complexities requires a specialized approach to software engineering, with blockchain-oriented practices emerging… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

    Comments: 12 pages, 5 figures, 18th Conference on Cooperative and Human Aspects of Software Engineering (CHASE)

  7. arXiv:2411.17753  [pdf, other

    cs.DC

    Observability in Fog Computing

    Authors: Aleteia Araujo, Breno Costa, Joao Bachiega Jr, Leonardo R. Carvalho, Rajkumar Buyya

    Abstract: Fog Computing provides computational resources close to the end user, supporting low-latency and high-bandwidth communications. It supports IoT applications, enabling real-time data processing, analytics, and decision-making at the edge of the network. However, the high distribution of its constituent nodes and resource-restricted devices interconnected by heterogeneous and unreliable networks mak… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  8. arXiv:2410.19439  [pdf, other

    cs.NE

    Non-Dominated Sorting Bidirectional Differential Coevolution

    Authors: Cicero S. R. Mendes, Aluizio F. R. Araújo, Lucas R. C. Farias

    Abstract: Constrained multiobjective optimization problems (CMOPs) are commonly found in real-world applications. CMOP is a complex problem that needs to satisfy a set of equality or inequality constraints. This paper proposes a variant of the bidirectional coevolution algorithm (BiCo) with differential evolution (DE). The novelties in the model include the DE differential mutation and crossover operators a… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

    Comments: 6 pages

  9. arXiv:2410.19203  [pdf, other

    cs.NE cs.AI cs.LG

    An Inverse Modeling Constrained Multi-Objective Evolutionary Algorithm Based on Decomposition

    Authors: Lucas R. C. Farias, Aluizio F. R. Araújo

    Abstract: This paper introduces the inverse modeling constrained multi-objective evolutionary algorithm based on decomposition (IM-C-MOEA/D) for addressing constrained real-world optimization problems. Our research builds upon the advancements made in evolutionary computing-based inverse modeling, and it strategically bridges the gaps in applying inverse models based on decomposition to problem domains with… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

    Comments: 6 pages, 1 figure, 1 algorithm, and 2 tables

  10. arXiv:2410.16512  [pdf, other

    cs.CV

    TIPS: Text-Image Pretraining with Spatial awareness

    Authors: Kevis-Kokitsi Maninis, Kaifeng Chen, Soham Ghosh, Arjun Karpur, Koert Chen, Ye Xia, Bingyi Cao, Daniel Salz, Guangxing Han, Jan Dlabal, Dan Gnanapragasam, Mojtaba Seyedhosseini, Howard Zhou, Andre Araujo

    Abstract: While image-text representation learning has become very popular in recent years, existing models tend to lack spatial awareness and have limited direct applicability for dense understanding tasks. For this reason, self-supervised image-only pretraining is still the go-to method for many dense vision applications (e.g. depth estimation, semantic segmentation), despite the lack of explicit supervis… ▽ More

    Submitted 7 March, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: ICLR2025 camera-ready + appendix

  11. arXiv:2410.02080  [pdf, ps, other

    cs.CV cs.CL cs.LG

    EMMA: Efficient Visual Alignment in Multi-Modal LLMs

    Authors: Sara Ghazanfari, Alexandre Araujo, Prashanth Krishnamurthy, Siddharth Garg, Farshad Khorrami

    Abstract: Multi-modal Large Language Models (MLLMs) have recently exhibited impressive general-purpose capabilities by leveraging vision foundation models to encode the core concepts of images into representations. These are then combined with instructions and processed by the language model to generate high-quality responses. Despite significant progress in enhancing the language component, challenges pers… ▽ More

    Submitted 10 June, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  12. arXiv:2409.07926  [pdf, ps, other

    cs.CR cs.SE

    Mobile App Security Trends and Topics: An Examination of Questions From Stack Overflow

    Authors: Timothy Huo, Ana Catarina Araújo, Jake Imanaka, Anthony Peruma, Rick Kazman

    Abstract: The widespread use of smartphones and tablets has made society heavily reliant on mobile applications (apps) for accessing various resources and services. These apps often handle sensitive personal, financial, and health data, making app security a critical concern for developers. While there is extensive research on software security topics like malware and vulnerabilities, less is known about th… ▽ More

    Submitted 14 September, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

    Comments: This paper was accepted for publication at the 58th Hawaii International Conference on System Sciences (HICSS) - Software Technology Track

  13. Micro and macro facial expressions by driven animations in realistic Virtual Humans

    Authors: Rubens Halbig Montanha, Giovana Nascimento Raupp, Ana Carolina Policarpo Schmitt, Victor Flávio de Andrade Araujo, Soraia Raupp Musse

    Abstract: Computer Graphics (CG) advancements have allowed the creation of more realistic Virtual Humans (VH) through modern techniques for animating the VH body and face, thereby affecting perception. From traditional methods, including blend shapes, to driven animations using facial and body tracking, these advancements can potentially enhance the perception of comfort and realism in relation to VHs. Prev… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Journal ref: Entertainment Computing, Volume 52, January 2025

  14. arXiv:2408.09032  [pdf, other

    cs.CR cs.SE

    A Developer-Centric Study Exploring Mobile Application Security Practices and Challenges

    Authors: Anthony Peruma, Timothy Huo, Ana Catarina Araújo, Jake Imanaka, Rick Kazman

    Abstract: Mobile applications (apps) have become an essential part of everyday life, offering convenient access to services such as banking, healthcare, and shopping. With these apps handling sensitive personal and financial data, ensuring their security is paramount. While previous research has explored mobile app developer practices, there is limited knowledge about the common practices and challenges tha… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: Accepted: International Conference on Software Maintenance and Evolution (ICSME 2024); Industry Track

  15. arXiv:2407.21127  [pdf

    cs.SE

    Teaching Survey Research in Software Engineering

    Authors: Marcos Kalinowski, Allysson Allex Araújo, Daniel Mendez

    Abstract: In this chapter, we provide advice on how to effectively teach survey research based on lessons learned from several international teaching experiences on the topic and from conducting large-scale surveys published at various scientific conferences and journals. First, we provide teachers with a potential syllabus for teaching survey research, including learning objectives, lectures, and examples… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  16. arXiv:2407.21121  [pdf, other

    cs.LG cs.CV

    Tuning the Frequencies: Robust Training for Sinusoidal Neural Networks

    Authors: Tiago Novello, Diana Aldana, Andre Araujo, Luiz Velho

    Abstract: Sinusoidal neural networks have been shown effective as implicit neural representations (INRs) of low-dimensional signals, due to their smoothness and high representation capacity. However, initializing and training them remain empirical tasks which lack on deeper understanding to guide the learning process. To fill this gap, our work introduces a theoretical framework that explains the capacity p… ▽ More

    Submitted 3 April, 2025; v1 submitted 30 July, 2024; originally announced July 2024.

    Comments: CVPR2025 camera-ready + supplementary material

  17. arXiv:2407.15982  [pdf

    cs.SE

    Agile Minds, Innovative Solutions, and Industry-Academia Collaboration: Lean R&D Meets Problem-Based Learning in Software Engineering Education

    Authors: Lucas Romao, Marcos Kalinowski, Clarissa Barbosa, Allysson Allex Araújo, Simone D. J. Barbosa, Helio Lopes

    Abstract: [Context] Software Engineering (SE) education constantly seeks to bridge the gap between academic knowledge and industry demands, with active learning methods like Problem-Based Learning (PBL) gaining prominence. Despite these efforts, recent graduates struggle to align skills with industry needs. Recognizing the relevance of Industry-Academia Collaboration (IAC), Lean R&D has emerged as a success… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  18. arXiv:2407.15829  [pdf

    cs.SE

    Investigating Benefits and Limitations of Migrating to a Micro-Frontends Architecture

    Authors: Fabio Antunes, Maria Julia Dias Lima, Marco Antônio Pereira Araújo, Davide Taibi, Marcos Kalinowski

    Abstract: [Context] The adoption of micro-frontends architectures has gained traction as a promising approach to enhance modularity, scalability, and maintainability of web applications. [Goal] The primary aim of this research is to investigate the benefits and limitations of migrating a real-world application to a micro-frontends architecture from the perspective of the developers. [Method] Based on the ac… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  19. arXiv:2407.15821  [pdf

    cs.SE

    Towards Effective Collaboration between Software Engineers and Data Scientists developing Machine Learning-Enabled Systems

    Authors: Gabriel Busquim, Allysson Allex Araújo, Maria Julia Lima, Marcos Kalinowski

    Abstract: Incorporating Machine Learning (ML) into existing systems is a demand that has grown among several organizations. However, the development of ML-enabled systems encompasses several social and technical challenges, which must be addressed by actors with different fields of expertise working together. This paper has the objective of understanding how to enhance the collaboration between two key acto… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  20. Achieving Observability on Fog Computing with the use of open-source tools

    Authors: Breno Costa, Abhik Banerjee, Prem Prakash Jayaraman, Leonardo R. Carvalho, João Bachiega Jr., Aleteia Araujo

    Abstract: Fog computing can provide computational resources and low-latency communication at the network edge. But with it comes uncertainties that must be managed in order to guarantee Service Level Agreements. Service observability can help the environment better deal with uncertainties, delivering relevant and up-to-date information in a timely manner to support decision making. Observability is consider… ▽ More

    Submitted 25 May, 2024; originally announced July 2024.

    Comments: Paper presented at Mobiquitous 2023

  21. arXiv:2406.08332  [pdf, other

    cs.CV

    UDON: Universal Dynamic Online distillatioN for generic image representations

    Authors: Nikolaos-Antonios Ypsilantis, Kaifeng Chen, André Araujo, Ondřej Chum

    Abstract: Universal image representations are critical in enabling real-world fine-grained and instance-level recognition applications, where objects and entities from any domain must be identified at large scale. Despite recent advances, existing methods fail to capture important domain-specific knowledge, while also ignoring differences in data distribution across different domains. This leads to a large… ▽ More

    Submitted 9 December, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2024 accepted

  22. Understanding and measuring software engineer behavior: What can we learn from the behavioral sciences?

    Authors: Allysson Allex Araújo, Marcos Kalinowski, Daniel Graziotin

    Abstract: This paper explores the intricate challenge of understanding and measuring software engineer behavior. More specifically, we revolve around a central question: How can we enhance our understanding of software engineer behavior? Grounded in the nuanced complexities addressed within Behavioral Software Engineering (BSE), we advocate for holistic methods that integrate quantitative measures, such as… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 6

  23. arXiv:2406.02487  [pdf

    cs.SE

    Investigating the Online Recruitment and Selection Journey of Novice Software Engineers: Anti-patterns and Recommendations

    Authors: Miguel Setúbal, Tayana Conte, Marcos Kalinowski, Allysson Allex Araújo

    Abstract: [Context] The growing software development market has increased the demand for qualified professionals in Software Engineering (SE). To this end, companies must enhance their Recruitment and Selection (R&S) processes to maintain high quality teams, including opening opportunities for beginners, such as trainees and interns. However, given the various judgments and sociotechnical factors involved,… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 33 pages

  24. arXiv:2405.12979  [pdf, other

    cs.CV

    OmniGlue: Generalizable Feature Matching with Foundation Model Guidance

    Authors: Hanwen Jiang, Arjun Karpur, Bingyi Cao, Qixing Huang, Andre Araujo

    Abstract: The image matching field has been witnessing a continuous emergence of novel learnable feature matching techniques, with ever-improving performance on conventional benchmarks. However, our investigation shows that despite these gains, their potential for real-world applications is restricted by their limited generalization capabilities to novel image domains. In this paper, we introduce OmniGlue,… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  25. arXiv:2404.19174  [pdf, other

    cs.CV

    XFeat: Accelerated Features for Lightweight Image Matching

    Authors: Guilherme Potje, Felipe Cadar, Andre Araujo, Renato Martins, Erickson R. Nascimento

    Abstract: We introduce a lightweight and accurate architecture for resource-efficient visual correspondence. Our method, dubbed XFeat (Accelerated Features), revisits fundamental design choices in convolutional neural networks for detecting, extracting, and matching local features. Our new model satisfies a critical need for fast and robust algorithms suitable to resource-limited devices. In particular, acc… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: CVPR 2024; Source code available at www.verlab.dcc.ufmg.br/descriptors/xfeat_cvpr24

  26. arXiv:2404.05465  [pdf, other

    cs.CV cs.LG

    HAMMR: HierArchical MultiModal React agents for generic VQA

    Authors: Lluis Castrejon, Thomas Mensink, Howard Zhou, Vittorio Ferrari, Andre Araujo, Jasper Uijlings

    Abstract: Combining Large Language Models (LLMs) with external specialized tools (LLMs+tools) is a recent paradigm to solve multimodal tasks such as Visual Question Answering (VQA). While this approach was demonstrated to work well when optimized and evaluated for each individual benchmark, in practice it is crucial for the next generation of real-world AI systems to handle a broad range of multimodal probl… ▽ More

    Submitted 14 October, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  27. arXiv:2404.04809  [pdf, other

    cs.CL

    Low-Resource Machine Translation through Retrieval-Augmented LLM Prompting: A Study on the Mambai Language

    Authors: Raphaël Merx, Aso Mahmudi, Katrina Langford, Leo Alberto de Araujo, Ekaterina Vylomova

    Abstract: This study explores the use of large language models (LLMs) for translating English into Mambai, a low-resource Austronesian language spoken in Timor-Leste, with approximately 200,000 native speakers. Leveraging a novel corpus derived from a Mambai language manual and additional sentences translated by a native speaker, we examine the efficacy of few-shot LLM prompting for machine translation (MT)… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Report number: https://aclanthology.org/2024.eurali-1.1/

  28. arXiv:2402.09674  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    PAL: Proxy-Guided Black-Box Attack on Large Language Models

    Authors: Chawin Sitawarin, Norman Mu, David Wagner, Alexandre Araujo

    Abstract: Large Language Models (LLMs) have surged in popularity in recent months, but they have demonstrated concerning capabilities to generate harmful content when manipulated. While techniques like safety fine-tuning aim to minimize harmful use, recent works have shown that LLMs remain vulnerable to attacks that elicit toxic responses. In this work, we introduce the Proxy-Guided Attack on LLMs (PAL), th… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  29. arXiv:2402.05339  [pdf

    cs.SE

    Can participation in a hackathon impact the motivation of software engineering students? A preliminary case study analysis

    Authors: Allysson Allex Araújo, Marcos Kalinowski, Maria Teresa Baldassarre

    Abstract: [Background] Hackathons are increasingly gaining prominence in Software Engineering (SE) education, lauded for their ability to elevate students' skill sets. [Objective] This paper investigates whether hackathons can impact the motivation of SE students. [Method] We conducted an evaluative case study assessing students' motivations before and after a hackathon, combining quantitative analysis usin… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  30. arXiv:2402.03337  [pdf, other

    cs.RO cs.AI cs.LG

    Reinforcement-learning robotic sailboats: simulator and preliminary results

    Authors: Eduardo Charles Vasconcellos, Ronald M Sampaio, André P D Araújo, Esteban Walter Gonzales Clua, Philippe Preux, Raphael Guerra, Luiz M G Gonçalves, Luis Martí, Hernan Lira, Nayat Sanchez-Pi

    Abstract: This work focuses on the main challenges and problems in developing a virtual oceanic environment reproducing real experiments using Unmanned Surface Vehicles (USV) digital twins. We introduce the key features for building virtual worlds, considering using Reinforcement Learning (RL) agents for autonomous navigation and control. With this in mind, the main problems concern the definition of the si… ▽ More

    Submitted 16 January, 2024; originally announced February 2024.

    Journal ref: NeurIPS 2023 Workshop on Robot Learning Workshop: Pretraining, Fine-Tuning, and Generalization with Large Scale Models, Dec 2023, New Orelans, United States

  31. arXiv:2401.14033  [pdf, ps, other

    cs.LG

    Novel Quadratic Constraints for Extending LipSDP beyond Slope-Restricted Activations

    Authors: Patricia Pauli, Aaron Havens, Alexandre Araujo, Siddharth Garg, Farshad Khorrami, Frank Allgöwer, Bin Hu

    Abstract: Recently, semidefinite programming (SDP) techniques have shown great promise in providing accurate Lipschitz bounds for neural networks. Specifically, the LipSDP approach (Fazlyab et al., 2019) has received much attention and provides the least conservative Lipschitz upper bounds that can be computed with polynomial time guarantees. However, one main restriction of LipSDP is that its formulation r… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: accepted as a conference paper at ICLR 2024

  32. arXiv:2311.17846  [pdf, other

    cs.CV

    Towards Real-World Focus Stacking with Deep Learning

    Authors: Alexandre Araujo, Jean Ponce, Julien Mairal

    Abstract: Focus stacking is widely used in micro, macro, and landscape photography to reconstruct all-in-focus images from multiple frames obtained with focus bracketing, that is, with shallow depth of field and different focus planes. Existing deep learning approaches to the underlying multi-focus image fusion problem have limited applicability to real-world imagery since they are designed for very short i… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  33. arXiv:2311.05452  [pdf, other

    eess.IV cs.CV cs.LG

    Transformer-based Model for Oral Epithelial Dysplasia Segmentation

    Authors: Adam J Shephard, Hanya Mahmood, Shan E Ahmed Raza, Anna Luiza Damaceno Araujo, Alan Roger Santos-Silva, Marcio Ajudarte Lopes, Pablo Agustin Vargas, Kris McCombe, Stephanie Craig, Jacqueline James, Jill Brooks, Paul Nankivell, Hisham Mehanna, Syed Ali Khurram, Nasir M Rajpoot

    Abstract: Oral epithelial dysplasia (OED) is a premalignant histopathological diagnosis given to lesions of the oral cavity. OED grading is subject to large inter/intra-rater variability, resulting in the under/over-treatment of patients. We developed a new Transformer-based pipeline to improve detection and segmentation of OED in haematoxylin and eosin (H&E) stained whole slide images (WSIs). Our model was… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 5 pages, 2 figures, 4 tables

  34. arXiv:2310.18274  [pdf, other

    cs.CV cs.LG

    LipSim: A Provably Robust Perceptual Similarity Metric

    Authors: Sara Ghazanfari, Alexandre Araujo, Prashanth Krishnamurthy, Farshad Khorrami, Siddharth Garg

    Abstract: Recent years have seen growing interest in developing and applying perceptual similarity metrics. Research has shown the superiority of perceptual metrics over pixel-wise metrics in aligning with human perception and serving as a proxy for the human visual system. On the other hand, as perceptual metrics rely on neural networks, there is a growing concern regarding their resilience, given the esta… ▽ More

    Submitted 29 March, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  35. arXiv:2310.03664  [pdf, other

    eess.IV cs.CV

    Certification of Deep Learning Models for Medical Image Segmentation

    Authors: Othmane Laousy, Alexandre Araujo, Guillaume Chassagnon, Nikos Paragios, Marie-Pierre Revel, Maria Vakalopoulou

    Abstract: In medical imaging, segmentation models have known a significant improvement in the past decade and are now used daily in clinical practice. However, similar to classification models, segmentation models are affected by adversarial attacks. In a safety-critical field like healthcare, certifying model predictions is of the utmost importance. Randomized smoothing has been introduced lately and provi… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  36. arXiv:2309.16883  [pdf, other

    cs.LG stat.ML

    The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing

    Authors: Blaise Delattre, Alexandre Araujo, Quentin Barthélemy, Alexandre Allauzen

    Abstract: Real-life applications of deep neural networks are hindered by their unsteady predictions when faced with noisy inputs and adversarial attacks. The certified radius in this context is a crucial indicator of the robustness of models. However how to design an efficient classifier with an associated certified radius? Randomized smoothing provides a promising framework by relying on noise injection in… ▽ More

    Submitted 18 March, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

  37. arXiv:2309.08250  [pdf, other

    cs.CV

    Optimization of Rank Losses for Image Retrieval

    Authors: Elias Ramzi, Nicolas Audebert, Clément Rambour, André Araujo, Xavier Bitot, Nicolas Thome

    Abstract: In image retrieval, standard evaluation metrics rely on score ranking, \eg average precision (AP), recall at k (R@k), normalized discounted cumulative gain (NDCG). In this work we introduce a general framework for robust and decomposable rank losses optimization. It addresses two major challenges for end-to-end training of deep neural networks with rank losses: non-differentiability and non-decomp… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2207.04873

  38. arXiv:2309.01858  [pdf, other

    cs.CV

    Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations

    Authors: Nikolaos-Antonios Ypsilantis, Kaifeng Chen, Bingyi Cao, Mário Lipovský, Pelin Dogan-Schönberger, Grzegorz Makosa, Boris Bluntschli, Mojtaba Seyedhosseini, Ondřej Chum, André Araujo

    Abstract: Fine-grained and instance-level recognition methods are commonly trained and evaluated on specific domains, in a model per domain scenario. Such an approach, however, is impractical in real large-scale applications. In this work, we address the problem of universal image embedding, where a single universal model is trained and used in multiple domains. First, we leverage existing domain-specific d… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: ICCV 2023 Accepted

  39. arXiv:2308.13363  [pdf, other

    cs.CV

    CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing

    Authors: Jonathan Cui, David A. Araujo, Suman Saha, Md. Faisal Kabir

    Abstract: Despite their simpler information fusion designs compared with Vision Transformers and Convolutional Neural Networks, Vision MLP architectures have demonstrated strong performance and high data efficiency in recent research. However, existing works such as CycleMLP and Vision Permutator typically model spatial information in equal-size spatial regions and do not consider cross-scale spatial intera… ▽ More

    Submitted 14 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures, developed under Penn State University's Multi-Campus Research Experience for Undergraduates Symposium, 2023. This work has been submitted to the IEEE for possible publication

  40. arXiv:2308.06954  [pdf, other

    cs.CV

    Global Features are All You Need for Image Retrieval and Reranking

    Authors: Shihao Shao, Kaifeng Chen, Arjun Karpur, Qinghua Cui, Andre Araujo, Bingyi Cao

    Abstract: Image retrieval systems conventionally use a two-stage paradigm, leveraging global features for initial retrieval and local features for reranking. However, the scalability of this method is often limited due to the significant storage and computation cost incurred by local feature matching in the reranking stage. In this paper, we present SuperGlobal, a novel approach that exclusively employs glo… ▽ More

    Submitted 19 August, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: ICCV23 camera-ready + appendix

  41. arXiv:2308.05810  [pdf, other

    cs.CV

    Spintronics for image recognition: performance benchmarking via ultrafast data-driven simulations

    Authors: Anatole Moureaux, Chloé Chopin, Simon de Wergifosse, Laurent Jacques, Flavio Abreu Araujo

    Abstract: We present a demonstration of image classification using an echo-state network (ESN) relying on a single simulated spintronic nanostructure known as the vortex-based spin-torque oscillator (STVO) delayed in time. We employ an ultrafast data-driven simulation framework called the data-driven Thiele equation approach (DD-TEA) to simulate the STVO dynamics. This allows us to avoid the challenges asso… ▽ More

    Submitted 7 February, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: 6 pages, 4 figures

  42. arXiv:2307.15157  [pdf, other

    cs.CV cs.LG eess.IV

    R-LPIPS: An Adversarially Robust Perceptual Similarity Metric

    Authors: Sara Ghazanfari, Siddharth Garg, Prashanth Krishnamurthy, Farshad Khorrami, Alexandre Araujo

    Abstract: Similarity metrics have played a significant role in computer vision to capture the underlying semantics of images. In recent years, advanced similarity metrics, such as the Learned Perceptual Image Patch Similarity (LPIPS), have emerged. These metrics leverage deep features extracted from trained neural networks and have demonstrated a remarkable ability to closely align with human perception whe… ▽ More

    Submitted 31 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  43. arXiv:2307.09262  [pdf, other

    cs.CV

    Neuromorphic spintronics simulated using an unconventional data-driven Thiele equation approach

    Authors: Anatole Moureaux, Simon de Wergifosse, Chloé Chopin, Flavio Abreu Araujo

    Abstract: In this study, we developed a quantitative description of the dynamics of spin-torque vortex nano-oscillators (STVOs) through an unconventional model based on the combination of the Thiele equation approach (TEA) and data from micromagnetic simulations (MMS). Solving the STVO dynamics with our analytical model allows to accelerate the simulations by 9 orders of magnitude compared to MMS while reac… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: Presented in ISCS2023

    Report number: ISCS23-46

  44. arXiv:2306.09949  [pdf, other

    cs.CV

    Towards Better Certified Segmentation via Diffusion Models

    Authors: Othmane Laousy, Alexandre Araujo, Guillaume Chassagnon, Marie-Pierre Revel, Siddharth Garg, Farshad Khorrami, Maria Vakalopoulou

    Abstract: The robustness of image segmentation has been an important research topic in the past few years as segmentation models have reached production-level accuracy. However, like classification models, segmentation models can be vulnerable to adversarial perturbations, which hinders their use in critical-decision systems like healthcare or autonomous driving. Recently, randomized smoothing has been prop… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  45. arXiv:2306.09224  [pdf, other

    cs.CV

    Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

    Authors: Thomas Mensink, Jasper Uijlings, Lluis Castrejon, Arushi Goel, Felipe Cadar, Howard Zhou, Fei Sha, André Araujo, Vittorio Ferrari

    Abstract: We propose Encyclopedic-VQA, a large scale visual question answering (VQA) dataset featuring visual questions about detailed properties of fine-grained categories and instances. It contains 221k unique question+answer pairs each matched with (up to) 5 images, resulting in a total of 1M VQA samples. Moreover, our dataset comes with a controlled knowledge base derived from Wikipedia, marking the evi… ▽ More

    Submitted 24 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ICCV'23

  46. arXiv:2306.09109  [pdf, other

    cs.CV

    NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

    Authors: Varun Jampani, Kevis-Kokitsi Maninis, Andreas Engelhardt, Arjun Karpur, Karen Truong, Kyle Sargent, Stefan Popov, André Araujo, Ricardo Martin-Brualla, Kaushal Patel, Daniel Vlasic, Vittorio Ferrari, Ameesh Makadia, Ce Liu, Yuanzhen Li, Howard Zhou

    Abstract: Recent advances in neural reconstruction enable high-quality 3D object reconstruction from casually captured image collections. Current techniques mostly analyze their progress on relatively simple image collections where Structure-from-Motion (SfM) techniques can provide ground-truth (GT) camera poses. We note that SfM techniques tend to fail on in-the-wild image collections such as image search… ▽ More

    Submitted 13 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera ready. Project page: https://navidataset.github.io

  47. arXiv:2306.09012  [pdf, other

    cs.CV

    Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization

    Authors: Dror Aiger, André Araujo, Simon Lynen

    Abstract: Large-scale visual localization systems continue to rely on 3D point clouds built from image collections using structure-from-motion. While the 3D points in these models are represented using local image features, directly matching a query image's local features against the point cloud is challenging due to the scale of the nearest-neighbor search problem. Many recent approaches to visual localiza… ▽ More

    Submitted 29 December, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ICCV23 camera-ready + appendix

  48. arXiv:2305.16494  [pdf, other

    cs.CV cs.AI

    Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability

    Authors: Haotian Xue, Alexandre Araujo, Bin Hu, Yongxin Chen

    Abstract: Neural networks are known to be susceptible to adversarial samples: small variations of natural examples crafted to deliberately mislead the models. While they can be easily generated using gradient-based techniques in digital and physical scenarios, they often differ greatly from the actual data distribution of natural images, resulting in a trade-off between strength and stealthiness. In this pa… ▽ More

    Submitted 17 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted as a conference paper in NeurIPS'2023. Code repo: https://github.com/xavihart/Diff-PGD

  49. arXiv:2305.16173  [pdf, other

    cs.LG cs.AI

    Efficient Bound of Lipschitz Constant for Convolutional Layers by Gram Iteration

    Authors: Blaise Delattre, Quentin Barthélemy, Alexandre Araujo, Alexandre Allauzen

    Abstract: Since the control of the Lipschitz constant has a great impact on the training stability, generalization, and robustness of neural networks, the estimation of this value is nowadays a real scientific challenge. In this paper we introduce a precise, fast, and differentiable upper bound for the spectral norm of convolutional layers using circulant matrix theory and a new alternative to the Power ite… ▽ More

    Submitted 19 June, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  50. arXiv:2304.00583  [pdf, other

    cs.CV

    Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints

    Authors: Guilherme Potje, Felipe Cadar, Andre Araujo, Renato Martins, Erickson R. Nascimento

    Abstract: Local feature extraction is a standard approach in computer vision for tackling important tasks such as image matching and retrieval. The core assumption of most methods is that images undergo affine transformations, disregarding more complicated effects such as non-rigid deformations. Furthermore, incipient works tailored for non-rigid correspondence still rely on keypoint detectors designed for… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: CVPR 2023; Source code available at https://verlab.dcc.ufmg.br/descriptors/dalf_cvpr23