Skip to main content

Showing 1–50 of 61 results for author: Pinto, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  2. arXiv:2502.02437  [pdf, other

    cs.DC eess.SY

    H-MBR: Hypervisor-level Memory Bandwidth Reservation for Mixed Criticality Systems

    Authors: Afonso Oliveira, Diogo Costa, Gonçalo Moreira, José Martins, Sandro Pinto

    Abstract: Recent advancements in fields such as automotive and aerospace have driven a growing demand for robust computational resources. Applications that were once designed for basic MCUs are now deployed on highly heterogeneous SoC platforms. While these platforms deliver the necessary computational performance, they also present challenges related to resource sharing and predictability. These challenges… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  3. arXiv:2501.16245  [pdf, other

    cs.DC cs.PF eess.SY

    SP-IMPact: A Framework for Static Partitioning Interference Mitigation and Performance Analysis

    Authors: Diogo Costa, Gonçalo Moreira, Afonso Oliveira, José Martins, Sandro Pinto

    Abstract: Modern embedded systems are evolving toward complex, heterogeneous architectures to accommodate increasingly demanding applications. Driven by SWAP-C constraints, this shift has led to consolidating multiple systems onto single hardware platforms. Static Partitioning Hypervisors offer a promising solution to partition hardware resources and provide spatial isolation between critical workloads. How… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  4. arXiv:2412.15129  [pdf, other

    cs.CV cs.AI cs.LG

    Jet: A Modern Transformer-Based Normalizing Flow

    Authors: Alexander Kolesnikov, André Susano Pinto, Michael Tschannen

    Abstract: In the past, normalizing generative flows have emerged as a promising class of generative models for natural images. This type of model has many modeling advantages: the ability to efficiently compute log-likelihood of the input data, fast generation and simple overall structure. Normalizing flows remained a topic of active research but later fell out of favor, as visual quality of the samples was… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

  5. arXiv:2412.03555  [pdf, other

    cs.CV

    PaliGemma 2: A Family of Versatile VLMs for Transfer

    Authors: Andreas Steiner, André Susano Pinto, Michael Tschannen, Daniel Keysers, Xiao Wang, Yonatan Bitton, Alexey Gritsenko, Matthias Minderer, Anthony Sherbondy, Shangbang Long, Siyang Qin, Reeve Ingle, Emanuele Bugliarello, Sahar Kazemzadeh, Thomas Mesnard, Ibrahim Alabdulmohsin, Lucas Beyer, Xiaohua Zhai

    Abstract: PaliGemma 2 is an upgrade of the PaliGemma open Vision-Language Model (VLM) based on the Gemma 2 family of language models. We combine the SigLIP-So400m vision encoder that was also used by PaliGemma with the whole range of Gemma 2 models, from the 2B one all the way up to the 27B model. We train these models at three resolutions (224px, 448px, and 896px) in multiple stages to equip them with broa… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

  6. arXiv:2411.19722  [pdf, other

    cs.LG cs.AI cs.CV

    JetFormer: An Autoregressive Generative Model of Raw Images and Text

    Authors: Michael Tschannen, André Susano Pinto, Alexander Kolesnikov

    Abstract: Removing modeling constraints and unifying architectures across domains has been a key driver of the recent progress in training large multimodal models. However, most of these models still rely on many separately trained components such as modality-specific encoders and decoders. In this work, we further streamline joint generative modeling of images and text. We propose an autoregressive decoder… ▽ More

    Submitted 19 May, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

    Comments: ICLR 2025. Code available at https://github.com/google-research/big_vision

  7. arXiv:2410.09839  [pdf, other

    cs.CR cs.AR

    RISC-V Needs Secure 'Wheels': the MCU Initiator-Side Perspective

    Authors: Sandro Pinto, Jose Martins, Manuel Rodriguez, Luis Cunha, Georg Schmalz, Uwe Moslehner, Kai Dieffenbach, Thomas Roecker

    Abstract: The automotive industry is experiencing a massive paradigm shift. Cars are becoming increasingly autonomous, connected, and computerized. Modern electrical/electronic (E/E) architectures are pushing for an unforeseen functionality integration density, resulting in physically separate Electronic Control Units (ECUs) becoming virtualized and mapped to logical partitions within a single physical micr… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  8. arXiv:2408.11209  [pdf, other

    cs.SE

    Assisting Novice Developers Learning in Flutter Through Cognitive-Driven Development

    Authors: Ronivaldo Ferreira, Victor H. S. Pinto, Cleidson R. B. de Souza, Gustavo Pinto

    Abstract: Cognitive-Driven Development (CDD) is a coding design technique that helps developers focus on designing code within cognitive limits. The imposed limit tends to enhance code readability and maintainability. While early works on CDD focused mostly on Java, its applicability extends beyond specific programming languages. In this study, we explored the use of CDD in two new dimensions: focusing on F… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 10 pages

    Report number: SBES Education Track 2024

  9. arXiv:2407.07726  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    PaliGemma: A versatile 3B VLM for transfer

    Authors: Lucas Beyer, Andreas Steiner, André Susano Pinto, Alexander Kolesnikov, Xiao Wang, Daniel Salz, Maxim Neumann, Ibrahim Alabdulmohsin, Michael Tschannen, Emanuele Bugliarello, Thomas Unterthiner, Daniel Keysers, Skanda Koppula, Fangyu Liu, Adam Grycner, Alexey Gritsenko, Neil Houlsby, Manoj Kumar, Keran Rong, Julian Eisenschlos, Rishabh Kabra, Matthias Bauer, Matko Bošnjak, Xi Chen, Matthias Minderer , et al. (10 additional authors not shown)

    Abstract: PaliGemma is an open Vision-Language Model (VLM) that is based on the SigLIP-So400m vision encoder and the Gemma-2B language model. It is trained to be a versatile and broadly knowledgeable base model that is effective to transfer. It achieves strong performance on a wide variety of open-world tasks. We evaluate PaliGemma on almost 40 diverse tasks including standard VLM benchmarks, but also more… ▽ More

    Submitted 10 October, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: v2 adds Appendix H and I and a few citations

  10. arXiv:2406.03401  [pdf

    cs.CR

    CROSSCON: Cross-platform Open Security Stack for Connected Devices

    Authors: Bruno Crispo, Marco Roveri, Sandro Pinto, Tiago Gomes, Aljosa Pasic, Akos Milankovich, David Puron, Ainara Garcia, Ziga Putrle, Peter Ten, Malvina Catalano

    Abstract: The proliferation of Internet of Things (IoT) embedded devices is expected to reach 30 billion by 2030, creating a dynamic landscape where diverse devices must coexist. This presents challenges due to the rapid expansion of different architectures and platforms. Addressing these challenges requires a unifi ed solution capable of accommodating various devices while offering a broad range of service… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  11. arXiv:2404.05688  [pdf, other

    cs.LG cs.AI cs.CR

    David and Goliath: An Empirical Evaluation of Attacks and Defenses for QNNs at the Deep Edge

    Authors: Miguel Costa, Sandro Pinto

    Abstract: ML is shifting from the cloud to the edge. Edge computing reduces the surface exposing private data and enables reliable throughput guarantees in real-time applications. Of the panoply of devices deployed at the edge, resource-constrained MCUs, e.g., Arm Cortex-M, are more prevalent, orders of magnitude cheaper, and less power-hungry than application processors or GPUs. Thus, enabling intelligence… ▽ More

    Submitted 2 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    ACM Class: I.2.0

  12. arXiv:2403.19596  [pdf, other

    cs.CV

    LocCa: Visual Pretraining with Location-aware Captioners

    Authors: Bo Wan, Michael Tschannen, Yongqin Xian, Filip Pavetic, Ibrahim Alabdulmohsin, Xiao Wang, André Susano Pinto, Andreas Steiner, Lucas Beyer, Xiaohua Zhai

    Abstract: Image captioning has been shown as an effective pretraining method similar to contrastive pretraining. However, the incorporation of location-aware information into visual pretraining remains an area with limited research. In this paper, we propose a simple visual pretraining method with location-aware captioners (LocCa). LocCa uses a simple image captioner task interface, to teach a model to read… ▽ More

    Submitted 11 November, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  13. arXiv:2401.15289  [pdf, other

    cs.CR cs.AR

    SoK: Where's the "up"?! A Comprehensive (bottom-up) Study on the Security of Arm Cortex-M Systems

    Authors: Xi Tan, Zheyuan Ma, Sandro Pinto, Le Guan, Ning Zhang, Jun Xu, Zhiqiang Lin, Hongxin Hu, Ziming Zhao

    Abstract: Arm Cortex-M processors are the most widely used 32-bit microcontrollers among embedded and Internet-of-Things devices. Despite the widespread usage, there has been little effort in summarizing their hardware security features, characterizing the limitations and vulnerabilities of their hardware and software stack, and systematizing the research on securing these systems. The goals and contributio… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: To Appear in the 18th USENIX WOOT Conference on Offensive Technologies, August 12-13, 2024

    ACM Class: C.0; K.6.5

  14. arXiv:2401.06790  [pdf, other

    cs.CL cs.AI

    Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions

    Authors: Daniel de S. Moraes, Pedro T. C. Santos, Polyana B. da Costa, Matheus A. S. Pinto, Ivan de J. P. Pinto, Álvaro M. G. da Veiga, Sergio Colcher, Antonio J. G. Busson, Rafael H. Rocha, Rennan Gaio, Rafael Miceli, Gabriela Tourinho, Marcos Rabaioli, Leandro Santos, Fellipe Marques, David Favaro

    Abstract: This work presents an unsupervised method for automatically constructing and expanding topic taxonomies using instruction-based fine-tuned LLMs (Large Language Models). We apply topic modeling and keyword extraction techniques to create initial topic taxonomies and LLMs to post-process the resulting terms and create a hierarchy. To expand an existing taxonomy with new terms, we use zero-shot promp… ▽ More

    Submitted 11 February, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  15. A Heterogeneous RISC-V based SoC for Secure Nano-UAV Navigation

    Authors: Luca Valente, Alessandro Nadalini, Asif Veeran, Mattia Sinigaglia, Bruno Sa, Nils Wistoff, Yvan Tortorella, Simone Benatti, Rafail Psiakis, Ari Kulmala, Baker Mohammad, Sandro Pinto, Daniele Palossi, Luca Benini, Davide Rossi

    Abstract: The rapid advancement of energy-efficient parallel ultra-low-power (ULP) ucontrollers units (MCUs) is enabling the development of autonomous nano-sized unmanned aerial vehicles (nano-UAVs). These sub-10cm drones represent the next generation of unobtrusive robotic helpers and ubiquitous smart sensors. However, nano-UAVs face significant power and payload constraints while requiring advanced comput… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  16. arXiv:2312.06033  [pdf, ps, other

    cs.IT eess.SP

    Study of Multiuser Multiple-Antenna Wireless Communications Systems Based on Super-Resolution Arrays

    Authors: S. Pinto, R. C. de Lamare

    Abstract: This work studies multiple-antenna wireless communication systems based on super-resolution arrays (SRAs). We consider the uplink of a multiple-antenna system in which users communicate with a multiple-antenna base station equipped with SRAs. In particular, we develop linear minimum mean-square error (MMSE) receive filters along with linear and successive interference cancellation receivers for pr… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 3 figures, 7 pages

  17. arXiv:2311.10837  [pdf, other

    cs.SI physics.soc-ph

    Evaluating the Relationship Between News Source Sharing and Political Beliefs

    Authors: Sofía M del Pozo, Sebastián Pinto, Matteo Serafino, Federico Moss, Tomás Cicchini, Hernán A Makse, Pablo Balenzuela

    Abstract: In an era marked by an abundance of news sources, access to information significantly influences public opinion. Notably, the bias of news sources often serves as an indicator of individuals' political leanings. This study explores this hypothesis by examining the news sharing behavior of politically active social media users, whose political ideologies were identified in a previous study. Using c… ▽ More

    Submitted 15 October, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  18. arXiv:2310.08701  [pdf, other

    cs.SI physics.soc-ph

    Analyzing User Ideologies and Shared News During the 2019 Argentinian Elections

    Authors: Sofía M del Pozo, Sebastián Pinto, Matteo Serafino, Lucio Garcia, Hernán A Makse, Pablo Balenzuela

    Abstract: The extensive data generated on social media platforms allow us to gain insights over trending topics and public opinions. Additionally, it offers a window into user behavior, including their content engagement and news sharing habits. In this study, we analyze the relationship between users' political ideologies and the news they share during Argentina's 2019 election period. Our findings reveal… ▽ More

    Submitted 25 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  19. MCU-Wide Timing Side Channels and Their Detection

    Authors: Johannes Müller, Anna Lena Duque Antón, Lucas Deutschmann, Dino Mehmedagić, Cristiano Rodrigues, Daniel Oliveira, Keerthikumara Devarajegowda, Mohammad Rahmani Fadiheh, Sandro Pinto, Dominik Stoffel, Wolfgang Kunz

    Abstract: Microarchitectural timing side channels have been thoroughly investigated as a security threat in hardware designs featuring shared buffers (e.g., caches) or parallelism between attacker and victim task execution. However, contradicting common intuitions, recent activities demonstrate that this threat is real even in microcontroller SoCs without such features. In this paper, we describe SoC-wide t… ▽ More

    Submitted 18 July, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: This version extends the work of the previous version and was accepted and presented at DAC'24

  20. arXiv:2308.00997  [pdf, other

    cs.DC cs.PF eess.SY

    IRQ Coloring and the Subtle Art of Mitigating Interrupt-generated Interference

    Authors: Diogo Costa, Luca Cuomo, Daniel Oliveira, Ida Maria Savino, Bruno Morelli, José Martins, Alessandro Biasci, Sandro Pinto

    Abstract: Integrating workloads with differing criticality levels presents a formidable challenge in achieving the stringent spatial and temporal isolation requirements imposed by safety-critical standards such as ISO26262. The shift towards high-performance multicore platforms has been posing increasing issues to the so-called mixed-criticality systems (MCS) due to the reciprocal interference created by co… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 10 pages, 9 figures, 2 tables

  21. arXiv:2303.17376  [pdf, other

    cs.CV cs.AI cs.LG

    A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision

    Authors: Lucas Beyer, Bo Wan, Gagan Madan, Filip Pavetic, Andreas Steiner, Alexander Kolesnikov, André Susano Pinto, Emanuele Bugliarello, Xiao Wang, Qihang Yu, Liang-Chieh Chen, Xiaohua Zhai

    Abstract: There has been a recent explosion of computer vision models which perform many tasks and are composed of an image encoder (usually a ViT) and an autoregressive decoder (usually a Transformer). However, most of this work simply presents one system and its results, leaving many questions regarding design decisions and trade-offs of such systems unanswered. In this work, we aim to provide such answer… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  22. arXiv:2303.11186  [pdf, other

    cs.OS

    Shedding Light on Static Partitioning Hypervisors for Arm-based Mixed-Criticality Systems

    Authors: José Martins, Sandro Pinto

    Abstract: In this paper, we aim to understand the properties and guarantees of static partitioning hypervisors (SPH) for Arm-based mixed-criticality systems (MCS). To this end, we performed a comprehensive empirical evaluation of popular open-source SPH, i.e., Jailhouse, Xen (Dom0-less), Bao, and seL4 CAmkES VMM, focusing on two key requirements of modern MCS: real-time and safety. The goal of this study is… ▽ More

    Submitted 23 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  23. arXiv:2302.08242  [pdf, other

    cs.CV

    Tuning computer vision models with task rewards

    Authors: André Susano Pinto, Alexander Kolesnikov, Yuge Shi, Lucas Beyer, Xiaohua Zhai

    Abstract: Misalignment between model predictions and intended usage can be detrimental for the deployment of computer vision models. The issue is exacerbated when the task involves complex structured outputs, as it becomes harder to design procedures which address this misalignment. In natural language processing, this is often addressed using reinforcement learning techniques that align models with a task… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 11 pages

  24. arXiv:2302.02969  [pdf, other

    cs.AR

    CVA6 RISC-V Virtualization: Architecture, Microarchitecture, and Design Space Exploration

    Authors: Bruno Sá, Luca Valente, José Martins, Davide Rossi, Luca Benini, Sandro Pinto

    Abstract: Virtualization is a key technology used in a wide range of applications, from cloud computing to embedded systems. Over the last few years, mainstream computer architectures were extended with hardware virtualization support, giving rise to a set of virtualization technologies (e.g., Intel VT, Arm VE) that are now proliferating in modern processors and SoCs. In this article, we describe our work o… ▽ More

    Submitted 4 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  25. arXiv:2209.05572  [pdf, other

    cs.CR

    Bao-Enclave: Virtualization-based Enclaves for Arm

    Authors: Samuel Pereira, Joao Sousa, Sandro Pinto, José Martins, David Cerdeira

    Abstract: General-purpose operating systems (GPOS), such as Linux, encompass several million lines of code. Statistically, a larger code base inevitably leads to a higher number of potential vulnerabilities and inherently a more vulnerable system. To minimize the impact of vulnerabilities in GPOS, it has become common to implement security-sensitive programs outside the domain of the GPOS, i.e., in a Truste… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 6 pages, 5 figures, WF-IoT 2022

    ACM Class: D.4.6

  26. arXiv:2205.10337  [pdf, other

    cs.CV

    UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes

    Authors: Alexander Kolesnikov, André Susano Pinto, Lucas Beyer, Xiaohua Zhai, Jeremiah Harmsen, Neil Houlsby

    Abstract: We introduce UViM, a unified approach capable of modeling a wide range of computer vision tasks. In contrast to previous models, UViM has the same functional form for all tasks; it requires no task-specific modifications which require extensive human expertise. The approach involves two components: (I) a base model (feed-forward) which is trained to directly predict raw vision outputs, guided by a… ▽ More

    Submitted 14 October, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: 22 pages. Accepted at NeurIPS 2022

  27. arXiv:2203.01025  [pdf, other

    cs.CR

    ReZone: Disarming TrustZone with TEE Privilege Reduction

    Authors: David Cerdeira, José Martins, Nuno Santos, Sandro Pinto

    Abstract: In TrustZone-assisted TEEs, the trusted OS has unrestricted access to both secure and normal world memory. Unfortunately, this architectural limitation has opened an aisle of exploration for attackers, which have demonstrated how to leverage a chain of exploits to hijack the trusted OS and gain full control of the system, targeting (i) the rich execution environment (REE), (ii) all trusted applica… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  28. arXiv:2202.12015  [pdf, other

    cs.CV cs.LG

    Learning to Merge Tokens in Vision Transformers

    Authors: Cedric Renggli, André Susano Pinto, Neil Houlsby, Basil Mustafa, Joan Puigcerver, Carlos Riquelme

    Abstract: Transformers are widely applied to solve natural language understanding and computer vision tasks. While scaling up these architectures leads to improved performance, it often comes at the expense of much higher computational costs. In order for large-scale models to remain practical in real-world systems, there is a need for reducing their computational overhead. In this work, we present the Patc… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 11 pages, 9 figures

  29. arXiv:2112.11644  [pdf, other

    physics.soc-ph cs.SI

    Reconstructing social sensitivity from evolution of content volume in Twitter

    Authors: Sebastián Pinto, Marcos Trevisan, Pablo Balenzuela

    Abstract: We set up a simple mathematical model for the dynamics of public interest in terms of media coverage and social interactions. We test the model on a series of events related to violence in the US during 2020, using the volume of tweets and retweets as a proxy of public interest, and the volume of news as a proxy of media coverage. The model succesfully fits the data and allows inferring a measure… ▽ More

    Submitted 3 October, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

  30. arXiv:2112.07086  [pdf, ps, other

    cs.IT

    Study of Linear Precoding and Power Allocation for Large Multiple-Antenna Systems with Coarsely Quantized Signals

    Authors: S. F. Pinto, R. C. de Lamare

    Abstract: This work studies coarse quantization-aware BD (${\scriptstyle\mathrm{CQA-BD}}$) and coarse quantization-aware RBD (${\scriptstyle\mathrm{CQA-RBD}}$) precoding algorithms for large-scale MU-MIMO systems with coarsely quantized signals and proposes the coarse-quantization most advantageous allocation strategy (${\scriptstyle\mathrm{CQA-MAAS}}$) power allocation algorithm for linearly-precoded MU-MI… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 7 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2107.03969

  31. arXiv:2111.08084  [pdf, ps, other

    cs.IT

    Finding the Minimum Norm and Center Density of Cyclic Lattices via Nonlinear Systems

    Authors: William Lima da Silva Pinto, Carina Alves

    Abstract: Lattices with a circulant generator matrix represent a subclass of cyclic lattices. This subclass can be described by a basis containing a vector and its circular shifts. In this paper, we present certain conditions under which the norm expression of an arbitrary vector of this type of lattice is substantially simplified, and then investigate some of the lattices obtained under these conditions. W… ▽ More

    Submitted 5 July, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: preprint, 28 pages, 1 figure

    MSC Class: 11H31; 52C17; 15A15; 15A03; 90C30

  32. arXiv:2110.02911  [pdf, other

    cs.LG cs.CV

    Shifting Capsule Networks from the Cloud to the Deep Edge

    Authors: Miguel Costa, Diogo Costa, Tiago Gomes, Sandro Pinto

    Abstract: Capsule networks (CapsNets) are an emerging trend in image processing. In contrast to a convolutional neural network, CapsNets are not vulnerable to object deformation, as the relative spatial information of the objects is preserved across the network. However, their complexity is mainly related to the capsule structure and the dynamic routing mechanism, which makes it almost unreasonable to deplo… ▽ More

    Submitted 15 June, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    ACM Class: I.2.5

    Journal ref: ACM Trans. Intell. Syst. Technol. 13, 6, Article 105 (December 2022), 25 pages

  33. arXiv:2107.03969  [pdf, ps, other

    cs.IT

    Study of Block Diagonalization Precoding and Power Allocation for Multiple-Antenna Systems with Coarsely Quantized Signals

    Authors: S. Pinto, R. de Lamare

    Abstract: In this work, we present block diagonalization and power allocation algorithms for large-scale multiple-antenna systems with coarsely quantized signals. In particular, we develop Coarse Quantization-Aware Block Diagonalization ${\scriptstyle\mathrm{\left(CQA-BD\right)}}$ and Coarse Quantization-Aware Regularized Block Diagonalization ${\scriptstyle\mathrm{\left(CQA-RBD\right)}}$ precoding algorith… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: 9 figures, 33 pages

  34. arXiv:2107.03781  [pdf

    cs.CR cs.AR

    Towards a Trusted Execution Environment via Reconfigurable FPGA

    Authors: Sérgio Pereira, David Cerdeira, Cristiano Rodrigues, Sandro Pinto

    Abstract: Trusted Execution Environments (TEEs) are used to protect sensitive data and run secure execution for security-critical applications, by providing an environment isolated from the rest of the system. However, over the last few years, TEEs have been proven weak, as either TEEs built upon security-oriented hardware extensions (e.g., Arm TrustZone) or resorting to dedicated secure elements were explo… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  35. arXiv:2106.05974  [pdf, other

    cs.CV cs.LG stat.ML

    Scaling Vision with Sparse Mixture of Experts

    Authors: Carlos Riquelme, Joan Puigcerver, Basil Mustafa, Maxim Neumann, Rodolphe Jenatton, André Susano Pinto, Daniel Keysers, Neil Houlsby

    Abstract: Sparsely-gated Mixture of Experts networks (MoEs) have demonstrated excellent scalability in Natural Language Processing. In Computer Vision, however, almost all performant networks are "dense", that is, every input is processed by every parameter. We present a Vision MoE (V-MoE), a sparse version of the Vision Transformer, that is scalable and competitive with the largest dense networks. When app… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 44 pages, 38 figures

  36. arXiv:2103.14951  [pdf, other

    cs.AR cs.OS

    A First Look at RISC-V Virtualization from an Embedded Systems Perspective

    Authors: Bruno Sá, José Martins, Sandro Pinto

    Abstract: This article describes the first public implementation and evaluation of the latest version of the RISC-V hypervisor extension (H-extension v0.6.1) specification in a Rocket chip core. To perform a meaningful evaluation for modern multi-core embedded and mixedcriticality systems, we have ported Bao, an open-source static partitioning hypervisor, to RISC-V. We have also extended the RISC-V platform… ▽ More

    Submitted 16 August, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

  37. arXiv:2102.03625  [pdf, other

    cs.CR

    uTango: an open-source TEE for IoT devices

    Authors: Daniel Oliveira, Tiago Gomes, Sandro Pinto

    Abstract: Security is one of the main challenges of the Internet of Things (IoT). IoT devices are mainly powered by low-cost microcontrollers (MCUs) that typically lack basic hardware security mechanisms to separate security-critical applications from less critical components. Recently, Arm has started to release Cortex-M MCUs enhanced with TrustZone technology (i.e., TrustZone-M), a system-wide security so… ▽ More

    Submitted 16 February, 2022; v1 submitted 6 February, 2021; originally announced February 2021.

  38. arXiv:2011.01637  [pdf, other

    cs.SD cs.IR

    Shift If You Can: Counting and Visualising Correction Operations for Beat Tracking Evaluation

    Authors: A. Sá Pinto, I. Domingues, M. E. P. Davies

    Abstract: In this late-breaking abstract we propose a modified approach for beat tracking evaluation which poses the problem in terms of the effort required to transform a sequence of beat detections such that they maximise the well-known F-measure calculation when compared to a sequence of ground truth annotations. Central to our approach is the inclusion of a shifting operation conducted over an additiona… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: ISMIR 2020 Late Breaking/Demo

  39. arXiv:2010.06866  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Ensembles for Low-Data Transfer Learning

    Authors: Basil Mustafa, Carlos Riquelme, Joan Puigcerver, André Susano Pinto, Daniel Keysers, Neil Houlsby

    Abstract: In the low-data regime, it is difficult to train good supervised models from scratch. Instead practitioners turn to pre-trained models, leveraging transfer learning. Ensembling is an empirically and theoretically appealing way to construct powerful predictive models, but the predominant approach of training multiple deep networks with different random initialisations collides with the need for tra… ▽ More

    Submitted 19 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

  40. arXiv:2010.06402  [pdf, other

    cs.LG cs.CV

    Which Model to Transfer? Finding the Needle in the Growing Haystack

    Authors: Cedric Renggli, André Susano Pinto, Luka Rimanic, Joan Puigcerver, Carlos Riquelme, Ce Zhang, Mario Lucic

    Abstract: Transfer learning has been recently popularized as a data-efficient alternative to training models from scratch, in particular for computer vision tasks where it provides a remarkably solid baseline. The emergence of rich model repositories, such as TensorFlow Hub, enables the practitioners and researchers to unleash the potential of these models across a wide range of downstream tasks. As these r… ▽ More

    Submitted 25 March, 2022; v1 submitted 13 October, 2020; originally announced October 2020.

  41. arXiv:2010.00332  [pdf, other

    cs.CV cs.LG

    Training general representations for remote sensing using in-domain knowledge

    Authors: Maxim Neumann, André Susano Pinto, Xiaohua Zhai, Neil Houlsby

    Abstract: Automatically finding good and general remote sensing representations allows to perform transfer learning on a wide range of applications - improving the accuracy and reducing the required number of training samples. This paper investigates development of generic remote sensing representations, and explores which characteristics are important for a dataset to be a good source for representation le… ▽ More

    Submitted 30 September, 2020; originally announced October 2020.

    Comments: Accepted at the IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 2020. arXiv admin note: substantial text overlap with arXiv:1911.06721

  42. arXiv:2009.13239  [pdf, other

    cs.LG cs.CV stat.ML

    Scalable Transfer Learning with Expert Models

    Authors: Joan Puigcerver, Carlos Riquelme, Basil Mustafa, Cedric Renggli, André Susano Pinto, Sylvain Gelly, Daniel Keysers, Neil Houlsby

    Abstract: Transfer of pre-trained representations can improve sample efficiency and reduce computational requirements for new tasks. However, representations used for transfer are usually generic, and are not tailored to a particular distribution of downstream tasks. We explore the use of expert representations for transfer with a simple, yet effective, strategy. We train a diverse set of experts by exploit… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  43. arXiv:2002.10916  [pdf, ps, other

    cs.IT eess.SP

    Study of Coarse Quantization-Aware Block Diagonalization Algorithms for MIMO Systems with Low Resolution

    Authors: S. B. Pinto, R. C. de Lamare

    Abstract: It is known that the estimated energy consumption of digital-to analog converters (DACs) is around 30\% of the energy consumed by analog-to-digital converters (ADCs) keeping fixed the sampling rate and bit resolution. Assuming that similarly to ADC, DAC dissipation doubles with every extra bit of resolution, a decrease in two resolution bits, for instance from 4 to 2 bits, represents a 75$\% $ low… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

    Comments: 3 figures, 9 pages. arXiv admin note: text overlap with arXiv:1707.00953

  44. arXiv:1911.06721  [pdf, other

    cs.CV

    In-domain representation learning for remote sensing

    Authors: Maxim Neumann, Andre Susano Pinto, Xiaohua Zhai, Neil Houlsby

    Abstract: Given the importance of remote sensing, surprisingly little attention has been paid to it by the representation learning community. To address it and to establish baselines and a common evaluation protocol in this domain, we provide simplified access to 5 diverse remote sensing datasets in a standardized form. Specifically, we investigate in-domain representation learning to develop generic remote… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

  45. arXiv:1910.04867  [pdf, other

    cs.CV cs.LG stat.ML

    A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark

    Authors: Xiaohua Zhai, Joan Puigcerver, Alexander Kolesnikov, Pierre Ruyssen, Carlos Riquelme, Mario Lucic, Josip Djolonga, Andre Susano Pinto, Maxim Neumann, Alexey Dosovitskiy, Lucas Beyer, Olivier Bachem, Michael Tschannen, Marcin Michalski, Olivier Bousquet, Sylvain Gelly, Neil Houlsby

    Abstract: Representation learning promises to unlock deep learning for the long tail of vision tasks without expensive labelled datasets. Yet, the absence of a unified evaluation for general visual representations hinders progress. Popular protocols are often too constrained (linear classification), limited in diversity (ImageNet, CIFAR, Pascal-VOC), or only weakly related to representation quality (ELBO, r… ▽ More

    Submitted 21 February, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

  46. arXiv:1909.08095  [pdf, other

    cs.SI physics.soc-ph

    Analyzing Mass Media influence using natural language processing and time series analysis

    Authors: Federico Albanese, Sebastián Pinto, Viktoriya Semeshenko, Pablo Balenzuela

    Abstract: A key question of collective social behavior is related to the influence of Mass Media on public opinion. Different approaches have been developed to address quantitatively this issue, ranging from field experiments to mathematical models. In this work we propose a combination of tools involving natural language processing and time series analysis. We compare selected features of mass media news a… ▽ More

    Submitted 12 June, 2020; v1 submitted 6 September, 2019; originally announced September 2019.

  47. arXiv:1907.13070  [pdf, other

    cs.LG stat.ML

    Predicting assisted ventilation in Amyotrophic Lateral Sclerosis using a mixture of experts and conformal predictors

    Authors: Telma Pereira, Sofia Pires, Marta Gromicho, Susana Pinto, Mamede de Carvalho, Sara C. Madeira

    Abstract: Amyotrophic Lateral Sclerosis (ALS) is a neurodegenerative disease characterized by a rapid motor decline, leading to respiratory failure and subsequently to death. In this context, researchers have sought for models to automatically predict disease progression to assisted ventilation in ALS patients. However, the clinical translation of such models is limited by the lack of insight 1) on the risk… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Journal ref: KDD 2019 Workshop on Applied Data Science for Healthcare

  48. arXiv:1904.03510  [pdf, ps, other

    cs.IT

    Well-Rounded Lattices via Polynomials

    Authors: Carina Alves, William Lima da Silva Pinto, Antonio Aparecido de Andrade

    Abstract: Well-rounded lattices have been a topic of recent studies with applications in wiretap channels and in cryptography. A lattice of full rank in Euclidean space is called well-rounded if its set of minimal vectors spans the whole space. In this paper, we investigate when lattices coming from polynomials with integer coefficients are well-rounded.

    Submitted 6 April, 2019; originally announced April 2019.

    MSC Class: 15A03; 15A06; 15A15; 11C08; 11C20; 11H31

  49. arXiv:1812.07505  [pdf, ps, other

    eess.SP cs.IT cs.LG cs.SD eess.AS math.OC stat.ML

    Direction Finding Based on Multi-Step Knowledge-Aided Iterative Conjugate Gradient Algorithms

    Authors: S. Pinto, R. C. de Lamare

    Abstract: In this work, we present direction-of-arrival (DoA) estimation algorithms based on the Krylov subspace that effectively exploit prior knowledge of the signals that impinge on a sensor array. The proposed multi-step knowledge-aided iterative conjugate gradient (CG) (MS-KAI-CG) algorithms perform subtraction of the unwanted terms found in the estimated covariance matrix of the sensor data. Furthermo… ▽ More

    Submitted 15 December, 2018; originally announced December 2018.

    Comments: 7 figures, 11 pages

  50. arXiv:1811.08306  [pdf, ps, other

    eess.SP cs.IT

    Study of Multi-Step Knowledge-Aided Iterative Nested MUSIC for Direction Finding

    Authors: S. Pinto, R. C. de Lamare

    Abstract: In this work, we propose a subspace-based algorithm for direction-of-arrival (DOA) estimation applied to the signals impinging on a two-level nested array, referred to as multi-step knowledge-aided iterative nested MUSIC method (MS-KAI-Nested-MUSIC), which significantly improves the accuracy of the original Nested-MUSIC. Differently from existing knowledge-aided methods applied to uniform linear a… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: 9 pages, 5 figures. arXiv admin note: text overlap with arXiv:1707.00953, arXiv:1805.00169, arXiv:1703.10523