Skip to main content

Showing 1–50 of 80 results for author: Martins, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.17495  [pdf, ps, other

    cs.LG cs.NI

    BiLCNet : BiLSTM-Conformer Network for Encrypted Traffic Classification with 5G SA Physical Channel Records

    Authors: Ke Ma, Jialiang Lu, Philippe Martins

    Abstract: Accurate and efficient traffic classification is vital for wireless network management, especially under encrypted payloads and dynamic application behavior, where traditional methods such as port-based identification and deep packet inspection (DPI) are increasingly inadequate. This work explores the feasibility of using physical channel data collected from the air interface of 5G Standalone (SA)… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: 6 pages, 5 figures

  2. arXiv:2507.08861  [pdf, ps, other

    cs.LG stat.ML

    On the under-reaching phenomenon in message-passing neural PDE solvers: revisiting the CFL condition

    Authors: Lucas Tesan, Mikel M. Iparraguirre, David Gonzalez, Pedro Martins, Elias Cueto

    Abstract: This paper proposes sharp lower bounds for the number of message passing iterations required in graph neural networks (GNNs) when solving partial differential equations (PDE). This significantly reduces the need for exhaustive hyperparameter tuning. Bounds are derived for the three fundamental classes of PDEs (hyperbolic, parabolic and elliptic) by relating the physical characteristics of the prob… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  3. arXiv:2506.04079  [pdf, ps, other

    cs.CL cs.AI cs.LG

    EuroLLM-9B: Technical Report

    Authors: Pedro Henrique Martins, João Alves, Patrick Fernandes, Nuno M. Guerreiro, Ricardo Rei, Amin Farajian, Mateusz Klimaszewski, Duarte M. Alves, José Pombal, Nicolas Boizard, Manuel Faysse, Pierre Colombo, François Yvon, Barry Haddow, José G. C. de Souza, Alexandra Birch, André F. T. Martins

    Abstract: This report presents EuroLLM-9B, a large language model trained from scratch to support the needs of European citizens by covering all 24 official European Union languages and 11 additional languages. EuroLLM addresses the issue of European languages being underrepresented and underserved in existing open large language models. We provide a comprehensive overview of EuroLLM-9B's development, inclu… ▽ More

    Submitted 16 June, 2025; v1 submitted 4 June, 2025; originally announced June 2025.

    Comments: 56 pages

  4. arXiv:2503.18768  [pdf, other

    cs.DB

    Transformer-based Ranking Approaches for Keyword Queries over Relational Databases

    Authors: Paulo Martins, Altigran da Silva, Johny Moreira, Edleno de Moura

    Abstract: Relational Keyword Search (R-KwS) systems enable naive/informal users to explore and retrieve information from relational databases without requiring schema knowledge or query-language proficiency. Although numerous R-KwS methods have been proposed, most still focus on queries referring only to attribute values or primarily address performance enhancements, providing limited support for queries re… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  5. arXiv:2503.15321  [pdf, ps, other

    astro-ph.GA cs.CV

    Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images

    Authors: Euclid Collaboration, G. Stevens, S. Fotopoulou, M. N. Bremer, T. Matamoro Zatarain, K. Jahnke, B. Margalef-Bentabol, M. Huertas-Company, M. J. Smith, M. Walmsley, M. Salvato, M. Mezcua, A. Paulino-Afonso, M. Siudek, M. Talia, F. Ricci, W. Roster, N. Aghanim, B. Altieri, S. Andreon, H. Aussel, C. Baccigalupi, M. Baldi, S. Bardelli, P. Battaglia , et al. (249 additional authors not shown)

    Abstract: Light emission from galaxies exhibit diverse brightness profiles, influenced by factors such as galaxy type, structural features and interactions with other galaxies. Elliptical galaxies feature more uniform light distributions, while spiral and irregular galaxies have complex, varied light profiles due to their structural heterogeneity and star-forming activity. In addition, galaxies with an acti… ▽ More

    Submitted 12 August, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

    Comments: Paper submitted as part of the A&A Special Issue `Euclid Quick Data Release (Q1)', 34 pages, 26 figures

  6. arXiv:2503.03876  [pdf

    math.NA cs.IT

    Approximate Evaluation Method for the Probability of the Union of Independent Events

    Authors: Edson Luiz Ursini, Paulo S. Martins

    Abstract: The evaluation of the probability of union of a large number of independent events requires several combinations involving the factorial and the use of high performance computers with several hours of processing. Bounds and simplifications on the probability of the union are useful in the analysis of stochastic problems across various areas including (but not limited to) systems reliability, biolo… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Journal ref: Revista Sodebras - ISSN 1809-3957, 2017

  7. arXiv:2502.01193  [pdf, other

    cs.NI

    SigN: SIMBox Activity Detection Through Latency Anomalies at the Cellular Edge

    Authors: Anne Josiane Kouam, Aline Carneiro Viana, Philippe Martins, Cedric Adjih, Alain Tchana

    Abstract: Despite their widespread adoption, cellular networks face growing vulnerabilities due to their inherent complexity and the integration of advanced technologies. One of the major threats in this landscape is Voice over IP (VoIP) to GSM gateways, known as SIMBox devices. These devices use multiple SIM cards to route VoIP traffic through cellular networks, enabling international bypass fraud with los… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  8. arXiv:2412.12034  [pdf, other

    cs.LG

    Thermodynamics-informed graph neural networks for real-time simulation of digital human twins

    Authors: Lucas Tesán, David González, Pedro Martins, Elías Cueto

    Abstract: The growing importance of real-time simulation in the medical field has exposed the limitations and bottlenecks inherent in the digital representation of complex biological systems. This paper presents a novel methodology aimed at advancing current lines of research in soft tissue simulation. The proposed approach introduces a hybrid model that integrates the geometric bias of graph neural network… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  9. arXiv:2411.08730  [pdf

    cs.MM cs.HC

    3D Modelling to Address Pandemic Challenges: A Project-Based Learning Methodology

    Authors: Tânia Rocha, Ana Ribeiro, Joana Oliveira, Ricardo Nunes, Diana Carvalho, Hugo Paredes, Paulo Martins

    Abstract: The use of 3D modelling in medical education is a revolutionary tool during the learning process. In fact, this type of technology enables a more interactive teaching approach, making information retention more effective and enhancing students' understanding. 3D modelling allows for the creation of precise representations of the human body, as well as interaction with three-dimensional models, giv… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

  10. arXiv:2409.16235  [pdf, other

    cs.CL

    EuroLLM: Multilingual Language Models for Europe

    Authors: Pedro Henrique Martins, Patrick Fernandes, João Alves, Nuno M. Guerreiro, Ricardo Rei, Duarte M. Alves, José Pombal, Amin Farajian, Manuel Faysse, Mateusz Klimaszewski, Pierre Colombo, Barry Haddow, José G. C. de Souza, Alexandra Birch, André F. T. Martins

    Abstract: The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding and generating text in all official European Union languages, as well as several additional relevant languages. We outline the progress made to date,… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  11. arXiv:2408.02461  [pdf, other

    cs.NI math.PR

    Performance analysis of a RIS-assisted communications

    Authors: Hamza Adrat, Laurent Decreusefond, Philippe Martins

    Abstract: Reconfigurable Intelligent Surfaces (RIS) are currently considered for adoption in future 6G stantards. ETSI and 3GPP have started feasibility and performance investigations of such a technology. This work proposes an analytical model to analyze RIS performance. It relies on a simple street model where obstacles and mobile units are all aligned. RIS is positioned onto a building parallel to the ro… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  12. arXiv:2405.15569  [pdf, other

    cs.AI cs.NE

    Randomized heuristic repair for large-scale multidimensional knapsack problem

    Authors: Jean P. Martins

    Abstract: The multidimensional knapsack problem (MKP) is an NP-hard combinatorial optimization problem whose solution is determining a subset of maximum total profit items that do not violate capacity constraints. Due to its hardness, large-scale MKP instances are usually a target for metaheuristics, a context in which effective feasibility maintenance strategies are crucial. In 1998, Chu and Beasley propos… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  13. arXiv:2405.01435  [pdf, other

    cs.NI cs.LG

    Closed-form congestion control via deep symbolic regression

    Authors: Jean Martins, Igor Almeida, Ricardo Souza, Silvia Lins

    Abstract: As mobile networks embrace the 5G era, the interest in adopting Reinforcement Learning (RL) algorithms to handle challenges in ultra-low-latency and high throughput scenarios increases. Simultaneously, the advent of packetized fronthaul networks imposes demanding requirements that traditional congestion control mechanisms cannot accomplish, highlighting the potential of RL-based congestion control… ▽ More

    Submitted 28 March, 2024; originally announced May 2024.

  14. Solving the Multiobjective Quasi-Clique Problem

    Authors: Daniela Scherer dos Santos, Kathrin Klamroth, Pedro Martins, Luís Paquete

    Abstract: Given a simple undirected graph $G$, a quasi-clique is a subgraph of $G$ whose density is at least $γ$ $(0 < γ\leq 1)$. Finding a maximum quasi-clique has been addressed from two different perspectives: $i)$ maximizing vertex cardinality for a given edge density; and $ii)$ maximizing edge density for a given vertex cardinality. However, when no a priori preference information about cardinality and… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Journal ref: European Journal of Operational Research, 2025

  15. arXiv:2403.08534  [pdf, ps, other

    cs.DM

    Ensuring connectedness for the Maximum Quasi-clique and Densest $k$-subgraph problems

    Authors: Daniela Scherer dos Santos, Kathrin Klamroth, Pedro Martins, Luís Paquete

    Abstract: Given an undirected graph $G$, a quasi-clique is a subgraph of $G$ whose density is at least $γ$ $(0 < γ\leq 1)$. Two optimization problems can be defined for quasi-cliques: the Maximum Quasi-Clique (MQC) Problem, which finds a quasi-clique with maximum vertex cardinality, and the Densest $k$-Subgraph (DKS) Problem, which finds the densest subgraph given a fixed cardinality constraint. Most existi… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  16. arXiv:2402.17733  [pdf, other

    cs.CL

    Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

    Authors: Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

    Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  17. arXiv:2402.00786  [pdf, other

    cs.CL cs.LG

    CroissantLLM: A Truly Bilingual French-English Language Model

    Authors: Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, António Loison, Duarte M. Alves, Caio Corro, Nicolas Boizard, João Alves, Ricardo Rei, Pedro H. Martins, Antoni Bigata Casademunt, François Yvon, André F. T. Martins, Gautier Viaud, Céline Hudelot, Pierre Colombo

    Abstract: We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a cust… ▽ More

    Submitted 9 April, 2025; v1 submitted 1 February, 2024; originally announced February 2024.

  18. Boosting Mixed-Initiative Co-Creativity in Game Design: A Tutorial

    Authors: Solange Margarido, Licínio Roque, Penousal Machado, Pedro Martins

    Abstract: In recent years, there has been a growing application of mixed-initiative co-creative approaches in the creation of video games. The rapid advances in the capabilities of artificial intelligence (AI) systems further propel creative collaboration between humans and computational agents. In this tutorial, we present guidelines for researchers and practitioners to develop game design tools with a hig… ▽ More

    Submitted 14 August, 2025; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 37 pages, 1 table, 19 figures; expanded introduction to section 3, subsection 4.1, and closing discussion in section 5; restructured subsection 4.2 for greater clarity

  19. arXiv:2308.08365  [pdf, other

    eess.IV cs.CV q-bio.TO

    DeepContrast: Deep Tissue Contrast Enhancement using Synthetic Data Degradations and OOD Model Predictions

    Authors: Nuno Pimpão Martins, Yannis Kalaidzidis, Marino Zerial, Florian Jug

    Abstract: Microscopy images are crucial for life science research, allowing detailed inspection and characterization of cellular and tissue-level structures and functions. However, microscopy data are unavoidably affected by image degradations, such as noise, blur, or others. Many such degradations also contribute to a loss of image contrast, which becomes especially pronounced in deeper regions of thick sa… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 8 pages, 7 figures, 1 table

  20. arXiv:2307.07756  [pdf, other

    cs.LG cs.CR cs.SI

    Real-time Traffic Classification for 5G NSA Encrypted Data Flows With Physical Channel Records

    Authors: Xiao Fei, Philippe Martins, Jialiang Lu

    Abstract: The classification of fifth-generation New-Radio (5G-NR) mobile network traffic is an emerging topic in the field of telecommunications. It can be utilized for quality of service (QoS) management and dynamic resource allocation. However, traditional approaches such as Deep Packet Inspection (DPI) can not be directly applied to encrypted data flows. Therefore, new real-time encrypted traffic classi… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 6 pages, 10 figures

  21. arXiv:2305.11876  [pdf, other

    cs.HC cs.AI cs.CY

    Challenges and Trends in User Trust Discourse in AI

    Authors: Sonia Sousa, Jose Cravino, Paulo Martins

    Abstract: The Internet revolution in 1990, followed by the data-driven and information revolution, has transformed the world as we know it. Nowadays, what seam to be 10 to 20 years ago, a science fiction idea (i.e., machines dominating the world) is seen as possible. This revolution also brought a need for new regulatory practices where user trust and artificial Intelligence (AI) discourse has a central rol… ▽ More

    Submitted 23 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    MSC Class: 68T01 ACM Class: H.5.2; I.2.1

    Journal ref: Multimodal Technologies and Interaction Multimodal Technologies and Interaction Multimodal Technologies and Interaction (MDPI) 2023

  22. arXiv:2305.03306  [pdf, other

    cs.HC cs.AI cs.CY

    Human-centered trust framework: An HCI perspective

    Authors: Sonia Sousa, Jose Cravino, Paulo Martins, David Lamas

    Abstract: The rationale of this work is based on the current user trust discourse of Artificial Intelligence (AI). We aim to produce novel HCI approaches that use trust as a facilitator for the uptake (or appropriation) of current technologies. We propose a framework (HCTFrame) to guide non-experts to unlock the full potential of user trust in AI design. Results derived from a data triangulation of findings… ▽ More

    Submitted 15 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Report number: 2305.03306 MSC Class: 68T01 ACM Class: H.5.2; I.2.1

  23. arXiv:2305.00955  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

    Authors: Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins

    Abstract: Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Work in Progress

  24. arXiv:2301.04225  [pdf, other

    q-bio.MN cs.LG q-bio.CB

    Inferring Gene Regulatory Neural Networks for Bacterial Decision Making in Biofilms

    Authors: Samitha Somathilaka, Daniel P. Martins, Xu Li, Yusong Li, Sasitharan Balasubramaniam

    Abstract: Bacterial cells are sensitive to a range of external signals used to learn the environment. These incoming external signals are then processed using a Gene Regulatory Network (GRN), exhibiting similarities to modern computing algorithms. An in-depth analysis of gene expression dynamics suggests an inherited Gene Regulatory Neural Network (GRNN) behavior within the GRN that enables the cellular dec… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

  25. arXiv:2209.03338  [pdf, other

    cs.MM cs.HC cs.IR cs.SD eess.AS

    ESSYS* Sharing #UC: An Emotion-driven Audiovisual Installation

    Authors: Sérgio M. Rebelo, Mariana Seiça, Pedro Martins, João Bicker, Penousal Machado

    Abstract: We present ESSYS* Sharing #UC, an audiovisual installation artwork that reflects upon the emotional context related to the university and the city of Coimbra, based on the data shared about them on Twitter. The installation was presented in an urban art gallery of Círculo de Artes Plásticas de Coimbra during the summer and autumn of 2021. In the installation space, one may see a collection of typo… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: Paper to be published in 2022 IEEE VIS Arts Program (VISAP 2022). For the associated supplementary materials, see https://cdv.dei.uc.pt/essys_sharing_uc/

    ACM Class: H.4.m; H.5.1; H.5.5

    Journal ref: 2022 IEEE VIS Arts Program (VISAP 2022)

  26. arXiv:2209.00099  [pdf, other

    cs.CL

    Efficient Methods for Natural Language Processing: A Survey

    Authors: Marcos Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro H. Martins, André F. T. Martins, Jessica Zosa Forde, Peter Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz

    Abstract: Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data, time, storage, or energy, all of which are naturally limited and unevenly distributed. This motivates research into efficient methods that require few… ▽ More

    Submitted 24 March, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

    Comments: Accepted at TACL, pre publication version

  27. arXiv:2208.03740  [pdf, other

    cs.LG cs.AI

    Multi-agent reinforcement learning for intent-based service assurance in cellular networks

    Authors: Satheesh K. Perepu, Jean P. Martins, Ricardo Souza S, Kaushik Dey

    Abstract: Recently, intent-based management has received good attention in telecom networks owing to stringent performance requirements for many of the use cases. Several approaches in the literature employ traditional closed-loop driven methods to fulfill the intents on the KPIs. However, these methods consider every closed-loop independent of each other which degrades the combined performance. Also, such… ▽ More

    Submitted 26 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted at Globecom 2022 conference

  28. arXiv:2205.14500  [pdf, other

    cs.LG

    Optimal Decision Diagrams for Classification

    Authors: Alexandre M. Florio, Pedro Martins, Maximilian Schiffer, Thiago Serra, Thibaut Vidal

    Abstract: Decision diagrams for classification have some notable advantages over decision trees, as their internal connections can be determined at training time and their width is not bound to grow exponentially with their depth. Accordingly, decision diagrams are usually less prone to data fragmentation in internal nodes. However, the inherent complexity of training these classifiers acted as a long-stand… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

    MSC Class: 68T99 ACM Class: I.2.6

  29. arXiv:2205.12230  [pdf, other

    cs.CL

    Chunk-based Nearest Neighbor Machine Translation

    Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

    Abstract: Semi-parametric models, which augment generation with retrieval, have led to impressive results in language modeling and machine translation, due to their ability to retrieve fine-grained information from a datastore of examples. One of the most prominent approaches, $k$NN-MT, exhibits strong domain adaptation capabilities by retrieving tokens from domain-specific datastores \citep{khandelwal2020n… ▽ More

    Submitted 7 November, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  30. arXiv:2204.12608  [pdf, other

    cs.CL

    Efficient Machine Translation Domain Adaptation

    Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

    Abstract: Machine translation models struggle when translating out-of-domain text, which makes domain adaptation a topic of critical importance. However, most domain adaptation methods focus on fine-tuning or training the entire or part of the model on every new domain, which can be costly. On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving exam… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Workshop Semiparametric Methods in NLP: Decoupling Logic from Knowledge

  31. arXiv:2203.05921  [pdf, other

    cs.DB cs.IR

    Supporting Schema References in Keyword Queries over Relational Databases

    Authors: Paulo Martins, Altigran da Silva, João Cavalcanti, Edleno de Moura

    Abstract: Relational Keyword Search (R-KwS) systems enable naive/informal users to explore and retrieve information from relational databases without knowing schema details or query languages. These systems take the keywords from the input query, locate the elements of the target database that correspond to these keywords, and look for ways to "connect" these elements using information on referential integr… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    ACM Class: H.2; H.3.3

  32. arXiv:2109.00301  [pdf, other

    cs.CL

    $\infty$-former: Infinite Memory Transformer

    Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

    Abstract: Transformers are unable to model long-term memories effectively, since the amount of computation they need to perform grows with the context length. While variations of efficient transformers have been proposed, they all have a finite memory capacity and are forced to drop old information. In this paper, we propose the $\infty$-former, which extends the vanilla transformer with an unbounded long-t… ▽ More

    Submitted 25 March, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: ACL 2022

  33. arXiv:2108.07834  [pdf, other

    eess.SP cs.ET physics.ins-det

    Applying Intelligent Reflector Surfaces for Detecting Violent Expiratory Aerosol Cloud using Terahertz Signals

    Authors: Harun Šiljak, Michael Taynnan Barros, Nathan D'Arcy, Daniel Perez Martins, Nicola Marchetti, Sasitharan Balasubramaniam

    Abstract: The recent COVID-19 pandemic has driven researchers from different spectrum to develop novel solutions that can improve detection and understanding of SARS-CoV-2 virus. In this article we propose the use of Intelligent Reflector Surface (IRS) emitting terahertz signals to detect airborne respiratory aerosol cloud that are secreted from people. Our proposed approach makes use of future IRS infrastr… ▽ More

    Submitted 29 July, 2022; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: 7 pages, 6 figures. This work has been submitted to the IEEE for possible publication

  34. arXiv:2107.07862  [pdf, other

    q-bio.MN cs.ET

    A Graph-based Molecular Communications Model Analysis of the Human Gut Bacteriome

    Authors: Samitha Somathilaka, Daniel P. Martins, Wiley Barton, Orla O'Sullivan, Paul D. Cotter, Sasitharan Balasubramaniam

    Abstract: Alterations in the human gut bacteriome can be associated with human health issues, such as type-2 diabetes and cardiovascular disease. Both external and internal factors can drive changes in the composition and in the interactions of the human gut bacteriome, impacting negatively on the host cells. In this paper, we focus on the human gut bacteriome metabolism and we propose a two-layer network s… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  35. arXiv:2104.14944  [pdf, other

    eess.SY cs.NI

    A Review on Bio-Cyber Interfaces for Intrabody Molecular Communications Systems

    Authors: Yevgeni Koucheryavy, Anastasia Yastrebova, Daniel P. Martins, Sasitharan Balasubramaniam

    Abstract: The recent advancements in bio-engineering and wireless communications systems have motivated researchers to propose novel applications for telemedicine, therapeutics and human health monitoring. For instance, through wireless medical telemetry a healthcare worker can remotely measure biological signals and control certain processes in the organism required for the maintenance of the patient's hea… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: 16 pages, 2 tables and 2 figures

  36. arXiv:2104.07341  [pdf, other

    cs.ET eess.SP q-bio.BM

    Microfluidic-based Bacterial Molecular Computing on a Chip

    Authors: Daniel P. Martins, Michael Taynnan Barros, Benjamin O'Sullivan, Ian Seymour, Alan O'Riordan, Lee Coffey, Joseph Sweeney, Sasitharan Balasubramaniam

    Abstract: Biocomputing systems based on engineered bacteria can lead to novel tools for environmental monitoring and detection of metabolic diseases. In this paper, we propose a Bacterial Molecular Computing on a Chip (BMCoC) using microfluidic and electrochemical sensing technologies. The computing can be flexibly integrated into the chip, but we focus on engineered bacterial AND Boolean logic gate and ON-… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: 11 pages, 6 figures

  37. arXiv:2102.01672  [pdf, other

    cs.CL cs.AI cs.LG

    The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

    Authors: Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D. Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak , et al. (31 additional authors not shown)

    Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it… ▽ More

    Submitted 1 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

  38. arXiv:2101.11337  [pdf, other

    cs.SI

    Launchers and Targets in Social Networks

    Authors: Pedro Martins, Filipa Alarcão Martins

    Abstract: Influence propagation in social networks is a subject of growing interest. A relevant issue in those networks involves the identification of key influencers. These players have an important role on viral marketing strategies and message propagation, including political propaganda and fake news. In effect, an important way to fight malicious usage on social networks is to understand their propertie… ▽ More

    Submitted 4 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: 30 pages, 6 figures

    MSC Class: 68R10; 90B18

  39. arXiv:2009.02224  [pdf, other

    eess.SP cs.IT

    Evolving Intelligent Reflector Surface towards 6G for Public Health: Application in Airborne Virus Detection

    Authors: Harun Šiljak, Nouman Ashraf, Michael Taynnan Barros, Daniel Perez Martins, Bernard Butler, Arman Farhang, Nicola Marchetti, Sasitharan Balasubramaniam

    Abstract: While metasurface based intelligent reflecting surfaces (IRS) are an important emerging technology for future generations of wireless connectivity in its own right, the plans for the mass deployment of these surfaces motivate the question of their integration with other new and emerging technologies that would require mass proliferation. This question of integration and the vision of future commun… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    Comments: This work has been submitted to the IEEE for possible publication

  40. arXiv:2004.02644  [pdf, other

    cs.CL

    Sparse Text Generation

    Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

    Abstract: Current state-of-the-art text generators build on powerful language models such as GPT-2, achieving impressive performance. However, to avoid degenerate text, they require sampling from a modified softmax, via temperature parameters or ad-hoc truncation techniques, as in top-$k$ or nucleus sampling. This creates a mismatch between training and testing conditions. In this paper, we use the recently… ▽ More

    Submitted 5 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

  41. arXiv:2002.05556  [pdf, other

    cs.CL cs.CV

    Sparse and Structured Visual Attention

    Authors: Pedro Henrique Martins, Vlad Niculae, Zita Marinho, André Martins

    Abstract: Visual attention mechanisms are widely used in multimodal tasks, as visual question answering (VQA). One drawback of softmax-based attention mechanisms is that they assign some probability mass to all image regions, regardless of their adjacency structure and of their relevance to the text. In this paper, to better link the image structure with the text, we replace the traditional softmax attentio… ▽ More

    Submitted 8 July, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

  42. arXiv:2002.05162  [pdf, other

    cs.NI cs.CL

    A Combined Stochastic and Physical Framework for Modeling Indoor 5G Millimeter Wave Propagation

    Authors: Georges Nassif, Catherine Gloaguen, Philippe Martins

    Abstract: Indoor coverage is a major challenge for 5G millimeter waves (mmWaves). In this paper, we address this problem through a novel theoretical framework that combines stochastic indoor environment modeling with advanced physical propagation simulation. This approach is particularly adapted to investigate indoor-to-indoor 5G mmWave propagation. Its system implementation, so-called iGeoStat, generates p… ▽ More

    Submitted 17 February, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: 30 pages, 18 figures, and 7 tables

  43. arXiv:1909.04425  [pdf

    cs.SD cs.LG eess.AS

    Automatic detection of estuarine dolphin whistles in spectrogram images

    Authors: O. M. Serra, F. P. R. Martins, L. R. Padovese

    Abstract: An algorithm for detecting tonal vocalizations from estuarine dolphin (Sotalia guianensis) specimens without interference of a human operator is developed. The raw audio data collected from a passive monitoring sensor in the Cananéia underwater soundscape is converted to spectrogram images, containing the desired acoustic event (whistle) as a linear pattern in the images. Detection is a four-step… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: 10 pages; 18 figures

  44. arXiv:1907.08243  [pdf, other

    cs.CL

    Joint Learning of Named Entity Recognition and Entity Linking

    Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

    Abstract: Named entity recognition (NER) and entity linking (EL) are two fundamentally related tasks, since in order to perform EL, first the mentions to entities have to be detected. However, most entity linking approaches disregard the mention detection part, assuming that the correct mentions have been previously detected. In this paper, we perform joint learning of NER and EL to leverage their relatedne… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

  45. arXiv:1901.00375  [pdf, other

    cs.NI cs.PF

    Computing the $k$-coverage of a wireless network

    Authors: Anaïs Vergne, Laurent Decreusefond, Philippe Martins

    Abstract: Coverage is one of the main quality of service of a wirelessnetwork. $k$-coverage, that is to be covered simultaneously by $k$network nodes, is synonym of reliability and numerous applicationssuch as multiple site MIMO features, or handovers. We introduce here anew algorithm for computing the $k$-coverage of a wirelessnetwork. Our method is based on the observation that $k$-coverage canbe interpr… ▽ More

    Submitted 29 December, 2018; originally announced January 2019.

    Comments: Valuetools 2019, Mar 2019, Palma de Mallorca, Spain. 2019. arXiv admin note: text overlap with arXiv:1802.08442

  46. Towards Automating Precision Studies of Clone Detectors

    Authors: Vaibhav Saini, Farima Farmahinifarahani, Yadong Lu, Di Yang, Pedro Martins, Hitesh Sajnani, Pierre Baldi, Cristina Lopes

    Abstract: Current research in clone detection suffers from poor ecosystems for evaluating precision of clone detection tools. Corpora of labeled clones are scarce and incomplete, making evaluation labor intensive and idiosyncratic, and limiting inter tool comparison. Precision-assessment tools are simply lacking. We present a semi-automated approach to facilitate precision studies of clone detection tools.… ▽ More

    Submitted 13 December, 2018; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: Accepted to be published in the 41st ACM/IEEE International Conference on Software Engineering

    ACM Class: D.2.13

    Journal ref: Proceeding 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE)

  47. arXiv:1811.11316  [pdf

    cs.HC

    Using Computer Vision Techniques for Moving Poster Design

    Authors: Sérgio Rebelo, Pedro Martins, João Bicker, Penousal Machado

    Abstract: Graphic Design encompasses a wide range of activities from the design of traditional print media (e.g., books and posters) to site-specific (e.g., signage systems) and electronic media (e.g., interfaces). Its practice always explores the new possibilities of information and communication technologies. Therefore, interactivity and participation have become key features in the design process. Even i… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: This paper will be published in the sixth International Conference Ergotrip Design 29-30 November 2017, Aveiro, Portugal

    Journal ref: REBELO, Sérgio et al. - Using Computer Vision Techniques for Moving Poster Design. In Proceedings of sixth Ergotrip Design (ETD 17). Aveiro, Portugal : Universidade de Aveiro, 2017

  48. arXiv:1807.03053  [pdf, other

    cs.CL cs.RO

    A deep learning approach for understanding natural language commands for mobile service robots

    Authors: Pedro Henrique Martins, Luís Custódio, Rodrigo Ventura

    Abstract: Using natural language to give instructions to robots is challenging, since natural language understanding is still largely an open problem. In this paper we address this problem by restricting our attention to commands modeled as one action, plus arguments (also known as slots). For action detection (also called intent detection) and slot filling various architectures of Recurrent Neural Networks… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

  49. arXiv:1806.11431  [pdf, other

    cs.OS

    Integrating Proactive Mode Changes in Mixed Criticality Systems

    Authors: Flavio R Massaro Jr., Paulo S. Martins, Edson L. Ursini

    Abstract: In this work, we propose to integrate prediction algorithms to the scheduling of mode changes under the Earliest-Deadline-First and Fixed-priority scheduling in mixed-criticality real-time systems. The method proactively schedules a mode change in the system based on state variables such as laxity, to the percentage difference in the temporal distance between the completion time of the instance of… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

  50. arXiv:1804.04621  [pdf, other

    cs.SE

    The Java Build Framework: Large Scale Compilation

    Authors: Pedro Martins, Rohan Achar, Cristina V. Lopes

    Abstract: Large repositories of source code for research tend to limit their utility to static analysis of the code, as they give no guarantees on whether the projects are compilable, much less runnable in any way. The immediate consequence of the lack of large compilable and runnable datasets is that research that requires such properties does not generalize beyond small benchmarks. We present the Java Bui… ▽ More

    Submitted 12 April, 2018; originally announced April 2018.