Skip to main content

Showing 1–50 of 70 results for author: Rocha, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05331  [pdf, ps, other

    cs.RO

    A Careful Examination of Large Behavior Models for Multitask Dexterous Manipulation

    Authors: TRI LBM Team, Jose Barreiros, Andrew Beaulieu, Aditya Bhat, Rick Cory, Eric Cousineau, Hongkai Dai, Ching-Hsin Fang, Kunimatsu Hashimoto, Muhammad Zubair Irshad, Masha Itkina, Naveen Kuppuswamy, Kuan-Hui Lee, Katherine Liu, Dale McConachie, Ian McMahon, Haruki Nishimura, Calder Phillips-Grafflin, Charles Richter, Paarth Shah, Krishnan Srinivasan, Blake Wulfe, Chen Xu, Mengchao Zhang, Alex Alspach , et al. (57 additional authors not shown)

    Abstract: Robot manipulation has seen tremendous progress in recent years, with imitation learning policies enabling successful performance of dexterous and hard-to-model tasks. Concurrently, scaling data and model size has led to the development of capable language and vision foundation models, motivating large-scale efforts to create general-purpose robot foundation models. While these models have garnere… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  2. arXiv:2507.03761  [pdf, ps, other

    cs.IR

    Ranking-based Fusion Algorithms for Extreme Multi-label Text Classification (XMTC)

    Authors: Celso França, Gestefane Rabbi, Thiago Salles, Washington Cunha, Leonardo Rocha, Marcos André Gonçalves

    Abstract: In the context of Extreme Multi-label Text Classification (XMTC), where labels are assigned to text instances from a large label space, the long-tail distribution of labels presents a significant challenge. Labels can be broadly categorized into frequent, high-coverage \textbf{head labels} and infrequent, low-coverage \textbf{tail labels}, complicating the task of balancing effectiveness across al… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

  3. arXiv:2506.14297  [pdf, ps, other

    cs.SE

    Quality Assessment of Python Tests Generated by Large Language Models

    Authors: Victor Alves, Carla Bezerra, Ivan Machado, Larissa Rocha, Tássio Virgínio, Publio Silva

    Abstract: The manual generation of test scripts is a time-intensive, costly, and error-prone process, indicating the value of automated solutions. Large Language Models (LLMs) have shown great promise in this domain, leveraging their extensive knowledge to produce test code more efficiently. This study investigates the quality of Python test code generated by three LLMs: GPT-4o, Amazon Q, and LLama 3.3. We… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: International Conference on Evaluation and Assessment in Software Engineering (EASE), 2025 edition

  4. CTDGSI: A comprehensive exploitation of instance selection methods for automatic text classification. VII Concurso de Teses, Dissertações e Trabalhos de Graduação em SI -- XXI Simpósio Brasileiro de Sistemas de Informação

    Authors: Washington Cunha, Leonardo Rocha, Marcos André Gonçalves

    Abstract: Progress in Natural Language Processing (NLP) has been dictated by the rule of more: more data, more computing power and more complexity, best exemplified by the Large Language Models. However, training (or fine-tuning) large dense models for specific applications usually requires significant amounts of computing resources. This \textbf{Ph.D. dissertation} focuses on an under-investi\-gated NLP da… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: 16 pages, 5 figures, 2 tables

  5. arXiv:2505.20185  [pdf, ps, other

    cs.SI physics.soc-ph

    Sentiment spreads, but topics do not, in COVID-19 discussions within the Belgian Reddit community

    Authors: Tim Van Wesemael, Luis E. C. Rocha, Tijs W. Alleman, Jan M. Baetens

    Abstract: This study investigates how topics and sentiments on COVID-19 mitigation measures -- specifically lockdowns, mask mandates, and vaccinations -- spread through the Belgian Reddit community. We explore 655,642 posts created between 1 January 2020 and 30 June 2022. In line with previous studies for other countries and platforms, we find that the volume of posts on these topics can be tied to importan… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 25 pages; 9 figures; 5 tables

    MSC Class: 91C99

  6. arXiv:2505.02857  [pdf, ps, other

    cs.CY cs.SE

    Overcoming Obstacles: Challenges of Gender Inequality in Undergraduate ICT Programs

    Authors: Angelica Pereira Souza, Anderson Uchôa, Edna Dias Canedo, Juliana Alves Pereira, Claudia Pinto Pereira, Larissa Rocha

    Abstract: Context: Gender inequality is a widely discussed issue across various sectors, including Information Technology and Communication (ICT). In Brazil, women represent less than 18% of ICT students in higher education. Prior studies highlight gender-related barriers that discourage women from staying in ICT. However, they provide limited insights into their perceptions as undergraduate students and th… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: 8 pages. To be published in the Proceedings of the 6th ACM/IEEE Workshop on Gender Equality, Diversity, and Inclusion in Software Engineering (GE), 2025 in Ottawa, Ontario, Canada

  7. arXiv:2504.01930  [pdf, other

    cs.CL cs.AI

    A thorough benchmark of automatic text classification: From traditional approaches to large language models

    Authors: Washington Cunha, Leonardo Rocha, Marcos André Gonçalves

    Abstract: Automatic text classification (ATC) has experienced remarkable advancements in the past decade, best exemplified by recent small and large language models (SLMs and LLMs), leveraged by Transformer architectures. Despite recent effectiveness improvements, a comprehensive cost-benefit analysis investigating whether the effectiveness gains of these recent approaches compensate their much higher costs… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 7 pages, 2 figures, 3 tables

  8. arXiv:2502.21051  [pdf, other

    cs.LG cs.CE

    Detection of anomalies in cow activity using wavelet transform based features

    Authors: Valentin Guien, Violaine Antoine, Romain Lardy, Isabelle Veissier, Luis E C Rocha

    Abstract: In Precision Livestock Farming, detecting deviations from optimal or baseline values - i.e. anomalies in time series - is essential to allow undertaking corrective actions rapidly. Here we aim at detecting anomalies in 24h time series of cow activity, with a view to detect cases of disease or oestrus. Deviations must be distinguished from noise which can be very high in case of biological data. It… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: 17 pages, 8 figures, 4 tables, 1 algorithm

  9. arXiv:2502.02368  [pdf

    cs.SE cs.AI

    Evaluating the Effectiveness of LLMs in Fixing Maintainability Issues in Real-World Projects

    Authors: Henrique Nunes, Eduardo Figueiredo, Larissa Rocha, Sarah Nadi, Fischer Ferreira, Geanderson Esteves

    Abstract: Large Language Models (LLMs) have gained attention for addressing coding problems, but their effectiveness in fixing code maintainability remains unclear. This study evaluates LLMs capability to resolve 127 maintainability issues from 10 GitHub repositories. We use zero-shot prompting for Copilot Chat and Llama 3.1, and few-shot prompting with Llama only. The LLM-generated solutions are assessed f… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  10. arXiv:2409.15142  [pdf, other

    physics.soc-ph cs.SI

    Critical Node Detection in Temporal Social Networks, Based on Global and Semi-local Centrality Measures

    Authors: Zahra Farahi, Ali Kamandi, Rooholah Abedian, Luis Enrique Correa Rocha

    Abstract: Nodes that play strategic roles in networks are called critical or influential nodes. For example, in an epidemic, we can control the infection spread by isolating critical nodes; in marketing, we can use certain nodes as the initial spreaders aiming to reach the largest part of the network, or they can be selected for removal in targeted attacks to maximise the fragmentation of the network. In th… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: Comments and criticisms are welcomed

  11. arXiv:2408.09629  [pdf, other

    cs.CL

    A Strategy to Combine 1stGen Transformers and Open LLMs for Automatic Text Classification

    Authors: Claudio M. V. de Andrade, Washington Cunha, Davi Reis, Adriana Silvina Pagano, Leonardo Rocha, Marcos André Gonçalves

    Abstract: Transformer models have achieved state-of-the-art results, with Large Language Models (LLMs), an evolution of first-generation transformers (1stTR), being considered the cutting edge in several NLP tasks. However, the literature has yet to conclusively demonstrate that LLMs consistently outperform 1stTRs across all NLP tasks. This study compares three 1stTRs (BERT, RoBERTa, and BART) with two open… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: 13 pages, 3 figures, 8 tables

  12. arXiv:2408.08905  [pdf, other

    cs.DL cs.IR cs.LG

    PATopics: An automatic framework to extract useful information from pharmaceutical patents documents

    Authors: Pablo Cecilio, Antônio Perreira, Juliana Santos Rosa Viegas, Washington Cunha, Felipe Viegas, Elisa Tuler, Fabiana Testa Moura de Carvalho Vicentini, Leonardo Rocha

    Abstract: Pharmaceutical patents play an important role by protecting the innovation from copies but also drive researchers to innovate, create new products, and promote disruptive innovations focusing on collective health. The study of patent management usually refers to an exhaustive manual search. This happens, because patent documents are complex with a lot of details regarding the claims and methodolog… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 17 pages, 5 figures, 5 tables

  13. arXiv:2407.17284  [pdf, other

    cs.LG cs.DB cs.IR

    A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks

    Authors: Fabiano Belém, Washington Cunha, Celso França, Claudio Andrade, Leonardo Rocha, Marcos André Gonçalves

    Abstract: This is the first work to investigate the effectiveness of BERT-based contextual embeddings in active learning (AL) tasks on cold-start scenarios, where traditional fine-tuning is infeasible due to the absence of labeled data. Our primary contribution is the proposal of a more robust fine-tuning pipeline - DoTCAL - that diminishes the reliance on labeled data in AL using two steps: (1) fully lever… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 11 pages, 4 figures, 2 Tables, and 1 algorithm

  14. arXiv:2405.08784  [pdf, other

    cs.CL cs.SI

    Refinement of an Epilepsy Dictionary through Human Annotation of Health-related posts on Instagram

    Authors: Aehong Min, Xuan Wang, Rion Brattig Correia, Jordan Rozum, Wendy R. Miller, Luis M. Rocha

    Abstract: We used a dictionary built from biomedical terminology extracted from various sources such as DrugBank, MedDRA, MedlinePlus, TCMGeneDIT, to tag more than 8 million Instagram posts by users who have mentioned an epilepsy-relevant drug at least once, between 2010 and early 2016. A random sample of 1,771 posts with 2,947 term matches was evaluated by human annotators to identify false-positives. Open… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  15. arXiv:2405.07072  [pdf, other

    cs.SI

    Focused digital cohort selection from social media using the metric backbone of biomedical knowledge graphs

    Authors: Ziqi Guo, Jack Felag, Jordan C. Rozum, Rion Brattig Correia, Xuan Wang, Luis M. Rocha

    Abstract: Social media data allows researchers to construct large digital cohorts to study the interplay between human behavior and medical treatment.Identifying the users most relevant to a specific health problem is, however, a challenge in that social media sites vary in the generality of their discourse. To filter relevant users on any social media, we have developed a general method and tested it on ep… ▽ More

    Submitted 26 May, 2025; v1 submitted 11 May, 2024; originally announced May 2024.

  16. arXiv:2405.05229  [pdf, other

    cs.IR cs.DL

    myAURA: Personalized health library for epilepsy management via knowledge graph sparsification and visualization

    Authors: Rion Brattig Correia, Jordan C. Rozum, Leonard Cross, Jack Felag, Michael Gallant, Ziqi Guo, Bruce W. Herr II, Aehong Min, Deborah Stungis Rocha, Xuan Wang, Katy Börner, Wendy Miller, Luis M. Rocha

    Abstract: Objective: We report the development of the patient-centered myAURA application and suite of methods designed to aid epilepsy patients, caregivers, and researchers in making decisions about care and self-management. Materials and Methods: myAURA rests on the federation of an unprecedented collection of heterogeneous data resources relevant to epilepsy, such as biomedical databases, social media,… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  17. arXiv:2403.12705  [pdf, other

    cs.DM

    The ultrametric backbone is the union of all minimum spanning forests

    Authors: Jordan C Rozum, Luis M Rocha

    Abstract: Minimum spanning trees and forests are powerful sparsification techniques that remove cycles from weighted graphs to minimize total edge weight while preserving node connectivity. They have applications in computer science, network science, and graph theory. Despite their utility and ubiquity, they have several limitations, including that they are only defined for undirected networks, they signifi… ▽ More

    Submitted 22 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 10 pages, 1 figure. Revision corrects typo in abstract

  18. arXiv:2402.06297  [pdf, other

    cs.RO

    Dynamic Q-planning for Online UAV Path Planning in Unknown and Complex Environments

    Authors: Lidia Gianne Souza da Rocha, Kenny Anderson Queiroz Caldas, Marco Henrique Terra, Fabio Ramos, Kelen Cristiane Teixeira Vivaldini

    Abstract: Unmanned Aerial Vehicles need an online path planning capability to move in high-risk missions in unknown and complex environments to complete them safely. However, many algorithms reported in the literature may not return reliable trajectories to solve online problems in these scenarios. The Q-Learning algorithm, a Reinforcement Learning Technique, can generate trajectories in real-time and has d… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  19. arXiv:2311.14817  [pdf, ps, other

    physics.soc-ph cs.SI

    Quantifying edge relevance for epidemic spreading via the semi-metric topology of complex networks

    Authors: David Soriano Paños, Felipe Xavier Costa, Luis M. Rocha

    Abstract: Sparsification aims at extracting a reduced core of associations that best preserves both the dynamics and topology of networks while reducing the computational cost of simulations. We show that the semi-metric topology of complex networks yields a natural and algebraically-principled sparsification that outperforms existing methods on those goals. Weighted graphs whose edges represent distances b… ▽ More

    Submitted 4 June, 2025; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: 13 pages, 4 figures. Supplementary Text: 12 pages, 1 table, 9 figures

  20. arXiv:2310.14379  [pdf, other

    cs.IR

    Can Offline Metrics Measure Explanation Goals? A Comparative Survey Analysis of Offline Explanation Metrics in Recommender Systems

    Authors: André Levi Zanon, Marcelo Garcia Manzato, Leonardo Rocha

    Abstract: Explanations in a Recommender System (RS) provide reasons for recommendations to users and can enhance transparency, persuasiveness, engagement, and trust-known as explanation goals. Evaluating the effectiveness of explanation algorithms offline remains challenging due to subjectivity. Initially, we conducted a literature review on current offline metrics, revealing that algorithms are often asses… ▽ More

    Submitted 14 April, 2025; v1 submitted 22 October, 2023; originally announced October 2023.

  21. arXiv:2310.03491  [pdf, other

    cs.IR cs.LG cs.SE

    TPDR: A Novel Two-Step Transformer-based Product and Class Description Match and Retrieval Method

    Authors: Washington Cunha, Celso França, Leonardo Rocha, Marcos André Gonçalves

    Abstract: There is a niche of companies responsible for intermediating the purchase of large batches of varied products for other companies, for which the main challenge is to perform product description standardization, i.e., matching an item described by a client with a product described in a catalog. The problem is complex since the client's product description may be: (1) potentially noisy; (2) short an… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 10 pages, 8 figures, 5 tables

  22. arXiv:2307.15614  [pdf, other

    physics.soc-ph cs.MA econ.GN physics.pop-ph

    Fast but multi-partisan: Bursts of communication increase opinion diversity in the temporal Deffuant model

    Authors: Fatemeh Zarei, Yerali Gandica, Luis Enrique Correa Rocha

    Abstract: Human interactions create social networks forming the backbone of societies. Individuals adjust their opinions by exchanging information through social interactions. Two recurrent questions are whether social structures promote opinion polarisation or consensus in societies and whether polarisation can be avoided, particularly on social media. In this paper, we hypothesise that not only network st… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: 9 pages, 6 figures. Comments (e.g. missing references, suggestions, ...) are welcomed

  23. arXiv:2303.16361  [pdf, other

    q-bio.MN cs.CE

    Dynamical Modularity in Automata Models of Biochemical Networks

    Authors: Thomas Parmer, Luis M. Rocha

    Abstract: Given the large size and complexity of most biochemical regulation and signaling networks, there is a non-trivial relationship between the micro-level logic of component interactions and the observed macro-dynamics. Here we address this issue by formalizing the existing concept of pathway modules, which are sequences of state updates that are guaranteed to occur (barring outside interference) in t… ▽ More

    Submitted 17 April, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: 42 pages, 7 figures; updated author information

  24. arXiv:2303.16098  [pdf, other

    cs.CL cs.AI

    Carolina: a General Corpus of Contemporary Brazilian Portuguese with Provenance, Typology and Versioning Information

    Authors: Maria Clara Ramos Morales Crespo, Maria Lina de Souza Jeannine Rocha, Mariana Lourenço Sturzeneker, Felipe Ribas Serras, Guilherme Lamartine de Mello, Aline Silva Costa, Mayara Feliciano Palma, Renata Morais Mesquita, Raquel de Paula Guets, Mariana Marques da Silva, Marcelo Finger, Maria Clara Paixão de Sousa, Cristiane Namiuti, Vanessa Martins do Monte

    Abstract: This paper presents the first publicly available version of the Carolina Corpus and discusses its future directions. Carolina is a large open corpus of Brazilian Portuguese texts under construction using web-as-corpus methodology enhanced with provenance, typology, versioning, and text integrality. The corpus aims at being used both as a reliable source for research in Linguistics and as an import… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 14 pages, 3 figures, 1 appendix

    MSC Class: 68T50 ACM Class: I.2.7

  25. arXiv:2209.01181  [pdf, other

    cs.SI cs.DS physics.soc-ph

    The distance backbone of directed networks

    Authors: Felipe Xavier Costa, Rion Brattig Correia, Luis M. Rocha

    Abstract: In weighted graphs the shortest path between two nodes is often reached through an indirect path, out of all possible connections, leading to structural redundancies which play key roles in the dynamics and evolution of complex networks. We have previously developed a parameter-free, algebraically-principled methodology to uncover such redundancy and reveal the distance backbone of weighted graphs… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted at the 11th International Conference on Complex Networks and their Applications

  26. LargeNetVis: Visual Exploration of Large Temporal Networks Based on Community Taxonomies

    Authors: Claudio D. G. Linhares, Jean R. Ponciano, Diogenes S. Pedro, Luis E. C. Rocha, Agma J. M. Traina, Jorge Poco

    Abstract: Temporal (or time-evolving) networks are commonly used to model complex systems and the evolution of their components throughout time. Although these networks can be analyzed by different means, visual analytics stands out as an effective way for a pre-analysis before doing quantitative/statistical analyses to identify patterns, anomalies, and other behaviors in the data, thus leading to new insig… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: 11 pages, 9 figures

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, 2022

  27. arXiv:2207.10924  [pdf, other

    physics.soc-ph cs.SI

    Evolution of the public opinion on COVID-19 vaccination in Japan

    Authors: Yuri Nakayama, Yuka Takedomi, Towa Suda, Takeaki Uno, Takako Hashimoto, Masashi Toyoda, Naoki Yoshinaga, Masaru Kitsuregawa, Luis E. C. Rocha, Ryota Kobayashi

    Abstract: Vaccines are promising tools to control the spread of COVID-19. An effective vaccination campaign requires government policies and community engagement, sharing experiences for social support, and voicing concerns to vaccine safety and efficiency. The increasing use of online social platforms allows us to trace large-scale communication and infer public opinion in real-time. We collected more than… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  28. arXiv:2207.10794  [pdf, other

    q-bio.QM cs.CV cs.LG

    Neuroimaging Feature Extraction using a Neural Network Classifier for Imaging Genetics

    Authors: Cédric Beaulac, Sidi Wu, Erin Gibson, Michelle F. Miranda, Jiguo Cao, Leno Rocha, Mirza Faisal Beg, Farouk S. Nathoo

    Abstract: A major issue in the association of genes to neuroimaging phenotypes is the high dimension of both genetic data and neuroimaging data. In this article, we tackle the latter problem with an eye toward developing solutions that are relevant for disease prediction. Supported by a vast literature on the predictive power of neural networks, our proposed solution uses neural networks to extract from neu… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: Under review

    Journal ref: BMC Bioinformatics 24, 271 (2023)

  29. arXiv:2201.07552  [pdf, other

    q-bio.QM cs.CY cs.SI stat.CO

    Small Cohort of Epilepsy Patients Showed Increased Activity on Facebook before Sudden Unexpected Death

    Authors: Ian B. Wood, Rion Brattig Correia, Wendy R. Miller, Luis M. Rocha

    Abstract: Sudden Unexpected Death in Epilepsy (SUDEP) remains a leading cause of death in people with epilepsy. Despite the constant risk for patients and bereavement to family members, to date the physiological mechanisms of SUDEP remain unknown. Here we explore the potential to identify putative predictive signals of SUDEP from online digital behavioral data using text and sentiment analysis. Specifically… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: Submitted to Epilepsy & Behavior

    MSC Class: 62P10 (Primary) 92D50; 68U15; 92D30 (Secondary) ACM Class: J.3; I.5.4

  30. arXiv:2107.13902  [pdf, other

    cs.SE

    Developers perception on the severity of test smells: an empirical study

    Authors: Denivan Campos, Larissa Rocha, Ivan Machado

    Abstract: Unit testing is an essential component of the software development life-cycle. A developer could easily and quickly catch and fix software faults introduced in the source code by creating and running unit tests. Despite their importance, unit tests are subject to bad design or implementation decisions, the so-called test smells. These might decrease software systems quality from various aspects, m… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

    Comments: 14 pages

  31. Autonomous Navigation System for a Delivery Drone

    Authors: Victor R. F. Miranda, Adriano M. C. Rezende, Thiago L. Rocha, Héctor Azpúrua, Luciano C. A. Pimenta, Gustavo M. Freitas

    Abstract: The use of delivery services is an increasing trend worldwide, further enhanced by the COVID pandemic. In this context, drone delivery systems are of great interest as they may allow for faster and cheaper deliveries. This paper presents a navigation system that makes feasible the delivery of parcels with autonomous drones. The system generates a path between a start and a final point and controls… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 12 pages, 15 figures, extended version of an paper published at the XXIII Brazilian Congress of Automatica, entitled "Desenvolvimento de um drone autônomo para tarefas de entrega de carga"

  32. arXiv:2106.06422  [pdf, other

    cs.SE

    From Blackboard to the Office: A Look Into How Practitioners Perceive Software Testing Education

    Authors: Luana Martins, Vinicius Brito, Daniela Feitosa, Larissa Rocha, Heitor Costa, Ivan Machado

    Abstract: The teaching-learning process may require specific pedagogical approaches to establish a relationship with industry practices. Recently, some studies investigated the educators' perspectives and the undergraduate courses curriculum to identify potential weaknesses and solutions for the software testing teaching process. However, it is still unclear how the practitioners evaluate the acquisition of… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: Preprint of the manuscript accepted for publication at EASE 2021

  33. arXiv:2105.00500  [pdf, other

    cs.SE

    Assessing Exception Handling Testing Practices in Open-Source Libraries

    Authors: Luan P. Lima, Lincoln S. Rocha, Carla I. M. Bezerra, Matheus Paixao

    Abstract: Modern programming languages (e.g., Java and C#) provide features to separate error-handling code from regular code, seeking to enhance software comprehensibility and maintainability. Nevertheless, the way exception handling (EH) code is structured in such languages may lead to multiple, different, and complex control flows, which may affect the software testability. Previous studies have reported… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

    Comments: Submitted to Empirical Software Engineering Journal

  34. arXiv:2103.04668  [pdf, other

    cs.SI cs.DS cs.IR q-bio.QM

    The distance backbone of complex networks

    Authors: Tiago Simas, Rion Brattig Correia, Luis M. Rocha

    Abstract: Redundancy needs more precise characterization as it is a major factor in the evolution and robustness of networks of multivariate interactions. We investigate the complexity of such interactions by inferring a connection transitivity that includes all possible measures of path length for weighted graphs. The result, without breaking the graph into smaller components, is a distance backbone subgra… ▽ More

    Submitted 11 May, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: To appear in the Journal of Complex Networks

    MSC Class: 05C12 (Primary) 05C22; 05C82; 91D30 (Secondary) ACM Class: G.2.2; F.2.2; I.2.4; I.2.1; J.3

    Journal ref: Journal of Complex Networks, Volume 9, Issue 6, December 2021, cnab021

  35. arXiv:2003.05613  [pdf, other

    cs.SE

    A survey on test practitioners' awareness of test smells

    Authors: Nildo Silva Junior, Larissa Rocha, Luana Almeida Martins, Ivan Machado

    Abstract: Developing test code may be a time-consuming task that usually requires much effort and cost, especially when it is done manually. Besides, during this process, developers and testers are likely to adopt bad design choices, which may lead to the introduction of the so-called test smells in test code. Test smells are bad solutions to either implement or design test code. As the test code with test… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

    Comments: 14 pages, 2 figures and 3 tables

  36. Mining social media data for biomedical signals and health-related behavior

    Authors: Rion Brattig Correia, Ian B. Wood, Johan Bollen, Luis M. Rocha

    Abstract: Social media data has been increasingly used to study biomedical and health-related phenomena. From cohort level discussions of a condition to planetary level analyses of sentiment, social media has provided scientists with unprecedented amounts of data to study human behavior and response associated with a variety of health conditions and medical treatments. Here we review recent work in mining s… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

    Comments: To appear in the Annual Review of Biomedical Data Science

    ACM Class: K.4; I.7

    Journal ref: Annual Review of Biomedical Data Science, 3:1 (2020)

  37. arXiv:1811.03341  [pdf, other

    physics.soc-ph cs.SI

    Modelling Opinion Dynamics in the Age of Algorithmic Personalisation

    Authors: Nicola Perra, Luis E C Rocha

    Abstract: Modern technology has drastically changed the way we interact and consume information. For example, online social platforms allow for seamless communication exchanges at an unprecedented scale. However, we are still bounded by cognitive and temporal constraints. Our attention is limited and extremely valuable. Algorithmic personalisation has become a standard approach to tackle the information ove… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

  38. Visual-Quality-Driven Learning for Underwater Vision Enhancement

    Authors: Walysson Vital Barbosa, Henrique Grandinetti Barbosa Amaral, Thiago Lages Rocha, Erickson Rangel Nascimento

    Abstract: The image processing community has witnessed remarkable advances in enhancing and restoring images. Nevertheless, restoring the visual quality of underwater images remains a great challenge. End-to-end frameworks might fail to enhance the visual quality of underwater images since in several scenarios it is not feasible to provide the ground truth of the scene radiance. In this work, we propose a C… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Accepted for publication and presented in 2018 IEEE International Conference on Image Processing (ICIP)

  39. arXiv:1803.04774  [pdf, other

    cs.OH cs.CE cs.DM eess.SY q-bio.MN q-bio.QM

    CANA: A python package for quantifying control and canalization in Boolean Networks

    Authors: Rion Brattig Correia, Alexander J. Gates, Xuan Wang, Luis M. Rocha

    Abstract: Logical models offer a simple but powerful means to understand the complex dynamics of biochemical regulation, without the need to estimate kinetic parameters. However, even simple automata components can lead to collective dynamics that are computationally intractable when aggregated into networks. In previous work we demonstrated that automata network models of biochemical regulation are highly… ▽ More

    Submitted 9 May, 2018; v1 submitted 9 March, 2018; originally announced March 2018.

    Comments: Submitted to the Systems Biology section of Frontiers in Physiology

    MSC Class: 94C (Primary) 93; 92C42 (Secondary) ACM Class: G.4; I.1; J.3

    Journal ref: Frontiers in Physiology, 9:1046, 2018

  40. arXiv:1803.03571  [pdf, other

    cs.SI cs.CY cs.IR q-bio.QM stat.ML

    City-wide Analysis of Electronic Health Records Reveals Gender and Age Biases in the Administration of Known Drug-Drug Interactions

    Authors: Rion Brattig Correia, Luciana P. de Araújo, Mauro M. Mattos, Luis M. Rocha

    Abstract: The occurrence of drug-drug-interactions (DDI) from multiple drug dispensations is a serious problem, both for individuals and health-care systems, since patients with complications due to DDI are likely to reenter the system at a costlier level. We present a large-scale longitudinal study (18 months) of the DDI phenomenon at the primary- and secondary-care level using electronic health records (E… ▽ More

    Submitted 2 January, 2020; v1 submitted 9 March, 2018; originally announced March 2018.

    MSC Class: J.3; G.3 ACM Class: J.3; G.3

    Journal ref: npj Digit. Med. 2, 74 (2019)

  41. arXiv:1708.06877  [pdf, ps, other

    cs.IT cs.AI

    The Reachability of Computer Programs

    Authors: Reginaldo I. Silva Filho, Ricardo L. Azevedo da Rocha, Camila Leite Silva, Ricardo H. Gracini Guiraldelli

    Abstract: Would it be possible to explain the emergence of new computational ideas using the computation itself? Would it be feasible to describe the discovery process of new algorithmic solutions using only mathematics? This study is the first effort to analyze the nature of such inquiry from the viewpoint of effort to find a new algorithmic solution to a given problem. We define program reachability as a… ▽ More

    Submitted 22 August, 2017; originally announced August 2017.

    ACM Class: E.4

  42. arXiv:1707.03959  [pdf

    cs.SI cs.CY q-bio.PE

    Human Sexual Cycles are Driven by Culture and Match Collective Moods

    Authors: Ian B. Wood, Pedro Leal Varela, Johan Bollen, Luis M. Rocha, Joana Gonçalves-Sá

    Abstract: It is a long-standing question whether human sexual and reproductive cycles are affected predominantly by biology or culture. The literature is mixed with respect to whether biological or cultural factors best explain the reproduction cycle phenomenon, with biological explanations dominating the argument. The biological hypothesis proposes that human reproductive cycles are an adaptation to the se… ▽ More

    Submitted 27 October, 2017; v1 submitted 12 July, 2017; originally announced July 2017.

    Comments: Main Paper: 21 pages, 4 figures Supplementary Material: 66 pages, 15 figures, 13 tables

  43. arXiv:1707.02108  [pdf, other

    physics.soc-ph cs.SI physics.data-an

    Sampling of Temporal Networks: Methods and Biases

    Authors: Luis E C Rocha, Naoki Masuda, Petter Holme

    Abstract: Temporal networks have been increasingly used to model a diversity of systems that evolve in time; for example human contact structures over which dynamic processes such as epidemics take place. A fundamental aspect of real-life networks is that they are sampled within temporal and spatial frames. Furthermore, one might wish to subsample networks to reduce their size for better visualization or to… ▽ More

    Submitted 7 July, 2017; originally announced July 2017.

    Comments: 10 pages, 8 figures, comments welcome

    Journal ref: Phys. Rev. E 96, 052302 (2017)

  44. arXiv:1603.04222  [pdf, other

    stat.ME cs.SI physics.soc-ph

    Multiple seed structure and disconnected networks in respondent-driven sampling

    Authors: Jens Malmros, Luis E. C. Rocha

    Abstract: Respondent-driven sampling (RDS) is a link-tracing sampling method that is especially suitable for sampling hidden populations. RDS combines an efficient snowball-type sampling scheme with inferential procedures that yield unbiased population estimates under some assumptions about the sampling procedure and population structure. Several seed individuals are typically used to initiate RDS recruitme… ▽ More

    Submitted 14 March, 2016; originally announced March 2016.

  45. arXiv:1510.01006  [pdf, other

    cs.SI cs.CY cs.IR q-bio.QM stat.ML

    Monitoring Potential Drug Interactions and Reactions via Network Analysis of Instagram User Timelines

    Authors: Rion Brattig Correia, Lang Li, Luis M. Rocha

    Abstract: Much recent research aims to identify evidence for Drug-Drug Interactions (DDI) and Adverse Drug reactions (ADR) from the biomedical scientific literature. In addition to this "Bibliome", the universe of social media provides a very promising source of large-scale data that can help identify DDI and ADR in ways that have not been hitherto possible. Given the large number of users, analysis of soci… ▽ More

    Submitted 14 January, 2016; v1 submitted 4 October, 2015; originally announced October 2015.

    Comments: Pacific Symposium on Biocomputing. 21:492-503

  46. arXiv:1510.00217  [pdf, ps, other

    physics.soc-ph cs.SI nlin.AO

    Temporal and structural heterogeneities emerging in adaptive temporal networks

    Authors: Takaaki Aoki, Luis E. C. Rocha, Thilo Gross

    Abstract: We introduce a model of adaptive temporal networks whose evolution is regulated by an interplay between node activity and dynamic exchange of information through links. We study the model by using a master equation approach. Starting from a homogeneous initial configuration, we show that temporal and structural heterogeneities, characteristic of real-world networks, spontaneously emerge. This theo… ▽ More

    Submitted 4 April, 2016; v1 submitted 1 October, 2015; originally announced October 2015.

    Journal ref: Physical Review E 93, 040301(R) (2016)

  47. arXiv:1509.04386  [pdf, other

    physics.soc-ph cs.SI nlin.AO physics.data-an

    Modularity and the spread of perturbations in complex dynamical systems

    Authors: Artemy Kolchinsky, Alexander J. Gates, Luis M. Rocha

    Abstract: We propose a method to decompose dynamical systems based on the idea that modules constrain the spread of perturbations. We find partitions of system variables that maximize 'perturbation modularity', defined as the autocovariance of coarse-grained perturbed trajectories. The measure effectively separates the fast intramodular from the slow intermodular dynamics of perturbation spreading (in this… ▽ More

    Submitted 23 December, 2015; v1 submitted 14 September, 2015; originally announced September 2015.

    Journal ref: Physical Review E, 2015

  48. arXiv:1503.05826  [pdf, other

    stat.AP cs.SI physics.data-an physics.soc-ph

    Respondent-driven sampling bias induced by clustering and community structure in social networks

    Authors: Luis Enrique Correa Rocha, Anna Ekeus Thorson, Renaud Lambiotte, Fredrik Liljeros

    Abstract: Sampling hidden populations is particularly challenging using standard sampling methods mainly because of the lack of a sampling frame. Respondent-driven sampling (RDS) is an alternative methodology that exploits the social contacts between peers to reach and weight individuals in these hard-to-reach populations. It is a snowball sampling procedure where the weight of the respondents is adjusted f… ▽ More

    Submitted 19 March, 2015; originally announced March 2015.

    Comments: 14 pages, 11 figures

    Journal ref: J. R. Stat. Soc. A, 180: 99 (2017)

  49. arXiv:1501.03471  [pdf, other

    cs.CY cs.SI physics.soc-ph

    Computational fact checking from knowledge networks

    Authors: Giovanni Luca Ciampaglia, Prashant Shiralkar, Luis M. Rocha, Johan Bollen, Filippo Menczer, Alessandro Flammini

    Abstract: Traditional fact checking by expert journalists cannot keep up with the enormous volume of information that is now generated online. Computational fact checking may significantly enhance our ability to evaluate the veracity of dubious information. Here we show that the complexities of human fact checking can be approximated quite well by finding the shortest path between concept nodes under proper… ▽ More

    Submitted 14 January, 2015; originally announced January 2015.

  50. arXiv:1412.0744  [pdf, other

    stat.ML cs.IR q-bio.QM

    Extraction of Pharmacokinetic Evidence of Drug-drug Interactions from the Literature

    Authors: Artemy Kolchinsky, Anália Lourenço, Heng-Yi Wu, Lang Li, Luis M. Rocha

    Abstract: Drug-drug interaction (DDI) is a major cause of morbidity and mortality and a subject of intense scientific interest. Biomedical literature mining can aid DDI research by extracting evidence for large numbers of potential interactions from published literature and clinical databases. Though DDI is investigated in domains ranging in scale from intracellular biochemistry to human populations, litera… ▽ More

    Submitted 18 May, 2015; v1 submitted 1 December, 2014; originally announced December 2014.

    Comments: PLOS One (2015)

    ACM Class: H.2.8; H.3.1; J.3