Skip to main content

Showing 1–50 of 50 results for author: Lopes, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.21512  [pdf, ps, other

    cs.NE

    Assessing an evolutionary search engine for small language models, prompts, and evaluation metrics

    Authors: Cláudio Lúcio do Val Lopes, Lucca Machado

    Abstract: The concurrent optimization of language models and instructional prompts presents a significant challenge for deploying efficient and effective AI systems, particularly when balancing performance against computational costs like token usage. This paper introduces and assesses a bi-objective evolutionary search engine designed to navigate this complex space, focusing specifically on Small Language… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 14 pages, 1 figure, 1 table

  2. arXiv:2506.14409  [pdf, ps, other

    cs.SE

    Defining the Game Producer: A Mapping of Key Characteristics and Differentiators of the Professional Behind Digital Game Production

    Authors: Rafael C. Lopes, Danilo M. Ribeiro

    Abstract: Introduction: As digital games grow in complexity, the role of the Game Producer becomes increasingly relevant for aligning creative, technical, and business dimensions. Objective: This study aimed to identify and map the main characteristics, skills, and competencies that define the Digital Game Producer profile. Methodology: A qualitative investigation was conducted with 11 semi-structured inter… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  3. arXiv:2504.12549  [pdf, other

    cs.CL cs.AI cs.LG

    Memorization: A Close Look at Books

    Authors: Iris Ma, Ian Domingo, Alberto Krone-Martins, Pierre Baldi, Cristina V. Lopes

    Abstract: To what extent can entire books be extracted from LLMs? Using the Llama 3 70B family of models, and the "prefix-prompting" extraction technique, we were able to auto-regressively reconstruct, with a very high level of similarity, one entire book (Alice's Adventures in Wonderland) from just the first 500 tokens. We were also able to obtain high extraction rates on several other books, piece-wise. H… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  4. arXiv:2407.15718  [pdf, other

    cs.CY cs.AI cs.HC cs.IR cs.SE

    Integrating AI Tutors in a Programming Course

    Authors: Iris Ma, Alberto Krone Martins, Cristina Videira Lopes

    Abstract: RAGMan is an LLM-powered tutoring system that can support a variety of course-specific and homework-specific AI tutors. RAGMan leverages Retrieval Augmented Generation (RAG), as well as strict instructions, to ensure the alignment of the AI tutors' responses. By using RAGMan's AI tutors, students receive assistance with their specific homework assignments without directly obtaining solutions, whil… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Accepted at SIGCSE Virtual 2024

  5. Single color digital H&E staining with In-and-Out Net

    Authors: Mengkun Chen, Yen-Tung Liu, Fadeel Sher Khan, Matthew C. Fox, Jason S. Reichenberg, Fabiana C. P. S. Lopes, Katherine R. Sebastian, Mia K. Markey, James W. Tunnell

    Abstract: Virtual staining streamlines traditional staining procedures by digitally generating stained images from unstained or differently stained images. While conventional staining methods involve time-consuming chemical processes, virtual staining offers an efficient and low infrastructure alternative. Leveraging microscopy-based techniques, such as confocal microscopy, researchers can expedite tissue a… ▽ More

    Submitted 22 November, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Journal ref: Computerized Medical Imaging and Graphics, volume = {118}, pages = {102468}, year = {2024}, issn = {0895-6111},

  6. arXiv:2402.00247  [pdf, other

    cs.SE cs.PL

    Towards AI-Assisted Synthesis of Verified Dafny Methods

    Authors: Md Rakib Hossain Misu, Cristina V. Lopes, Iris Ma, James Noble

    Abstract: Large language models show great promise in many domains, including programming. A promise is easy to make but hard to keep, and language models often fail to keep their promises, generating erroneous code. A promising avenue to keep models honest is to incorporate formal verification: generating programs' specifications as well as code so that the code can be proved correct with respect to the sp… ▽ More

    Submitted 10 June, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: This is an author provided preprint. The final version will be published at Proc. ACM Softw. Eng; FSE 2024, in July 2024

  7. arXiv:2401.17622  [pdf, other

    cs.SE

    Commit Messages in the Age of Large Language Models

    Authors: Cristina V. Lopes, Vanessa I. Klotzman, Iris Ma, Iftekar Ahmed

    Abstract: Commit messages are explanations of changes made to a codebase that are stored in version control systems. They help developers understand the codebase as it evolves. However, writing commit messages can be tedious and inconsistent among developers. To address this issue, researchers have tried using different methods to automatically generate commit messages, including rule-based, retrieval-based… ▽ More

    Submitted 1 February, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: Submitted to FSE 23 on Feb 6 2023

  8. arXiv:2311.15740  [pdf, other

    cs.CV cs.DL

    Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten Documents

    Authors: Mariana Dias, Carla Teixeira Lopes

    Abstract: Linked Data is used in various fields as a new way of structuring and connecting data. Cultural heritage institutions have been using linked data to improve archival descriptions and facilitate the discovery of information. Most archival records have digital representations of physical artifacts in the form of scanned images that are non-machine-readable. Optical Character Recognition (OCR) recogn… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 25 pages, 4 figures

    Journal ref: J. Comput. Cult. Herit. 16, 4, Article 77 (December 2023), 25 pages

  9. Moving from ISAD(G) to a CIDOC CRM-based Linked Data Model in the Portuguese Archives

    Authors: Inês Koch, Carla Teixeira Lopes, Cristina Ribeiro

    Abstract: Archives are facing numerous challenges. On the one hand, archival assets are evolving to encompass digitized documents and increasing quantities of born-digital information in diverse formats. On the other hand, the audience is changing along with how it wishes to access archival material. Moreover, the interoperability requirements of cultural heritage repositories are growing. In this context,… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Inês Koch, Carla Teixeira Lopes, and Cristina Ribeiro. 2023. Moving from ISAD(G) to a CIDOC CRM-based Linked Data Model in the Portuguese Archives. J. Comput. Cult. Herit. Just Accepted (September 2023). https://doi.org/10.1145/3605910

  10. arXiv:2310.02235  [pdf, other

    cs.CL cs.AI cs.CY cs.LG cs.SI

    Automatic Quality Assessment of Wikipedia Articles -- A Systematic Literature Review

    Authors: Pedro Miguel Moás, Carla Teixeira Lopes

    Abstract: Wikipedia is the world's largest online encyclopedia, but maintaining article quality through collaboration is challenging. Wikipedia designed a quality scale, but with such a manual assessment process, many articles remain unassessed. We review existing methods for automatically measuring the quality of Wikipedia articles, identifying and comparing machine learning algorithms, article features, q… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 37 pages, 10 figures, just accepted in ACM Computing Surveys (September 2023). This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ACM Computing Surveys, https://dx.doi.org/10.1145/3625286

  11. arXiv:2306.16259  [pdf

    cs.AR

    Exploration and Analysis of Combinations of Hamming Codes in 32-bit Memories

    Authors: David Freitas, David Mota, Clailton Lopes, Daniel Simões, Jarbas Silveira, João Mota, César Marcon

    Abstract: Reducing the threshold voltage of electronic devices increases their sensitivity to electromagnetic radiation dramatically, increasing the probability of changing the memory cells' content. Designers mitigate failures using techniques such as Error Correction Codes (ECCs) to maintain information integrity. Although there are several studies of ECC usage in spatial application memories, there is st… ▽ More

    Submitted 13 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  12. Using Large Language Models to Generate JUnit Tests: An Empirical Study

    Authors: Mohammed Latif Siddiq, Joanna C. S. Santos, Ridwanul Hasan Tanvir, Noshin Ulfat, Fahmid Al Rifat, Vinicius Carvalho Lopes

    Abstract: A code generation model generates code by taking a prompt from a code comment, existing code, or a combination of both. Although code generation models (e.g., GitHub Copilot) are increasingly being adopted in practice, it is unclear whether they can successfully be used for unit test generation without fine-tuning for a strongly typed language like Java. To fill this gap, we investigated how well… ▽ More

    Submitted 8 March, 2024; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: Accepted in Research Track of The 28th International Conference on Evaluation and Assessment in Software Engineering (EASE 2024)

    Journal ref: The 28th International Conference on Evaluation and Assessment in Software Engineering (EASE), 2024, 313-322

  13. Improving the Quality of Commit Messages in Students' Projects

    Authors: Iris Ma, Cristina V. Lopes

    Abstract: Commit messages play a crucial role in collaborative software development. They provide a clear and concise description of the changes made to the source code. However, many commit messages among students' projects lack useful information. This is a concern, as low-quality commit messages can negatively impact communication of software development and future maintenance. To address this issue, thi… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at ICSE SEENG Workshop 2023

  14. Black Boxes, White Noise: Similarity Detection for Neural Functions

    Authors: Farima Farmahinifarahani, Cristina V. Lopes

    Abstract: Similarity, or clone, detection has important applications in copyright violation, software theft, code search, and the detection of malicious components. There is now a good number of open source and proprietary clone detectors for programs written in traditional programming languages. However, the increasing adoption of deep learning models in software poses a challenge to these tools: these mod… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Journal ref: The Art, Science, and Engineering of Programming, 2023, Vol. 7, Issue 3, Article 12

  15. The Evolution of Web Search User Interfaces -- An Archaeological Analysis of Google Search Engine Result Pages

    Authors: B. Oliveira, C. T. Lopes

    Abstract: Web search engines have marked everyone's life by transforming how one searches and accesses information. Search engines give special attention to the user interface, especially search engine result pages (SERP). The well-known ''10 blue links'' list has evolved into richer interfaces, often personalized to the search query, the user, and other aspects. More than 20 years later, the literature has… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: 10 pages, Full Paper of CHIIR 2023

    MSC Class: 68P20

  16. From 10 Blue Links Pages to Feature-Full Search Engine Results Pages -- Analysis of the Temporal Evolution of SERP Features

    Authors: B. Oliveira, C. T. Lopes

    Abstract: Web Search Engine Results Pages (SERP) are one of the most well-known and used web pages. These pages have started as simple ``10 blue links'' pages, but the information in SERP currently goes way beyond these links. Several features have been included in these pages to complement organic and sponsored results and attempt to provide answers to the query instead of just pointing to websites that mi… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: 8 pages, CHIIR 2023 Conference Short Paper

    MSC Class: 68P20

  17. arXiv:2205.09083  [pdf, ps, other

    cs.IR

    Health Information Retrieval -- State of the art report

    Authors: Carla Teixeira Lopes

    Abstract: This report provides an overview of the field of Information Retrieval (IR) in healthcare. It does not aim to introduce general concepts and theories of IR but to present and describe specific aspects of Health Information Retrieval (HIR). After a brief introduction to the more broader field of IR, the significance of HIR at current times is discussed. Specific characteristics of Health Informatio… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: 38 pages, 0 figures

    ACM Class: H.3.3

  18. arXiv:2106.08452  [pdf, other

    cs.AI

    Deep Neural Networks for Approximating Stream Reasoning with C-SPARQL

    Authors: Ricardo Ferreira, Carolina Lopes, Ricardo Gonçalves, Matthias Knorr, Ludwig Krippahl, João Leite

    Abstract: The amount of information produced, whether by newspapers, blogs and social networks, or by monitoring systems, is increasing rapidly. Processing all this data in real-time, while taking into consideration advanced knowledge about the problem domain, is challenging, but required in scenarios where assessing potential risks in a timely fashion is critical. C-SPARQL, a language for continuous querie… ▽ More

    Submitted 16 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted at the 20th EPIA Conference on Artificial Intelligence, EPIA 2021; update on previous version - data on optimizer and loss added for CNNs in the appendix

  19. arXiv:2103.14100  [pdf, other

    cs.SE

    Expanding Frontiers: Settling an Understanding of Systems-of-Information Systems

    Authors: Valdemar Vicente Graciano Neto, Bruno Gabriel Araújo Lebtag, Paulo Gabriel Teixeira, Priscilla Batista, Vinícius Carvalho Lopes, Jamal El-Hachem, Jérémy Buisson, Flavio Oquendo, Juliana Fernandes, Francisco Ferreira, Rodrigo Peireira dos Santos, Davi Viana, Everton Cavalcante, Mohamad Kassab, Ahmad Mohsin, Roberto Oliveira, Vânia Neves, Maria Istela Cagnin, Elisa Yumi Nakagawa

    Abstract: System-of-Systems (SoS) has consolidated itself as a special type of software-intensive systems. As such, subtypes of SoS have also emerged, such as Cyber-Physical SoS (CPSoS) that are formed essentially of cyber-physical constituent systems and Systems-of-Information Systems (SoIS) that contain information systems as their constituents. In contrast to CPSoS that have been investigated and covered… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: 6 pages, 2 figures, 28 references

  20. Analyzing Dominance Move (MIP-DoM) Indicator for Multi- and Many-objective Optimization

    Authors: Claudio Lucio do Val Lopes, Flávio Vinícius Cruzeiro Martins, Elizabeth Fialho Wanner, Kalyanmoy Deb

    Abstract: Dominance move (DoM) is a binary quality indicator that can be used in multi-objective and many-objective optimization to compare two solution sets obtained from different algorithms. The DoM indicator can differentiate the sets for certain important features, such as convergence, spread, uniformity, and cardinality. DoM does not use any reference, and it has an intuitive and physical meaning, sim… ▽ More

    Submitted 5 February, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: 15 Pages. Submitted for consideration for publication in the IEEE Transactions on Evolutionary Computation

    Journal ref: IEEE Transactions on Evolutionary Computation 2021

  21. arXiv:2003.05377  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Brazilian Lyrics-Based Music Genre Classification Using a BLSTM Network

    Authors: Raul de Araújo Lima, Rômulo César Costa de Sousa, Simone Diniz Junqueira Barbosa, Hélio Cortês Vieira Lopes

    Abstract: Organize songs, albums, and artists in groups with shared similarity could be done with the help of genre labels. In this paper, we present a novel approach for automatic classifying musical genre in Brazilian music using only the song lyrics. This kind of classification remains a challenge in the field of Natural Language Processing. We construct a dataset of 138,368 Brazilian song lyrics distrib… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: 7 pages, 4 figures, 3 tables

    MSC Class: 68T50(Primary); 68T05 (Secondary) ACM Class: I.2.7; I.2.6

  22. arXiv:2002.10842  [pdf, ps, other

    cs.NE

    An Assignment Problem Formulation for Dominance Move Indicator

    Authors: Claudio Lucio do Val Lopes, Flávio Vinícius Cruzeiro Martins, Elizabeth F. Wanner

    Abstract: Dominance move (DoM) is a binary quality indicator to compare solution sets in multiobjective optimization. The indicator allows a more natural and intuitive relation when comparing solution sets. It is Pareto compliant and does not demand any parameters or reference sets. In spite of its advantages, the combinatorial calculation nature is a limitation. The original formulation presents an efficie… ▽ More

    Submitted 14 May, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: text overlap with arXiv:2001.03657

  23. arXiv:2001.03657  [pdf, ps, other

    math.OC cs.NE

    Dominance Move calculation using a MIP approach for comparison of multi and many-objective optimization solution sets

    Authors: Claudio Lucio do Val Lopes, Flávio Vinícius Cruzeiro Martins, Elizabeth Fialho Wanner

    Abstract: Dominance move (DoM) is a binary quality indicator that can be used in multiobjective optimization. It can compare solution sets while representing some important features such as convergence, spread, uniformity, and cardinality. DoM has an intuitive concept and considers the minimum move of one set needed to weakly Pareto dominate the other set. Despite the aforementioned properties, DoM is hard… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

    Comments: 23 pages, 5 figures

  24. arXiv:1912.04451  [pdf, other

    cs.MA

    ColosseumRL: A Framework for Multiagent Reinforcement Learning in $N$-Player Games

    Authors: Alexander Shmakov, John Lanier, Stephen McAleer, Rohan Achar, Cristina Lopes, Pierre Baldi

    Abstract: Much of recent success in multiagent reinforcement learning has been in two-player zero-sum games. In these games, algorithms such as fictitious self-play and minimax tree search can converge to an approximate Nash equilibrium. While playing a Nash equilibrium strategy in a two-player zero-sum game is optimal, in an $n$-player general sum game, it becomes a much less informative solution concept.… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

    Comments: Accepted for the 2020 AAAI Spring Symposium, Challenges and Opportunities for Multi-Agent Reinforcement Learning. Source code available at https://github.com/colosseumrl/colosseumrl/

  25. arXiv:1909.03167  [pdf, other

    cs.DC cs.SE

    GoTcha: An Interactive Debugger for GoT-Based Distributed Systems

    Authors: Rohan Achar, Pritha Dawn, Cristina V. Lopes

    Abstract: Debugging distributed systems is hard. Most of the techniques that have been developed for debugging such systems use either extensive model checking, or postmortem analysis of logs and traces. Interactive debugging is typically a tool that is only effective in single threaded and single process applications, and is rarely applied to distributed systems. While the live observation of state changes… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

  26. arXiv:1905.12111  [pdf, other

    cs.SE

    Analyzing and Supporting Adaptation of Online Code Examples

    Authors: Tianyi Zhang, Di Yang, Cristina Videira Lopes, Miryung Kim

    Abstract: Developers often resort to online Q&A forums such as Stack Overflow (SO) for filling their programming needs. Although code examples on those forums are good starting points, they are often incomplete and inadequate for developers' local program contexts; adaptation of those examples is necessary to integrate them to production code. As a consequence, the process of adapting online code examples i… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: This paper will appear at ICSE 2019

  27. arXiv:1904.06584  [pdf, other

    cs.PL cs.DC

    Got: Git, but for Objects

    Authors: Rohan Achar, Cristina V. Lopes

    Abstract: We look at one important category of distributed applications characterized by the existence of multiple collaborating, and competing, components sharing mutable, long-lived, replicated objects. The problem addressed by our work is that of object state synchronization among the components. As an organizing principle for replicated objects, we formally specify the Global Object Tracker (GoT) model,… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

  28. Towards Automating Precision Studies of Clone Detectors

    Authors: Vaibhav Saini, Farima Farmahinifarahani, Yadong Lu, Di Yang, Pedro Martins, Hitesh Sajnani, Pierre Baldi, Cristina Lopes

    Abstract: Current research in clone detection suffers from poor ecosystems for evaluating precision of clone detection tools. Corpora of labeled clones are scarce and incomplete, making evaluation labor intensive and idiosyncratic, and limiting inter tool comparison. Precision-assessment tools are simply lacking. We present a semi-automated approach to facilitate precision studies of clone detection tools.… ▽ More

    Submitted 13 December, 2018; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: Accepted to be published in the 41st ACM/IEEE International Conference on Software Engineering

    ACM Class: D.2.13

    Journal ref: Proceeding 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE)

  29. arXiv:1811.05624  [pdf, other

    cs.SI cs.MA

    Multi-Winner Contests for Strategic Diffusion in Social Networks

    Authors: Wen Shen, Yang Feng, Cristina V. Lopes

    Abstract: Strategic diffusion encourages participants to take active roles in promoting stakeholders' agendas by rewarding successful referrals. As social media continues to transform the way people communicate, strategic diffusion has become a powerful tool for stakeholders to influence people's decisions or behaviors for desired objectives. Existing reward mechanisms for strategic diffusion are usually ei… ▽ More

    Submitted 10 March, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: 9 pages, 6 figures, In Proceedings of The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)

  30. Oreo: Detection of Clones in the Twilight Zone

    Authors: Vaibhav Saini, Farima Farmahinifarahani, Yadong Lu, Pierre Baldi, Cristina Lopes

    Abstract: Source code clones are categorized into four types of increasing difficulty of detection, ranging from purely textual (Type-1) to purely semantic (Type-4). Most clone detectors reported in the literature work well up to Type-3, which accounts for syntactic differences. In between Type-3 and Type-4, however, there lies a spectrum of clones that, although still exhibiting some syntactic similarities… ▽ More

    Submitted 15 June, 2018; originally announced June 2018.

    ACM Class: D.2.13

    Journal ref: Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (2018) 354-365

  31. arXiv:1804.09548  [pdf

    cs.CV

    Applying Faster R-CNN for Object Detection on Malaria Images

    Authors: Jane Hung, Deepali Ravel, Stefanie C. P. Lopes, Gabriel Rangel, Odailton Amaral Nery, Benoit Malleret, Francois Nosten, Marcus V. G. Lacerda, Marcelo U. Ferreira, Laurent Rénia, Manoj T. Duraisingh, Fabio T. M. Costa, Matthias Marti, Anne E. Carpenter

    Abstract: Deep learning based models have had great success in object detection, but the state of the art models have not yet been widely applied to biological image data. We apply for the first time an object detection model previously used on natural images to identify cells and recognize their stages in brightfield microscopy images of malaria-infected blood. Many micro-organisms like malaria parasites a… ▽ More

    Submitted 11 March, 2019; v1 submitted 25 April, 2018; originally announced April 2018.

    Comments: CVPR 2017: computer vision for microscopy image analysis (CVMI) Workshop

  32. arXiv:1804.04621  [pdf, other

    cs.SE

    The Java Build Framework: Large Scale Compilation

    Authors: Pedro Martins, Rohan Achar, Cristina V. Lopes

    Abstract: Large repositories of source code for research tend to limit their utility to static analysis of the code, as they give no guarantees on whether the projects are compilable, much less runnable in any way. The immediate consequence of the lack of large compilable and runnable datasets is that research that requires such properties does not generalize beyond small benchmarks. We present the Java Bui… ▽ More

    Submitted 12 April, 2018; originally announced April 2018.

  33. arXiv:1803.06464  [pdf, other

    cs.CY cs.MA eess.SY

    Toward Understanding the Impact of User Participation in Autonomous Ridesharing Systems

    Authors: Wen Shen, Rohan Achar, Cristina V. Lopes

    Abstract: Autonomous ridesharing systems (ARS) promise many societal and environmental benefits, including decreased accident rates, reduced energy consumption and pollutant emissions, and diminished land use for parking. To unleash ARS' potential, stakeholders must understand how the degree of passenger participation influences the ridesharing systems' efficiency. To date, however, a careful study that qua… ▽ More

    Submitted 28 March, 2018; v1 submitted 17 March, 2018; originally announced March 2018.

    Comments: 17 pages, 11 figures

    Journal ref: Proceedings of the 2018 Winter Simulation Conference

  34. arXiv:1709.04049  [pdf, other

    cs.AI cs.CY cs.MA

    Information Design in Crowdfunding under Thresholding Policies

    Authors: Wen Shen, Jacob W. Crandall, Ke Yan, Cristina V. Lopes

    Abstract: Crowdfunding has emerged as a prominent way for entrepreneurs to secure funding without sophisticated intermediation. In crowdfunding, an entrepreneur often has to decide how to disclose the campaign status in order to collect as many contributions as possible. Such decisions are difficult to make primarily due to incomplete information. We propose information design as a tool to help the entrepre… ▽ More

    Submitted 28 March, 2018; v1 submitted 12 September, 2017; originally announced September 2017.

    Comments: 9 pages, 2 figures, In Proceedings of the 17th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2018)

  35. arXiv:1705.01198  [pdf, other

    cs.SE

    Stack Overflow in Github: Any Snippets There?

    Authors: Di Yang, Pedro Martins, Vaibhav Saini, Cristina Lopes

    Abstract: When programmers look for how to achieve certain programming tasks, Stack Overflow is a popular destination in search engine results. Over the years, Stack Overflow has accumulated an impressive knowledge base of snippets of code that are amply documented. We are interested in studying how programmers use these snippets of code in their projects. Can we find Stack Overflow snippets in real project… ▽ More

    Submitted 2 May, 2017; originally announced May 2017.

    Comments: 14th International Conference on Mining Software Repositories, 11 pages

  36. arXiv:1610.07520  [pdf, other

    eess.SY cs.LG

    Nonlinear Adaptive Algorithms on Rank-One Tensor Models

    Authors: Felipe C. Pinheiro, Cassio G. Lopes

    Abstract: This work proposes a low complexity nonlinearity model and develops adaptive algorithms over it. The model is based on the decomposable---or rank-one, in tensor language---Volterra kernels. It may also be described as a product of FIR filters, which explains its low-complexity. The rank-one model is also interesting because it comes from a well-posed problem in approximation theory. The paper uses… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

  37. arXiv:1608.08736  [pdf, other

    cs.SE

    Collective Intelligence for Smarter API Recommendations in Python

    Authors: Andrea Renika D'Souza, Di Yang, Cristina V. Lopes

    Abstract: Software developers use Application Programming Interfaces (APIs) of libraries and frameworks extensively while writing programs. In this context, the recommendations provided in code completion pop-ups help developers choose the desired methods. The candidate lists recommended by these tools, however, tend to be large, ordered alphabetically and sometimes even incomplete. A fair amount of work ha… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.

    Comments: 10 pages, SCAM 2016

  38. arXiv:1608.03248  [pdf, other

    math.OC cs.IT cs.LG

    Combination of LMS Adaptive Filters with Coefficients Feedback

    Authors: Luiz F. O. Chamon, Cassio G. Lopes

    Abstract: Parallel combinations of adaptive filters have been effectively used to improve the performance of adaptive algorithms and address well-known trade-offs, such as convergence rate vs. steady-state error. Nevertheless, typical combinations suffer from a convergence stagnation issue due to the fact that the component filters run independently. Solutions to this issue usually involve conditional trans… ▽ More

    Submitted 19 November, 2017; v1 submitted 10 August, 2016; originally announced August 2016.

    Comments: 13 pages, 12 figures

  39. From Query to Usable Code: An Analysis of Stack Overflow Code Snippets

    Authors: Di Yang, Aftab Hussain, Cristina Lopes

    Abstract: Enriched by natural language texts, Stack Overflow code snippets are an invaluable code-centric knowledge base of small units of source code. Besides being useful for software developers, these annotated snippets can potentially serve as the basis for automated tools that provide working code solutions to specific natural language queries. With the goal of developing automated tools with the Sta… ▽ More

    Submitted 14 May, 2016; originally announced May 2016.

    Comments: 13th IEEE/ACM International Conference on Mining Software Repositories, 11 pages

  40. arXiv:1603.02208  [pdf, other

    cs.AI cs.GT

    An Online Mechanism for Ridesharing in Autonomous Mobility-on-Demand Systems

    Authors: Wen Shen, Cristina V. Lopes, Jacob W. Crandall

    Abstract: With proper management, Autonomous Mobility-on-Demand (AMoD) systems have great potential to satisfy the transport demands of urban populations by providing safe, convenient, and affordable ridesharing services. Meanwhile, such systems can substantially decrease private car ownership and use, and thus significantly reduce traffic congestion, energy consumption, and carbon emissions. To achieve thi… ▽ More

    Submitted 1 March, 2017; v1 submitted 7 March, 2016; originally announced March 2016.

    Journal ref: Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI 2016) pp. 475-481

  41. SourcererCC and SourcererCC-I: Tools to Detect Clones in Batch mode and During Software Development

    Authors: Vaibhav Saini, Hitesh Sajnani, Jaewoo Kim, Cristina Lopes

    Abstract: Given the availability of large source-code repositories, there has been a large number of applications for large-scale clone detection. Unfortunately, despite a decade of active research, there is a marked lack in clone detectors that scale to big software systems or large repositories, specifically for detecting near-miss (Type 3) clones where significant editing activities may take place in the… ▽ More

    Submitted 4 March, 2016; originally announced March 2016.

    Comments: ICSE 2016 pre-print. arXiv admin note: substantial text overlap with arXiv:1512.06448

    Journal ref: Proceedings of the 38th International Conference on Software Engineering Companion Pages 597-600 , 2016

  42. arXiv:1603.00427  [pdf, other

    eess.SY cs.LG

    A Nonlinear Adaptive Filter Based on the Model of Simple Multilinear Functionals

    Authors: Felipe C. Pinheiro, Cássio G. Lopes

    Abstract: Nonlinear adaptive filtering allows for modeling of some additional aspects of a general system and usually relies on highly complex algorithms, such as those based on the Volterra series. Through the use of the Kronecker product and some basic facts of tensor algebra, we propose a simple model of nonlinearity, one that can be interpreted as a product of the outputs of K FIR linear filters, and co… ▽ More

    Submitted 1 March, 2016; originally announced March 2016.

    Comments: 5 pages, one of references, plus extra page attached

  43. Geometric-Algebra LMS Adaptive Filter and its Application to Rotation Estimation

    Authors: Wilder B. Lopes, Anas Al-Nuaimi, Cassio G. Lopes

    Abstract: This paper exploits Geometric (Clifford) Algebra (GA) theory in order to devise and introduce a new adaptive filtering strategy. From a least-squares cost function, the gradient is calculated following results from Geometric Calculus (GC), the extension of GA to handle differential and integral calculus. The novel GA least-mean-squares (GA-LMS) adaptive filter, which inherits properties from stand… ▽ More

    Submitted 22 January, 2016; originally announced January 2016.

    Comments: 4 pages of content plus 1 of references; 4 figures. Supplementary material (codes and datasets) available at www.lps.usp.br/wilder

  44. SourcererCC: Scaling Code Clone Detection to Big Code

    Authors: Hitesh Sajnani, Vaibhav Saini, Jeffrey Svajlenko, Chanchal K. Roy, Cristina V. Lopes

    Abstract: Despite a decade of active research, there is a marked lack in clone detectors that scale to very large repositories of source code, in particular for detecting near-miss clones where significant editing activities may take place in the cloned code. We present SourcererCC, a token-based clone detector that targets three clone types, and exploits an index to achieve scalability to large inter-proje… ▽ More

    Submitted 20 December, 2015; originally announced December 2015.

    Comments: Accepted for publication at ICSE'16 (preprint, unrevised)

  45. arXiv:1508.04465  [pdf, other

    cs.SE cs.DC cs.PF

    On Designing and Testing Distributed Virtual Environments

    Authors: Arthur Valadares, Eugenia Gabrielova, Cristina V. Lopes

    Abstract: Distributed Real-Time (DRT) systems are among the most complex software systems to design, test, maintain and evolve. The existence of components distributed over a network often conflicts with real-time requirements, leading to design strategies that depend on domain- and even application-specific knowledge. Distributed Virtual Environment (DVE) systems are DRT systems that connect multiple users… ▽ More

    Submitted 18 August, 2015; originally announced August 2015.

    Comments: Wiley Journal on Concurrency and Computation: Practice and Experience, to appear (preprint)

  46. How Scale Affects Structure in Java Programs

    Authors: Cristina V. Lopes, Joel Ossher

    Abstract: Many internal software metrics and external quality attributes of Java programs correlate strongly with program size. This knowledge has been used pervasively in quantitative studies of software through practices such as normalization on size metrics. This paper reports size-related super- and sublinear effects that have not been known before. Findings obtained on a very large collection of Java p… ▽ More

    Submitted 3 August, 2015; originally announced August 2015.

    Comments: ACM Conference on Object-Oriented Programming, Systems, Languages and Applications (OOPSLA), October 2015. (Preprint)

  47. Managing Autonomous Mobility on Demand Systems for Better Passenger Experience

    Authors: Wen Shen, Cristina Lopes

    Abstract: Autonomous mobility on demand systems, though still in their infancy, have very promising prospects in providing urban population with sustainable and safe personal mobility in the near future. While much research has been conducted on both autonomous vehicles and mobility on demand systems, to the best of our knowledge, this is the first work that shows how to manage autonomous mobility on demand… ▽ More

    Submitted 9 July, 2015; originally announced July 2015.

    Journal ref: Proceedings of the 18th International Conference on Principles and Practice of Multi-Agent Systems (PRIMA 2015). pp 20-35. Lecture Notes in Computer Science, vol 9387. Springer

  48. arXiv:1204.5446  [pdf, other

    cs.CR

    Verifying Search Results Over Web Collections

    Authors: Michael T. Goodrich, Duy Nguyen, Olga Ohrimenko, Charalampos Papamanthou, Roberto Tamassia, Nikos Triandopoulos, Cristina Videira Lopes

    Abstract: Searching accounts for one of the most frequently performed computations over the Internet as well as one of the most important applications of outsourced computing, producing results that critically affect users' decision-making behaviors. As such, verifying the integrity of Internet-based searches over vast amounts of web contents is essential. We provide the first solution to this general sec… ▽ More

    Submitted 17 December, 2012; v1 submitted 24 April, 2012; originally announced April 2012.

  49. arXiv:1010.3951  [pdf

    cs.MM cs.SD

    Alternatives to speech in low bit rate communication systems

    Authors: Cristina Videira Lopes, Pedro M. Q. Aguiar

    Abstract: This paper describes a framework and a method with which speech communication can be analyzed. The framework consists of a set of low bit rate, short-range acoustic communication systems, such as speech, but that are quite different from speech. The method is to systematically compare these systems according to different objective functions such as data rate, computational overhead, psychoacoustic… ▽ More

    Submitted 19 October, 2010; originally announced October 2010.

    Comments: 4 pages, 2 figures

  50. arXiv:0801.2618  [pdf, ps, other

    cs.SE cs.IR cs.NI

    Survey of Technologies for Web Application Development

    Authors: Barry Doyle, Cristina Videira Lopes

    Abstract: Web-based application developers face a dizzying array of platforms, languages, frameworks and technical artifacts to choose from. We survey, classify, and compare technologies supporting Web application development. The classification is based on (1) foundational technologies; (2)integration with other information sources; and (3) dynamic content generation. We further survey and classify softw… ▽ More

    Submitted 17 January, 2008; originally announced January 2008.

    Comments: 43 pages

    ACM Class: A.1; D.1.0; D.1.1; D.2.11; H.3.5; H.5.4