Skip to main content

Showing 1–7 of 7 results for author: Wallace, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.09021  [pdf, ps, other

    cs.SE cs.AI cs.PL

    AI-Mediated Code Comment Improvement

    Authors: Maria Dhakal, Chia-Yi Su, Robert Wallace, Chris Fakhimi, Aakash Bansal, Toby Li, Yu Huang, Collin McMillan

    Abstract: This paper describes an approach to improve code comments along different quality axes by rewriting those comments with customized Artificial Intelligence (AI)-based tools. We conduct an empirical study followed by grounded theory qualitative analysis to determine the quality axes to improve. Then we propose a procedure using a Large Language Model (LLM) to rewrite existing code comments along the… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  2. arXiv:2503.13352  [pdf, other

    physics.chem-ph cs.LG

    Strain Problems got you in a Twist? Try StrainRelief: A Quantum-Accurate Tool for Ligand Strain Calculations

    Authors: Ewan R. S. Wallace, Nathan C. Frey, Joshua A. Rackers

    Abstract: Ligand strain energy, the energy difference between the bound and unbound conformations of a ligand, is an important component of structure-based small molecule drug design. A large majority of observed ligands in protein-small molecule co-crystal structures bind in low-strain conformations, making strain energy a useful filter for structure-based drug design. In this work we present a tool for ca… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  3. arXiv:2412.14486  [pdf, other

    cs.HC cs.IR

    Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities

    Authors: Amandeep Kaur, James R. Wallace

    Abstract: Social media constitutes a rich and influential source of information for qualitative researchers. Although computational techniques like topic modelling assist with managing the volume and diversity of social media content, qualitative researcher's lack of programming expertise creates a significant barrier to their adoption. In this paper we explore how BERTopic, an advanced Large Language Model… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  4. arXiv:2412.14481  [pdf, other

    cs.HC

    The Shape of Agency: Designing for Personal Agency in Qualitative Data Analysis

    Authors: Luka Ugaya Mazza, Plinio Morita, James R. Wallace

    Abstract: Computational thematic analysis is rapidly emerging as a method of using large text corpora to understand the lived experience of people across the continuum of health care: patients, practitioners, and everyone in between. However, many qualitative researchers do not have the necessary programming skills to write machine learning code on their own, but also seek to maintain ownership, intimacy, a… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  5. Programmer Visual Attention During Context-Aware Code Summarization

    Authors: Robert Wallace, Aakash Bansal, Zachary Karas, Ningzhi Tang, Yu Huang, Toby Jia-Jun Li, Collin McMillan

    Abstract: Abridged: Programmer attention represents the visual focus of programmers on parts of the source code in pursuit of programming tasks. We conducted an in-depth human study with 10 Java programmers, where each programmer generated summaries for 40 methods from five large Java projects over five one-hour sessions. We used eye-tracking equipment to map the visual attention of programmers while they w… ▽ More

    Submitted 25 March, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 13 pages, 4 figures, 5 tables. Published in IEEE Transactions on Software Engineering

  6. arXiv:1903.00040  [pdf, other

    cs.SE

    EyeDoc: Documentation Navigation with Eye Tracking

    Authors: Robert Wallace, Collin McMillan

    Abstract: We demonstrate EyeDoc, a tool for navigating software documentation with the use of the eyes. When programming, developers often have many windows open such as an IDE, consoles and GUIs for third-party utilities, the application under development, and a web browser for navigating documentation. Several studies have shown that the navigation among these different tasks imposes a small mental load w… ▽ More

    Submitted 28 February, 2019; originally announced March 2019.

  7. arXiv:1701.08339  [pdf, other

    cs.CL

    Using English as Pivot to Extract Persian-Italian Parallel Sentences from Non-Parallel Corpora

    Authors: Ebrahim Ansari, M. H. Sadreddini, Mostafa Sheikhalishahi, Richard Wallace, Fatemeh Alimardani

    Abstract: The effectiveness of a statistical machine translation system (SMT) is very dependent upon the amount of parallel corpus used in the training phase. For low-resource language pairs there are not enough parallel corpora to build an accurate SMT. In this paper, a novel approach is presented to extract bilingual Persian-Italian parallel sentences from a non-parallel (comparable) corpus. In this study… ▽ More

    Submitted 28 January, 2017; originally announced January 2017.

    Comments: 30 pages, Accepted to be published in "Applications of Comparable Corpora", Berlin: Language Science Press