Skip to main content

Showing 1–18 of 18 results for author: Damevski, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.05992  [pdf, other

    cs.SE cs.CL cs.CY

    Psycholinguistic Analyses in Software Engineering Text: A Systematic Literature Review

    Authors: Amirali Sajadi, Kostadin Damevski, Preetha Chatterjee

    Abstract: Context: A deeper understanding of human factors in software engineering (SE) is essential for improving team collaboration, decision-making, and productivity. Communication channels like code reviews and chats provide insights into developers' psychological and emotional states. While large language models excel at text analysis, they often lack transparency and precision. Psycholinguistic tools… ▽ More

    Submitted 17 April, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

  2. arXiv:2503.02191  [pdf, other

    cs.SE

    Understanding and Predicting Derailment in Toxic Conversations on GitHub

    Authors: Mia Mohammad Imran, Robert Zita, Rebekah Copeland, Preetha Chatterjee, Rahat Rizvi Rahman, Kostadin Damevski

    Abstract: Software projects thrive on the involvement and contributions of individuals from different backgrounds. However, toxic language and negative interactions can hinder the participation and retention of contributors and alienate newcomers. Proactive moderation strategies aim to prevent toxicity from occurring by addressing conversations that have derailed from their intended purpose. This study aims… ▽ More

    Submitted 19 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  3. arXiv:2502.14202  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Do LLMs Consider Security? An Empirical Study on Responses to Programming Questions

    Authors: Amirali Sajadi, Binh Le, Anh Nguyen, Kostadin Damevski, Preetha Chatterjee

    Abstract: The widespread adoption of conversational LLMs for software development has raised new security concerns regarding the safety of LLM-generated content. Our motivational study outlines ChatGPT's potential in volunteering context-specific information to the developers, promoting safe coding practices. Motivated by this finding, we conduct a study to evaluate the degree of security awareness exhibite… ▽ More

    Submitted 3 April, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Accepted to EMSE

  4. arXiv:2411.11659  [pdf, ps, other

    cs.SE

    Improving Data Curation of Software Vulnerability Patches through Uncertainty Quantification

    Authors: Hui Chen, Yunhua Zhao, Kostadin Damevski

    Abstract: The changesets (or patches) that fix open source software vulnerabilities form critical datasets for various machine learning security-enhancing applications, such as automated vulnerability patching and silent fix detection. These patch datasets are derived from extensive collections of historical vulnerability fixes, maintained in databases like the Common Vulnerabilities and Exposures list and… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  5. arXiv:2402.04183  [pdf, other

    cs.SE

    Incivility in Open Source Projects: A Comprehensive Annotated Dataset of Locked GitHub Issue Threads

    Authors: Ramtin Ehsani, Mia Mohammad Imran, Robert Zita, Kostadin Damevski, Preetha Chatterjee

    Abstract: In the dynamic landscape of open source software (OSS) development, understanding and addressing incivility within issue discussions is crucial for fostering healthy and productive collaborations. This paper presents a curated dataset of 404 locked GitHub issue discussion threads and 5961 individual comments, collected from 213 OSS projects. We annotated the comments with various categories of inc… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  6. arXiv:2312.10297  [pdf, other

    cs.SE

    Shedding Light on Software Engineering-specific Metaphors and Idioms

    Authors: Mia Mohammad Imran, Preetha Chatterjee, Kostadin Damevski

    Abstract: Use of figurative language, such as metaphors and idioms, is common in our daily-life communications, and it can also be found in Software Engineering (SE) channels, such as comments on GitHub. Automatically interpreting figurative language is a challenging task, even with modern Large Language Models (LLMs), as it often involves subtle nuances. This is particularly true in the SE domain, where fi… ▽ More

    Submitted 23 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

  7. arXiv:2312.09731  [pdf, other

    cs.SE

    Uncovering the Causes of Emotions in Software Developer Communication Using Zero-shot LLMs

    Authors: Mia Mohammad Imran, Preetha Chatterjee, Kostadin Damevski

    Abstract: Understanding and identifying the causes behind developers' emotions (e.g., Frustration caused by `delays in merging pull requests') can be crucial towards finding solutions to problems and fostering collaboration in open-source communities. Effectively identifying such information in the high volume of communications across the different project channels, such as chats, emails, and issue comments… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  8. Interpersonal Trust in OSS: Exploring Dimensions of Trust in GitHub Pull Requests

    Authors: Amirali Sajadi, Kostadin Damevski, Preetha Chatterjee

    Abstract: Interpersonal trust plays a crucial role in facilitating collaborative tasks, such as software development. While previous research recognizes the significance of trust in an organizational setting, there is a lack of understanding in how trust is exhibited in OSS distributed teams, where there is an absence of direct, in-person communications. To foster trust and collaboration in OSS teams, we ne… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Journal ref: 2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

  9. arXiv:2311.04755  [pdf, other

    cs.SE

    Towards Understanding Emotions in Informal Developer Interactions: A Gitter Chat Study

    Authors: Amirali Sajadi, Kostadin Damevski, Preetha Chatterjee

    Abstract: Emotions play a significant role in teamwork and collaborative activities like software development. While researchers have analyzed developer emotions in various software artifacts (e.g., issues, pull requests), few studies have focused on understanding the broad spectrum of emotions expressed in chats. As one of the most widely used means of communication, chats contain valuable information in t… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  10. arXiv:2305.16430  [pdf, other

    cs.SE

    Too Few Bug Reports? Exploring Data Augmentation for Improved Changeset-based Bug Localization

    Authors: Agnieszka Ciborowska, Kostadin Damevski

    Abstract: Modern Deep Learning (DL) architectures based on transformers (e.g., BERT, RoBERTa) are exhibiting performance improvements across a number of natural language tasks. While such DL models have shown tremendous potential for use in software engineering applications, they are often hampered by insufficient training data. Particularly constrained are applications that require project-specific data, s… ▽ More

    Submitted 1 June, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  11. arXiv:2208.05573  [pdf, other

    cs.SE

    Data Augmentation for Improving Emotion Recognition in Software Engineering Communication

    Authors: Mia Mohammad Imran, Yashasvi Jain, Preetha Chatterjee, Kostadin Damevski

    Abstract: Emotions (e.g., Joy, Anger) are prevalent in daily software engineering (SE) activities, and are known to be significant indicators of work productivity (e.g., bug fixing efficiency). Recent studies have shown that directly applying general purpose emotion classification tools to SE corpora is not effective. Even within the SE domain, tool performance degrades significantly when trained on one com… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

  12. Using clarification questions to improve software developers' Web search

    Authors: Mia Mohammad Imran, Kostadin Damevski

    Abstract: Context: Recent research indicates that Web queries written by software developers are not very successful in retrieving relevant results, performing measurably worse compared to general purpose Web queries. Most approaches up to this point have addressed this problem with software engineering-specific automated query reformulation techniques, which work without developer involvement but are limit… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  13. arXiv:2203.03544  [pdf, other

    cs.SE

    Online Adaptable Bug Localization for Rapidly Evolving Software

    Authors: Agnieszka Ciborowska, Michael J. Decker, Kostadin Damevski

    Abstract: Bug localization aims to reduce debugging time by recommending program elements that are relevant for a specific bug report. To date, researchers have primarily addressed this problem by applying different information retrieval techniques that leverage similarities between a given bug report and source code. However, with modern software development trending towards increased speed of software cha… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  14. arXiv:2112.14169  [pdf, other

    cs.SE

    Fast Changeset-based Bug Localization with BERT

    Authors: Agnieszka Ciborowska, Kostadin Damevski

    Abstract: Automatically localizing software bugs to the changesets that induced them has the potential to improve software developer efficiency and to positively affect software quality. To facilitate this automation, a bug report has to be effectively matched with source code changes, even when a significant lexical gap exists between natural language used to describe the bug and identifier naming practice… ▽ More

    Submitted 11 April, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

  15. arXiv:1904.07072  [pdf, other

    cs.SE

    Modeling Hierarchical Usage Context for Software Exceptions based on Interaction Data

    Authors: Hui Chen, Kostadin Damevski, David Shepherd, Nicholas A. Kraft

    Abstract: Traces of user interactions with a software system, captured in production, are commonly used as an input source for user experience testing. In this paper, we present an alternative use, introducing a novel approach of modeling user interaction traces enriched with another type of data gathered in production - software fault reports consisting of software exceptions and stack traces. The model de… ▽ More

    Submitted 23 July, 2019; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: 24 pages, 7 figures

  16. arXiv:1612.09183  [pdf, other

    cs.OH cs.SE

    Detecting Plagiarism based on the Creation Process

    Authors: Johannes Schneider, Avi Bernstein, Jan Vom Brocke, Kostadin Damevski, David C. Shepherd

    Abstract: All methodologies for detecting plagiarism to date have focused on the final digital "outcome", such as a document or source code. Our novel approach takes the creation process into account using logged events collected by special software or by the macro recorders found in most office applications. We look at an author's interaction logs with the software used to create the work. Detection relies… ▽ More

    Submitted 19 July, 2017; v1 submitted 10 December, 2016; originally announced December 2016.

  17. arXiv:1508.04044  [pdf, ps, other

    cs.SE

    Supporting Developers in Porting Software via Combined Textual and Structural Analysis of Software Artifacts

    Authors: Kostadin Damevski, David Shepherd, Nicholas Kraft, Lori Pollock

    Abstract: This is position paper accepted to the Computational Science & Engineering Software Sustainability and Productivity Challenges (CSESSP Challenges) Workshop, sponsored by the Networking and Information Technology Research and Development (NITRD) Software Design and Productivity (SDP) Coordinating Group, held October 15th-16th 2015 in Washington DC, USA. It discusses the role recommendation systems,… ▽ More

    Submitted 17 August, 2015; originally announced August 2015.

  18. arXiv:1401.6931  [pdf, other

    cs.SE cs.IR

    How the Sando Search Tool Recommends Queries

    Authors: Xi Ge, David Shepherd, Kostadin Damevski, Emerson Murphy-Hill

    Abstract: Developers spend a significant amount of time searching their local codebase. To help them search efficiently, researchers have proposed novel tools that apply state-of-the-art information retrieval algorithms to retrieve relevant code snippets from the local codebase. However, these tools still rely on the developer to craft an effective query, which requires that the developer is familiar with t… ▽ More

    Submitted 27 January, 2014; originally announced January 2014.