Skip to main content

Showing 1–50 of 83 results for author: Arora, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.18948  [pdf, other

    cs.AI cs.CV

    Use of Metric Learning for the Recognition of Handwritten Digits, and its Application to Increase the Outreach of Voice-based Communication Platforms

    Authors: Devesh Pant, Dibyendu Talukder, Deepak Kumar, Rachit Pandey, Aaditeshwar Seth, Chetan Arora

    Abstract: Initiation, monitoring, and evaluation of development programmes can involve field-based data collection about project activities. This data collection through digital devices may not always be feasible though, for reasons such as unaffordability of smartphones and tablets by field-based cadre, or shortfalls in their training and capacity building. Paper-based data collection has been argued to be… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

    Comments: 10 Pages, 7 Figures, ACM COMPASS 2022

    Journal ref: COMPASS 2022: Proceedings of the 5th ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies COMPASS '22: Proceedings of the 5th ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies, Pages 364 - 374

  2. arXiv:2502.18694  [pdf, other

    cs.SE

    Requirements-Driven Automated Software Testing: A Systematic Review

    Authors: Fanyu Wang, Chetan Arora, Chakkrit Tantithamthavorn, Kaicheng Huang, Aldeida Aleti

    Abstract: Automated software testing has the potential to enhance efficiency and reliability in software development, yet its adoption remains hindered by challenges in aligning test generation with software requirements. REquirements-Driven Automated Software Testing (REDAST) aims to bridge this gap by leveraging requirements as the foundation for automated test artifact generation. This systematic literat… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: Under reviewing in TOSEM

  3. arXiv:2502.18077  [pdf, other

    cs.CV cs.CR cs.LG

    Examining the Threat Landscape: Foundation Models and Model Stealing

    Authors: Ankita Raj, Deepankar Varma, Chetan Arora

    Abstract: Foundation models (FMs) for computer vision learn rich and robust representations, enabling their adaptation to task/domain-specific deployments with little to no fine-tuning. However, we posit that the very same strength can make applications based on FMs vulnerable to model stealing attacks. Through empirical analysis, we reveal that models fine-tuned from FMs harbor heightened susceptibility to… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: Accepted to BMVC 2024

    Journal ref: 35th British Machine Vision Conference 2024, Glasgow, UK, November 25-28, 2024

  4. arXiv:2502.16471  [pdf, other

    cs.LG cs.CV

    Feature Space Perturbation: A Panacea to Enhanced Transferability Estimation

    Authors: Prafful Kumar Khoba, Zijian Wang, Chetan Arora, Mahsa Baktashmotlagh

    Abstract: Leveraging a transferability estimation metric facilitates the non-trivial challenge of selecting the optimal model for the downstream task from a pool of pre-trained models. Most existing metrics primarily focus on identifying the statistical relationship between feature embeddings and the corresponding labels within the target dataset, but overlook crucial aspect of model robustness. This oversi… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Journal ref: Winter Conference on Applications of Computer Vision (WACV) 2025

  5. arXiv:2502.14930  [pdf

    cs.SE

    RAGVA: Engineering Retrieval Augmented Generation-based Virtual Assistants in Practice

    Authors: Rui Yang, Michael Fu, Chakkrit Tantithamthavorn, Chetan Arora, Lisa Vandenhurk, Joey Chua

    Abstract: Retrieval-augmented generation (RAG)-based applications are gaining prominence due to their ability to leverage large language models (LLMs). These systems excel at combining retrieval mechanisms with generative capabilities, resulting in more accurate, contextually relevant responses that enhance user experience. In particular, Transurban, a road operation company, is replacing its rule-based vir… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: Under Review at the Journal of Systems and Software (JSS)

  6. arXiv:2502.04916  [pdf, other

    cs.SE

    Classification or Prompting: A Case Study on Legal Requirements Traceability

    Authors: Romina Etezadi, Sallam Abualhaija, Chetan Arora, Lionel Briand

    Abstract: New regulations are continuously introduced to ensure that software development complies with the ethical concerns and prioritizes public safety. A prerequisite for demonstrating compliance involves tracing software requirements to legal provisions. Requirements traceability is a fundamental task where requirements engineers are supposed to analyze technical requirements against target artifacts,… ▽ More

    Submitted 11 February, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

  7. arXiv:2502.00015  [pdf, other

    cs.CY cs.AI

    Ethical Concerns of Generative AI and Mitigation Strategies: A Systematic Mapping Study

    Authors: Yutan Huang, Chetan Arora, Wen Cheng Houng, Tanjila Kanij, Anuradha Madulgalla, John Grundy

    Abstract: [Context] Generative AI technologies, particularly Large Language Models (LLMs), have transformed numerous domains by enhancing convenience and efficiency in information retrieval, content generation, and decision-making processes. However, deploying LLMs also presents diverse ethical challenges, and their mitigation strategies remain complex and domain-dependent. [Objective] This paper aims to id… ▽ More

    Submitted 8 January, 2025; originally announced February 2025.

  8. arXiv:2501.16857  [pdf, other

    cs.SE

    Comparing Human and LLM Generated Code: The Jury is Still Out!

    Authors: Sherlock A. Licorish, Ansh Bajpai, Chetan Arora, Fanyu Wang, Kla Tantithamthavorn

    Abstract: Much is promised in relation to AI-supported software development. However, there has been limited evaluation effort in the research domain aimed at validating the true utility of such techniques, especially when compared to human coding outputs. We bridge this gap, where a benchmark dataset comprising 72 distinct software engineering tasks is used to compare the effectiveness of large language mo… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

    Comments: 10 pages, 6 figures

    ACM Class: D.2.4; D.2.5; D.2.8

  9. arXiv:2501.04810  [pdf, other

    cs.SE

    On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability

    Authors: Andreas Vogelsang, Alexander Korn, Giovanna Broccia, Alessio Ferrari, Jannik Fischbach, Chetan Arora

    Abstract: Large language models (LLMs) are increasingly used to generate software artifacts, such as source code, tests, and trace links. Requirements play a central role in shaping the input prompts that guide LLMs, as they are often used as part of the prompts to synthesize the artifacts. However, the impact of requirements formulation on LLM performance remains unclear. In this paper, we investigate the… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Accepted at 2025 IEEE/ACM 47th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

  10. arXiv:2412.00374  [pdf, other

    cs.CV

    LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Image

    Authors: Chetan Madan, Mayuna Gupta, Soumen Basu, Pankaj Gupta, Chetan Arora

    Abstract: We focus on the problem of Gallbladder Cancer (GBC) detection from Ultrasound (US) images. The problem presents unique challenges to modern Deep Neural Network (DNN) techniques due to low image quality arising from noise, textures, and viewpoint variations. Tackling such challenges would necessitate precise localization performance by the DNN to identify the discerning features for the downstream… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: Accepted at WACV 2025

  11. arXiv:2411.13302  [pdf, other

    cs.CV

    Can Reasons Help Improve Pedestrian Intent Estimation? A Cross-Modal Approach

    Authors: Vaishnavi Khindkar, Vineeth Balasubramanian, Chetan Arora, Anbumani Subramanian, C. V. Jawahar

    Abstract: With the increased importance of autonomous navigation systems has come an increasing need to protect the safety of Vulnerable Road Users (VRUs) such as pedestrians. Predicting pedestrian intent is one such challenging task, where prior work predicts the binary cross/no-cross intention with a fusion of visual and motion features. However, there has been no effort so far to hedge such predictions w… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  12. arXiv:2409.12002  [pdf, other

    cs.RO cs.CV

    Towards Global Localization using Multi-Modal Object-Instance Re-Identification

    Authors: Aneesh Chavan, Vaibhav Agrawal, Vineeth Bhat, Sarthak Chittawar, Siddharth Srivastava, Chetan Arora, K Madhava Krishna

    Abstract: Re-identification (ReID) is a critical challenge in computer vision, predominantly studied in the context of pedestrians and vehicles. However, robust object-instance ReID, which has significant implications for tasks such as autonomous exploration, long-term perception, and scene understanding, remains underexplored. In this work, we address this gap by proposing a novel dual-path object-instance… ▽ More

    Submitted 1 May, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: 8 pages, 5 figures, 3 tables. Accepted at Advances in Robotics, AIR 2025 (Oral)

    MSC Class: 68T40 ACM Class: I.2.9; I.2.10

  13. arXiv:2408.14698  [pdf, other

    cs.IR cs.AI cs.CL cs.CV

    Smart Multi-Modal Search: Contextual Sparse and Dense Embedding Integration in Adobe Express

    Authors: Cherag Aroraa, Tracy Holloway King, Jayant Kumar, Yi Lu, Sanat Sharma, Arvind Srikantan, David Uvalle, Josep Valls-Vargas, Harsha Vardhan

    Abstract: As user content and queries become increasingly multi-modal, the need for effective multi-modal search systems has grown. Traditional search systems often rely on textual and metadata annotations for indexed images, while multi-modal embeddings like CLIP enable direct search using text and image embeddings. However, embedding-based approaches face challenges in integrating contextual features such… ▽ More

    Submitted 29 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: CIKM 2024 (International Conference on Information and Knowledge Management), Multimodal Search and Recommendations Workshop

  14. arXiv:2408.10577  [pdf, other

    cs.SE

    Optimizing Large Language Model Hyperparameters for Code Generation

    Authors: Chetan Arora, Ahnaf Ibn Sayeed, Sherlock Licorish, Fanyu Wang, Christoph Treude

    Abstract: Large Language Models (LLMs), such as GPT models, are increasingly used in software engineering for various tasks, such as code generation, requirements management, and debugging. While automating these tasks has garnered significant attention, a systematic study on the impact of varying hyperparameters on code generation outcomes remains unexplored. This study aims to assess LLMs' code generation… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  15. arXiv:2408.01621  [pdf, other

    cs.SE

    Managing Human-Centric Software Defects: Insights from GitHub and Practitioners' Perspectives

    Authors: Vedant Chauhan, Chetan Arora, Hourieh Khalajzadeh, John Grundy

    Abstract: Context: Human-centric defects (HCDs) are nuanced and subjective defects that often occur due to end-user perceptions or differences, such as their genders, ages, cultures, languages, disabilities, socioeconomic status, and educational backgrounds. Development teams have a limited understanding of these issues, which leads to the neglect of these defects. Defect reporting tools do not adequately h… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  16. arXiv:2407.06585  [pdf, other

    cs.CV

    D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms

    Authors: Tajamul Ashraf, Krithika Rangarajan, Mohit Gambhir, Richa Gabha, Chetan Arora

    Abstract: We focus on the problem of Unsupervised Domain Adaptation (\uda) for breast cancer detection from mammograms (BCDM) problem. Recent advancements have shown that masked image modeling serves as a robust pretext task for UDA. However, when applied to cross-domain BCDM, these techniques struggle with breast abnormalities such as masses, asymmetries, and micro-calcifications, in part due to the typica… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  17. Web3 and the State: Indian state's redescription of blockchain

    Authors: Debarun Sarkar, Cheshta Arora

    Abstract: The article closely reads a discussion paper by the National Institution for Transforming India (NITI) Aayog and a strategy paper by the Ministry of Electronics and Information Technology (MeitY) advocating non-financial use cases of blockchain in India. By noting the discursive shift from transparency to trust to adjustably transparent enacted in these two documents, and consequently the Indian s… ▽ More

    Submitted 2 October, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Journal ref: First Monday 29 (10)

  18. arXiv:2404.16831  [pdf, other

    cs.CV

    The Third Monocular Depth Estimation Challenge

    Authors: Jaime Spencer, Fabio Tosi, Matteo Poggi, Ripudaman Singh Arora, Chris Russell, Simon Hadfield, Richard Bowden, GuangYuan Zhou, ZhengXin Li, Qiang Rao, YiPing Bao, Xiao Liu, Dohyeong Kim, Jinseong Kim, Myunghyun Kim, Mykola Lavreniuk, Rui Li, Qing Mao, Jiang Wu, Yu Zhu, Jinqiu Sun, Yanning Zhang, Suraj Patni, Aradhye Agarwal, Chetan Arora , et al. (16 additional authors not shown)

    Abstract: This paper discusses the results of the third edition of the Monocular Depth Estimation Challenge (MDEC). The challenge focuses on zero-shot generalization to the challenging SYNS-Patches dataset, featuring complex scenes in natural and indoor settings. As with the previous edition, methods can use any form of supervision, i.e. supervised or self-supervised. The challenge received a total of 19 su… ▽ More

    Submitted 27 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: To appear in CVPRW2024

  19. Generating Test Scenarios from NL Requirements using Retrieval-Augmented LLMs: An Industrial Study

    Authors: Chetan Arora, Tomas Herda, Verena Homm

    Abstract: Test scenarios are specific instances of test cases that describe actions to validate a particular software functionality. By outlining the conditions under which the software operates and the expected outcomes, test scenarios ensure that the software functionality is tested in an integrated manner. Test scenarios are crucial for systematically testing an application under various conditions, incl… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  20. arXiv:2404.06371  [pdf, other

    cs.SE cs.CL cs.LG

    Model Generation with LLMs: From Requirements to UML Sequence Diagrams

    Authors: Alessio Ferrari, Sallam Abualhaija, Chetan Arora

    Abstract: Complementing natural language (NL) requirements with graphical models can improve stakeholders' communication and provide directions for system design. However, creating models from requirements involves manual effort. The advent of generative large language models (LLMs), ChatGPT being a notable example, offers promising avenues for automated assistance in model generation. This paper investigat… ▽ More

    Submitted 1 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    ACM Class: D.2; K.6.3; D.2.1; D.3.1; D.2.2; D.2.10; D.2.2; I.2; I.2.7

  21. arXiv:2404.05442  [pdf

    cs.HC cs.SE

    Unlocking Adaptive User Experience with Generative AI

    Authors: Yutan Huang, Tanjila Kanij, Anuradha Madugalla, Shruti Mahajan, Chetan Arora, John Grundy

    Abstract: Developing user-centred applications that address diverse user needs requires rigorous user research. This is time, effort and cost-consuming. With the recent rise of generative AI techniques based on Large Language Models (LLMs), there is a possibility that these powerful tools can be used to develop adaptive interfaces. This paper presents a novel approach to develop user personas and adaptive i… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  22. arXiv:2404.05425  [pdf, other

    cs.SE

    Requirements Elicitation in Government Projects: A Preliminary Empirical Study

    Authors: Anqi Ren, Lin Liu, Yi Wang, Xiao Liu, Hailong Wang, Kaijia Xu, Xishuo Zhang, Chetan Arora

    Abstract: Government development projects vary significantly from private sector initiatives in scope, stakeholder complexity, and regulatory requirements. There is a lack of empirical studies focusing on requirements engineering (RE) activities specifically for government projects. We addressed this gap by conducting a series of semi-structured interviews with 12 professional software practitioners working… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  23. Towards Standards-Compliant Assistive Technology Product Specifications via LLMs

    Authors: Chetan Arora, John Grundy, Louise Puli, Natasha Layton

    Abstract: In the rapidly evolving field of assistive technology (AT), ensuring that products meet national and international standards is essential for user safety, efficacy, and accessibility. In this vision paper, we introduce CompliAT, a pioneering framework designed to streamline the compliance process of AT product specifications with these standards through the innovative use of Large Language Models… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  24. arXiv:2403.18807  [pdf, other

    cs.CV cs.AI cs.LG

    ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation

    Authors: Suraj Patni, Aradhye Agarwal, Chetan Arora

    Abstract: In the absence of parallax cues, a learning-based single image depth estimation (SIDE) model relies heavily on shading and contextual cues in the image. While this simplicity is attractive, it is necessary to train such models on large and varied datasets, which are difficult to capture. It has been shown that using embeddings from pre-trained foundational models, such as CLIP, improves zero shot… ▽ More

    Submitted 17 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  25. arXiv:2403.15917  [pdf, other

    cs.SE

    Who Uses Personas in Requirements Engineering: The Practitioners' Perspective

    Authors: Yi Wang, Chetan Arora, Xiao Liu, Thuong Hoang, Vasudha Malhotra, Ben Cheng, John Grundy

    Abstract: Personas are commonly used in software projects to gain a better understanding of end-users' needs. However, there is a limited understanding of their usage and effectiveness in practice. This paper presents the results of a two-step investigation, comprising interviews with 26 software developers, UI/UX designers, business analysts and product managers and a survey of 203 practitioners, aimed at… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  26. arXiv:2403.08848  [pdf, other

    eess.IV cs.CV

    FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders

    Authors: Soumen Basu, Mayuna Gupta, Chetan Madan, Pankaj Gupta, Chetan Arora

    Abstract: In recent years, automated Gallbladder Cancer (GBC) detection has gained the attention of researchers. Current state-of-the-art (SOTA) methodologies relying on ultrasound sonography (US) images exhibit limited generalization, emphasizing the need for transformative approaches. We observe that individual US frames may lack sufficient information to capture disease manifestation. This study advocate… ▽ More

    Submitted 29 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: To Appear at CVPR 2024

  27. arXiv:2402.11910  [pdf, other

    cs.SE

    Enhancing Large Language Models for Text-to-Testcase Generation

    Authors: Saranya Alagarsamy, Chakkrit Tantithamthavorn, Wannita Takerngsaksiri, Chetan Arora, Aldeida Aleti

    Abstract: Context: Test-driven development (TDD) is a widely employed software development practice that involves developing test cases based on requirements prior to writing the code. Although various methods for automated test case generation have been proposed, they are not specifically tailored for TDD, where requirements instead of code serve as input. Objective: In this paper, we introduce a text-to-t… ▽ More

    Submitted 1 April, 2025; v1 submitted 19 February, 2024; originally announced February 2024.

  28. arXiv:2402.02726  [pdf, other

    cs.SE

    How do software practitioners perceive human-centric defects?

    Authors: Vedant Chauhan, Chetan Arora, Hourieh Khalajzadeh, John Grundy

    Abstract: Context: Human-centric software design and development focuses on how users want to carry out their tasks rather than making users accommodate their software. Software users can have different genders, ages, cultures, languages, disabilities, socioeconomic statuses, and educational backgrounds, among many other differences. Due to the inherently varied nature of these differences and their impact… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  29. arXiv:2401.08097  [pdf, other

    cs.SE cs.AI cs.CY

    Fairness Concerns in App Reviews: A Study on AI-based Mobile Apps

    Authors: Ali Rezaei Nasab, Maedeh Dashti, Mojtaba Shahin, Mansooreh Zahedi, Hourieh Khalajzadeh, Chetan Arora, Peng Liang

    Abstract: Fairness is one of the socio-technical concerns that must be addressed in software systems. Considering the popularity of mobile software applications (apps) among a wide range of individuals worldwide, mobile apps with unfair behaviors and outcomes can affect a significant proportion of the global population, potentially more than any other type of software system. Users express a wide range of s… ▽ More

    Submitted 31 July, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Preprint accepted for publication in ACM Transactions on Software Engineering and Methodology (TOSEM), 2024

  30. arXiv:2401.01508  [pdf, other

    cs.SE

    Practical Guidelines for the Selection and Evaluation of Natural Language Processing Techniques in Requirements Engineering

    Authors: Mehrdad Sabetzadeh, Chetan Arora

    Abstract: Natural Language Processing (NLP) is now a cornerstone of requirements automation. One compelling factor behind the growing adoption of NLP in Requirements Engineering (RE) is the prevalent use of natural language (NL) for specifying requirements in industry. NLP techniques are commonly used for automatically classifying requirements, extracting important information, e.g., domain models and gloss… ▽ More

    Submitted 16 July, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: This article will appear as Chapter 15 in a book titled "Handbook of Natural Language Processing for Requirements Engineering", to be published by Springer

  31. arXiv:2311.09086  [pdf, other

    cs.CL cs.AI cs.SI

    The Uli Dataset: An Exercise in Experience Led Annotation of oGBV

    Authors: Arnav Arora, Maha Jinadoss, Cheshta Arora, Denny George, Brindaalakshmi, Haseena Dawood Khan, Kirti Rawat, Div, Ritash, Seema Mathur, Shivani Yadav, Shehla Rashid Shora, Rie Raut, Sumit Pawar, Apurva Paithane, Sonia, Vivek, Dharini Priscilla, Khairunnisha, Grace Banu, Ambika Tandon, Rishav Thakker, Rahul Dev Korra, Aatman Vaidya, Tarunima Prabhakar

    Abstract: Online gender based violence has grown concomitantly with adoption of the internet and social media. Its effects are worse in the Global majority where many users use social media in languages other than English. The scale and volume of conversations on the internet has necessitated the need for automated detection of hate speech, and more specifically gendered abuse. There is, however, a lack of… ▽ More

    Submitted 24 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  32. arXiv:2311.04588  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Army of Thieves: Enhancing Black-Box Model Extraction via Ensemble based sample selection

    Authors: Akshit Jindal, Vikram Goyal, Saket Anand, Chetan Arora

    Abstract: Machine Learning (ML) models become vulnerable to Model Stealing Attacks (MSA) when they are deployed as a service. In such attacks, the deployed model is queried repeatedly to build a labelled dataset. This dataset allows the attacker to train a thief model that mimics the original model. To maximize query efficiency, the attacker has to select the most informative subset of data points from the… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures, paper accepted to WACV 2024

  33. arXiv:2311.03550  [pdf, other

    cs.CV cs.AI

    United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure Learning from Videos

    Authors: Siddhant Bansal, Chetan Arora, C. V. Jawahar

    Abstract: Given multiple videos of the same task, procedure learning addresses identifying the key-steps and determining their order to perform the task. For this purpose, existing approaches use the signal generated from a pair of videos. This makes key-steps discovery challenging as the algorithms lack inter-videos perspective. Instead, we propose an unsupervised Graph-based Procedure Learning (GPL) frame… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 13 pages, 6 figures, Accepted in Winter Conference on Applications of Computer Vision (WACV), 2024

  34. arXiv:2311.00284  [pdf, other

    cs.SE cs.LG

    Model-driven Engineering for Machine Learning Components: A Systematic Literature Review

    Authors: Hira Naveed, Chetan Arora, Hourieh Khalajzadeh, John Grundy, Omar Haggag

    Abstract: Context: Machine Learning (ML) has become widely adopted as a component in many modern software applications. Due to the large volumes of data available, organizations want to increasingly leverage their data to extract meaningful insights and enhance business profitability. ML components enable predictive capabilities, anomaly detection, recommendation, accurate image and text processing, and inf… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  35. arXiv:2310.18648  [pdf, other

    cs.SE

    Generative Artificial Intelligence for Software Engineering -- A Research Agenda

    Authors: Anh Nguyen-Duc, Beatriz Cabrero-Daniel, Adam Przybylek, Chetan Arora, Dron Khanna, Tomas Herda, Usman Rafiq, Jorge Melegati, Eduardo Guerra, Kai-Kristian Kemell, Mika Saari, Zheying Zhang, Huy Le, Tho Quan, Pekka Abrahamsson

    Abstract: Generative Artificial Intelligence (GenAI) tools have become increasingly prevalent in software development, offering assistance to various managerial and technical project activities. Notable examples of these tools include OpenAIs ChatGPT, GitHub Copilot, and Amazon CodeWhisperer. Although many recent publications have explored and evaluated the application of GenAI, a comprehensive understandin… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  36. arXiv:2310.13976  [pdf, other

    cs.SE

    Advancing Requirements Engineering through Generative AI: Assessing the Role of LLMs

    Authors: Chetan Arora, John Grundy, Mohamed Abdelrazek

    Abstract: Requirements Engineering (RE) is a critical phase in software development including the elicitation, analysis, specification, and validation of software requirements. Despite the importance of RE, it remains a challenging process due to the complexities of communication, uncertainty in the early stages and inadequate automation support. In recent years, large-language models (LLMs) have shown sign… ▽ More

    Submitted 1 November, 2023; v1 submitted 21 October, 2023; originally announced October 2023.

  37. arXiv:2309.06227  [pdf

    cs.HC cs.AI

    On the Injunction of XAIxArt

    Authors: Cheshta Arora, Debarun Sarkar

    Abstract: The position paper highlights the range of concerns that are engulfed in the injunction of explainable artificial intelligence in art (XAIxArt). Through a series of quick sub-questions, it points towards the ambiguities concerning 'explanation' and the postpositivist tradition of 'relevant explanation'. Rejecting both 'explanation' and 'relevant explanation', the paper takes a stance that XAIxArt… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  38. arXiv:2309.05261  [pdf, other

    cs.CV

    Gall Bladder Cancer Detection from US Images with Only Image Level Labels

    Authors: Soumen Basu, Ashish Papanai, Mayank Gupta, Pankaj Gupta, Chetan Arora

    Abstract: Automated detection of Gallbladder Cancer (GBC) from Ultrasound (US) images is an important problem, which has drawn increased interest from researchers. However, most of these works use difficult-to-acquire information such as bounding box annotations or additional US videos. In this paper, we focus on GBC detection using only image-level labels. Such annotation is usually available based on the… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted at MICCAI 2023

  39. arXiv:2307.00390  [pdf, other

    cs.SE

    PersonaGen: A Tool for Generating Personas from User Feedback

    Authors: Xishuo Zhang, Lin Liu, Yi Wang, Xiao Liu, Hailong Wang, Anqi Ren, Chetan Arora

    Abstract: Personas are crucial in software development processes, particularly in agile settings. However, no effective tools are available for generating personas from user feedback in agile software development processes. To fill this gap, we propose a novel tool that uses the GPT-4 model and knowledge graph to generate persona templates from well-processed user feedback, facilitating requirement analysis… ▽ More

    Submitted 6 July, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

  40. UTRNet: High-Resolution Urdu Text Recognition In Printed Documents

    Authors: Abdur Rahman, Arjun Ghosh, Chetan Arora

    Abstract: In this paper, we propose a novel approach to address the challenges of printed Urdu text recognition using high-resolution, multi-scale semantic feature extraction. Our proposed UTRNet architecture, a hybrid CNN-RNN model, demonstrates state-of-the-art performance on benchmark datasets. To address the limitations of previous works, which struggle to generalize to the intricacies of the Urdu scrip… ▽ More

    Submitted 23 August, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted at The 17th International Conference on Document Analysis and Recognition (ICDAR 2023)

    Journal ref: Document Analysis and Recognition - ICDAR 2023 (2023) 305-324

  41. arXiv:2306.01492  [pdf, other

    cs.SE

    Multi-Modal Emotion Recognition for Enhanced Requirements Engineering: A Novel Approach

    Authors: Ben Cheng, Chetan Arora, Xiao Liu, Thuong Hoang, Yi Wang, John Grundy

    Abstract: Requirements engineering (RE) plays a crucial role in developing software systems by bridging the gap between stakeholders' needs and system specifications. However, effective communication and elicitation of stakeholder requirements can be challenging, as traditional RE methods often overlook emotional cues. This paper introduces a multi-modal emotion recognition platform (MEmoRE) to enhance the… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  42. arXiv:2305.01082  [pdf, other

    cs.CL cs.IR cs.LG

    Contextual Multilingual Spellchecker for User Queries

    Authors: Sanat Sharma, Josep Valls-Vargas, Tracy Holloway King, Francois Guerin, Chirag Arora

    Abstract: Spellchecking is one of the most fundamental and widely used search features. Correcting incorrectly spelled user queries not only enhances the user experience but is expected by the user. However, most widely available spellchecking solutions are either lower accuracy than state-of-the-art solutions or too slow to be used for search use cases where latency is a key requirement. Furthermore, most… ▽ More

    Submitted 14 June, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: 5 pages, In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '23)

  43. arXiv:2304.01074  [pdf, other

    cs.RO

    FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation

    Authors: Sudarshan S Harithas, Gurkirat Singh, Aneesh Chavan, Sarthak Sharma, Suraj Patni, Chetan Arora, K. Madhava Krishna

    Abstract: We focus on the problem of LiDAR point cloud based loop detection (or Finding) and closure (LDC) in a multi-agent setting. State-of-the-art (SOTA) techniques directly generate learned embeddings of a given point cloud, require large data transfers, and are not robust to wide variations in 6 Degrees-of-Freedom (DOF) viewpoint. Moreover, absence of strong priors in an unstructured point cloud leads… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  44. arXiv:2303.10439  [pdf, other

    cs.SE cs.CL

    Stop Words for Processing Software Engineering Documents: Do they Matter?

    Authors: Yaohou Fan, Chetan Arora, Christoph Treude

    Abstract: Stop words, which are considered non-predictive, are often eliminated in natural language processing tasks. However, the definition of uninformative vocabulary is vague, so most algorithms use general knowledge-based stop lists to remove stop words. There is an ongoing debate among academics about the usefulness of stop word elimination, especially in domain-specific settings. In this work, we inv… ▽ More

    Submitted 12 June, 2023; v1 submitted 18 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at the 2nd Intl. Workshop on NL-based Software Engineering (NLBSE 2023)

  45. arXiv:2303.02920  [pdf, other

    cs.SE cs.AI

    Requirements Engineering Framework for Human-centered Artificial Intelligence Software Systems

    Authors: Khlood Ahmad, Mohamed Abdelrazek, Chetan Arora, Arbind Agrahari Baniya, Muneera Bano, John Grundy

    Abstract: [Context] Artificial intelligence (AI) components used in building software solutions have substantially increased in recent years. However, many of these solutions focus on technical aspects and ignore critical human-centered aspects. [Objective] Including human-centered aspects during requirements engineering (RE) when building AI-based software can help achieve more responsible, unbiased, and i… ▽ More

    Submitted 18 May, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  46. arXiv:2302.06034  [pdf, other

    cs.SE

    Requirements Elicitation and Modelling of Artificial Intelligence Systems: An Empirical Study

    Authors: Khlood Ahmad, Mohamed Abdelrazek, Chetan Arora, John Grundy, Muneera Bano

    Abstract: Artificial Intelligence (AI) systems have gained significant traction in the recent past, creating new challenges in requirements engineering (RE) when building AI software systems. RE for AI practices have not been studied much and have scarce empirical studies. Additionally, many AI software solutions tend to focus on the technical aspects and ignore human-centered values. In this paper, we repo… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  47. arXiv:2302.05618  [pdf, other

    cs.SE

    Persona-based Assessment of Software Engineering Student Research Projects: An Experience Report

    Authors: Chetan Arora, Laura Tubino, Andrew Cain, Kevin Lee, Vasudha Malhotra

    Abstract: Students enrolled in software engineering degrees are generally required to undertake a research project in their final year through which they demonstrate the ability to conduct research, communicate outcomes, and build in-depth expertise in an area. Assessment in these projects typically involves evaluating the product of their research via a thesis or a similar artifact. However, this misses a… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

  48. arXiv:2302.05617  [pdf, other

    cs.SE

    Towards Human-Centred Crowd Computing: Software for Better Use of Computational Resources

    Authors: Niroshinie Fernando, Chetan Arora, Seng W. Loke, Lubna Alam, Stephen La Macchia, Helen Graesser

    Abstract: Internet-connected smart devices are increasing at an exponential rate. These powerful devices have created a yet-untapped pool of idle resources that can be utilised, among others, for processing data in resource-depleted environments. The idea of bringing together a pool of smart devices for ``crowd computing'' (CC) has been studied in the recent past from an infrastructural feasibility perspect… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

  49. arXiv:2302.04793  [pdf, other

    cs.SE

    AI-based Question Answering Assistance for Analyzing Natural-language Requirements

    Authors: Saad Ezzini, Sallam Abualhaija, Chetan Arora, Mehrdad Sabetzadeh

    Abstract: By virtue of being prevalently written in natural language (NL), requirements are prone to various defects, e.g., inconsistency and incompleteness. As such, requirements are frequently subject to quality assurance processes. These processes, when carried out entirely manually, are tedious and may further overlook important quality issues due to time and budget pressures. In this paper, we propose… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: This paper has been accepted at the 45th International Conference on Software Engineering (ICSE 2023)

  50. arXiv:2301.10404  [pdf, other

    cs.SE

    Requirements Practices and Gaps When Engineering Human-Centered Artificial Intelligence Systems

    Authors: Khlood Ahmad, Mohamed Abdelrazek, Chetan Arora, Muneera Bano, John Grundy

    Abstract: [Context] Engineering Artificial Intelligence (AI) software is a relatively new area with many challenges, unknowns, and limited proven best practices. Big companies such as Google, Microsoft, and Apple have provided a suite of recent guidelines to assist engineering teams in building human-centered AI systems. [Objective] The practices currently adopted by practitioners for developing such system… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.