Skip to main content

Showing 1–12 of 12 results for author: Thudumu, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04571  [pdf, ps, other

    cs.AI

    OpenAg: Democratizing Agricultural Intelligence

    Authors: Srikanth Thudumu, Jason Fisher

    Abstract: Agriculture is undergoing a major transformation driven by artificial intelligence (AI), machine learning, and knowledge representation technologies. However, current agricultural intelligence systems often lack contextual understanding, explainability, and adaptability, especially for smallholder farmers with limited resources. General-purpose large language models (LLMs), while powerful, typical… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 10 pages, 1 figure

  2. arXiv:2505.24765  [pdf, ps, other

    quant-ph cs.AI

    Supervised Quantum Machine Learning: A Future Outlook from Qubits to Enterprise Applications

    Authors: Srikanth Thudumu, Jason Fisher, Hung Du

    Abstract: Supervised Quantum Machine Learning (QML) represents an intersection of quantum computing and classical machine learning, aiming to use quantum resources to support model training and inference. This paper reviews recent developments in supervised QML, focusing on methods such as variational quantum circuits, quantum neural networks, and quantum kernel methods, along with hybrid quantum-classical… ▽ More

    Submitted 17 June, 2025; v1 submitted 30 May, 2025; originally announced May 2025.

    Comments: Future outlook and roadmap of QML with 7 pages and 1 figure

  3. arXiv:2501.17361  [pdf, other

    cs.LG cs.AI

    The M-factor: A Novel Metric for Evaluating Neural Architecture Search in Resource-Constrained Environments

    Authors: Srikanth Thudumu, Hy Nguyen, Hung Du, Nhat Duong, Zafaryab Rasool, Rena Logothetis, Scott Barnett, Rajesh Vasa, Kon Mouzakis

    Abstract: Neural Architecture Search (NAS) aims to automate the design of deep neural networks. However, existing NAS techniques often focus on maximising accuracy, neglecting model efficiency. This limitation restricts their use in resource-constrained environments like mobile devices and edge computing systems. Moreover, current evaluation metrics prioritise performance over efficiency, lacking a balanced… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  4. arXiv:2501.16753  [pdf, other

    cs.CV cs.AI

    Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction

    Authors: Hy Nguyen, Srikanth Thudumu, Hung Du, Rajesh Vasa, Kon Mouzakis

    Abstract: Next-frame prediction in videos is crucial for applications such as autonomous driving, object tracking, and motion prediction. The primary challenge in next-frame prediction lies in effectively capturing and processing both spatial and temporal information from previous video sequences. The transformer architecture, known for its prowess in handling sequence data, has made remarkable progress in… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  5. arXiv:2501.15695  [pdf, other

    cs.MA cs.AI

    Contextual Knowledge Sharing in Multi-Agent Reinforcement Learning with Decentralized Communication and Coordination

    Authors: Hung Du, Srikanth Thudumu, Hy Nguyen, Rajesh Vasa, Kon Mouzakis

    Abstract: Decentralized Multi-Agent Reinforcement Learning (Dec-MARL) has emerged as a pivotal approach for addressing complex tasks in dynamic environments. Existing Multi-Agent Reinforcement Learning (MARL) methodologies typically assume a shared objective among agents and rely on centralized control. However, many real-world scenarios feature agents with individual goals and limited observability of othe… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

  6. arXiv:2501.14000  [pdf, other

    cs.LG cs.AI

    Local Control Networks (LCNs): Optimizing Flexibility in Neural Network Data Pattern Capture

    Authors: Hy Nguyen, Duy Khoa Pham, Srikanth Thudumu, Hung Du, Rajesh Vasa, Kon Mouzakis

    Abstract: The widespread use of Multi-layer perceptrons (MLPs) often relies on a fixed activation function (e.g., ReLU, Sigmoid, Tanh) for all nodes within the hidden layers. While effective in many scenarios, this uniformity may limit the networks ability to capture complex data patterns. We argue that employing the same activation function at every node is suboptimal and propose leveraging different activ… ▽ More

    Submitted 25 April, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

  7. arXiv:2501.13994  [pdf, other

    cs.CV cs.AI cs.RO

    CSAOT: Cooperative Multi-Agent System for Active Object Tracking

    Authors: Hy Nguyen, Bao Pham, Hung Du, Srikanth Thudumu, Rajesh Vasa, Kon Mouzakis

    Abstract: Object Tracking is essential for many computer vision applications, such as autonomous navigation, surveillance, and robotics. Unlike Passive Object Tracking (POT), which relies on static camera viewpoints to detect and track objects across consecutive frames, Active Object Tracking (AOT) requires a controller agent to actively adjust its viewpoint to maintain visual contact with a moving target i… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  8. arXiv:2501.13992  [pdf, other

    cs.LG cs.AI

    Dual-Branch HNSW Approach with Skip Bridges and LID-Driven Optimization

    Authors: Hy Nguyen, Nguyen Hung Nguyen, Nguyen Linh Bao Nguyen, Srikanth Thudumu, Hung Du, Rajesh Vasa, Kon Mouzakis

    Abstract: The Hierarchical Navigable Small World (HNSW) algorithm is widely used for approximate nearest neighbor (ANN) search, leveraging the principles of navigable small-world graphs. However, it faces some limitations. The first is the local optima problem, which arises from the algorithm's greedy search strategy, selecting neighbors based solely on proximity at each step. This often leads to cluster di… ▽ More

    Submitted 25 April, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

  9. arXiv:2406.06835  [pdf, other

    cs.SE

    Large language models for generating rules, yay or nay?

    Authors: Shangeetha Sivasothy, Scott Barnett, Rena Logothetis, Mohamed Abdelrazek, Zafaryab Rasool, Srikanth Thudumu, Zac Brannelly

    Abstract: Engineering safety-critical systems such as medical devices and digital health intervention systems is complex, where long-term engagement with subject-matter experts (SMEs) is needed to capture the systems' expected behaviour. In this paper, we present a novel approach that leverages Large Language Models (LLMs), such as GPT-3.5 and GPT-4, as a potential world model to accelerate the engineering… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 1 figure

  10. arXiv:2402.01968  [pdf, other

    cs.MA cs.AI cs.LG

    A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions

    Authors: Hung Du, Srikanth Thudumu, Rajesh Vasa, Kon Mouzakis

    Abstract: Research interest in autonomous agents is on the rise as an emerging topic. The notable achievements of Large Language Models (LLMs) have demonstrated the considerable potential to attain human-like intelligence in autonomous agents. However, the challenge lies in enabling these agents to learn, reason, and navigate uncertainties in dynamic environments. Context awareness emerges as a pivotal elem… ▽ More

    Submitted 29 January, 2025; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 11 pages, 1 figure

  11. arXiv:2401.08138  [pdf, other

    cs.SE cs.AI

    LLMs for Test Input Generation for Semantic Caches

    Authors: Zafaryab Rasool, Scott Barnett, David Willie, Stefanus Kurniawan, Sherwin Balugo, Srikanth Thudumu, Mohamed Abdelrazek

    Abstract: Large language models (LLMs) enable state-of-the-art semantic capabilities to be added to software systems such as semantic search of unstructured documents and text generation. However, these models are computationally expensive. At scale, the cost of serving thousands of users increases massively affecting also user experience. To address this problem, semantic caches are used to check for answe… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted in International Conference on AI Engineering Software Engineering (CAIN 2024)

  12. arXiv:2401.05856  [pdf, other

    cs.SE cs.AI

    Seven Failure Points When Engineering a Retrieval Augmented Generation System

    Authors: Scott Barnett, Stefanus Kurniawan, Srikanth Thudumu, Zach Brannelly, Mohamed Abdelrazek

    Abstract: Software engineers are increasingly adding semantic search capabilities to applications using a strategy known as Retrieval Augmented Generation (RAG). A RAG system involves finding documents that semantically match a query and then passing the documents to a large language model (LLM) such as ChatGPT to extract the right answer using an LLM. RAG systems aim to: a) reduce the problem of hallucinat… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.