Skip to main content

Showing 1–50 of 112 results for author: Majumdar, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05538  [pdf, ps, other

    cs.AI cs.CR cs.CY

    Red Teaming AI Red Teaming

    Authors: Subhabrata Majumdar, Brian Pendleton, Abhishek Gupta

    Abstract: Red teaming has evolved from its origins in military applications to become a widely adopted methodology in cybersecurity and AI. In this paper, we take a critical look at the practice of AI red teaming. We argue that despite its current popularity in AI governance, there exists a significant gap between red teaming's original intent as a critical thinking exercise and its narrow focus on discover… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  2. arXiv:2506.20323  [pdf

    cs.LG cs.AI

    Comparative Analysis of Deep Learning Models for Crop Disease Detection: A Transfer Learning Approach

    Authors: Saundarya Subramaniam, Shalini Majumdar, Shantanu Nadar, Kaustubh Kulkarni

    Abstract: This research presents the development of an Artificial Intelligence (AI) - driven crop disease detection system designed to assist farmers in rural areas with limited resources. We aim to compare different deep learning models for a comparative analysis, focusing on their efficacy in transfer learning. By leveraging deep learning models, including EfficientNet, ResNet101, MobileNetV2, and our cus… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  3. arXiv:2506.16380  [pdf, ps, other

    cs.LG

    Classification of Cattle Behavior and Detection of Heat (Estrus) using Sensor Data

    Authors: Druva Dhakshinamoorthy, Avikshit Jha, Sabyasachi Majumdar, Devdulal Ghosh, Ranjita Chakraborty, Hena Ray

    Abstract: This paper presents a novel system for monitoring cattle behavior and detecting estrus (heat) periods using sensor data and machine learning. We designed and deployed a low-cost Bluetooth-based neck collar equipped with accelerometer and gyroscope sensors to capture real-time behavioral data from real cows, which was synced to the cloud. A labeled dataset was created using synchronized CCTV footag… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: 6 pages, 5 figures. Druva Dhakshinamoorthy and Avikshit Jha contributed equally as co-first authors. Work conducted during a summer internship at CDAC Kolkata by students of BITS Pilani

    ACM Class: I.5.1; I.5.4; I.2.10; I.2.6; C.3; J.2; H.4.2

  4. arXiv:2506.02706  [pdf, other

    cs.SI

    Collective Intelligence Outperforms Individual Talent: A Case Study in League of Legends

    Authors: Angelo Josey Caldeira, Sajan Maharjan, Srijoni Majumdar, Evangelos Pournaras

    Abstract: Gaming environments are popular testbeds for studying human interactions and behaviors in complex artificial intelligence systems. Particularly, in multiplayer online battle arena (MOBA) games, individuals collaborate in virtual environments of high realism that involves real-time strategic decision-making and trade-offs on resource management, information collection and sharing, team synergy and… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  5. arXiv:2505.18789  [pdf, ps, other

    cs.SE cs.CL

    From Output to Evaluation: Does Raw Instruction-Tuned Code LLMs Output Suffice for Fill-in-the-Middle Code Generation?

    Authors: Wasi Uddin Ahmad, Somshubra Majumdar, Boris Ginsburg

    Abstract: Post-processing is crucial for the automatic evaluation of LLMs in fill-in-the-middle (FIM) code generation due to the frequent presence of extraneous code in raw outputs. This extraneous generation suggests a lack of awareness regarding output boundaries, requiring truncation for effective evaluation. The determination of an optimal truncation strategy, however, often proves intricate, particular… ▽ More

    Submitted 9 June, 2025; v1 submitted 24 May, 2025; originally announced May 2025.

    Comments: Work in progress

  6. arXiv:2505.14349  [pdf, ps, other

    cs.CY cs.AI cs.ET cs.HC cs.MA

    Upgrading Democracies with Fairer Voting Methods

    Authors: Evangelos Pournaras, Srijoni Majumdar, Thomas Wellings, Joshua C. Yang, Fatemeh B. Heravan, Regula Hänggli Fricker, Dirk Helbing

    Abstract: Voting methods are instrumental design element of democracies. Citizens use them to express and aggregate their preferences to reach a collective decision. However, voting outcomes can be as sensitive to voting rules as they are to people's voting choices. Despite the significance and inter-disciplinary scientific progress on voting methods, several democracies keep relying on outdated voting meth… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Includes Supplementary Information

  7. arXiv:2505.00949  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Llama-Nemotron: Efficient Reasoning Models

    Authors: Akhiad Bercovich, Itay Levy, Izik Golan, Mohammad Dabbah, Ran El-Yaniv, Omri Puny, Ido Galil, Zach Moshe, Tomer Ronen, Najeeb Nabwani, Ido Shahaf, Oren Tropp, Ehud Karpas, Ran Zilberstein, Jiaqi Zeng, Soumye Singhal, Alexander Bukharin, Yian Zhang, Tugrul Konuk, Gerald Shen, Ameya Sunil Mahabaleshwarkar, Bilal Kartal, Yoshi Suhara, Olivier Delalleau, Zijia Chen , et al. (111 additional authors not shown)

    Abstract: We introduce the Llama-Nemotron series of models, an open family of heterogeneous reasoning models that deliver exceptional reasoning capabilities, inference efficiency, and an open license for enterprise use. The family comes in three sizes -- Nano (8B), Super (49B), and Ultra (253B) -- and performs competitively with state-of-the-art reasoning models such as DeepSeek-R1 while offering superior i… ▽ More

    Submitted 30 June, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  8. arXiv:2505.00268  [pdf, ps, other

    cs.CL cs.AI

    Consistency in Language Models: Current Landscape, Challenges, and Future Directions

    Authors: Jekaterina Novikova, Carol Anderson, Borhane Blili-Hamelin, Subhabrata Majumdar

    Abstract: The hallmark of effective language use lies in consistency -- expressing similar meanings in similar contexts and avoiding contradictions. While human communication naturally demonstrates this principle, state-of-the-art language models struggle to maintain reliable consistency across different scenarios. This paper examines the landscape of consistency research in AI language systems, exploring b… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

  9. arXiv:2504.08719  [pdf, other

    cs.CL

    SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling

    Authors: Krishna C. Puvvada, Faisal Ladhak, Santiago Akle Serrano, Cheng-Ping Hsieh, Shantanu Acharya, Somshubra Majumdar, Fei Jia, Samuel Kriman, Simeng Sun, Dima Rekesh, Boris Ginsburg

    Abstract: We present a decoder-only Transformer architecture that robustly generalizes to sequence lengths substantially longer than those seen during training. Our model, SWAN-GPT, interleaves layers without positional encodings (NoPE) and sliding-window attention layers equipped with rotary positional encodings (SWA-RoPE). Experiments demonstrate strong performance on sequence lengths significantly longer… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  10. arXiv:2504.04030  [pdf, other

    cs.SE cs.CL

    OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs

    Authors: Wasi Uddin Ahmad, Aleksander Ficek, Mehrzad Samadi, Jocelyn Huang, Vahid Noroozi, Somshubra Majumdar, Boris Ginsburg

    Abstract: Large Language Models (LLMs) have transformed software development by enabling code generation, automated debugging, and complex reasoning. However, their continued advancement is constrained by the scarcity of high-quality, publicly available supervised fine-tuning (SFT) datasets tailored for coding tasks. To bridge this gap, we introduce OpenCodeInstruct, the largest open-access instruction tuni… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: Work in progress

  11. arXiv:2504.03624  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

    Authors: NVIDIA, :, Aaron Blakeman, Aarti Basant, Abhinav Khattar, Adithya Renduchintala, Akhiad Bercovich, Aleksander Ficek, Alexis Bjorlin, Ali Taghibakhshi, Amala Sanjay Deshmukh, Ameya Sunil Mahabaleshwarkar, Andrew Tao, Anna Shors, Ashwath Aithal, Ashwin Poojary, Ayush Dattagupta, Balaram Buddharaju, Bobby Chen, Boris Ginsburg, Boxin Wang, Brandon Norick, Brian Butterfield, Bryan Catanzaro, Carlo del Mundo , et al. (176 additional authors not shown)

    Abstract: As inference-time scaling becomes critical for enhanced reasoning capabilities, it is increasingly becoming important to build models that are efficient to infer. We introduce Nemotron-H, a family of 8B and 56B/47B hybrid Mamba-Transformer models designed to reduce inference cost for a given accuracy level. To achieve this goal, we replace the majority of self-attention layers in the common Transf… ▽ More

    Submitted 15 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

  12. arXiv:2504.01943  [pdf, other

    cs.CL

    OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

    Authors: Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg

    Abstract: Since the advent of reasoning-based large language models, many have found great success from distilling reasoning capabilities into student models. Such techniques have significantly bridged the gap between reasoning and standard LLMs on coding tasks. Despite this, much of the progress on distilling reasoning models remains locked behind proprietary datasets or lacks details on data curation, fil… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Work in progress

  13. arXiv:2502.15924  [pdf, other

    cs.CL

    Improving Consistency in Large Language Models through Chain of Guidance

    Authors: Harsh Raj, Vipul Gupta, Domenic Rosati, Subhabrata Majumdar

    Abstract: Consistency is a fundamental dimension of trustworthiness in Large Language Models (LLMs). For humans to be able to trust LLM-based applications, their outputs should be consistent when prompted with inputs that carry the same meaning or intent. Despite this need, there is no known mechanism to control and guide LLMs to be more consistent at inference time. In this paper, we introduce a novel alig… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: Accepted at Transactions of Machine Learning Research (TMLR) 2025

    ACM Class: I.2.6; I.5.1

  14. arXiv:2502.13820  [pdf, other

    cs.AI cs.CL cs.LG cs.SE

    Scoring Verifiers: Evaluating Synthetic Verification for Code and Reasoning

    Authors: Aleksander Ficek, Somshubra Majumdar, Vahid Noroozi, Boris Ginsburg

    Abstract: Synthetic verification techniques such as generating test cases and reward modelling are common ways to enhance the coding capabilities of large language models (LLM) beyond predefined tests. Additionally, code verification has recently found great success as a critical component in improving reasoning capability of LLMs via reinforcement learning. In this paper, we propose a an approach which can… ▽ More

    Submitted 1 April, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

  15. arXiv:2501.15396  [pdf

    q-bio.QM cs.CV cs.LG eess.IV stat.AP

    Foundations of a Knee Joint Digital Twin from qMRI Biomarkers for Osteoarthritis and Knee Replacement

    Authors: Gabrielle Hoyer, Kenneth T Gao, Felix G Gassert, Johanna Luitjens, Fei Jiang, Sharmila Majumdar, Valentina Pedoia

    Abstract: This study forms the basis of a digital twin system of the knee joint, using advanced quantitative MRI (qMRI) and machine learning to advance precision health in osteoarthritis (OA) management and knee replacement (KR) prediction. We combined deep learning-based segmentation of knee joint structures with dimensionality reduction to create an embedded feature space of imaging biomarkers. Through cr… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

    Comments: This manuscript builds on an earlier preprint version available on Research Square: https://doi.org/10.21203/rs.3.rs-4317958/v1

    Journal ref: npj Digit. Med. 8, 118 (2025)

  16. arXiv:2501.13376  [pdf

    eess.IV cs.CV

    Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility

    Authors: Gabrielle Hoyer, Michelle W Tong, Rupsa Bhattacharjee, Valentina Pedoia, Sharmila Majumdar

    Abstract: Foundation models hold transformative potential for medical imaging, but their clinical utility requires rigorous evaluation to address their strengths and limitations. This study introduces an evaluation framework for assessing the clinical impact and translatability of SAM, MedSAM, and SAM2, using musculoskeletal MRI as a case study. We tested these models across zero-shot and finetuned paradigm… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

  17. arXiv:2412.20875  [pdf, other

    cs.CV

    Attention Is All You Need For Mixture-of-Depths Routing

    Authors: Advait Gadhikar, Souptik Kumar Majumdar, Niclas Popp, Piyapat Saranrittichai, Martin Rapp, Lukas Schott

    Abstract: Advancements in deep learning are driven by training models with increasingly larger numbers of parameters, which in turn heightens the computational demands. To address this issue, Mixture-of-Depths (MoD) models have been proposed to dynamically assign computations only to the most relevant parts of the inputs, thereby enabling the deployment of large-parameter models with high efficiency during… ▽ More

    Submitted 30 December, 2024; originally announced December 2024.

    Comments: 22 pages, 19 figures

  18. arXiv:2410.22284  [pdf, other

    cs.CR cs.LG

    Embedding-based classifiers can detect prompt injection attacks

    Authors: Md. Ahsan Ayub, Subhabrata Majumdar

    Abstract: Large Language Models (LLMs) are seeing significant adoption in every type of organization due to their exceptional generative capabilities. However, LLMs are found to be vulnerable to various adversarial attacks, particularly prompt injection attacks, which trick them into producing harmful or inappropriate content. Adversaries execute such attacks by crafting malicious prompts to deceive the LLM… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

  19. arXiv:2410.16700  [pdf

    cs.AI cs.CY q-bio.GN

    AskBeacon -- Performing genomic data exchange and analytics with natural language

    Authors: Anuradha Wickramarachchi, Shakila Tonni, Sonali Majumdar, Sarvnaz Karimi, Sulev Kõks, Brendan Hosking, Jordi Rambla, Natalie A. Twine, Yatish Jain, Denis C. Bauer

    Abstract: Enabling clinicians and researchers to directly interact with global genomic data resources by removing technological barriers is vital for medical genomics. AskBeacon enables Large Language Models to be applied to securely shared cohorts via the GA4GH Beacon protocol. By simply "asking" Beacon, actionable insights can be gained, analyzed and made publication-ready.

    Submitted 22 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

  20. arXiv:2409.12914  [pdf, other

    cs.LG cs.CL

    Evaluating Defences against Unsafe Feedback in RLHF

    Authors: Domenic Rosati, Giles Edkins, Harsh Raj, David Atanasov, Subhabrata Majumdar, Janarthanan Rajendran, Frank Rudzicz, Hassan Sajjad

    Abstract: While there has been progress towards aligning Large Language Models (LLMs) with human values and ensuring safe behaviour at inference time, safety guards can easily be removed when fine tuned on unsafe and harmful datasets. While this setting has been treated extensively, another popular training paradigm, learning from unsafe feedback with reinforcement learning, has previously been unexplored.… ▽ More

    Submitted 25 February, 2025; v1 submitted 19 September, 2024; originally announced September 2024.

  21. arXiv:2409.01438  [pdf, other

    eess.AS cs.SD

    Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR

    Authors: Weiqing Wang, Kunal Dhawan, Taejin Park, Krishna C. Puvvada, Ivan Medennikov, Somshubra Majumdar, He Huang, Jagadeesh Balam, Boris Ginsburg

    Abstract: Speech foundation models have achieved state-of-the-art (SoTA) performance across various tasks, such as automatic speech recognition (ASR) in hundreds of languages. However, multi-speaker ASR remains a challenging task for these models due to data scarcity and sparsity. In this paper, we present approaches to enable speech foundation models to process and understand multi-speaker speech with limi… ▽ More

    Submitted 2 December, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    Comments: Accepted by SLT 2024

  22. arXiv:2407.21077  [pdf, other

    cs.CL cs.LG cs.NE

    Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

    Authors: Somshubra Majumdar, Vahid Noroozi, Mehrzad Samadi, Sean Narenthiran, Aleksander Ficek, Wasi Uddin Ahmad, Jocelyn Huang, Jagadeesh Balam, Boris Ginsburg

    Abstract: Large Language Models (LLMs) require high quality instruction data for effective alignment, particularly in code generation tasks where expert curated datasets are expensive to produce. We present Genetic-Instruct, a scalable algorithm for synthesizing large-scale, high quality coding instructions using evolutionary principles. Starting from a small set of seed instructions, Genetic-Instruct gener… ▽ More

    Submitted 22 May, 2025; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: Accepted to be presented in ACL 2025

  23. arXiv:2407.12184  [pdf

    eess.IV cs.CV

    The object detection method aids in image reconstruction evaluation and clinical interpretation of meniscal abnormalities

    Authors: Natalia Konovalova, Aniket Tolpadi, Felix Liu, Zehra Akkaya, Felix Gassert, Paula Giesler, Johanna Luitjens, Misung Han, Emma Bahroos, Sharmila Majumdar, Valentina Pedoia

    Abstract: This study investigates the relationship between deep learning (DL) image reconstruction quality and anomaly detection performance, and evaluates the efficacy of an artificial intelligence (AI) assistant in enhancing radiologists' interpretation of meniscal anomalies on reconstructed images. A retrospective study was conducted using an in-house reconstruction and anomaly detection pipeline to asse… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  24. arXiv:2406.19674  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

    Authors: Krishna C. Puvvada, Piotr Żelasko, He Huang, Oleksii Hrinchuk, Nithin Rao Koluguri, Kunal Dhawan, Somshubra Majumdar, Elena Rastorgueva, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg

    Abstract: Recent advances in speech recognition and translation rely on hundreds of thousands of hours of Internet speech data. We argue that state-of-the art accuracy can be reached without relying on web-scale data. Canary - multilingual ASR and speech translation model, outperforms current state-of-the-art models - Whisper, OWSM, and Seamless-M4T on English, French, Spanish, and German languages, while b… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech-2024

  25. arXiv:2406.12946  [pdf

    eess.AS cs.AI cs.CL cs.LG

    Instruction Data Generation and Unsupervised Adaptation for Speech Language Models

    Authors: Vahid Noroozi, Zhehuai Chen, Somshubra Majumdar, Steve Huang, Jagadeesh Balam, Boris Ginsburg

    Abstract: In this paper, we propose three methods for generating synthetic samples to train and evaluate multimodal large language models capable of processing both text and speech inputs. Addressing the scarcity of samples containing both modalities, synthetic data generation emerges as a crucial strategy to enhance the performance of such systems and facilitate the modeling of cross-modal relationships be… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted for Interspeech 2024

  26. arXiv:2406.11871  [pdf, other

    cs.AI

    Generative AI Voting: Fair Collective Choice is Resilient to LLM Biases and Inconsistencies

    Authors: Srijoni Majumdar, Edith Elkind, Evangelos Pournaras

    Abstract: Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) unravel new capabilities for AI personal assistants to overcome cognitive bandwidth limitations of humans, providing decision support or even direct repres… ▽ More

    Submitted 8 April, 2025; v1 submitted 30 May, 2024; originally announced June 2024.

    Comments: 23 pages, 5 figures

  27. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 6 August, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  28. arXiv:2406.11036  [pdf, other

    cs.CL cs.CR

    garak: A Framework for Security Probing Large Language Models

    Authors: Leon Derczynski, Erick Galinkin, Jeffrey Martin, Subho Majumdar, Nanna Inie

    Abstract: As Large Language Models (LLMs) are deployed and integrated into thousands of applications, the need for scalable evaluation of how models respond to adversarial attacks grows rapidly. However, LLM security is a moving target: models produce unpredictable output, are constantly updated, and the potential adversary is highly diverse: anyone with access to the internet and a decent command of natura… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: https://garak.ai

  29. arXiv:2405.14577  [pdf, other

    cs.CL cs.LG

    Representation Noising: A Defence Mechanism Against Harmful Finetuning

    Authors: Domenic Rosati, Jan Wehner, Kai Williams, Łukasz Bartoszcze, David Atanasov, Robie Gonzales, Subhabrata Majumdar, Carsten Maple, Hassan Sajjad, Frank Rudzicz

    Abstract: Releasing open-source large language models (LLMs) presents a dual-use risk since bad actors can easily fine-tune these models for harmful purposes. Even without the open release of weights, weight stealing and fine-tuning APIs make closed models vulnerable to harmful fine-tuning attacks (HFAs). While safety measures like preventing jailbreaks and improving safety guardrails are important, such me… ▽ More

    Submitted 30 October, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Published in NeurIPs 2024

  30. arXiv:2405.05495  [pdf, other

    cs.OH

    PARSAC: Fast, Human-quality Floorplanning for Modern SoCs with Complex Design Constraints

    Authors: Hesham Mostafa, Uday Mallappa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

    Abstract: The floorplanning of Systems-on-a-Chip (SoCs) and of chip sub-systems is a crucial step in the physical design flow as it determines the optimal shapes and locations of the blocks that make up the system. Simulated Annealing (SA) has been the method of choice for tackling classical floorplanning problems where the objective is to minimize wire-length and the total placement area. The goal in indus… ▽ More

    Submitted 1 August, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 9 pages, 7 figures

  31. arXiv:2405.05480  [pdf, other

    cs.AR cs.AI cs.LG

    FloorSet -- a VLSI Floorplanning Dataset with Design Constraints of Real-World SoCs

    Authors: Uday Mallappa, Hesham Mostafa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

    Abstract: Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of nearly 10E250. As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark th… ▽ More

    Submitted 1 August, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 10 pages, 11 figures

  32. arXiv:2405.05085  [pdf, other

    cs.MA

    Fair Voting Outcomes with Impact and Novelty Compromises? Unraveling Biases in Electing Participatory Budgeting Winners

    Authors: Sajan Maharjan, Srijoni Majumdar, Evangelos Pournaras

    Abstract: Participatory budgeting, as a paradigm for democratic innovations, engages citizens in the distribution of a public budget to projects, which they propose and vote for implementation. So far, voting algorithms have been proposed and studied in social choice literature to elect projects that are popular, while others prioritize on a proportional representation of voters' preferences, for instance,… ▽ More

    Submitted 29 October, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 41 pages, 19 figures

  33. WaveCatBoost for Probabilistic Forecasting of Regional Air Quality Data

    Authors: Jintu Borah, Tanujit Chakraborty, Md. Shahrul Md. Nadzir, Mylene G. Cayetano, Shubhankar Majumdar

    Abstract: Accurate and reliable air quality forecasting is essential for protecting public health, sustainable development, pollution control, and enhanced urban planning. This letter presents a novel WaveCatBoost architecture designed to forecast the real-time concentrations of air pollutants by combining the maximal overlapping discrete wavelet transform (MODWT) with the CatBoost model. This hybrid approa… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Journal ref: IEEE Sensors Letters, 2024, Volume: 9, Issue: 1

  34. arXiv:2401.05947  [pdf, other

    cs.CR

    Send Message to the Future? Blockchain-based Time Machines for Decentralized Reveal of Locked Information

    Authors: Zhuolun Li, Srijoni Majumdar, Evangelos Pournaras

    Abstract: Conditional information reveal systems automate the release of information upon meeting specific predefined conditions, such as time or location. This paper introduces a breakthrough in the understanding, design, and application of conditional information reveal systems that are highly secure and decentralized. By designing a new practical timed-release cryptography system and a secret sharing sch… ▽ More

    Submitted 25 January, 2025; v1 submitted 11 January, 2024; originally announced January 2024.

  35. arXiv:2312.17279  [pdf, other

    cs.CL eess.AS

    Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition

    Authors: Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg

    Abstract: In this paper, we propose an efficient and accurate streaming speech recognition model based on the FastConformer architecture. We adapted the FastConformer architecture for streaming applications through: (1) constraining both the look-ahead and past contexts in the encoder, and (2) introducing an activation caching mechanism to enable the non-autoregressive encoder to operate autoregressively du… ▽ More

    Submitted 2 May, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: Shorter version accepted to ICASSP 2024

  36. arXiv:2311.03374  [pdf, other

    cs.SE cs.AI cs.IR

    Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023

    Authors: Srijoni Majumdar, Soumen Paul, Debjyoti Paul, Ayan Bandyopadhyay, Samiran Chattopadhyay, Partha Pratim Das, Paul D Clough, Prasenjit Majumder

    Abstract: The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs e… ▽ More

    Submitted 27 October, 2023; originally announced November 2023.

    Comments: Overview Paper of the Information Retrieval of Software Engineering Track at the Forum for Information Retrieval, 2023

  37. arXiv:2310.17152  [pdf

    cs.CV cs.AI cs.LG q-bio.QM

    Technical Note: Feasibility of translating 3.0T-trained Deep-Learning Segmentation Models Out-of-the-Box on Low-Field MRI 0.55T Knee-MRI of Healthy Controls

    Authors: Rupsa Bhattacharjee, Zehra Akkaya, Johanna Luitjens, Pan Su, Yang Yang, Valentina Pedoia, Sharmila Majumdar

    Abstract: In the current study, our purpose is to evaluate the feasibility of applying deep learning (DL) enabled algorithms to quantify bilateral knee biomarkers in healthy controls scanned at 0.55T and compared with 3.0T. The current study assesses the performance of standard in-practice bone, and cartilage segmentation algorithms at 0.55T, both qualitatively and quantitatively, in terms of comparing segm… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 11 Pages, 3 Figures, 2 Tables

  38. arXiv:2309.09950  [pdf, other

    eess.AS cs.SD

    Investigating End-to-End ASR Architectures for Long Form Audio Transcription

    Authors: Nithin Rao Koluguri, Samuel Kriman, Georgy Zelenfroind, Somshubra Majumdar, Dima Rekesh, Vahid Noroozi, Jagadeesh Balam, Boris Ginsburg

    Abstract: This paper presents an overview and evaluation of some of the end-to-end ASR models on long-form audios. We study three categories of Automatic Speech Recognition(ASR) models based on their core architecture: (1) convolutional, (2) convolutional with squeeze-and-excitation and (3) convolutional models with attention. We selected one ASR model from each category and evaluated Word Error Rate, maxim… ▽ More

    Submitted 20 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: PrePrint. Submitted to ICASSP 2024

  39. arXiv:2308.09138  [pdf, other

    cs.CL cs.AI cs.CY

    Semantic Consistency for Assuring Reliability of Large Language Models

    Authors: Harsh Raj, Vipul Gupta, Domenic Rosati, Subhabrata Majumdar

    Abstract: Large Language Models (LLMs) exhibit remarkable fluency and competence across various natural language tasks. However, recent research has highlighted their sensitivity to variations in input prompts. To deploy LLMs in a safe and reliable manner, it is crucial for their outputs to be consistent when prompted with expressions that carry the same meaning or intent. While some existing work has explo… ▽ More

    Submitted 28 April, 2025; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: An updated version of this preprint is available at arXiv:2502.15924, and has been accepted at the Transactions on Machine Learning Research

  40. arXiv:2308.06653  [pdf, other

    cs.SE cs.AI

    Smart Knowledge Transfer using Google-like Search

    Authors: Srijoni Majumdar, Partha Pratim Das

    Abstract: To address the issue of rising software maintenance cost due to program comprehension challenges, we propose SMARTKT (Smart Knowledge Transfer), a search framework, which extracts and integrates knowledge related to various aspects of an application in form of a semantic graph. This graph supports syntax and semantic queries and converts the process of program comprehension into a {\em google-like… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: 3 pages, 2 figures, accepted in the NDLI-UNESCO International Symposium on Knowledge Engineering for Digital Library Design 2019 (KEDL) as an extended abstract and poster

  41. arXiv:2307.12915  [pdf, other

    cs.MA cs.AI

    Consensus-based Participatory Budgeting for Legitimacy: Decision Support via Multi-agent Reinforcement Learning

    Authors: Srijoni Majumdar, Evangelos Pournaras

    Abstract: The legitimacy of bottom-up democratic processes for the distribution of public funds by policy-makers is challenging and complex. Participatory budgeting is such a process, where voting outcomes may not always be fair or inclusive. Deliberation for which project ideas to put for voting and choose for implementation lack systematization and do not scale. This paper addresses these grand challenges… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 13 Pages, 8 Figures, 3 Tables, Accepted in International Conference on Machine Learning, Optimization, and Data Science, 2023

    Journal ref: International Conference on Machine Learning, Optimization, and Data Science, 2023

  42. arXiv:2307.08412  [pdf, other

    cs.CR cs.DC

    A Privacy-Preserving Blockchain-based E-voting System

    Authors: Arnab Mukherjee, Souvik Majumdar, Anup Kumar Kolya, Saborni Nandi

    Abstract: Within a modern democratic nation, elections play a significant role in the nation's functioning. However, with the existing infrastructure for conducting elections using Electronic Voting Systems (EVMs), many loopholes exist, which illegitimate entities might leverage to cast false votes or even tamper with the EVMs after the voting session is complete. The need of the hour is to introduce a robu… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  43. Improving City Life via Legitimate and Participatory Policy-making: A Data-driven Approach in Switzerland

    Authors: Thomas Wellings, Srijoni Majumdar, Regula Hänggli Fricker, Evangelos Pournaras

    Abstract: This paper introduces a novel data-driven approach to address challenges faced by city policymakers concerning the distribution of public funds. Providing budgeting processes for improving quality of life based on objective (data-driven) evidence has been so far a missing element in policy-making. This paper focuses on a case study of 1,204 citizens in the city of Aarau, Switzerland, and analyzes… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 18 pages, 15 figures

    Journal ref: 24th Annual International Conference on Digital Government Research (dg.o 2023)

  44. arXiv:2306.06283  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon

    Authors: Kevin Maik Jablonka, Qianxiang Ai, Alexander Al-Feghali, Shruti Badhwar, Joshua D. Bocarsly, Andres M Bran, Stefan Bringuier, L. Catherine Brinson, Kamal Choudhary, Defne Circi, Sam Cox, Wibe A. de Jong, Matthew L. Evans, Nicolas Gastellu, Jerome Genzling, María Victoria Gil, Ankur K. Gupta, Zhi Hong, Alishba Imran, Sabine Kruschwitz, Anne Labarre, Jakub Lála, Tao Liu, Steven Ma, Sauradeep Majumdar , et al. (28 additional authors not shown)

    Abstract: Large-language models (LLMs) such as GPT-4 caught the interest of many scientists. Recent studies suggested that these models could be useful in chemistry and materials science. To explore these possibilities, we organized a hackathon. This article chronicles the projects built as part of this hackathon. Participants employed LLMs for various applications, including predicting properties of mole… ▽ More

    Submitted 14 July, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  45. arXiv:2305.16993  [pdf, ps, other

    cs.DC cs.MA

    Discrete-choice Multi-agent Optimization: Decentralized Hard Constraint Satisfaction for Smart Cities

    Authors: Srijoni Majumdar, Chuhao Qin, Evangelos Pournaras

    Abstract: Making Smart Cities more sustainable, resilient and democratic is emerging as an endeavor of satisfying hard constraints, for instance meeting net-zero targets. Decentralized multi-agent methods for socio-technical optimization of large-scale complex infrastructures such as energy and transport networks are scalable and more privacy-preserving by design. However, they mainly focus on satisfying so… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 8 pages, 7 figures, Accepted for MSDM@AAMAS 2023

  46. arXiv:2305.05084  [pdf, other

    eess.AS cs.SD

    Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition

    Authors: Dima Rekesh, Nithin Rao Koluguri, Samuel Kriman, Somshubra Majumdar, Vahid Noroozi, He Huang, Oleksii Hrinchuk, Krishna Puvvada, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg

    Abstract: Conformer-based models have become the dominant end-to-end architecture for speech processing tasks. With the objective of enhancing the conformer architecture for efficient training and inference, we carefully redesigned Conformer with a novel downsampling schema. The proposed model, named Fast Conformer(FC), is 2.8x faster than the original Conformer, supports scaling to Billion parameters witho… ▽ More

    Submitted 30 September, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted at ASRU 2023

  47. Very high resolution canopy height maps from RGB imagery using self-supervised vision transformer and convolutional decoder trained on Aerial Lidar

    Authors: Jamie Tolan, Hung-I Yang, Ben Nosarzewski, Guillaume Couairon, Huy Vo, John Brandt, Justine Spore, Sayantan Majumdar, Daniel Haziza, Janaki Vamaraju, Theo Moutakanni, Piotr Bojanowski, Tracy Johns, Brian White, Tobias Tiecke, Camille Couprie

    Abstract: Vegetation structure mapping is critical for understanding the global carbon cycle and monitoring nature-based approaches to climate adaptation and mitigation. Repeated measurements of these data allow for the observation of deforestation or degradation of existing forests, natural forest regeneration, and the implementation of sustainable agricultural practices like agroforestry. Assessments of t… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Journal ref: Remote Sensing of Environment 300, 113888, 2024

  48. arXiv:2304.06795  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Efficient Sequence Transduction by Jointly Predicting Tokens and Durations

    Authors: Hainan Xu, Fei Jia, Somshubra Majumdar, He Huang, Shinji Watanabe, Boris Ginsburg

    Abstract: This paper introduces a novel Token-and-Duration Transducer (TDT) architecture for sequence-to-sequence tasks. TDT extends conventional RNN-Transducer architectures by jointly predicting both a token and its duration, i.e. the number of input frames covered by the emitted token. This is achieved by using a joint network with two outputs which are independently normalized to generate distributions… ▽ More

    Submitted 29 May, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

  49. arXiv:2303.08535  [pdf, other

    cond-mat.stat-mech cs.LG physics.data-an

    Singular relaxation of a random walk in a box with a Metropolis Monte Carlo dynamics

    Authors: Alexei D. Chepelianskii, Satya N. Majumdar, Hendrik Schawe, Emmanuel Trizac

    Abstract: We study analytically the relaxation eigenmodes of a simple Monte Carlo algorithm, corresponding to a particle in a box which moves by uniform random jumps. Moves outside of the box are rejected. At long times, the system approaches the equilibrium probability density, which is uniform inside the box. We show that the relaxation towards this equilibrium is unusual: for a jump length comparable to… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  50. arXiv:2211.13419  [pdf, other

    cs.CR cs.LG stat.AP

    Network Security Modelling with Distributional Data

    Authors: Subhabrata Majumdar, Ganesh Subramaniam

    Abstract: We investigate the detection of botnet command and control (C2) hosts in massive IP traffic using machine learning methods. To this end, we use NetFlow data -- the industry standard for monitoring of IP traffic -- and ML models using two sets of features: conventional NetFlow variables and distributional features based on NetFlow variables. In addition to using static summaries of NetFlow features… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted and presented in CAMLIS 2022, https://www.camlis.org/2022-conference. arXiv admin note: text overlap with arXiv:2108.08924