Skip to main content

Showing 51–100 of 907 results for author: Prateek

.
  1. arXiv:2502.06786  [pdf, other

    cs.LG cs.AI

    Matryoshka Quantization

    Authors: Pranav Nair, Puranjay Datta, Jeff Dean, Prateek Jain, Aditya Kusupati

    Abstract: Quantizing model weights is critical for reducing the communication and inference costs of large models. However, quantizing models -- especially to low precisions like int4 or int2 -- requires a trade-off in model quality; int2, in particular, is known to severely degrade model quality. Consequently, practitioners are often forced to maintain multiple models with different quantization levels or… ▽ More

    Submitted 3 March, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

  2. arXiv:2502.04248  [pdf, other

    cs.LG

    Adapting to Evolving Adversaries with Regularized Continual Robust Training

    Authors: Sihui Dai, Christian Cianfarani, Arjun Bhagoji, Vikash Sehwag, Prateek Mittal

    Abstract: Robust training methods typically defend against specific attack types, such as Lp attacks with fixed budgets, and rarely account for the fact that defenders may encounter new attacks over time. A natural solution is to adapt the defended model to new adversaries as they arise via fine-tuning, a method which we call continual robust training (CRT). However, when implemented naively, fine-tuning on… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  3. arXiv:2502.00706  [pdf, other

    cs.CR cs.CL cs.LG

    Model Provenance Testing for Large Language Models

    Authors: Ivica Nikolic, Teodora Baluta, Prateek Saxena

    Abstract: Large language models are increasingly customized through fine-tuning and other adaptations, creating challenges in enforcing licensing terms and managing downstream impacts. Tracking model origins is crucial both for protecting intellectual property and for identifying derived models when biases or vulnerabilities are discovered in foundation models. We address this challenge by developing a fram… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  4. arXiv:2502.00382  [pdf, other

    cs.CV cs.AI cs.LG

    Masked Generative Nested Transformers with Decode Time Scaling

    Authors: Sahil Goyal, Debapriya Tula, Gagan Jain, Pradeep Shenoy, Prateek Jain, Sujoy Paul

    Abstract: Recent advances in visual generation have made significant strides in producing content of exceptional quality. However, most methods suffer from a fundamental problem - a bottleneck of inference computational efficiency. Most of these algorithms involve multiple passes over a transformer model to generate tokens or denoise inputs. However, the model size is kept consistent throughout all iteratio… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

  5. arXiv:2501.13114  [pdf, ps, other

    math.LO

    Continuous Algebra: Algebraic Semantics for Continuous Propositional Logic

    Authors: Purbita Jana, Prateek

    Abstract: We have introduced continuous algebra as the algebraic semantics for Continuous Propositional Logic (CPL). A Continuous algebra is an MV-algebra together with an unary operator $κ$, analogous to the unary connective $\dfrac{1}{2}$ in CPL. We establish structural results, including the subdirect representation theorem. We also introduce $\ell u^*$-groups, which are lattice ordered groups with stron… ▽ More

    Submitted 29 May, 2025; v1 submitted 18 January, 2025; originally announced January 2025.

  6. arXiv:2501.10489  [pdf

    physics.soc-ph

    Monte Carlo Simulations of Infection Spread in Indoor Environment

    Authors: Rahul Sheshanarayana, Prateek K. Jha

    Abstract: The dynamics of infection spread in populations has received popular attention since the outbreak of Covid-19 and many statistical models have been developed. One of the interesting areas of research is short-time dynamics in confined, indoor environments. We have modeled this using a simple Monte Carlo scheme. Our model is generally applicable for the peer-to-peer transmission case, when the infe… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

  7. arXiv:2501.09826  [pdf, other

    cs.CV

    PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery

    Authors: Shristi Das Biswas, Matthew Shreve, Xuelu Li, Prateek Singhal, Kaushik Roy

    Abstract: Recent advancements in language-guided diffusion models for image editing are often bottle-necked by cumbersome prompt engineering to precisely articulate desired changes. An intuitive alternative calls on guidance from in-the-wild image exemplars to help users bring their imagined edits to life. Contemporary exemplar-based editing methods shy away from leveraging the rich latent space learnt by p… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  8. arXiv:2501.09672  [pdf, other

    cs.CV cs.AI

    Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark

    Authors: Alexis Roger, Prateek Humane, Daniel Z. Kaplan, Kshitij Gupta, Qi Sun, George Adamopoulos, Jonathan Siu Chi Lim, Quentin Anthony, Edwin Fennell, Irina Rish

    Abstract: The proliferation of Vision-Language Models (VLMs) in the past several years calls for rigorous and comprehensive evaluation methods and benchmarks. This work analyzes existing VLM evaluation techniques, including automated metrics, AI-based assessments, and human evaluations across diverse tasks. We first introduce Robin - a novel suite of VLMs that we built by combining Large Language Models (LL… ▽ More

    Submitted 20 January, 2025; v1 submitted 16 January, 2025; originally announced January 2025.

  9. arXiv:2501.08642  [pdf, other

    physics.flu-dyn

    Effects of pressure gradient histories on skin friction and mean flow of high Reynolds number turbulent boundary layers over smooth and rough walls

    Authors: Thomas Preskett, Marco Virgilio, Prateek Jaiswal, Bharathram Ganapathisubramani

    Abstract: Experiments are conducted over smooth and rough walls to explore the influence of pressure gradient histories on skin friction and mean flow of turbulent boundary layers. Different pressure gradient histories are imposed on the boundary layer through an aerofoil mounted in the freestream. Hot-wire measurements are taken at different freestream velocities downstream of the aerofoil where the flow h… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

  10. arXiv:2501.03876  [pdf, other

    astro-ph.IM

    Computational Astrophysics, Data Science & AI/ML in Astronomy: A Perspective from Indian Community

    Authors: Prateek Sharma, Bhargav Vaidya, Yogesh Wadadekar, Jasjeet Bagla, Piyali Chatterjee, Shravan Hanasoge, Prayush Kumar, Dipanjan Mukherjee, Ninan Sajeeth Philip, Nishant Singh

    Abstract: In contemporary astronomy and astrophysics (A&A), the integration of high-performance computing (HPC), big data analytics, and artificial intelligence/machine learning (AI/ML) has become essential for advancing research across a wide range of scientific domains. These tools are playing an increasingly pivotal role in accelerating discoveries, simulating complex astrophysical phenomena, and analyzi… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

    Comments: Accepted for publication in The Journal of Astrophysics and Astronomy. This is an expanded version of one of the chapters in the recently released Vision Document of the Astronomical Society of India

  11. arXiv:2412.16926  [pdf, other

    cs.CL cs.AI cs.LG

    Revisiting In-Context Learning with Long Context Language Models

    Authors: Jinheon Baek, Sun Jae Lee, Prakhar Gupta, Geunseob Oh, Siddharth Dalmia, Prateek Kolhar

    Abstract: In-Context Learning (ICL) is a technique by which language models make predictions based on examples provided in their input context. Previously, their context window size imposed a limit on the number of examples that can be shown, making example selection techniques crucial for identifying the maximally effective set of examples. However, the recent advent of Long Context Language Models (LCLMs)… ▽ More

    Submitted 28 May, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

    Comments: ACL Findings 2025

  12. arXiv:2412.16429  [pdf, other

    cs.CY cs.AI cs.LG

    LearnLM: Improving Gemini for Learning

    Authors: LearnLM Team, Abhinit Modi, Aditya Srikanth Veerubhotla, Aliya Rysbek, Andrea Huber, Brett Wiltshire, Brian Veprek, Daniel Gillick, Daniel Kasenberg, Derek Ahmed, Irina Jurenka, James Cohan, Jennifer She, Julia Wilkowski, Kaiz Alarakyia, Kevin R. McKee, Lisa Wang, Markus Kunesch, Mike Schaekermann, Miruna Pîslar, Nikhil Joshi, Parsa Mahmoudieh, Paul Jhun, Sara Wiltberger, Shakir Mohamed , et al. (21 additional authors not shown)

    Abstract: Today's generative AI systems are tuned to present information by default rather than engage users in service of learning as a human tutor would. To address the wide range of potential education use cases for these systems, we reframe the challenge of injecting pedagogical behavior as one of \textit{pedagogical instruction following}, where training and evaluation examples include system-level ins… ▽ More

    Submitted 25 December, 2024; v1 submitted 20 December, 2024; originally announced December 2024.

  13. arXiv:2412.14327  [pdf, other

    cs.CV

    Personalized Generative Low-light Image Denoising and Enhancement

    Authors: Xijun Wang, Prateek Chennuri, Yu Yuan, Bole Ma, Xingguang Zhang, Stanley Chan

    Abstract: While smartphone cameras today can produce astonishingly good photos, their performance in low light is still not completely satisfactory because of the fundamental limits in photon shot noise and sensor read noise. Generative image restoration methods have demonstrated promising results compared to traditional methods, but they suffer from hallucinatory content generation when the signal-to-noise… ▽ More

    Submitted 10 March, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

  14. arXiv:2412.11449  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Whisper-GPT: A Hybrid Representation Audio Large Language Model

    Authors: Prateek Verma

    Abstract: We propose WHISPER-GPT: A generative large language model (LLM) for speech and music that allows us to work with continuous audio representations and discrete tokens simultaneously as part of a single architecture. There has been a huge surge in generative audio, speech, and music models that utilize discrete audio tokens derived from neural compression algorithms, e.g. ENCODEC. However, one of th… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: 6 pages, 3 figures. 50th International Conference on Acoustics, Speech and Signal Processing, Hyderabad, India

  15. arXiv:2412.09988  [pdf

    cs.CY cs.AI

    AI and the Future of Digital Public Squares

    Authors: Beth Goldberg, Diana Acosta-Navas, Michiel Bakker, Ian Beacock, Matt Botvinick, Prateek Buch, Renée DiResta, Nandika Donthi, Nathanael Fast, Ravi Iyer, Zaria Jalan, Andrew Konya, Grace Kwak Danciu, Hélène Landemore, Alice Marwick, Carl Miller, Aviv Ovadya, Emily Saltz, Lisa Schirch, Dalit Shalom, Divya Siddarth, Felix Sieker, Christopher Small, Jonathan Stray, Audrey Tang , et al. (2 additional authors not shown)

    Abstract: Two substantial technological advances have reshaped the public square in recent decades: first with the advent of the internet and second with the recent introduction of large language models (LLMs). LLMs offer opportunities for a paradigm shift towards more decentralized, participatory online spaces that can be used to facilitate deliberative dialogues at scale, but also create risks of exacerba… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: 40 pages, 5 figures

  16. arXiv:2412.09538  [pdf, other

    cs.LG stat.ML

    Capturing the Temporal Dependence of Training Data Influence

    Authors: Jiachen T. Wang, Dawn Song, James Zou, Prateek Mittal, Ruoxi Jia

    Abstract: Traditional data influence estimation methods, like influence function, assume that learning algorithms are permutation-invariant with respect to training data. However, modern training paradigms, especially for foundation models using stochastic algorithms and multi-stage curricula, are sensitive to data ordering, thus violating this assumption. This mismatch renders influence functions inadequat… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: Correspondence to Jiachen T. Wang and Ruoxi Jia

  17. arXiv:2412.08840  [pdf, ps, other

    stat.AP

    The Causal Effect of the Two-For-One Strategy in the National Basketball Association

    Authors: Prateek Sasan, Daryl Swartzentruber

    Abstract: This study evaluates the effectiveness of the two-for-one strategy in basketball by applying a causal inference framework to play-by-play data from the 2018-19 and 2021-22 National Basketball Association regular seasons. Incorporating factors such as player lineup, betting odds, and player ratings, we compute the average treatment effect and find that the two-for-one strategy has a positive impact… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  18. arXiv:2412.08102  [pdf, other

    cs.RO

    Verification and Validation of a Vision-Based Landing System for Autonomous VTOL Air Taxis

    Authors: Ayoosh Bansal, Duo Wang, Mikael Yeghiazaryan, Yangge Li, Chuyuan Tao, Hyung-Jin Yoon, Prateek Arora, Christos Papachristos, Petros Voulgaris, Sayan Mitra, Lui Sha, Naira Hovakimyan

    Abstract: Autonomous air taxis are poised to revolutionize urban mass transportation, however, ensuring their safety and reliability remains an open challenge. Validating autonomy solutions on air taxis in the real world presents complexities, risks, and costs that further convolute this challenge. Verification and Validation (V&V) frameworks play a crucial role in the design and development of highly relia… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: To be published in AIAA SciTech 2025 Forum

    ACM Class: I.2.9

  19. arXiv:2412.07097  [pdf, other

    cs.CR cs.AI

    On Evaluating the Durability of Safeguards for Open-Weight LLMs

    Authors: Xiangyu Qi, Boyi Wei, Nicholas Carlini, Yangsibo Huang, Tinghao Xie, Luxi He, Matthew Jagielski, Milad Nasr, Prateek Mittal, Peter Henderson

    Abstract: Stakeholders -- from model developers to policymakers -- seek to minimize the dual-use risks of large language models (LLMs). An open challenge to this goal is whether technical safeguards can impede the misuse of LLMs, even when models are customizable via fine-tuning or when model weights are fully open. In response, several recent studies have proposed methods to produce durable LLM safeguards… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  20. arXiv:2412.06011  [pdf, other

    eess.IV cs.CV

    TopoCellGen: Generating Histopathology Cell Topology with a Diffusion Model

    Authors: Meilong Xu, Saumya Gupta, Xiaoling Hu, Chen Li, Shahira Abousamra, Dimitris Samaras, Prateek Prasanna, Chao Chen

    Abstract: Accurately modeling multi-class cell topology is crucial in digital pathology, as it provides critical insights into tissue structure and pathology. The synthetic generation of cell topology enables realistic simulations of complex tissue environments, enhances downstream tasks by augmenting training data, aligns more closely with pathologists' domain knowledge, and offers new opportunities for co… ▽ More

    Submitted 24 March, 2025; v1 submitted 8 December, 2024; originally announced December 2024.

    Comments: Accepted by CVPR 2025. 15 pages, 8 figures

  21. arXiv:2412.03235  [pdf, other

    cs.CL cs.AI

    Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?

    Authors: Sravanti Addepalli, Yerram Varun, Arun Suggala, Karthikeyan Shanmugam, Prateek Jain

    Abstract: Large Language Models (LLMs) are known to be susceptible to crafted adversarial attacks or jailbreaks that lead to the generation of objectionable content despite being aligned to human preferences using safety fine-tuning methods. While the large dimensionality of input token space makes it inevitable to find adversarial prompts that can jailbreak these models, we aim to evaluate whether safety f… ▽ More

    Submitted 25 March, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

    Comments: Accepted in ICLR 2025

    Journal ref: Addepalli, S., Varun, Y., Suggala, A., Shanmugam, K., & Jain, P. (2025). Does safety training of LLMs generalize to semantically related natural prompts? In The Thirteenth International Conference on Learning Representations 2025

  22. arXiv:2412.02626  [pdf, other

    cs.CL cs.AI

    Time-Reversal Provides Unsupervised Feedback to LLMs

    Authors: Yerram Varun, Rahul Madhavan, Sravanti Addepalli, Arun Suggala, Karthikeyan Shanmugam, Prateek Jain

    Abstract: Large Language Models (LLMs) are typically trained to predict in the forward direction of time. However, recent works have shown that prompting these models to look back and critique their own generations can produce useful feedback. Motivated by this, we explore the question of whether LLMs can be empowered to think (predict and score) backwards to provide unsupervised feedback that complements f… ▽ More

    Submitted 2 February, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

    Comments: Accepted as a spotlight in NeurIPS 2024

    Journal ref: The Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024

  23. arXiv:2412.02168  [pdf, other

    cs.CV

    Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis

    Authors: Yu Yuan, Xijun Wang, Yichen Sheng, Prateek Chennuri, Xingguang Zhang, Stanley Chan

    Abstract: Image generation today can produce somewhat realistic images from text prompts. However, if one asks the generator to synthesize a specific camera setting such as creating different fields of view using a 24mm lens versus a 70mm lens, the generator will not be able to interpret and generate scene-consistent images. This limitation not only hinders the adoption of generative tools in professional p… ▽ More

    Submitted 24 March, 2025; v1 submitted 2 December, 2024; originally announced December 2024.

    Comments: Accepted by CVPR 2025. Project page: https://generative-photography.github.io/project/

  24. arXiv:2412.01672  [pdf, other

    cs.CV

    Gen-SIS: Generative Self-augmentation Improves Self-supervised Learning

    Authors: Varun Belagali, Srikar Yellapragada, Alexandros Graikos, Saarthak Kapse, Zilinghan Li, Tarak Nath Nandi, Ravi K Madduri, Prateek Prasanna, Joel Saltz, Dimitris Samaras

    Abstract: Self-supervised learning (SSL) methods have emerged as strong visual representation learners by training an image encoder to maximize similarity between features of different views of the same image. To perform this view-invariance task, current SSL algorithms rely on hand-crafted augmentations such as random cropping and color jittering to create multiple views of an image. Recently, generative d… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: Webpage: https://histodiffusion.github.io/docs/publications/gensis

  25. arXiv:2411.17173  [pdf, other

    astro-ph.GA

    Misty, patchy, and turbulent: constraining the cold circumgalactic medium with mCC

    Authors: Mukesh Singh Bisht, Prateek Sharma, Alankar Dutta, Biman B. Nath

    Abstract: The circumgalactic medium (CGM) is the largest baryon reservoir around galaxies, but its extent, mass, and temperature distribution remain uncertain. We propose that cold gas in the CGM resides primarily in $\sim 100 \hbox{--} 10^4$ cloud complexes (CCs), each containing a mist of tiny cold cloudlets dispersed in a warm/hot medium ($\sim 10^5 \hbox{--} 10^6$~K). Modeling CCs as uniform and misty s… ▽ More

    Submitted 28 November, 2024; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: submitted to MNRAS. 22 pages, 2 tables, 18 figures including Appendix

  26. arXiv:2411.16969  [pdf, other

    cs.CV

    ZoomLDM: Latent Diffusion Model for multi-scale image generation

    Authors: Srikar Yellapragada, Alexandros Graikos, Kostas Triaridis, Prateek Prasanna, Rajarsi R. Gupta, Joel Saltz, Dimitris Samaras

    Abstract: Diffusion models have revolutionized image generation, yet several challenges restrict their application to large-image domains, such as digital pathology and satellite imagery. Given that it is infeasible to directly train a model on 'whole' images from domains with potential gigapixel sizes, diffusion-based generative methods have focused on synthesizing small, fixed-size patches extracted from… ▽ More

    Submitted 24 March, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

  27. arXiv:2411.15076  [pdf, other

    eess.IV cs.CV q-bio.QM

    RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency

    Authors: Wentao Huang, Meilong Xu, Xiaoling Hu, Shahira Abousamra, Aniruddha Ganguly, Saarthak Kapse, Alisa Yurovsky, Prateek Prasanna, Tahsin Kurc, Joel Saltz, Michael L. Miller, Chao Chen

    Abstract: Spatial transcriptomics (ST) provides essential spatial context by mapping gene expression within tissue, enabling detailed study of cellular heterogeneity and tissue organization. However, aligning ST data with histology images poses challenges due to inherent spatial distortions and modality-specific variations. Existing methods largely rely on direct alignment, which often fails to capture comp… ▽ More

    Submitted 22 March, 2025; v1 submitted 22 November, 2024; originally announced November 2024.

    Comments: 18 pages, 9 figures

  28. Translating C To Rust: Lessons from a User Study

    Authors: Ruishi Li, Bo Wang, Tianyu Li, Prateek Saxena, Ashish Kundu

    Abstract: Rust aims to offer full memory safety for programs, a guarantee that untamed C programs do not enjoy. How difficult is it to translate existing C code to Rust? To get a complementary view from that of automatic C to Rust translators, we report on a user study asking humans to translate real-world C programs to Rust. Our participants are able to produce safe Rust translations, whereas state-of-the-… ▽ More

    Submitted 5 December, 2024; v1 submitted 21 November, 2024; originally announced November 2024.

    Comments: Accepted by NDSS Symposium 2025. Please cite the conference version of this paper, e.g., "Ruishi Li, Bo Wang, Tianyu Li, Prateek Saxena, Ashish Kundu. Translating C To Rust: Lessons from a User Study. In 32nd Annual Network and Distributed System Security Symposium (NDSS 2025)."

  29. arXiv:2411.13459  [pdf, other

    cs.CR cs.AI cs.LG

    SoK: A Systems Perspective on Compound AI Threats and Countermeasures

    Authors: Sarbartha Banerjee, Prateek Sahu, Mulong Luo, Anjo Vahldiek-Oberwagner, Neeraja J. Yadwadkar, Mohit Tiwari

    Abstract: Large language models (LLMs) used across enterprises often use proprietary models and operate on sensitive inputs and data. The wide range of attack vectors identified in prior research - targeting various software and hardware components used in training and inference - makes it extremely challenging to enforce confidentiality and integrity policies. As we advance towards constructing compound… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: 13 pages, 4 figures, 2 tables

  30. arXiv:2411.11581  [pdf, other

    cs.CL

    OASIS: Open Agent Social Interaction Simulations with One Million Agents

    Authors: Ziyi Yang, Zaibin Zhang, Zirui Zheng, Yuxian Jiang, Ziyue Gan, Zhiyu Wang, Zijian Ling, Jinsong Chen, Martz Ma, Bowen Dong, Prateek Gupta, Shuyue Hu, Zhenfei Yin, Guohao Li, Xu Jia, Lijun Wang, Bernard Ghanem, Huchuan Lu, Chaochao Lu, Wanli Ouyang, Yu Qiao, Philip Torr, Jing Shao

    Abstract: There has been a growing interest in enhancing rule-based agent-based models (ABMs) for social media platforms (i.e., X, Reddit) with more realistic large language model (LLM) agents, thereby allowing for a more nuanced study of complex systems. As a result, several LLM-based ABMs have been proposed in the past year. While they hold promise, each simulator is specifically designed to study a parti… ▽ More

    Submitted 23 March, 2025; v1 submitted 18 November, 2024; originally announced November 2024.

  31. arXiv:2411.11540  [pdf, other

    cs.DC

    The Jevons Paradox In Cloud Computing: A Thermodynamics Perspective

    Authors: Prateek Sharma

    Abstract: How do we explain the simultaneous growth in energy efficiency of cloud computing and its energy consumption? The Jevons paradox provides one perspective of this phenomenon. However, it is not clear or obvious \emph{why} the Jevons paradox exists, and \emph{when} is it applicable. To answer these questions, we seek inspiration from thermodynamics, and model the cloud as a thermodynamic system. We… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  32. arXiv:2411.11434  [pdf, other

    cs.CR

    CLUE-MARK: Watermarking Diffusion Models using CLWE

    Authors: Kareem Shehata, Aashish Kolluri, Prateek Saxena

    Abstract: As AI-generated images become widespread, reliable watermarking is essential for content verification, copyright enforcement, and combating disinformation. Existing techniques rely on heuristic approaches and lack formal guarantees of undetectability, making them vulnerable to steganographic attacks that can expose or erase the watermark. Additionally, these techniques often degrade output quality… ▽ More

    Submitted 12 December, 2024; v1 submitted 18 November, 2024; originally announced November 2024.

  33. arXiv:2411.08612  [pdf, other

    physics.space-ph

    Simulating the Arrival of Multiple Coronal Mass Ejections that Triggered the Gannon Superstorm on May 10, 2024

    Authors: Smitha V. Thampi, Ankush Bhaskar, Prateek Mayank, Bhargav Vaidya, Indu Venugopal

    Abstract: The May 10, 2024 space weather event stands out as the most powerful storm recorded during the current solar cycle. This study employs a numerical framework utilizing a semi-empirical coronal model, along with HUXt (Heliospheric Upwind eXtrapolation with time-dependence) and cone-CME models for the inner heliosphere, to forecast solar wind velocity and the arrival of CMEs associated with this even… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

    Comments: 18 pages, 10 figures

  34. arXiv:2411.07405  [pdf, other

    cs.RO eess.SY

    Quality of Control based Resource Dimensioning for Collaborative Edge Robotics

    Authors: Neelabhro Roy, Mani H. Dhullipalla, Gourav Prateek Sharma, Dimos V. Dimarogonas, James Gross

    Abstract: With the increasing focus on flexible automation, which emphasizes systems capable of adapting to varied tasks and conditions, exploring future deployments of cloud and edge-based network infrastructures in robotic systems becomes crucial. This work, examines how wireless solutions could support the shift from rigid, wired setups toward more adaptive, flexible automation in industrial environments… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: Accepted in IEEE CCNC 2025

  35. arXiv:2411.05045  [pdf, other

    cs.CL

    Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale

    Authors: Flavio Di Palo, Prateek Singhi, Bilal Fadlallah

    Abstract: Large Language Models (LLMs) face significant challenges at inference time due to their high computational demands. To address this, we present Performance-Guided Knowledge Distillation (PGKD), a cost-effective and high-throughput solution for production text classification applications. PGKD utilizes teacher-student Knowledge Distillation to distill the knowledge of LLMs into smaller, task-specif… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: Published in EMNLP 2024

  36. arXiv:2411.04838  [pdf, other

    cond-mat.stat-mech cs.AI cs.LG hep-th

    Machine learning and optimization-based approaches to duality in statistical physics

    Authors: Andrea E. V. Ferrari, Prateek Gupta, Nabil Iqbal

    Abstract: The notion of duality -- that a given physical system can have two different mathematical descriptions -- is a key idea in modern theoretical physics. Establishing a duality in lattice statistical mechanics models requires the construction of a dual Hamiltonian and a map from the original to the dual observables. By using simple neural networks to parameterize these maps and introducing a loss fun… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 27 pages + appendices, lots of plots

  37. TopoTxR: A topology-guided deep convolutional network for breast parenchyma learning on DCE-MRIs

    Authors: Fan Wang, Zhilin Zou, Nicole Sakla, Luke Partyka, Nil Rawal, Gagandeep Singh, Wei Zhao, Haibin Ling, Chuan Huang, Prateek Prasanna, Chao Chen

    Abstract: Characterization of breast parenchyma in dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is a challenging task owing to the complexity of underlying tissue structures. Existing quantitative approaches, like radiomics and deep learning models, lack explicit quantification of intricate and subtle parenchymal structures, including fibroglandular tissue. To address this, we propose a no… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 22 pages, 8 figures, 8 tables, accepted by Medical Image Analysis ( https://www.sciencedirect.com/science/article/abs/pii/S1361841524002986 )

    Journal ref: Volume 99, 2025, 103373

  38. arXiv:2410.20135  [pdf, ps, other

    stat.ML cs.LG

    Near-Optimal Streaming Heavy-Tailed Statistical Estimation with Clipped SGD

    Authors: Aniket Das, Dheeraj Nagaraj, Soumyabrata Pal, Arun Suggala, Prateek Varshney

    Abstract: We consider the problem of high-dimensional heavy-tailed statistical estimation in the streaming setting, which is much harder than the traditional batch setting due to memory constraints. We cast this problem as stochastic convex optimization with heavy tailed stochastic gradients, and prove that the widely used Clipped-SGD algorithm attains near-optimal sub-Gaussian statistical rates whenever th… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

    Comments: Accepted at NeurIPS 2024

  39. Quanta Video Restoration

    Authors: Prateek Chennuri, Yiheng Chi, Enze Jiang, G. M. Dilshan Godaliyadda, Abhiram Gnanasambandam, Hamid R. Sheikh, Istvan Gyongy, Stanley H. Chan

    Abstract: The proliferation of single-photon image sensors has opened the door to a plethora of high-speed and low-light imaging applications. However, data collected by these sensors are often 1-bit or few-bit, and corrupted by noise and strong motion. Conventional video restoration methods are not designed to handle this situation, while specialized quanta burst algorithms have limited performance when th… ▽ More

    Submitted 14 November, 2024; v1 submitted 19 October, 2024; originally announced October 2024.

    Comments: Accepted at European Conference on Computer Vision (ECCV) 2024, Milano, Italy, Sept 29 - Oct 4, 2024, Part XL, LNCS 15098

    Journal ref: European Conference on Computer Vision (ECCV) 2024

  40. arXiv:2410.12367  [pdf, ps, other

    math.ST cs.LG stat.ME

    Adaptive and Stratified Subsampling Techniques for High Dimensional Non-Standard Data Environments

    Authors: Prateek Mittal, Jai Dalmotra, Joohi Chauhan

    Abstract: This paper addresses the challenge of estimating high-dimensional parameters in non-standard data environments, where traditional methods often falter due to issues such as heavy-tailed distributions, data contamination, and dependent observations. We propose robust subsampling techniques, specifically Adaptive Importance Sampling (AIS) and Stratified Subsampling, designed to enhance the reliabili… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  41. arXiv:2410.09102  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

    Authors: Tong Wu, Shujian Zhang, Kaiqiang Song, Silei Xu, Sanqiang Zhao, Ravi Agrawal, Sathish Reddy Indurthi, Chong Xiang, Prateek Mittal, Wenxuan Zhou

    Abstract: Large Language Models (LLMs) are susceptible to security and safety threats, such as prompt injection, prompt extraction, and harmful requests. One major cause of these vulnerabilities is the lack of an instruction hierarchy. Modern LLM architectures treat all inputs equally, failing to distinguish between and prioritize various types of instructions, such as system messages, user prompts, and dat… ▽ More

    Submitted 1 March, 2025; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: Preprint

    Journal ref: ICLR 2025

  42. arXiv:2410.07172  [pdf, other

    cs.LG cs.CL

    Glider: Global and Local Instruction-Driven Expert Router

    Authors: Pingzhi Li, Prateek Yadav, Jaehong Yoon, Jie Peng, Yi-Lin Sung, Mohit Bansal, Tianlong Chen

    Abstract: The availability of performant pre-trained models has led to a proliferation of fine-tuned expert models that are specialized to particular domains. This has enabled the creation of powerful and adaptive routing-based "Model MoErging" methods with the goal of using expert modules to create an aggregate system with improved performance or generalization. However, existing MoErging methods often pri… ▽ More

    Submitted 11 April, 2025; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: Our code is available at https://github.com/UNITES-Lab/glider

  43. arXiv:2410.06567  [pdf, other

    cs.LG

    Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization

    Authors: Prateek Varshney, Mert Pilanci

    Abstract: Deploying large and complex deep neural networks on resource-constrained edge devices poses significant challenges due to their computational demands and the complexities of non-convex optimization. Traditional compression methods such as distillation and pruning often retain non-convexity that complicates fine-tuning in real-time on such devices. Moreover, these methods often necessitate extensiv… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 10 Pages, 7 figures, 2 tables

  44. arXiv:2410.03820  [pdf, other

    hep-ph hep-th

    Axion Couplings in Heterotic String Theory

    Authors: Prateek Agrawal, Michael Nee, Mario Reig

    Abstract: We study the coupling of axions to gauge bosons in heterotic string theory. The axion-gauge boson couplings in the low energy 4d theory are derived by matching mixed anomalies between higher-form global symmetries and the zero-form gauge symmetry in the 10d theory. When the standard model gauge group is embedded in a single simple group in the 10d theory -- as is the case for almost all heterotic… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 22 pages, 1 figure

  45. arXiv:2410.03617  [pdf, other

    cs.LG cs.AI cs.CL

    What Matters for Model Merging at Scale?

    Authors: Prateek Yadav, Tu Vu, Jonathan Lai, Alexandra Chronopoulou, Manaal Faruqui, Mohit Bansal, Tsendsuren Munkhdalai

    Abstract: Model merging aims to combine multiple expert models into a more capable single model, offering benefits such as reduced storage and serving costs, improved generalization, and support for decentralized model development. Despite its promise, previous studies have primarily focused on merging a few small models. This leaves many unanswered questions about the effect of scaling model size and how i… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 20 Pages, 7 Figures, 4 Tables

  46. arXiv:2410.02212  [pdf, other

    cs.CV

    Hard Negative Sample Mining for Whole Slide Image Classification

    Authors: Wentao Huang, Xiaoling Hu, Shahira Abousamra, Prateek Prasanna, Chao Chen

    Abstract: Weakly supervised whole slide image (WSI) classification is challenging due to the lack of patch-level labels and high computational costs. State-of-the-art methods use self-supervised patch-wise feature representations for multiple instance learning (MIL). Recently, methods have been proposed to fine-tune the feature representation on the downstream task using pseudo labeling, but mostly focusing… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 13 pages, 4 figures, accepted by MICCAI 2024

  47. arXiv:2410.02012  [pdf, other

    eess.IV cs.CV

    Semi-Supervised Contrastive VAE for Disentanglement of Digital Pathology Images

    Authors: Mahmudul Hasan, Xiaoling Hu, Shahira Abousamra, Prateek Prasanna, Joel Saltz, Chao Chen

    Abstract: Despite the strong prediction power of deep learning models, their interpretability remains an important concern. Disentanglement models increase interpretability by decomposing the latent space into interpretable subspaces. In this paper, we propose the first disentanglement method for pathology images. We focus on the task of detecting tumor-infiltrating lymphocytes (TIL). We propose different i… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  48. arXiv:2410.00307  [pdf, other

    cs.CV

    RadGazeGen: Radiomics and Gaze-guided Medical Image Generation using Diffusion Models

    Authors: Moinak Bhattacharya, Gagandeep Singh, Shubham Jain, Prateek Prasanna

    Abstract: In this work, we present RadGazeGen, a novel framework for integrating experts' eye gaze patterns and radiomic feature maps as controls to text-to-image diffusion models for high fidelity medical image generation. Despite the recent success of text-to-image diffusion models, text descriptions are often found to be inadequate and fail to convey detailed disease-specific information to these models… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

  49. arXiv:2409.19943  [pdf, other

    astro-ph.SR physics.space-ph

    Study of Evolution and Geo-effectiveness of CME-CME Interactions using MHD Simulations with SWASTi framework

    Authors: Prateek Mayank, Stefan Lotz, Bhargav Vaidya, Wageesh Mishra, D. Chakrabarty

    Abstract: The geo-effectiveness of Coronal Mass Ejections (CMEs) is a critical area of study in space weather, particularly in the lesser-explored domain of CME-CME interactions and their geomagnetic consequences. This study leverages the SWASTi framework to perform 3D MHD simulation of a range of CME-CME interaction scenarios within realistic solar wind conditions. The focus is on the dynamics of the initi… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: Accepted for publication in The Astrophysical Journal

  50. arXiv:2409.14848  [pdf, other

    math.OC cs.CE cs.DM

    A Bi-criterion Steiner Traveling Salesperson Problem with Time Windows for Last-Mile Electric Vehicle Logistics

    Authors: Prateek Agarwal, Debojjal Bagchi, Tarun Rambha, Venktesh Pandey

    Abstract: This paper addresses the problem of energy-efficient and safe routing of last-mile electric freight vehicles. With the rising environmental footprint of the transportation sector and the growing popularity of E-Commerce, freight companies are likely to benefit from optimal time-window-feasible tours that minimize energy usage while reducing traffic conflicts at intersections and thereby improving… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.