Skip to main content

Showing 1–50 of 107 results for author: Kulkarni, K

.
  1. arXiv:2510.07441  [pdf, ps, other

    cs.CV

    DynamicEval: Rethinking Evaluation for Dynamic Text-to-Video Synthesis

    Authors: Nithin C. Babu, Aniruddha Mahapatra, Harsh Rangwani, Rajiv Soundararajan, Kuldeep Kulkarni

    Abstract: Existing text-to-video (T2V) evaluation benchmarks, such as VBench and EvalCrafter, suffer from two limitations. (i) While the emphasis is on subject-centric prompts or static camera scenes, camera motion essential for producing cinematic shots and existing metrics under dynamic motion are largely unexplored. (ii) These benchmarks typically aggregate video-level scores into a single model-level sc… ▽ More

    Submitted 8 October, 2025; originally announced October 2025.

    Comments: Preprint. Under review. 26 pages, 11 figures, 11 tables. Access the project page in https://nithincbabu7.github.io/DynamicEval

  2. arXiv:2510.07061  [pdf, ps, other

    cs.CL

    Revisiting Metric Reliability for Fine-grained Evaluation of Machine Translation and Summarization in Indian Languages

    Authors: Amir Hossein Yari, Kalmit Kulkarni, Ahmad Raza Khan, Fajri Koto

    Abstract: While automatic metrics drive progress in Machine Translation (MT) and Text Summarization (TS), existing metrics have been developed and validated almost exclusively for English and other high-resource languages. This narrow focus leaves Indian languages, spoken by over 1.5 billion people, largely overlooked, casting doubt on the universality of current evaluation practices. To address this gap, w… ▽ More

    Submitted 8 October, 2025; originally announced October 2025.

    Comments: 18 pages, 14 figures

  3. arXiv:2509.15236  [pdf, ps, other

    cs.GR cs.AI

    ChannelFlow-Tools: A Standardized Dataset Creation Pipeline for 3D Obstructed Channel Flows

    Authors: Shubham Kavane, Kajol Kulkarni, Harald Koestler

    Abstract: We present ChannelFlow-Tools, a configuration-driven framework that standardizes the end-to-end path from programmatic CAD solid generation to ML-ready inputs and targets for 3D obstructed channel flows. The toolchain integrates geometry synthesis with feasibility checks, signed distance field (SDF) voxelization, automated solver orchestration on HPC (waLBerla LBM), and Cartesian resampling to co-… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  4. arXiv:2509.14366  [pdf, ps, other

    physics.optics physics.app-ph

    Realization of a Chiral Photonic-Crystal Cavity with Broken Time-Reversal Symmetry

    Authors: Kiran M. Kulkarni, Hongjing Xu, Fuyang Tay, Gustavo M. Rodriguez-Barrios, Dasom Kim, Alessandro Alabastri, Vasil Rokaj, Ceren B. Dag, Andrey Baydin, Junichiro Kono

    Abstract: Light-matter interactions in chiral cavities offer a compelling route to manipulate material properties by breaking fundamental symmetries such as time-reversal symmetry. However, only a limited number of chiral cavity implementations exhibiting broken time-reversal symmetry have been demonstrated to date. These typically rely on either the application of strong magnetic fields, circularly polariz… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  5. arXiv:2507.15743  [pdf, ps, other

    cs.AI cs.CL cs.HC cs.LG

    Towards physician-centered oversight of conversational diagnostic AI

    Authors: Elahe Vedadi, David Barrett, Natalie Harris, Ellery Wulczyn, Shashir Reddy, Roma Ruparel, Mike Schaekermann, Tim Strother, Ryutaro Tanno, Yash Sharma, Jihyeon Lee, Cían Hughes, Dylan Slack, Anil Palepu, Jan Freyberg, Khaled Saab, Valentin Liévin, Wei-Hung Weng, Tao Tu, Yun Liu, Nenad Tomasev, Kavita Kulkarni, S. Sara Mahdavi, Kelvin Guu, Joëlle Barral , et al. (10 additional authors not shown)

    Abstract: Recent work has demonstrated the promise of conversational AI systems for diagnostic dialogue. However, real-world assurance of patient safety means that providing individual diagnoses and treatment plans is considered a regulated activity by licensed professionals. Furthermore, physicians commonly oversee other team members in such activities, including nurse practitioners (NPs) or physician assi… ▽ More

    Submitted 21 July, 2025; originally announced July 2025.

  6. arXiv:2506.20323  [pdf

    cs.LG cs.AI

    Comparative Analysis of Deep Learning Models for Crop Disease Detection: A Transfer Learning Approach

    Authors: Saundarya Subramaniam, Shalini Majumdar, Shantanu Nadar, Kaustubh Kulkarni

    Abstract: This research presents the development of an Artificial Intelligence (AI) - driven crop disease detection system designed to assist farmers in rural areas with limited resources. We aim to compare different deep learning models for a comparative analysis, focusing on their efficacy in transfer learning. By leveraging deep learning models, including EfficientNet, ResNet101, MobileNetV2, and our cus… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  7. arXiv:2506.17471  [pdf, ps, other

    cs.DC cs.MS cs.PF math.NA

    Code Generation for Near-Roofline Finite Element Actions on GPUs from Symbolic Variational Forms

    Authors: Kaushik Kulkarni, Andreas Klöckner

    Abstract: We present a novel parallelization strategy for evaluating Finite Element Method (FEM) variational forms on GPUs, focusing on those that are expressible through the Unified Form Language (UFL) on simplex meshes. We base our approach on code transformations, wherein we construct a space of scheduling candidates and rank them via a heuristic cost model to effectively handle the large diversity of co… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    MSC Class: 65Y05

  8. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  9. arXiv:2504.14809  [pdf, other

    cs.CR cs.CY

    vApps: Verifiable Applications at Internet Scale

    Authors: Isaac Zhang, Kshitij Kulkarni, Tan Li, Daniel Wong, Thomas Kim, John Guibas, Uma Roy, Bryan Pellegrino, Ryan Zarick

    Abstract: Blockchain technology promises a decentralized, trustless, and interoperable infrastructure. However, widespread adoption remains hindered by issues such as limited scalability, high transaction costs, and the complexity of maintaining coherent verification logic across different blockchain layers. This paper introduces Verifiable Applications (vApps), a novel development framework designed to str… ▽ More

    Submitted 29 April, 2025; v1 submitted 20 April, 2025; originally announced April 2025.

    Comments: 12 pages, 11 figures

  10. arXiv:2503.06074  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Conversational AI for Disease Management

    Authors: Anil Palepu, Valentin Liévin, Wei-Hung Weng, Khaled Saab, David Stutz, Yong Cheng, Kavita Kulkarni, S. Sara Mahdavi, Joëlle Barral, Dale R. Webster, Katherine Chou, Avinatan Hassidim, Yossi Matias, James Manyika, Ryutaro Tanno, Vivek Natarajan, Adam Rodman, Tao Tu, Alan Karthikesalingam, Mike Schaekermann

    Abstract: While large language models (LLMs) have shown promise in diagnostic dialogue, their capabilities for effective management reasoning - including disease progression, therapeutic response, and safe medication prescription - remain under-explored. We advance the previously demonstrated diagnostic capabilities of the Articulate Medical Intelligence Explorer (AMIE) through a new LLM-based agentic syste… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

    Comments: 62 pages, 7 figures in main text, 36 figures in appendix

  11. arXiv:2502.18864  [pdf, other

    cs.AI cs.CL cs.HC cs.LG physics.soc-ph q-bio.OT

    Towards an AI co-scientist

    Authors: Juraj Gottweis, Wei-Hung Weng, Alexander Daryin, Tao Tu, Anil Palepu, Petar Sirkovic, Artiom Myaskovsky, Felix Weissenberger, Keran Rong, Ryutaro Tanno, Khaled Saab, Dan Popovici, Jacob Blum, Fan Zhang, Katherine Chou, Avinatan Hassidim, Burak Gokturk, Amin Vahdat, Pushmeet Kohli, Yossi Matias, Andrew Carroll, Kavita Kulkarni, Nenad Tomasev, Yuan Guan, Vikram Dhillon , et al. (9 additional authors not shown)

    Abstract: Scientific discovery relies on scientists generating novel hypotheses that undergo rigorous experimental validation. To augment this process, we introduce an AI co-scientist, a multi-agent system built on Gemini 2.0. The AI co-scientist is intended to help uncover new, original knowledge and to formulate demonstrably novel research hypotheses and proposals, building upon prior evidence and aligned… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: 81 pages in total (main 38 pages, appendix 43 pages), 13 main figures, 40 appendix figures, 1 main table, 2 appendix tables, 143 main references, 7 appendix references

  12. arXiv:2502.18786  [pdf, other

    cs.NE cs.AI q-bio.NC

    NeuroTree: Hierarchical Functional Brain Pathway Decoding for Mental Health Disorders

    Authors: Jun-En Ding, Dongsheng Luo, Anna Zilverstand, Kaustubh Kulkarni, Feng Liu

    Abstract: Mental disorders are among the most widespread diseases globally. Analyzing functional brain networks through functional magnetic resonance imaging (fMRI) is crucial for understanding mental disorder behaviors. Although existing fMRI-based graph neural networks (GNNs) have demonstrated significant potential in brain network feature extraction, they often fail to characterize complex relationships… ▽ More

    Submitted 23 May, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

  13. arXiv:2502.02059  [pdf, other

    cs.GT

    Optimal Routing in the Presence of Hooks: Three Case Studies

    Authors: Tarun Chitra, Kshitij Kulkarni, Karthik Srinivasan

    Abstract: We consider the problem of optimally executing a user trade over networks of constant function market makers (CFMMs) in the presence of hooks. Hooks, introduced in an upcoming version of Uniswap, are auxiliary smart contracts that allow for extra information to be added to liquidity pools. This allows liquidity providers to enable constraints on trades, allowing CFMMs to read external data, such a… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: 25 pages, 14 figures

  14. arXiv:2501.02840  [pdf

    cs.CV cs.AI cs.LG

    Enhanced Rooftop Solar Panel Detection by Efficiently Aggregating Local Features

    Authors: Kuldeep Kurte, Kedar Kulkarni

    Abstract: In this paper, we present an enhanced Convolutional Neural Network (CNN)-based rooftop solar photovoltaic (PV) panel detection approach using satellite images. We propose to use pre-trained CNN-based model to extract the local convolutional features of rooftops. These local features are then combined using the Vectors of Locally Aggregated Descriptors (VLAD) technique to obtain rooftop-level globa… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: Accepted at CODS-COMAD 2024, December, 2024, Jodhpur, India (https://cods-comad.in/accepted-papers.php)

  15. arXiv:2411.15028  [pdf, other

    cs.CV

    FloAt: Flow Warping of Self-Attention for Clothing Animation Generation

    Authors: Swasti Shreya Mishra, Kuldeep Kulkarni, Duygu Ceylan, Balaji Vasan Srinivasan

    Abstract: We propose a diffusion model-based approach, FloAtControlNet to generate cinemagraphs composed of animations of human clothing. We focus on human clothing like dresses, skirts and pants. The input to our model is a text prompt depicting the type of clothing and the texture of clothing like leopard, striped, or plain, and a sequence of normal maps that capture the underlying animation that we desir… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

  16. arXiv:2411.03395  [pdf, other

    cs.HC cs.CL

    Exploring Large Language Models for Specialist-level Oncology Care

    Authors: Anil Palepu, Vikram Dhillon, Polly Niravath, Wei-Hung Weng, Preethi Prasad, Khaled Saab, Ryutaro Tanno, Yong Cheng, Hanh Mai, Ethan Burns, Zainub Ajmal, Kavita Kulkarni, Philip Mansfield, Dale Webster, Joelle Barral, Juraj Gottweis, Mike Schaekermann, S. Sara Mahdavi, Vivek Natarajan, Alan Karthikesalingam, Tao Tu

    Abstract: Large language models (LLMs) have shown remarkable progress in encoding clinical knowledge and responding to complex medical queries with appropriate clinical reasoning. However, their applicability in subspecialist or complex medical settings remains underexplored. In this work, we probe the performance of AMIE, a research conversational diagnostic AI system, in the subspecialist domain of breast… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  17. arXiv:2410.03741  [pdf, other

    cs.HC cs.AI

    Towards Democratization of Subspeciality Medical Expertise

    Authors: Jack W. O'Sullivan, Anil Palepu, Khaled Saab, Wei-Hung Weng, Yong Cheng, Emily Chu, Yaanik Desai, Aly Elezaby, Daniel Seung Kim, Roy Lan, Wilson Tang, Natalie Tapaskar, Victoria Parikh, Sneha S. Jain, Kavita Kulkarni, Philip Mansfield, Dale Webster, Juraj Gottweis, Joelle Barral, Mike Schaekermann, Ryutaro Tanno, S. Sara Mahdavi, Vivek Natarajan, Alan Karthikesalingam, Euan Ashley , et al. (1 additional authors not shown)

    Abstract: The scarcity of subspecialist medical expertise, particularly in rare, complex and life-threatening diseases, poses a significant challenge for healthcare delivery. This issue is particularly acute in cardiology where timely, accurate management determines outcomes. We explored the potential of AMIE (Articulate Medical Intelligence Explorer), a large language model (LLM)-based experimental AI syst… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  18. arXiv:2409.15650  [pdf, other

    cs.CV

    ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models

    Authors: Divya Kothandaraman, Kuldeep Kulkarni, Sumit Shekhar, Balaji Vasan Srinivasan, Dinesh Manocha

    Abstract: We present ImPoster, a novel algorithm for generating a target image of a 'source' subject performing a 'driving' action. The inputs to our algorithm are a single pair of a source image with the subject that we wish to edit and a driving image with a subject of an arbitrary class performing the driving action, along with the text descriptions of the two images. Our approach is completely unsupervi… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Journal ref: COLING 2025

  19. arXiv:2406.10197  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Composing Parts for Expressive Object Generation

    Authors: Harsh Rangwani, Aishwarya Agarwal, Kuldeep Kulkarni, R. Venkatesh Babu, Srikrishna Karanam

    Abstract: Image composition and generation are processes where the artists need control over various parts of the generated images. However, the current state-of-the-art generation models, like Stable Diffusion, cannot handle fine-grained part-level attributes in the text prompts. Specifically, when additional attribute details are added to the base text prompt, these text-to-image models either generate an… ▽ More

    Submitted 29 June, 2025; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Project Page Will Be Here: https://rangwani-harsh.github.io/PartCraft

  20. arXiv:2405.16716  [pdf, other

    cs.GT cs.MA eess.SY math.DS

    Adaptive Incentive Design with Learning Agents

    Authors: Chinmay Maheshwari, Kshitij Kulkarni, Manxi Wu, Shankar Sastry

    Abstract: We propose an adaptive incentive mechanism that learns the optimal incentives in environments where players continuously update their strategies. Our mechanism updates incentives based on each player's externality, defined as the difference between the player's marginal cost and the operator's marginal cost at each time step. The proposed mechanism updates the incentives on a slower timescale comp… ▽ More

    Submitted 1 March, 2025; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: 40 pages

  21. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  22. arXiv:2404.03919  [pdf, other

    math.OC cs.GT cs.MA eess.SY

    Understanding the Impact of Coalitions between EV Charging Stations

    Authors: Sukanya Kudva, Kshitij Kulkarni, Chinmay Maheshwari, Anil Aswani, Shankar Sastry

    Abstract: The rapid growth of electric vehicles (EVs) is driving the expansion of charging infrastructure globally. As charging stations become ubiquitous, their substantial electricity consumption can influence grid operation and electricity pricing. Naturally, \textit{some} groups of charging stations, which could be jointly operated by a company, may coordinate to decide their charging profile. While coo… ▽ More

    Submitted 3 October, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: 20 pages, 5 figures

    MSC Class: 91A10; 91A80; 91B52; 91B54; 91B74; 93A16; 93A15

  23. arXiv:2403.02525  [pdf, other

    cs.GT

    An Analysis of Intent-Based Markets

    Authors: Tarun Chitra, Kshitij Kulkarni, Mallesh Pai, Theo Diamandis

    Abstract: Mechanisms for decentralized finance on blockchains suffer from various problems, including suboptimal price execution for users, latency, and a worse user experience compared to their centralized counterparts. Recently, off-chain marketplaces, colloquially called `intent markets,' have been proposed as a solution to these problems. In these markets, agents called \emph{solvers} compete to satisfy… ▽ More

    Submitted 6 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 27 pages, 2 figures

  24. arXiv:2403.00033  [pdf, other

    q-bio.NC cs.LG eess.SP

    Spatial Craving Patterns in Marijuana Users: Insights from fMRI Brain Connectivity Analysis with High-Order Graph Attention Neural Networks

    Authors: Jun-En Ding, Shihao Yang, Anna Zilverstand, Kaustubh R. Kulkarni, Xiaosi Gu, Feng Liu

    Abstract: The excessive consumption of marijuana can induce substantial psychological and social consequences. In this investigation, we propose an elucidative framework termed high-order graph attention neural networks (HOGANN) for the classification of Marijuana addiction, coupled with an analysis of localized brain network communities exhibiting abnormal activities among chronic marijuana users. HOGANN i… ▽ More

    Submitted 8 September, 2024; v1 submitted 28 February, 2024; originally announced March 2024.

  25. arXiv:2401.16844  [pdf, other

    cs.GT cs.CY cs.MA econ.EM eess.SY

    Congestion Pricing for Efficiency and Equity: Theory and Applications to the San Francisco Bay Area

    Authors: Chinmay Maheshwari, Kshitij Kulkarni, Druv Pai, Jiarui Yang, Manxi Wu, Shankar Sastry

    Abstract: Congestion pricing, while adopted by many cities to alleviate traffic congestion, raises concerns about widening socioeconomic disparities due to its disproportionate impact on low-income travelers. We address this concern by proposing a new class of congestion pricing schemes that not only minimize total travel time, but also incorporate an equity objective, reducing disparities in the relative c… ▽ More

    Submitted 20 September, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 44 pages, 12 figures

    MSC Class: 91A07; 91A14; 91A68; 91A90

  26. arXiv:2401.05654  [pdf, other

    cs.AI cs.CL cs.LG

    Towards Conversational Diagnostic AI

    Authors: Tao Tu, Anil Palepu, Mike Schaekermann, Khaled Saab, Jan Freyberg, Ryutaro Tanno, Amy Wang, Brenna Li, Mohamed Amin, Nenad Tomasev, Shekoofeh Azizi, Karan Singhal, Yong Cheng, Le Hou, Albert Webson, Kavita Kulkarni, S Sara Mahdavi, Christopher Semturs, Juraj Gottweis, Joelle Barral, Katherine Chou, Greg S Corrado, Yossi Matias, Alan Karthikesalingam, Vivek Natarajan

    Abstract: At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and enduring trust. Artificial Intelligence (AI) systems capable of diagnostic dialogue could increase accessibility, consistency, and quality of care. However, approximating clinicians' expertise is an outstanding grand challenge. Here, we introdu… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 46 pages, 5 figures in main text, 19 figures in appendix

  27. arXiv:2312.00164  [pdf, other

    cs.CY cs.AI

    Towards Accurate Differential Diagnosis with Large Language Models

    Authors: Daniel McDuff, Mike Schaekermann, Tao Tu, Anil Palepu, Amy Wang, Jake Garrison, Karan Singhal, Yash Sharma, Shekoofeh Azizi, Kavita Kulkarni, Le Hou, Yong Cheng, Yun Liu, S Sara Mahdavi, Sushant Prakash, Anupam Pathak, Christopher Semturs, Shwetak Patel, Dale R Webster, Ewa Dominowska, Juraj Gottweis, Joelle Barral, Katherine Chou, Greg S Corrado, Yossi Matias , et al. (3 additional authors not shown)

    Abstract: An accurate differential diagnosis (DDx) is a cornerstone of medical care, often reached through an iterative process of interpretation that combines clinical history, physical examination, investigations and procedures. Interactive interfaces powered by Large Language Models (LLMs) present new opportunities to both assist and automate aspects of this process. In this study, we introduce an LLM op… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  28. arXiv:2310.07865  [pdf, ps, other

    math.OC cs.GT math.CO q-fin.CP

    The Specter (and Spectra) of Miner Extractable Value

    Authors: Guillermo Angeris, Tarun Chitra, Theo Diamandis, Kshitij Kulkarni

    Abstract: Miner extractable value (MEV) refers to any excess value that a transaction validator can realize by manipulating the ordering of transactions. In this work, we introduce a simple theoretical definition of the 'cost of MEV', prove some basic properties, and show that the definition is useful via a number of examples. In a variety of settings, this definition is related to the 'smoothness' of a fun… ▽ More

    Submitted 12 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  29. arXiv:2308.08066  [pdf, ps, other

    math.OC q-fin.MF q-fin.TR

    The Geometry of Constant Function Market Makers

    Authors: Guillermo Angeris, Tarun Chitra, Theo Diamandis, Alex Evans, Kshitij Kulkarni

    Abstract: Constant function market makers (CFMMs) are the most popular type of decentralized trading venue for cryptocurrency tokens. In this paper, we give a very general geometric framework (or 'axioms') which encompass and generalize many of the known results for CFMMs in the literature, without requiring strong conditions such as differentiability or homogeneity. One particular consequence of this frame… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  30. arXiv:2307.13139  [pdf, ps, other

    cs.CR

    Attacks on Dynamic DeFi Interest Rate Curves

    Authors: Tarun Chitra, Peteris Erins, Kshitij Kulkarni

    Abstract: As decentralized money market protocols continue to grow in value locked, there have been a number of optimizations proposed for improving capital efficiency. One set of proposals from Euler Finance and Mars Protocol is to have an interest rate curve that is a proportional-integral-derivative (PID) controller. In this paper, we demonstrate attacks on proportional and proportional-integral controll… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  31. arXiv:2303.08639  [pdf, other

    cs.CV

    Blowing in the Wind: CycleNet for Human Cinemagraphs from Still Images

    Authors: Hugo Bertiche, Niloy J. Mitra, Kuldeep Kulkarni, Chun-Hao Paul Huang, Tuanfeng Y. Wang, Meysam Madadi, Sergio Escalera, Duygu Ceylan

    Abstract: Cinemagraphs are short looping videos created by adding subtle motions to a static image. This kind of media is popular and engaging. However, automatic generation of cinemagraphs is an underexplored area and current solutions require tedious low-level manual authoring by artists. In this paper, we present an automatic method that allows generating human cinemagraphs from single RGB images. We inv… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  32. arXiv:2303.00830  [pdf, other

    eess.AS cs.SD eess.SP

    DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments

    Authors: Shikha Baghel, Shreyas Ramoji, Sidharth, Ranjana H, Prachi Singh, Somil Jain, Pratik Roy Chowdhuri, Kaustubh Kulkarni, Swapnil Padhi, Deepu Vijayasenan, Sriram Ganapathy

    Abstract: In multilingual societies, social conversations often involve code-mixed speech. The current speech technology may not be well equipped to extract information from multi-lingual multi-speaker conversations. The DISPLACE challenge entails a first-of-kind task to benchmark speaker and language diarization on the same data, as the data contains multi-speaker conversations in multilingual code-mixed s… ▽ More

    Submitted 5 June, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  33. arXiv:2302.09657  [pdf

    cs.CV cs.AI cs.LG

    Table Tennis Stroke Detection and Recognition Using Ball Trajectory Data

    Authors: Kaustubh Milind Kulkarni, Rohan S Jamadagni, Jeffrey Aaron Paul, Sucheth Shenoy

    Abstract: In this work, the novel task of detecting and classifying table tennis strokes solely using the ball trajectory has been explored. A single camera setup positioned in the umpire's view has been employed to procure a dataset consisting of six stroke classes executed by four professional table tennis players. Ball tracking using YOLOv4, a traditional object detection model, and TrackNetv2, a tempora… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: 9 pages, 5 figures, 6 tables

  34. arXiv:2302.02249  [pdf, other

    cs.CV cs.AI

    Self-supervised Multi-view Disentanglement for Expansion of Visual Collections

    Authors: Nihal Jain, Praneetha Vaddamanu, Paridhi Maheshwari, Vishwa Vinay, Kuldeep Kulkarni

    Abstract: Image search engines enable the retrieval of images relevant to a query image. In this work, we consider the setting where a query for similar images is derived from a collection of images. For visual search, the similarity measurements may be made along multiple axes, or views, such as style and color. We assume access to a set of feature extractors, each of which computes representations for a s… ▽ More

    Submitted 4 February, 2023; originally announced February 2023.

    Comments: A version of this paper has been accepted at WSDM 2023

  35. arXiv:2301.12532  [pdf, ps, other

    cs.GT cs.CR

    Credible, Optimal Auctions via Public Broadcast

    Authors: Tarun Chitra, Matheus V. X. Ferreira, Kshitij Kulkarni

    Abstract: We study auction design in a setting where agents can communicate over a censorship-resistant broadcast channel like the ones we can implement over a public blockchain. We seek to design credible, strategyproof auctions in a model that differs from the traditional mechanism design framework because communication is not centralized via the auctioneer. We prove this allows us to design a larger clas… ▽ More

    Submitted 3 September, 2024; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: 19 pages

    Journal ref: AFT 2024

  36. arXiv:2212.03313  [pdf, other

    astro-ph.HE astro-ph.SR

    The prevalence and influence of circumstellar material around hydrogen-rich supernova progenitors

    Authors: Rachel J. Bruch, Avishay Gal-Yam, Ofer Yaron, Ping Chen, Nora L. Strotjohann, Ido Irani, Erez Zimmerman, Steve Schulze, Yi Yang, Young-Lo Kim, Mattia Bulla, Jesper Sollerman, Mickael Rigault, Eran Ofek, Maayane Soumagnac, Frank J. Masci, Christoffer Fremling, Daniel Perley, Jakob Nordin, S. Bradley Cenko, Anna Y. Q. Ho, S. Adams, Igor Adreoni, Eric C. Bellm, Nadia Blagorodnova , et al. (22 additional authors not shown)

    Abstract: Narrow transient emission lines (flash-ionization features) in early supernova (SN) spectra trace the presence of circumstellar material (CSM) around the massive progenitor stars of core-collapse SNe. The lines disappear within days after the SN explosion, suggesting that this material is spatially confined, and originates from enhanced mass loss shortly (months to a few years) prior to explosion.… ▽ More

    Submitted 13 December, 2022; v1 submitted 6 December, 2022; originally announced December 2022.

  37. arXiv:2211.16596  [pdf, other

    stat.ML cs.LG eess.SY

    Towards Dynamic Causal Discovery with Rare Events: A Nonparametric Conditional Independence Test

    Authors: Chih-Yuan Chiu, Kshitij Kulkarni, Shankar Sastry

    Abstract: Causal phenomena associated with rare events occur across a wide range of engineering problems, such as risk-sensitive safety analysis, accident analysis and prevention, and extreme value theory. However, current methods for causal discovery are often unable to uncover causal links, between random variables in a dynamic setting, that manifest only when the variables first experience low-probabilit… ▽ More

    Submitted 17 July, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

  38. arXiv:2210.00708  [pdf

    cs.CV cs.LG eess.IV

    EraseNet: A Recurrent Residual Network for Supervised Document Cleaning

    Authors: Yashowardhan Shinde, Kishore Kulkarni, Sachin Kuberkar

    Abstract: Document denoising is considered one of the most challenging tasks in computer vision. There exist millions of documents that are still to be digitized, but problems like document degradation due to natural and man-made factors make this task very difficult. This paper introduces a supervised approach for cleaning dirty documents using a new fully convolutional auto-encoder architecture. This pape… ▽ More

    Submitted 4 July, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 10 pages, 5 figures, attempting for publication in International Journal on Document Analysis and Recognition (IJDAR)

  39. arXiv:2207.11835  [pdf, other

    cs.GT cs.CR q-fin.CP

    Towards a Theory of Maximal Extractable Value I: Constant Function Market Makers

    Authors: Kshitij Kulkarni, Theo Diamandis, Tarun Chitra

    Abstract: Maximal Extractable Value (MEV) refers to excess value captured by miners (or validators) from users in a cryptocurrency network. This excess value often comes from reordering users' transactions to maximize fees or from inserting new transactions that front-run users' transactions. One of the most common types of MEV involves a `sandwich attack' against a user trading on a constant function marke… ▽ More

    Submitted 30 April, 2023; v1 submitted 24 July, 2022; originally announced July 2022.

  40. arXiv:2207.03729  [pdf, other

    cs.CV

    GEMS: Scene Expansion using Generative Models of Graphs

    Authors: Rishi Agarwal, Tirupati Saketh Chandra, Vaidehi Patil, Aniruddha Mahapatra, Kuldeep Kulkarni, Vishwa Vinay

    Abstract: Applications based on image retrieval require editing and associating in intermediate spaces that are representative of the high-level concepts like objects and their relationships rather than dense, pixel-level representations like RGB images or semantic-label maps. We focus on one such representation, scene graphs, and propose a novel scene expansion task where we enrich an input seed graph by a… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  41. arXiv:2204.05507  [pdf, ps, other

    cs.GT econ.GN econ.TH eess.SY

    Inducing Social Optimality in Games via Adaptive Incentive Design

    Authors: Chinmay Maheshwari, Kshitij Kulkarni, Manxi Wu, Shankar Sastry

    Abstract: How can a social planner adaptively incentivize selfish agents who are learning in a strategic environment to induce a socially optimal outcome in the long run? We propose a two-timescale learning dynamics to answer this question in both atomic and non-atomic games. In our learning dynamics, players adopt a class of learning rules to update their strategies at a faster timescale, while a social pl… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 20 pages

  42. arXiv:2112.03051  [pdf, other

    cs.CV

    Controllable Animation of Fluid Elements in Still Images

    Authors: Aniruddha Mahapatra, Kuldeep Kulkarni

    Abstract: We propose a method to interactively control the animation of fluid elements in still images to generate cinemagraphs. Specifically, we focus on the animation of fluid elements like water, smoke, fire, which have the properties of repeating textures and continuous fluid motion. Taking inspiration from prior works, we represent the motion of such fluid elements in the image in the form of a constan… ▽ More

    Submitted 25 September, 2023; v1 submitted 6 December, 2021; originally announced December 2021.

  43. arXiv:2111.15582  [pdf, ps, other

    math.NT

    Hilbert's Irreducibility Theorem and Ideal Class Groups of Quadratic Fields

    Authors: Kaivalya Kulkarni, Aaron Levin

    Abstract: We prove a version of Hilbert's Irreducibility Theorem in the quadratic case, giving a quantitative improvement to a result of Bilu-Gillibert in this restricted setting. As an application, we give improvements to several quantitative results counting quadratic fields with certain types of ideal class groups. The proof of the main theorem is based on a result of Stewart and Top on values of binary… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    MSC Class: 11R29 (Primary) 11G30 14H40 (Secondary)

  44. arXiv:2110.08879  [pdf, other

    eess.SY cs.GT

    Dynamic Tolling for Inducing Socially Optimal Traffic Loads

    Authors: Chinmay Maheshwari, Kshitij Kulkarni, Manxi Wu, Shankar Sastry

    Abstract: How to design tolls that induce socially optimal traffic loads with dynamically arriving travelers who make selfish routing decisions? We propose a two-timescale discrete-time stochastic dynamics that adaptively adjusts the toll prices on a parallel link network while accounting for the updates of traffic loads induced by the incoming and outgoing travelers and their route choices. The updates of… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

    Comments: 18 pages, 3 figures

  45. arXiv:2108.13702  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    SemIE: Semantically-aware Image Extrapolation

    Authors: Bholeshwar Khurana, Soumya Ranjan Dash, Abhishek Bhatia, Aniruddha Mahapatra, Hrituraj Singh, Kuldeep Kulkarni

    Abstract: We propose a semantically-aware novel paradigm to perform image extrapolation that enables the addition of new object instances. All previous methods are limited in their capability of extrapolation to merely extending the already existing objects in the image. However, our proposed approach focuses not only on (i) extending the already present objects but also on (ii) adding new objects in the ex… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: To appear in International Conference on Computer Vision (ICCV) 2021. Project URL: https://semie-iccv.github.io

  46. arXiv:2104.09907  [pdf

    cs.CV cs.LG

    Table Tennis Stroke Recognition Using Two-Dimensional Human Pose Estimation

    Authors: Kaustubh Milind Kulkarni, Sucheth Shenoy

    Abstract: We introduce a novel method for collecting table tennis video data and perform stroke detection and classification. A diverse dataset containing video data of 11 basic strokes obtained from 14 professional table tennis players, summing up to a total of 22111 videos has been collected using the proposed setup. The temporal convolutional neural network model developed using 2D pose estimation perfor… ▽ More

    Submitted 31 May, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted at CVPR Sports Workshop 2021 (7th International Workshop on Computer Vision in Sports) (CVSports)

  47. arXiv:2011.02544  [pdf, ps, other

    cs.MA cs.AI

    Social Choice with Changing Preferences: Representation Theorems and Long-Run Policies

    Authors: Kshitij Kulkarni, Sven Neth

    Abstract: We study group decision making with changing preferences as a Markov Decision Process. We are motivated by the increasing prevalence of automated decision-making systems when making choices for groups of people over time. Our main contribution is to show how classic representation theorems from social choice theory can be adapted to characterize optimal policies in this dynamic setting. We provide… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: Accepted to the Workshop on Consequential Decision Making in Dynamic Environments, NeurIPS 2020

  48. arXiv:2010.12769  [pdf

    eess.IV

    Diverse R-PPG: Camera-Based Heart Rate Estimation for Diverse Subject Skin-Tones and Scenes

    Authors: Pradyumna Chari, Krish Kabra, Doruk Karinca, Soumyarup Lahiri, Diplav Srivastava, Kimaya Kulkarni, Tianyuan Chen, Maxime Cannesson, Laleh Jalilian, Achuta Kadambi

    Abstract: Heart rate (HR) is an essential clinical measure for the assessment of cardiorespiratory instability. Since communities of color are disproportionately affected by both COVID-19 and cardiovascular disease, there is a pressing need to deploy contactless HR sensing solutions for high-quality telemedicine evaluations. Existing computer vision methods that estimate HR from facial videos exhibit biased… ▽ More

    Submitted 9 December, 2020; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: 49 pages, 6 figures, 3 tables, Supplement with 7 figures

  49. arXiv:2007.13414  [pdf, other

    cs.LG cs.AI stat.ML

    Hyper-local sustainable assortment planning

    Authors: Nupur Aggarwal, Abhishek Bansal, Kushagra Manglik, Kedar Kulkarni, Vikas Raykar

    Abstract: Assortment planning, an important seasonal activity for any retailer, involves choosing the right subset of products to stock in each store.While existing approaches only maximize the expected revenue, we propose including the environmental impact too, through the Higg Material Sustainability Index. The trade-off between revenue and environmental impact is balanced through a multi-objective optimi… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  50. arXiv:2004.08614  [pdf, other

    cs.CV cs.LG

    Halluci-Net: Scene Completion by Exploiting Object Co-occurrence Relationships

    Authors: Kuldeep Kulkarni, Tejas Gokhale, Rajhans Singh, Pavan Turaga, Aswin Sankaranarayanan

    Abstract: Recently, there has been substantial progress in image synthesis from semantic labelmaps. However, methods used for this task assume the availability of complete and unambiguous labelmaps, with instance boundaries of objects, and class labels for each pixel. This reliance on heavily annotated inputs restricts the application of image synthesis techniques to real-world applications, especially unde… ▽ More

    Submitted 20 May, 2021; v1 submitted 18 April, 2020; originally announced April 2020.

    Comments: Accepted to AI for Content Creation Workshop @CVPR 2021