Skip to main content

Showing 1–18 of 18 results for author: Madhavan, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.03051  [pdf, other

    cs.CV

    AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

    Authors: Wenhao Chai, Enxin Song, Yilun Du, Chenlin Meng, Vashisht Madhavan, Omer Bar-Tal, Jenq-Neng Hwang, Saining Xie, Christopher D. Manning

    Abstract: Video detailed captioning is a key task which aims to generate comprehensive and coherent textual descriptions of video content, benefiting both video understanding and generation. In this paper, we propose AuroraCap, a video captioner based on a large multimodal model. We follow the simplest architecture design without additional parameters for temporal modeling. To address the overhead caused by… ▽ More

    Submitted 9 April, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: Accepted to ICLR 2025. Code, docs, weight, benchmark and training data are all avaliable at https://rese1f.github.io/aurora-web/

  2. arXiv:2407.06209  [pdf, other

    cs.LG

    Self-supervised Pretraining for Partial Differential Equations

    Authors: Varun Madhavan, Amal S Sebastian, Bharath Ramsundar, Venkatasubramanian Viswanathan

    Abstract: In this work, we describe a novel approach to building a neural PDE solver leveraging recent advances in transformer based neural network architectures. Our model can provide solutions for different values of PDE parameters without any need for retraining the network. The training is carried out in a self-supervised manner, similar to pretraining approaches applied in language and vision tasks. We… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2311.11694  [pdf, other

    cs.LG stat.ML

    Unveiling the Power of Self-Attention for Shipping Cost Prediction: The Rate Card Transformer

    Authors: P Aditya Sreekar, Sahil Verma, Varun Madhavan, Abhishek Persad

    Abstract: Amazon ships billions of packages to its customers annually within the United States. Shipping cost of these packages are used on the day of shipping (day 0) to estimate profitability of sales. Downstream systems utilize these days 0 profitability estimates to make financial decisions, such as pricing strategies and delisting loss-making products. However, obtaining accurate shipping cost estimate… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  4. arXiv:2302.02595  [pdf

    cs.LG

    Clarifying Trust of Materials Property Predictions using Neural Networks with Distribution-Specific Uncertainty Quantification

    Authors: Cameron Gruich, Varun Madhavan, Yixin Wang, Bryan Goldsmith

    Abstract: It is critical that machine learning (ML) model predictions be trustworthy for high-throughput catalyst discovery approaches. Uncertainty quantification (UQ) methods allow estimation of the trustworthiness of an ML model, but these methods have not been well explored in the field of heterogeneous catalysis. Herein, we investigate different UQ methods applied to a crystal graph convolutional neural… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 28 pages, 16 figures (8 main text, 8 SI), submitted to Machine Learning: Science & Technology journal (MLST, IOP)

  5. arXiv:2205.14396  [pdf, other

    cs.MA cs.CY cs.LG

    Deep Learning-based Spatially Explicit Emulation of an Agent-Based Simulator for Pandemic in a City

    Authors: Varun Madhavan, Adway Mitra, Partha Pratim Chakrabarti

    Abstract: Agent-Based Models are very useful for simulation of physical or social processes, such as the spreading of a pandemic in a city. Such models proceed by specifying the behavior of individuals (agents) and their interactions, and parameterizing the process of infection based on such interactions based on the geography and demography of the city. However, such models are computationally very expensi… ▽ More

    Submitted 29 January, 2023; v1 submitted 28 May, 2022; originally announced May 2022.

  6. arXiv:2203.12610  [pdf, other

    cs.LG astro-ph.EP nlin.SI physics.class-ph physics.flu-dyn

    AI Poincaré 2.0: Machine Learning Conservation Laws from Differential Equations

    Authors: Ziming Liu, Varun Madhavan, Max Tegmark

    Abstract: We present a machine learning algorithm that discovers conservation laws from differential equations, both numerically (parametrized as neural networks) and symbolically, ensuring their functional independence (a non-linear generalization of linear independence). Our independence module can be viewed as a nonlinear generalization of singular value decomposition. Our method can readily handle induc… ▽ More

    Submitted 30 October, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: 15 pages, 12 figures

    Journal ref: Phys. Rev. E 106, 045307, 2022

  7. arXiv:2110.12370  [pdf, other

    cs.CL

    Team Enigma at ArgMining-EMNLP 2021: Leveraging Pre-trained Language Models for Key Point Matching

    Authors: Manav Nitin Kapadnis, Sohan Patnaik, Siba Smarak Panigrahi, Varun Madhavan, Abhilash Nandy

    Abstract: We present the system description for our submission towards the Key Point Analysis Shared Task at ArgMining 2021. Track 1 of the shared task requires participants to develop methods to predict the match score between each pair of arguments and keypoints, provided they belong to the same topic under the same stance. We leveraged existing state of the art pre-trained language models along with inco… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

  8. arXiv:2110.04475  [pdf, other

    cs.CL

    Leveraging recent advances in Pre-Trained Language Models forEye-Tracking Prediction

    Authors: Varun Madhavan, Aditya Girish Pawate, Shraman Pal, Abhranil Chandra

    Abstract: Cognitively inspired Natural Language Pro-cessing uses human-derived behavioral datalike eye-tracking data, which reflect the seman-tic representations of language in the humanbrain to augment the neural nets to solve arange of tasks spanning syntax and semanticswith the aim of teaching machines about lan-guage processing mechanisms. In this paper,we use the ZuCo 1.0 and ZuCo 2.0 dataset con-taini… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

  9. arXiv:2106.06091  [pdf, other

    cs.LG cs.AI

    DECORE: Deep Compression with Reinforcement Learning

    Authors: Manoj Alwani, Yang Wang, Vashisht Madhavan

    Abstract: Deep learning has become an increasingly popular and powerful methodology for modern pattern recognition systems. However, many deep neural networks have millions or billions of parameters, making them untenable for real-world applications due to constraints on memory size or latency requirements. As a result, efficient network compression techniques are often required for the widespread adoption… ▽ More

    Submitted 7 February, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

  10. arXiv:2104.01650  [pdf, other

    cs.MA stat.AP

    City-scale Simulation of Covid-19 Pandemic and Intervention Policies using Agent-based Modelling

    Authors: Gaurav Suryawanshi, Varun Madhavan, Adway Mitra, Partha Pratim Chakrabarti

    Abstract: During the Covid-19 pandemic, most governments across the world imposed policies like lock-down of public spaces and restrictions on people's movements to minimize the spread of the virus through physical contact. However, such policies have grave social and economic costs, and so it is important to pre-assess their impacts. In this work we aim to visualize the dynamics of the pandemic in a city u… ▽ More

    Submitted 9 September, 2021; v1 submitted 4 April, 2021; originally announced April 2021.

  11. arXiv:2003.01825  [pdf, other

    cs.NE cs.AI cs.LG

    Scaling MAP-Elites to Deep Neuroevolution

    Authors: Cédric Colas, Joost Huizinga, Vashisht Madhavan, Jeff Clune

    Abstract: Quality-Diversity (QD) algorithms, and MAP-Elites (ME) in particular, have proven very useful for a broad range of applications including enabling real robots to recover quickly from joint damage, solving strongly deceptive maze tasks or evolving robot morphologies to discover new gaits. However, present implementations of MAP-Elites and other QD algorithms seem to be limited to low-dimensional co… ▽ More

    Submitted 5 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: Accepted to GECCO 2020

  12. arXiv:1812.07069  [pdf, other

    cs.NE

    An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents

    Authors: Felipe Petroski Such, Vashisht Madhavan, Rosanne Liu, Rui Wang, Pablo Samuel Castro, Yulun Li, Jiale Zhi, Ludwig Schubert, Marc G. Bellemare, Jeff Clune, Joel Lehman

    Abstract: Much human and computational effort has aimed to improve how deep reinforcement learning algorithms perform on benchmarks such as the Atari Learning Environment. Comparatively less effort has focused on understanding what has been learned by such methods, and investigating and comparing the representations learned by different families of reinforcement learning (RL) algorithms. Sources of friction… ▽ More

    Submitted 29 May, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

  13. arXiv:1805.04687  [pdf, other

    cs.CV

    BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning

    Authors: Fisher Yu, Haofeng Chen, Xin Wang, Wenqi Xian, Yingying Chen, Fangchen Liu, Vashisht Madhavan, Trevor Darrell

    Abstract: Datasets drive vision progress, yet existing driving datasets are impoverished in terms of visual content and supported tasks to study multitask learning for autonomous driving. Researchers are usually constrained to study a small set of problems on one dataset, while real-world computer vision applications require performing tasks of various complexities. We construct BDD100K, the largest driving… ▽ More

    Submitted 8 April, 2020; v1 submitted 12 May, 2018; originally announced May 2018.

    Comments: Published at IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020

  14. arXiv:1712.06567  [pdf, other

    cs.NE cs.LG

    Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

    Authors: Felipe Petroski Such, Vashisht Madhavan, Edoardo Conti, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: Deep artificial neural networks (DNNs) are typically trained via gradient-based learning algorithms, namely backpropagation. Evolution strategies (ES) can rival backprop-based algorithms such as Q-learning and policy gradients on challenging deep reinforcement learning (RL) problems. However, ES can be considered a gradient-based algorithm because it performs stochastic gradient descent via an ope… ▽ More

    Submitted 20 April, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  15. arXiv:1712.06560  [pdf, other

    cs.AI

    Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents

    Authors: Edoardo Conti, Vashisht Madhavan, Felipe Petroski Such, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: Evolution strategies (ES) are a family of black-box optimization algorithms able to train deep neural networks roughly as well as Q-learning and policy gradient methods on challenging deep reinforcement learning (RL) problems, but are much faster (e.g. hours vs. days) because they parallelize better. However, many RL problems require directed exploration because they have reward functions that are… ▽ More

    Submitted 29 October, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  16. arXiv:1207.6600  [pdf, other

    cs.IR cs.AI cs.SI

    Diversity in Ranking using Negative Reinforcement

    Authors: Rama Badrinath, C. E. Veni Madhavan

    Abstract: In this paper, we consider the problem of diversity in ranking of the nodes in a graph. The task is to pick the top-k nodes in the graph which are both 'central' and 'diverse'. Many graph-based models of NLP like text summarization, opinion summarization involve the concept of diversity in generating the summaries. We develop a novel method which works in an iterative fashion based on random walks… ▽ More

    Submitted 27 July, 2012; originally announced July 2012.

  17. arXiv:1111.4898  [pdf, other

    cs.SI physics.soc-ph

    A Navigation Algorithm Inspired by Human Navigation

    Authors: Vijesh M., Sudarshan Iyengar, Vijay Mahantesh, Amitash Ramesh, Veni Madhavan

    Abstract: Human navigation has been a topic of interest in spatial cognition from the past few decades. It has been experimentally observed that humans accomplish the task of way-finding a destination in an unknown environment by recognizing landmarks. Investigations using network analytic techniques reveal that humans, when asked to way-find their destination, learn the top ranked nodes of a network. In th… ▽ More

    Submitted 21 November, 2011; originally announced November 2011.

    Comments: Human Navigation, Path Concatenation, Hotspots, Center Strategic Paths, Approximation Algorithm

  18. arXiv:0901.0529  [pdf, ps, other

    cs.OH cs.CR

    Measures for classification and detection in steganalysis

    Authors: Sujit Gujar, C E Veni Madhavan

    Abstract: Still and multi-media images are subject to transformations for compression, steganographic embedding and digital watermarking. In a major program of activities we are engaged in the modeling, design and analysis of digital content. Statistical and pattern classification techniques should be combined with understanding of run length, transform coding techniques, and also encryption techniques.

    Submitted 5 January, 2009; originally announced January 2009.

    Comments: 15 pages, 8 figures