Skip to main content

Showing 1–26 of 26 results for author: Agrawal, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.08824  [pdf, other

    cs.LG stat.ML

    Disentangling impact of capacity, objective, batchsize, estimators, and step-size on flow VI

    Authors: Abhinav Agrawal, Justin Domke

    Abstract: Normalizing flow-based variational inference (flow VI) is a promising approximate inference approach, but its performance remains inconsistent across studies. Numerous algorithmic choices influence flow VI's performance. We conduct a step-by-step analysis to disentangle the impact of some of the key factors: capacity, objectives, gradient estimators, number of gradient estimates (batchsize), and s… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  2. arXiv:2405.19747  [pdf, other

    cs.LG stat.ML

    Understanding and mitigating difficulties in posterior predictive evaluation

    Authors: Abhinav Agrawal, Justin Domke

    Abstract: Predictive posterior densities (PPDs) are of interest in approximate Bayesian inference. Typically, these are estimated by simple Monte Carlo (MC) averages using samples from the approximate posterior. We observe that the signal-to-noise ratio (SNR) of such estimators can be extremely low. An analysis for exact inference reveals SNR decays exponentially as there is an increase in (a) the mismatch… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2312.03607  [pdf, other

    math.OC physics.comp-ph stat.ML

    From concrete mixture to structural design -- a holistic optimization procedure in the presence of uncertainties

    Authors: Atul Agrawal, Erik Tamsen, Phaedon-Stelios Koutsourelakis, Joerg F. Unger

    Abstract: Designing civil structures such as bridges, dams or buildings is a complex task requiring many synergies from several experts. Each is responsible for different parts of the process. This is often done in a sequential manner, e.g. the structural engineer makes a design under the assumption of certain material properties (e.g. the strength class of the concrete), and then the material engineer opti… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  4. arXiv:2311.15137  [pdf, other

    math.OC cs.LG stat.ML

    Multi-fidelity Constrained Optimization for Stochastic Black Box Simulators

    Authors: Atul Agrawal, Kislaya Ravi, Phaedon-Stelios Koutsourelakis, Hans-Joachim Bungartz

    Abstract: Constrained optimization of the parameters of a simulator plays a crucial role in a design process. These problems become challenging when the simulator is stochastic, computationally expensive, and the parameter space is high-dimensional. One can efficiently perform optimization only by utilizing the gradient with respect to the parameters, but these gradients are unavailable in many legacy, blac… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  5. arXiv:2307.02432  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph stat.ML

    A probabilistic, data-driven closure model for RANS simulations with aleatoric, model uncertainty

    Authors: Atul Agrawal, Phaedon-Stelios Koutsourelakis

    Abstract: We propose a data-driven, closure model for Reynolds-averaged Navier-Stokes (RANS) simulations that incorporates aleatoric, model uncertainty. The proposed closure consists of two parts. A parametric one, which utilizes previously proposed, neural-network-based tensor basis functions dependent on the rate of strain and rotation tensor invariants. This is complemented by latent, random variables wh… ▽ More

    Submitted 15 April, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 31 pages, 10 figures

  6. arXiv:2305.02573  [pdf, other

    stat.ML cs.LG math.OC

    Joint Graph Learning and Model Fitting in Laplacian Regularized Stratified Models

    Authors: Ziheng Cheng, Junzi Zhang, Akshay Agrawal, Stephen Boyd

    Abstract: Laplacian regularized stratified models (LRSM) are models that utilize the explicit or implicit network structure of the sub-problems as defined by the categorical features called strata (e.g., age, region, time, forecast horizon, etc.), and draw upon data from neighboring strata to enhance the parameter learning of each sub-problem. They have been widely applied in machine learning and signal pro… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 32 pages, 10 figures

  7. A Model for Censored Reliability Data with Two Dependent Failure Modes and Prediction of Future Failures

    Authors: Aakash Agrawal, Debanjan Mitra, Ayon Ganguly

    Abstract: Quite often, we observe reliability data with two failure modes that may influence each other, resulting in a setting of dependent failure modes. Here, we discuss modelling of censored reliability data with two dependent failure modes by using a bivariate Weibull model with distinct shape parameters which we construct as an extension of the well-known Marshall-Olkin bivariate exponential model in… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

  8. arXiv:2206.09138  [pdf, other

    stat.ME stat.AP

    Application of a General Family of Bivariate Distributions in Modelling Dependent Competing Risks Data with Associated Model Selection

    Authors: Aakash Agrawal, Ayon Ganguly, Debanjan Mitra

    Abstract: In this article, a general family of bivariate distributions is used to model competing risks data with dependent factors. The general structure of competing risks data considered here includes ties. A comprehensive inferential framework for the proposed model is presented: maximum likelihood estimation, confidence interval construction, and model selection within the bivariate family of distribut… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

  9. arXiv:2111.03144  [pdf, other

    cs.LG stat.ML

    Amortized Variational Inference for Simple Hierarchical Models

    Authors: Abhinav Agrawal, Justin Domke

    Abstract: It is difficult to use subsampling with variational inference in hierarchical models since the number of local latent variables scales with the dataset. Thus, inference in hierarchical models remains a challenge at large scale. It is helpful to use a variational family with structure matching the posterior, but optimization is still slow due to the huge number of local distributions. Instead, this… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: Neural Information Processing Systems (NeurIPS) 2021

  10. arXiv:2103.03565  [pdf, other

    cs.LG physics.comp-ph physics.flu-dyn stat.ML

    Physics-aware deep neural networks for surrogate modeling of turbulent natural convection

    Authors: Didier Lucor, Atul Agrawal, Anne Sergent

    Abstract: Recent works have explored the potential of machine learning as data-driven turbulence closures for RANS and LES techniques. Beyond these advances, the high expressivity and agility of physics-informed neural networks (PINNs) make them promising candidates for full fluid flow PDE modeling. An important question is whether this new paradigm, exempt from the traditional notion of discretization of t… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  11. arXiv:2103.02559  [pdf, other

    cs.LG math.OC stat.ML

    Minimum-Distortion Embedding

    Authors: Akshay Agrawal, Alnur Ali, Stephen Boyd

    Abstract: We consider the vector embedding problem. We are given a finite set of items, with the goal of assigning a representative vector to each one, possibly under some constraints (such as the collection of vectors being standardized, i.e., having zero mean and unit covariance). We are given data indicating that some pairs of items are similar, and optionally, some other pairs are dissimilar. For pairs… ▽ More

    Submitted 24 August, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  12. arXiv:2006.13070  [pdf, other

    stat.ML cs.LG

    Normalizing Flows Across Dimensions

    Authors: Edmond Cunningham, Renos Zabounidis, Abhinav Agrawal, Madalina Fiterau, Daniel Sheldon

    Abstract: Real-world data with underlying structure, such as pictures of faces, are hypothesized to lie on a low-dimensional manifold. This manifold hypothesis has motivated state-of-the-art generative algorithms that learn low-dimensional data representations. Unfortunately, a popular generative model, normalizing flows, cannot take advantage of this. Normalizing flows are based on successive variable tran… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

  13. arXiv:2006.10343  [pdf, other

    cs.LG stat.ML

    Advances in Black-Box VI: Normalizing Flows, Importance Weighting, and Optimization

    Authors: Abhinav Agrawal, Daniel Sheldon, Justin Domke

    Abstract: Recent research has seen several advances relevant to black-box VI, but the current state of automatic posterior inference is unclear. One such advance is the use of normalizing flows to define flexible posterior densities for deep latent variable models. Another direction is the integration of Monte-Carlo methods to serve two purposes; first, to obtain tighter variational objectives for optimizat… ▽ More

    Submitted 23 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Neural Information Processing Systems (NeurIPS) 2020

  14. arXiv:2006.07909  [pdf, ps, other

    cs.LG cs.CL cs.CV stat.ML

    Leveraging Multimodal Behavioral Analytics for Automated Job Interview Performance Assessment and Feedback

    Authors: Anumeha Agrawal, Rosa Anil George, Selvan Sunitha Ravi, Sowmya Kamath S, Anand Kumar M

    Abstract: Behavioral cues play a significant part in human communication and cognitive perception. In most professional domains, employee recruitment policies are framed such that both professional skills and personality traits are adequately assessed. Hiring interviews are structured to evaluate expansively a potential employee's suitability for the position - their professional qualifications, interperson… ▽ More

    Submitted 16 June, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: 9 pages, ACL 2020

  15. arXiv:2006.04248  [pdf, other

    cs.LG stat.ML

    Learning Convex Optimization Models

    Authors: Akshay Agrawal, Shane Barratt, Stephen Boyd

    Abstract: A convex optimization model predicts an output from an input by solving a convex optimization problem. The class of convex optimization models is large, and includes as special cases many well-known models like linear and logistic regression. We propose a heuristic for learning the parameters in a convex optimization model given a dataset of input-output pairs, using recently developed methods for… ▽ More

    Submitted 18 June, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: Authors listed in alphabetical order

  16. arXiv:2005.13625  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning

    Authors: J. K. Terry, Nathaniel Grammel, Sanghyun Son, Benjamin Black, Aakriti Agrawal

    Abstract: Parameter sharing, where each agent independently learns a policy with fully shared parameters between all policies, is a popular baseline method for multi-agent deep reinforcement learning. Unfortunately, since all agents share the same policy network, they cannot learn different policies or tasks. This issue has been circumvented experimentally by adding an agent-specific indicator signal to obs… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 May, 2020; originally announced May 2020.

  17. arXiv:2001.01861  [pdf, other

    cs.LG cs.DC stat.ML

    Vamsa: Automated Provenance Tracking in Data Science Scripts

    Authors: Mohammad Hossein Namaki, Avrilia Floratou, Fotis Psallidas, Subru Krishnan, Ashvin Agrawal, Yinghui Wu, Yiwen Zhu, Markus Weimer

    Abstract: There has recently been a lot of ongoing research in the areas of fairness, bias and explainability of machine learning (ML) models due to the self-evident or regulatory requirements of various ML applications. We make the following observation: All of these approaches require a robust understanding of the relationship between ML models and the data used to train them. In this work, we introduce t… ▽ More

    Submitted 30 July, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

  18. arXiv:1910.12430  [pdf, other

    cs.LG math.OC stat.ML

    Differentiable Convex Optimization Layers

    Authors: Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, Zico Kolter

    Abstract: Recent work has shown how to embed differentiable optimization problems (that is, problems whose solutions can be backpropagated through) as layers within deep learning architectures. This method provides a useful inductive bias for certain problems, but existing software for differentiable optimization layers is rigid and difficult to apply to new settings. In this paper, we propose an approach t… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: In NeurIPS 2019. Code available at https://www.github.com/cvxgrp/cvxpylayers. Authors in alphabetical order

  19. arXiv:1909.00052  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Learning Digital Circuits: A Journey Through Weight Invariant Self-Pruning Neural Networks

    Authors: Amey Agrawal, Rohit Karlupia

    Abstract: Recently, in the paper "Weight Agnostic Neural Networks" Gaier & Ha utilized architecture search to find networks where the topology completely encodes the knowledge. However, architecture search in topology space is expensive. We use the existing framework of binarized networks to find performant topologies by constraining the weights to be either, zero or one. We show that such topologies achiev… ▽ More

    Submitted 3 May, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

  20. arXiv:1907.12953  [pdf, other

    cs.LG stat.ML

    A real-time iterative machine learning approach for temperature profile prediction in additive manufacturing processes

    Authors: Arindam Paul, Mojtaba Mozaffar, Zijiang Yang, Wei-keng Liao, Alok Choudhary, Jian Cao, Ankit Agrawal

    Abstract: Additive Manufacturing (AM) is a manufacturing paradigm that builds three-dimensional objects from a computer-aided design model by successively adding material layer by layer. AM has become very popular in the past decade due to its utility for fast prototyping such as 3D printing as well as manufacturing functional parts with complex geometries using processes such as laser metal deposition that… ▽ More

    Submitted 9 August, 2019; v1 submitted 28 July, 2019; originally announced July 2019.

    Comments: 10 pages, 8 figures

    Journal ref: 6th IEEE International Conference on Data Science and Advanced Analytics (DSAA), 2019

  21. arXiv:1907.03222  [pdf, other

    physics.comp-ph cs.LG stat.ML

    IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery

    Authors: Dipendra Jha, Logan Ward, Zijiang Yang, Christopher Wolverton, Ian Foster, Wei-keng Liao, Alok Choudhary, Ankit Agrawal

    Abstract: Materials discovery is crucial for making scientific advances in many domains. Collections of data from experiments and first-principle computations have spurred interest in applying machine learning methods to create predictive models capable of mapping from composition and crystal structures to materials properties. Generally, these are regression problems with the input being a 1D vector compos… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

    Comments: 9 pages, under publication at KDD'19

  22. arXiv:1903.03178  [pdf, other

    cs.LG physics.chem-ph stat.ML

    Transfer Learning Using Ensemble Neural Networks for Organic Solar Cell Screening

    Authors: Arindam Paul, Dipendra Jha, Reda Al-Bahrani, Wei-keng Liao, Alok Choudhary, Ankit Agrawal

    Abstract: Organic Solar Cells are a promising technology for solving the clean energy crisis in the world. However, generating candidate chemical compounds for solar cells is a time-consuming process requiring thousands of hours of laboratory analysis. For a solar cell, the most important property is the power conversion efficiency which is dependent on the highest occupied molecular orbitals (HOMO) values… ▽ More

    Submitted 28 July, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: 8 pages, 11 figures, International Joint Conference on Neural Networks

    Journal ref: International Joint Conference on Neural Networks, Budapest Hungary, 14-19 July 2019

  23. arXiv:1901.06588  [pdf, other

    cs.LG stat.ML

    Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks

    Authors: Charbel Sakr, Naigang Wang, Chia-Yu Chen, Jungwook Choi, Ankur Agrawal, Naresh Shanbhag, Kailash Gopalakrishnan

    Abstract: Efforts to reduce the numerical precision of computations in deep learning training have yielded systems that aggressively quantize weights and activations, yet employ wide high-precision accumulators for partial sums in inner-product operations to preserve the quality of convergence. The absence of any framework to analyze the precision requirements of partial sum accumulations results in conserv… ▽ More

    Submitted 19 January, 2019; originally announced January 2019.

    Comments: Published as a conference paper in ICLR 2019

  24. arXiv:1812.00898  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning

    Authors: Aishwarya Agrawal, Mateusz Malinowski, Felix Hill, Ali Eslami, Oriol Vinyals, Tejas Kulkarni

    Abstract: Advances in Deep Reinforcement Learning have led to agents that perform well across a variety of sensory-motor domains. In this work, we study the setting in which an agent must learn to generate programs for diverse scenes conditioned on a given symbolic instruction. Final goals are specified to our agent via images of the scenes. A symbolic instruction consistent with the goal images is used as… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  25. arXiv:1712.02679  [pdf, other

    cs.LG stat.ML

    AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training

    Authors: Chia-Yu Chen, Jungwook Choi, Daniel Brand, Ankur Agrawal, Wei Zhang, Kailash Gopalakrishnan

    Abstract: Highly distributed training of Deep Neural Networks (DNNs) on future compute platforms (offering 100 of TeraOps/s of computational capacity) is expected to be severely communication constrained. To overcome this limitation, new gradient compression techniques are needed that are computationally friendly, applicable to a wide variety of layers seen in Deep Neural Networks and adaptable to variation… ▽ More

    Submitted 7 December, 2017; originally announced December 2017.

    Comments: IBM Research AI, 9 pages, 7 figures, AAAI18 accepted

  26. arXiv:1502.02551  [pdf, other

    cs.LG cs.NE stat.ML

    Deep Learning with Limited Numerical Precision

    Authors: Suyog Gupta, Ankur Agrawal, Kailash Gopalakrishnan, Pritish Narayanan

    Abstract: Training of large-scale deep neural networks is often constrained by the available computational resources. We study the effect of limited precision data representation and computation on neural network training. Within the context of low-precision fixed-point computations, we observe the rounding scheme to play a crucial role in determining the network's behavior during training. Our results show… ▽ More

    Submitted 9 February, 2015; originally announced February 2015.

    Comments: 10 pages, 6 figures, 1 table