Skip to main content

Showing 1–45 of 45 results for author: Honavar, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.20059  [pdf, ps, other

    cs.AI

    DiaLLMs: EHR Enhanced Clinical Conversational System for Clinical Test Recommendation and Diagnosis Prediction

    Authors: Weijieying Ren, Tianxiang Zhao, Lei Wang, Tianchun Wang, Vasant Honavar

    Abstract: Recent advances in Large Language Models (LLMs) have led to remarkable progresses in medical consultation. However, existing medical LLMs overlook the essential role of Electronic Health Records (EHR) and focus primarily on diagnosis recommendation, limiting their clinical applicability. We propose DiaLLM, the first medical LLM that integrates heterogeneous EHR data into clinically grounded dialog… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Journal ref: published in ACL 2025

  2. arXiv:2504.01995  [pdf, other

    cs.AI cs.LG

    Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics

    Authors: Hamed Mahdavi, Alireza Hashemi, Majid Daliri, Pegah Mohammadipour, Alireza Farhadi, Samira Malek, Yekta Yazdanifard, Amir Khasahmadi, Vasant Honavar

    Abstract: Recent advances in large language models (LLMs) have shown impressive progress in mathematical reasoning tasks. However, current evaluation benchmarks predominantly focus on the accuracy of final answers, often overlooking the crucial logical rigor for mathematical problem solving. The claim that state-of-the-art LLMs can solve Math Olympiad-level problems requires closer examination. To explore t… ▽ More

    Submitted 10 April, 2025; v1 submitted 31 March, 2025; originally announced April 2025.

  3. arXiv:2503.09814  [pdf

    cond-mat.mtrl-sci cs.LG

    A practical guide to machine learning interatomic potentials -- Status and future

    Authors: Ryan Jacobs, Dane Morgan, Siamak Attarian, Jun Meng, Chen Shen, Zhenghao Wu, Clare Yijia Xie, Julia H. Yang, Nongnuch Artrith, Ben Blaiszik, Gerbrand Ceder, Kamal Choudhary, Gabor Csanyi, Ekin Dogus Cubuk, Bowen Deng, Ralf Drautz, Xiang Fu, Jonathan Godwin, Vasant Honavar, Olexandr Isayev, Anders Johansson, Boris Kozinsky, Stefano Martiniani, Shyue Ping Ong, Igor Poltavsky , et al. (5 additional authors not shown)

    Abstract: The rapid development and large body of literature on machine learning interatomic potentials (MLIPs) can make it difficult to know how to proceed for researchers who are not experts but wish to use these tools. The spirit of this review is to help such researchers by serving as a practical, accessible guide to the state-of-the-art in MLIPs. This review paper covers a broad range of topics related… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Journal ref: Current Opinion in Solid State and Materials Science, 35, 101214 (2025)

  4. arXiv:2503.05079  [pdf, other

    cs.LG

    On a Connection Between Imitation Learning and RLHF

    Authors: Teng Xiao, Yige Yuan, Mingxiao Li, Zhengyu Chen, Vasant G Honavar

    Abstract: This work studies the alignment of large language models with preference data from an imitation learning perspective. We establish a close theoretical connection between reinforcement learning from human feedback RLHF and imitation learning (IL), revealing that RLHF implicitly performs imitation learning on the preference data distribution. Building on this connection, we propose DIL, a principled… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: ICLR 2025

  5. arXiv:2502.00883  [pdf, other

    cs.LG cs.CL

    SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

    Authors: Teng Xiao, Yige Yuan, Zhengyu Chen, Mingxiao Li, Shangsong Liang, Zhaochun Ren, Vasant G Honavar

    Abstract: Existing preference optimization objectives for language model alignment require additional hyperparameters that must be extensively tuned to achieve optimal performance, increasing both the complexity and time required for fine-tuning large language models. In this paper, we propose a simple yet effective hyperparameter-free preference optimization algorithm for alignment. We observe that promisi… ▽ More

    Submitted 20 February, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

    Comments: ICLR 2025

  6. arXiv:2501.16562  [pdf, other

    cs.LG stat.ME

    C-HDNet: A Fast Hyperdimensional Computing Based Method for Causal Effect Estimation from Networked Observational Data

    Authors: Abhishek Dalvi, Neil Ashtekar, Vasant Honavar

    Abstract: We consider the problem of estimating causal effects from observational data in the presence of network confounding. In this context, an individual's treatment assignment and outcomes may be affected by their neighbors within the network. We propose a novel matching technique which leverages hyperdimensional computing to model network information and improve predictive performance. We present resu… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  7. arXiv:2501.03540  [pdf, ps, other

    cs.LG cs.AI

    Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions

    Authors: Weijieying Ren, Tianxiang Zhao, Yuqing Huang, Vasant Honavar

    Abstract: Tabular data remains one of the most prevalent data types across a wide range of real-world applications, yet effective representation learning for this domain poses unique challenges due to its irregular patterns, heterogeneous feature distributions, and complex inter-column dependencies. This survey provides a comprehensive review of state-of-the-art techniques in tabular data representation lea… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  8. arXiv:2412.14516  [pdf, other

    cs.LG cs.CL

    Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment

    Authors: Teng Xiao, Yige Yuan, Huaisheng Zhu, Mingxiao Li, Vasant G Honavar

    Abstract: We study the problem of aligning large language models (LLMs) with human preference data. Contrastive preference optimization has shown promising results in aligning LLMs with available preference data by optimizing the implicit reward associated with the policy. However, the contrastive objective focuses mainly on the relative values of implicit rewards associated with two responses while ignorin… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Accepted by NeurIPS 2024 Main

  9. arXiv:2411.10821  [pdf, other

    cs.LG q-bio.BM

    GeomCLIP: Contrastive Geometry-Text Pre-training for Molecules

    Authors: Teng Xiao, Chao Cui, Huaisheng Zhu, Vasant G. Honavar

    Abstract: Pretraining molecular representations is crucial for drug and material discovery. Recent methods focus on learning representations from geometric structures, effectively capturing 3D position information. Yet, they overlook the rich information in biomedical texts, which detail molecules' properties and substructures. With this in mind, we set up a data collection effort for 200K pairs of ground-s… ▽ More

    Submitted 16 November, 2024; originally announced November 2024.

    Comments: BIBM 2024

  10. arXiv:2410.10093  [pdf, other

    cs.CL cs.LG

    How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective

    Authors: Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar

    Abstract: This paper introduces a novel generalized self-imitation learning ($\textbf{GSIL}$) framework, which effectively and efficiently aligns large language models with offline demonstration data. We develop $\textbf{GSIL}$ by deriving a surrogate objective of imitation learning with density ratio estimates, facilitating the use of self-generated data and optimizing the imitation learning objective with… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: EMNLP 2024 Main

  11. arXiv:2409.06080  [pdf

    cond-mat.mtrl-sci cs.LG

    Regression with Large Language Models for Materials and Molecular Property Prediction

    Authors: Ryan Jacobs, Maciej P. Polak, Lane E. Schultz, Hamed Mahdavi, Vasant Honavar, Dane Morgan

    Abstract: We demonstrate the ability of large language models (LLMs) to perform material and molecular property regression tasks, a significant deviation from the conventional LLM use case. We benchmark the Large Language Model Meta AI (LLaMA) 3 on several molecular properties in the QM9 dataset and 24 materials properties. Only composition-based input strings are used as the model input and we fine tune on… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  12. arXiv:2404.10907  [pdf, other

    cs.AI

    Causal Effect Estimation Using Random Hyperplane Tessellations

    Authors: Abhishek Dalvi, Neil Ashtekar, Vasant Honavar

    Abstract: Matching is one of the simplest approaches for estimating causal effects from observational data. Matching techniques compare the observed outcomes across pairs of individuals with similar covariate values but different treatment statuses in order to estimate causal effects. However, traditional matching techniques are unreliable given high-dimensional covariates due to the infamous curse of dimen… ▽ More

    Submitted 19 September, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: At CLeaR 2024

  13. arXiv:2403.08167  [pdf, other

    cs.LG cs.CL q-bio.QM

    MolBind: Multimodal Alignment of Language, Molecules, and Proteins

    Authors: Teng Xiao, Chao Cui, Huaisheng Zhu, Vasant G. Honavar

    Abstract: Recent advancements in biology and chemistry have leveraged multi-modal learning, integrating molecules and their natural language descriptions to enhance drug discovery. However, current pre-training frameworks are limited to two modalities, and designing a unified network to process different modalities (e.g., natural language, 2D molecular graphs, 3D molecular conformations, and 3D proteins) re… ▽ More

    Submitted 2 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Report number: 2403.08167

  14. arXiv:2403.07179  [pdf, other

    cs.LG cs.CL q-bio.BM

    3M-Diffusion: Latent Multi-Modal Diffusion for Language-Guided Molecular Structure Generation

    Authors: Huaisheng Zhu, Teng Xiao, Vasant G Honavar

    Abstract: Generating molecular structures with desired properties is a critical task with broad applications in drug discovery and materials design. We propose 3M-Diffusion, a novel multi-modal molecular graph generation method, to generate diverse, ideally novel molecular structures with desired properties. 3M-Diffusion encodes molecular graphs into a graph latent space which it then aligns with the text s… ▽ More

    Submitted 2 October, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  15. arXiv:2402.17073  [pdf, other

    cs.LG cs.AI cs.SI

    Hyperdimensional Representation Learning for Node Classification and Link Prediction

    Authors: Abhishek Dalvi, Vasant Honavar

    Abstract: We introduce Hyperdimensional Graph Learner (HDGL), a novel method for node classification and link prediction in graphs. HDGL maps node features into a very high-dimensional space (\textit{hyperdimensional} or HD space for short) using the \emph{injectivity} property of node representations in a family of Graph Neural Networks (GNNs) and then uses HD operators such as \textit{bundling} and \texti… ▽ More

    Submitted 26 February, 2025; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted by WSDM 2025

  16. arXiv:2401.05667  [pdf, other

    cs.LG cs.AI

    EsaCL: Efficient Continual Learning of Sparse Models

    Authors: Weijieying Ren, Vasant G Honavar

    Abstract: A key challenge in the continual learning setting is to efficiently learn a sequence of tasks without forgetting how to perform previously learned tasks. Many existing approaches to this problem work by either retraining the model on previous tasks or by expanding the model to accommodate new tasks. However, these approaches typically suffer from increased storage and computational requirements, a… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: SDM 2024 : SIAM International Conference on Data Mining

  17. arXiv:2307.16307  [pdf, other

    cs.AI cs.DB cs.LO

    Representing and Reasoning with Multi-Stakeholder Qualitative Preference Queries

    Authors: Samik Basu, Vasant Honavar, Ganesh Ram Santhanam, Jia Tao

    Abstract: Many decision-making scenarios, e.g., public policy, healthcare, business, and disaster response, require accommodating the preferences of multiple stakeholders. We offer the first formal treatment of reasoning with multi-stakeholder qualitative preferences in a setting where stakeholders express their preferences in a qualitative preference language, e.g., CP-net, CI-net, TCP-net, CP-Theory. We i… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: A shorter version is published in the proceeding of 26th European Conference on Artificial Intelligence ECAI 2023

  18. arXiv:2211.02789  [pdf, other

    cs.LG

    Forecasting User Interests Through Topic Tag Predictions in Online Health Communities

    Authors: Amogh Subbakrishna Adishesha, Lily Jakielaszek, Fariha Azhar, Peixuan Zhang, Vasant Honavar, Fenglong Ma, Chandra Belani, Prasenjit Mitra, Sharon Xiaolei Huang

    Abstract: The increasing reliance on online communities for healthcare information by patients and caregivers has led to the increase in the spread of misinformation, or subjective, anecdotal and inaccurate or non-specific recommendations, which, if acted on, could cause serious harm to the patients. Hence, there is an urgent need to connect users with accurate and tailored health information in a timely ma… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Healthcare Informatics and NLP

  19. arXiv:2108.00079  [pdf, other

    cs.CR cs.LG

    Zooming Into the Darknet: Characterizing Internet Background Radiation and its Structural Changes

    Authors: Michalis Kallitsis, Vasant Honavar, Rupesh Prajapati, Dinghao Wu, John Yen

    Abstract: Network telescopes or "Darknets" provide a unique window into Internet-wide malicious activities associated with malware propagation, denial of service attacks, scanning performed for network reconnaissance, and others. Analyses of the resulting data can provide actionable insights to security analysts that can be used to prevent or mitigate cyber-threats. Large Darknets, however, observe millions… ▽ More

    Submitted 5 August, 2021; v1 submitted 28 July, 2021; originally announced August 2021.

    Comments: 30 pages, 14 tables, 12 figures, arXiv.org perpetual, non-exclusive license

  20. arXiv:2011.11631  [pdf, ps, other

    cs.LG

    Explainable Multivariate Time Series Classification: A Deep Neural Network Which Learns To Attend To Important Variables As Well As Informative Time Intervals

    Authors: Tsung-Yu Hsieh, Suhang Wang, Yiwei Sun, Vasant Honavar

    Abstract: Time series data is prevalent in a wide variety of real-world applications and it calls for trustworthy and explainable models for people to understand and fully trust decisions made by AI solutions. We consider the problem of building explainable classifiers from multi-variate time series data. A key criterion to understand such predictive models involves elucidating and quantifying the contribut… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

  21. arXiv:2010.01101  [pdf

    stat.AP cs.CY cs.LG q-bio.PE

    Commuting Network Spillovers and COVID-19 Deaths Across US Counties

    Authors: Christopher Seto, Aria Khademi, Corina Graif, Vasant G. Honavar

    Abstract: This study explored how population mobility flows form commuting networks across US counties and influence the spread of COVID-19. We utilized 3-level mixed effects negative binomial regression models to estimate the impact of network COVID-19 exposure on county confirmed cases and deaths over time. We also conducted weighting-based analyses to estimate the causal effect of network exposure. Resul… ▽ More

    Submitted 10 February, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted for Presentation at The Population Association of America 2021

  22. arXiv:2008.00357  [pdf, other

    cs.LG cs.AI stat.ML

    A Causal Lens for Peeking into Black Box Predictive Models: Predictive Model Interpretation via Causal Attribution

    Authors: Aria Khademi, Vasant Honavar

    Abstract: With the increasing adoption of predictive models trained using machine learning across a wide range of high-stakes applications, e.g., health care, security, criminal justice, finance, and education, there is a growing need for effective techniques for explaining such models and their predictions. We aim to address this problem in settings where the predictive model is a black box; That is, we ca… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

  23. arXiv:2005.11770  [pdf, other

    stat.ML cs.LG

    Longitudinal Deep Kernel Gaussian Process Regression

    Authors: Junjie Liang, Yanting Wu, Dongkuan Xu, Vasant Honavar

    Abstract: Gaussian processes offer an attractive framework for predictive modeling from longitudinal data, i.e., irregularly sampled, sparse observations from a set of individuals over time. However, such methods have two key shortcomings: (i) They rely on ad hoc heuristics or expensive trial and error to choose the effective kernels, and (ii) They fail to handle multilevel correlation structure in the data… ▽ More

    Submitted 7 December, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: Paper accepted by 35th AAAI Conference on Artificial Intelligence

  24. arXiv:1912.02390  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Robust Relational Causal Discovery

    Authors: Sanghack Lee, Vasant Honavar

    Abstract: We consider the problem of learning causal relationships from relational data. Existing approaches rely on queries to a relational conditional independence (RCI) oracle to establish and orient causal relations in such a setting. In practice, queries to a RCI oracle have to be replaced by reliable tests for RCI against available data. Relational data present several unique challenges in testing for… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: 14 pages

    Journal ref: Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence, UAI 2019

  25. arXiv:1911.10640  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Algorithmic Bias in Recidivism Prediction: A Causal Perspective

    Authors: Aria Khademi, Vasant Honavar

    Abstract: ProPublica's analysis of recidivism predictions produced by Correctional Offender Management Profiling for Alternative Sanctions (COMPAS) software tool for the task, has shown that the predictions were racially biased against African American defendants. We analyze the COMPAS data using a causal reformulation of the underlying algorithmic fairness problem. Specifically, we assess whether COMPAS ex… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

    Comments: Accepted for publication at the Thirty Fourth AAAI conference on Artificial Intelligence (AAAI-20)

  26. arXiv:1911.04062  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    LMLFM: Longitudinal Multi-Level Factorization Machine

    Authors: Junjie Liang, Dongkuan Xu, Yiwei Sun, Vasant Honavar

    Abstract: We consider the problem of learning predictive models from longitudinal data, consisting of irregularly repeated, sparse observations from a set of individuals over time. Such data often exhibit {\em longitudinal correlation} (LC) (correlations among observations for each individual over time), {\em cluster correlation} (CC) (correlations among individuals that have similar characteristics), or bo… ▽ More

    Submitted 21 November, 2019; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Thirty-Fourth AAAI Conference on Artificial Intelligence, accepted

  27. arXiv:1909.11630  [pdf, other

    cs.LG stat.ML

    The Dynamical Gaussian Process Latent Variable Model in the Longitudinal Scenario

    Authors: Thanh Le, Vasant Honavar

    Abstract: The Dynamical Gaussian Process Latent Variable Models provide an elegant non-parametric framework for learning the low dimensional representations of the high-dimensional time-series. Real world observational studies, however, are often ill-conditioned: the observations can be noisy, not assuming the luxury of relatively complete and equally spaced like those in time series. Such conditions make i… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  28. arXiv:1909.06543  [pdf, other

    cs.LG cs.CR cs.SI stat.ML

    Node Injection Attacks on Graphs via Reinforcement Learning

    Authors: Yiwei Sun, Suhang Wang, Xianfeng Tang, Tsung-Yu Hsieh, Vasant Honavar

    Abstract: Real-world graph applications, such as advertisements and product recommendations make profits based on accurately classify the label of the nodes. However, in such scenarios, there are high incentives for the adversaries to attack such graph to reduce the node classification performance. Previous work on graph adversarial attacks focus on modifying existing graph structures, which is infeasible i… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: Preprint, under review

  29. arXiv:1909.01084  [pdf, other

    cs.SI cs.LG

    MEGAN: A Generative Adversarial Network for Multi-View Network Embedding

    Authors: Yiwei Sun, Suhang Wang, Tsung-Yu Hsieh, Xianfeng Tang, Vasant Honavar

    Abstract: Data from many real-world applications can be naturally represented by multi-view networks where the different views encode different types of relationships (e.g., friendship, shared interests in music, etc.) between real-world individuals or entities. There is an urgent need for methods to obtain low-dimensional, information preserving and typically nonlinear embeddings of such multi-view network… ▽ More

    Submitted 19 August, 2019; originally announced September 2019.

    Comments: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19

  30. Fairness in Algorithmic Decision Making: An Excursion Through the Lens of Causality

    Authors: Aria Khademi, Sanghack Lee, David Foley, Vasant Honavar

    Abstract: As virtually all aspects of our lives are increasingly impacted by algorithmic decision making systems, it is incumbent upon us as a society to ensure such systems do not become instruments of unfair discrimination on the basis of gender, race, ethnicity, religion, etc. We consider the problem of determining whether the decisions made by such systems are discriminatory, through the lens of causal… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: 7 pages, 2 figures, 2 tables.To appear in Proceedings of the International Conference on World Wide Web (WWW), 2019

  31. arXiv:1901.08942  [pdf, other

    cs.CV

    Improving Image Captioning by Leveraging Knowledge Graphs

    Authors: Yimin Zhou, Yiwei Sun, Vasant Honavar

    Abstract: We explore the use of a knowledge graphs, that capture general or commonsense knowledge, to augment the information extracted from images by the state-of-the-art methods for image captioning. The results of our experiments, on several benchmark data sets such as MS COCO, as measured by CIDEr-D, a performance metric for image captioning, show that the variants of the state-of-the-art methods for im… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

    Comments: Accepted by WACV'19

  32. arXiv:1812.04109  [pdf, ps, other

    cs.IR cs.LG stat.ML

    Top-N-Rank: A Scalable List-wise Ranking Method for Recommender Systems

    Authors: Junjie Liang, Jinlong Hu, Shoubin Dong, Vasant Honavar

    Abstract: We propose Top-N-Rank, a novel family of list-wise Learning-to-Rank models for reliably recommending the N top-ranked items. The proposed models optimize a variant of the widely used discounted cumulative gain (DCG) objective function which differs from DCG in two important aspects: (i) It limits the evaluation of DCG only on the top N items in the ranked lists, thereby eliminating the impact of l… ▽ More

    Submitted 19 December, 2018; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: paper accepted by the 2018 IEEE International Conference on Big Data

  33. arXiv:1811.02616  [pdf, ps, other

    cs.LG cs.SI

    Multi-View Network Embedding Via Graph Factorization Clustering and Co-Regularized Multi-View Agreement

    Authors: Yiwei Sun, Ngot Bui, Tsung-Yu Hsieh, Vasant Honavar

    Abstract: Real-world social networks and digital platforms are comprised of individuals (nodes) that are linked to other individuals or entities through multiple types of relationships (links). Sub-networks of such a network based on each type of link correspond to distinct views of the underlying network. In real-world applications, each node is typically linked to only a small subset of other nodes. Hence… ▽ More

    Submitted 18 February, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: ICDMW2018 -- IEEE International Conference on Data Mining workshop on Graph Analytics

  34. arXiv:1809.01225  [pdf, ps, other

    cs.LG cs.CC stat.ML

    Compositional Stochastic Average Gradient for Machine Learning and Related Applications

    Authors: Tsung-Yu Hsieh, Yasser EL-Manzalawy, Yiwei Sun, Vasant Honavar

    Abstract: Many machine learning, statistical inference, and portfolio optimization problems require minimization of a composition of expected value functions (CEVF). Of particular interest is the finite-sum versions of such compositional optimization problems (FS-CEVF). Compositional stochastic variance reduced gradient (C-SVRG) methods that combine stochastic compositional gradient descent (SCGD) and stoch… ▽ More

    Submitted 7 September, 2018; v1 submitted 4 September, 2018; originally announced September 2018.

  35. arXiv:1707.04352  [pdf

    cs.AI

    Advances in Artificial Intelligence Require Progress Across all of Computer Science

    Authors: Gregory D. Hager, Randal Bryant, Eric Horvitz, Maja Mataric, Vasant Honavar

    Abstract: Advances in Artificial Intelligence require progress across all of computer science.

    Submitted 13 July, 2017; originally announced July 2017.

    Comments: 7 pages, Computing Community Consortium White Paper

  36. arXiv:1707.00599  [pdf

    cs.CY

    Advanced Cyberinfrastructure for Science, Engineering, and Public Policy

    Authors: Vasant G. Honavar, Katherine Yelick, Klara Nahrstedt, Holly Rushmeier, Jennifer Rexford, Mark D. Hill, Elizabeth Bradley, Elizabeth Mynatt

    Abstract: Progress in many domains increasingly benefits from our ability to view the systems through a computational lens, i.e., using computational abstractions of the domains; and our ability to acquire, share, integrate, and analyze disparate types of data. These advances would not be possible without the advanced data and computational cyberinfrastructure and tools for data capture, integration, analys… ▽ More

    Submitted 30 June, 2017; originally announced July 2017.

    Comments: A Computing Community Consortium (CCC) white paper, 9 pages. arXiv admin note: text overlap with arXiv:1604.02006

  37. arXiv:1706.03102  [pdf

    cs.CY

    Big Data, Data Science, and Civil Rights

    Authors: Solon Barocas, Elizabeth Bradley, Vasant Honavar, Foster Provost

    Abstract: Advances in data analytics bring with them civil rights implications. Data-driven and algorithmic decision making increasingly determine how businesses target advertisements to consumers, how police departments monitor individuals or groups, how banks decide who gets a loan and who does not, how employers hire, how colleges and universities make admissions and financial aid decisions, and much mor… ▽ More

    Submitted 9 June, 2017; originally announced June 2017.

    Comments: A Computing Community Consortium (CCC) white paper, 8 pages

  38. arXiv:1604.02006  [pdf

    cs.CY cs.AI cs.DC cs.HC

    Accelerating Science: A Computing Research Agenda

    Authors: Vasant G. Honavar, Mark D. Hill, Katherine Yelick

    Abstract: The emergence of "big data" offers unprecedented opportunities for not only accelerating scientific advances but also enabling new modes of discovery. Scientific progress in many disciplines is increasingly enabled by our ability to examine natural phenomena through the computational lens, i.e., using algorithmic or information processing abstractions of the underlying processes; and our ability t… ▽ More

    Submitted 6 April, 2016; originally announced April 2016.

    Comments: Computing Community Consortium (CCC) white paper, 17 pages

  39. arXiv:1508.02103  [pdf, ps, other

    cs.AI cs.LG

    Lifted Representation of Relational Causal Models Revisited: Implications for Reasoning and Structure Learning

    Authors: Sanghack Lee, Vasant Honavar

    Abstract: Maier et al. (2010) introduced the relational causal model (RCM) for representing and inferring causal relationships in relational data. A lifted representation, called abstract ground graph (AGG), plays a central role in reasoning with and learning of RCM. The correctness of the algorithm proposed by Maier et al. (2013a) for learning RCM from data relies on the soundness and completeness of AGG f… ▽ More

    Submitted 17 August, 2015; v1 submitted 9 August, 2015; originally announced August 2015.

    Comments: Workshop on Advances in Causal Inference, Conference on Uncertainty in Artificial Intelligence, 2015

  40. arXiv:1507.08559  [pdf, other

    cs.AI

    CRISNER: A Practically Efficient Reasoner for Qualitative Preferences

    Authors: Ganesh Ram Santhanam, Samik Basu, Vasant Honavar

    Abstract: We present CRISNER (Conditional & Relative Importance Statement Network PrEference Reasoner), a tool that provides practically efficient as well as exact reasoning about qualitative preferences in popular ceteris paribus preference languages such as CP-nets, TCP-nets, CP-theories, etc. The tool uses a model checking engine to translate preference specifications and queries into appropriate Kripke… ▽ More

    Submitted 30 July, 2015; originally announced July 2015.

    Comments: 15 pages

  41. Representing and Reasoning with Qualitative Preferences for Compositional Systems

    Authors: Ganesh Ram Santhanam, Samik Basu, Vasant Honavar

    Abstract: Many applications, e.g., Web service composition, complex system design, team formation, etc., rely on methods for identifying collections of objects or entities satisfying some functional requirement. Among the collections that satisfy the functional requirement, it is often necessary to identify one or more collections that are optimal with respect to user preferences over a set of attributes th… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 42, pages 211-274, 2011

  42. arXiv:1401.3478  [pdf

    cs.LG cs.AI stat.ML

    Efficient Markov Network Structure Discovery Using Independence Tests

    Authors: Facundo Bromberg, Dimitris Margaritis, Vasant Honavar

    Abstract: We present two algorithms for learning the structure of a Markov network from data: GSMN* and GSIMN. Both algorithms use statistical independence tests to infer the structure by successively constraining the set of structures consistent with the results of these tests. Until very recently, algorithms for structure learning were based on maximum likelihood estimation, which has been proved to… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 35, pages 449-484, 2009

  43. arXiv:1309.6842  [pdf

    cs.AI

    Causal Transportability of Experiments on Controllable Subsets of Variables: z-Transportability

    Authors: Sanghack Lee, Vasant Honavar

    Abstract: We introduce z-transportability, the problem of estimating the causal effect of a set of variables X on another set of variables Y in a target domain from experiments on any subset of controllable variables Z where Z is an arbitrary subset of observable variables V in a source domain. z-Transportability generalizes z-identifiability, the problem of estimating in a given domain the causal effect of… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-361-370

  44. arXiv:1107.0434  [pdf, other

    cs.AI cs.FL cs.LG

    Abstraction Super-structuring Normal Forms: Towards a Theory of Structural Induction

    Authors: Adrian Silvescu, Vasant Honavar

    Abstract: Induction is the process by which we obtain predictive laws or theories or models of the world. We consider the structural aspect of induction. We answer the question as to whether we can find a finite and minmalistic set of operations on structural elements in terms of which any theory can be expressed. We identify abstraction (grouping similar entities) and super-structuring (combining topologic… ▽ More

    Submitted 3 July, 2011; originally announced July 2011.

  45. arXiv:cs/0511075  [pdf

    cs.LG cs.AI

    Identifying Interaction Sites in "Recalcitrant" Proteins: Predicted Protein and Rna Binding Sites in Rev Proteins of Hiv-1 and Eiav Agree with Experimental Data

    Authors: Michael Terribilini, Jae-Hyung Lee, Changhui Yan, Robert L. Jernigan, Susan Carpenter, Vasant Honavar, Drena Dobbs

    Abstract: Protein-protein and protein nucleic acid interactions are vitally important for a wide range of biological processes, including regulation of gene expression, protein synthesis, and replication and assembly of many viruses. We have developed machine learning approaches for predicting which amino acids of a protein participate in its interactions with other proteins and/or nucleic acids, using on… ▽ More

    Submitted 20 November, 2005; originally announced November 2005.

    Comments: Pacific Symposium on Biocomputing, Hawaii, In press, Accepted, 2006

    ACM Class: J.3