Skip to main content

Showing 1–50 of 61 results for author: Henao, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.03712  [pdf, other

    cs.LG math.ST

    Learning Survival Distributions with the Asymmetric Laplace Distribution

    Authors: Deming Sheng, Ricardo Henao

    Abstract: Probabilistic survival analysis models seek to estimate the distribution of the future occurrence (time) of an event given a set of covariates. In recent years, these models have preferred nonparametric specifications that avoid directly estimating survival distributions via discretization. Specifically, they estimate the probability of an individual event at fixed times or the time of an event at… ▽ More

    Submitted 7 May, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025

  2. arXiv:2502.05933  [pdf, other

    cs.CL cs.AI

    Learning to Substitute Words with Model-based Score Ranking

    Authors: Hongye Liu, Ricardo Henao

    Abstract: Smart word substitution aims to enhance sentence quality by improving word choices; however current benchmarks rely on human-labeled data. Since word choices are inherently subjective, ground-truth word substitutions generated by a small group of annotators are often incomplete and likely not generalizable. To circumvent this issue, we instead employ a model-based score (BARTScore) to quantify sen… ▽ More

    Submitted 14 February, 2025; v1 submitted 9 February, 2025; originally announced February 2025.

    Comments: Accepted at NAACL 2025 (main, long)

  3. arXiv:2405.17333  [pdf, other

    stat.ML cs.LG

    Conditioning on Time is All You Need for Synthetic Survival Data Generation

    Authors: Mohd Ashhad, Ricardo Henao

    Abstract: Synthetic data generation holds considerable promise, offering avenues to enhance privacy, fairness, and data accessibility. Despite the availability of various methods for generating synthetic tabular data, challenges persist, particularly in specialized applications such as survival analysis. One significant obstacle in survival data generation is censoring, which manifests as not knowing the pr… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2405.17248  [pdf, other

    stat.ML cs.LG

    On Understanding Attention-Based In-Context Learning for Categorical Data

    Authors: Aaron T. Wang, William Convertino, Xiang Cheng, Ricardo Henao, Lawrence Carin

    Abstract: In-context learning based on attention models is examined for data with categorical outcomes, with inference in such models viewed from the perspective of functional gradient descent (GD). We develop a network composed of attention blocks, with each block employing a self-attention layer followed by a cross-attention layer, with associated skip connections. This model can exactly perform multi-ste… ▽ More

    Submitted 6 May, 2025; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2025

  5. Deep learning-driven pulmonary artery and vein segmentation reveals demography-associated vasculature anatomical differences

    Authors: Yuetan Chu, Gongning Luo, Longxi Zhou, Shaodong Cao, Guolin Ma, Xianglin Meng, Juexiao Zhou, Changchun Yang, Dexuan Xie, Dan Mu, Ricardo Henao, Gianluca Setti, Xigang Xiao, Lianming Wu, Zhaowen Qiu, Xin Gao

    Abstract: Pulmonary artery-vein segmentation is crucial for disease diagnosis and surgical planning and is traditionally achieved by Computed Tomography Pulmonary Angiography (CTPA). However, concerns regarding adverse health effects from contrast agents used in CTPA have constrained its clinical utility. In contrast, identifying arteries and veins using non-contrast CT, a conventional and low-cost clinical… ▽ More

    Submitted 1 December, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Journal ref: Nat Commun 16, 2262 (2025)

  6. arXiv:2310.15853  [pdf, other

    stat.ML cs.LG

    Improving Event Time Prediction by Learning to Partition the Event Time Space

    Authors: Jimmy Hickey, Ricardo Henao, Daniel Wojdyla, Michael Pencina, Matthew M. Engelhard

    Abstract: Recently developed survival analysis methods improve upon existing approaches by predicting the probability of event occurrence in each of a number pre-specified (discrete) time intervals. By avoiding placing strong parametric assumptions on the event density, this approach tends to improve prediction performance, particularly when data are plentiful. However, in clinical settings with limited ava… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 16 pages, 5 figures, 2 tables

  7. arXiv:2306.04933  [pdf, other

    cs.CL cs.LG stat.ML

    InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding

    Authors: Junda Wu, Tong Yu, Rui Wang, Zhao Song, Ruiyi Zhang, Handong Zhao, Chaochao Lu, Shuai Li, Ricardo Henao

    Abstract: Soft prompt tuning achieves superior performances across a wide range of few-shot tasks. However, the performances of prompt tuning can be highly sensitive to the initialization of the prompts. We also empirically observe that conventional prompt tuning methods cannot encode and learn sufficient task-relevant information from prompt tokens. In this work, we develop an information-theoretic framewo… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  8. arXiv:2306.01196  [pdf, other

    cs.LG cs.AI stat.ML

    An Effective Meaningful Way to Evaluate Survival Models

    Authors: Shi-ang Qi, Neeraj Kumar, Mahtab Farrokh, Weijie Sun, Li-Hao Kuan, Rajesh Ranganath, Ricardo Henao, Russell Greiner

    Abstract: One straightforward metric to evaluate a survival prediction model is based on the Mean Absolute Error (MAE) -- the average of the absolute difference between the time predicted by the model and the true event time, over all subjects. Unfortunately, this is challenging because, in practice, the test set includes (right) censored individuals, meaning we do not know when a censored individual actual… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023

  9. arXiv:2305.19329  [pdf, other

    cs.CV cs.IR cs.LG

    Mitigating Test-Time Bias for Fair Image Retrieval

    Authors: Fanjie Kong, Shuai Yuan, Weituo Hao, Ricardo Henao

    Abstract: We address the challenge of generating fair and unbiased image retrieval results given neutral textual queries (with no explicit gender or race connotations), while maintaining the utility (performance) of the underlying vision-language (VL) model. Previous methods aim to disentangle learned representations of images and text queries from gender and racial characteristics. However, we show these a… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  10. arXiv:2303.05581  [pdf, other

    cs.CL cs.AI

    Open World Classification with Adaptive Negative Samples

    Authors: Ke Bai, Guoyin Wang, Jiwei Li, Sunghyun Park, Sungjin Lee, Puyang Xu, Ricardo Henao, Lawrence Carin

    Abstract: Open world classification is a task in natural language processing with key practical relevance and impact. Since the open or {\em unknown} category data only manifests in the inference phase, finding a model with a suitable decision boundary accommodating for the identification of known classes and discrimination of the open category is challenging. The performance of existing models is limited b… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted by EMNLP 2021 (Main Track, Long Paper)

  11. arXiv:2302.13136  [pdf, other

    cs.CL cs.AI

    Toward Fairness in Text Generation via Mutual Information Minimization based on Importance Sampling

    Authors: Rui Wang, Pengyu Cheng, Ricardo Henao

    Abstract: Pretrained language models (PLMs), such as GPT2, have achieved remarkable empirical performance in text generation tasks. However, pretrained on large-scale natural language corpora, the generated text from PLMs may exhibit social bias against disadvantaged demographic groups. To improve the fairness of PLMs in text generation, we propose to minimize the mutual information between the semantics in… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

  12. Neural Insights for Digital Marketing Content Design

    Authors: Fanjie Kong, Yuan Li, Houssam Nassif, Tanner Fiez, Ricardo Henao, Shreya Chakrabarti

    Abstract: In digital marketing, experimenting with new website content is one of the key levers to improve customer engagement. However, creating successful marketing content is a manual and time-consuming process that lacks clear guiding principles. This paper seeks to close the loop between content creation and online experimentation by offering marketers AI-driven actionable insights based on historical… ▽ More

    Submitted 7 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Journal ref: International Conference on Knowledge Discovery and Data Mining (KDD'23), Long Beach, CA, pp. 4320-4332, 2023

  13. arXiv:2210.12818  [pdf, other

    cs.CV

    Pushing the Efficiency Limit Using Structured Sparse Convolutions

    Authors: Vinay Kumar Verma, Nikhil Mehta, Shijing Si, Ricardo Henao, Lawrence Carin

    Abstract: Weight pruning is among the most popular approaches for compressing deep convolutional neural networks. Recent work suggests that in a randomly initialized deep neural network, there exist sparse subnetworks that achieve performance comparable to the original network. Unfortunately, finding these subnetworks involves iterative stages of training and pruning, which can be computationally expensive.… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted at the IEEE Winter Conference on Applications of Computer Vision, WACV 2023

  14. arXiv:2210.05751  [pdf, other

    cs.CV cs.LG

    Toward Sustainable Continual Learning: Detection and Knowledge Repurposing of Similar Tasks

    Authors: Sijia Wang, Yoojin Choi, Junya Chen, Mostafa El-Khamy, Ricardo Henao

    Abstract: Most existing works on continual learning (CL) focus on overcoming the catastrophic forgetting (CF) problem, with dynamic models and replay methods performing exceptionally well. However, since current works tend to assume exclusivity or dissimilarity among learning tasks, these methods require constantly accumulating task-specific knowledge in memory for each task. This results in the eventual pr… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  15. arXiv:2202.12932  [pdf, other

    stat.ML cs.LG

    Capturing Actionable Dynamics with Structured Latent Ordinary Differential Equations

    Authors: Paidamoyo Chapfuwa, Sherri Rose, Lawrence Carin, Edward Meeds, Ricardo Henao

    Abstract: End-to-end learning of dynamical systems with black-box models, such as neural ordinary differential equations (ODEs), provides a flexible framework for learning dynamics from data without prescribing a mathematical model for the dynamics. Unfortunately, this flexibility comes at the cost of understanding the dynamical system, for which ODEs are used ubiquitously. Further, experimental data are co… ▽ More

    Submitted 16 June, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: Accepted for the 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022). Github code can be found at https://github.com/paidamoyo/structured_latent_ODEs

  16. arXiv:2202.01869  [pdf, other

    cs.LG stat.ME

    Flexible Triggering Kernels for Hawkes Process Modeling

    Authors: Yamac Alican Isik, Connor Davis, Paidamoyo Chapfuwa, Ricardo Henao

    Abstract: Recently proposed encoder-decoder structures for modeling Hawkes processes use transformer-inspired architectures, which encode the history of events via embeddings and self-attention mechanisms. These models deliver better prediction and goodness-of-fit than their RNN-based counterparts. However, they often require high computational and memory complexity requirements and sometimes fail to adequa… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  17. arXiv:2202.00932  [pdf, other

    cs.SE

    Automatic Creation of Acceptance Tests by Extracting Conditionals from Requirements: NLP Approach and Case Study

    Authors: Jannik Fischbach, Julian Frattini, Andreas Vogelsang, Daniel Mendez, Michael Unterkalmsteiner, Andreas Wehrle, Pablo Restrepo Henao, Parisa Yousefi, Tedi Juricic, Jeannette Radduenz, Carsten Wiecher

    Abstract: Acceptance testing is crucial to determine whether a system fulfills end-user requirements. However, the creation of acceptance tests is a laborious task entailing two major challenges: (1) practitioners need to determine the right set of test cases that fully covers a requirement, and (2) they need to create test cases manually due to insufficient tool support. Existing approaches for automatical… ▽ More

    Submitted 13 October, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

  18. arXiv:2108.00663  [pdf, other

    cs.CL cs.SE

    Transfer Learning for Mining Feature Requests and Bug Reports from Tweets and App Store Reviews

    Authors: Pablo Restrepo Henao, Jannik Fischbach, Dominik Spies, Julian Frattini, Andreas Vogelsang

    Abstract: Identifying feature requests and bug reports in user comments holds great potential for development teams. However, automated mining of RE-related information from social media and app stores is challenging since (1) about 70% of user comments contain noisy, irrelevant information, (2) the amount of user comments grows daily making manual analysis unfeasible, and (3) user comments are written in d… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

  19. arXiv:2107.01983  [pdf, other

    cs.LG cs.AI

    Gradient Importance Learning for Incomplete Observations

    Authors: Qitong Gao, Dong Wang, Joshua D. Amason, Siyang Yuan, Chenyang Tao, Ricardo Henao, Majda Hadziahmetovic, Lawrence Carin, Miroslav Pajic

    Abstract: Though recent works have developed methods that can generate estimates (or imputations) of the missing entries in a dataset to facilitate downstream analysis, most depend on assumptions that may not align with real-world applications and could suffer from poor performance in subsequent tasks such as classification. This is particularly true if the data have large missingness rates or a small sampl… ▽ More

    Submitted 1 March, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

  20. arXiv:2106.02694  [pdf, other

    cs.CV cs.LG

    Efficient Classification of Very Large Images with Tiny Objects

    Authors: Fanjie Kong, Ricardo Henao

    Abstract: An increasing number of applications in computer vision, specially, in medical imaging and remote sensing, become challenging when the goal is to classify very large images with tiny informative objects. Specifically, these classification tasks face two key challenges: $i$) the size of the input image is usually in the order of mega- or giga-pixels, however, existing deep architectures do not easi… ▽ More

    Submitted 3 December, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

  21. arXiv:2104.02652  [pdf, other

    cs.CV cs.LG

    Malignancy Prediction and Lesion Identification from Clinical Dermatological Images

    Authors: Meng Xia, Meenal K. Kheterpal, Samantha C. Wong, Christine Park, William Ratliff, Lawrence Carin, Ricardo Henao

    Abstract: We consider machine-learning-based malignancy prediction and lesion identification from clinical dermatological images, which can be indistinctly acquired via smartphone or dermoscopy capture. Additionally, we do not assume that images contain single lesions, thus the framework supports both focal or wide-field images. Specifically, we propose a two-stage approach in which we first identify all le… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  22. arXiv:2101.03154  [pdf, other

    cs.CV cs.LG

    Quantum Tensor Network in Machine Learning: An Application to Tiny Object Classification

    Authors: Fanjie Kong, Xiao-yang Liu, Ricardo Henao

    Abstract: Tiny object classification problem exists in many machine learning applications like medical imaging or remote sensing, where the object of interest usually occupies a small region of the whole image. It is challenging to design an efficient machine learning model with respect to tiny object of interest. Current neural network structures are unable to deal with tiny object efficiently because they… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    Comments: 8 pages, 7 figures

    Journal ref: https://tensorworkshop.github.io/NeurIPS2020/CFP.html

  23. arXiv:2012.08674  [pdf, other

    cs.LG cs.CV

    Wasserstein Contrastive Representation Distillation

    Authors: Liqun Chen, Dong Wang, Zhe Gan, Jingjing Liu, Ricardo Henao, Lawrence Carin

    Abstract: The primary goal of knowledge distillation (KD) is to encapsulate the information of a model learned from a teacher network into a student network, with the latter being more compact than the former. Existing work, e.g., using Kullback-Leibler divergence for distillation, may fail to capture important structural knowledge in the teacher network and often lacks the ability for feature generalizatio… ▽ More

    Submitted 28 March, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: Accepted by CVPR 2021

  24. arXiv:2012.03369  [pdf, other

    cs.CV cs.AI

    Proactive Pseudo-Intervention: Causally Informed Contrastive Learning For Interpretable Vision Models

    Authors: Dong Wang, Yuewei Yang, Chenyang Tao, Zhe Gan, Liqun Chen, Fanjie Kong, Ricardo Henao, Lawrence Carin

    Abstract: Deep neural networks excel at comprehending complex visual signals, delivering on par or even superior performance to that of human experts. However, ad-hoc visual explanations of model decisions often reveal an alarming level of reliance on exploiting non-causal visual cues that strongly correlate with the target label in training data. As such, deep neural nets suffer compromised generalization… ▽ More

    Submitted 29 April, 2021; v1 submitted 6 December, 2020; originally announced December 2020.

  25. arXiv:2011.12454  [pdf, other

    cs.CV

    Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer

    Authors: Zidi Xiu, Junya Chen, Ricardo Henao, Benjamin Goldstein, Lawrence Carin, Chenyang Tao

    Abstract: Dealing with severe class imbalance poses a major challenge for real-world applications, especially when the accurate classification and generalization of minority classes is of primary interest. In computer vision, learning from long tailed datasets is a recurring theme, especially for natural image datasets. While existing solutions mostly appeal to sampling or weighting adjustments to alleviate… ▽ More

    Submitted 19 November, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

  26. arXiv:2011.04794  [pdf, other

    cs.IT

    Estimating Total Correlation with Mutual Information Estimators

    Authors: Ke Bai, Pengyu Cheng, Weituo Hao, Ricardo Henao, Lawrence Carin

    Abstract: Total correlation (TC) is a fundamental concept in information theory that measures statistical dependency among multiple random variables. Recently, TC has shown noticeable effectiveness as a regularizer in many learning tasks, where the correlation among multiple latent embeddings requires to be jointly minimized or maximized. However, calculating precise TC values is challenging, especially whe… ▽ More

    Submitted 22 February, 2023; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted by AISTATS 2023

  27. arXiv:2010.12618  [pdf, other

    stat.ML cs.LG

    Counterfactual Representation Learning with Balancing Weights

    Authors: Serge Assaad, Shuxi Zeng, Chenyang Tao, Shounak Datta, Nikhil Mehta, Ricardo Henao, Fan Li, Lawrence Carin

    Abstract: A key to causal inference with observational data is achieving balance in predictive features associated with each treatment type. Recent literature has explored representation learning to achieve this goal. In this work, we discuss the pitfalls of these strategies - such as a steep trade-off between achieving balance and predictive power - and present a remedy via the integration of balancing wei… ▽ More

    Submitted 23 February, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted to International Conference on Artificial Intelligence and Statistics (AISTATS 2021)

  28. arXiv:2009.08541  [pdf, other

    stat.ML cs.LG

    Variational Disentanglement for Rare Event Modeling

    Authors: Zidi Xiu, Chenyang Tao, Michael Gao, Connor Davis, Benjamin A. Goldstein, Ricardo Henao

    Abstract: Combining the increasing availability and abundance of healthcare data and the current advances in machine learning methods have created renewed opportunities to improve clinical decision support systems. However, in healthcare risk prediction applications, the proportion of cases with the condition (label) of interest is often very low relative to the available sample size. Though very prevalent… ▽ More

    Submitted 16 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted to AAAI2021

  29. arXiv:2008.06597  [pdf, other

    cs.CV

    Weakly supervised cross-domain alignment with optimal transport

    Authors: Siyang Yuan, Ke Bai, Liqun Chen, Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin

    Abstract: Cross-domain alignment between image objects and text sequences is key to many visual-language tasks, and it poses a fundamental challenge to both computer vision and natural language processing. This paper investigates a novel approach for the identification and optimization of fine-grained semantic similarities between image and text entities, under a weakly-supervised setup, improving performan… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: Accepted to BMVC 2020 (Oral)

  30. arXiv:2006.11991  [pdf, other

    cs.CL cs.LG

    Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage

    Authors: Shijing Si, Rui Wang, Jedrek Wosik, Hao Zhang, David Dov, Guoyin Wang, Ricardo Henao, Lawrence Carin

    Abstract: Small and imbalanced datasets commonly seen in healthcare represent a challenge when training classifiers based on deep learning models. So motivated, we propose a novel framework based on BioBERT (Bidirectional Encoder Representations from Transformers forBiomedical TextMining). Specifically, (i) we introduce Label Embeddings for Self-Attention in each layer of BERT, which we call LESA-BERT, and… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: 20 pages, Machine Learning for Healthcare 2020 (To appear)

  31. Enabling Counterfactual Survival Analysis with Balanced Representations

    Authors: Paidamoyo Chapfuwa, Serge Assaad, Shuxi Zeng, Michael J. Pencina, Lawrence Carin, Ricardo Henao

    Abstract: Balanced representation learning methods have been applied successfully to counterfactual inference from observational data. However, approaches that account for survival outcomes are relatively limited. Survival data are frequently encountered across diverse medical applications, i.e., drug development, risk profiling, and clinical trials, and such data are also relevant in fields like manufactur… ▽ More

    Submitted 3 March, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Accepted at ACM Conference on Health, Inference, and Learning (ACM CHIL 2021). Code at https://github.com/paidamoyo/counterfactual_survival_analysis

  32. arXiv:2004.01376  [pdf, other

    stat.ML cs.LG

    Neural Conditional Event Time Models

    Authors: Matthew Engelhard, Samuel Berchuck, Joshua D'Arcy, Ricardo Henao

    Abstract: Event time models predict occurrence times of an event of interest based on known features. Recent work has demonstrated that neural networks achieve state-of-the-art event time predictions in a variety of settings. However, standard event time models suppose that the event occurs, eventually, in all cases. Consequently, no distinction is made between a) the probability of event occurrence, and b)… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

  33. arXiv:2003.04430  [pdf, other

    stat.ML cs.LG stat.AP

    Variational Learning of Individual Survival Distributions

    Authors: Zidi Xiu, Chenyang Tao, Benjamin A. Goldstein, Ricardo Henao

    Abstract: The abundance of modern health data provides many opportunities for the use of machine learning techniques to build better statistical models to improve clinical decision making. Predicting time-to-event distributions, also known as survival analysis, plays a key role in many clinical applications. We introduce a variational time-to-event prediction model, named Variational Survival Inference (VSI… ▽ More

    Submitted 13 December, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

  34. arXiv:2003.00355  [pdf, other

    stat.ML cs.LG

    Survival Cluster Analysis

    Authors: Paidamoyo Chapfuwa, Chunyuan Li, Nikhil Mehta, Lawrence Carin, Ricardo Henao

    Abstract: Conventional survival analysis approaches estimate risk scores or individualized time-to-event distributions conditioned on covariates. In practice, there is often great population-level phenotypic heterogeneity, resulting from (unknown) subpopulations with diverse risk profiles or survival distributions. As a result, there is an unmet need in survival analysis for identifying subpopulations with… ▽ More

    Submitted 29 February, 2020; originally announced March 2020.

    Comments: Accepted at ACM CHIL 2020. Code: this https URL, https://github.com/paidamoyo/survival_cluster_analysis

  35. arXiv:2002.04752  [pdf

    eess.IV cs.CV cs.LG

    Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale Chest Computed Tomography Volumes

    Authors: Rachel Lea Draelos, David Dov, Maciej A. Mazurowski, Joseph Y. Lo, Ricardo Henao, Geoffrey D. Rubin, Lawrence Carin

    Abstract: Machine learning models for radiology benefit from large-scale data sets with high quality labels for abnormalities. We curated and analyzed a chest computed tomography (CT) data set of 36,316 volumes from 19,993 unique patients. This is the largest multiply-annotated volumetric medical imaging data set reported. To annotate this data set, we developed a rule-based method for automatically extract… ▽ More

    Submitted 12 October, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: 20 pages, 3 figures, 5 tables (appendices additional). Published in Medical Image Analysis (October 2020)

  36. arXiv:2002.02913  [pdf, other

    cs.LG stat.ML

    Learning Autoencoders with Relational Regularization

    Authors: Hongteng Xu, Dixin Luo, Ricardo Henao, Svati Shah, Lawrence Carin

    Abstract: A new algorithmic framework is proposed for learning autoencoders of data distributions. We minimize the discrepancy between the model and target distributions, with a \emph{relational regularization} on the learnable latent prior. This regularization penalizes the fused Gromov-Wasserstein (FGW) distance between the latent prior and its corresponding posterior, allowing one to flexibly learn a str… ▽ More

    Submitted 25 June, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

    Journal ref: International conference on machine learning 2020

  37. arXiv:1910.04233  [pdf, other

    stat.ML cs.LG cs.NE

    Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods

    Authors: Kevin J Liang, Guoyin Wang, Yitong Li, Ricardo Henao, Lawrence Carin

    Abstract: We investigate time-dependent data analysis from the perspective of recurrent kernel machines, from which models with hidden units and gated memory cells arise naturally. By considering dynamic gating of the memory cell, a model closely related to the long short-term memory (LSTM) recurrent neural network is derived. Extending this setup to $n$-gram filters, the convolutional neural network (CNN),… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

  38. arXiv:1910.02176  [pdf, other

    cs.LG stat.ML

    Straight-Through Estimator as Projected Wasserstein Gradient Flow

    Authors: Pengyu Cheng, Chang Liu, Chunyuan Li, Dinghan Shen, Ricardo Henao, Lawrence Carin

    Abstract: The Straight-Through (ST) estimator is a widely used technique for back-propagating gradients through discrete random variables. However, this effective method lacks theoretical justification. In this paper, we show that ST can be interpreted as the simulation of the projected Wasserstein gradient flow (pWGF). Based on this understanding, a theoretical foundation is established to justify the conv… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: Accepted as NeurIPS 2018 Bayesian Deep Learning Workshop

  39. arXiv:1909.13456  [pdf, other

    cs.LG cs.CL stat.ML

    Improving Textual Network Learning with Variational Homophilic Embeddings

    Authors: Wenlin Wang, Chenyang Tao, Zhe Gan, Guoyin Wang, Liqun Chen, Xinyuan Zhang, Ruiyi Zhang, Qian Yang, Ricardo Henao, Lawrence Carin

    Abstract: The performance of many network learning applications crucially hinges on the success of network embedding algorithms, which aim to encode rich network information into low-dimensional vertex-based vector representations. This paper considers a novel variational formulation of network embeddings, with special focus on textual networks. Different from most existing methods that optimize a discrimin… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

    Comments: Accepted to NeurIPS 2019

  40. arXiv:1905.13331  [pdf, other

    cs.LG stat.ML

    Discriminative Clustering for Robust Unsupervised Domain Adaptation

    Authors: Rui Wang, Guoyin Wang, Ricardo Henao

    Abstract: Unsupervised domain adaptation seeks to learn an invariant and discriminative representation for an unlabeled target domain by leveraging the information of a labeled source dataset. We propose to improve the discriminative ability of the target domain representation by simultaneously learning tightly clustered target representations while encouraging that each cluster is assigned to a unique and… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  41. Survival Function Matching for Calibrated Time-to-Event Predictions

    Authors: Paidamoyo Chapfuwa, Chenyang Tao, Lawrence Carin, Ricardo Henao

    Abstract: Models for predicting the time of a future event are crucial for risk assessment, across a diverse range of applications. Existing time-to-event (survival) models have focused primarily on preserving pairwise ordering of estimated event times, or relative risk. Model calibration is relatively under explored, despite its critical importance in time-to-event applications. We present a survival funct… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  42. arXiv:1904.12739  [pdf, other

    physics.med-ph cs.CV

    Weakly Supervised Instance Learning for Thyroid Malignancy Prediction from Whole Slide Cytopathology Images

    Authors: David Dov, Shahar Ziv Kovalsky, Serge Assaad, Avani A. Pendse Jonathan Cohen, Danielle Elliott Range, Ricardo Henao, Lawrence Carin

    Abstract: We consider machine-learning-based thyroid-malignancy prediction from cytopathology whole-slide images (WSI). Multiple instance learning (MIL) approaches, typically used for the analysis of WSIs, divide the image (bag) into patches (instances), which are used to predict a single bag-level label. These approaches perform poorly in cytopathology slides due to a unique bag structure: sparsely located… ▽ More

    Submitted 30 July, 2020; v1 submitted 25 April, 2019; originally announced April 2019.

  43. arXiv:1904.00839  [pdf, other

    cs.CV cs.AI eess.IV

    Thyroid Cancer Malignancy Prediction From Whole Slide Cytopathology Images

    Authors: David Dov, Shahar Kovalsky, Jonathan Cohen, Danielle Range, Ricardo Henao, Lawrence Carin

    Abstract: We consider preoperative prediction of thyroid cancer based on ultra-high-resolution whole-slide cytopathology images. Inspired by how human experts perform diagnosis, our approach first identifies and classifies diagnostic image regions containing informative thyroid cells, which only comprise a tiny fraction of the entire image. These local estimates are then aggregated into a single prediction… ▽ More

    Submitted 29 March, 2019; originally announced April 2019.

    Journal ref: Proceedings of Machine Learning Research, 2019, Vol. 106

  44. arXiv:1808.09633  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    Improved Semantic-Aware Network Embedding with Fine-Grained Word Alignment

    Authors: Dinghan Shen, Xinyuan Zhang, Ricardo Henao, Lawrence Carin

    Abstract: Network embeddings, which learn low-dimensional representations for each vertex in a large-scale network, have received considerable attention in recent years. For a wide range of applications, vertices in a network are typically accompanied by rich textual information such as user profiles, paper abstracts, etc. We propose to incorporate semantic features into network embeddings by matching impor… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: To appear at EMNLP 2018

  45. arXiv:1806.02978  [pdf, other

    cs.LG stat.ML

    JointGAN: Multi-Domain Joint Distribution Learning with Generative Adversarial Nets

    Authors: Yunchen Pu, Shuyang Dai, Zhe Gan, Weiyao Wang, Guoyin Wang, Yizhe Zhang, Ricardo Henao, Lawrence Carin

    Abstract: A new generative adversarial network is developed for joint distribution matching. Distinct from most existing approaches, that only learn conditional distributions, the proposed model aims to learn a joint distribution of multiple random variables (domains). This is achieved by learning to sample from conditional distributions between the domains, while simultaneously learning to sample from the… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: Accepted by ICML 2018

  46. arXiv:1805.09843  [pdf, other

    cs.CL cs.AI cs.LG

    Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

    Authors: Dinghan Shen, Guoyin Wang, Wenlin Wang, Martin Renqiang Min, Qinliang Su, Yizhe Zhang, Chunyuan Li, Ricardo Henao, Lawrence Carin

    Abstract: Many deep learning architectures have been proposed to model the compositionality in text sequences, requiring a substantial number of parameters and expensive computations. However, there has not been a rigorous evaluation regarding the added value of sophisticated compositional functions. In this paper, we conduct a point-by-point comparative study between Simple Word-Embedding-based Models (SWE… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: To appear at ACL 2018 (code: https://github.com/dinghanshen/SWEM)

  47. arXiv:1805.05361  [pdf, other

    cs.CL cs.IR cs.LG

    NASH: Toward End-to-End Neural Architecture for Generative Semantic Hashing

    Authors: Dinghan Shen, Qinliang Su, Paidamoyo Chapfuwa, Wenlin Wang, Guoyin Wang, Lawrence Carin, Ricardo Henao

    Abstract: Semantic hashing has become a powerful paradigm for fast similarity search in many information retrieval systems. While fairly successful, previous techniques generally require two-stage training, and the binary constraints are handled ad-hoc. In this paper, we present an end-to-end Neural Architecture for Semantic Hashing (NASH), where the binary hashing codes are treated as Bernoulli latent vari… ▽ More

    Submitted 14 May, 2018; originally announced May 2018.

    Comments: To appear at ACL 2018

  48. arXiv:1805.04174  [pdf, other

    cs.CL cs.LG

    Joint Embedding of Words and Labels for Text Classification

    Authors: Guoyin Wang, Chunyuan Li, Wenlin Wang, Yizhe Zhang, Dinghan Shen, Xinyuan Zhang, Ricardo Henao, Lawrence Carin

    Abstract: Word embeddings are effective intermediate representations for capturing semantic regularities between words, when learning the representations of text sequences. We propose to view text classification as a label-word joint embedding problem: each label is embedded in the same space with the word vectors. We introduce an attention framework that measures the compatibility of embeddings between tex… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

    Comments: Published in ACL 2018; Code: https://github.com/guoyinwang/LEAM

  49. arXiv:1804.03184  [pdf, other

    stat.ML cs.LG

    Adversarial Time-to-Event Modeling

    Authors: Paidamoyo Chapfuwa, Chenyang Tao, Chunyuan Li, Courtney Page, Benjamin Goldstein, Lawrence Carin, Ricardo Henao

    Abstract: Modern health data science applications leverage abundant molecular and electronic health data, providing opportunities for machine learning to build statistical models to support clinical practice. Time-to-event analysis, also called survival analysis, stands as one of the most representative examples of such statistical models. We present a deep-network-based approach that leverages adversarial… ▽ More

    Submitted 7 June, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: Published in ICML 2018; Code: https://github.com/paidamoyo/adversarial_time_to_event

    Journal ref: Proceedings of the 35th International Conference on Machine Learning, PMLR 80:735-744, 2018

  50. arXiv:1801.05062  [pdf, other

    stat.ML cs.LG stat.AP

    Multi-Label Learning from Medical Plain Text with Convolutional Residual Models

    Authors: Xinyuan Zhang, Ricardo Henao, Zhe Gan, Yitong Li, Lawrence Carin

    Abstract: Predicting diagnoses from Electronic Health Records (EHRs) is an important medical application of multi-label learning. We propose a convolutional residual model for multi-label classification from doctor notes in EHR data. A given patient may have multiple diagnoses, and therefore multi-label learning is required. We employ a Convolutional Neural Network (CNN) to encode plain text into a fixed-le… ▽ More

    Submitted 8 August, 2018; v1 submitted 15 January, 2018; originally announced January 2018.

    Comments: Machine Learning for Healthcare 2018 spotlight paper