Skip to main content

Showing 1–17 of 17 results for author: Dong, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.02736  [pdf, other

    stat.ME

    Estimating Optimal Dynamic Treatment Regimes Using Irregularly Observed Data: A Target Trial Emulation and Bayesian Joint Modeling Approach

    Authors: Larry Dong, Eleanor Pullenayegum, Rodolphe Thiébaut, Olli Saarela

    Abstract: An optimal dynamic treatment regime (DTR) is a sequence of decision rules aimed at providing the best course of treatments individualized to patients. While conventional DTR estimation uses longitudinal data, such data can also be irregular, where patient-level variables can affect visit times, treatment assignments and outcomes. In this work, we first extend the target trial framework - a paradig… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  2. arXiv:2407.14065  [pdf, other

    cs.LG stat.ML

    MSCT: Addressing Time-Varying Confounding with Marginal Structural Causal Transformer for Counterfactual Post-Crash Traffic Prediction

    Authors: Shuang Li, Ziyuan Pu, Nan Zhang, Duxin Chen, Lu Dong, Daniel J. Graham, Yinhai Wang

    Abstract: Traffic crashes profoundly impede traffic efficiency and pose economic challenges. Accurate prediction of post-crash traffic status provides essential information for evaluating traffic perturbations and developing effective solutions. Previous studies have established a series of deep learning models to predict post-crash traffic conditions, however, these correlation-based methods cannot accommo… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 13 pages, 9 figures

  3. arXiv:2407.05625  [pdf, other

    stat.ME cs.LG

    New User Event Prediction Through the Lens of Causal Inference

    Authors: Henry Shaowu Yuchi, Shixiang Zhu, Li Dong, Yigit M. Arisoy, Matthew C. Spencer

    Abstract: Modeling and analysis for event series generated by users of heterogeneous behavioral patterns are closely involved in our daily lives, including credit card fraud detection, online platform user recommendation, and social network analysis. The most commonly adopted approach to this task is to assign users to behavior-based categories and analyze each of them separately. However, this requires ext… ▽ More

    Submitted 4 April, 2025; v1 submitted 8 July, 2024; originally announced July 2024.

  4. arXiv:2109.01218  [pdf, other

    stat.AP stat.ME

    Evaluating the Use of Generalized Dynamic Weighted Ordinary Least Squares for Individualized HIV Treatment Strategies

    Authors: Larry Dong, Erica E. M. Moodie, Laura Villain, Rodolphe Thiébaut

    Abstract: Dynamic treatment regimes (DTR) are a statistical paradigm in precision medicine which aim to optimize patient outcomes by individualizing treatments. At its simplest, a DTR may require only a single decision to be made; this special case is called an individualized treatment rule (ITR) and is often used to maximize short-term rewards. Generalized dynamic weighted ordinary least squares (G-dWOLS),… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  5. arXiv:2104.07932  [pdf, other

    cs.LG cs.CE stat.ML

    Interval-censored Hawkes processes

    Authors: Marian-Andrei Rizoiu, Alexander Soen, Shidi Li, Pio Calderon, Leanne Dong, Aditya Krishna Menon, Lexing Xie

    Abstract: Interval-censored data solely records the aggregated counts of events during specific time intervals - such as the number of patients admitted to the hospital or the volume of vehicles passing traffic loop detectors - and not the exact occurrence time of the events. It is currently not understood how to fit the Hawkes point processes to this kind of data. Its typical loss function (the point proce… ▽ More

    Submitted 25 November, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Journal ref: Journal of Machine Learning Research, 23(338):1-84, 2022. https://jmlr.org/papers/v23/21-0917.html

  6. MultiImport: Inferring Node Importance in a Knowledge Graph from Multiple Input Signals

    Authors: Namyong Park, Andrey Kan, Xin Luna Dong, Tong Zhao, Christos Faloutsos

    Abstract: Given multiple input signals, how can we infer node importance in a knowledge graph (KG)? Node importance estimation is a crucial and challenging task that can benefit a lot of applications including recommendation, search, and query disambiguation. A key challenge towards this goal is how to effectively use input from different sources. On the one hand, a KG is a rich source of information, with… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: KDD 2020 Research Track. 10 pages

  7. arXiv:2004.13852  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    TXtract: Taxonomy-Aware Knowledge Extraction for Thousands of Product Categories

    Authors: Giannis Karamanolakis, Jun Ma, Xin Luna Dong

    Abstract: Extracting structured knowledge from product profiles is crucial for various applications in e-Commerce. State-of-the-art approaches for knowledge extraction were each designed for a single category of product, and thus do not apply to real-life e-Commerce scenarios, which often contain thousands of diverse categories. This paper proposes TXtract, a taxonomy-aware knowledge extraction model that a… ▽ More

    Submitted 1 May, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

    Comments: Accepted to ACL 2020 (Long Paper)

  8. Improving trial generalizability using observational studies

    Authors: Dasom Lee, Shu Yang, Lin Dong, Xiaofei Wang, Donglin Zeng, Jianwen Cai

    Abstract: Complementary features of randomized controlled trials (RCTs) and observational studies (OSs) can be used jointly to estimate the average treatment effect of a target population. We propose a calibration weighting estimator that enforces the covariate balance between the RCT and OS, therefore improving the trial-based estimator's generalizability. Exploiting semiparametric efficiency theory, we pr… ▽ More

    Submitted 29 November, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

  9. arXiv:2001.09223  [pdf, other

    cs.LG cs.NI stat.ML

    Stacked Auto Encoder Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks

    Authors: Feibo Jiang, Kezhi Wang, Li Dong, Cunhua Pan, Kun Yang

    Abstract: An online resource scheduling framework is proposed for minimizing the sum of weighted task latency for all the Internet of things (IoT) users, by optimizing offloading decision, transmission power and resource allocation in the large-scale mobile edge computing (MEC) system. Towards this end, a deep reinforcement learning (DRL) based solution is proposed, which includes the following components.… ▽ More

    Submitted 14 April, 2020; v1 submitted 24 January, 2020; originally announced January 2020.

    Comments: Accepted by IEEE Internet of Things Journal

  10. arXiv:1905.08865  [pdf, other

    cs.LG cs.IR stat.ML

    Estimating Node Importance in Knowledge Graphs Using Graph Neural Networks

    Authors: Namyong Park, Andrey Kan, Xin Luna Dong, Tong Zhao, Christos Faloutsos

    Abstract: How can we estimate the importance of nodes in a knowledge graph (KG)? A KG is a multi-relational graph that has proven valuable for many tasks including question answering and semantic search. In this paper, we present GENI, a method for tackling the problem of estimating node importance in KGs, which enables several downstream applications such as item recommendation and resource allocation. Whi… ▽ More

    Submitted 16 June, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: KDD 2019 Research Track. 11 pages. Changelog: Type 3 font removed, and minor updates made in the Appendix (v2)

  11. arXiv:1904.12606  [pdf, other

    cs.IR cs.LG stat.ML

    OpenKI: Integrating Open Information Extraction and Knowledge Bases with Relation Inference

    Authors: Dongxu Zhang, Subhabrata Mukherjee, Colin Lockard, Xin Luna Dong, Andrew McCallum

    Abstract: In this paper, we consider advancing web-scale knowledge extraction and alignment by integrating OpenIE extractions in the form of (subject, predicate, object) triples with Knowledge Bases (KB). Traditional techniques from universal schema and from schema mapping fall in two extremes: either they perform instance-level inference relying on embedding for (subject, object) pairs, thus cannot handle… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

  12. arXiv:1904.10762  [pdf, other

    cs.LG cs.AI stat.ML

    Baconian: A Unified Open-source Framework for Model-Based Reinforcement Learning

    Authors: Linsen Dong, Guanyu Gao, Xinyi Zhang, Liangyu Chen, Yonggang Wen

    Abstract: Model-Based Reinforcement Learning (MBRL) is one category of Reinforcement Learning (RL) algorithms which can improve sampling efficiency by modeling and approximating system dynamics. It has been widely adopted in the research of robotics, autonomous driving, etc. Despite its popularity, there still lacks some sophisticated and reusable open-source frameworks to facilitate MBRL research and exper… ▽ More

    Submitted 15 March, 2021; v1 submitted 23 April, 2019; originally announced April 2019.

  13. arXiv:1902.06036  [pdf, other

    stat.ME

    Assessing Biosimilarity using Functional Metrics

    Authors: Lin Dong, Sujit K. Ghosh

    Abstract: In recent years there have been a lot of interest to test for similarity between biological drug products, commonly known as biologics. Biologics are large and complex molecule drugs that are produced by living cells and hence these are sensitive to the environmental changes. In addition, biologics usually induce antibodies which raises the safety and efficacy issues. The manufacturing process is… ▽ More

    Submitted 19 February, 2019; v1 submitted 15 February, 2019; originally announced February 2019.

  14. arXiv:1807.08447  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    LinkNBed: Multi-Graph Representation Learning with Entity Linkage

    Authors: Rakshit Trivedi, Bunyamin Sisman, Jun Ma, Christos Faloutsos, Hongyuan Zha, Xin Luna Dong

    Abstract: Knowledge graphs have emerged as an important model for studying complex multi-relational data. This has given rise to the construction of numerous large scale but incomplete knowledge graphs encoding information extracted from various resources. An effective and scalable approach to jointly learn over multiple graphs and eventually construct a unified graph is a crucial next step for the success… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

    Comments: ACL 2018

  15. arXiv:1806.01264  [pdf, other

    cs.CL cs.AI cs.IR stat.ML

    OpenTag: Open Attribute Value Extraction from Product Profiles [Deep Learning, Active Learning, Named Entity Recognition]

    Authors: Guineng Zheng, Subhabrata Mukherjee, Xin Luna Dong, Feifei Li

    Abstract: Extraction of missing attribute values is to find values describing an attribute of interest from a free text input. Most past related work on extraction of missing attribute values work with a closed world assumption with the possible set of values known beforehand, or use dictionaries of values and hand-crafted features. How can we discover new attribute values that we have never seen before? Ca… ▽ More

    Submitted 6 October, 2018; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, London, UK, August 19-23, 2018

  16. arXiv:1805.09496  [pdf, other

    cs.LG cs.AI stat.ML

    Intelligent Trainer for Model-Based Reinforcement Learning

    Authors: Yuanlong Li, Linsen Dong, Xin Zhou, Yonggang Wen, Kyle Guan

    Abstract: Model-based reinforcement learning (MBRL) has been proposed as a promising alternative solution to tackle the high sampling cost challenge in the canonical reinforcement learning (RL), by leveraging a learned model to generate synthesized data for policy training purpose. The MBRL framework, nevertheless, is inherently limited by the convoluted process of jointly learning control policy and config… ▽ More

    Submitted 5 June, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: 13 pages

  17. arXiv:1712.00328  [pdf, other

    stat.ML cs.LG

    Group Sparse Bayesian Learning for Active Surveillance on Epidemic Dynamics

    Authors: Hongbin Pei, Bo Yang, Jiming Liu, Lei Dong

    Abstract: Predicting epidemic dynamics is of great value in understanding and controlling diffusion processes, such as infectious disease spread and information propagation. This task is intractable, especially when surveillance resources are very limited. To address the challenge, we study the problem of active surveillance, i.e., how to identify a small portion of system components as sentinels to effect… ▽ More

    Submitted 21 November, 2017; originally announced December 2017.