Skip to main content

Showing 1–50 of 90 results for author: Chi, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.03853  [pdf, other

    q-bio.QM cs.AI cs.LG q-bio.GN

    GRAPE: Heterogeneous Graph Representation Learning for Genetic Perturbation with Coding and Non-Coding Biotype

    Authors: Changxi Chi, Jun Xia, Jingbo Zhou, Jiabei Cheng, Chang Yu, Stan Z. Li

    Abstract: Predicting genetic perturbations enables the identification of potentially crucial genes prior to wet-lab experiments, significantly improving overall experimental efficiency. Since genes are the foundation of cellular life, building gene regulatory networks (GRN) is essential to understand and predict the effects of genetic perturbations. However, current methods fail to fully leverage gene-relat… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  2. arXiv:2505.03652  [pdf, other

    cs.LG physics.comp-ph physics.data-an q-bio.QM stat.ML

    Mitigating mode collapse in normalizing flows by annealing with an adaptive schedule: Application to parameter estimation

    Authors: Yihang Wang, Chris Chi, Aaron R. Dinner

    Abstract: Normalizing flows (NFs) provide uncorrelated samples from complex distributions, making them an appealing tool for parameter estimation. However, the practical utility of NFs remains limited by their tendency to collapse to a single mode of a multimodal distribution. In this study, we show that annealing with an adaptive schedule based on the effective sample size (ESS) can mitigate mode collapse.… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 19 pages, 10 figures

  3. arXiv:2504.14757  [pdf, other

    cs.SE cs.AI

    SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs

    Authors: Minh V. T. Pham, Huy N. Phan, Hoang N. Phan, Cuong Le Chi, Tien N. Nguyen, Nghi D. Q. Bui

    Abstract: Large language models (LLMs) are transforming automated program repair (APR) through agent-based approaches that localize bugs, generate patches, and verify fixes. However, the lack of high-quality, scalable training datasets, especially those with verifiable outputs and intermediate reasoning traces-limits progress, particularly for open-source models. In this work, we present SWE-Synth, a framew… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: Work in progress

  4. arXiv:2503.14381  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Optimizing High-Dimensional Oblique Splits

    Authors: Chien-Ming Chi

    Abstract: Orthogonal-split trees perform well, but evidence suggests oblique splits can enhance their performance. This paper explores optimizing high-dimensional $s$-sparse oblique splits from $\{(\vec{w}, \vec{w}^{\top}\boldsymbol{X}_{i}) : i\in \{1,\dots, n\}, \vec{w} \in \mathbb{R}^p, \| \vec{w} \|_{2} = 1, \| \vec{w} \|_{0} \leq s \}$ for growing oblique trees, where $ s $ is a user-defined sparsity pa… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: 79 pages, 9 tables

  5. arXiv:2503.08317  [pdf, other

    cs.RO cs.NI

    Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios

    Authors: Zikang Yuan, Yuechuan Pu, Hongcheng Luo, Fengtian Lang, Cheng Chi, Teng Li, Yingying Shen, Haiyang Sun, Bing Wang, Xin Yang

    Abstract: Ensuring the safety of autonomous vehicles necessitates comprehensive simulation of multi-sensor data, encompassing inputs from both cameras and LiDAR sensors, across various dynamic driving scenarios. Neural rendering techniques, which utilize collected raw sensor data to simulate these dynamic environments, have emerged as a leading methodology. While NeRF-based approaches can uniformly represen… ▽ More

    Submitted 24 March, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

    Comments: 10 pages

  6. arXiv:2412.16859  [pdf, other

    cs.CV cs.AI

    Adversarially Domain-adaptive Latent Diffusion for Unsupervised Semantic Segmentation

    Authors: Jongmin Yu, Zhongtian Sun, Chen Bene Chi, Jinhong Yang, Shan Luo

    Abstract: Semantic segmentation requires extensive pixel-level annotation, motivating unsupervised domain adaptation (UDA) to transfer knowledge from labelled source domains to unlabelled or weakly labelled target domains. One of the most efficient strategies involves using synthetic datasets generated within controlled virtual environments, such as video games or traffic simulators, which can automatically… ▽ More

    Submitted 6 April, 2025; v1 submitted 21 December, 2024; originally announced December 2024.

    Comments: Accepted from CVPR 2025 Workshop PVUW

  7. arXiv:2412.12213  [pdf, other

    cs.LG q-fin.CP stat.ML

    The AI Black-Scholes: Finance-Informed Neural Network

    Authors: Amine M. Aboussalah, Xuanze Li, Cheng Chi, Raj Patel

    Abstract: In the realm of option pricing, existing models are typically classified into principle-driven methods, such as solving partial differential equations (PDEs) that pricing function satisfies, and data-driven approaches, such as machine learning (ML) techniques that parameterize the pricing function directly. While principle-driven models offer a rigorous theoretical framework, they often rely on un… ▽ More

    Submitted 15 December, 2024; originally announced December 2024.

  8. arXiv:2412.04455  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

    Authors: Enshen Zhou, Qi Su, Cheng Chi, Zhizheng Zhang, Zhongyuan Wang, Tiejun Huang, Lu Sheng, He Wang

    Abstract: Automatic detection and prevention of open-set failures are crucial in closed-loop robotic systems. Recent studies often struggle to simultaneously identify unexpected failures reactively after they occur and prevent foreseeable ones proactively. To this end, we propose Code-as-Monitor (CaM), a novel paradigm leveraging the vision-language model (VLM) for both open-set reactive and proactive failu… ▽ More

    Submitted 21 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: Accepted by CVPR 2025. Project page: https://zhoues.github.io/Code-as-Monitor/

  9. arXiv:2411.03351  [pdf, other

    cs.CR cs.AI cs.DB

    Tabular Data Synthesis with Differential Privacy: A Survey

    Authors: Mengmeng Yang, Chi-Hung Chi, Kwok-Yan Lam, Jie Feng, Taolin Guo, Wei Ni

    Abstract: Data sharing is a prerequisite for collaborative innovation, enabling organizations to leverage diverse datasets for deeper insights. In real-world applications like FinTech and Smart Manufacturing, transactional data, often in tabular form, are generated and analyzed for insight generation. However, such datasets typically contain sensitive personal/business information, raising privacy concerns… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  10. arXiv:2410.09309  [pdf, other

    cs.RO

    Adaptive Compliance Policy: Learning Approximate Compliance for Diffusion Guided Control

    Authors: Yifan Hou, Zeyi Liu, Cheng Chi, Eric Cousineau, Naveen Kuppuswamy, Siyuan Feng, Benjamin Burchfiel, Shuran Song

    Abstract: Compliance plays a crucial role in manipulation, as it balances between the concurrent control of position and force under uncertainties. Yet compliance is often overlooked by today's visuomotor policies that solely focus on position control. This paper introduces Adaptive Compliance Policy (ACP), a novel framework that learns to dynamically adjust system compliance both spatially and temporally f… ▽ More

    Submitted 6 March, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

  11. arXiv:2410.05739  [pdf, other

    cs.SD cs.AI eess.AS

    Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals

    Authors: Cheng Chi, Xiaoyu Li, Andong Li, Yuxuan Ke, Xiaodong Li, Chengshi Zheng

    Abstract: Telepresence technology aims to provide an immersive virtual presence for remote conference applications, and it is extremely important to synthesize high-quality binaural audio signals for this aim. Because the ambient noise is often inevitable in practical application scenarios, it is highly desired that binaural audio signals without noise can be obtained from microphone-array signals directly.… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  12. arXiv:2408.05838  [pdf

    cs.RO

    RALTPER: A Risk-Aware Local Trajectory Planner for Complex Environment with Gaussian Uncertainty

    Authors: Cheng Chi

    Abstract: In this paper, we propose a novel Risk-Aware Local Trajectory Planner (RALTPER) for autonomous vehicles in complex environments characterized by Gaussian uncertainty. The proposed method integrates risk awareness and trajectory planning by leveraging probabilistic models to evaluate the likelihood of collisions with dynamic and static obstacles. The RALTPER focuses on collision avoidance constrain… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  13. arXiv:2408.05776  [pdf

    cs.NI eess.SP

    Convergence of Symbiotic Communications and Blockchain for Sustainable and Trustworthy 6G Wireless Networks

    Authors: Haoxiang Luo, Gang Sun, Cheng Chi, Hongfang Yu, Mohsen Guizani

    Abstract: Symbiotic communication (SC) is known as a new wireless communication paradigm, similar to the natural ecosystem population, and can enable multiple communication systems to cooperate and mutualize through service exchange and resource sharing. As a result, SC is seen as an important potential technology for future sixth-generation (6G) communications, solving the problem of lack of spectrum resou… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  14. arXiv:2407.15208  [pdf, other

    cs.RO cs.AI

    Flow as the Cross-Domain Manipulation Interface

    Authors: Mengda Xu, Zhenjia Xu, Yinghao Xu, Cheng Chi, Gordon Wetzstein, Manuela Veloso, Shuran Song

    Abstract: We present Im2Flow2Act, a scalable learning framework that enables robots to acquire real-world manipulation skills without the need of real-world robot training data. The key idea behind Im2Flow2Act is to use object flow as the manipulation interface, bridging domain gaps between different embodiments (i.e., human and robot) and training environments (i.e., real-world and simulated). Im2Flow2Act… ▽ More

    Submitted 4 October, 2024; v1 submitted 21 July, 2024; originally announced July 2024.

    Comments: Conference on Robot Learning 2024

  15. arXiv:2406.19464  [pdf, other

    cs.RO cs.AI cs.CV cs.SD eess.AS

    ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data

    Authors: Zeyi Liu, Cheng Chi, Eric Cousineau, Naveen Kuppuswamy, Benjamin Burchfiel, Shuran Song

    Abstract: Audio signals provide rich information for the robot interaction and object properties through contact. This information can surprisingly ease the learning of contact-rich robot manipulation skills, especially when the visual information alone is ambiguous or incomplete. However, the usage of audio data in robot manipulation has been constrained to teleoperated demonstrations collected by either a… ▽ More

    Submitted 3 November, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Conference on Robot Learning (CoRL) 2024; Project website: https://maniwav.github.io/

  16. arXiv:2406.12229  [pdf, other

    cs.AI cs.LG

    Spatially Resolved Gene Expression Prediction from Histology via Multi-view Graph Contrastive Learning with HSIC-bottleneck Regularization

    Authors: Changxi Chi, Hang Shi, Qi Zhu, Daoqiang Zhang, Wei Shao

    Abstract: The rapid development of spatial transcriptomics(ST) enables the measurement of gene expression at spatial resolution, making it possible to simultaneously profile the gene expression, spatial locations of spots, and the matched histopathological images. However, the cost for collecting ST data is much higher than acquiring histopathological images, and thus several studies attempt to predict the… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  17. arXiv:2404.10147  [pdf, other

    cs.CV

    Eyes on the Streets: Leveraging Street-Level Imaging to Model Urban Crime Dynamics

    Authors: Zhixuan Qi, Huaiying Luo, Chen Chi

    Abstract: This study addresses the challenge of urban safety in New York City by examining the relationship between the built environment and crime rates using machine learning and a comprehensive dataset of street view images. We aim to identify how urban landscapes correlate with crime statistics, focusing on the characteristics of street views and their association with crime rates. The findings offer in… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  18. arXiv:2404.00611  [pdf, ps, other

    cs.CV

    Object-level Copy-Move Forgery Image Detection based on Inconsistency Mining

    Authors: Jingyu Wang, Niantai Jing, Ziyao Liu, Jie Nie, Yuxin Qi, Chi-Hung Chi, Kwok-Yan Lam

    Abstract: In copy-move tampering operations, perpetrators often employ techniques, such as blurring, to conceal tampering traces, posing significant challenges to the detection of object-level targets with intact structures. Focus on these challenges, this paper proposes an Object-level Copy-Move Forgery Image Detection based on Inconsistency Mining (IMNet). To obtain complete object-level targets, we custo… ▽ More

    Submitted 3 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: 4 pages, 2 figures, Accepted to WWW 2024

  19. arXiv:2403.16446  [pdf, other

    cs.CL

    Towards Automatic Evaluation for LLMs' Clinical Capabilities: Metric, Data, and Algorithm

    Authors: Lei Liu, Xiaoyan Yang, Fangzhou Li, Chenfei Chi, Yue Shen, Shiwei Lyu Ming Zhang, Xiaowei Ma, Xiangguo Lyu, Liya Ma, Zhiqiang Zhang, Wei Xue, Yiran Huang, Jinjie Gu

    Abstract: Large language models (LLMs) are gaining increasing interests to improve clinical efficiency for medical diagnosis, owing to their unprecedented performance in modelling natural language. Ensuring the safe and reliable clinical applications, the evaluation of LLMs indeed becomes critical for better mitigating the potential risks, e.g., hallucinations. However, current evaluation methods heavily re… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  20. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (76 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 22 April, 2025; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  21. arXiv:2403.09566  [pdf, other

    cs.RO

    PaperBot: Learning to Design Real-World Tools Using Paper

    Authors: Ruoshi Liu, Junbang Liang, Sruthi Sudhakar, Huy Ha, Cheng Chi, Shuran Song, Carl Vondrick

    Abstract: Paper is a cheap, recyclable, and clean material that is often used to make practical tools. Traditional tool design either relies on simulation or physical analysis, which is often inaccurate and time-consuming. In this paper, we propose PaperBot, an approach that directly learns to design and use a tool in the real world using paper without human intervention. We demonstrated the effectiveness a… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Project Website: https://paperbot.cs.columbia.edu/

  22. arXiv:2403.09096  [pdf, other

    eess.IV cs.CV

    Deep unfolding Network for Hyperspectral Image Super-Resolution with Automatic Exposure Correction

    Authors: Yuan Fang, Yipeng Liu, Jie Chen, Zhen Long, Ao Li, Chong-Yung Chi, Ce Zhu

    Abstract: In recent years, the fusion of high spatial resolution multispectral image (HR-MSI) and low spatial resolution hyperspectral image (LR-HSI) has been recognized as an effective method for HSI super-resolution (HSI-SR). However, both HSI and MSI may be acquired under extreme conditions such as night or poorly illuminating scenarios, which may cause different exposure levels, thereby seriously downgr… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  23. arXiv:2403.02814  [pdf, other

    cs.LG cs.AI

    InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting

    Authors: Ce Chi, Xing Wang, Kexin Yang, Zhiyan Song, Di Jin, Lin Zhu, Chao Deng, Junlan Feng

    Abstract: Transformer has become one of the most popular architectures for multivariate time series (MTS) forecasting. Recent Transformer-based MTS models generally prefer channel-independent structures with the observation that channel independence can alleviate noise and distribution drift issues, leading to more robustness. Nevertheless, it is essential to note that channel dependency remains an inherent… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  24. arXiv:2402.14840  [pdf, other

    cs.CL cs.AI stat.AP

    RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning

    Authors: Congyun Jin, Ming Zhang, Xiaowei Ma, Li Yujiao, Yingbo Wang, Yabo Jia, Yuliang Du, Tao Sun, Haowen Wang, Cong Fan, Jinjie Gu, Chenfei Chi, Xiangguo Lv, Fangzhou Li, Wei Xue, Yiran Huang

    Abstract: Recent advancements in Large Language Models (LLMs) and Large Multi-modal Models (LMMs) have shown potential in various medical applications, such as Intelligent Medical Diagnosis. Although impressive results have been achieved, we find that existing benchmarks do not reflect the complexity of real medical reports and specialized in-depth reasoning capabilities. In this work, we introduced RJUA-Me… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 15 pages, 13 figures

  25. arXiv:2402.10329  [pdf, other

    cs.RO

    Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

    Authors: Cheng Chi, Zhenjia Xu, Chuer Pan, Eric Cousineau, Benjamin Burchfiel, Siyuan Feng, Russ Tedrake, Shuran Song

    Abstract: We present Universal Manipulation Interface (UMI) -- a data collection and policy learning framework that allows direct skill transfer from in-the-wild human demonstrations to deployable robot policies. UMI employs hand-held grippers coupled with careful interface design to enable portable, low-cost, and information-rich data collection for challenging bimanual and dynamic manipulation demonstrati… ▽ More

    Submitted 5 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Project website: https://umi-gripper.github.io

  26. arXiv:2402.04064  [pdf, other

    cs.CV cs.AI

    Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing

    Authors: Jongmin Yu, Chen Bene Chi, Sebastiano Fichera, Paolo Paoletti, Devansh Mehta, Shan Luo

    Abstract: Road pavement detection and segmentation are critical for developing autonomous road repair systems. However, developing an instance segmentation method that simultaneously performs multi-class defect detection and segmentation is challenging due to the textural simplicity of road pavement image, the diversity of defect geometries, and the morphological ambiguity between classes. We propose a nove… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted to the ICRA 2024

  27. arXiv:2401.01836  [pdf, other

    cs.AI

    Neural Control: Concurrent System Identification and Control Learning with Neural ODE

    Authors: Cheng Chi

    Abstract: Controlling continuous-time dynamical systems is generally a two step process: first, identify or model the system dynamics with differential equations, then, minimize the control objectives to achieve optimal control function and optimal state trajectories. However, any inaccuracy in dynamics modeling will lead to sub-optimality in the resulting control function. To address this, we propose a neu… ▽ More

    Submitted 22 April, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 9 pages, code open sourced in format of Google Colab notebooks; Resubmitted for adding missed references in the last submission

  28. arXiv:2312.09785  [pdf, other

    cs.CL

    RJUA-QA: A Comprehensive QA Dataset for Urology

    Authors: Shiwei Lyu, Chenfei Chi, Hongbo Cai, Lei Shi, Xiaoyan Yang, Lei Liu, Xiang Chen, Deng Zhao, Zhiqiang Zhang, Xianguo Lyu, Ming Zhang, Fangzhou Li, Xiaowei Ma, Yue Shen, Jinjie Gu, Wei Xue, Yiran Huang

    Abstract: We introduce RJUA-QA, a novel medical dataset for question answering (QA) and reasoning with clinical evidence, contributing to bridge the gap between general large language models (LLMs) and medical-specific LLM applications. RJUA-QA is derived from realistic clinical scenarios and aims to facilitate LLMs in generating reliable diagnostic and advice. The dataset contains 2,132 curated Question-Co… ▽ More

    Submitted 7 January, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: An initial version

  29. Privacy-preserving Federated Primal-dual Learning for Non-convex and Non-smooth Problems with Model Sparsification

    Authors: Yiwei Li, Chien-Wei Huang, Shuai Wang, Chong-Yung Chi, Tony Q. S. Quek

    Abstract: Federated learning (FL) has been recognized as a rapidly growing research area, where the model is trained over massively distributed clients under the orchestration of a parameter server (PS) without sharing clients' data. This paper delves into a class of federated problems characterized by non-convex and non-smooth loss functions, that are prevalent in FL applications but challenging to handle… ▽ More

    Submitted 3 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 33 pages, 8 figures, 1 table. Accepted by IEEE Internet of Things Journal

  30. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (269 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 14 May, 2025; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  31. arXiv:2309.13733  [pdf, other

    stat.ML cs.LG stat.CO

    Towards Tuning-Free Minimum-Volume Nonnegative Matrix Factorization

    Authors: Duc Toan Nguyen, Eric C. Chi

    Abstract: Nonnegative Matrix Factorization (NMF) is a versatile and powerful tool for discovering latent structures in data matrices, with many variations proposed in the literature. Recently, Leplat et al.\@ (2019) introduced a minimum-volume NMF for the identifiable recovery of rank-deficient matrices in the presence of noise. The performance of their formulation, however, requires the selection of a tuni… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  32. arXiv:2307.16259  [pdf, ps, other

    cs.IT cs.NI eess.SP

    Communication-Sensing Region for Cell-Free Massive MIMO ISAC Systems

    Authors: Weihao Mao, Yang Lu, Chong-Yung Chi, Bo Ai, Zhangdui Zhong, Zhiguo Ding

    Abstract: This paper investigates the system model and the transmit beamforming design for the Cell-Free massive multi-input multi-output (MIMO) integrated sensing and communication (ISAC) system. The impact of the uncertainty of the target locations on the propagation of wireless signals is considered during both uplink and downlink phases, and especially, the main statistics of the MIMO channel estimation… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  33. arXiv:2307.09955  [pdf, other

    cs.RO cs.AI cs.LG

    XSkill: Cross Embodiment Skill Discovery

    Authors: Mengda Xu, Zhenjia Xu, Cheng Chi, Manuela Veloso, Shuran Song

    Abstract: Human demonstration videos are a widely available data source for robot learning and an intuitive user interface for expressing desired behavior. However, directly extracting reusable robot manipulation skills from unstructured human videos is challenging due to the big embodiment difference and unobserved action parameters. To bridge this embodiment gap, this paper introduces XSkill, an imitation… ▽ More

    Submitted 28 September, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  34. GICI-LIB: A GNSS/INS/Camera Integrated Navigation Library

    Authors: Cheng Chi, Xin Zhang, Jiahui Liu, Yulong Sun, Zihao Zhang, Xingqun Zhan

    Abstract: Accurate navigation is essential for autonomous robots and vehicles. In recent years, the integration of the Global Navigation Satellite System (GNSS), Inertial Navigation System (INS), and camera has garnered considerable attention due to its robustness and high accuracy in diverse environments. However, leveraging the full capacity of GNSS is cumbersome because of the diverse choices of formulat… ▽ More

    Submitted 12 November, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: Open-source: https://github.com/chichengcn/gici-open. Preprint version on Robotics and Automation Letters (RAL)

  35. arXiv:2306.00275  [pdf, other

    cs.DC

    A Comprehensive Survey on Orbital Edge Computing: Systems, Applications, and Algorithms

    Authors: Changhao Wu, Yuanchun Li, Mengwei Xu, Chongbin Guo, Zengshan Yin, Weiwei Gao, Chuanxiu Chi

    Abstract: The number of satellites, especially those operating in low-earth orbit (LEO), is exploding in recent years. Additionally, the use of COTS hardware into those satellites enables a new paradigm of computing: orbital edge computing (OEC). OEC entails more technically advanced steps compared to single-satellite computing. This feature allows for vast design spaces with multiple parameters, rendering… ▽ More

    Submitted 1 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: 18 pages, 9 figures and 5 tables

    MSC Class: 68M14 ACM Class: C.2.4

  36. arXiv:2304.13940  [pdf, other

    stat.ML cs.LG

    A Majorization-Minimization Gauss-Newton Method for 1-Bit Matrix Completion

    Authors: Xiaoqian Liu, Xu Han, Eric C. Chi, Boaz Nadler

    Abstract: In 1-bit matrix completion, the aim is to estimate an underlying low-rank matrix from a partial set of binary observations. We propose a novel method for 1-bit matrix completion called Majorization-Minimization Gauss-Newton (MMGN). Our method is based on the majorization-minimization principle, which converts the original optimization problem into a sequence of standard low-rank matrix completion… ▽ More

    Submitted 23 September, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 30 pages, 7 figures

  37. arXiv:2304.03292  [pdf, other

    cs.LG

    SE-shapelets: Semi-supervised Clustering of Time Series Using Representative Shapelets

    Authors: Borui Cai, Guangyan Huang, Shuiqiao Yang, Yong Xiang, Chi-Hung Chi

    Abstract: Shapelets that discriminate time series using local features (subsequences) are promising for time series clustering. Existing time series clustering methods may fail to capture representative shapelets because they discover shapelets from a large pool of uninformative subsequences, and thus result in low clustering accuracy. This paper proposes a Semi-supervised Clustering of Time Series Using Re… ▽ More

    Submitted 14 November, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  38. arXiv:2303.09858  [pdf, other

    eess.IV cs.CR cs.CV cs.MM

    Preventing Unauthorized AI Over-Analysis by Medical Image Adversarial Watermarking

    Authors: Xingxing Wei, Bangzheng Pu, Shiji Zhao, Chen Chi, Huazhu Fu

    Abstract: The advancement of deep learning has facilitated the integration of Artificial Intelligence (AI) into clinical practices, particularly in computer-aided diagnosis. Given the pivotal role of medical images in various diagnostic procedures, it becomes imperative to ensure the responsible and secure utilization of AI techniques. However, the unauthorized utilization of AI for image analysis raises si… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

  39. arXiv:2303.04137  [pdf, other

    cs.RO

    Diffusion Policy: Visuomotor Policy Learning via Action Diffusion

    Authors: Cheng Chi, Zhenjia Xu, Siyuan Feng, Eric Cousineau, Yilun Du, Benjamin Burchfiel, Russ Tedrake, Shuran Song

    Abstract: This paper introduces Diffusion Policy, a new way of generating robot behavior by representing a robot's visuomotor policy as a conditional denoising diffusion process. We benchmark Diffusion Policy across 12 different tasks from 4 different robot manipulation benchmarks and find that it consistently outperforms existing state-of-the-art robot learning methods with an average improvement of 46.9%.… ▽ More

    Submitted 14 March, 2024; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: An extended journal version of the original RSS2023 paper

  40. arXiv:2303.02454  [pdf, other

    cs.CV

    Exploiting Implicit Rigidity Constraints via Weight-Sharing Aggregation for Scene Flow Estimation from Point Clouds

    Authors: Yun Wang, Cheng Chi, Xin Yang

    Abstract: Scene flow estimation, which predicts the 3D motion of scene points from point clouds, is a core task in autonomous driving and many other 3D vision applications. Existing methods either suffer from structure distortion due to ignorance of rigid motion consistency or require explicit pose estimation and 3D object segmentation. Errors of estimated poses and segmented objects would yield inaccurate… ▽ More

    Submitted 1 April, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  41. arXiv:2302.11553  [pdf, other

    cs.RO

    RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects

    Authors: Zhenjia Xu, Zhou Xian, Xingyu Lin, Cheng Chi, Zhiao Huang, Chuang Gan, Shuran Song

    Abstract: We introduce RoboNinja, a learning-based cutting system for multi-material objects (i.e., soft objects with rigid cores such as avocados or mangos). In contrast to prior works using open-loop cutting actions to cut through single-material objects (e.g., slicing a cucumber), RoboNinja aims to remove the soft part of an object while preserving the rigid core, thereby maximizing the yield. To achieve… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  42. Robust Extrinsic Self-Calibration of Camera and Solid State LiDAR

    Authors: Jiahui Liu, Xingqun Zhan, Cheng Chi, Xin Zhang, Chuanrun Zhai

    Abstract: This letter proposes an extrinsic calibration approach for a pair of monocular camera and prism-spinning solid-state LiDAR. The unique characteristics of the point cloud measured resulting from the flower-like scanning pattern is first disclosed as the vacant points, a type of outlier between foreground target and background objects. Unlike existing method using only depth continuous measurements,… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Journal ref: Journal of Intelligent & Robotic Systems. 109 (2023) 81

  43. Differentially Private Federated Clustering over Non-IID Data

    Authors: Yiwei Li, Shuai Wang, Chong-Yung Chi, Tony Q. S. Quek

    Abstract: In this paper, we investigate federated clustering (FedC) problem, that aims to accurately partition unlabeled data samples distributed over massive clients into finite clusters under the orchestration of a parameter server, meanwhile considering data privacy. Though it is an NP-hard optimization problem involving real variables denoting cluster centroids and binary variables denoting the cluster… ▽ More

    Submitted 30 October, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

    Comments: 34 pages, 4 figures, 1 table

  44. arXiv:2210.09347  [pdf, other

    cs.RO

    Cloth Funnels: Canonicalized-Alignment for Multi-Purpose Garment Manipulation

    Authors: Alper Canberk, Cheng Chi, Huy Ha, Benjamin Burchfiel, Eric Cousineau, Siyuan Feng, Shuran Song

    Abstract: Automating garment manipulation is challenging due to extremely high variability in object configurations. To reduce this intrinsic variation, we introduce the task of "canonicalized-alignment" that simplifies downstream applications by reducing the possible garment configurations. This task can be considered as "cloth state funnel" that manipulates arbitrarily configured clothing items into a pre… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 8 pages, 8 figures, website at https://clothfunnels.cs.columbia.edu/

    ACM Class: I.2.9

  45. arXiv:2207.02891  [pdf, other

    cs.LG cs.AI

    Don't overfit the history -- Recursive time series data augmentation

    Authors: Amine Mohamed Aboussalah, Min-Jae Kwon, Raj G Patel, Cheng Chi, Chi-Guhn Lee

    Abstract: Time series observations can be seen as realizations of an underlying dynamical system governed by rules that we typically do not know. For time series learning tasks, we need to understand that we fit our model on available data, which is a unique realized history. Training on a single realization often induces severe overfitting lacking generalization. To address this issue, we introduce a gener… ▽ More

    Submitted 28 January, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted to ICLR 2023 Resubmitted here due to major change in proofs following conference submission

  46. arXiv:2207.01678  [pdf, other

    stat.ML cs.LG math.ST

    FACT: High-Dimensional Random Forests Inference

    Authors: Chien-Ming Chi, Yingying Fan, Jinchi Lv

    Abstract: Quantifying the usefulness of individual features in random forests learning can greatly enhance its interpretability. Existing studies have shown that some popularly used feature importance measures for random forests suffer from the bias issue. In addition, there lack comprehensive size and power analyses for most of these existing methods. In this paper, we approach the problem via hypothesis t… ▽ More

    Submitted 12 November, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: 42 pages, 3 figures

  47. arXiv:2206.02743  [pdf, other

    cs.IR

    A Neural Corpus Indexer for Document Retrieval

    Authors: Yujing Wang, Yingyan Hou, Haonan Wang, Ziming Miao, Shibin Wu, Hao Sun, Qi Chen, Yuqing Xia, Chengmin Chi, Guoshuai Zhao, Zheng Liu, Xing Xie, Hao Allen Sun, Weiwei Deng, Qi Zhang, Mao Yang

    Abstract: Current state-of-the-art document retrieval solutions mainly follow an index-retrieve paradigm, where the index is hard to be directly optimized for the final retrieval target. In this paper, we aim to show that an end-to-end deep neural network unifying training and indexing stages can significantly improve the recall performance of traditional methods. To this end, we propose Neural Corpus Index… ▽ More

    Submitted 12 February, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 19 pages, 6 figures, accepted by NeurIPS 2022

  48. arXiv:2206.02568  [pdf, other

    math.OC cs.AI cs.DM cs.LG

    A Deep Reinforcement Learning Framework For Column Generation

    Authors: Cheng Chi, Amine Mohamed Aboussalah, Elias B. Khalil, Juyoung Wang, Zoha Sherkat-Masoumi

    Abstract: Column Generation (CG) is an iterative algorithm for solving linear programs (LPs) with an extremely large number of variables (columns). CG is the workhorse for tackling large-scale \textit{integer} linear programs, which rely on CG to solve LP relaxations within a branch and price algorithm. Two canonical applications are the Cutting Stock Problem (CSP) and Vehicle Routing Problem with Time Wind… ▽ More

    Submitted 12 January, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), 2022

  49. arXiv:2204.12284  [pdf, other

    cs.LG cs.CR

    Federated Stochastic Primal-dual Learning with Differential Privacy

    Authors: Yiwei Li, Shuai Wang, Tsung-Hui Chang, Chong-Yung Chi

    Abstract: Federated learning (FL) is a new paradigm that enables many clients to jointly train a machine learning (ML) model under the orchestration of a parameter server while keeping the local data not being exposed to any third party. However, the training of FL is an interactive process between local clients and the parameter server. Such process would cause privacy leakage since adversaries may retriev… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: 18 pages, 6 figures

  50. arXiv:2203.12837  [pdf

    cs.CR cs.DC

    Secure Multi-Party Delegated Authorisation For Access and Sharing of Electronic Health Records

    Authors: Kheng-Leong Tan, Chi-Hung Chi, Kwok-Yan Lam

    Abstract: Timely sharing of electronic health records (EHR) across providers is essential and significance in facilitating medical researches and prompt patients' care. With sharing, it is crucial that patients can control who can access their data and when, and guarantee the security and privacy of their data. In current literature, various system models, cryptographic techniques and access control mechani… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.