Skip to main content

Showing 1–50 of 66 results for author: Mi, L

.
  1. arXiv:2506.06205  [pdf, other

    cs.RO cs.AI

    Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

    Authors: Sheng Chen, Peiyu He, Jiaxin Hu, Ziyang Liu, Yansheng Wang, Tao Xu, Chi Zhang, Chongchong Zhang, Chao An, Shiyu Cai, Duo Cao, Kangping Chen, Shuai Chu, Tianwei Chu, Mingdi Dan, Min Du, Weiwei Fang, Pengyou Fu, Junkai Hu, Xiaowei Jiang, Zhaodi Jiang, Fuxuan Li, Jun Li, Minghui Li, Mingyao Li , et al. (46 additional authors not shown)

    Abstract: Modern robot navigation systems encounter difficulties in diverse and complex indoor environments. Traditional approaches rely on multiple modules with small models or rule-based systems and thus lack adaptability to new environments. To address this, we developed Astra, a comprehensive dual-model architecture, Astra-Global and Astra-Local, for mobile robot navigation. Astra-Global, a multimodal L… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: Astra Technical Report

  2. arXiv:2505.05195  [pdf, other

    cs.LG cs.AI cs.CV

    Concept-Based Unsupervised Domain Adaptation

    Authors: Xinyue Xu, Yueying Hu, Hui Tang, Yi Qin, Lu Mi, Hao Wang, Xiaomeng Li

    Abstract: Concept Bottleneck Models (CBMs) enhance interpretability by explaining predictions through human-understandable concepts but typically assume that training and test data share the same distribution. This assumption often fails under domain shifts, leading to degraded performance and poor generalization. To address these limitations and improve the robustness of CBMs, we propose the Concept-based… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: Accepted by ICML 2025

  3. arXiv:2504.19730  [pdf, other

    cs.SE cs.CL

    Evaluate-and-Purify: Fortifying Code Language Models Against Adversarial Attacks Using LLM-as-a-Judge

    Authors: Wenhan Mu, Ling Xu, Shuren Pei, Le Mi, Huichi Zhou

    Abstract: The widespread adoption of code language models in software engineering tasks has exposed vulnerabilities to adversarial attacks, especially the identifier substitution attacks. Although existing identifier substitution attackers demonstrate high success rates, they often produce adversarial examples with unnatural code patterns. In this paper, we systematically assess the quality of adversarial e… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 25 pages, 6 figures

  4. arXiv:2504.01616  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR

    The Mini-SiTian Array: Imaging Processing Pipeline

    Authors: Kai Xiao, Zhirui Li, Yang Huang, Jie Zheng, Haibo Yuan, Junju Du, Linying Mi, Hongrui Gu, Yongkang Sun, Bowen Zhang, Shunxuan He, Henggeng Han, Min He, Ruifeng Shi, Yu Zhang, Chuanjie Zheng, Zexi Niu, Guiting Tian, Hu Zou, Yongna Mao, Hong Wu, Jifeng Liu

    Abstract: As a pathfinder of the SiTian project, the Mini-SiTian (MST) array, employed three commercial CMOS cameras, represents a next-generation, cost-effective optical time-domain survey project. This paper focuses primarily on the precise data processing pipeline designed for wide-field, CMOS-based devices, including the removal of instrumental effects, astrometry, photometry, and flux calibration. When… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 24 pages, 17 figures, 1 table, accepted for publication in a special issue of Research in Astronomy and Astrophysics on the Mini-SiTian Array, see main results in Figures 11, 12, and 15

  5. arXiv:2504.00488  [pdf, other

    q-bio.MN math.OC

    Dynamical model-based experiment design for drug repositioning

    Authors: Atte Aalto, La Mi, Diego A. Blanco-Mora, Jorge Goncalves

    Abstract: Computational methods in drug repositioning can help to conserve resources. In particular, methods based on biological networks are showing promise. Considering only the network topology and knowledge on drug target genes is not sufficient for quantitative predictions or predictions involving drug combinations. We propose an iterative procedure alternating between system identification and drug re… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  6. arXiv:2503.20871  [pdf, other

    cs.CV cs.AI cs.CL

    VinaBench: Benchmark for Faithful and Consistent Visual Narratives

    Authors: Silin Gao, Sheryl Mathew, Li Mi, Sepideh Mamooler, Mengjie Zhao, Hiromi Wakaki, Yuki Mitsufuji, Syrielle Montariol, Antoine Bosselut

    Abstract: Visual narrative generation transforms textual narratives into sequences of images illustrating the content of the text. However, generating visual narratives that are faithful to the input text and self-consistent across generated images remains an open challenge, due to the lack of knowledge constraints used for planning the stories. In this work, we propose a new benchmark, VinaBench, to addres… ▽ More

    Submitted 3 April, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

    Comments: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025)

  7. arXiv:2502.00266  [pdf, other

    cs.CV cs.LG

    MCM: Multi-layer Concept Map for Efficient Concept Learning from Masked Images

    Authors: Yuwei Sun, Lu Mi, Ippei Fujisawa, Ryota Kanai

    Abstract: Masking strategies commonly employed in natural language processing are still underexplored in vision tasks such as concept learning, where conventional methods typically rely on full images. However, using masked images diversifies perceptual inputs, potentially offering significant advantages in concept learning with large-scale Transformer models. To this end, we propose Multi-layer Concept Map… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

  8. arXiv:2501.00803  [pdf, other

    cs.CL cs.AI

    Reasoning-Oriented and Analogy-Based Methods for Locating and Editing in Zero-Shot Event-Relational Reasoning

    Authors: Jingyao Tang, Lishuang Li, Liteng Mi, Haiming Wu, Hongbin Lu

    Abstract: Zero-shot event-relational reasoning is an important task in natural language processing, and existing methods jointly learn a variety of event-relational prefixes and inference-form prefixes to achieve such tasks. However, training prefixes consumes large computational resources and lacks interpretability. Additionally, learning various relational and inferential knowledge inefficiently exploits… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

  9. arXiv:2412.02529  [pdf, other

    q-bio.NC cs.LG stat.ML

    Active learning of neural population dynamics using two-photon holographic optogenetics

    Authors: Andrew Wagenmaker, Lu Mi, Marton Rozsa, Matthew S. Bull, Karel Svoboda, Kayvon Daie, Matthew D. Golub, Kevin Jamieson

    Abstract: Recent advances in techniques for monitoring and perturbing neural populations have greatly enhanced our ability to study circuits in the brain. In particular, two-photon holographic optogenetics now enables precise photostimulation of experimenter-specified groups of individual neurons, while simultaneous two-photon calcium imaging enables the measurement of ongoing and induced activity across th… ▽ More

    Submitted 8 May, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

    Comments: NeurIPS 2024

  10. arXiv:2411.14816  [pdf, other

    cs.CV cs.RO eess.IV

    Unsupervised Multi-view UAV Image Geo-localization via Iterative Rendering

    Authors: Haoyuan Li, Chang Xu, Wen Yang, Li Mi, Huai Yu, Haijian Zhang

    Abstract: Unmanned Aerial Vehicle (UAV) Cross-View Geo-Localization (CVGL) presents significant challenges due to the view discrepancy between oblique UAV images and overhead satellite images. Existing methods heavily rely on the supervision of labeled datasets to extract viewpoint-invariant features for cross-view retrieval. However, these methods have expensive training costs and tend to overfit the regio… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

    Comments: 13 pages

  11. arXiv:2411.14714  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR

    Mining double-line spectroscopic candidates in the LAMOST medium-resolution spectroscopic survey using human-AI hybrid method

    Authors: Shan-shan Li, Chun-qian Li, Chang-hua Li, Dong-wei Fan, Yun-fei Xu, Lin-ying Mi, Chen-zhou Cui, Jian-rong Shi

    Abstract: We utilize a hybrid approach that integrates the traditional cross-correlation function (CCF) and machine learning to detect spectroscopic multi-systems, specifically focusing on double-line spectroscopic binary (SB2). Based on the ninth data release (DR9) of the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST), which includes a medium-resolution survey (MRS) containing 29,920,58… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 18 pages, 11 figures, accepted by ApJS, Data available via China-VO PaperData repository

  12. arXiv:2411.00915  [pdf, other

    cs.CV cs.AI

    Empower Vision Applications with LoRA LMM

    Authors: Liang Mi, Weijun Wang, Wenming Tu, Qingfeng He, Rui Kong, Xinyu Fang, Yazhu Dong, Yikang Zhang, Yunchun Li, Meng Li, Haipeng Dai, Guihai Chen, Yunxin Liu

    Abstract: Large Multimodal Models (LMMs) have shown significant progress in various complex vision tasks with the solid linguistic and reasoning capacity inherited from large language models (LMMs). Low-rank adaptation (LoRA) offers a promising method to integrate external knowledge into LMMs, compensating for their limitations on domain-specific tasks. However, the existing LoRA model serving is excessivel… ▽ More

    Submitted 3 April, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: EuroSys'2025

  13. arXiv:2410.21700  [pdf, ps, other

    math-ph math.SP

    Sharp palindromic criterion for semi-uniform dynamical localization

    Authors: Svetlana Jitomirskaya, Wencai Liu, Lufang Mi

    Abstract: We develop a sharp palindromic argument for general 1D operators, that proves absence of semi-uniform localization in the regime of exponential symmetry-based resonances. This provides the first examples of operators with dynamical localization but no SULE/SUDL, as well as with nearly uniform distribution of centers of localization in absence of SULE. For the almost Mathieu operators, this also le… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  14. arXiv:2407.16990  [pdf, other

    cs.NI

    Region-based Content Enhancement for Efficient Video Analytics at the Edge

    Authors: Weijun Wang, Liang Mi, Shaowei Cen, Haipeng Dai, Yuanchun Li, Xiaoming Fu, Yunxin Liu

    Abstract: Video analytics is widespread in various applications serving our society. Recent advances of content enhancement in video analytics offer significant benefits for the bandwidth saving and accuracy improvement. However, existing content-enhanced video analytics systems are excessively computationally expensive and provide extremely low throughput. In this paper, we present region-based content enh… ▽ More

    Submitted 3 April, 2025; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: NSDI'25

  15. arXiv:2405.03373  [pdf, other

    cs.CV

    Knowledge-aware Text-Image Retrieval for Remote Sensing Images

    Authors: Li Mi, Xianjie Dai, Javiera Castillo-Navarro, Devis Tuia

    Abstract: Image-based retrieval in large Earth observation archives is challenging because one needs to navigate across thousands of candidate matches only with the query image as a guide. By using text as information supporting the visual query, the retrieval system gains in usability, but at the same time faces difficulties due to the diversity of visual signals that cannot be summarized by a short captio… ▽ More

    Submitted 25 October, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE TGRS

  16. arXiv:2403.13965  [pdf, other

    cs.CV

    ConGeo: Robust Cross-view Geo-localization across Ground View Variations

    Authors: Li Mi, Chang Xu, Javiera Castillo-Navarro, Syrielle Montariol, Wen Yang, Antoine Bosselut, Devis Tuia

    Abstract: Cross-view geo-localization aims at localizing a ground-level query image by matching it to its corresponding geo-referenced aerial view. In real-world scenarios, the task requires accommodating diverse ground images captured by users with varying orientations and reduced field of views (FoVs). However, existing learning pipelines are orientation-specific or FoV-specific, demanding separate model… ▽ More

    Submitted 4 September, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: ECCV2024. Project page at https://eceo-epfl.github.io/ConGeo/

  17. arXiv:2402.12846  [pdf, other

    cs.CV cs.AI

    ConVQG: Contrastive Visual Question Generation with Multimodal Guidance

    Authors: Li Mi, Syrielle Montariol, Javiera Castillo-Navarro, Xianjie Dai, Antoine Bosselut, Devis Tuia

    Abstract: Asking questions about visual environments is a crucial way for intelligent agents to understand rich multi-faceted scenes, raising the importance of Visual Question Generation (VQG) systems. Apart from being grounded to the image, existing VQG systems can use textual constraints, such as expected answers or knowledge triplets, to generate focused questions. These constraints allow VQG systems to… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: AAAI 2024. Project page at https://limirs.github.io/ConVQG

  18. arXiv:2401.14142  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

    Authors: Xinyue Xu, Yi Qin, Lu Mi, Hao Wang, Xiaomeng Li

    Abstract: Existing methods, such as concept bottleneck models (CBMs), have been successful in providing concept-based interpretations for black-box deep learning models. They typically work by predicting concepts given the input and then predicting the final class label given the predicted concepts. However, (1) they often fail to capture the high-order, nonlinear interaction between concepts, e.g., correct… ▽ More

    Submitted 30 December, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR 2024

  19. arXiv:2312.15740  [pdf, other

    cs.NI cs.CV cs.LG

    BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge

    Authors: Lin Sun, Weijun Wang, Tingting Yuan, Liang Mi, Haipeng Dai, Yunxin Liu, Xiaoming Fu

    Abstract: High-definition (HD) cameras for surveillance and road traffic have experienced tremendous growth, demanding intensive computation resources for real-time analytics. Recently, offloading frames from the front-end device to the back-end edge server has shown great promise. In multi-stream competitive environments, efficient bandwidth management and proper scheduling are crucial to ensure both high… ▽ More

    Submitted 4 February, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted by 2024 IEEE INFOCOM

  20. arXiv:2311.06928  [pdf, other

    cs.LG stat.ME

    Attention for Causal Relationship Discovery from Biological Neural Dynamics

    Authors: Ziyu Lu, Anika Tabassum, Shruti Kulkarni, Lu Mi, J. Nathan Kutz, Eric Shea-Brown, Seung-Hwan Lim

    Abstract: This paper explores the potential of the transformer models for learning Granger causality in networks with complex nonlinear dynamics at every node, as in neurobiological and biophysical networks. Our study primarily focuses on a proof-of-concept investigation based on simulated neural dynamics, for which the ground-truth causality is known through the underlying connectivity matrix. For transfor… ▽ More

    Submitted 23 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted to the NeurIPS 2023 Workshop on Causal Representation Learning

  21. arXiv:2311.02258  [pdf, other

    q-bio.NC cs.LG

    Learning Time-Invariant Representations for Individual Neurons from Population Dynamics

    Authors: Lu Mi, Trung Le, Tianxing He, Eli Shlizerman, Uygar Sümbül

    Abstract: Neurons can display highly variable dynamics. While such variability presumably supports the wide range of behaviors generated by the organism, their gene expressions are relatively stable in the adult brain. This suggests that neuronal activity is a combination of its time-invariant identity and the inputs the neuron receives from the rest of the circuit. Here, we propose a self-supervised learni… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023

  22. arXiv:2310.17892  [pdf, other

    astro-ph.IM

    Astronomical Knowledge Entity Extraction in Astrophysics Journal Articles via Large Language Models

    Authors: Wujun Shao, Pengli Ji, Dongwei Fan, Yaohua Hu, Xiaoran Yan, Chenzhou Cui, Linying Mi, Lang Chen, Rui Zhang

    Abstract: Astronomical knowledge entities, such as celestial object identifiers, are crucial for literature retrieval and knowledge graph construction, and other research and applications in the field of astronomy. Traditional methods of extracting knowledge entities from texts face challenges like high manual effort, poor generalization, and costly maintenance. Consequently, there is a pressing need for im… ▽ More

    Submitted 17 January, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  23. arXiv:2309.17157  [pdf, other

    cs.CL

    LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud

    Authors: Mengke Zhang, Tianxing He, Tianle Wang, Lu Mi, Fatemehsadat Mireshghallah, Binyi Chen, Hao Wang, Yulia Tsvetkov

    Abstract: In the current user-server interaction paradigm of prompted generation with large language models (LLM) on cloud, the server fully controls the generation process, which leaves zero options for users who want to keep the generated text to themselves. We propose LatticeGen, a cooperative framework in which the server still handles most of the computation while the user controls the sampling operati… ▽ More

    Submitted 5 April, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  24. arXiv:2303.00882  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    X-Ray2EM: Uncertainty-Aware Cross-Modality Image Reconstruction from X-Ray to Electron Microscopy in Connectomics

    Authors: Yicong Li, Yaron Meirovitch, Aaron T. Kuan, Jasper S. Phelps, Alexandra Pacureanu, Wei-Chung Allen Lee, Nir Shavit, Lu Mi

    Abstract: Comprehensive, synapse-resolution imaging of the brain will be crucial for understanding neuronal computations and function. In connectomics, this has been the sole purview of volume electron microscopy (EM), which entails an excruciatingly difficult process because it requires cutting tissue into many thin, fragile slices that then need to be imaged, aligned, and reconstructed. Unlike EM, hard X-… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted by ISBI 2023 conference. Supplementary material is available in this arXiv version

  25. arXiv:2302.03819  [pdf, other

    cs.CV cs.LG q-bio.NC

    The XPRESS Challenge: Xray Projectomic Reconstruction -- Extracting Segmentation with Skeletons

    Authors: Tri Nguyen, Mukul Narwani, Mark Larson, Yicong Li, Shuhan Xie, Hanspeter Pfister, Donglai Wei, Nir Shavit, Lu Mi, Alexandra Pacureanu, Wei-Chung Lee, Aaron T. Kuan

    Abstract: The wiring and connectivity of neurons form a structural basis for the function of the nervous system. Advances in volume electron microscopy (EM) and image segmentation have enabled mapping of circuit diagrams (connectomics) within local regions of the mouse brain. However, applying volume EM over the whole brain is not currently feasible due to technological challenges. As a result, comprehensiv… ▽ More

    Submitted 24 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 6 pages, 2 figures

  26. arXiv:2301.08664  [pdf, other

    cs.CV cs.LG cs.MM

    AccDecoder: Accelerated Decoding for Neural-enhanced Video Analytics

    Authors: Tingting Yuan, Liang Mi, Weijun Wang, Haipeng Dai, Xiaoming Fu

    Abstract: The quality of the video stream is key to neural network-based video analytics. However, low-quality video is inevitably collected by existing surveillance systems because of poor quality cameras or over-compressed/pruned video streaming protocols, e.g., as a result of upstream bandwidth limit. To address this issue, existing studies use quality enhancers (e.g., neural super-resolution) to improve… ▽ More

    Submitted 24 January, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

    Comments: Accepted by 2023 IEEE INFOCOM

  27. arXiv:2301.04877  [pdf, other

    eess.SY

    Asymptotically stable polarization of multi-agent gradient flows over manifolds

    Authors: La Mi, Jorge Gonçalves, Johan Markdahl

    Abstract: Multi-agent systems are known to exhibit stable emergent behaviors, including polarization, over $\mathbb{R}^n$ or highly symmetric nonlinear spaces. In this article, we eschew linearity and symmetry of the underlying spaces, and study the stability of polarized equilibria of multi-agent gradient flows evolving on general hypermanifolds. The agents attract or repel each other according to the part… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  28. Photometric redshift estimation of galaxies in the DESI Legacy Imaging Surveys

    Authors: Changhua Li, Yanxia Zhang, Chenzhou Cui, Dongwei Fan, Yongheng Zhao, Xue-Bing Wu, Jing-Yi Zhang, Yihan Tao, Jun Han, Yunfei Xu, Shanshan Li, Linying Mi, Boliang He, Zihan Kang, Youfen Wang, Hanxi Yang, Sisi Yang

    Abstract: The accurate estimation of photometric redshifts plays a crucial role in accomplishing science objectives of the large survey projects. The template-fitting and machine learning are the two main types of methods applied currently. Based on the training set obtained by cross-correlating the DESI Legacy Imaging Surveys DR9 galaxy catalogue and SDSS DR16 galaxy catalogue, the two kinds of methods are… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted for publication in MNRAS. 14 pages, 9 figures, 11 tables

  29. arXiv:2209.04061  [pdf, other

    cs.CV

    im2nerf: Image to Neural Radiance Field in the Wild

    Authors: Lu Mi, Abhijit Kundu, David Ross, Frank Dellaert, Noah Snavely, Alireza Fathi

    Abstract: We propose im2nerf, a learning framework that predicts a continuous neural object representation given a single input image in the wild, supervised by only segmentation output from off-the-shelf recognition methods. The standard approach to constructing neural radiance fields takes advantage of multi-view consistency and requires many calibrated views of a scene, a requirement that cannot be satis… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: 12 pages, 8 figures, 4 tables

  30. arXiv:2207.06684  [pdf, other

    cs.LG cs.AI cs.CV cs.SI stat.ML

    Subgraph Frequency Distribution Estimation using Graph Neural Networks

    Authors: Zhongren Chen, Xinyue Xu, Shengyi Jiang, Hao Wang, Lu Mi

    Abstract: Small subgraphs (graphlets) are important features to describe fundamental units of a large network. The calculation of the subgraph frequency distributions has a wide application in multiple domains including biology and engineering. Unfortunately due to the inherent complexity of this task, most of the existing methods are computationally intensive and inefficient. In this work, we propose GNNS,… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: accepted by KDD 2022 Workshop on Deep Learning on Graphs

  31. arXiv:2110.06421  [pdf, other

    cs.LG

    Revisiting Latent-Space Interpolation via a Quantitative Evaluation Framework

    Authors: Lu Mi, Tianxing He, Core Francisco Park, Hao Wang, Yue Wang, Nir Shavit

    Abstract: Latent-space interpolation is commonly used to demonstrate the generalization ability of deep latent variable models. Various algorithms have been proposed to calculate the best trajectory between two encodings in the latent space. In this work, we show how data labeled with semantically continuous attributes can be utilized to conduct a quantitative evaluation of latent-space interpolation algori… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: 11 pages

  32. Predicate correlation learning for scene graph generation

    Authors: Leitian Tao, Li Mi, Nannan Li, Xianhang Cheng, Yaosi Hu, Zhenzhong Chen

    Abstract: For a typical Scene Graph Generation (SGG) method, there is often a large gap in the performance of the predicates' head classes and tail classes. This phenomenon is mainly caused by the semantic overlap between different predicates as well as the long-tailed data distribution. In this paper, a Predicate Correlation Learning (PCL) method for SGG is proposed to address the above two problems by tak… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  33. arXiv:2107.01181  [pdf, other

    cs.CV cs.AI

    Visual Relationship Forecasting in Videos

    Authors: Li Mi, Yangjun Ou, Zhenzhong Chen

    Abstract: Real-world scenarios often require the anticipation of object interactions in unknown future, which would assist the decision-making process of both humans and agents. To meet this challenge, we present a new task named Visual Relationship Forecasting (VRF) in videos to explore the prediction of visual relationships in a reasoning manner. Specifically, given a subject-object pair with H existing f… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  34. arXiv:2106.14880  [pdf, other

    cs.CV

    HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps

    Authors: Lu Mi, Hang Zhao, Charlie Nash, Xiaohan Jin, Jiyang Gao, Chen Sun, Cordelia Schmid, Nir Shavit, Yuning Chai, Dragomir Anguelov

    Abstract: High Definition (HD) maps are maps with precise definitions of road lanes with rich semantics of the traffic rules. They are critical for several key stages in an autonomous driving system, including motion forecasting and planning. However, there are only a small amount of real-world road topologies and geometries, which significantly limits our ability to test out the self-driving stack to gener… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  35. arXiv:2106.05563  [pdf, ps, other

    astro-ph.GA astro-ph.IM

    Identification of BASS DR3 Sources as Stars, Galaxies and Quasars by XGBoost

    Authors: Changhua Li, Yanxia Zhang, Chenzhou Cui, Dongwei Fan, Yongheng Zhao, Xue-Bing Wu, Boliang He, Yunfei Xu, Shanshan Li, Jun Han, Yihan Tao, Linying Mi, Hanxi Yang, Sisi Yang

    Abstract: The Beijing-Arizona Sky Survey (BASS) Data Release 3 (DR3) catalogue was released in 2019, which contains the data from all BASS and the Mosaic z-band Legacy Survey (MzLS) observations during 2015 January and 2019 March, about 200 million sources. We cross-match BASS DR3 with spectral databases from the Sloan Digital Sky Survey (SDSS) and the Large Sky Area Multi-object Fiber Spectroscopic Telesco… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 15 pages, 10 tables, 6 figures, accepted for publication in Monthly Notices of the Royal Astronomical Society Main Journal

  36. arXiv:2105.09320  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Optical manipulation of Rashba-split 2-Dimensional Electron Gas

    Authors: M. Michiardi, F. Boschini, H. -H. Kung, M. X. Na, S. K. Y. Dufresne, A. Currie, G. Levy, S. Zhdanovich, A. K. Mills, D. J. Jones, J. L. Mi, B. B. Iversen, Ph. Hofmann, A. Damascelli

    Abstract: In spintronic devices, the two main approaches to actively control the electrons' spin degree of freedom involve either static magnetic or electric fields. An alternative avenue relies on the application of optical fields to generate spin currents, which promises to bolster spin-device performance allowing for significantly faster and more efficient spin logic. To date, research has mainly focused… ▽ More

    Submitted 2 June, 2022; v1 submitted 19 May, 2021; originally announced May 2021.

    Journal ref: Nature Communications 13, 3096 (2022)

  37. Learning Guided Electron Microscopy with Active Acquisition

    Authors: Lu Mi, Hao Wang, Yaron Meirovitch, Richard Schalek, Srinivas C. Turaga, Jeff W. Lichtman, Aravinthan D. T. Samuel, Nir Shavit

    Abstract: Single-beam scanning electron microscopes (SEM) are widely used to acquire massive data sets for biomedical study, material analysis, and fabrication inspection. Datasets are typically acquired with uniform acquisition: applying the electron beam with the same power and duration to all image pixels, even if there is great variety in the pixels' importance for eventual use. Many SEMs are now able t… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: MICCAI 2020

  38. GWOPS: A VO-technology Driven Tool to Search for the Electromagnetic Counterpart of Gravitational Wave Event

    Authors: Yunfei Xu, Dong Xu, Chenzhou Cui, Dongwei Fan, Zipei Zhu, Bangyao Yu, Changhua Li, Jun Han, Linying Mi, Shanshan Li, Boliang He, Yihan Tao, Hanxi Yang, Sisi Yang

    Abstract: The search and follow-up observation of electromagnetic (EM) counterparts of gravitational waves (GW) is a current hot topic of GW cosmology. Due to the limitation of the accuracy of the GW observation facility at this stage, we can only get a rough sky-localization region for the GW event, and the typical area of the region is between 200 and 1500 square degrees. Since GW events occur in or near… ▽ More

    Submitted 9 September, 2020; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: 12 pages, 8 figures, published by Publications of the Astronomical Society of the Pacific

    Journal ref: Publications of the Astronomical Society of the Pacific, 132(1016), 104501 (2020)

  39. arXiv:2005.10501  [pdf, other

    astro-ph.IM

    Towards an Astronomical Science Platform: Experiences and Lessons Learned from Chinese Virtual Observatory

    Authors: Chenzhou Cui, Yihan Tao, Changhua Li, Dongwei Fan, Jian Xiao, Boliang He, Shanshan Li, Ce Yu, Linying Mi, Yunfei Xu, Jun Han, Sisi Yang, Yongheng Zhao, Yanjie Xue, Jinxin Hao, Liang Liu, Xiao Chen, Junyi Chen, Hailong Zhang

    Abstract: In the era of big data astronomy, next generation telescopes and large sky surveys produce data sets at the TB or even PB level. Due to their large data volumes, these astronomical data sets are extremely difficult to transfer and analyze using personal computers or small clusters. In order to offer better access to data, data centers now generally provide online science platforms that enable anal… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: 26 pages,5 figure,accepted for publication in Astronomy and Computing

  40. arXiv:2003.02029  [pdf, ps, other

    astro-ph.IM

    IVOA HiPS Implementation in the Framework of WorldWide Telescope

    Authors: Yunfei Xu, Chenzhou Cui, Dongwei Fan, Shanshan Li, Changhua Li, Jun Han, Linying Mi, Boliang He, Hanxi Yang, Yihan Tao, Sisi Yang, Lan He

    Abstract: The WorldWide Telescope(WWT) is a scientific visualization platform which can browse deep space images, star catalogs, and planetary remote sensing data from different observation facilities in a three-dimensional virtual scene. First launched and then open-sourced by Microsoft Research, the WWT is now managed by the American Astronomical Society (AAS). Hierarchical Progressive Survey (HiPS) is an… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Comments: 22 pages, 15 figures

  41. arXiv:2002.10543  [pdf, other

    cs.LG stat.ML

    Variational Wasserstein Barycenters for Geometric Clustering

    Authors: Liang Mi

    Abstract: We propose to compute Wasserstein barycenters (WBs) by solving for Monge maps with variational principle. We discuss the metric properties of WBs and explore their connections, especially the connections of Monge WBs, to K-means clustering and co-clustering. We also discuss the feasibility of Monge WBs on unbalanced measures and spherical domains. We propose two new problems -- regularized K-means… ▽ More

    Submitted 29 March, 2023; v1 submitted 24 February, 2020; originally announced February 2020.

  42. arXiv:2001.11114  [pdf, other

    cs.LG cs.DM math.FA stat.ML

    A Family of Pairwise Multi-Marginal Optimal Transports that Define a Generalized Metric

    Authors: Liang Mi, Azadeh Sheikholeslami, José Bento

    Abstract: The Optimal transport (OT) problem is rapidly finding its way into machine learning. Favoring its use are its metric properties. Many problems admit solutions with guarantees only for objects embedded in metric spaces, and the use of non-metrics can complicate solving them. Multi-marginal OT (MMOT) generalizes OT to simultaneously transporting multiple distributions. It captures important relation… ▽ More

    Submitted 22 December, 2022; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Machine Learning (2022)

  43. arXiv:1910.04858  [pdf, other

    cs.CV cs.LG

    Training-Free Uncertainty Estimation for Dense Regression: Sensitivity as a Surrogate

    Authors: Lu Mi, Hao Wang, Yonglong Tian, Hao He, Nir Shavit

    Abstract: Uncertainty estimation is an essential step in the evaluation of the robustness for deep learning models in computer vision, especially when applied in risk-sensitive areas. However, most state-of-the-art deep learning models either fail to obtain uncertainty estimation or need significant modification (e.g., formulating a proper Bayesian treatment) to obtain it. Most previous methods are not able… ▽ More

    Submitted 10 January, 2022; v1 submitted 27 September, 2019; originally announced October 2019.

    Comments: In proceedings of the 36th AAAI Conference on Artificial Intelligence

  44. arXiv:1907.12188   

    cs.HC

    Hand-Gesture-Recognition Based Text Input Method for AR/VR Wearable Devices

    Authors: Nizamuddin Maitlo, Yanbo Wang, Chao Ping Chen, Lantian Mi, Wenbo Zhang

    Abstract: Static and dynamic hand movements are basic way for human-machine interactions. To recognize and classify these movements, first these movements are captured by the cameras mounted on the augmented reality (AR) or virtual reality (VR) wearable devices. The hand is segmented using segmentation method and its gestures are passed to hand gesture recognition algorithm, which depends on depth-wise sepa… ▽ More

    Submitted 2 April, 2020; v1 submitted 28 July, 2019; originally announced July 2019.

    Comments: Information is not correct need to rewrite

  45. arXiv:1903.00127  [pdf, ps, other

    math.DS

    On the existence of full dimensional KAM torus for nonlinear Schrödinger equation

    Authors: Hongzi Cong, Lufang Mi, Yunfeng Shi, Yuan Wu

    Abstract: In this paper, we study the following nonlinear Schrödinger equation \begin{eqnarray}\label{maineq0} \textbf{i}u_{t}-u_{xx}+V*u+εf(x)|u|^4u=0,\ x\in\mathbb{T}=\mathbb{R}/2π\mathbb{Z}, \end{eqnarray} where $V*$ is the Fourier multiplier defined by $\widehat{(V* u})_n=V_{n}\widehat{u}_n, V_n\in[-1,1]$ and $f(x)$ is Gevrey smooth. It is shown that for $0\leq|ε|\ll1$, there is some… ▽ More

    Submitted 28 February, 2019; originally announced March 2019.

  46. arXiv:1812.05676  [pdf, other

    cs.LG stat.ML

    A Probe Towards Understanding GAN and VAE Models

    Authors: Lu Mi, Macheng Shen, Jingzhao Zhang

    Abstract: This project report compares some known GAN and VAE models proposed prior to 2017. There has been significant progress after we finished this report. We upload this report as an introduction to generative models and provide some personal interpretations supported by empirical evidence. Both generative adversarial network models and variational autoencoders have been widely used to approximate prob… ▽ More

    Submitted 17 December, 2018; v1 submitted 13 December, 2018; originally announced December 2018.

    Comments: 9 pages, 8 figures

  47. arXiv:1812.01157  [pdf, other

    cs.CV

    Cross-Classification Clustering: An Efficient Multi-Object Tracking Technique for 3-D Instance Segmentation in Connectomics

    Authors: Yaron Meirovitch, Lu Mi, Hayk Saribekyan, Alexander Matveev, David Rolnick, Nir Shavit

    Abstract: Pixel-accurate tracking of objects is a key element in many computer vision applications, often solved by iterated individual object tracking or instance segmentation followed by object matching. Here we introduce cross-classification clustering (3C), a technique that simultaneously tracks complex, interrelated objects in an image stack. The key idea in cross-classification is to efficiently turn… ▽ More

    Submitted 15 June, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: 11 figures

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 8425-8435

  48. arXiv:1812.00338  [pdf, other

    cs.LG stat.ML

    Regularized Wasserstein Means for Aligning Distributional Data

    Authors: Liang Mi, Wen Zhang, Yalin Wang

    Abstract: We propose to align distributional data from the perspective of Wasserstein means. We raise the problem of regularizing Wasserstein means and propose several terms tailored to tackle different problems. Our formulation is based on the variational transportation to distribute a sparse discrete measure into the target domain. The resulting sparse representation well captures the desired property of… ▽ More

    Submitted 20 February, 2020; v1 submitted 2 December, 2018; originally announced December 2018.

  49. arXiv:1806.09045  [pdf, other

    cs.CV

    Variational Wasserstein Clustering

    Authors: Liang Mi, Wen Zhang, Xianfeng Gu, Yalin Wang

    Abstract: We propose a new clustering method based on optimal transportation. We solve optimal transportation with variational principles, and investigate the use of power diagrams as transportation plans for aggregating arbitrary domains into a fixed number of clusters. We iteratively drive centroids through target domains while maintaining the minimum clustering energy by adjusting the power diagrams. Thu… ▽ More

    Submitted 26 July, 2018; v1 submitted 23 June, 2018; originally announced June 2018.

    Comments: Accepted to ECCV 2018

  50. arXiv:1801.07548  [pdf, ps, other

    cs.DC astro-ph.IM

    A hybrid architecture for astronomical computing

    Authors: Changhua Li, Chenzhou Cui, Boliang He, Dongwei Fan, Linying Mi, Shanshan Li, Sisi Yang, Yunfei Xu, Jun Han, Junyi Chen, Hailong Zhang, Ce Yu, Jian Xiao, Chuanjun Wang, Zihuang Cao, Yufeng Fan, Liang Liu, Xiao Chen, Wenming Song, Kangyu Du

    Abstract: With many large science equipment constructing and putting into use, astronomy has stepped into the big data era. The new method and infrastructure of big data processing has become a new requirement of many astronomers. Cloud computing, Map/Reduce, Hadoop, Spark, etc. many new technology has sprung up in recent years. Comparing to the high performance computing(HPC), Data is the center of these n… ▽ More

    Submitted 18 January, 2018; originally announced January 2018.

    Comments: 4 pages, 2 figures, ADASS XXVI conference