Skip to main content

Showing 1–50 of 174 results for author: Shan, L

.
  1. arXiv:2506.01277  [pdf, ps, other

    cs.AI

    GeoLocSFT: Efficient Visual Geolocation via Supervised Fine-Tuning of Multimodal Foundation Models

    Authors: Qiang Yi, Lianlei Shan

    Abstract: Accurately determining the geographic location where a single image was taken, visual geolocation, remains a formidable challenge due to the planet's vastness and the deceptive similarity among distant locations. We introduce GeoLocSFT, a framework that demonstrates how targeted supervised fine-tuning (SFT) of a large multimodal foundation model (Gemma 3) using a small, high-quality dataset can yi… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: 29 pages, 14 figures

  2. arXiv:2506.00479  [pdf, ps, other

    cs.CL cs.CV cs.LG

    EffiVLM-BENCH: A Comprehensive Benchmark for Evaluating Training-Free Acceleration in Large Vision-Language Models

    Authors: Zekun Wang, Minghua Ma, Zexin Wang, Rongchuan Mu, Liping Shan, Ming Liu, Bing Qin

    Abstract: Large Vision-Language Models (LVLMs) have achieved remarkable success, yet their significant computational demands hinder practical deployment. While efforts to improve LVLM efficiency are growing, existing methods lack comprehensive evaluation across diverse backbones, benchmarks, and metrics. In this work, we systematically evaluate mainstream acceleration techniques for LVLMs, categorized into… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: ACL 2025

  3. arXiv:2505.17653  [pdf, ps, other

    cs.AI

    GeoGramBench: Benchmarking the Geometric Program Reasoning in Modern LLMs

    Authors: Shixian Luo, Zezhou Zhu, Yu Yuan, Yuncheng Yang, Lianlei Shan, Yong Wu

    Abstract: Geometric spatial reasoning forms the foundation of many applications in artificial intelligence, yet the ability of large language models (LLMs) to operate over geometric spatial information expressed in procedural code remains underexplored. In this paper, we address this gap by formalizing the Program-to-Geometry task, which challenges models to translate programmatic drawing code into accurate… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 23 pages, 13 figures

  4. arXiv:2505.13306  [pdf, other

    cs.CV cs.IR

    GMM-Based Comprehensive Feature Extraction and Relative Distance Preservation For Few-Shot Cross-Modal Retrieval

    Authors: Chengsong Sun, Weiping Li, Xiang Li, Yuankun Liu, Lianlei Shan

    Abstract: Few-shot cross-modal retrieval focuses on learning cross-modal representations with limited training samples, enabling the model to handle unseen classes during inference. Unlike traditional cross-modal retrieval tasks, which assume that both training and testing data share the same class distribution, few-shot retrieval involves data with sparse representations across modalities. Existing methods… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  5. arXiv:2505.12396  [pdf, ps, other

    cs.IR

    LLM-CoT Enhanced Graph Neural Recommendation with Harmonized Group Policy Optimization

    Authors: Hailong Luo, Bin Wu, Hongyong Jia, Qingqing Zhu, Lianlei Shan

    Abstract: Graph neural networks (GNNs) have advanced recommender systems by modeling interaction relationships. However, existing graph-based recommenders rely on sparse ID features and do not fully exploit textual information, resulting in low information density within representations. Furthermore, graph contrastive learning faces challenges. Random negative sampling can introduce false negative samples,… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  6. arXiv:2504.03071  [pdf, other

    cs.CL cs.AI

    AD-GPT: Large Language Models in Alzheimer's Disease

    Authors: Ziyu Liu, Lintao Tang, Zeliang Sun, Zhengliang Liu, Yanjun Lyu, Wei Ruan, Yangshuang Xu, Liang Shan, Jiyoon Shin, Xiaohe Chen, Dajiang Zhu, Tianming Liu, Rongjie Liu, Chao Huang

    Abstract: Large language models (LLMs) have emerged as powerful tools for medical information retrieval, yet their accuracy and depth remain limited in specialized domains such as Alzheimer's disease (AD), a growing global health challenge. To address this gap, we introduce AD-GPT, a domain-specific generative pre-trained transformer designed to enhance the retrieval and analysis of AD-related genetic and n… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  7. arXiv:2504.02723  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Computing High-dimensional Confidence Sets for Arbitrary Distributions

    Authors: Chao Gao, Liren Shan, Vaidehi Srinivas, Aravindan Vijayaraghavan

    Abstract: We study the problem of learning a high-density region of an arbitrary distribution over $\mathbb{R}^d$. Given a target coverage parameter $δ$, and sample access to an arbitrary distribution $D$, we want to output a confidence set $S \subset \mathbb{R}^d$ such that $S$ achieves $δ$ coverage of $D$, i.e., $\mathbb{P}_{y \sim D} \left[ y \in S \right] \ge δ$, and the volume of $S$ is as small as pos… ▽ More

    Submitted 12 May, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

    Comments: Improves volume approximation factor from $\exp(\tilde{O}(d^{2/3}))$ to $\exp(\tilde{O}(d^{1/2}))$, along with other minor edits. To appear in COLT 2025

  8. arXiv:2504.02441  [pdf, other

    cs.CL cs.AI

    Cognitive Memory in Large Language Models

    Authors: Lianlei Shan, Shixian Luo, Zezhou Zhu, Yu Yuan, Yong Wu

    Abstract: This paper examines memory mechanisms in Large Language Models (LLMs), emphasizing their importance for context-rich responses, reduced hallucinations, and improved efficiency. It categorizes memory into sensory, short-term, and long-term, with sensory memory corresponding to input prompts, short-term memory processing immediate context, and long-term memory implemented via external databases or s… ▽ More

    Submitted 23 April, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

    Comments: 37 pages, 9 figures

  9. arXiv:2503.11640  [pdf, other

    physics.optics cs.AI

    Enhancing Deep Learning Based Structured Illumination Microscopy Reconstruction with Light Field Awareness

    Authors: Long-Kun Shan, Ze-Hao Wang, Tong-Tian Weng, Xiang-Dong Chen, Fang-Wen Sun

    Abstract: Structured illumination microscopy (SIM) is a pivotal technique for dynamic subcellular imaging in live cells. Conventional SIM reconstruction algorithms depend on accurately estimating the illumination pattern and can introduce artefacts when this estimation is imprecise. Although recent deep learning-based SIM reconstruction methods have improved speed, accuracy, and robustness, they often strug… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  10. arXiv:2503.11265  [pdf, other

    cs.CV

    DynRsl-VLM: Enhancing Autonomous Driving Perception with Dynamic Resolution Vision-Language Models

    Authors: Xirui Zhou, Lianlei Shan, Xiaolin Gui

    Abstract: Visual Question Answering (VQA) models, which fall under the category of vision-language models, conventionally execute multiple downsampling processes on image inputs to strike a balance between computational efficiency and model performance. Although this approach aids in concentrating on salient features and diminishing computational burden, it incurs the loss of vital detailed information, a d… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  11. arXiv:2503.07209  [pdf, other

    cs.CV cs.LG

    Synthetic Lung X-ray Generation through Cross-Attention and Affinity Transformation

    Authors: Ruochen Pi, Lianlei Shan

    Abstract: Collecting and annotating medical images is a time-consuming and resource-intensive task. However, generating synthetic data through models such as Diffusion offers a cost-effective alternative. This paper introduces a new method for the automatic generation of accurate semantic masks from synthetic lung X-ray images based on a stable diffusion model trained on text-image pairs. This method uses c… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  12. arXiv:2502.20733  [pdf

    cond-mat.supr-con cond-mat.str-el

    Symmetry-Broken Kondo Screening and Zero-Energy Mode in the Kagome Superconductor CsV3Sb5

    Authors: Yubing Tu, Zongyuan Zhang, Wenjian Lu, Tao Han, Run Lv, Zhuying Wang, Zekun Zhou, Xinyuan Hou, Ning Hao, Zhenyu Wang, Xianhui Chen, Lei Shan

    Abstract: The quantum states of matter reorganize themselves in response to defects, giving rise to emergent local excitations that imprint unique characteristics of the host states. While magnetic impurities are known to generate Kondo screening in a Fermi liquid and Yu-Shiba-Rusinov (YSR) states in a conventional superconductor, it remains unclear whether they can evoke distinct phenomena in the kagome su… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: 19 pages, 4 figures

  13. arXiv:2502.20493  [pdf

    cs.LG cs.AI

    Unified Kernel-Segregated Transpose Convolution Operation

    Authors: Vijay Srinivas Tida, Md Imran Hossen, Liqun Shan, Sai Venkatesh Chilukoti, Sonya Hsu, Xiali Hei

    Abstract: The optimization of the transpose convolution layer for deep learning applications is achieved with the kernel segregation mechanism. However, kernel segregation has disadvantages, such as computing extra elements to obtain the output feature map with odd dimensions while launching a thread. To mitigate this problem, we introduce a unified kernel segregation approach that limits the usage of memor… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  14. arXiv:2502.16658  [pdf, other

    cs.LG stat.ML

    Volume Optimality in Conformal Prediction with Structured Prediction Sets

    Authors: Chao Gao, Liren Shan, Vaidehi Srinivas, Aravindan Vijayaraghavan

    Abstract: Conformal Prediction is a widely studied technique to construct prediction sets of future observations. Most conformal prediction methods focus on achieving the necessary coverage guarantees, but do not provide formal guarantees on the size (volume) of the prediction sets. We first prove an impossibility of volume optimality where any distribution-free method can only find a trivial solution. We t… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: 41 pages, 19 figures, 2 tables

  15. arXiv:2502.16352  [pdf, ps, other

    cs.LG cs.CR cs.CY cs.DS

    Verifying Classification with Limited Disclosure

    Authors: Siddharth Bhandari, Liren Shan

    Abstract: We consider the multi-party classification problem introduced by Dong, Hartline, and Vijayaraghavan (2022) motivated by electronic discovery. In this problem, our goal is to design a protocol that guarantees the requesting party receives nearly all responsive documents while minimizing the disclosure of nonresponsive documents. We develop verification protocols that certify the correctness of a cl… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

    Comments: 18 pages, 0 figures

  16. arXiv:2502.12264  [pdf, ps, other

    econ.TH cs.CY cs.GT cs.LG

    Multi-dimensional Test Design

    Authors: Xiaoyun Qiu, Liren Shan

    Abstract: How should one jointly design tests and the arrangement of agencies to administer these tests (testing procedure)? To answer this question, we analyze a model where a principal must use multiple tests to screen an agent with a multi-dimensional type, knowing that the agent can change his type at a cost. We identify a new tradeoff between setting difficult tests and using a difficult testing proced… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  17. arXiv:2502.11329  [pdf, other

    cs.CV

    Differentially private fine-tuned NF-Net to predict GI cancer type

    Authors: Sai Venkatesh Chilukoti, Imran Hossen Md, Liqun Shan, Vijay Srinivas Tida, Xiali Hei

    Abstract: Based on global genomic status, the cancer tumor is classified as Microsatellite Instable (MSI) and Microsatellite Stable (MSS). Immunotherapy is used to diagnose MSI, whereas radiation and chemotherapy are used for MSS. Therefore, it is significant to classify a gastro-intestinal (GI) cancer tumor into MSI vs. MSS to provide appropriate treatment. The existing literature showed that deep learning… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

    Comments: 10 pages, 8 tables, 2 figures

  18. arXiv:2502.08003  [pdf, other

    cs.LG

    Heterogeneous Multi-agent Multi-armed Bandits on Stochastic Block Models

    Authors: Mengfan Xu, Liren Shan, Fatemeh Ghaffari, Xuchuang Wang, Xutong Liu, Mohammad Hajiesmaili

    Abstract: We study a novel heterogeneous multi-agent multi-armed bandit problem with a cluster structure induced by stochastic block models, influencing not only graph topology, but also reward heterogeneity. Specifically, agents are distributed on random graphs based on stochastic block models - a generalized Erdos-Renyi model with heterogeneous edge probabilities: agents are grouped into clusters (known o… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 55 pages

  19. arXiv:2411.11008  [pdf, other

    physics.plasm-ph

    Structure of weakly collisional shock waves of multicomponent plasmas inside hohlraums of indirect inertial confinement fusions

    Authors: Tianyi Liang, Dong Wu, Lifeng Wang, Lianqiang Shan, Zongqiang Yuan, Hongbo Cai, Yuqiu Gu, Zhengmao Sheng, Xiantu He

    Abstract: In laser-driven indirect inertial confinement fusion (ICF), a hohlraum--a cavity constructed from high-Z materials--serves the purpose of converting laser energy into thermal x-ray energy. This process involves the interaction of low-density ablated plasmas, which can give rise to weakly collisional shock waves characterized by a Knudsen number $K_n$ on the order of 1. The Knudsen number serves as… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

  20. arXiv:2410.23829  [pdf

    physics.acc-ph hep-ex

    First Proof of Principle Experiment for Muon Production with Ultrashort High Intensity Laser

    Authors: Feng Zhang, Li Deng, Yanjie Ge, Jiaxing Wen, Bo Cui, Ke Feng, Hao Wang, Chen Wu, Ziwen Pan, Hongjie Liu, Zhigang Deng, Zongxin Zhang, Liangwen Chen, Duo Yan, Lianqiang Shan, Zongqiang Yuan, Chao Tian, Jiayi Qian, Jiacheng Zhu, Yi Xu, Yuhong Yu, Xueheng Zhang, Lei Yang, Weimin Zhou, Yuqiu Gu , et al. (4 additional authors not shown)

    Abstract: Muons, which play a crucial role in both fundamental and applied physics, have traditionally been generated through proton accelerators or from cosmic rays. With the advent of ultra-short high-intensity lasers capable of accelerating electrons to GeV levels, it has become possible to generate muons in laser laboratories. In this work, we show the first proof of principle experiment for novel muon… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    Journal ref: Nature Physics,2025

  21. arXiv:2410.09435  [pdf, ps, other

    econ.TH math.DS

    On the Oscillations in Cournot Games with Best Response Strategies

    Authors: Zhengyang Liu, Haolin Lu, Liang Shan, Zihe Wang

    Abstract: In this paper, we consider the dynamic oscillation in the Cournot oligopoly model, which involves multiple firms producing homogeneous products. To explore the oscillation under the updates of best response strategies, we focus on the linear price functions. In this setting, we establish the existence of oscillations. In particular, we show that for the scenario of different costs among firms, the… ▽ More

    Submitted 6 May, 2025; v1 submitted 12 October, 2024; originally announced October 2024.

  22. arXiv:2409.13199  [pdf, other

    cs.CL

    CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information

    Authors: Yuxin Wang, Minghua Ma, Zekun Wang, Jingchang Chen, Huiming Fan, Liping Shan, Qing Yang, Dongliang Xu, Ming Liu, Bing Qin

    Abstract: The colossal parameters and computational overhead of Large Language Models (LLMs) challenge their real-world applications. Network pruning, which targets unstructured or structured sparsity by removing redundant parameters, has recently been explored for LLM acceleration. Existing LLM pruning works focus on unstructured pruning, which typically requires special hardware support for a practical sp… ▽ More

    Submitted 9 December, 2024; v1 submitted 20 September, 2024; originally announced September 2024.

    Comments: Proc. The 31st International Conference on Computational Linguistics (COLING2025)

  23. arXiv:2408.07366  [pdf, other

    hep-ph

    Probing a light long-lived pseudo-scalar from Higgs decay via displaced taus at the LHC

    Authors: Lianyou Shan, Lei Wang, Jin Min Yang, Rui Zhu

    Abstract: A light (GeV mass) long-lived ($cτ$ around dozens of millimeters) CP-odd scalar can be readily predicted in new physics models. In this work we investigate the Higgs decay into such a light scalar plus a $Z$-boson and take the aligned two-Higgs-doublet model (2HDM) as an example. This light long-lived scalar, with the dominant decay to tau leptons, will fly over a distance from the production poin… ▽ More

    Submitted 3 March, 2025; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: 18 pages, 7 figures

    Journal ref: JHEP 10 (2024) 193

  24. arXiv:2408.04963  [pdf, other

    cs.LG

    LiD-FL: Towards List-Decodable Federated Learning

    Authors: Hong Liu, Liren Shan, Han Bao, Ronghui You, Yuhao Yi, Jiancheng Lv

    Abstract: Federated learning is often used in environments with many unverified participants. Therefore, federated learning under adversarial attacks receives significant attention. This paper proposes an algorithmic framework for list-decodable federated learning, where a central server maintains a list of models, with at least one guaranteed to perform well. The framework has no strict restriction on the… ▽ More

    Submitted 26 February, 2025; v1 submitted 9 August, 2024; originally announced August 2024.

    Comments: 26 pages, 5 figures

  25. arXiv:2405.19568  [pdf, other

    cs.CV

    Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation

    Authors: Lianlei Shan, Wenzhang Zhou, Wei Li, Xingyu Ding

    Abstract: The goal of incremental Few-shot Semantic Segmentation (iFSS) is to extend pre-trained segmentation models to new classes via few annotated images without access to old training data. During incrementally learning novel classes, the data distribution of old classes will be destroyed, leading to catastrophic forgetting. Meanwhile, the novel classes have only few samples, making models impossible to… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 10 pages, 5 figures

  26. arXiv:2405.18663  [pdf, other

    cs.AI

    Lifelong Learning and Selective Forgetting via Contrastive Strategy

    Authors: Lianlei Shan, Wenzhang Zhou, Wei Li, Xingyu Ding

    Abstract: Lifelong learning aims to train a model with good performance for new tasks while retaining the capacity of previous tasks. However, some practical scenarios require the system to forget undesirable knowledge due to privacy issues, which is called selective forgetting. The joint task of the two is dubbed Learning with Selective Forgetting (LSF). In this paper, we propose a new framework based on c… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 10 pages, 5 figure

  27. arXiv:2405.18078  [pdf, other

    cs.CV

    Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images

    Authors: Lianlei Shan, Weiqiang Wang, Ke Lv, Bin Luo

    Abstract: Semantic segmentation requires pixel-level annotation, which is time-consuming. Active Learning (AL) is a promising method for reducing data annotation costs. Due to the gap between aerial and natural images, the previous AL methods are not ideal, mainly caused by unreasonable labeling units and the neglect of class imbalance. Previous labeling units are based on images or regions, which does not… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  28. arXiv:2405.17776  [pdf, other

    cs.LG

    The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention

    Authors: Xingyu Ding, Lianlei Shan, Guiqin Zhao, Meiqi Wu, Wenzhang Zhou, Wei Li

    Abstract: Deep learning-based information processing consumes long time and requires huge computing resources, especially for dense prediction tasks which require an output for each pixel, like semantic segmentation and salient object detection. There are mainly two challenges for quantization of dense prediction tasks. Firstly, directly applying the upsampling operation that dense prediction tasks require… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 30 pages, 6 figures

  29. arXiv:2404.18567  [pdf, other

    cs.CR

    Double Backdoored: Converting Code Large Language Model Backdoors to Traditional Malware via Adversarial Instruction Tuning Attacks

    Authors: Md Imran Hossen, Sai Venkatesh Chilukoti, Liqun Shan, Sheng Chen, Yinzhi Cao, Xiali Hei

    Abstract: Instruction-tuned Large Language Models designed for coding tasks are increasingly employed as AI coding assistants. However, the cybersecurity vulnerabilities and implications arising from the widespread integration of these models are not yet fully understood due to limited research in this domain. This work investigates novel techniques for transitioning backdoors from the AI/ML domain to tradi… ▽ More

    Submitted 6 March, 2025; v1 submitted 29 April, 2024; originally announced April 2024.

  30. arXiv:2403.07260  [pdf, other

    cs.CL

    LaERC-S: Improving LLM-based Emotion Recognition in Conversation with Speaker Characteristics

    Authors: Yumeng Fu, Junjie Wu, Zhongjie Wang, Meishan Zhang, Lili Shan, Yulin Wu, Bingquan Li

    Abstract: Emotion recognition in conversation (ERC), the task of discerning human emotions for each utterance within a conversation, has garnered significant attention in human-computer interaction systems. Previous ERC studies focus on speaker-specific information that predominantly stems from relationships among utterances, which lacks sufficient information around conversations. Recent research in ERC ha… ▽ More

    Submitted 3 March, 2025; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: COLING 2025

  31. arXiv:2402.14540  [pdf, other

    cs.GT

    On Truthful Item-Acquiring Mechanisms for Reward Maximization

    Authors: Liang Shan, Shuo Zhang, Jie Zhang, Zihe Wang

    Abstract: In this research, we study the problem that a collector acquires items from the owner based on the item qualities the owner declares and an independent appraiser's assessments. The owner is interested in maximizing the probability that the collector acquires the items and is the only one who knows the items' factual quality. The appraiser performs her duties with impartiality, but her assessment m… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  32. arXiv:2402.11769  [pdf, other

    eess.SY cs.GT math.OC

    Connection-Aware P2P Trading: Simultaneous Trading and Peer Selection

    Authors: Cheng Feng, Kedi Zheng, Lanqing Shan, Hani Alers, Qixin Chen, Lampros Stergioulas, Hongye Guo

    Abstract: Peer-to-peer (P2P) trading is seen as a viable solution to handle the growing number of distributed energy resources in distribution networks. However, when dealing with large-scale consumers, there are several challenges that must be addressed. One of these challenges is limited communication capabilities. Additionally, prosumers may have specific preferences when it comes to trading. Both can re… ▽ More

    Submitted 28 October, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Paper accepted for Applied Energy. Personal use of this material is permitted. Permission from Elsevier must be obtained for all other uses

    Journal ref: Applied Energy, Volume 377, Part D, 2025, 124658, ISSN 0306-2619,

  33. arXiv:2401.17952  [pdf, ps, other

    cs.CY cs.DS cs.IR

    Error-Tolerant E-Discovery Protocols

    Authors: Jinshuo Dong, Jason D. Hartline, Liren Shan, Aravindan Vijayaraghavan

    Abstract: We consider the multi-party classification problem introduced by Dong, Hartline, and Vijayaraghavan (2022) in the context of electronic discovery (e-discovery). Based on a request for production from the requesting party, the responding party is required to provide documents that are responsive to the request except for those that are legally privileged. Our goal is to find a protocol that verifie… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 28 pages, 6 figures, CSLAW 2024

  34. arXiv:2401.00973  [pdf, other

    cs.LG cs.CR

    Facebook Report on Privacy of fNIRS data

    Authors: Md Imran Hossen, Sai Venkatesh Chilukoti, Liqun Shan, Vijay Srinivas Tida, Xiali Hei

    Abstract: The primary goal of this project is to develop privacy-preserving machine learning model training techniques for fNIRS data. This project will build a local model in a centralized setting with both differential privacy (DP) and certified robustness. It will also explore collaborative federated learning to train a shared model between multiple clients without sharing local fNIRS datasets. To preven… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 15 pages, 5 figures, 3 tables

    MSC Class: I.2.0

  35. arXiv:2312.02400  [pdf, ps, other

    cs.LG cs.CR

    DP-SGD-Global-Adapt-V2-S: Triad Improvements of Privacy, Accuracy and Fairness via Step Decay Noise Multiplier and Step Decay Upper Clipping Threshold

    Authors: Sai Venkatesh Chilukoti, Md Imran Hossen, Liqun Shan, Vijay Srinivas Tida, Mahathir Mohammad Bappy, Wenmeng Tian, Xiai Hei

    Abstract: Differentially Private Stochastic Gradient Descent (DP-SGD) has become a widely used technique for safeguarding sensitive information in deep learning applications. Unfortunately, DPSGD's per-sample gradient clipping and uniform noise addition during training can significantly degrade model utility and fairness. We observe that the latest DP-SGD-Global-Adapt's average gradient norm is the same thr… ▽ More

    Submitted 5 February, 2025; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: 34 pages single column, 10 figures, 21 tables

    MSC Class: 26; 40

  36. arXiv:2311.17450  [pdf, other

    cs.CV

    Continual Learning for Image Segmentation with Dynamic Query

    Authors: Weijia Wu, Yuzhong Zhao, Zhuang Li, Lianlei Shan, Hong Zhou, Mike Zheng Shou

    Abstract: Image segmentation based on continual learning exhibits a critical drop of performance, mainly due to catastrophic forgetting and background shift, as they are required to incorporate new classes continually. In this paper, we propose a simple, yet effective Continual Image Segmentation method with incremental Dynamic Query (CISDQ), which decouples the representation learning of both old and new k… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Code: https://github.com/weijiawu/CisDQ

    Journal ref: TCSVT 2023

  37. arXiv:2309.11075  [pdf, other

    physics.plasm-ph

    Large-scale Kinetic Simulations of Colliding Plasmas within a Hohlraum of Indirect Drive Inertial Confinement Fusions

    Authors: Tianyi Liang, Dong Wu, Xiaochuan Ning, Lianqiang Shan, Zongqiang Yuan, Hongbo Cai, Zhengmao Sheng, Xiantu He

    Abstract: The National Ignition Facility has recently achieved successful burning plasma and ignition using the inertial confinement fusion (ICF) approach. However, there are still many fundamental physics phenomena that are not well understood, including the kinetic processes in the hohlraum. Shan et al. [Phys. Rev. Lett, 120, 195001, 2018] utilized the energy spectra of neutrons to investigate the kinetic… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  38. A General Approach to Proving Properties of Fibonacci Representations via Automata Theory

    Authors: Jeffrey Shallit, Sonja Linghui Shan

    Abstract: We provide a method, based on automata theory, to mechanically prove the correctness of many numeration systems based on Fibonacci numbers. With it, long case-based and induction-based proofs of correctness can be replaced by simply constructing a regular expression (or finite automaton) specifying the rules for valid representations, followed by a short computation. Examples of the systems that c… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: In Proceedings AFL 2023, arXiv:2309.01126

    Journal ref: EPTCS 386, 2023, pp. 228-242

  39. arXiv:2308.12663  [pdf, other

    hep-ph

    Dark Mater Interactions From An Extra U(1) gauge symmetry with kinetic mixing and Higgs charge

    Authors: Lianyou Shan, Zhao-Huan Yu

    Abstract: We investigate fermionic dark matter interactions with standard model particles from an additional $\mathrm{U}(1)_\mathrm{X}$ gauge symmetry, assuming kinetic mixing between the $\mathrm{U}(1)_\mathrm{X}$ and $\mathrm{U}(1)_\mathrm{Y}$ gauge fields as well as a nonzero $\mathrm{U}(1)_\mathrm{X}$ charge of the Higgs doublet. For ensuring gauge-invariant Yukawa interactions and the cancellation of g… ▽ More

    Submitted 4 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 21 pages, 2 figures, 3 tables v2 : references added

  40. arXiv:2308.10160  [pdf, other

    cs.DS

    Higher-Order Cheeger Inequality for Partitioning with Buffers

    Authors: Konstantin Makarychev, Yury Makarychev, Liren Shan, Aravindan Vijayaraghavan

    Abstract: We prove a new generalization of the higher-order Cheeger inequality for partitioning with buffers. Consider a graph $G=(V,E)$. The buffered expansion of a set $S \subseteq V$ with a buffer $B \subseteq V \setminus S$ is the edge expansion of $S$ after removing all the edges from set $S$ to its buffer $B$. An $\varepsilon$-buffered $k$-partitioning is a partitioning of a graph into disjoint compon… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 45 pages

  41. arXiv:2308.08373  [pdf, ps, other

    cs.DS

    Approximation Algorithms for Norm Multiway Cut

    Authors: Charlie Carlson, Jafar Jafarov, Konstantin Makarychev, Yury Makarychev, Liren Shan

    Abstract: We consider variants of the classic Multiway Cut problem. Multiway Cut asks to partition a graph $G$ into $k$ parts so as to separate $k$ given terminals. Recently, Chandrasekaran and Wang (ESA 2021) introduced $\ell_p$-norm Multiway, a generalization of the problem, in which the goal is to minimize the $\ell_p$ norm of the edge boundaries of $k$ parts. We provide an… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 25 pages, ESA 2023

  42. arXiv:2307.15128  [pdf, other

    cs.CV

    End-to-end Remote Sensing Change Detection of Unregistered Bi-temporal Images for Natural Disasters

    Authors: Guiqin Zhao, Lianlei Shan, Weiqiang Wang

    Abstract: Change detection based on remote sensing images has been a prominent area of interest in the field of remote sensing. Deep networks have demonstrated significant success in detecting changes in bi-temporal remote sensing images and have found applications in various fields. Given the degradation of natural environments and the frequent occurrence of natural disasters, accurately and swiftly identi… ▽ More

    Submitted 16 August, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  43. arXiv:2305.15033  [pdf, other

    cs.CL

    SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models

    Authors: Zekun Wang, Jingchang Chen, Wangchunshu Zhou, Haichao Zhu, Jiafeng Liang, Liping Shan, Ming Liu, Dongliang Xu, Qing Yang, Bing Qin

    Abstract: Despite achieving remarkable performance on various vision-language tasks, Transformer-based Vision-Language Models (VLMs) suffer from redundancy in inputs and parameters, significantly hampering their efficiency in real-world applications. Moreover, the degree of redundancy in token representations and model parameters, such as attention heads, varies significantly for different inputs. In light… ▽ More

    Submitted 26 February, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: COLING-LREC 2024

  44. arXiv:2304.12584  [pdf, other

    physics.optics cs.LG

    Learning imaging mechanism directly from optical microscopy observations

    Authors: Ze-Hao Wang, Long-Kun Shan, Tong-Tian Weng, Tian-Long Chen, Qi-Yu Wang, Xiang-Dong Chen, Zhang-Yang Wang, Guang-Can Guo, Fang-Wen Sun

    Abstract: Optical microscopy image plays an important role in scientific research through the direct visualization of the nanoworld, where the imaging mechanism is described as the convolution of the point spread function (PSF) and emitters. Based on a priori knowledge of the PSF or equivalent PSF, it is possible to achieve more precise exploration of the nanoworld. However, it is an outstanding challenge t… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  45. arXiv:2304.09113  [pdf, ps, other

    cs.DS

    Random Cuts are Optimal for Explainable k-Medians

    Authors: Konstantin Makarychev, Liren Shan

    Abstract: We show that the RandomCoordinateCut algorithm gives the optimal competitive ratio for explainable k-medians in l1. The problem of explainable k-medians was introduced by Dasgupta, Frost, Moshkovitz, and Rashtchian in 2020. Several groups of authors independently proposed a simple polynomial-time randomized algorithm for the problem and showed that this algorithm is O(log k loglog k) competitive.… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 14 pages, 2 figures

  46. Optimal Pricing Schemes for Identical Items with Time-Sensitive Buyers

    Authors: Zhengyang Liu, Liang Shan, Zihe Wang

    Abstract: Time or money? That is a question! In this paper, we consider this dilemma in the pricing regime, in which we try to find the optimal pricing scheme for identical items with heterogenous time-sensitive buyers. We characterize the revenue-optimal solution and propose an efficient algorithm to find it in a Bayesian setting. Our results also demonstrate the tight ratio between the value of wasted tim… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 11pages, 2 figures

  47. Quantum enhanced radio detection and ranging with solid spins

    Authors: Xiang-Dong Chen, En-Hui Wang, Long-Kun Shan, Shao-Chun Zhang, Ce Feng, Yu Zheng, Yang Dong, Guang-Can Guo, Fang-Wen Sun

    Abstract: The accurate radio frequency (RF) ranging and localizing of objects has benefited the researches including autonomous driving, the Internet of Things, and manufacturing. Quantum receivers have been proposed to detect the radio signal with ability that can outperform conventional measurement. As one of the most promising candidates, solid spin shows superior robustness, high spatial resolution and… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Journal ref: Nature Communications 14, 1288 (2023)

  48. arXiv:2302.05115  [pdf

    cond-mat.supr-con cond-mat.str-el

    Unidirectional electron-phonon coupling as a "fingerprint'' of the nematic state in a kagome superconductor

    Authors: Ping Wu, Yubing Tu, Zhuying Wang, Shuikang Yu, Hongyu Li, Wanru Ma, Zuowei Liang, Yunmei Zhang, Xuechen Zhang, Zeyu Li, Ye Yang, Zhenhua Qiao, Jianjun Ying, Tao Wu, Lei Shan, Ziji Xiang, Zhenyu Wang, Xianhui Chen

    Abstract: Electronic nematicity has been commonly observed in juxtaposition with unconventional superconductivity. Understanding the nature of the nematic state, as well as its consequence on the electronic band structure and superconductivity, has become a pivotal focus in condensed matter physics. Here we use spectroscopic imaging-scanning tunneling microscopy to visualize how the interacting quasiparticl… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 31 pages, 4 figures plus 10 extended data figures. A final version will appear in Nature Physics

  49. arXiv:2301.12703  [pdf, other

    physics.optics

    Simultaneous magnetic and electric Purcell enhancement in a hybrid metal-dielectric nanostructure

    Authors: Lingxiao Shan, Qi Liu, Yun Ma, Yali Jia, Hai Lin, Guowei Lu, Qihuang Gong, Ying Gu

    Abstract: Hybrid metal-dielectric structures, which combine the advantages of both metal and dielectric materials, support high-confined but low-loss magnetic and electric resonances under deliberate arrangements. However, their potential for enhancing magnetic emission has not been explored. Here, we study the simultaneous magnetic and electric Purcell enhancement supported by a hybrid structure consisting… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: 10 pages, 6 figures, submitted to Chin. Opt. Lett

  50. arXiv:2211.15102  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Emergent superconducting fluctuations in a compressed kagome superconductor

    Authors: Xikai Wen, Fanghang Yu, Zhigang Gui, Yuqing Zhang, Xingyuan Hou, Lei Shan, Tao Wu, Ziji Xiang, Zhenyu Wang, Jianjun Ying, Xianhui Chen

    Abstract: The recent discovery of superconductivity (SC) and charge density wave (CDW) in kagome metals AV3Sb5 (A = K, Rb, Cs) provides an ideal playground for the study of emergent electronic orders. Application of moderate pressure leads to a two-dome-shaped SC phase regime in CsV3Sb5 accompanied by the destabilizing of CDW phase; such unconventional evolution of SC may involve the pressure-induced format… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: 16 pages, 4 figures

    Journal ref: Science Bulletin 68(3), 259-265 (2023)