Skip to main content

Showing 1–50 of 103 results for author: Yīng, Y

.
  1. arXiv:2506.08091  [pdf, ps, other

    quant-ph

    On whether quantum theory needs complex numbers: the foil theories perspective

    Authors: Yìlè Yīng, Maria Ciudad Alañón, Daniel Centeno, Jacopo Surace, Marina Maciel Ansanelli, Ruizhi Liu, David Schmid, Robert W. Spekkens

    Abstract: Recent work by Renou et al. (2021) has led to some controversy concerning the question of whether quantum theory requires complex numbers for its formulation. We promote the view that the main result of that work is best understood not as a claim about the relative merits of different representations of quantum theory, but rather as a claim about the possibility of experimentally adjudicating betw… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 6+24 pages, 9 figures. Comments welcome!

  2. arXiv:2506.00112  [pdf, ps, other

    quant-ph

    Copenhagenish interpretations of quantum mechanics

    Authors: David Schmid, Yìlè Yīng, Matthew Leifer

    Abstract: We define a class of Copenhagenish interpretations encompassing modern interpretations that follow the Copenhagen spirit. These interpretations are characterized by four postulates: Observers Observe, Universality, Anti-$ψ$-ontology, and Completeness. We explain why such interpretations are not equivalent to the textbook (or orthodox) interpretation, nor to the view that one should shut up and cal… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

    Comments: 16 pages

  3. arXiv:2505.19965  [pdf, ps, other

    cs.AI

    Adaptive Location Hierarchy Learning for Long-Tailed Mobility Prediction

    Authors: Yu Wang, Junshu Dai, Yuchen Ying, Yuxuan Liang, Tongya Zheng, Mingli Song

    Abstract: Human mobility prediction is crucial for applications ranging from location-based recommendations to urban planning, which aims to forecast users' next location visits based on historical trajectories. Despite the severe long-tailed distribution of locations, the problem of long-tailed mobility prediction remains largely underexplored. Existing long-tailed learning methods primarily focus on rebal… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  4. arXiv:2504.02944  [pdf, other

    quant-ph

    Quantifiers and witnesses for the nonclassicality of measurements and of states

    Authors: Yujie Zhang, Yìlè Yīng, David Schmid

    Abstract: In a recent work, arXiv:2503.05884, we proposed a unified notion of nonclassicality that applies to arbitrary processes in quantum theory, including individual quantum states, measurements, channels, set of these, etc. This notion is derived from the principle of generalized noncontextuality, but in a novel manner that applies to individual processes rather than full experiments or theories. Here,… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  5. arXiv:2503.05884  [pdf, other

    quant-ph

    Reassessing the boundary between classical and nonclassical for individual quantum processes

    Authors: Yujie Zhang, David Schmid, Yìlè Yīng, Robert W. Spekkens

    Abstract: There is a received wisdom about where to draw the boundary between classical and nonclassical for various types of quantum processes. For instance, for multipartite states, it is the divide between separable and entangled, for channels, the divide between entanglement-breaking and not, for sets of measurements, the divide between compatible and incompatible, and for assemblages, the divide betwee… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  6. arXiv:2503.00482  [pdf, other

    cs.RO eess.SY

    A Navigation System for ROV's inspection on Fish Net Cage

    Authors: Zhikang Ge, Fang Yang, Wenwu Lu, Peng Wei, Yibin Ying, Chen Peng

    Abstract: Autonomous Remotely Operated Vehicles (ROVs) offer a promising solution for automating fishnet inspection, reducing labor dependency, and improving operational efficiency. In this paper, we modify an off-the-shelf ROV, the BlueROV2, into a ROS-based framework and develop a localization module, a path planning system, and a control framework. For real-time, local localization, we employ the open-so… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

  7. arXiv:2502.19783  [pdf, other

    cond-mat.mes-hall

    Hysteretic responses of nanomechanical resonators based on crumpled few-layer graphene

    Authors: Heng Lu, Chen Yang, Ce Zhang, YuBin Zhang, FengNan Chen, Yue Ying, Zhuo-Zhi Zhang, Xiang-Xiang Song, Guang-Wei Deng, Ying Yan, Joel Moser

    Abstract: Manipulating two-dimensional materials occasionally results in crumpled membranes. Their complicated morphologies feature an abundance of folds, creases and wrinkles that make each crumpled membrane unique. Here, we prepare four nanomechanical resonators based on crumpled membranes of few-layer graphene and measure their static response and the spectrum of their dynamic response. We tune both resp… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: The following article has been submitted to Applied Physics Letters

  8. arXiv:2411.16196  [pdf, other

    cs.CV cs.LG

    Learn from Foundation Model: Fruit Detection Model without Manual Annotation

    Authors: Yanan Wang, Zhenghao Fei, Ruichen Li, Yibin Ying

    Abstract: Recent breakthroughs in large foundation models have enabled the possibility of transferring knowledge pre-trained on vast datasets to domains with limited data availability. Agriculture is one of the domains that lacks sufficient data. This study proposes a framework to train effective, domain-specific, small models from foundation models without manual annotation. Our approach begins with SDM (S… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 17 pages, 12 figures, conference or other essential info

  9. arXiv:2411.07941  [pdf, other

    eess.IV cs.AI cs.CV

    DuoLift-GAN:Reconstructing CT from Single-view and Biplanar X-Rays with Generative Adversarial Networks

    Authors: Zhaoxi Zhang, Yueliang Ying

    Abstract: Computed tomography (CT) provides highly detailed three-dimensional (3D) medical images but is costly, time-consuming, and often inaccessible in intraoperative settings (Organization et al. 2011). Recent advancements have explored reconstructing 3D chest volumes from sparse 2D X-rays, such as single-view or orthogonal double-view images. However, current models tend to process 2D images in a plana… ▽ More

    Submitted 11 December, 2024; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: 9 pages, LaTeX; removed the superscript numbers associated with the authors' names for clarity, typos corrected

  10. arXiv:2410.21809  [pdf

    physics.optics physics.med-ph

    First-in-human spinal cord tumor imaging with fast adaptive focus tracking robotic-OCT

    Authors: Bin He, Yuzhe Ying, Yejiong Shi, Zhe Meng, Zichen Yin, Zhengyu Chen, Zhangwei Hu, Ruizhi Xue, Linkai Jing, Yang Lu, Zhenxing Sun, Weitao Man, Youtu Wu, Dan Lei, Ning Zhang, Guihuai Wang, Ping Xue

    Abstract: Current surgical procedures for spinal cord tumors lack in vivo high-resolution, high-speed multifunctional imaging systems, posing challenges for precise tumor resection and intraoperative decision-making. This study introduces the Fast Adaptive Focus Tracking Robotic Optical Coherence Tomography (FACT-ROCT) system,designed to overcome these obstacles by providing real-time, artifact-free multifu… ▽ More

    Submitted 29 October, 2024; v1 submitted 29 October, 2024; originally announced October 2024.

  11. arXiv:2410.13413  [pdf, other

    cs.CL cs.AI

    Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

    Authors: Chengyu Du, Jinyi Han, Yizhou Ying, Aili Chen, Qianyu He, Haokun Zhao, Sirui Xia, Haoran Guo, Jiaqing Liang, Zulong Chen, Liangyue Li, Yanghua Xiao

    Abstract: Recent advancements in large language models (LLMs) have demonstrated that progressive refinement, rather than providing a single answer, results in more accurate and thoughtful outputs. However, existing methods often rely heavily on supervision signals to evaluate previous responses, making it difficult to assess output quality in more open-ended scenarios effectively. Additionally, these method… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 10 pages, 4 figures

  12. arXiv:2410.09156  [pdf, other

    cs.LG stat.ML

    On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning

    Authors: Bokun Wang, Yunwen Lei, Yiming Ying, Tianbao Yang

    Abstract: We study the discriminative probabilistic modeling on a continuous domain for the data prediction task of (multimodal) self-supervised representation learning. To address the challenge of computing the integral in the partition function for each anchor data, we leverage the multiple importance sampling (MIS) technique for robust Monte Carlo integration, which can recover InfoNCE-based contrastive… ▽ More

    Submitted 5 March, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: To appear in ICLR 2025

  13. arXiv:2409.16030  [pdf, other

    cs.RO

    MHRC: Closed-loop Decentralized Multi-Heterogeneous Robot Collaboration with Large Language Models

    Authors: Wenhao Yu, Jie Peng, Yueliang Ying, Sai Li, Jianmin Ji, Yanyong Zhang

    Abstract: The integration of large language models (LLMs) with robotics has significantly advanced robots' abilities in perception, cognition, and task planning. The use of natural language interfaces offers a unified approach for expressing the capability differences of heterogeneous robots, facilitating communication between them, and enabling seamless task allocation and collaboration. Currently, the uti… ▽ More

    Submitted 25 September, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

  14. arXiv:2409.07537  [pdf, other

    quant-ph

    Connecting extended Wigner's friend arguments and noncontextuality

    Authors: Laurens Walleghem, Yìlè Yīng, Rafael Wagner, David Schmid

    Abstract: The Local Friendliness argument is an extended Wigner's friend no-go theorem that provides strong constraints on the nature of reality -- stronger even than those imposed by Bell's theorem or by noncontextuality arguments. In this work, we prove a variety of connections between Local Friendliness scenarios and Kochen-Specker noncontextuality. Specifically, we first show how one can derive new Loca… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 22 pages + appendix, 16 figures, all comments welcome! The second half of this work is a follow-up work to arXiv:2310.06976v2

  15. arXiv:2408.03109  [pdf, other

    cond-mat.quant-gas quant-ph

    Exceptional point and hysteresis trajectories in cold Rydberg atomic gases

    Authors: Jun Zhang, En-Ze Li, Ya-Jun Wang, Bang Liu, Li-Hua Zhang, Zheng-Yuan Zhang, Shi-Yao Shao, Qing Li, Han-Chao Chen, Yu Ma, Tian-Yu Han, Qi-Feng Wang, Jia-Dou Nan, Yi-Ming Ying, Dong-Yang Zhu, Bao-Sen Shi, Dong-Sheng Ding

    Abstract: The interplay between strong long-range interactions and the coherent driving contribute to the formation of complex patterns, symmetry, and novel phases of matter in many-body systems. However, long-range interactions may induce an additional dissipation channel, resulting in non-Hermitian many-body dynamics and the emergence of exceptional points in spectrum. Here, we report experimental observa… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  16. arXiv:2407.21688  [pdf, other

    quant-ph

    Twirled worlds: symmetry-induced failures of tomographic locality

    Authors: Daniel Centeno, Marco Erba, David Schmid, John H. Selby, Robert W. Spekkens, Sina Soltani, Jacopo Surace, Alex Wilce, Yìlè Yīng

    Abstract: Tomographic locality is a principle commonly used in the program of finding axioms that pick out quantum theory within the landscape of possible theories. The principle asserts the sufficiency of local measurements for achieving a tomographic characterization of any bipartite state. In this work, we explore the meaning of the principle of tomographic locality by developing a simple scheme for gene… ▽ More

    Submitted 4 October, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

    Comments: 5+11 pages, 2 figures. Comments welcome!

  17. arXiv:2407.00164  [pdf, other

    quant-ph math-ph

    Conceptual and formal groundwork for the study of resource dependence relations

    Authors: Yìlè Yīng, Tomáš Gonda, Robert Spekkens

    Abstract: A resource theory imposes a preorder over states, with one state being above another if the first can be converted to the second by a free operation, and where the set of free operations defines the notion of resourcefulness under study. In general, the location of a state in the preorder of one resource theory can constrain its location in the preorder of a different resource theory. It follows t… ▽ More

    Submitted 12 July, 2024; v1 submitted 28 June, 2024; originally announced July 2024.

    Comments: Minor corrections. 36 + 11 pages, 19 figures. Comments welcome!

  18. arXiv:2406.03798  [pdf

    physics.med-ph

    Optical biomarker of metabolism for breast tumor diagnosis: Insights from subcellular dynamics

    Authors: Zichen Yin, Shuwei Zhang, Bin He, Houpu Yang, Zhengyu Chen, Zhangwei Hu, Yejiong Shi, Ruizhi Xue, Panqi Yang, Yuzhe Ying, Chengming Wang, Shu Wang, Ping Xue

    Abstract: Label-free metabolic dynamics contrast is highly appealing but difficult to achieve in biomedical imaging. Interference offers a highly sensitive mechanism for capturing the metabolic dynamics of the subcellular scatterers. However, traditional interference detection methods fail to isolate pure metabolic dynamics, as the dynamic signals are coupled with scatterer reflectivity and other uncontroll… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  19. Kirkwood-Dirac representations beyond quantum states (and their relation to noncontextuality)

    Authors: David Schmid, Roberto D. Baldijão, Yìlè Yīng, Rafael Wagner, John H. Selby

    Abstract: Kirkwood-Dirac representations of quantum states are increasingly finding use in many areas within quantum theory. Usually, representations of this sort are only applied to provide a representation of quantum states (as complex functions over some set). We show how standard Kirkwood-Dirac representations can be extended to a fully compositional representation of all of quantum theory (including ch… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 5 pages; comments welcome!

    Journal ref: Phys. Rev. A 110, 052206 (2024)

  20. arXiv:2404.19641  [pdf

    physics.med-ph physics.bio-ph physics.optics

    Fast and label-free 3D virtual H&E histology via active modulation-assisted dynamic full-field OCT

    Authors: Zichen Yin, Bin He, Yuzhe Ying, Shuwei Zhang, Panqi Yang, Zhengyu Chen, Zhangwei Hu, Yejiong Shi, Ruizhi Xue, Chengming Wang, Shu Wang, Guihuai Wang, Ping Xue

    Abstract: Pathological features are the gold standard for tumor diagnosis, guiding treatment and prognosis. However, standard histopathological process is labor-intensive and time-consuming, while frozen sections have lower accuracy. Dynamic full-field optical coherence tomography (D-FFOCT) offers rapid histologic information by measuring the subcellular dynamics of fresh, unprocessed tissues. However, D-FF… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  21. arXiv:2403.12538  [pdf, other

    cs.RO

    Multi-View Active Sensing for Human-Robot Interaction via Hierarchically Connected Tree

    Authors: Yuanjiong Ying, Xian Huang, Wei Dong

    Abstract: Comprehensive perception of human beings is the prerequisite to ensure the safety of human-robot interaction. Currently, prevailing visual sensing approach typically involves a single static camera, resulting in a restricted and occluded field of view. In our work, we develop an active vision system using multiple cameras to dynamically capture multi-source RGB-D data. An integrated human sensing… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  22. arXiv:2403.05761  [pdf, other

    cs.RO

    CEASE: Collision-Evaluation-based Active Sense System for Collaborative Robotic Arms

    Authors: Xian Huang, Yuanjiong Ying, Wei Dong

    Abstract: Collision detection via visual fences can significantly enhance the safety of collaborative robotic arms. Existing work typically performs such detection based on pre-deployed stationary cameras outside the robotic arm's workspace. These stationary cameras can only provide a restricted detection range and constrain the mobility of the robotic system. To cope with this issue, we propose an active s… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  23. arXiv:2402.16121  [pdf, other

    cs.CV cs.AI

    Towards Accurate Post-training Quantization for Reparameterized Models

    Authors: Luoming Zhang, Yefei He, Wen Fei, Zhenyu Lou, Weijia Wu, YangWei Ying, Hong Zhou

    Abstract: Model reparameterization is a widely accepted technique for improving inference speed without compromising performance. However, current Post-training Quantization (PTQ) methods often lead to significant accuracy degradation when applied to reparameterized models. This is primarily caused by channel-specific and sample-specific outliers, which appear only at specific samples and channels and impac… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  24. arXiv:2402.11178  [pdf, other

    cs.CL

    RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations

    Authors: Haolan Zhan, Zhuang Li, Xiaoxi Kang, Tao Feng, Yuncheng Hua, Lizhen Qu, Yi Ying, Mei Rianto Chandra, Kelly Rosalin, Jureynolds Jureynolds, Suraj Sharma, Shilin Qu, Linhao Luo, Lay-Ki Soon, Zhaleh Semnani Azad, Ingrid Zukerman, Gholamreza Haffari

    Abstract: Norm violations occur when individuals fail to conform to culturally accepted behaviors, which may lead to potential conflicts. Remediating norm violations requires social awareness and cultural sensitivity of the nuances at play. To equip interactive AI systems with a remediation ability, we offer ReNoVi - a large-scale corpus of 9,258 multi-turn dialogues annotated with social norms, as well as… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: work in progress. 15 pages, 7 figures

  25. arXiv:2310.08425  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Non-convex Learning for Multi-layer Neural Networks

    Authors: Hanpu Shen, Cheng-Long Wang, Zihang Xiang, Yiming Ying, Di Wang

    Abstract: This paper focuses on the problem of Differentially Private Stochastic Optimization for (multi-layer) fully connected neural networks with a single output node. In the first part, we examine cases with no hidden nodes, specifically focusing on Generalized Linear Models (GLMs). We investigate the well-specific model where the random noise possesses a zero mean, and the link function is both bounded… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  26. arXiv:2310.06976  [pdf, other

    quant-ph

    Extended Wigner's friend paradoxes do not require nonlocal correlations

    Authors: Laurens Walleghem, Rafael Wagner, Yìlè Yīng, David Schmid

    Abstract: Extended Wigner's friend no-go theorems provide a modern lens for investigating the measurement problem, by making precise the challenges that arise when one attempts to model agents as dynamical quantum systems. Most such no-go theorems studied to date, such as the Frauchiger-Renner argument and the Local Friendliness argument, are explicitly constructed using quantum correlations that violate Be… ▽ More

    Submitted 24 January, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: v2: Significant changes, authors included. 7+4 pages, 1+1 figures. Comments are welcome!

  27. Relating Wigner's Friend Scenarios to Nonclassical Causal Compatibility, Monogamy Relations, and Fine Tuning

    Authors: Yìlè Yīng, Marina Maciel Ansanelli, Andrea Di Biagio, Elie Wolfe, David Schmid, Eric Gama Cavalcanti

    Abstract: Nonclassical causal modeling was developed in order to explain violations of Bell inequalities while adhering to relativistic causal structure and faithfulness -- that is, avoiding fine-tuned causal explanations. Recently, a no-go theorem that can be viewed as being stronger than Bell's theorem has been derived, based on extensions of the Wigner's friend thought experiment: the Local Friendliness… ▽ More

    Submitted 25 September, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 16+5 pages, 9 figures. Accepted in Quantum

    Journal ref: Quantum 8, 1485 (2024)

  28. arXiv:2309.05145  [pdf, other

    cs.LG cs.AI stat.ML

    Outlier Robust Adversarial Training

    Authors: Shu Hu, Zhenhuan Yang, Xin Wang, Yiming Ying, Siwei Lyu

    Abstract: Supervised learning models are challenged by the intrinsic complexities of training data such as outliers and minority subpopulations and intentional attacks at inference time with adversarial samples. While traditional robust learning methods and the recent adversarial training approaches are designed to handle each of the two challenges, to date, no work has been done to develop models that are… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: Accepted by The 15th Asian Conference on Machine Learning (ACML 2023)

  29. arXiv:2308.16220  [pdf, other

    quant-ph

    A review and analysis of six extended Wigner's friend arguments

    Authors: David Schmid, Yìlè Yīng, Matthew Leifer

    Abstract: The Wigner's friend thought experiment was intended to illustrate the difficulty one has in describing an agent as a quantum system when that agent performs a measurement. While it does pose a challenge to the orthodox interpretation of quantum theory, most modern interpretations have no trouble in resolving the difficulty. Recently, a number of extensions of Wigner's ideas have been proposed. We… ▽ More

    Submitted 10 September, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Minor changes. 32 pages, 9 figures; comments welcome!

  30. Elastic scattering and total reaction cross sections of $^{6}$Li studied with a microscopic continuum discretized coupled channels model

    Authors: Wendi Chen, D. Y. Pang, Hairui Guo, Ye Tao, Weili Sun, Yangjun Ying

    Abstract: We present a systematic study of $^{6}$Li elastic scattering and total reaction cross sections at incident energies around the Coulomb barrier within the continuum discretized coupled-channels (CDCC) framework, where $^{6}$Li is treated in an $α$+$d$ two-body model. Collisions with $^{27}$Al, $^{64}$Zn, $^{138}$Ba and $^{208}$Pa are analyzed. The microscopic optical potentials (MOP) based on Skyrm… ▽ More

    Submitted 18 October, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: 12 pages, 16 figures

  31. arXiv:2307.03357  [pdf, ps, other

    cs.LG stat.ML

    Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms

    Authors: Ming Yang, Xiyuan Wei, Tianbao Yang, Yiming Ying

    Abstract: Many machine learning tasks can be formulated as a stochastic compositional optimization (SCO) problem such as reinforcement learning, AUC maximization, and meta-learning, where the objective function involves a nested composition associated with an expectation. While a significant amount of studies has been devoted to studying the convergence behavior of SCO algorithms, there is little work on un… ▽ More

    Submitted 21 November, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

  32. Continuum-discretized coupled-channel calculations for $^{6}$Li fusion reactions with closed channels

    Authors: Wendi Chen, D. Y. Pang, Hairui Guo, Ye Tao, Weili Sun, Yangjun Ying

    Abstract: Fusion reactions induced by the weakly bound nucleus $^{6}$Li with targets $^{28}$Si, $^{64}$Ni, $^{144}$Sm and $^{209}$Bi at energies around the Coulomb barrier are investigated within a three-body model where $^{6}$Li is described with an $α+ d$ cluster model. The total fusion (TF) cross sections are calculated with the continuum-discretized coupled-channel (CDCC) method and the complete fusion… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: 15 pages; 23 figures

    Journal ref: Physical Review C 107, 064610 (2023)

  33. arXiv:2305.20057  [pdf, other

    cs.LG

    Three-Way Trade-Off in Multi-Objective Learning: Optimization, Generalization and Conflict-Avoidance

    Authors: Lisha Chen, Heshan Fernando, Yiming Ying, Tianyi Chen

    Abstract: Multi-objective learning (MOL) problems often arise in emerging machine learning problems when there are multiple learning criteria, data modalities, or learning tasks. Different from single-objective learning, one of the critical challenges in MOL is the potential conflict among different objectives during the iterative optimization process. Recent works have developed various dynamic weighting a… ▽ More

    Submitted 5 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Journal of Machine Learning Research 25, no. 193 (2024): 1-53

  34. arXiv:2305.16891  [pdf, other

    cs.LG stat.ML

    Generalization Guarantees of Gradient Descent for Multi-Layer Neural Networks

    Authors: Puyu Wang, Yunwen Lei, Di Wang, Yiming Ying, Ding-Xuan Zhou

    Abstract: Recently, significant progress has been made in understanding the generalization of neural networks (NNs) trained by gradient descent (GD) using the algorithmic stability approach. However, most of the existing research has focused on one-hidden-layer NNs and has not addressed the impact of different network scaling parameters. In this paper, we greatly extend the previous work \cite{lei2022stabil… ▽ More

    Submitted 29 September, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 38 pages, 2 figures

  35. arXiv:2303.09527  [pdf, other

    cs.IR cs.CR cs.LG

    Fairness-aware Differentially Private Collaborative Filtering

    Authors: Zhenhuan Yang, Yingqiang Ge, Congzhe Su, Dingxian Wang, Xiaoting Zhao, Yiming Ying

    Abstract: Recently, there has been an increasing adoption of differential privacy guided algorithms for privacy-preserving machine learning tasks. However, the use of such algorithms comes with trade-offs in terms of algorithmic fairness, which has been widely acknowledged. Specifically, we have empirically observed that the classical collaborative filtering method, trained by differentially private stochas… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  36. arXiv:2302.12383  [pdf, ps, other

    cs.LG cs.AI

    Generalization Analysis for Contrastive Representation Learning

    Authors: Yunwen Lei, Tianbao Yang, Yiming Ying, Ding-Xuan Zhou

    Abstract: Recently, contrastive learning has found impressive success in advancing the state of the art in solving various machine learning tasks. However, the existing generalization analysis is very limited or even not meaningful. In particular, the existing generalization error bounds depend linearly on the number $k$ of negative examples while it was widely shown in practice that choosing a large $k$ is… ▽ More

    Submitted 27 February, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  37. arXiv:2211.15785  [pdf, other

    hep-ph

    Data-driven extraction of the substructure of quark and gluon jets in proton-proton and heavy-ion collisions

    Authors: Yueyang Ying

    Abstract: The modification of quark- and gluon-initiated jets in the quark-gluon plasma produced in heavy-ion collisions is a long-standing question that has not yet received a definitive answer from experiments. In particular, the size of the modifications in the quark-gluon plasma differs between theoretical models. Therefore a fully data-driven technique is crucial for an unbiased extraction of the quark… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: MEng thesis

  38. arXiv:2211.03093  [pdf, other

    cs.RO

    SRIBO: An Efficient and Resilient Single-Range and Inertia Based Odometry for Flying Robots

    Authors: Wei Dong, Zheyuan Mei, Yuanjiong Ying, Sijia Chen, Yichen ie, Xiangyang Zhu

    Abstract: Positioning with one inertial measurement unit and one ranging sensor is commonly thought to be feasible only when trajectories are in certain patterns ensuring observability. For this reason, to pursue observable patterns, it is required either exciting the trajectory or searching key nodes in a long interval, which is commonly highly nonlinear and may also lack resilience. Therefore, such a posi… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  39. arXiv:2210.15862  [pdf

    cond-mat.mes-hall physics.app-ph

    Sliding nanomechanical resonators

    Authors: Yue Ying, Zhuo-Zhi Zhang, Joel Moser, Zi-Jia Su, Xiang-Xiang Song, Guo-Ping Guo

    Abstract: The motion of a vibrating object is determined by the way it is held. This simple observation has long inspired string instrument makers to create new sounds by devising elegant string clamping mechanisms, whereby the distance between the clamping points is modulated as the string vibrates. At the nanoscale, the simplest way to emulate this principle would be to controllably make nanoresonators sl… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Journal ref: Nature Communications 13, 6392 (2022)

  40. arXiv:2209.09298  [pdf, ps, other

    cs.LG stat.ML

    Stability and Generalization Analysis of Gradient Methods for Shallow Neural Networks

    Authors: Yunwen Lei, Rong Jin, Yiming Ying

    Abstract: While significant theoretical progress has been achieved, unveiling the generalization mystery of overparameterized neural networks still remains largely elusive. In this paper, we study the generalization behavior of shallow neural networks (SNNs) by leveraging the concept of algorithmic stability. We consider gradient descent (GD) and stochastic gradient descent (SGD) to train SNNs, for both of… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: to appear in Neural Information Processing Systems (NeurIPS 2022)

  41. arXiv:2209.08005  [pdf, ps, other

    stat.ML cs.LG

    Stability and Generalization for Markov Chain Stochastic Gradient Methods

    Authors: Puyu Wang, Yunwen Lei, Yiming Ying, Ding-Xuan Zhou

    Abstract: Recently there is a large amount of work devoted to the study of Markov chain stochastic gradient methods (MC-SGMs) which mainly focus on their convergence analysis for solving minimization problems. In this paper, we provide a comprehensive generalization analysis of MC-SGMs for both minimization and minimax problems through the lens of algorithmic stability in the framework of statistical learni… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  42. arXiv:2209.04188  [pdf, ps, other

    stat.ML cs.CR cs.LG

    Differentially Private Stochastic Gradient Descent with Low-Noise

    Authors: Puyu Wang, Yunwen Lei, Yiming Ying, Ding-Xuan Zhou

    Abstract: Modern machine learning algorithms aim to extract fine-grained information from data to provide accurate predictions, which often conflicts with the goal of privacy protection. This paper addresses the practical and theoretical importance of developing privacy-preserving machine learning algorithms that ensure good performance while preserving privacy. In this paper, we focus on the privacy and ut… ▽ More

    Submitted 14 July, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

  43. arXiv:2208.10451  [pdf, other

    cs.LG cs.CY stat.ML

    Minimax AUC Fairness: Efficient Algorithm with Provable Convergence

    Authors: Zhenhuan Yang, Yan Lok Ko, Kush R. Varshney, Yiming Ying

    Abstract: The use of machine learning models in consequential decision making often exacerbates societal inequity, in particular yielding disparate impact on members of marginalized groups defined by race and gender. The area under the ROC curve (AUC) is widely used to evaluate the performance of a scoring function in machine learning, but is studied in algorithmic fairness less than other performance metri… ▽ More

    Submitted 28 November, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

  44. arXiv:2204.00641  [pdf, other

    hep-ph

    Data-driven extraction of the substructure of quark and gluon jets in proton-proton and heavy-ion collisions

    Authors: Yueyang Ying, Jasmine Brewer, Yi Chen, Yen-Jie Lee

    Abstract: The different modifications of quark- and gluon-initiated jets in the quark-gluon plasma (QGP) produced in heavy-ion collisions is a long-standing question that has not yet received a definitive answer from experiments. In particular, the relative sizes of the modification of quark and gluon jets differ between theoretical models. Therefore, a fully data-driven technique is crucial for an unbiased… ▽ More

    Submitted 31 January, 2023; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: added section on smearing due to backgrounds

  45. arXiv:2203.15046  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    AUC Maximization in the Era of Big Data and AI: A Survey

    Authors: Tianbao Yang, Yiming Ying

    Abstract: Area under the ROC curve, a.k.a. AUC, is a measure of choice for assessing the performance of a classifier for imbalanced data. AUC maximization refers to a learning paradigm that learns a predictive model by directly maximizing its AUC score. It has been studied for more than two decades dating back to late 90s and a huge amount of work has been devoted to AUC maximization since then. Recently, s… ▽ More

    Submitted 3 August, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted to the Journal of ACM Computing Surveys

  46. arXiv:2201.09046  [pdf, other

    cs.LG cs.CR

    Differentially Private SGDA for Minimax Problems

    Authors: Zhenhuan Yang, Shu Hu, Yunwen Lei, Kush R. Varshney, Siwei Lyu, Yiming Ying

    Abstract: Stochastic gradient descent ascent (SGDA) and its variants have been the workhorse for solving minimax problems. However, in contrast to the well-studied stochastic gradient descent (SGD) with differential privacy (DP) constraints, there is little work on understanding the generalization (utility) of SGDA with DP constraints. In this paper, we use the algorithmic stability approach to establish th… ▽ More

    Submitted 29 July, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

    Comments: To appear in UAI 2022

  47. Magnetoelectricity in two-dimensional materials

    Authors: Yìlè Yīng, Ulrich Zülicke

    Abstract: Since the initial isolation of few-layer graphene, a plethora of two-dimensional atomic crystals has become available, covering almost all known materials types including metals, semiconductors, superconductors, ferro- and antiferromagnets. These advances have augmented the already existing variety of two-dimensional materials that are routinely realized by quantum confinement in bulk-semiconducto… ▽ More

    Submitted 29 June, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: 29 pages, 7 figures. v3: corrected typo in Fig. 1

    Journal ref: Advances in Physics: X 7, 2032343 (2022)

  48. arXiv:2112.14869  [pdf, other

    cs.LG

    Label Distributionally Robust Losses for Multi-class Classification: Consistency, Robustness and Adaptivity

    Authors: Dixian Zhu, Yiming Ying, Tianbao Yang

    Abstract: We study a family of loss functions named label-distributionally robust (LDR) losses for multi-class classification that are formulated from distributionally robust optimization (DRO) perspective, where the uncertainty in the given label information are modeled and captured by taking the worse case of distributional weights. The benefits of this perspective are several fold: (i) it provides a unif… ▽ More

    Submitted 28 June, 2023; v1 submitted 29 December, 2021; originally announced December 2021.

    Comments: To appear in ICML2023; 37 pages

  49. arXiv:2111.15192  [pdf, other

    cs.CV

    PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction

    Authors: Qingyu Wang, Baojian Ma, Wei Liu, Mingzhao Lou, Mingchuan Zhou, Huanyu Jiang, Yibin Ying

    Abstract: Stereo matching is an important task in computer vision which has drawn tremendous research attention for decades. While in terms of disparity accuracy, density and data size, public stereo datasets are difficult to meet the requirements of models. In this paper, we aim to address the issue between datasets and models and propose a large scale stereo dataset with high accuracy disparity ground tru… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

  50. arXiv:2111.12050  [pdf, other

    cs.LG

    Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning

    Authors: Zhenhuan Yang, Yunwen Lei, Puyu Wang, Tianbao Yang, Yiming Ying

    Abstract: Pairwise learning refers to learning tasks where the loss function depends on a pair of instances. It instantiates many important machine learning tasks such as bipartite ranking and metric learning. A popular approach to handle streaming data in pairwise learning is an online gradient descent (OGD) algorithm, where one needs to pair the current instance with a buffering set of previous instances… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021 accepted