Skip to main content

Showing 51–100 of 356 results for author: Yin, M

.
  1. arXiv:2411.16746  [pdf, ps, other

    cs.CR cs.AI cs.LG

    LoBAM: LoRA-Based Backdoor Attack on Model Merging

    Authors: Ming Yin, Jingyang Zhang, Jingwei Sun, Minghong Fang, Hai Li, Yiran Chen

    Abstract: Model merging is an emerging technique that integrates multiple models fine-tuned on different tasks to create a versatile model that excels in multiple domains. This scheme, in the meantime, may open up backdoor attack opportunities where one single malicious model can jeopardize the integrity of the merged model. Existing works try to demonstrate the risk of such attacks by assuming substantial… ▽ More

    Submitted 30 May, 2025; v1 submitted 23 November, 2024; originally announced November 2024.

  2. arXiv:2411.15215  [pdf, other

    cs.LG cs.AI q-bio.BM

    S$^2$ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning

    Authors: Mingze Yin, Hanjing Zhou, Jialu Wu, Yiheng Zhu, Yuxuan Zhan, Zitai Kong, Hongxia Xu, Chang-Yu Hsieh, Jintai Chen, Tingjun Hou, Jian Wu

    Abstract: Antibodies safeguard our health through their precise and potent binding to specific antigens, demonstrating promising therapeutic efficacy in the treatment of numerous diseases, including COVID-19. Recent advancements in biomedical language models have shown the great potential to interpret complex biological structures and functions. However, existing antibody specific models have a notable limi… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  3. arXiv:2411.12950   

    cs.AI

    KAAE: Numerical Reasoning for Knowledge Graphs via Knowledge-aware Attributes Learning

    Authors: Ming Yin, Qiang Zhou, Zongsheng Cao, Mei Li

    Abstract: Numerical reasoning is pivotal in various artificial intelligence applications, such as natural language processing and recommender systems, where it involves using entities, relations, and attribute values (e.g., weight, length) to infer new factual relations (e.g., the Nile is longer than the Amazon). However, existing approaches encounter two critical challenges in modeling: (1) semantic releva… ▽ More

    Submitted 23 November, 2024; v1 submitted 19 November, 2024; originally announced November 2024.

    Comments: This paper was decided to be withdrawn due to failure to resolve collaborative disputes within the research team or authorship issues. We are actively communicating to reach an agreement and avoid a recurrence of similar issues

  4. arXiv:2411.12593  [pdf, other

    cs.CV cs.AI

    AdaCM$^2$: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction

    Authors: Yuanbin Man, Ying Huang, Chengming Zhang, Bingzhe Li, Wei Niu, Miao Yin

    Abstract: The advancements in large language models (LLMs) have propelled the improvement of video understanding tasks by incorporating LLMs with visual models. However, most existing LLM-based models (e.g., VideoLLaMA, VideoChat) are constrained to processing short-duration videos. Recent attempts to understand long-term videos by extracting and compressing visual features into a fixed memory size. Neverth… ▽ More

    Submitted 4 April, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

    Comments: CVPR 2025 Highlight

  5. arXiv:2411.10461  [pdf, other

    cs.HC cs.AI

    Utilizing Human Behavior Modeling to Manipulate Explanations in AI-Assisted Decision Making: The Good, the Bad, and the Scary

    Authors: Zhuoyan Li, Ming Yin

    Abstract: Recent advances in AI models have increased the integration of AI-based decision aids into the human decision making process. To fully unlock the potential of AI-assisted decision making, researchers have computationally modeled how humans incorporate AI recommendations into their final decisions, and utilized these models to improve human-AI team performance. Meanwhile, due to the ``black-box'' n… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  6. arXiv:2411.06019  [pdf, other

    cs.CV cs.GR

    GaussianSpa: An "Optimizing-Sparsifying" Simplification Framework for Compact and High-Quality 3D Gaussian Splatting

    Authors: Yangming Zhang, Wenqi Jia, Wei Niu, Miao Yin

    Abstract: 3D Gaussian Splatting (3DGS) has emerged as a mainstream for novel view synthesis, leveraging continuous aggregations of Gaussian functions to model scene geometry. However, 3DGS suffers from substantial memory requirements to store the multitude of Gaussians, hindering its practicality. To address this challenge, we introduce GaussianSpa, an optimization-based simplification framework for compact… ▽ More

    Submitted 10 April, 2025; v1 submitted 8 November, 2024; originally announced November 2024.

    Comments: CVPR 2025. Project page at https://noodle-lab.github.io/gaussianspa/

  7. arXiv:2411.04138  [pdf, other

    cs.NI cs.AI cs.LG

    NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation

    Authors: Momin Haider, Ming Yin, Menglei Zhang, Arpit Gupta, Jing Zhu, Yu-Xiang Wang

    Abstract: Mobile devices such as smartphones, laptops, and tablets can often connect to multiple access networks (e.g., Wi-Fi, LTE, and 5G) simultaneously. Recent advancements facilitate seamless integration of these connections below the transport layer, enhancing the experience for apps that lack inherent multi-path support. This optimization hinges on dynamically determining the traffic distribution acro… ▽ More

    Submitted 29 October, 2024; originally announced November 2024.

    Comments: NeurIPS (Datasets and Benchmarks)

  8. arXiv:2411.02120  [pdf, other

    cs.LG cs.AI q-bio.BM

    Bridge-IF: Learning Inverse Protein Folding with Markov Bridges

    Authors: Yiheng Zhu, Jialu Wu, Qiuyi Li, Jiahuan Yan, Mingze Yin, Wei Wu, Mingyang Li, Jieping Ye, Zheng Wang, Jian Wu

    Abstract: Inverse protein folding is a fundamental task in computational protein design, which aims to design protein sequences that fold into the desired backbone structures. While the development of machine learning algorithms for this task has seen significant success, the prevailing approaches, which predominantly employ a discriminative formulation, frequently encounter the error accumulation issue and… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  9. arXiv:2411.01016  [pdf, other

    cs.LG cs.AI

    MoE-I$^2$: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition

    Authors: Cheng Yang, Yang Sui, Jinqi Xiao, Lingyi Huang, Yu Gong, Yuanlin Duan, Wenqi Jia, Miao Yin, Yu Cheng, Bo Yuan

    Abstract: The emergence of Mixture of Experts (MoE) LLMs has significantly advanced the development of language models. Compared to traditional LLMs, MoE LLMs outperform traditional LLMs by achieving higher performance with considerably fewer activated parameters. Despite this efficiency, their enormous parameter size still leads to high deployment costs. In this paper, we introduce a two-stage compression… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  10. arXiv:2411.00841  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    A Theoretical Perspective for Speculative Decoding Algorithm

    Authors: Ming Yin, Minshuo Chen, Kaixuan Huang, Mengdi Wang

    Abstract: Transformer-based autoregressive sampling has been the major bottleneck for slowing down large language model inferences. One effective way to accelerate inference is \emph{Speculative Decoding}, which employs a small model to sample a sequence of draft tokens and a large model to validate. Given its empirical effectiveness, the theoretical understanding of Speculative Decoding is falling behind.… ▽ More

    Submitted 29 October, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  11. arXiv:2411.00078  [pdf, other

    cs.CV cs.AI eess.IV

    How Good Are We? Evaluating Cell AI Foundation Models in Kidney Pathology with Human-in-the-Loop Enrichment

    Authors: Junlin Guo, Siqi Lu, Can Cui, Ruining Deng, Tianyuan Yao, Zhewen Tao, Yizhe Lin, Marilyn Lionts, Quan Liu, Juming Xiong, Yu Wang, Shilin Zhao, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo

    Abstract: Training AI foundation models has emerged as a promising large-scale learning approach for addressing real-world healthcare challenges, including digital pathology. While many of these models have been developed for tasks like disease diagnosis and tissue quantification using extensive and diverse training datasets, their readiness for deployment on some arguably simplest tasks, such as nuclei seg… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

  12. arXiv:2410.24193  [pdf, ps, other

    math.NT

    On the Tamagawa number conjecture for newforms at Eisenstein primes

    Authors: Mulun Yin

    Abstract: We extend the results of [CGLS22] to higher weight modular forms and prove a rank $0$ Tamagawa number formula (also known as the Bloch-Kato conjecture) for modular forms at good Eisenstein primes. Under standard hypotheses (i.e. the injectivity of the $p$-adic Abel-Jabobi map and the non-degeneracy of the Gillet-Soulé height pairing), we also discuss some partial results towards a rank $1$ result.… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    Comments: 32 pages. Comments welcome!

    MSC Class: 11G40

  13. arXiv:2410.23241  [pdf, ps, other

    math.NT

    $p$-converse theorems for elliptic curves of potentially good ordinary reduction at Eisenstein primes

    Authors: Timo Keller, Mulun Yin

    Abstract: Let $E/\mathbf{Q}$ be an elliptic curve and $p\geq 3$ be a prime. We prove the $p$-converse theorems for elliptic curves of potentially good ordinary reduction at Eisenstein primes (i.e., such that the residual representation $E[p]$ is reducible) when the $p$-Selmer rank is $0$ or $1$. The key step is to obtain the anticyclotomic Iwasawa Main Conjectures for an auxiliary imaginary quadratic field… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 34 pages. Comments welcome!

    MSC Class: 11G40 (Primary) 11G05; 11G10; 14G10 (Secondary)

  14. arXiv:2410.20290  [pdf, other

    cs.CL

    Fast Best-of-N Decoding via Speculative Rejection

    Authors: Hanshi Sun, Momin Haider, Ruiqi Zhang, Huitao Yang, Jiahao Qiu, Ming Yin, Mengdi Wang, Peter Bartlett, Andrea Zanette

    Abstract: The safe and effective deployment of Large Language Models (LLMs) involves a critical step called alignment, which ensures that the model's responses are in accordance with human preferences. Prevalent alignment techniques, such as DPO, PPO and their variants, align LLMs by changing the pre-trained model weights during a phase called post-training. While predominant, these post-training methods ad… ▽ More

    Submitted 31 October, 2024; v1 submitted 26 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024

  15. arXiv:2410.14620  [pdf, other

    cs.IT eess.SP

    Site-Specific Outdoor Propagation Assessment and Ray-Tracing Analysis for Wireless Digital Twins

    Authors: Morteza Ghaderi Aram, Hao Guo, Mingsheng Yin, Tommy Svensson

    Abstract: Digital twinning is becoming increasingly vital in the design and real-time control of future wireless networks by providing precise cost-effective simulations, predictive insights, and real-time data integration. This paper explores the application of digital twinning in optimizing wireless communication systems within urban environments, where building arrangements can critically impact network… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  16. arXiv:2410.05877  [pdf, other

    cs.IR cs.LG

    MDAP: A Multi-view Disentangled and Adaptive Preference Learning Framework for Cross-Domain Recommendation

    Authors: Junxiong Tong, Mingjia Yin, Hao Wang, Qiushi Pan, Defu Lian, Enhong Chen

    Abstract: Cross-domain Recommendation systems leverage multi-domain user interactions to improve performance, especially in sparse data or new user scenarios. However, CDR faces challenges such as effectively capturing user preferences and avoiding negative transfer. To address these issues, we propose the Multi-view Disentangled and Adaptive Preference Learning (MDAP) framework. Our MDAP framework uses a m… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: The International Web Information Systems Engineering conference

  17. arXiv:2410.04545  [pdf, other

    cs.CL

    How Does the Disclosure of AI Assistance Affect the Perceptions of Writing?

    Authors: Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin

    Abstract: Recent advances in generative AI technologies like large language models have boosted the incorporation of AI assistance in writing workflows, leading to the rise of a new paradigm of human-AI co-creation in writing. To understand how people perceive writings that are produced under this paradigm, in this paper, we conduct an experimental study to understand whether and how the disclosure of the l… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: EMNLP 2024. arXiv admin note: text overlap with arXiv:2403.12004

  18. arXiv:2410.04346  [pdf, other

    cs.CL

    Ordinal Preference Optimization: Aligning Human Preferences via NDCG

    Authors: Yang Zhao, Yixin Wang, Mingzhang Yin

    Abstract: Aligning Large Language Models (LLMs) with diverse human preferences is a pivotal technique for controlling model behaviors and enhancing generation quality. Reinforcement Learning from Human Feedback (RLHF), Direct Preference Optimization (DPO), and their variants optimize language models by pairwise comparisons. However, when multiple responses are available, these approaches fall short of lever… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  19. Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs

    Authors: Wei Wu, Chao Wang, Liyi Chen, Mingze Yin, Yiheng Zhu, Kun Fu, Jieping Ye, Hui Xiong, Zheng Wang

    Abstract: Proteins, as essential biomolecules, play a central role in biological processes, including metabolic reactions and DNA replication. Accurate prediction of their properties and functions is crucial in biological applications. Recent development of protein language models (pLMs) with supervised fine tuning provides a promising solution to this problem. However, the fine-tuned model is tailored for… ▽ More

    Submitted 29 May, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted by KDD2025

  20. arXiv:2410.03126  [pdf, other

    cs.HC cs.AI

    Understanding Decision Subjects' Engagement with and Perceived Fairness of AI Models When Opportunities of Qualification Improvement Exist

    Authors: Meric Altug Gemalmaz, Ming Yin

    Abstract: We explore how an AI model's decision fairness affects people's engagement with and perceived fairness of the model if they are subject to its decisions, but could repeatedly and strategically respond to these decisions. Two types of strategic responses are considered -- people could determine whether to continue interacting with the model, and whether to invest in themselves to improve their chan… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  21. arXiv:2409.18295  [pdf, other

    cs.LG cs.AI cs.DC

    Enhancing Lossy Compression Through Cross-Field Information for Scientific Applications

    Authors: Youyuan Liu, Wenqi Jia, Taolue Yang, Miao Yin, Sian Jin

    Abstract: Lossy compression is one of the most effective methods for reducing the size of scientific data containing multiple data fields. It reduces information density through prediction or transformation techniques to compress the data. Previous approaches use local information from a single target field when predicting target data points, limiting their potential to achieve higher compression ratios. In… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 9 pages, 9 figures, accepted by DRBSD-10

  22. arXiv:2409.17466  [pdf, other

    stat.ML cs.AI cs.LG

    Adjusting Regression Models for Conditional Uncertainty Calibration

    Authors: Ruijiang Gao, Mingzhang Yin, James McInerney, Nathan Kallus

    Abstract: Conformal Prediction methods have finite-sample distribution-free marginal coverage guarantees. However, they generally do not offer conditional coverage guarantees, which can be important for high-stakes decisions. In this paper, we propose a novel algorithm to train a regression function to improve the conditional coverage after applying the split conformal prediction procedure. We establish an… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: Machine Learning Special Issue on Uncertainty Quantification

  23. arXiv:2409.07416  [pdf, other

    cs.IR cs.AI cs.LG

    Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation

    Authors: Luo Ji, Gao Liu, Mingyang Yin, Hongxia Yang, Jingren Zhou

    Abstract: Modern listwise recommendation systems need to consider both long-term user perceptions and short-term interest shifts. Reinforcement learning can be applied on recommendation to study such a problem but is also subject to large search space, sparse user feedback and long interactive latency. Motivated by recent progress in hierarchical reinforcement learning, we propose a novel framework called m… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 18 pages, 4 figures

  24. arXiv:2409.05785  [pdf, other

    cs.DC cs.AI

    NeurLZ: An Online Neural Learning-Based Method to Enhance Scientific Lossy Compression

    Authors: Wenqi Jia, Zhewen Hu, Youyuan Liu, Boyuan Zhang, Jinzhen Wang, Jinyang Liu, Wei Niu, Stavros Kalafatis, Junzhou Huang, Sian Jin, Daoce Wang, Jiannan Tian, Miao Yin

    Abstract: Large-scale scientific simulations generate massive datasets, posing challenges for storage and I/O. Traditional lossy compression struggles to advance more in balancing compression ratio, data quality, and adaptability to diverse scientific data features. While deep learning-based solutions have been explored, their common practice of relying on large models and offline training limits adaptabili… ▽ More

    Submitted 18 April, 2025; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: ICS 2025

  25. arXiv:2409.02416  [pdf, other

    cs.LG stat.ML

    Relative-Translation Invariant Wasserstein Distance

    Authors: Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei

    Abstract: We introduce a new family of distances, relative-translation invariant Wasserstein distances ($RW_p$), for measuring the similarity of two probability distributions under distribution shift. Generalizing it from the classical optimal transport model, we show that $RW_p$ distances are also real distance metrics defined on the quotient set $\mathcal{P}_p(\mathbb{R}^n)/\sim$ and invariant to distribu… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  26. arXiv:2408.09278  [pdf, other

    eess.IV cs.CV

    Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology

    Authors: Junchao Zhu, Mengmeng Yin, Ruining Deng, Yitian Long, Yu Wang, Yaohong Wang, Shilin Zhao, Haichun Yang, Yuankai Huo

    Abstract: Accurate delineation of the boundaries between the renal cortex and medulla is crucial for subsequent functional structural analysis and disease diagnosis. Training high-quality deep-learning models for layer segmentation relies on the availability of large amounts of annotated data. However, due to the patient's privacy of medical data and scarce clinical cases, constructing pathological datasets… ▽ More

    Submitted 21 March, 2025; v1 submitted 17 August, 2024; originally announced August 2024.

  27. arXiv:2408.08598  [pdf, ps, other

    math.CO

    On odd covers of cliques and disjoint unions

    Authors: Calum Buchanan, Alexander Clifton, Eric Culver, Péter Frankl, Jiaxi Nie, Kenta Ozeki, Puck Rombach, Mei Yin

    Abstract: Babai and Frankl posed the ``odd cover problem" of finding the minimum cardinality of a collection of complete bipartite graphs such that every edge of the complete graph of order $n$ is covered an odd number of times. In a previous paper with O'Neill, some of the authors proved that this value is always $\lceil n / 2 \rceil$ or $\lceil n / 2 \rceil + 1$ and that it is the former whenever $n$ is a… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 19 pages, 6 figures

    MSC Class: 05C70; 05C50

  28. arXiv:2408.06381  [pdf, other

    eess.IV cs.AI cs.CV

    Assessment of Cell Nuclei AI Foundation Models in Kidney Pathology

    Authors: Junlin Guo, Siqi Lu, Can Cui, Ruining Deng, Tianyuan Yao, Zhewen Tao, Yizhe Lin, Marilyn Lionts, Quan Liu, Juming Xiong, Yu Wang, Shilin Zhao, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo

    Abstract: Cell nuclei instance segmentation is a crucial task in digital kidney pathology. Traditional automatic segmentation methods often lack generalizability when applied to unseen datasets. Recently, the success of foundation models (FMs) has provided a more generalizable solution, potentially enabling the segmentation of any cell type. In this study, we perform a large-scale evaluation of three widely… ▽ More

    Submitted 6 February, 2025; v1 submitted 9 August, 2024; originally announced August 2024.

  29. MABR: Multilayer Adversarial Bias Removal Without Prior Bias Knowledge

    Authors: Maxwell J. Yin, Boyu Wang, Charles Ling

    Abstract: Models trained on real-world data often mirror and exacerbate existing social biases. Traditional methods for mitigating these biases typically require prior knowledge of the specific biases to be addressed, such as gender or racial biases, and the social groups associated with each instance. In this paper, we introduce a novel adversarial training strategy that operates independently of prior bia… ▽ More

    Submitted 10 May, 2025; v1 submitted 10 August, 2024; originally announced August 2024.

    Journal ref: AAAI 2025, 39(24):25724-25732

  30. arXiv:2408.02914  [pdf, other

    cs.HC

    VirtualNexus: Enhancing 360-Degree Video AR/VR Collaboration with Environment Cutouts and Virtual Replicas

    Authors: Xincheng Huang, Michael Yin, Ziyi Xia, Robert Xiao

    Abstract: Asymmetric AR/VR collaboration systems bring a remote VR user to a local AR user's physical environment, allowing them to communicate and work within a shared virtual/physical space. Such systems often display the remote environment through 3D reconstructions or 360-degree videos. While 360-degree cameras stream an environment in higher quality, they lack spatial information, making them less inte… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 12 pages, 10 figures, to be published in The 37th Annual ACM Symposium on User Interface Software and Technology (UIST'24)

  31. arXiv:2407.20172  [pdf, other

    eess.IV cs.AI cs.CV

    LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework

    Authors: Zhenqi He, Wenrui Liu, Minghao Yin, Kai Han

    Abstract: Histological artifacts pose challenges for both pathologists and Computer-Aided Diagnosis (CAD) systems, leading to errors in analysis. Current approaches for histological artifact restoration, based on Generative Adversarial Networks (GANs) and pixel-level Diffusion Models, suffer from performance limitations and computational inefficiencies. In this paper, we propose a novel framework, LatentArt… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Accept to DGM4MICCAI2024

  32. arXiv:2407.19296  [pdf, other

    cs.AI

    Multi-Modal CLIP-Informed Protein Editing

    Authors: Mingze Yin, Hanjing Zhou, Yiheng Zhu, Miao Lin, Yixuan Wu, Jialu Wu, Hongxia Xu, Chang-Yu Hsieh, Tingjun Hou, Jintai Chen, Jian Wu

    Abstract: Proteins govern most biological functions essential for life, but achieving controllable protein discovery and optimization remains challenging. Recently, machine learning-assisted protein editing (MLPE) has shown promise in accelerating optimization cycles and reducing experimental workloads. However, current methods struggle with the vast combinatorial space of potential protein edits and cannot… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: 13 pages, 7 figures, 5 tables

    Journal ref: Health Data Science, 2024

  33. arXiv:2407.18390  [pdf, other

    eess.IV cs.CV

    GLAM: Glomeruli Segmentation for Human Pathological Lesions using Adapted Mouse Model

    Authors: Lining Yu, Mengmeng Yin, Ruining Deng, Quan Liu, Tianyuan Yao, Can Cui, Yitian Long, Yu Wang, Yaohong Wang, Shilin Zhao, Haichun Yang, Yuankai Huo

    Abstract: Moving from animal models to human applications in preclinical research encompasses a broad spectrum of disciplines in medical science. A fundamental element in the development of new drugs, treatments, diagnostic methods, and in deepening our understanding of disease processes is the accurate measurement of kidney tissues. Past studies have demonstrated the viability of translating glomeruli segm… ▽ More

    Submitted 7 February, 2025; v1 submitted 25 July, 2024; originally announced July 2024.

  34. arXiv:2407.06645  [pdf, other

    cs.LG cs.CL

    Entropy Law: The Story Behind Data Compression and LLM Performance

    Authors: Mingjia Yin, Chuhan Wu, Yufei Wang, Hao Wang, Wei Guo, Yasheng Wang, Yong Liu, Ruiming Tang, Defu Lian, Enhong Chen

    Abstract: Data is the cornerstone of large language models (LLMs), but not all data is useful for model learning. Carefully selected data can better elicit the capabilities of LLMs with much less computational overhead. Most methods concentrate on evaluating the quality of individual samples in data selection, while the combinatorial effects among samples are neglected. Even if each sample is of perfect qua… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  35. arXiv:2407.06309  [pdf, other

    cs.CY cs.AI

    Multimodal Chain-of-Thought Reasoning via ChatGPT to Protect Children from Age-Inappropriate Apps

    Authors: Chuanbo Hu, Bin Liu, Minglei Yin, Yilu Zhou, Xin Li

    Abstract: Mobile applications (Apps) could expose children to inappropriate themes such as sexual content, violence, and drug use. Maturity rating offers a quick and effective method for potential users, particularly guardians, to assess the maturity levels of apps. Determining accurate maturity ratings for mobile apps is essential to protect children's health in today's saturated digital marketplace. Exist… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  36. arXiv:2407.03307  [pdf, other

    eess.IV cs.CV

    HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization

    Authors: Yucheng Tang, Yufan He, Vishwesh Nath, Pengfeig Guo, Ruining Deng, Tianyuan Yao, Quan Liu, Can Cui, Mengmeng Yin, Ziyue Xu, Holger Roth, Daguang Xu, Haichun Yang, Yuankai Huo

    Abstract: In digital pathology, the traditional method for deep learning-based image segmentation typically involves a two-stage process: initially segmenting high-resolution whole slide images (WSI) into smaller patches (e.g., 256x256, 512x512, 1024x1024) and subsequently reconstructing them to their original scale. This method often struggles to capture the complex details and vast scope of WSIs. In this… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  37. arXiv:2407.00596  [pdf, other

    eess.IV cs.CV

    HATs: Hierarchical Adaptive Taxonomy Segmentation for Panoramic Pathology Image Analysis

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Juming Xiong, Shunxing Bao, Hao Li, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Haichun Yang, Yuankai Huo

    Abstract: Panoramic image segmentation in computational pathology presents a remarkable challenge due to the morphologically complex and variably scaled anatomy. For instance, the intricate organization in kidney pathology spans multiple layers, from regions like the cortex and medulla to functional units such as glomeruli, tubules, and vessels, down to various cell types. In this paper, we propose a novel… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.19286

  38. arXiv:2407.00030  [pdf, other

    cs.DC cs.PF

    On Orchestrating Parallel Broadcasts for Distributed Ledgers

    Authors: Peiyao Sheng, Chenyuan Wu, Dahlia Malkhi, Michael K. Reiter, Chrysoula Stathakopoulou, Michael Wei, Maofan Yin

    Abstract: This paper introduces and develops the concept of ``ticketing'', through which atomic broadcasts are orchestrated by nodes in a distributed system. The paper studies different ticketing regimes that allow parallelism, yet prevent slow nodes from hampering overall progress. It introduces a hybrid scheme which combines managed and unmanaged ticketing regimes, striking a balance between adaptivity an… ▽ More

    Submitted 17 May, 2024; originally announced July 2024.

  39. arXiv:2406.12404  [pdf

    cs.CV

    Scan-to-BIM for As-built Roads: Automatic Road Digital Twinning from Semantically Labeled Point Cloud Data

    Authors: Yuexiong Ding, Mengtian Yin, Ran Wei, Ioannis Brilakis, Muyang Liu, Xiaowei Luo

    Abstract: Creating geometric digital twins (gDT) for as-built roads still faces many challenges, such as low automation level and accuracy, limited asset types and shapes, and reliance on engineering experience. A novel scan-to-building information modeling (scan-to-BIM) framework is proposed for automatic road gDT creation based on semantically labeled point cloud data (PCD), which considers six asset type… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  40. arXiv:2406.05590  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security

    Authors: Minghao Shao, Sofija Jancheska, Meet Udeshi, Brendan Dolan-Gavitt, Haoran Xi, Kimberly Milner, Boyuan Chen, Max Yin, Siddharth Garg, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri, Muhammad Shafique

    Abstract: Large Language Models (LLMs) are being deployed across various domains today. However, their capacity to solve Capture the Flag (CTF) challenges in cybersecurity has not been thoroughly evaluated. To address this, we develop a novel method to assess LLMs in solving CTF challenges by creating a scalable, open-source benchmark database specifically designed for these applications. This database incl… ▽ More

    Submitted 18 February, 2025; v1 submitted 8 June, 2024; originally announced June 2024.

  41. arXiv:2406.01838  [pdf, other

    cs.LG cs.AI

    Learning the Target Network in Function Space

    Authors: Kavosh Asadi, Yao Liu, Shoham Sabach, Ming Yin, Rasool Fakoor

    Abstract: We focus on the task of learning the value function in the reinforcement learning (RL) setting. This task is often solved by updating a pair of online and target networks while ensuring that the parameters of these two networks are equivalent. We propose Lookahead-Replicate (LR), a new value-function approximation algorithm that is agnostic to this parameter-space equivalence. Instead, the LR algo… ▽ More

    Submitted 22 September, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to International Conference on Machine Learning (ICML24)

  42. arXiv:2405.20495  [pdf, other

    cs.CL cs.LG

    Transfer Q Star: Principled Decoding for LLM Alignment

    Authors: Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang

    Abstract: Aligning foundation models is essential for their safe and trustworthy deployment. However, traditional fine-tuning methods are computationally intensive and require updating billions of model parameters. A promising alternative, alignment via decoding, adjusts the response distribution directly without model updates to maximize a target reward $r$, thus providing a lightweight and adaptable frame… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  43. arXiv:2405.20492  [pdf, ps, other

    math.CO math.RA

    Monomial identities in the Weyl algebra

    Authors: Darij Grinberg, Tom Roby, Stephan Wagner, Mei Yin

    Abstract: Motivated by a question and some enumerative conjectures of Richard Stanley, we explore the equivalence classes of words in the Weyl algebra, $\mathbf{k} \left< D,U \mid DU - UD = 1 \right>$. We show that each class is generated by the swapping of adjacent *balanced subwords*, i.e., those which have the same number of $D$'s as $U$'s, and give several other characterizations, as well as a linear-ti… ▽ More

    Submitted 22 November, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 64 pages, 13 pictures. For Richard Stanley's 80th birthday. Detailed version available as ancillary file. Comments are welcome! v3 adds Remark 2.2 and three pictures

    MSC Class: 12H05; 16S32; 05A15; 68R15

  44. Dataset Regeneration for Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Suojuan Zhang, Sirui Zhao, Defu Lian, Enhong Chen

    Abstract: The sequential recommender (SR) system is a crucial component of modern recommender systems, as it aims to capture the evolving preferences of users. Significant efforts have been made to enhance the capabilities of SR systems. These methods typically follow the model-centric paradigm, which involves developing effective models based on fixed datasets. However, this approach often overlooks potent… ▽ More

    Submitted 10 September, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  45. Cypher4BIM: Releasing the Power of Graph for Building Knowledge Discovery

    Authors: Junxiang Zhu, Nicholas Nisbet, Mengtian Yin, Ran Wei, Ioannis Brilakis

    Abstract: Graph is considered a promising way for managing building information. A new graphic form of IFC (Industry Foundation Classes) data has just been developed, referred to as IFC-Graph. However, understanding of IFC-Graph is insufficient, especially for information query. This study aims to explore graphic building information query and develop a graph query language tailored for IFC-Graph. A series… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Journal ref: Automation in Construction, 2025

  46. arXiv:2405.12473  [pdf, other

    cs.IR cs.AI

    Learning Partially Aligned Item Representation for Cross-Domain Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Zhi Li, Sirui Zhao, Zhen Wang, Defu Lian, Enhong Chen

    Abstract: Cross-domain sequential recommendation (CDSR) aims to uncover and transfer users' sequential preferences across multiple recommendation domains. While significant endeavors have been made, they primarily concentrated on developing advanced transfer modules and aligning user representations using self-supervised learning techniques. However, the problem of aligning item representations has received… ▽ More

    Submitted 21 August, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  47. arXiv:2404.17069  [pdf, other

    cs.IT cs.LG eess.SP

    Channel Modeling for FR3 Upper Mid-band via Generative Adversarial Networks

    Authors: Yaqi Hu, Mingsheng Yin, Marco Mezzavilla, Hao Guo, Sundeep Rangan

    Abstract: The upper mid-band (FR3) has been recently attracting interest for new generation of mobile networks, as it provides a promising balance between spectrum availability and coverage, which are inherent limitations of the sub 6GHz and millimeter wave bands, respectively. In order to efficiently design and optimize the network, channel modeling plays a key role since FR3 systems are expected to operat… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  48. arXiv:2404.13528  [pdf, other

    cs.LG cs.AI cs.DC

    SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile

    Authors: Wei Niu, Md Musfiqur Rahman Sanim, Zhihao Shu, Jiexiong Guan, Xipeng Shen, Miao Yin, Gagan Agrawal, Bin Ren

    Abstract: This work is motivated by recent developments in Deep Neural Networks, particularly the Transformer architectures underlying applications such as ChatGPT, and the need for performing inference on mobile devices. Focusing on emerging transformers (specifically the ones with computationally efficient Swin-like architectures) and large models (e.g., Stable Diffusion and LLMs) based on transformers, w… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  49. arXiv:2404.13470  [pdf, other

    cs.DC cs.AI

    GWLZ: A Group-wise Learning-based Lossy Compression Framework for Scientific Data

    Authors: Wenqi Jia, Sian Jin, Jinzhen Wang, Wei Niu, Dingwen Tao, Miao Yin

    Abstract: The rapid expansion of computational capabilities and the ever-growing scale of modern HPC systems present formidable challenges in managing exascale scientific data. Faced with such vast datasets, traditional lossless compression techniques prove insufficient in reducing data size to a manageable level while preserving all information intact. In response, researchers have turned to error-bounded… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  50. arXiv:2404.11871  [pdf, ps, other

    cs.CV

    Group-On: Boosting One-Shot Segmentation with Supportive Query

    Authors: Hanjing Zhou, Mingze Yin, Danny Chen, Jian Wu, JinTai Chen

    Abstract: One-shot semantic segmentation aims to segment query images given only ONE annotated support image of the same class. This task is challenging because target objects in the support and query images can be largely different in appearance and pose (i.e., intra-class variation). Prior works suggested that incorporating more annotated support images in few-shot settings boosts performances but increas… ▽ More

    Submitted 8 June, 2025; v1 submitted 17 April, 2024; originally announced April 2024.

    Journal ref: ICME 2025