Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for recent submissions

  • Fri, 5 Sep 2025
  • Thu, 4 Sep 2025
  • Wed, 3 Sep 2025
  • Mon, 1 Sep 2025
  • Fri, 29 Aug 2025

See today's new changes

Total of 835 entries : 1-100 101-200 201-300 301-400 376-475 401-500 501-600 601-700 ... 801-835
Showing up to 100 entries per page: fewer | more | all

Wed, 3 Sep 2025 (continued, showing 100 of 380 entries )

[376] arXiv:2509.00797 [pdf, html, other]
Title: ProCause: Generating Counterfactual Outcomes to Evaluate Prescriptive Process Monitoring Methods
Jakob De Moor, Hans Weytjens, Johannes De Smedt
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[377] arXiv:2509.00772 [pdf, html, other]
Title: Flow Matters: Directional and Expressive GNNs for Heterophilic Graphs
Arman Gupta, Govind Waghmare, Gaurav Oberoi, Nitish Srivastava
Subjects: Machine Learning (cs.LG)
[378] arXiv:2509.00754 [pdf, html, other]
Title: Attribute Fusion-based Classifier on Framework of Belief Structure
Qiying Hu, Yingying Liang, Qianli Zhou, Witold Pedrycz
Subjects: Machine Learning (cs.LG)
[379] arXiv:2509.00735 [pdf, html, other]
Title: Task-Aware Adaptive Modulation: A Replay-Free and Resource-Efficient Approach For Continual Graph Learning
Jingtao Liu, Xinming Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[380] arXiv:2509.00704 [pdf, html, other]
Title: Why Pool When You Can Flow? Active Learning with GFlowNets
Renfei Zhang, Mohit Pandey, Artem Cherkasov, Martin Ester
Comments: 6 pages; 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[381] arXiv:2509.00703 [pdf, html, other]
Title: Robust Spatiotemporal Forecasting Using Adaptive Deep-Unfolded Variational Mode Decomposition
Osama Ahmad, Lukas Wesemann, Fabian Waschkowski, Zubair Khalid
Comments: Under review in IEEE Signal Processing Letter
Subjects: Machine Learning (cs.LG)
[382] arXiv:2509.00693 [pdf, html, other]
Title: DELTA: Variational Disentangled Learning for Privacy-Preserving Data Reprogramming
Arun Vignesh Malarkkan, Haoyue Bai, Anjali Kaushik, Yanjie Fu
Comments: 10 pages, 5 figures, 3 Tables. Accepted at IEEE International Conference on Data Mining (ICDM) 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[383] arXiv:2509.00684 [pdf, html, other]
Title: Valid Property-Enhanced Contrastive Learning for Targeted Optimization & Resampling for Novel Drug Design
Amartya Banerjee, Somnath Kar, Anirban Pal, Debabrata Maiti
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[384] arXiv:2509.00663 [pdf, html, other]
Title: An Evolutionary Multi-objective Optimization for Replica-Exchange-based Physics-informed Operator Learning Network
Binghang Lu, Changhong Mou, Guang Lin
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[385] arXiv:2509.00653 [pdf, html, other]
Title: IndiaWeatherBench: A Dataset and Benchmark for Data-Driven Regional Weather Forecasting over India
Tung Nguyen, Harkanwar Singh, Nilay Naharas, Lucas Bandarkar, Aditya Grover
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Atmospheric and Oceanic Physics (physics.ao-ph)
[386] arXiv:2509.00651 [pdf, html, other]
Title: Missing Data Imputation using Neural Cellular Automata
Tin Luu, Binh Nguyen, Man Ngo
Comments: 20 pages, 4 figures
Subjects: Machine Learning (cs.LG)
[387] arXiv:2509.00648 [pdf, html, other]
Title: Context-Action Embedding Learning for Off-Policy Evaluation in Contextual Bandits
Kushagra Chandak, Vincent Liu, Haanvid Lee
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[388] arXiv:2509.00641 [pdf, html, other]
Title: AMCR: A Framework for Assessing and Mitigating Copyright Risks in Generative Models
Zhipeng Yin, Zichong Wang, Avash Palikhe, Zhen Liu, Jun Liu, Wenbin Zhang
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2509.00639 [pdf, html, other]
Title: Disentangling Slow and Fast Temporal Dynamics in Degradation Inference with Hierarchical Differential Models
Mengjie Zhao, Olga Fink
Subjects: Machine Learning (cs.LG)
[390] arXiv:2509.00631 [pdf, html, other]
Title: Forecasting the Ionosphere from Sparse GNSS Data with Temporal-Fusion Transformers
Giacomo Acciarini, Simone Mestici, Halil Kelebek, Linnea Wolniewicz, Michael Vergalla, Madhulika Guhathakurta, Umaa Rebbapragada, Bala Poduval, Atılım Güneş Baydin, Frank Soboczenski
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Atmospheric and Oceanic Physics (physics.ao-ph)
[391] arXiv:2509.00616 [pdf, html, other]
Title: TimeCopilot
Azul Garza, Reneé Rosillo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[392] arXiv:2509.00614 [pdf, html, other]
Title: RoFt-Mol: Benchmarking Robust Fine-Tuning with Molecular Graph Foundation Models
Shikun Liu, Deyu Zou, Nima Shoghi, Victor Fung, Kai Liu, Pan Li
Subjects: Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[393] arXiv:2509.00602 [pdf, html, other]
Title: TranCIT: Transient Causal Interaction Toolbox
Salar Nouri, Kaidi Shao, Shervin Safavi
Subjects: Machine Learning (cs.LG)
[394] arXiv:2509.00560 [pdf, html, other]
Title: An Efficient GNNs-to-KANs Distillation via Self-Attention Dynamic Sampling with Potential for Consumer Electronics Edge Deployment
Can Cui, Zilong Fu, Penghe Huang, Yuanyuan Li, Wu Deng, Dongyan Li
Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[395] arXiv:2509.00550 [pdf, other]
Title: Integrated Multivariate Segmentation Tree for the Analysis of Heterogeneous Credit Data in Small and Medium-Sized Enterprises
Lu Han, Xiuying Wang
Comments: 26 pages,11 figures, 5 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[396] arXiv:2509.00546 [pdf, other]
Title: Advanced spectral clustering for heterogeneous data in credit risk monitoring systems
Lu Han, Mengyan Li, Jiping Qiang, Zhi Su
Comments: 25 pages, 7 figures, 6 tables
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[397] arXiv:2509.00540 [pdf, html, other]
Title: FedThief: Harming Others to Benefit Oneself in Self-Centered Federated Learning
Xiangyu Zhang, Mang Ye
Comments: 12 pages, 5 figures
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[398] arXiv:2509.00524 [pdf, html, other]
Title: Biological Pathway Informed Models with Graph Attention Networks (GATs)
Gavin Wong, Ping Shu Ho, Ivan Au Yeung, Ka Chun Cheung, Simon See
Comments: 5 pages, 3 figures
Subjects: Machine Learning (cs.LG); Molecular Networks (q-bio.MN)
[399] arXiv:2509.00515 [pdf, html, other]
Title: Graph Convolutional Network With Pattern-Spatial Interactive and Regional Awareness for Traffic Forecasting
Xinyu Ji, Chengcheng Yan, Jibiao Yuan, Fiefie Zhao
Subjects: Machine Learning (cs.LG)
[400] arXiv:2509.00488 [pdf, html, other]
Title: Localizing and Mitigating Memorization in Image Autoregressive Models
Aditya Kasliwal, Franziska Boenisch, Adam Dziedzic
Comments: Accepted at ICML 2025 Workshop on the Impact of Memorization on Trustworthy Foundation Models
Subjects: Machine Learning (cs.LG)
[401] arXiv:2509.00454 [pdf, html, other]
Title: Universal Properties of Activation Sparsity in Modern Large Language Models
Filip Szatkowski, Patryk Będkowski, Alessio Devoto, Jan Dubiński, Pasquale Minervini, Mikołaj Piórczyński, Simone Scardapane, Bartosz Wójcik
Subjects: Machine Learning (cs.LG)
[402] arXiv:2509.00421 [pdf, html, other]
Title: Memory Limitations of Prompt Tuning in Transformers
Maxime Meyer, Mario Michelessa, Caroline Chaux, Vincent Y. F. Tan
Subjects: Machine Learning (cs.LG)
[403] arXiv:2509.00415 [pdf, html, other]
Title: Lagrangian Relaxation for Multi-Action Partially Observable Restless Bandits: Heuristic Policies and Indexability
Rahul Meshram, Kesav Kaza
Comments: 13 pages
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[404] arXiv:2509.00404 [pdf, html, other]
Title: Metis: Training Large Language Models with Advanced Low-Bit Quantization
Hengjie Cao, Mengyi Chen, Yifeng Yang, Ruijun Huang, Fang Dong, Jixian Zhou, Anrui Chen, Mingzhi Dong, Yujiang Wang, Jinlong Hou, Yuan Cheng, Fan Wu, Fan Yang, Tun Lu, Ning Gu, Li Shang
Subjects: Machine Learning (cs.LG)
[405] arXiv:2509.00402 [pdf, html, other]
Title: Curriculum Guided Personalized Subgraph Federated Learning
Minku Kang, Hogun Park
Comments: Accepted to the CIKM 2025. This is an extended version of the original submission
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[406] arXiv:2509.00387 [pdf, html, other]
Title: Unifying Adversarial Perturbation for Graph Neural Networks
Jinluan Yang, Ruihao Zhang, Zhengyu Chen, Fei Wu, Kun Kuang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[407] arXiv:2509.00362 [pdf, html, other]
Title: Optimized Weight Initialization on the Stiefel Manifold for Deep ReLU Neural Networks
Hyungu Lee, Taehyeong Kim, Hayoung Choi
Comments: 16 pages, 3 figures, 3 tables
Subjects: Machine Learning (cs.LG)
[408] arXiv:2509.00348 [pdf, other]
Title: Theory Foundation of Physics-Enhanced Residual Learning
Shixiao Liang, Wang Chen, Keke Long, Peng Zhang, Xiaopeng Li, Jintao Ke
Comments: 24 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[409] arXiv:2509.00347 [pdf, html, other]
Title: LLM-Driven Policy Diffusion: Enhancing Generalization in Offline Reinforcement Learning
Hanping Zhang, Yuhong Guo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[410] arXiv:2509.00338 [pdf, html, other]
Title: Scalable Option Learning in High-Throughput Environments
Mikael Henaff, Scott Fujimoto, Michael Rabbat
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[411] arXiv:2509.00336 [pdf, html, other]
Title: Are We Really Learning the Score Function? Reinterpreting Diffusion Models Through Wasserstein Gradient Flow Matching
An B. Vuong, Michael T. McCann, Javier E. Santos, Yen Ting Lin
Subjects: Machine Learning (cs.LG)
[412] arXiv:2509.00333 [pdf, html, other]
Title: Counterfactual Risk Minimization with IPS-Weighted BPR and Self-Normalized Evaluation in Recommender Systems
Rahul Raja, Arpita Vats
Comments: Accepted at Causality, Counterfactuals & Sequential Decision-Making Workshop(CONSEQUENCES) at ACM Recommender Systems Conference(RecSys 25) Prague, Czech Republic
Subjects: Machine Learning (cs.LG)
[413] arXiv:2509.00326 [pdf, html, other]
Title: Chunked TabPFN: Exact Training-Free In-Context Learning for Long-Context Tabular Data
Renat Sergazinov, Shao-An Yin
Comments: 14 pages, 6 figures
Subjects: Machine Learning (cs.LG)
[414] arXiv:2509.00316 [pdf, html, other]
Title: Continuously Tempered Diffusion Samplers
Ezra Erives, Bowen Jing, Peter Holderrieth, Tommi Jaakkola
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[415] arXiv:2509.00280 [pdf, html, other]
Title: ReLATE: Learning Efficient Sparse Encoding for High-Performance Tensor Decomposition
Ahmed E. Helal, Fabio Checconi, Jan Laukemann, Yongseok Soh, Jesmin Jahan Tithi, Fabrizio Petrini, Jee Choi
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[416] arXiv:2509.00259 [pdf, html, other]
Title: Quantum-Optimized Selective State Space Model for Efficient Time Series Prediction
Stefan-Alexandru Jura, Mihai Udrescu, Alexandru Topirceanu
Subjects: Machine Learning (cs.LG)
[417] arXiv:2509.00221 [pdf, html, other]
Title: Speech Foundation Models Generalize to Time Series Tasks from Wearable Sensor Data
Jaya Narain, Zakaria Aldeneh, Shirley Ren
Comments: Preprint, under review
Subjects: Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[418] arXiv:2509.00217 [pdf, html, other]
Title: Learning to Shard: RL for Co-optimizing the Parallelism Degrees and Per-operator Sharding Dimensions in Distributed LLM Inference
Ruokai Yin, Sattwik Deb Mishra, Xuan Zuo, Hokchhay Tann, Preyas Shah, Apala Guha
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[419] arXiv:2509.00203 [pdf, html, other]
Title: Estimating Parameter Fields in Multi-Physics PDEs from Scarce Measurements
Xuyang Li, Mahdi Masmoudi, Rami Gharbi, Nizar Lajnef, Vishnu Naresh Boddeti
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[420] arXiv:2509.00202 [pdf, html, other]
Title: From TLinFormer to TConstFormer: The Leap to Constant-Time Transformer Attention: Achieving O(1) Computation and O(1) KV Cache during Autoregressive Inference
Zhongpan Tang
Subjects: Machine Learning (cs.LG)
[421] arXiv:2509.00195 [pdf, html, other]
Title: Democratizing Agentic AI with Fast Test-Time Scaling on the Edge
Hao Mark Chen, Zhiwen Mo, Guanxi Lu, Shuang Liang, Lingxiao Ma, Wayne Luk, Hongxiang Fan
Subjects: Machine Learning (cs.LG)
[422] arXiv:2509.00183 [pdf, html, other]
Title: FNODE: Flow-Matching for data-driven simulation of constrained multibody systems
Hongyu Wang, Jingquan Wang, Dan Negrut
Comments: 36 pages, 19 figures
Subjects: Machine Learning (cs.LG)
[423] arXiv:2509.00174 [pdf, html, other]
Title: Principled Approximation Methods for Efficient and Scalable Deep Learning
Pedro Savarese
Comments: PhD thesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[424] arXiv:2509.00103 [pdf, other]
Title: Pre-trained knowledge elevates large language models beyond traditional chemical reaction optimizers
Robert MacKnight, Jose Emilio Regio, Jeffrey G. Ethier, Luke A. Baldwin, Gabe Gomes
Comments: 19 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Chemical Physics (physics.chem-ph)
[425] arXiv:2509.00102 [pdf, html, other]
Title: Exploiting a Mixture-of-Layers in an Electrocardiography Foundation Model
Phu X. Nguyen, Huy Phan, Hieu Pham, Christos Chatzichristos, Bert Vandenberk, Maarten De Vos
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[426] arXiv:2509.00099 [pdf, html, other]
Title: LLM-QUBO: An End-to-End Framework for Automated QUBO Transformation from Natural Language Problem Descriptions
Huixiang Zhang, Mahzabeen Emu, Salimur Choudhury
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph)
[427] arXiv:2509.00097 [pdf, html, other]
Title: Progressive Element-wise Gradient Estimation for Neural Network Quantization
Kaiqi Zhao
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[428] arXiv:2509.00096 [pdf, html, other]
Title: Pruning Weights but Not Truth: Safeguarding Truthfulness While Pruning LLMs
Yao Fu, Runchao Li, Xianxuan Long, Haotian Yu, Xiaotian Han, Yu Yin, Pan Li
Comments: Accepted to EMNLP2025 findings (poster)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[429] arXiv:2509.00095 [pdf, html, other]
Title: Financial Decision Making using Reinforcement Learning with Dirichlet Priors and Quantum-Inspired Genetic Optimization
Prasun Nandy, Debjit Dhar, Rik Das
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[430] arXiv:2509.00092 [pdf, other]
Title: Robust Detection of Synthetic Tabular Data under Schema Variability
G. Charbel N. Kindji (MALT), Elisa Fromont (MALT), Lina Maria Rojas-Barahona, Tanguy Urvoy
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[431] arXiv:2509.00089 [pdf, html, other]
Title: Learning from Peers: Collaborative Ensemble Adversarial Training
Li Dengjin, Guo Yanming, Xie Yuxiang, Li Zheng, Chen Jiangming, Li Xiaolong, Lao Mingrui
Subjects: Machine Learning (cs.LG)
[432] arXiv:2509.00087 [pdf, other]
Title: Yet Unnoticed in LSTM: Binary Tree Based Input Reordering, Weight Regularization, and Gate Nonlinearization
Mojtaba Moattari
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[433] arXiv:2509.00086 [pdf, other]
Title: Centralized vs. Federated Learning for Educational Data Mining: A Comparative Study on Student Performance Prediction with SAEB Microdata
Rodrigo Tertulino
Comments: This paper has been prepared to be submitted Brazilian Journal of Informatics in Education - RBIE
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[434] arXiv:2509.00084 [pdf, html, other]
Title: Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs
Qibin Wang, Pu Zhao, Shaohan Huang, Fangkai Yang, Lu Wang, Furu Wei, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[435] arXiv:2509.00083 [pdf, html, other]
Title: Data Cartography for Detecting Memorization Hotspots and Guiding Data Interventions in Generative Models
Laksh Patel, Neel Shanbhag
Comments: 6 pages, 2 figures, 1 table; Presented at the 42nd International Conference on Machine Learning (ICML), winning the "Best Poster" award at ICML's workshop for data in generative models (DIG-BUGS)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[436] arXiv:2509.00076 [pdf, other]
Title: Experimental Assessment of a Multi-Class AI/ML Architecture for Real-Time Characterization of Cyber Events in a Live Research Reactor
Zachery Dahm, Konstantinos Vasili, Vasileios Theos, Konstantinos Gkouliaras, William Richards, True Miller, Brian Jowers, Stylianos Chatzidakis
Subjects: Machine Learning (cs.LG)
[437] arXiv:2509.00073 [pdf, other]
Title: Mitigating Clinician Information Overload: Generative AI for Integrated EHR and RPM Data Analysis
Ankit Shetgaonkar, Dipen Pradhan, Lakshit Arora, Sanjay Surendranath Girija, Shashank Kapoor, Aman Raj
Comments: Accepted at IEEE COMPSAC 2025
Journal-ref: 2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC)
Subjects: Machine Learning (cs.LG)
[438] arXiv:2509.00071 [pdf, html, other]
Title: SynCircuit: Automated Generation of New Synthetic RTL Circuits Can Enable Big Data in Circuits
Shang Liu, Jing Wang, Wenji Fang, Zhiyao Xie
Comments: Accepted by DAC'25
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[439] arXiv:2509.00069 [pdf, other]
Title: AnomalyExplainer Explainable AI for LLM-based anomaly detection using BERTViz and Captum
Prasasthy Balasubramanian, Dumindu Kankanamge, Ekaterina Gilman, Mourad Oussalah
Subjects: Machine Learning (cs.LG)
[440] arXiv:2509.00066 [pdf, html, other]
Title: T-MLP: Tailed Multi-Layer Perceptron for Level-of-Detail Signal Representation
Chuanxiang Yang, Yuanfeng Zhou, Guangshun Wei, Siyu Ren, Yuan Liu, Junhui Hou, Wenping Wang
Subjects: Machine Learning (cs.LG); Graphics (cs.GR); Image and Video Processing (eess.IV)
[441] arXiv:2509.00057 [pdf, html, other]
Title: From Data to Decision: A Multi-Stage Framework for Class Imbalance Mitigation in Optical Network Failure Analysis
Yousuf Moiz Ali, Jaroslaw E. Prilepsky, Nicola Sambo, Joao Pedro, Mohammad M. Hosseini, Antonio Napoli, Sergei K. Turitsyn, Pedro Freire
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[442] arXiv:2509.00050 [pdf, html, other]
Title: Applying Deep Learning to Anomaly Detection of Russian Satellite Activity for Indications Prior to Military Activity
David Kurtenbach, Megan Manly, Zach Metzinger
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[443] arXiv:2509.00049 [pdf, html, other]
Title: Adaptive Physics-Informed Neural Networks with Multi-Category Feature Engineering for Hydrogen Sorption Prediction in Clays, Shales, and Coals
Mohammad Nooraiepour, Mohammad Masoudi, Zezhang Song, Helge Hellevang
Subjects: Machine Learning (cs.LG)
[444] arXiv:2509.00047 [pdf, html, other]
Title: Teaching AI to Remember: Insights from Brain-Inspired Replay in Continual Learning
Jina Kim
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[445] arXiv:2509.00046 [pdf, other]
Title: Exploring and Reshaping the Weight Distribution in LLM
Chunming Ye, Songzhou Li, Xu Xu
Comments: 19 pages,16 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[446] arXiv:2509.00036 [pdf, html, other]
Title: A-FloPS: Accelerating Diffusion Sampling with Adaptive Flow Path Sampler
Cheng Jin, Zhenyu Xiao, Yuantao Gu
Comments: 14 pages,9 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[447] arXiv:2509.00035 [pdf, html, other]
Title: Transfer Learning for Minimum Operating Voltage Prediction in Advanced Technology Nodes: Leveraging Legacy Data and Silicon Odometer Sensing
Yuxuan Yin, Rebecca Chen, Boxun Xu, Chen He, Peng Li
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[448] arXiv:2509.00034 [pdf, other]
Title: Industrial Steel Slag Flow Data Loading Method for Deep Learning Applications
Mert Sehri, Ana Cardoso, Francisco de Assis Boldt, Patrick Dumond
Subjects: Machine Learning (cs.LG)
[449] arXiv:2509.00031 [pdf, html, other]
Title: ZeroQAT: Your Quantization-aware Training but Efficient
Qitao Tan, Xiaoying Song, Jin Lu, Guoming Li, Jun Liu, Lingzi Hong, Caiwen Ding, Jundong Li, Xiaoming Zhai, Shaoyi Huang, Wei Niu, Geng Yuan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[450] arXiv:2509.00027 [pdf, other]
Title: Mitigating Data Exfiltration Attacks through Layer-Wise Learning Rate Decay Fine-Tuning
Elie Thellier (EPIONE), Huiyu Li (EPIONE), Nicholas Ayache (EPIONE), Hervé Delingette (EPIONE)
Journal-ref: 6th MICCAI Workshop on "Distributed, Collaborative and Federated Learning'', Sep 2025, Daejeon, South Korea
Subjects: Machine Learning (cs.LG)
[451] arXiv:2509.00026 [pdf, html, other]
Title: Diagnosing Psychiatric Patients: Can Large Language and Machine Learning Models Perform Effectively in Emergency Cases?
Abu Shad Ahammed, Sayeri Mukherjee, Roman Obermaisser
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[452] arXiv:2509.02551 (cross-list from cs.NI) [pdf, html, other]
Title: On Transferring, Merging, and Splitting Task-Oriented Network Digital Twins
Zifan Zhang, Minghong Fang, Mingzhe Chen, Yuchen Liu
Comments: Accepted by IEEE MobiWac 2025
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[453] arXiv:2509.02535 (cross-list from stat.ML) [pdf, html, other]
Title: Probabilities of Causation and Root Cause Analysis with Quasi-Markovian Models
Eduardo Rocha Laurentino, Fabio Gagliardi Cozman, Denis Deratani Maua, Daniel Angelo Esteves Lawand, Davi Goncalves Bezerra Coelho, Lucas Martins Marques
Comments: Accepted at the 35th Brazilian Conference on Intelligent Systems (BRACIS 2025)
Journal-ref: Springer Proceedings, 2025
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[454] arXiv:2509.02534 (cross-list from cs.CL) [pdf, html, other]
Title: Jointly Reinforcing Diversity and Quality in Language Model Generations
Tianjian Li, Yiming Zhang, Ping Yu, Swarnadeep Saha, Daniel Khashabi, Jason Weston, Jack Lanchantin, Tianlu Wang
Comments: 29 pages, 11 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[455] arXiv:2509.02523 (cross-list from cs.CL) [pdf, html, other]
Title: Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices
Evan King, Adam Sabra, Manjunath Kudlur, James Wang, Pete Warden
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[456] arXiv:2509.02522 (cross-list from cs.CL) [pdf, html, other]
Title: Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR
Jiaming Li, Longze Chen, Ze Gong, Yukun Chen, Lu Wang, Wanwei He, Run Luo, Min Yang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[457] arXiv:2509.02514 (cross-list from cs.CL) [pdf, html, other]
Title: Comparative Study of Pre-Trained BERT and Large Language Models for Code-Mixed Named Entity Recognition
Mayur Shirke, Amey Shembade, Pavan Thorat, Madhushri Wagh, Raviraj Joshi
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[458] arXiv:2509.02503 (cross-list from cs.CL) [pdf, html, other]
Title: L3Cube-IndicHeadline-ID: A Dataset for Headline Identification and Semantic Evaluation in Low-Resource Indian Languages
Nishant Tanksale, Tanmay Kokate, Darshan Gohad, Sarvadnyaa Barate, Raviraj Joshi
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[459] arXiv:2509.02492 (cross-list from cs.CL) [pdf, other]
Title: GRAM-R$^2$: Self-Training Generative Foundation Reward Models for Reward Reasoning
Chenglong Wang, Yongyu Mu, Hang Zhou, Yifu Huo, Ziming Zhu, Jiali Zeng, Murun Yang, Bei Li, Tong Xiao, Xiaoyang Hao, Chunliang Zhang, Fandong Meng, Jingbo Zhu
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[460] arXiv:2509.02488 (cross-list from cs.CV) [pdf, html, other]
Title: Anisotropic Fourier Features for Positional Encoding in Medical Imaging
Nabil Jabareen, Dongsheng Yuan, Dingming Liu, Foo-Wei Ten, Sören Lukassen
Comments: 13 pages, 3 figures, 2 tables, to be published in ShapeMI MICCAI 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[461] arXiv:2509.02480 (cross-list from cs.DC) [pdf, html, other]
Title: MLP-Offload: Multi-Level, Multi-Path Offloading for LLM Pre-training to Break the GPU Memory Wall
Avinash Maurya, M. Mustafa Rafique, Franck Cappello, Bogdan Nicolae
Comments: SC'25: The International Conference for High Performance Computing, Networking, Storage and Analysis
Journal-ref: SC'25: The International Conference for High Performance Computing, Networking, Storage and Analysis, 2025
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[462] arXiv:2509.02476 (cross-list from stat.ML) [pdf, html, other]
Title: Wild Refitting for Model-Free Excess Risk Evaluation of Opaque ML/AI Models under Bregman Loss
Haichen Hu, David Simchi-Levi
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[463] arXiv:2509.02474 (cross-list from cs.GR) [pdf, html, other]
Title: Unifi3D: A Study on 3D Representations for Generation and Reconstruction in a Common Framework
Nina Wiedemann, Sainan Liu, Quentin Leboutet, Katelyn Gao, Benjamin Ummenhofer, Michael Paulitsch, Kai Yuan
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[464] arXiv:2509.02471 (cross-list from cs.SD) [pdf, html, other]
Title: ESTM: An Enhanced Dual-Branch Spectral-Temporal Mamba for Anomalous Sound Detection
Chengyuan Ma, Peng Jia, Hongyue Guo, Wenming Yang
Comments: Accepted in IEEE Signal Processing Letters 2025
Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[465] arXiv:2509.02452 (cross-list from cs.CL) [pdf, html, other]
Title: Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
Seyedali Mohammadi, Bhaskara Hanuma Vedula, Hemank Lamba, Edward Raff, Ponnurangam Kumaraguru, Francis Ferraro, Manas Gaur
Comments: To appear in EMNLP 2025, Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[466] arXiv:2509.02450 (cross-list from cs.CL) [pdf, html, other]
Title: EmoPerso: Enhancing Personality Detection with Self-Supervised Emotion-Aware Modelling
Lingzhi Shen, Xiaohao Cai, Yunfei Long, Imran Razzak, Guanming Chen, Shoaib Jameel
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[467] arXiv:2509.02446 (cross-list from cs.CL) [pdf, html, other]
Title: An Ensemble Classification Approach in A Multi-Layered Large Language Model Framework for Disease Prediction
Ali Hamdi, Malak Mohamed, Rokaia Emad, Khaled Shaban
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[468] arXiv:2509.02351 (cross-list from cs.CV) [pdf, html, other]
Title: Ordinal Adaptive Correction: A Data-Centric Approach to Ordinal Image Classification with Noisy Labels
Alireza Sedighi Moghaddam, Mohammad Reza Mohammadi
Comments: 10 pages, 5 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[469] arXiv:2509.02349 (cross-list from cs.SD) [pdf, html, other]
Title: AudioCodecBench: A Comprehensive Benchmark for Audio Codec Evaluation
Lu Wang, Hao Chen, Siyu Wu, Zhiyue Wu, Hao Zhou, Chengfeng Zhang, Ting Wang, Haodi Zhang
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[470] arXiv:2509.02337 (cross-list from stat.ML) [pdf, html, other]
Title: Distribution estimation via Flow Matching with Lipschitz guarantees
Lea Kunkel
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[471] arXiv:2509.02333 (cross-list from cs.CL) [pdf, html, other]
Title: DCPO: Dynamic Clipping Policy Optimization
Shihui Yang, Chengfeng Dou, Peidong Guo, Kai Lu, Qiang Ju, Fei Deng, Rihui Xin
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[472] arXiv:2509.02327 (cross-list from stat.ML) [pdf, other]
Title: Variational Uncertainty Decomposition for In-Context Learning
I. Shavindra Jayasekera, Jacob Si, Filippo Valdettaro, Wenlong Chen, A. Aldo Faisal, Yingzhen Li
Comments: Fixing author order; typo p.20
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[473] arXiv:2509.02259 (cross-list from cs.SD) [pdf, html, other]
Title: Speech transformer models for extracting information from baby cries
Guillem Bonafos, Jéremy Rouch, Lény Lego, David Reby, Hugues Patural, Nicolas Mathevon, Rémy Emonet
Comments: Accepted to WOCCI2025 (interspeech2025 workshop)
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Applications (stat.AP)
[474] arXiv:2509.02237 (cross-list from cs.CE) [pdf, other]
Title: Autoencoder-based non-intrusive model order reduction in continuum mechanics
Jannick Kehls, Ellen Kuhl, Tim Brepols, Kevin Linka, Hagen Holthusen
Subjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[475] arXiv:2509.02192 (cross-list from eess.SY) [pdf, html, other]
Title: Selection of Optimal Number and Location of PMUs for CNN Based Fault Location and Identification
Khalid Daud Khattak, Muhammad A. Choudhry
Comments: Paper submitted to 57th North American Power Symposium (NAPS) 2025
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
Total of 835 entries : 1-100 101-200 201-300 301-400 376-475 401-500 501-600 601-700 ... 801-835
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack