Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.AI

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Artificial Intelligence

Authors and titles for March 2025

Total of 3476 entries : 1-250 ... 2251-2500 2501-2750 2751-3000 3001-3250 3251-3476
Showing up to 250 entries per page: fewer | more | all
[3001] arXiv:2503.19753 (cross-list from cs.GR) [pdf, other]
Title: A Survey on Event-driven 3D Reconstruction: Development under Different Categories
Chuanzhi Xu, Haoxian Zhou, Haodong Chen, Vera Chung, Qiang Qu
Comments: We have decided not to submit this article and plan to withdraw it from public display. The content of this article will be presented in a more comprehensive form in another work
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3002] arXiv:2503.19786 (cross-list from cs.CL) [pdf, html, other]
Title: Gemma 3 Technical Report
Gemma Team: Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin, Robert Busa-Fekete, Alex Feng, Noveen Sachdeva, Benjamin Coleman, Yi Gao, Basil Mustafa, Iain Barr, Emilio Parisotto, David Tian, Matan Eyal, Colin Cherry, Jan-Thorsten Peter, Danila Sinopalnikov, Surya Bhupatiraju, Rishabh Agarwal, Mehran Kazemi, Dan Malkin, Ravin Kumar, David Vilar, Idan Brusilovsky, Jiaming Luo, Andreas Steiner, Abe Friesen, Abhanshu Sharma, Abheesht Sharma, Adi Mayrav Gilady, Adrian Goedeckemeyer, Alaa Saade, Alex Feng, Alexander Kolesnikov, Alexei Bendebury, Alvin Abdagic, Amit Vadi, András György, André Susano Pinto, Anil Das, Ankur Bapna, Antoine Miech, Antoine Yang, Antonia Paterson, Ashish Shenoy, Ayan Chakrabarti, Bilal Piot, Bo Wu, Bobak Shahriari, Bryce Petrini, Charlie Chen, Charline Le Lan, Christopher A. Choquette-Choo, CJ Carey, Cormac Brick, Daniel Deutsch, Danielle Eisenbud, Dee Cattle, Derek Cheng, Dimitris Paparas, Divyashree Shivakumar Sreepathihalli, Doug Reid, Dustin Tran, Dustin Zelle, Eric Noland, Erwin Huizenga, Eugene Kharitonov, Frederick Liu, Gagik Amirkhanyan, Glenn Cameron, Hadi Hashemi, Hanna Klimczak-Plucińska, Harman Singh, Harsh Mehta, Harshal Tushar Lehri, Hussein Hazimeh, Ian Ballantyne, Idan Szpektor, Ivan Nardini
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3003] arXiv:2503.19794 (cross-list from cs.CV) [pdf, html, other]
Title: PAVE: Patching and Adapting Video Large Language Models
Zhuoming Liu, Yiquan Li, Khoi Duc Nguyen, Yiwu Zhong, Yin Li
Comments: CVPR2025 Camera Ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3004] arXiv:2503.19801 (cross-list from cs.CV) [pdf, html, other]
Title: SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI
Zhiyang Liu, Dong Yang, Minghao Zhang, Hanyu Sun, Hong Wu, Huiying Wang, Wen Shen, Chao Chai, Shuang Xia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3005] arXiv:2503.19804 (cross-list from cs.CV) [pdf, html, other]
Title: LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset
Manjushree Aithal, Rosaura G. VidalMata, Manikandtan Kartha, Gong Chen, Eashan Adhikarla, Lucas N. Kirsten, Zhicheng Fu, Nikhil A. Madhusudhana, Joe Nasti
Comments: Dataset will be released upon publication
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3006] arXiv:2503.19817 (cross-list from cs.CR) [pdf, html, other]
Title: Bitstream Collisions in Neural Image Compression via Adversarial Perturbations
Jordan Madden, Lhamo Dorje, Xiaohua Li
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
[3007] arXiv:2503.19823 (cross-list from q-bio.NC) [pdf, html, other]
Title: GyralNet Subnetwork Partitioning via Differentiable Spectral Modularity Optimization
Yan Zhuang, Minheng Chen, Chao Cao, Tong Chen, Jing Zhang, Xiaowei Yu, Yanjun Lyu, Lu Zhang, Tianming Liu, Dajiang Zhu
Comments: 10 pages, 3 figures
Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3008] arXiv:2503.19844 (cross-list from cs.CL) [pdf, html, other]
Title: A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950
Zhao Fang, Liang-Chun Wu, Xuening Kong, Spencer Dean Stewart
Comments: Accepted to NLP4DH 2025 at NAACL 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3009] arXiv:2503.19848 (cross-list from cs.DL) [pdf, html, other]
Title: Guarding against artificial intelligence--hallucinated citations: the case for full-text reference deposit
Alex Glynn
Comments: 3 pages
Journal-ref: Glynn A. Guarding against artificial intelligence -- hallucinated citations: The case for full-text reference deposit. Eur Sci Ed. 2025;51:e153973
Subjects: Digital Libraries (cs.DL); Artificial Intelligence (cs.AI)
[3010] arXiv:2503.19867 (cross-list from cs.LG) [pdf, html, other]
Title: Geometric Meta-Learning via Coupled Ricci Flow: Unifying Knowledge Representation and Quantum Entanglement
Ming Lei, Christophe Baehr
Comments: 9 pages, submitted to IEEE PAMI
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP); Geometric Topology (math.GT); Quantum Physics (quant-ph)
[3011] arXiv:2503.19868 (cross-list from cs.IR) [pdf, html, other]
Title: GENIUS: A Generative Framework for Universal Multimodal Search
Sungyeon Kim, Xinliang Zhu, Xiaofan Lin, Muhammet Bastan, Douglas Gray, Suha Kwak
Comments: Accepted to CVPR 2025
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[3012] arXiv:2503.19885 (cross-list from cs.NE) [pdf, html, other]
Title: Dynamics of Structured Complex-Valued Hopfield Neural Networks
Rama Murthy Garimella, Marcos Eduardo Valle, Guilherme Vieira, Anil Rayala, Dileep Munugoti
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI)
[3013] arXiv:2503.19887 (cross-list from cs.CY) [pdf, html, other]
Title: AI threats to national security can be countered through an incident regime
Alejandro Ortega
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI)
[3014] arXiv:2503.19900 (cross-list from cs.CV) [pdf, html, other]
Title: CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning
Hao Yu, Zhuokai Zhao, Shen Yan, Lukasz Korycki, Jianyu Wang, Baosheng He, Jiayi Liu, Lizhu Zhang, Xiangjun Fan, Hanchao Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3015] arXiv:2503.19933 (cross-list from econ.GN) [pdf, other]
Title: Role of AI Innovation, Clean Energy and Digital Economy towards Net Zero Emission in the United States: An ARDL Approach
Adita Sultana, Abdullah Al Abrar Chowdhury, Azizul Hakim Rafi, Abdulla All Noman
Comments: 24 pages, 8 tables, 1 figure
Journal-ref: Journal of Environmental and Energy Economics, 2025
Subjects: General Economics (econ.GN); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[3016] arXiv:2503.19937 (cross-list from cs.CV) [pdf, html, other]
Title: Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation
Zhiyao Ren, Yibing Zhan, Baosheng Yu, Dacheng Tao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3017] arXiv:2503.19940 (cross-list from physics.ao-ph) [pdf, html, other]
Title: FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling
Qiusheng Huang, Xiaohui Zhong, Xu Fan, Lei Chen, Hao Li
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3018] arXiv:2503.19941 (cross-list from cs.RO) [pdf, html, other]
Title: Body Discovery of Embodied AI
Zhe Sun, Pengfei Tian, Xiaozhu Hu, Xiaoyu Zhao, Huiying Li, Zhenliang Zhang
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[3019] arXiv:2503.19943 (cross-list from eess.IV) [pdf, other]
Title: A Spatiotemporal Radar-Based Precipitation Model for Water Level Prediction and Flood Forecasting
Sakshi Dhankhar, Stefan Wittek, Hamidreza Eivazi, Andreas Rausch
Comments: 28 pages, 11 figures, 6 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3020] arXiv:2503.19945 (cross-list from eess.IV) [pdf, html, other]
Title: Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification
Daniel G. P. Petrini, Hae Yong Kim
Comments: 8 pages
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3021] arXiv:2503.19947 (cross-list from cs.CV) [pdf, html, other]
Title: Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders
Paul Koch, Jörg Krüger, Ankit Chowdhury, Oliver Heimann
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3022] arXiv:2503.19948 (cross-list from cs.CV) [pdf, other]
Title: Test-Time Reasoning Through Visual Human Preferences with VLMs and Soft Rewards
Alexander Gambashidze, Konstantin Sobolev, Andrey Kuznetsov, Ivan Oseledets
Comments: We are withdrawing this paper because the main contributions and methodology have significantly changed after further research and experimental updates. The current version no longer reflects our results and main contribution / topic
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3023] arXiv:2503.19950 (cross-list from cs.LG) [pdf, html, other]
Title: LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation
Han Chen, Zicong Jiang, Zining Zhang, Bingsheng He, Pingyi Luo, Mian Lu, Yuqiang Chen
Comments: Accepted by ICLR 2025 Workshop on Sparsity in LLMs (SLLM)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3024] arXiv:2503.19951 (cross-list from cs.CV) [pdf, html, other]
Title: ACVUBench: Audio-Centric Video Understanding Benchmark
Yudong Yang, Jimin Zhuang, Guangzhi Sun, Changli Tang, Yixuan Li, Peihan Li, Yifan Jiang, Wei Li, Zejun Ma, Chao Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3025] arXiv:2503.19988 (cross-list from cs.LG) [pdf, html, other]
Title: ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback
Bohan Zhai, Canwen Xu, Yuxiong He, Zhewei Yao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB)
[3026] arXiv:2503.20018 (cross-list from cs.LG) [pdf, html, other]
Title: Experience Replay Addresses Loss of Plasticity in Continual Learning
Jiuqi Wang, Rohan Chandra, Shangtong Zhang
Comments: 14 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[3027] arXiv:2503.20036 (cross-list from cs.SE) [pdf, html, other]
Title: BugCraft: End-to-End Crash Bug Reproduction Using LLM Agents in Minecraft
Eray Yapağcı, Yavuz Alp Sencer Öztürk, Eray Tüzün
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
[3028] arXiv:2503.20074 (cross-list from cs.PF) [pdf, html, other]
Title: Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and Resilience
Yahav Biran, Imry Kissos
Comments: 14 pages, 7 figures
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI)
[3029] arXiv:2503.20078 (cross-list from cs.LG) [pdf, other]
Title: Abstracting Geo-specific Terrains to Scale Up Reinforcement Learning
Volkan Ustun, Soham Hans, Rajay Kumar, Yunzhe Wang
Comments: 10 pages, 6 figures, 2024 Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[3030] arXiv:2503.20084 (cross-list from cs.CV) [pdf, html, other]
Title: Can Multi-modal (reasoning) LLMs work as deepfake detectors?
Simiao Ren, Yao Yao, Kidus Zewde, Zisheng Liang, Tsang (Dennis)Ng, Ning-Yau Cheng, Xiaoou Zhan, Qinzhe Liu, Yifei Chen, Hengwei Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3031] arXiv:2503.20099 (cross-list from cs.CY) [pdf, other]
Title: AI Identity, Empowerment, and Mindfulness in Mitigating Unethical AI Use
Mayssam Tarighi Shaayesteh, Sara Memarian Esfahani, Hossein Mohit
Comments: We have identified a critical methodological error in the data analysis section, which substantially impacts the validity of the main results and conclusions. We are currently revising the analysis and intend to submit a corrected version in the future
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI)
[3032] arXiv:2503.20110 (cross-list from cs.CL) [pdf, html, other]
Title: Efficient Model Development through Fine-tuning Transfer
Pin-Jie Lin, Rishab Balasubramanian, Fengyuan Liu, Nikhil Kandpal, Tu Vu
Comments: 21 pages, 4 figures, 13 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3033] arXiv:2503.20118 (cross-list from cs.GR) [pdf, html, other]
Title: Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors
Yuke Lou, Yiming Wang, Zhen Wu, Rui Zhao, Wenjia Wang, Mingyi Shi, Taku Komura
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3034] arXiv:2503.20126 (cross-list from cs.SE) [pdf, html, other]
Title: Can We Make Code Green? Understanding Trade-Offs in LLMs vs. Human Code Optimizations
Pooja Rani, Jan-Andrea Bard, June Sallou, Alexander Boll, Timo Kehrer, Alberto Bacchelli
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Performance (cs.PF)
[3035] arXiv:2503.20138 (cross-list from cs.LG) [pdf, html, other]
Title: Unlocking the Value of Decentralized Data: A Federated Dual Learning Approach for Model Aggregation
Junyi Zhu, Ruicong Yao, Taha Ceritli, Savas Ozkan, Matthew B. Blaschko, Eunchung Noh, Jeongwon Min, Cho Jung Min, Mete Ozay
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3036] arXiv:2503.20139 (cross-list from cs.LG) [pdf, html, other]
Title: Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu, Xin Liu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3037] arXiv:2503.20176 (cross-list from cs.LG) [pdf, html, other]
Title: Offline Reinforcement Learning with Discrete Diffusion Skills
RuiXi Qiao, Jie Cheng, Xingyuan Dai, Yonglin Tian, Yisheng Lv
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[3038] arXiv:2503.20182 (cross-list from cs.CL) [pdf, other]
Title: Leveraging Implicit Sentiments: Enhancing Reliability and Validity in Psychological Trait Evaluation of LLMs
Huanhuan Ma, Haisong Gong, Xiaoyuan Yi, Xing Xie, Dongkuan Xu
Comments: Code available via this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3039] arXiv:2503.20199 (cross-list from cs.CV) [pdf, html, other]
Title: Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery
Mélisande Teng, Arthur Ouaknine, Etienne Laliberté, Yoshua Bengio, David Rolnick, Hugo Larochelle
Comments: ICLR 2025 ML4RS workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3040] arXiv:2503.20202 (cross-list from cs.CL) [pdf, html, other]
Title: SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain
Nan Gao, Yihua Bao, Dongdong Weng, Jiayi Zhao, Jia Li, Yan Zhou, Pengfei Wan, Di Zhang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[3041] arXiv:2503.20205 (cross-list from cs.LG) [pdf, html, other]
Title: Generalized Phase Pressure Control Enhanced Reinforcement Learning for Traffic Signal Control
Xiao-Cheng Liao, Yi Mei, Mengjie Zhang, Xiang-Ling Chen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3042] arXiv:2503.20208 (cross-list from cs.RO) [pdf, html, other]
Title: Learning Adaptive Dexterous Grasping from Single Demonstrations
Liangzhi Shi, Yulin Liu, Lingqi Zeng, Bo Ai, Zhengdong Hong, Hao Su
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3043] arXiv:2503.20227 (cross-list from cs.CL) [pdf, html, other]
Title: Advancements in Natural Language Processing: Exploring Transformer-Based Architectures for Text Understanding
Tianhao Wu, Yu Wang, Ngoc Quach
Comments: This paper has been accepted by the 5th International Conference on Artificial Intelligence and Industrial Technology Applications (AIITA 2025)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3044] arXiv:2503.20230 (cross-list from cs.CV) [pdf, html, other]
Title: TraNCE: Transformative Non-linear Concept Explainer for CNNs
Ugochukwu Ejike Akpudo, Yongsheng Gao, Jun Zhou, Andrew Lewis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3045] arXiv:2503.20231 (cross-list from physics.soc-ph) [pdf, html, other]
Title: Dynamics of Algorithmic Content Amplification on TikTok
Fabian Baumann, Nipun Arora, Iyad Rahwan, Agnieszka Czaplicka
Comments: 34 pages
Subjects: Physics and Society (physics.soc-ph); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[3046] arXiv:2503.20233 (cross-list from cs.SI) [pdf, html, other]
Title: Dynamic Learning and Productivity for Data Analysts: A Bayesian Hidden Markov Model Perspective
Yue Yin
Comments: 29 pages; a shorter 11-page version is accepted by HCI International (HCII) 2025;
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Human-Computer Interaction (cs.HC)
[3047] arXiv:2503.20241 (cross-list from cs.RO) [pdf, html, other]
Title: LGR: LLM-Guided Ranking of Frontiers for Object Goal Navigation
Mitsuaki Uno, Kanji Tanaka, Daiki Iwata, Yudai Noda, Shoya Miyazaki, Kouki Terashima
Comments: 10 pages, 11 figures, technical report
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI)
[3048] arXiv:2503.20245 (cross-list from cs.AR) [pdf, html, other]
Title: ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network
Chih-Chia Hsu, Tian-Sheuan Chang
Journal-ref: in IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 71, no. 4, pp. 1693-1705, April 2024
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[3049] arXiv:2503.20252 (cross-list from cs.CV) [pdf, html, other]
Title: LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions
Yejin Kwon, Daeun Moon, Youngje Oh, Hyunsoo Yoon
Comments: Accepted Industry Track at ACL 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3050] arXiv:2503.20258 (cross-list from cs.CV) [pdf, html, other]
Title: Mamba-3D as Masked Autoencoders for Accurate and Data-Efficient Analysis of Medical Ultrasound Videos
Jiaheng Zhou, Yanfeng Zhou, Wei Fang, Yuxing Tang, Le Lu, Ge Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3051] arXiv:2503.20279 (cross-list from cs.CL) [pdf, html, other]
Title: sudo rm -rf agentic_security
Sejin Lee, Jian Kim, Haon Park, Ashkan Yousefpour, Sangyoon Yu, Min Song
Comments: Accepted ACL 2025 Industry track
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[3052] arXiv:2503.20281 (cross-list from cs.CR) [pdf, html, other]
Title: Are We There Yet? Unraveling the State-of-the-Art Graph Network Intrusion Detection Systems
Chenglong Wang, Pujia Zheng, Jiaping Gui, Cunqing Hua, Wajih Ul Hassan
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
[3053] arXiv:2503.20282 (cross-list from cs.CV) [pdf, html, other]
Title: Faster Parameter-Efficient Tuning with Token Redundancy Reduction
Kwonyoung Kim, Jungin Park, Jin Kim, Hyeongjun Kwon, Kwanghoon Sohn
Comments: CVPR 2025 Camera-ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3054] arXiv:2503.20285 (cross-list from cs.LG) [pdf, html, other]
Title: Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Hongye Cao, Fan Feng, Jing Huo, Shangdong Yang, Meng Fang, Tianpei Yang, Yang Gao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3055] arXiv:2503.20290 (cross-list from eess.AS) [pdf, html, other]
Title: QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions
Siyin Wang, Wenyi Yu, Xianzhao Chen, Xiaohai Tian, Jun Zhang, Lu Lu, Yu Tsao, Junichi Yamagishi, Yuxuan Wang, Chao Zhang
Comments: 22 pages, 10 figures
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Sound (cs.SD)
[3056] arXiv:2503.20291 (cross-list from cs.CV) [pdf, html, other]
Title: CryoSAMU: Enhancing 3D Cryo-EM Density Maps of Protein Structures at Intermediate Resolution with Structure-Aware Multimodal U-Nets
Chenwei Zhang, Khanh Dao Duc
Comments: 19 pages, 6 main figures, 2 supplementary figures, 3 main tables, 4 supplementary tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[3057] arXiv:2503.20294 (cross-list from cs.CV) [pdf, html, other]
Title: Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement
Xinghao Wang, Tao Gong, Qi Chu, Bin Liu, Nenghai Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3058] arXiv:2503.20302 (cross-list from cs.CL) [pdf, html, other]
Title: A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications
Sunayana Sitaram, Adrian de Wynter, Isobel McCrum, Qilong Gu, Si-Qing Chen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3059] arXiv:2503.20320 (cross-list from cs.CL) [pdf, other]
Title: Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models
Shih-Wen Ke, Guan-Yu Lai, Guo-Lin Fang, Hsi-Yuan Kao
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
[3060] arXiv:2503.20341 (cross-list from cs.LG) [pdf, html, other]
Title: Wasserstein Distributionally Robust Bayesian Optimization with Continuous Context
Francesco Micheli, Efe C. Balta, Anastasios Tsiamis, John Lygeros
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[3061] arXiv:2503.20348 (cross-list from cs.CV) [pdf, html, other]
Title: VideoGEM: Training-free Action Grounding in Videos
Felix Vogel, Walid Bousselham, Anna Kukleva, Nina Shvetsova, Hilde Kuehne
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[3062] arXiv:2503.20384 (cross-list from cs.RO) [pdf, html, other]
Title: MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
Rongyu Zhang, Menghang Dong, Yuan Zhang, Liang Heng, Xiaowei Chi, Gaole Dai, Li Du, Yuan Du, Shanghang Zhang
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI)
[3063] arXiv:2503.20394 (cross-list from cs.LG) [pdf, html, other]
Title: FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies
Tianqi He, Xiaohan Huang, Yi Du, Qingqing Long, Ziyue Qiao, Min Wu, Yanjie Fu, Yuanchun Zhou, Meng Xiao
Comments: 14 pages, Accepted by ICDE 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3064] arXiv:2503.20398 (cross-list from cs.LG) [pdf, html, other]
Title: Including local feature interactions in deep non-negative matrix factorization networks improves performance
Mahbod Nouri, David Rotermund, Alberto Garcia-Ortiz, Klaus R. Pawelzik
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3065] arXiv:2503.20428 (cross-list from cs.CV) [pdf, html, other]
Title: Evaluating Facial Expression Recognition Datasets for Deep Learning: A Benchmark Study with Novel Similarity Metrics
F. Xavier Gaya-Morey, Cristina Manresa-Yee, Célia Martinie, Jose M. Buades-Rubio
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3066] arXiv:2503.20446 (cross-list from eess.IV) [pdf, html, other]
Title: Attention Xception UNet (AXUNet): A Novel Combination of CNN and Self-Attention for Brain Tumor Segmentation
Farzan Moodi, Fereshteh Khodadadi Shoushtari, Gelareh Valizadeh, Dornaz Mazinani, Hanieh Mobarak Salari, Hamidreza Saligheh Rad
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3067] arXiv:2503.20472 (cross-list from cs.CV) [pdf, html, other]
Title: From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment
Yucheng Suo, Fan Ma, Linchao Zhu, Tianyi Wang, Fengyun Rao, Yi Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3068] arXiv:2503.20479 (cross-list from physics.app-ph) [pdf, html, other]
Title: A multi-agentic framework for real-time, autonomous freeform metasurface design
Robert Lupoiu, Yixuan Shao, Tianxiang Dai, Chenkai Mao, Kofi Edee, Jonathan A. Fan
Comments: 32 pages, 5 figures
Subjects: Applied Physics (physics.app-ph); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Computational Physics (physics.comp-ph)
[3069] arXiv:2503.20484 (cross-list from cs.CV) [pdf, html, other]
Title: Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation
Qi Si, Bo Wang, Zhao Zhang
Comments: 11 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3070] arXiv:2503.20485 (cross-list from eess.IV) [pdf, html, other]
Title: Underwater Image Enhancement by Convolutional Spiking Neural Networks
Vidya Sudevan, Fakhreddine Zayer, Rizwana Kausar, Sajid Javed, Hamad Karki, Giulia De Masi, Jorge Dias
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Performance (cs.PF)
[3071] arXiv:2503.20492 (cross-list from cs.CV) [pdf, html, other]
Title: Towards Efficient and General-Purpose Few-Shot Misclassification Detection for Vision-Language Models
Fanhu Zeng, Zhen Cheng, Fei Zhu, Xu-Yao Zhang
Comments: preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3072] arXiv:2503.20500 (cross-list from eess.SP) [pdf, html, other]
Title: Novel Deep Neural OFDM Receiver Architectures for LLR Estimation
Erhan Karakoca, Hüseyin Çevik, İbrahim Hökelek, Ali Görçin
Comments: Submitted to IEEE Globecom 2025
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI)
[3073] arXiv:2503.20523 (cross-list from cs.CV) [pdf, html, other]
Title: GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving
Lloyd Russell, Anthony Hu, Lorenzo Bertoni, George Fedoseev, Jamie Shotton, Elahe Arani, Gianluca Corrado
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[3074] arXiv:2503.20527 (cross-list from cs.CL) [pdf, html, other]
Title: StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs
Zhicheng Guo, Sijie Cheng, Yuchen Niu, Hao Wang, Sicheng Zhou, Wenbing Huang, Yang Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3075] arXiv:2503.20607 (cross-list from quant-ph) [pdf, html, other]
Title: A decision-theoretic approach to dealing with uncertainty in quantum mechanics
Keano De Vos, Gert de Cooman, Alexander Erreygers, Jasper De Bock
Comments: 52 pages
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Probability (math.PR)
[3076] arXiv:2503.20613 (cross-list from cs.LG) [pdf, html, other]
Title: State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning
Zongyuan Zhang, Tianyang Duan, Zheng Lin, Dong Huang, Zihan Fang, Zekai Sun, Ling Xiong, Hongbin Liang, Heming Cui, Yong Cui
Comments: 15 pages, 11 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI); Systems and Control (eess.SY)
[3077] arXiv:2503.20623 (cross-list from cs.CL) [pdf, html, other]
Title: Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions
Alessandro Maisto
Comments: 17 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3078] arXiv:2503.20630 (cross-list from cs.LG) [pdf, html, other]
Title: $β$-GNN: A Robust Ensemble Approach Against Graph Structure Perturbation
Haci Ismail Aslan, Philipp Wiesner, Ping Xiong, Odej Kao
Comments: This is the author's version of the paper accepted at EuroMLSys 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3079] arXiv:2503.20648 (cross-list from cs.CL) [pdf, html, other]
Title: TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes
Raj Sanjay Shah, Lei Xu, Qianchu Liu, Jon Burnsky, Drew Bertagnolli, Chaitanya Shivade
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3080] arXiv:2503.20654 (cross-list from cs.CV) [pdf, html, other]
Title: AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports
Xiangwen Zhang, Qian Zhang, Longfei Han, Qiang Qu, Xiaoming Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3081] arXiv:2503.20658 (cross-list from eess.SP) [pdf, html, other]
Title: Probabilistic Forecasting for Network Resource Analysis in Integrated Terrestrial and Non-Terrestrial Networks
Cristian J. Vaca-Rubio, Vaishnavi Kasuluru, Engin Zeydan, Luis Blanco, Roberto Pereira, Marius Caus, Kapal Dev
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[3082] arXiv:2503.20685 (cross-list from cs.CV) [pdf, html, other]
Title: Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound
Yuhao Huang, Ao Chang, Haoran Dou, Xing Tao, Xinrui Zhou, Yan Cao, Ruobing Huang, Alejandro F Frangi, Lingyun Bao, Xin Yang, Dong Ni
Comments: Accepted by Medical Image Analysis. 24 pages, 13 figures, 20 tabels
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3083] arXiv:2503.20739 (cross-list from cs.CV) [pdf, other]
Title: Emotion Detection and Music Recommendation System
Swetha Kambham, Hubert Jhonson, Sai Prathap Reddy Kambham
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3084] arXiv:2503.20742 (cross-list from cs.LG) [pdf, other]
Title: Quantum Neural Network Restatement of Markov Jump Process
Z.Zarezadeh, N.Zarezadeh
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Numerical Analysis (math.NA)
[3085] arXiv:2503.20744 (cross-list from cs.CV) [pdf, html, other]
Title: High Quality Diffusion Distillation on a Single GPU with Relative and Absolute Position Matching
Guoqiang Zhang, Kenta Niwa, J.P. Lewis, Cedric Mesnage, W. Bastiaan Kleijn
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3086] arXiv:2503.20750 (cross-list from cs.LG) [pdf, html, other]
Title: Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework
Soham Sane
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3087] arXiv:2503.20752 (cross-list from cs.CV) [pdf, html, other]
Title: Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning
Huajie Tan, Yuheng Ji, Xiaoshuai Hao, Minglan Lin, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang
Comments: 35 pages, 22 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3088] arXiv:2503.20756 (cross-list from cs.CL) [pdf, html, other]
Title: ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems
Chenxi Wang, Jizhan Fang, Xiang Chen, Bozhong Tian, Ziwen Xu, Huajun Chen, Ningyu Zhang
Comments: ACM MM 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[3089] arXiv:2503.20783 (cross-list from cs.LG) [pdf, html, other]
Title: Understanding R1-Zero-Like Training: A Critical Perspective
Zichen Liu, Changyu Chen, Wenjun Li, Penghui Qi, Tianyu Pang, Chao Du, Wee Sun Lee, Min Lin
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3090] arXiv:2503.20786 (cross-list from cs.CL) [pdf, html, other]
Title: Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark
Sondos Mahmoud Bsharat, Mukul Ranjan, Aidar Myrzakhan, Jiacheng Liu, Bowei Guo, Shengkun Tang, Zhuang Liu, Yuanzhi Li, Zhiqiang Shen
Comments: An order-invariant and mobile-centric benchmark. Code and data are available at: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3091] arXiv:2503.20790 (cross-list from cs.HC) [pdf, html, other]
Title: Toward a Human-Centered AI-assisted Colonoscopy System in Australia
Hsiang-Ting Chen, Yuan Zhang, Gustavo Carneiro, Rajvinder Singh
Comments: 4 pages, accepted by CHI '25 workshop Envisioning the Future of Interactive Health
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI)
[3092] arXiv:2503.20796 (cross-list from cs.CR) [pdf, html, other]
Title: EXPLICATE: Enhancing Phishing Detection through Explainable AI and LLM-Powered Interpretability
Bryan Lim, Roman Huerta, Alejandro Sotelo, Anthonie Quintela, Priyanka Kumar
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
[3093] arXiv:2503.20798 (cross-list from cs.CR) [pdf, html, other]
Title: Payload-Aware Intrusion Detection with CMAE and Large Language Models
Yongcheol Kim, Chanjae Lee, Young Yoon
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
[3094] arXiv:2503.20800 (cross-list from cs.CR) [pdf, html, other]
Title: Evidencing Unauthorized Training Data from AI Generated Content using Information Isotopes
Qi Tao, Yin Jinhua, Cai Dongqi, Xie Yueqi, Wang Huili, Hu Zhiyang, Yang Peiru, Nan Guoshun, Zhou Zhili, Wang Shangguang, Lyu Lingjuan, Huang Yongfeng, Lane Nicholas
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3095] arXiv:2503.20802 (cross-list from cs.CR) [pdf, html, other]
Title: CEFW: A Comprehensive Evaluation Framework for Watermark in Large Language Models
Shuhao Zhang, Bo Cheng, Jiale Han, Yuli Chen, Zhixuan Wu, Changbao Li, Pingli Gu
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
[3096] arXiv:2503.20804 (cross-list from cs.CR) [pdf, html, other]
Title: AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models
Le Qiu, Zelai Xu, Qixin Tan, Wenhao Tang, Chao Yu, Yu Wang
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3097] arXiv:2503.20807 (cross-list from stat.ML) [pdf, html, other]
Title: Fundamental Safety-Capability Trade-offs in Fine-tuning Large Language Models
Pin-Yu Chen, Han Shen, Payel Das, Tianyi Chen
Comments: The first two authors contribute equally to this work and are listed in alphabetical order
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[3098] arXiv:2503.20822 (cross-list from eess.IV) [pdf, html, other]
Title: Synthetic Video Enhances Physical Fidelity in Video Synthesis
Qi Zhao, Xingyu Ni, Ziyu Wang, Feng Cheng, Ziyan Yang, Lu Jiang, Bohan Wang
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[3099] arXiv:2503.20824 (cross-list from eess.IV) [pdf, html, other]
Title: Exploiting Temporal State Space Sharing for Video Semantic Segmentation
Syed Ariff Syed Hesham, Yun Liu, Guolei Sun, Henghui Ding, Jing Yang, Ender Konukoglu, Xue Geng, Xudong Jiang
Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3100] arXiv:2503.20825 (cross-list from stat.ML) [pdf, html, other]
Title: Debiasing Kernel-Based Generative Models
Tian Qin, Wei-Min Huang
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI)
[3101] arXiv:2503.20831 (cross-list from cs.CR) [pdf, html, other]
Title: Advancing Vulnerability Classification with BERT: A Multi-Objective Learning Model
Himanshu Tiwari
Comments: 9 Pages
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
[3102] arXiv:2503.20842 (cross-list from cs.RO) [pdf, other]
Title: Anti Robot Speciesism
Julian De Freitas, Noah Castelo, Bernd Schmitt, Miklos Sarvary
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI)
[3103] arXiv:2503.20844 (cross-list from cs.LG) [pdf, html, other]
Title: Robust Deep Reinforcement Learning in Robotics via Adaptive Gradient-Masked Adversarial Attacks
Zongyuan Zhang, Tianyang Duan, Zheng Lin, Dong Huang, Zihan Fang, Zekai Sun, Ling Xiong, Hongbin Liang, Heming Cui, Yong Cui, Yue Gao
Comments: 9 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI); Robotics (cs.RO)
[3104] arXiv:2503.20848 (cross-list from cs.GT) [pdf, html, other]
Title: The Backfiring Effect of Weak AI Safety Regulation
Benjamin Laufer, Jon Kleinberg, Hoda Heidari
Comments: 35 pages, 5 figures
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Theoretical Economics (econ.TH)
[3105] arXiv:2503.20853 (cross-list from cs.CV) [pdf, html, other]
Title: Unified Multimodal Discrete Diffusion
Alexander Swerdlow, Mihir Prabhudesai, Siddharth Gandhi, Deepak Pathak, Katerina Fragkiadaki
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[3106] arXiv:2503.20871 (cross-list from cs.CV) [pdf, html, other]
Title: VinaBench: Benchmark for Faithful and Consistent Visual Narratives
Silin Gao, Sheryl Mathew, Li Mi, Sepideh Mamooler, Mengjie Zhao, Hiromi Wakaki, Yuki Mitsufuji, Syrielle Montariol, Antoine Bosselut
Comments: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3107] arXiv:2503.20884 (cross-list from cs.CR) [pdf, html, other]
Title: Robust Federated Learning Against Poisoning Attacks: A GAN-Based Defense Framework
Usama Zafar, André Teixeira, Salman Toor
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[3108] arXiv:2503.20903 (cross-list from cs.LG) [pdf, html, other]
Title: Assessing Generative Models for Structured Data
Reilly Cannon, Nicolette M. Laird, Caesar Vazquez, Andy Lin, Amy Wagler, Tony Chiang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3109] arXiv:2503.20914 (cross-list from cs.IR) [pdf, html, other]
Title: D4R -- Exploring and Querying Relational Graphs Using Natural Language and Large Language Models -- the Case of Historical Documents
Michel Boeglin, David Kahn, Josiane Mothe, Diego Ortiz, David Panzoli
Comments: 8 pages, 7 figures
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[3110] arXiv:2503.20925 (cross-list from cs.CV) [pdf, html, other]
Title: Prototype Guided Backdoor Defense
Venkat Adithya Amula, Sunayana Samavedam, Saurabh Saini, Avani Gupta, Narayanan P J
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3111] arXiv:2503.20936 (cross-list from cs.CV) [pdf, html, other]
Title: LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos
Daniel Etaat, Dvij Kalaria, Nima Rahmanian, Shankar Sastry
Comments: CVPR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3112] arXiv:2503.20952 (cross-list from cs.LG) [pdf, html, other]
Title: TS-Inverse: A Gradient Inversion Attack Tailored for Federated Time Series Forecasting Models
Caspar Meijer, Jiyue Huang, Shreshtha Sharma, Elena Lazovik, Lydia Y. Chen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3113] arXiv:2503.20959 (cross-list from cs.CL) [pdf, html, other]
Title: Sociotechnical Effects of Machine Translation
Joss Moorkens, Andy Way, Séamus Lankford
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3114] arXiv:2503.20975 (cross-list from cs.GT) [pdf, html, other]
Title: Competitive Multi-armed Bandit Games for Resource Sharing
Hongbo Li, Lingjie Duan
Comments: This paper has been accepted by IEEE TMC
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI)
[3115] arXiv:2503.20981 (cross-list from cs.CL) [pdf, html, other]
Title: Patients Speak, AI Listens: LLM-based Analysis of Online Reviews Uncovers Key Drivers for Urgent Care Satisfaction
Xiaoran Xu, Zhaoqian Xue, Chi Zhang, Jhonatan Medri, Junjie Xiong, Jiayan Zhou, Jin Jin, Yongfeng Zhang, Siyuan Ma, Lingyao Li
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[3116] arXiv:2503.20990 (cross-list from cs.CE) [pdf, html, other]
Title: FinAudio: A Benchmark for Audio Large Language Models in Financial Applications
Yupeng Cao, Haohang Li, Yangyang Yu, Shashidhar Reddy Javaji, Yueru He, Jimin Huang, Zining Zhu, Qianqian Xie, Xiao-yang Liu, Koduvayur Subbalakshmi, Meikang Qiu, Sophia Ananiadou, Jian-Yun Nie
Subjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[3117] arXiv:2503.21000 (cross-list from cs.LG) [pdf, html, other]
Title: Improving User Behavior Prediction: Leveraging Annotator Metadata in Supervised Machine Learning Models
Lynnette Hui Xian Ng, Kokil Jaidka, Kaiyuan Tay, Hansin Ahuja, Niyati Chhaya
Comments: Accepted at CSCW 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3118] arXiv:2503.21011 (cross-list from cs.CL) [pdf, html, other]
Title: Can Large Language Models Predict Associations Among Human Attitudes?
Ana Ma, Derek Powell
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3119] arXiv:2503.21074 (cross-list from cs.CV) [pdf, other]
Title: Rerouting Connection: Hybrid Computer Vision Analysis Reveals Visual Similarity Between Indus and Tibetan-Yi Corridor Writing Systems
Ooha Lakkadi Reddy
Comments: 107 pages (43 main text, 6 references, 58 appendices). 21 figures, 4 tables in main text; 106 figures, 8 tables total. Code available at this https URL. Undergraduate thesis at Duke Kunshan University. Accepted for presentation at the 52nd International Conference for Computer Applications & Quantitative Methods in Archaeology (CAA 2025), Athens, Greece
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[3120] arXiv:2503.21088 (cross-list from cs.CL) [pdf, html, other]
Title: ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging
Haoming Xu, Shuxun Wang, Yanqiu Zhao, Yi Zhong, Ziyan Jiang, Ningyuan Zhao, Shumin Deng, Huajun Chen, Ningyu Zhang
Comments: SemEval@ACL 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[3121] arXiv:2503.21095 (cross-list from cs.LG) [pdf, html, other]
Title: Confidence Adjusted Surprise Measure for Active Resourceful Trials (CA-SMART): A Data-driven Active Learning Framework for Accelerating Material Discovery under Resource Constraints
Ahmed Shoyeb Raihan, Zhichao Liu, Tanveer Hossain Bhuiyan, Imtiaz Ahmed
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP)
[3122] arXiv:2503.21098 (cross-list from cs.IR) [pdf, other]
Title: Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search
Yedan Shen, Kaixin Wu, Yuechen Ding, Jingyuan Wen, Hong Liu, Mingjie Zhong, Zhouhan Lin, Jia Xu, Linjian Mo
Comments: Accepted by SIGIR 2025
Journal-ref: SIGIR 2025
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
[3123] arXiv:2503.21109 (cross-list from cs.DC) [pdf, html, other]
Title: Optimizing Multi-DNN Inference on Mobile Devices through Heterogeneous Processor Co-Execution
Yunquan Gao, Zhiguo Zhang, Praveen Kumar Donta, Chinmaya Kumar Dehury, Xiujun Wang, Dusit Niyato, Qiyang Zhang
Comments: 14 pages, 12 figures, 5 tables
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI)
[3124] arXiv:2503.21150 (cross-list from cs.CV) [pdf, html, other]
Title: The Devil is in Low-Level Features for Cross-Domain Few-Shot Segmentation
Yuhan Liu, Yixiong Zou, Yuhua Li, Ruixuan Li
Comments: Accepted by CVPR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3125] arXiv:2503.21154 (cross-list from cs.LG) [pdf, html, other]
Title: Federated Learning with Differential Privacy: An Utility-Enhanced Approach
Kanishka Ranaweera, Dinh C. Nguyen, Pubudu N. Pathirana, David Smith, Ming Ding, Thierry Rakotoarivelo, Aruna Seneviratne
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3126] arXiv:2503.21159 (cross-list from cs.LG) [pdf, html, other]
Title: Multi-Objective Optimization for Privacy-Utility Balance in Differentially Private Federated Learning
Kanishka Ranaweera, David Smith, Pubudu N. Pathirana, Ming Ding, Thierry Rakotoarivelo, Aruna Seneviratne
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3127] arXiv:2503.21164 (cross-list from cs.CV) [pdf, other]
Title: Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples
Samra Irshad, Seungkyu Lee, Nassir Navab, Hong Joo Lee, Seong Tae Kim
Comments: 11 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3128] arXiv:2503.21200 (cross-list from cs.LG) [pdf, html, other]
Title: Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation
Sicong Liu, Yang Shu, Chenjuan Guo, Bin Yang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[3129] arXiv:2503.21219 (cross-list from cs.CV) [pdf, html, other]
Title: GenFusion: Closing the Loop between Reconstruction and Generation via Videos
Sibo Wu, Congrong Xu, Binbin Huang, Andreas Geiger, Anpei Chen
Comments: CVPR 2025, project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3130] arXiv:2503.21237 (cross-list from cs.IR) [pdf, html, other]
Title: Bias-Aware Agent: Enhancing Fairness in AI-Driven Knowledge Retrieval
Karanbir Singh, William Ngu
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3131] arXiv:2503.21241 (cross-list from cs.LG) [pdf, html, other]
Title: Feature-Enhanced Machine Learning for All-Cause Mortality Prediction in Healthcare Data
HyeYoung Lee, Pavel Tsoi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[3132] arXiv:2503.21244 (cross-list from cs.LG) [pdf, html, other]
Title: Improving $(α, f)$-Byzantine Resilience in Federated Learning via layerwise aggregation and cosine distance
Mario García-Márquez, Nuria Rodríguez-Barroso, M.Victoria Luzón, Francisco Herrera
Comments: Submitted to Knowledge-Based Systems
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3133] arXiv:2503.21248 (cross-list from cs.CL) [pdf, html, other]
Title: ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition
Yujie Liu, Zonglin Yang, Tong Xie, Jinjie Ni, Ben Gao, Yuqiang Li, Shixiang Tang, Wanli Ouyang, Erik Cambria, Dongzhan Zhou
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE)
[3134] arXiv:2503.21251 (cross-list from cs.LG) [pdf, html, other]
Title: Dual-Splitting Conformal Prediction for Multi-Step Time Series Forecasting
Qingdi Yu, Zhiwei Cao, Ruihang Wang, Zhen Yang, Lijun Deng, Min Hu, Yong Luo, Xin Zhou
Comments: 28 pages, 13 figures, 3 tables. Submitted to Applied Soft Computing. With Editor This is the first public release of the work
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3135] arXiv:2503.21254 (cross-list from cs.CV) [pdf, html, other]
Title: Vision-to-Music Generation: A Survey
Zhaokai Wang, Chenxi Bao, Le Zhuo, Jingrui Han, Yang Yue, Yihong Tang, Victor Shea-Jay Huang, Yue Liao
Journal-ref: ISMIR 2025 "A Survey on Vision to Music Generation: Methods, Datasets, Evaluation, and Challenges"
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[3136] arXiv:2503.21257 (cross-list from cs.RO) [pdf, html, other]
Title: OminiAdapt: Learning Cross-Task Invariance for Robust and Environment-Aware Robotic Manipulation
Yongxu Wang, Weiyun Yi, Xinhao Kong, Wanting Li
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI)
[3137] arXiv:2503.21258 (cross-list from cs.CV) [pdf, html, other]
Title: Learn by Reasoning: Analogical Weight Generation for Few-Shot Class-Incremental Learning
Jizhou Han, Chenhao Ding, Yuhang He, Songlin Dong, Qiang Wang, Xinyuan Gao, Yihong Gong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3138] arXiv:2503.21284 (cross-list from cs.CV) [pdf, html, other]
Title: Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression
Hanyue Tu, Siqi Wu, Li Li, Wengang Zhou, Houqiang Li
Comments: Accepted for publication in IEEE Transactions on Multimedia 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3139] arXiv:2503.21305 (cross-list from cs.CR) [pdf, html, other]
Title: DeBackdoor: A Deductive Framework for Detecting Backdoor Attacks on Deep Models with Limited Data
Dorde Popovic, Amin Sadeghi, Ting Yu, Sanjay Chawla, Issa Khalil
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
[3140] arXiv:2503.21307 (cross-list from cs.CV) [pdf, html, other]
Title: InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token Compression
Dongchen Lu, Yuyao Sun, Zilu Zhang, Leping Huang, Jianliang Zeng, Mao Shu, Huo Cao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3141] arXiv:2503.21309 (cross-list from cs.CV) [pdf, html, other]
Title: FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval
Zixu Li, Zhiheng Fu, Yupeng Hu, Zhiwei Chen, Haokun Wen, Liqiang Nie
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3142] arXiv:2503.21332 (cross-list from cs.CL) [pdf, html, other]
Title: ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback
Taewon Yun, Jihwan Oh, Hyangsuk Min, Yuho Lee, Jihwan Bang, Jason Cai, Hwanjun Song
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3143] arXiv:2503.21335 (cross-list from cs.AR) [pdf, html, other]
Title: A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices
Ci-Hao Wu, Tian-Sheuan Chang
Journal-ref: in IEEE Open Journal of Circuits and Systems, vol. 5, pp. 128-140, 2024
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[3144] arXiv:2503.21337 (cross-list from cs.AR) [pdf, html, other]
Title: A 71.2-$μ$W Speech Recognition Accelerator with Recurrent Spiking Neural Network
Chih-Chyau Yang, Tian-Sheuan Chang
Journal-ref: in IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 71, no. 7, pp. 3203-3213, July 2024
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[3145] arXiv:2503.21347 (cross-list from cs.NE) [pdf, html, other]
Title: Residual Learning Inspired Crossover Operator and Strategy Enhancements for Evolutionary Multitasking
Ruilin Wang, Xiang Feng, Huiqun Yu, Edmund M-K Lai
Comments: 9 pages, 4 figures
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI)
[3146] arXiv:2503.21356 (cross-list from cs.LG) [pdf, html, other]
Title: Investigating the Duality of Interpretability and Explainability in Machine Learning
Moncef Garouani, Josiane Mothe, Ayah Barhrhouj, Julien Aligon
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3147] arXiv:2503.21392 (cross-list from cs.LG) [pdf, html, other]
Title: HybridoNet-Adapt: A Domain-Adapted Framework for Accurate Lithium-Ion Battery RUL Prediction
Khoa Tran, Bao Huynh, Tri Le, Lam Pham, Vy-Rin Nguyen, Hung-Cuong Trinh, Duong Tran Anh
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3148] arXiv:2503.21393 (cross-list from cs.CL) [pdf, html, other]
Title: An evaluation of LLMs and Google Translate for translation of selected Indian languages via sentiment and semantic analyses
Rohitash Chandra, Aryan Chaudhari, Yeshwanth Rayavarapu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3149] arXiv:2503.21422 (cross-list from q-fin.CP) [pdf, html, other]
Title: From Deep Learning to LLMs: A survey of AI in Quantitative Investment
Bokai Cao, Saizhuo Wang, Xinyi Lin, Xiaojun Wu, Haohan Zhang, Lionel M. Ni, Jian Guo
Subjects: Computational Finance (q-fin.CP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Statistical Finance (q-fin.ST); Trading and Market Microstructure (q-fin.TR)
[3150] arXiv:2503.21463 (cross-list from cs.CR) [pdf, html, other]
Title: Unveiling Latent Information in Transaction Hashes: Hypergraph Learning for Ethereum Ponzi Scheme Detection
Junhao Wu, Yixin Yang, Chengxiang Jin, Silu Mu, Xiaolei Qian, Jiajun Zhou, Shanqing Yu, Qi Xuan
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
[3151] arXiv:2503.21464 (cross-list from cs.CL) [pdf, html, other]
Title: Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection
Ryan Marinelli, Josef Pichlmeier, Tamas Bisztray
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Performance (cs.PF)
[3152] arXiv:2503.21465 (cross-list from cs.CV) [pdf, html, other]
Title: Retinal Fundus Multi-Disease Image Classification using Hybrid CNN-Transformer-Ensemble Architectures
Deependra Singh, Saksham Agarwal, Subhankar Mishra
Comments: 17 pages, 3 figures, 7 tables. Conference paper presented at the International Health Informatics Conference (IHIC 2023)
Journal-ref: In: Proceedings of the International Health Informatics Conference (IHIC 2023). Lecture Notes in Networks and Systems, vol. 1113, Springer, Singapore, pp. 103-120 (2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3153] arXiv:2503.21495 (cross-list from cs.LG) [pdf, html, other]
Title: Adaptive Resampling with Bootstrap for Noisy Multi-Objective Optimization Problems
Timo Budszuhn, Mark Joachim Krallmann, Daniel Horn
Comments: 14 pages. 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[3154] arXiv:2503.21504 (cross-list from cs.CL) [pdf, html, other]
Title: Keyword-Oriented Multimodal Modeling for Euphemism Identification
Yuxue Hu, Junsong Li, Meixuan Chen, Dongyu Su, Tongguan Wang, Ying Sha
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3155] arXiv:2503.21514 (cross-list from quant-ph) [pdf, html, other]
Title: Quantitative Evaluation of Quantum/Classical Neural Network Using a Game Solver Metric
Suzukaze Kamei, Hideaki Kawaguchi, Shin Nishio, Tatakahiko Satoh
Comments: 11 pages, 16 figures
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3156] arXiv:2503.21522 (cross-list from cs.SE) [pdf, html, other]
Title: MONO2REST: Identifying and Exposing Microservices: a Reusable RESTification Approach
Matthéo Lecrivain, Hanifa Barry, Dalila Tamzalit, Houari Sahraoui
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
[3157] arXiv:2503.21530 (cross-list from cs.CL) [pdf, html, other]
Title: Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
Umer Butt, Stalin Veranasi, Günter Neumann
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3158] arXiv:2503.21541 (cross-list from cs.CV) [pdf, html, other]
Title: LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing
Achint Soni, Meet Soni, Sirisha Rambhatla
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3159] arXiv:2503.21544 (cross-list from cs.CL) [pdf, html, other]
Title: SWI: Speaking with Intent in Large Language Models
Yuwei Yin, EunJeong Hwang, Giuseppe Carenini
Comments: Code: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3160] arXiv:2503.21558 (cross-list from cs.SI) [pdf, html, other]
Title: A Local Perspective-based Model for Overlapping Community Detection
Gaofeng Zhou, Rui-Feng Wang, Kangning Cui
Comments: 10 pages, 3 figures, 3 tables
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI)
[3161] arXiv:2503.21571 (cross-list from cs.SD) [pdf, html, other]
Title: Magnitude-Phase Dual-Path Speech Enhancement Network based on Self-Supervised Embedding and Perceptual Contrast Stretch Boosting
Alimjan Mattursun, Liejun Wang, Yinfeng Yu, Chunyang Ma
Comments: Main paper (6 pages). Accepted for publication by ICME 2025
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[3162] arXiv:2503.21581 (cross-list from cs.CV) [pdf, html, other]
Title: AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion
Liuyue Xie, Jiancong Guo, Ozan Cakmakci, Andre Araujo, Laszlo A. Jeni, Zhiheng Jia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3163] arXiv:2503.21592 (cross-list from cs.LG) [pdf, html, other]
Title: Simple and Critical Iterative Denoising: A Recasting of Discrete Diffusion in Graph Generation
Yoann Boget
Comments: ICML 2025 Accepted paper
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3164] arXiv:2503.21598 (cross-list from cs.CR) [pdf, other]
Title: Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing
Johan Wahréus, Ahmed Hussain, Panos Papadimitratos
Comments: 22 pages; 26 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3165] arXiv:2503.21615 (cross-list from cs.HC) [pdf, html, other]
Title: A Measure Based Generalizable Approach to Understandability
Vikas Kushwaha, Sruti Srinivasa Ragavan, Subhajit Roy
Comments: 6 pages
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[3166] arXiv:2503.21634 (cross-list from cs.LG) [pdf, html, other]
Title: When Astronomy Meets AI: Manazel For Crescent Visibility Prediction in Morocco
Yassir Lairgi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3167] arXiv:2503.21657 (cross-list from cs.LG) [pdf, html, other]
Title: Model Assembly Learning with Heterogeneous Layer Weight Merging
Yi-Kai Zhang, Jin Wang, Xu-Xiang Zhong, De-Chuan Zhan, Han-Jia Ye
Comments: ICLR 2025 Workshop on Neural Network Weights as a New Data Modality
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3168] arXiv:2503.21670 (cross-list from cs.CL) [pdf, html, other]
Title: COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing
Rajvee Sheth, Himanshu Beniwal, Mayank Singh
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3169] arXiv:2503.21674 (cross-list from cs.CR) [pdf, html, other]
Title: Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base
Satvik Verma, Qun Wang, E. Wes Bethel
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI)
[3170] arXiv:2503.21677 (cross-list from cs.LG) [pdf, html, other]
Title: A tale of two goals: leveraging sequentiality in multi-goal scenarios
Olivier Serris, Stéphane Doncieux, Olivier Sigaud
Comments: 14 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3171] arXiv:2503.21694 (cross-list from cs.GR) [pdf, html, other]
Title: Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
Zhiyuan Ma, Xinyue Liang, Rongyuan Wu, Xiangyu Zhu, Zhen Lei, Lei Zhang
Comments: Accepted to CVPR 2025. Code:this https URL. Demo:this https URL
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3172] arXiv:2503.21695 (cross-list from cs.CV) [pdf, html, other]
Title: AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation
Jiahe Qian, Yaoyu Fang, Jinkui Hao, Bo Zhou
Comments: 13 pages, 4 tables, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3173] arXiv:2503.21699 (cross-list from cs.MM) [pdf, html, other]
Title: MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX
Liuyue Xie, George Z. Wei, Avik Kuthiala, Ce Zheng, Ananya Bal, Mosam Dabhi, Liting Wen, Taru Rustagi, Ethan Lai, Sushil Khyalia, Rohan Choudhury, Morteza Ziyadi, Xu Zhang, Hao Yang, László A. Jeni
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[3174] arXiv:2503.21708 (cross-list from cs.LG) [pdf, html, other]
Title: The Mathematical Relationship Between Layer Normalization and Dynamic Activation Functions
Felix Stollenwerk
Comments: New title, renamed DyISRU, added missing parentheses in proof of theorem 3, minor language corrections
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3175] arXiv:2503.21718 (cross-list from cs.CL) [pdf, html, other]
Title: Outlier dimensions favor frequent tokens in language models
Iuri Macocco, Nora Graichen, Gemma Boleda, Marco Baroni
Comments: 9 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3176] arXiv:2503.21720 (cross-list from cs.CL) [pdf, html, other]
Title: Collab: Controlled Decoding using Mixture of Agents for LLM Alignment
Souradip Chakraborty, Sujay Bhatt, Udari Madhushani Sehwag, Soumya Suvra Ghosal, Jiahao Qiu, Mengdi Wang, Dinesh Manocha, Furong Huang, Alec Koppel, Sumitra Ganesh
Comments: Accepted to ICLR 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3177] arXiv:2503.21729 (cross-list from cs.CL) [pdf, html, other]
Title: ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation
Zhicheng Lee, Shulin Cao, Jinxin Liu, Jiajie Zhang, Weichuan Liu, Xiaoyin Che, Lei Hou, Juanzi Li
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3178] arXiv:2503.21735 (cross-list from cs.SE) [pdf, html, other]
Title: GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics
Arsham Gholamzadeh Khoee, Shuai Wang, Yinan Yu, Robert Feldt, Dhasarathy Parthasarathy
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multiagent Systems (cs.MA)
[3179] arXiv:2503.21747 (cross-list from cs.CV) [pdf, html, other]
Title: CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
Aniket Didolkar, Andrii Zadaianchuk, Rabiul Awal, Maximilian Seitzer, Efstratios Gavves, Aishwarya Agrawal
Comments: Accepted at CVPR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3180] arXiv:2503.21757 (cross-list from cs.CV) [pdf, html, other]
Title: Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck
Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3181] arXiv:2503.21761 (cross-list from cs.CV) [pdf, html, other]
Title: Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
David Yifan Yao, Albert J. Zhai, Shenlong Wang
Comments: CVPR 2025. Project page (with code): this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3182] arXiv:2503.21766 (cross-list from cs.CV) [pdf, html, other]
Title: Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence
Haolin Liu, Xiaohang Zhan, Zizheng Yan, Zhongjin Luo, Yuxin Wen, Xiaoguang Han
Comments: Accepted by CVPR 2025. Homepage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3183] arXiv:2503.21775 (cross-list from cs.CV) [pdf, html, other]
Title: StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
Ziyu Guo, Young Yoon Lee, Joseph Liu, Yizhak Ben-Shabat, Victor Zordan, Mubbasir Kapadia
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
[3184] arXiv:2503.21790 (cross-list from stat.AP) [pdf, html, other]
Title: March Madness Tournament Predictions Model: A Mathematical Modeling Approach
Christian McIver, Karla Avalos, Nikhil Nayak
Comments: 7 pages, 5 figures
Subjects: Applications (stat.AP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3185] arXiv:2503.21793 (cross-list from cs.NE) [pdf, html, other]
Title: Input-Triggered Hardware Trojan Attack on Spiking Neural Networks
Spyridon Raptis, Paul Kling, Ioannis Kaskampas, Ihsen Alouani, Haralampos-G. Stratigopoulos
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI)
[3186] arXiv:2503.21794 (cross-list from cs.NE) [pdf, other]
Title: Architecture of Information
Yurii Parzhyn
Comments: 81 pages, 5 figures
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG)
[3187] arXiv:2503.21795 (cross-list from cs.NE) [pdf, html, other]
Title: Threshold Adaptation in Spiking Networks Enables Shortest Path Finding and Place Disambiguation
Robin Dietrich, Tobias Fischer, Nicolai Waniek, Nico Reeb, Michael Milford, Alois Knoll, Adam D. Hines
Comments: Appears in the proceedings of the 2025 Neuro Inspired Computational Elements Conference (NICE)
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[3188] arXiv:2503.21797 (cross-list from cs.NE) [pdf, html, other]
Title: A Novel Two-Phase Cooperative Co-evolution Framework for Large-Scale Global Optimization with Complex Overlapping
Wenjie Qiu, Hongshu Guo, Zeyuan Ma, Yue-Jiao Gong
Comments: Accepted at ACM GECCO 2025
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI)
[3189] arXiv:2503.21800 (cross-list from cs.CL) [pdf, html, other]
Title: ELM: Ensemble of Language Models for Predicting Tumor Group from Pathology Reports
Lovedeep Gondara, Jonathan Simkin, Shebnum Devji, Gregory Arbour, Raymond Ng
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3190] arXiv:2503.21801 (cross-list from cs.LG) [pdf, html, other]
Title: Efficient Joint Prediction of Multiple Future Tokens
Kwangjun Ahn, Alex Lamb, John Langford
Comments: Technical report; comments welcome!
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3191] arXiv:2503.21803 (cross-list from cs.LG) [pdf, other]
Title: Forecasting Volcanic Radiative Power (VPR) at Fuego Volcano Using Bayesian Regularized Neural Network
Snehamoy Chatterjee, Greg Waite, Sidike Paheding, Luke Bowman
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP); Atmospheric and Oceanic Physics (physics.ao-ph)
[3192] arXiv:2503.21804 (cross-list from cs.LG) [pdf, html, other]
Title: Comparison of Metadata Representation Models for Knowledge Graph Embeddings
Shusaku Egami, Kyoumoto Matsushita, Takanori Ugai, Ken Fukuda
Comments: 11 pages, 9 Figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[3193] arXiv:2503.21805 (cross-list from cs.CL) [pdf, html, other]
Title: ImF: Implicit Fingerprint for Large Language Models
Jiaxuan Wu, Wanli Peng, Hang Fu, Yiming Xue, Juan Wen
Comments: 13 pages, 6 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3194] arXiv:2503.21806 (cross-list from cs.CL) [pdf, html, other]
Title: Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages
Heqing Zou, Fengmao Lv, Desheng Zheng, Eng Siong Chng, Deepu Rajan
Comments: Accepted to ICME 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3195] arXiv:2503.21807 (cross-list from cs.LG) [pdf, html, other]
Title: LERO: LLM-driven Evolutionary framework with Hybrid Rewards and Enhanced Observation for Multi-Agent Reinforcement Learning
Yuan Wei, Xiaohan Shan, Jianmin Li
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[3196] arXiv:2503.21810 (cross-list from cs.DB) [pdf, html, other]
Title: Taxonomy Inference for Tabular Data Using Large Language Models
Zhenyu Wu, Jiaoyan Chen, Norman W. Paton
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[3197] arXiv:2503.21813 (cross-list from cs.CL) [pdf, html, other]
Title: OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching
Zhangcheng Qiang, Kerry Taylor, Weiqing Wang, Jing Jiang
Comments: 14 pages, 4 figures, 4 tables, 2 prompt templates
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[3198] arXiv:2503.21815 (cross-list from quant-ph) [pdf, html, other]
Title: ATP: Adaptive Threshold Pruning for Efficient Data Encoding in Quantum Neural Networks
Mohamed Afane, Gabrielle Ebbrecht, Ying Wang, Juntao Chen, Junaid Farooq
Comments: Accepted at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.a
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3199] arXiv:2503.21834 (cross-list from cs.CV) [pdf, html, other]
Title: A Multi-Modal Knowledge-Enhanced Framework for Vessel Trajectory Prediction
Haomin Yu, Tianyi Li, Kristian Torp, Christian S. Jensen
Comments: 8 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3200] arXiv:2503.21838 (cross-list from cs.CL) [pdf, html, other]
Title: MSPLoRA: A Multi-Scale Pyramid Low-Rank Adaptation for Efficient Model Fine-Tuning
Jiancheng Zhao, Xingda Yu, Zhen Yang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3201] arXiv:2503.21839 (cross-list from cs.CV) [pdf, html, other]
Title: M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
Haolong Yan, Kaijun Tan, Yeqing Shen, Xin Huang, Zheng Ge, Xiangyu Zhang, Si Li, Daxin Jiang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3202] arXiv:2503.21843 (cross-list from cs.CV) [pdf, html, other]
Title: CMD-HAR: Cross-Modal Disentanglement for Wearable Human Activity Recognition
Hanyu Liu, Siyao Li, Ying Yu, Yixuan Jiang, Hang Xiao, Jingxi Long, Haotian Tang, Chao Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3203] arXiv:2503.21846 (cross-list from cs.NE) [pdf, html, other]
Title: LightSNN: Lightweight Architecture Search for Sparse and Accurate Spiking Neural Networks
Yesmine Abdennadher, Giovanni Perin, Riccardo Mazzieri, Jacopo Pegoraro, Michele Rossi
Comments: Accepted to AMLDS 2025 (Tokyo, July 2025). 6 pages, 3 figures, 2 tables
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
[3204] arXiv:2503.21847 (cross-list from cs.GR) [pdf, html, other]
Title: ReCoM: Realistic Co-Speech Motion Generation with Recurrent Embedded Transformer
Yong Xie, Yunlian Sun, Hongwen Zhang, Yebin Liu, Jinhui Tang
Comments: 8 pages, 6 figures, Project Page: this https URL
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI)
[3205] arXiv:2503.21848 (cross-list from cs.CV) [pdf, html, other]
Title: Comparative Analysis of Image, Video, and Audio Classifiers for Automated News Video Segmentation
Jonathan Attard, Dylan Seychell
Comments: Preprint for paper in CAI 2025, 7 pages, 5 tables, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3206] arXiv:2503.21854 (cross-list from cs.CV) [pdf, html, other]
Title: Foveated Instance Segmentation
Hongyi Zeng, Wenxuan Liu, Tianhua Xia, Jinhui Chen, Ziyun Li, Sai Qian Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3207] arXiv:2503.21888 (cross-list from cs.CL) [pdf, html, other]
Title: RedditESS: A Mental Health Social Support Interaction Dataset -- Understanding Effective Social Support to Refine AI-Driven Support Tools
Zeyad Alghamdi, Tharindu Kumarage, Garima Agrawal, Mansooreh Karami, Ibrahim Almuteb, Huan Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3208] arXiv:2503.21889 (cross-list from cs.CV) [pdf, html, other]
Title: StarFlow: Generating Structured Workflow Outputs From Sketch Images
Patrice Bechard, Chao Wang, Amirhossein Abaskohi, Juan Rodriguez, Christopher Pal, David Vazquez, Spandana Gella, Sai Rajeswar, Perouz Taslakian
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3209] arXiv:2503.21893 (cross-list from cs.CV) [pdf, html, other]
Title: Exponentially Weighted Instance-Aware Repeat Factor Sampling for Long-Tailed Object Detection Model Training in Unmanned Aerial Vehicles Surveillance Scenarios
Taufiq Ahmed, Abhishek Kumar, Constantino Álvarez Casado, Anlan Zhang, Tuomo Hänninen, Lauri Loven, Miguel Bordallo López, Sasu Tarkoma
Comments: 7 pages, 2 figures, 9 tables, 6 formulas, conference paper, code available
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3210] arXiv:2503.21910 (cross-list from cs.CL) [pdf, html, other]
Title: JEEM: Vision-Language Understanding in Four Arabic Dialects
Karima Kadaoui, Hanin Atwany, Hamdan Al-Ali, Abdelrahman Mohamed, Ali Mekky, Sergei Tilga, Natalia Fedorova, Ekaterina Artemova, Hanan Aldarmaki, Yova Kementchedjhieva
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3211] arXiv:2503.21911 (cross-list from cs.CL) [pdf, html, other]
Title: AutoPsyC: Automatic Recognition of Psychodynamic Conflicts from Semi-structured Interviews with Large Language Models
Sayed Muddashir Hossain, Simon Ostermann, Patrick Gebhard, Cord Benecke, Josef van Genabith, Philipp Müller
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3212] arXiv:2503.21928 (cross-list from cs.LG) [pdf, html, other]
Title: An Efficient Training Algorithm for Models with Block-wise Sparsity
Ding Zhu, Zhiqun Zuo, Mohammad Mahdi Khalili
Comments: 24 pages, submitted on Transactions on Machine Learning Research
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3213] arXiv:2503.21937 (cross-list from cs.PL) [pdf, html, other]
Title: Lobster: A GPU-Accelerated Framework for Neurosymbolic Programming
Paul Biberstein, Ziyang Li, Joseph Devietti, Mayur Naik
Subjects: Programming Languages (cs.PL); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[3214] arXiv:2503.21943 (cross-list from cs.CV) [pdf, html, other]
Title: Parametric Shadow Control for Portrait Generation in Text-to-Image Diffusion Models
Haoming Cai, Tsung-Wei Huang, Shiv Gehlot, Brandon Y. Feng, Sachin Shah, Guan-Ming Su, Christopher Metzler
Comments: ShadowDirector Arxiv Version. Fix the arxiv title text issue
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[3215] arXiv:2503.21961 (cross-list from cs.CL) [pdf, html, other]
Title: Entropy-Aware Branching for Improved Mathematical Reasoning
Xianzhi Li, Ethan Callanan, Xiaodan Zhu, Mathieu Sibue, Antony Papadimitriou, Mahmoud Mahfouz, Zhiqiang Ma, Xiaomo Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3216] arXiv:2503.21969 (cross-list from cs.RO) [pdf, other]
Title: Embodied Long Horizon Manipulation with Closed-loop Code Generation and Incremental Few-shot Adaptation
Yuan Meng, Xiangtong Yao, Haihui Ye, Yirui Zhou, Shengqiang Zhang, Zhenguo Sun, Xukun Li, Zhenshan Bing, Alois Knoll
Comments: update ICRA 6 page
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI)
[3217] arXiv:2503.21975 (cross-list from cs.RO) [pdf, other]
Title: Pretrained Bayesian Non-parametric Knowledge Prior in Robotic Long-Horizon Reinforcement Learning
Yuan Meng, Xiangtong Yao, Kejia Chen, Yansong Wu, Liding Zhang, Zhenshan Bing, Alois Knoll
Comments: initial upload 8 pages
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI)
[3218] arXiv:2503.21991 (cross-list from cs.CV) [pdf, html, other]
Title: BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
Hang Zhou, Xinxin Zuo, Rui Ma, Li Cheng
Comments: CVPR 2025. Project page: this https URL , code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[3219] arXiv:2503.22020 (cross-list from cs.CV) [pdf, html, other]
Title: CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
Qingqing Zhao, Yao Lu, Moo Jin Kim, Zipeng Fu, Zhuoyang Zhang, Yecheng Wu, Zhaoshuo Li, Qianli Ma, Song Han, Chelsea Finn, Ankur Handa, Ming-Yu Liu, Donglai Xiang, Gordon Wetzstein, Tsung-Yi Lin
Comments: Project website: this https URL
Journal-ref: CVPR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[3220] arXiv:2503.22023 (cross-list from cs.CY) [pdf, html, other]
Title: Safeguarding Autonomy: a Focus on Machine Learning Decision Systems
Paula Subías-Beltrán, Oriol Pujol, Itziar de Lecuona
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3221] arXiv:2503.22036 (cross-list from cs.CL) [pdf, html, other]
Title: Cognitive Prompts Using Guilford's Structure of Intellect Model
Oliver Kramer
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3222] arXiv:2503.22051 (cross-list from cs.CL) [pdf, html, other]
Title: Non-Monotonic Attention-based Read/Write Policy Learning for Simultaneous Translation
Zeeshan Ahmed, Frank Seide, Zhe Liu, Rastislav Rabatin, Jachym Kolar, Niko Moritz, Ruiming Xie, Simone Merello, Christian Fuegen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3223] arXiv:2503.22068 (cross-list from cs.LG) [pdf, html, other]
Title: A Proposal for Networks Capable of Continual Learning
Zeki Doruk Erden, Boi Faltings
Comments: Published at ICLR 2025 World Models Workshop
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3224] arXiv:2503.22069 (cross-list from cs.CV) [pdf, other]
Title: Contrasting Low and High-Resolution Features for HER2 Scoring using Deep Learning
Ekansh Chauhan, Anila Sharma, Amit Sharma, Vikas Nishadham, Asha Ghughtyal, Ankur Kumar, Gurudutt Gupta, Anurag Mehta, C.V. Jawahar, P.K. Vinod
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3225] arXiv:2503.22074 (cross-list from cs.CL) [pdf, html, other]
Title: Penrose Tiled Low-Rank Compression and Section-Wise Q&A Fine-Tuning: A General Framework for Domain-Specific Large Language Model Adaptation
Chuan-Wei Kuo, Siyu Chen, Chenqi Yan, Yu Yang Fredrik Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3226] arXiv:2503.22093 (cross-list from cs.CV) [pdf, html, other]
Title: How Well Can Vison-Language Models Understand Humans' Intention? An Open-ended Theory of Mind Question Evaluation Benchmark
Ximing Wen, Mallika Mainali, Anik Sen
Comments: 4 pages, accepted by ToM@AAAI25
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3227] arXiv:2503.22115 (cross-list from cs.CL) [pdf, html, other]
Title: Beyond Single-Sentence Prompts: Upgrading Value Alignment Benchmarks with Dialogues and Stories
Yazhou Zhang, Qimeng Liu, Qiuchi Li, Peng Zhang, Jing Qin
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[3228] arXiv:2503.22122 (cross-list from cs.RO) [pdf, html, other]
Title: REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation
Puzhen Yuan, Angyuan Ma, Yunchao Yao, Huaxiu Yao, Masayoshi Tomizuka, Mingyu Ding
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[3229] arXiv:2503.22141 (cross-list from cs.SE) [pdf, html, other]
Title: Integrating Artificial Intelligence with Human Expertise: An In-depth Analysis of ChatGPT's Capabilities in Generating Metamorphic Relations
Yifan Zhang (1), Dave Towey (1), Matthew Pike (1), Quang-Hung Luu (2), Huai Liu (2), Tsong Yueh Chen (2) ((1) University of Nottingham Ningbo China, (2) Swinburne University of Technology)
Comments: Submitted to Information and Software Technology
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
[3230] arXiv:2503.22143 (cross-list from eess.SP) [pdf, other]
Title: A Self-Supervised Learning of a Foundation Model for Analog Layout Design Automation
Sungyu Jeong, Won Joon Choi, Junung Choi, Anik Biswas, Byungsub Kim
Comments: 8 pages, 11 figures
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[3231] arXiv:2503.22144 (cross-list from cs.CL) [pdf, html, other]
Title: FRASE: Structured Representations for Generalizable SPARQL Query Generation
Papa Abdou Karim Karou Diallo, Amal Zouaq
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3232] arXiv:2503.22151 (cross-list from cs.CY) [pdf, other]
Title: When Autonomy Breaks: The Hidden Existential Risk of AI
Joshua Krook
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[3233] arXiv:2503.22152 (cross-list from cs.CV) [pdf, html, other]
Title: EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric Videos
Yuxuan Li, Vijay Veerabadran, Michael L. Iuzzolino, Brett D. Roads, Asli Celikyilmaz, Karl Ridgeway
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3234] arXiv:2503.22164 (cross-list from q-bio.BM) [pdf, html, other]
Title: PharmAgents: Building a Virtual Pharma with Large Language Model Agents
Bowen Gao, Yanwen Huang, Yiqiao Liu, Wenxuan Xie, Wei-Ying Ma, Ya-Qin Zhang, Yanyan Lan
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI)
[3235] arXiv:2503.22178 (cross-list from cs.LG) [pdf, html, other]
Title: AdaRank: Adaptive Rank Pruning for Enhanced Model Merging
Chanhyuk Lee, Jiho Choi, Chanryeol Lee, Donggyun Kim, Seunghoon Hong
Comments: Code Available at: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3236] arXiv:2503.22181 (cross-list from cs.CY) [pdf, other]
Title: e-person Architecture and Framework for Human-AI Co-adventure Relationship
Kanako Esaki, Tadayuki Matsumura, Yang Shao, Hiroyuki Mizuno
Comments: 24 pages, 4 figures, 1 table
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[3237] arXiv:2503.22182 (cross-list from cs.IR) [pdf, html, other]
Title: Sell It Before You Make It: Revolutionizing E-Commerce with Personalized AI-Generated Items
Jianghao Lin, Peng Du, Jiaqi Liu, Weite Li, Yong Yu, Weinan Zhang, Yang Cao
Comments: Under Review
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3238] arXiv:2503.22215 (cross-list from cs.CV) [pdf, html, other]
Title: Learning to Instruct for Visual Instruction Tuning
Zhihan Zhou, Feng Hong, Jiaan Luo, Jiangchao Yao, Dongsheng Li, Bo Han, Ya Zhang, Yanfeng Wang
Comments: 16 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[3239] arXiv:2503.22228 (cross-list from cs.SE) [pdf, other]
Title: MFH: A Multi-faceted Heuristic Algorithm Selection Approach for Software Verification
Jie Su, Liansai Deng, Cheng Wen, Rong Wang, Zhi Ma, Nan Zhang, Cong Tian, Zhenhua Duan, Shengchao Qin
Comments: The decision to withdraw the paper is driven by two reasons: 1. A conflict of interest arises from the proposed methods overlapping with pending patent applications by other authors. 2. Upon thorough review, it has been discovered that the paper contains ambiguities and inaccuracies in describing the method, potentially hindering readers' comprehension of the content
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[3240] arXiv:2503.22233 (cross-list from cs.LG) [pdf, html, other]
Title: Process Reward Modeling with Entropy-Driven Uncertainty
Lang Cao, Renhong Chen, Yingtian Zou, Chao Peng, Wu Ning, Huacong Xu, Qian Chen, Yuxian Wang, Peishuo Su, Mofan Peng, Zijie Chen, Yitong Li
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3241] arXiv:2503.22235 (cross-list from cs.LG) [pdf, html, other]
Title: WeatherMesh-3: Fast and accurate operational global weather forecasting
Haoxing Du, Lyna Kim, Joan Creus-Costa, Jack Michaels, Anuj Shetty, Todd Hutchinson, Christopher Riedel, John Dean
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3242] arXiv:2503.22250 (cross-list from cs.HC) [pdf, html, other]
Title: Modeling Challenging Patient Interactions: LLMs for Medical Communication Training
Anna Bodonhelyi, Christian Stegemann-Philipps, Alessandra Sonanini, Lea Herschbach, Marton Szep, Anne Herrmann-Werner, Teresa Festl-Wietek, Enkelejda Kasneci, Friederike Holderried
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI)
[3243] arXiv:2503.22275 (cross-list from eess.AS) [pdf, html, other]
Title: Make Some Noise: Towards LLM audio reasoning and generation using sound tokens
Shivam Mehta, Nebojsa Jojic, Hannes Gamper
Comments: 5 pages, 2 figures, Accepted at ICASSP 2025
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD)
[3244] arXiv:2503.22276 (cross-list from cs.LG) [pdf, html, other]
Title: Machine Learning Models for Soil Parameter Prediction Based on Satellite, Weather, Clay and Yield Data
Calvin Kammerlander, Viola Kolb, Marinus Luegmair, Lou Scheermann, Maximilian Schmailzl, Marco Seufert, Jiayun Zhang, Denis Dalic, Torsten Schön
Comments: This technical report is the documentation of a student project collaboration between Technische Hochschule Ingolstadt and MI4People
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3245] arXiv:2503.22324 (cross-list from cs.CV) [pdf, html, other]
Title: AH-GS: Augmented 3D Gaussian Splatting for High-Frequency Detail Representation
Chenyang Xu, XingGuo Deng, Rui Zhong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3246] arXiv:2503.22328 (cross-list from cs.CV) [pdf, html, other]
Title: VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow
Yancong Lin, Shiming Wang, Liangliang Nan, Julian Kooij, Holger Caesar
Comments: CVPR 2025. Code is available at this https URL. Yancong Lin and Shiming Wang have equal contributions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3247] arXiv:2503.22353 (cross-list from cs.CL) [pdf, html, other]
Title: Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions
Yubo Li, Yidi Miao, Xueying Ding, Ramayya Krishnan, Rema Padman
Comments: 8 pages, 5 figures
Journal-ref: Published at ACL 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3248] arXiv:2503.22358 (cross-list from cs.DB) [pdf, other]
Title: Shapley Revisited: Tractable Responsibility Measures for Query Answers
Meghyn Bienvenu, Diego Figueira, Pierre Lafourcade
Comments: Long version of PODS'25 paper, with corrected error on Shapley symmetry axiom statement
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[3249] arXiv:2503.22363 (cross-list from cs.CV) [pdf, html, other]
Title: ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection
Nandakishor M, Vrinda Govind V, Anuradha Puthalath, Anzy L, Swathi P S, Aswathi R, Devaprabha A R, Varsha Raj, Midhuna Krishnan K, Akhila Anilkumar T V, Yamuna P V
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3250] arXiv:2503.22374 (cross-list from cs.CV) [pdf, html, other]
Title: ViSketch-GPT: Collaborative Multi-Scale Feature Extraction for Sketch Recognition and Generation
Giulio Federico, Giuseppe Amato, Fabio Carrara, Claudio Gennaro, Marco Di Benedetto
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Total of 3476 entries : 1-250 ... 2251-2500 2501-2750 2751-3000 3001-3250 3251-3476
Showing up to 250 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack