Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.PF

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Performance

Authors and titles for recent submissions

  • Tue, 26 Aug 2025
  • Mon, 25 Aug 2025
  • Fri, 22 Aug 2025
  • Thu, 21 Aug 2025
  • Wed, 20 Aug 2025

See today's new changes

Total of 25 entries
Showing up to 50 entries per page: fewer | more | all

Tue, 26 Aug 2025 (showing 13 of 13 entries )

[1] arXiv:2508.17518 [pdf, html, other]
Title: Evaluating Compiler Optimization Impacts on zkVM Performance
Thomas Gassmann, Stefanos Chaliasos, Thodoris Sotiropoulos, Zhendong Su
Subjects: Performance (cs.PF)
[2] arXiv:2508.17372 [pdf, html, other]
Title: The Unwritten Contract of Cloud-based Elastic Solid-State Drives
Yingjia Wang, Ming-Chang Yang
Comments: Accepted and to appear in DAC 2025
Subjects: Performance (cs.PF)
[3] arXiv:2508.16996 [pdf, other]
Title: Evaluación y modelado del rendimiento de los sistemas informáticos
Xavier Molero, Carlos Juiz, Miguel Jesus Rodeno
Comments: in Spanish language
Subjects: Performance (cs.PF)
[4] arXiv:2508.16712 [pdf, html, other]
Title: Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Tianyao Shi, Yi Ding
Comments: 14 pages, 10 figures, 4 tables
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[5] arXiv:2508.16703 [pdf, html, other]
Title: Dynamic Sparse Attention on Mobile SoCs
Wangsong Yin, Daliang Xu, Mengwei Xu, Gang Huang, Xuanzhe Liu
Comments: Technical Report
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[6] arXiv:2508.16653 [pdf, html, other]
Title: H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference
Zizhuo Fu, Xiaotian Guo, Wenxuan Zeng, Shuzhang Zhong, Yadong Zhang, Peiyu Chen, Runsheng Wang, Le Ye, Meng Li
Comments: International Conference on Computer-Aided Design (ICCAD) 2025
Subjects: Performance (cs.PF)
[7] arXiv:2508.17493 (cross-list from cs.DC) [pdf, html, other]
Title: Easy Acceleration with Distributed Arrays
Jeremy Kepner, Chansup Byun, LaToya Anderson, William Arcand, David Bestor, William Bergeron, Alex Bonn, Daniel Burrill, Vijay Gadepally, Ryan Haney, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Charles Yee, Peter Michaleas
Comments: 8 pages, 4 figures, 2 tables, 2 algorithm listings, 2 code listings, to appear in IEEE HPEC 2025
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computational Engineering, Finance, and Science (cs.CE); Mathematical Software (cs.MS); Performance (cs.PF)
[8] arXiv:2508.17467 (cross-list from cs.LG) [pdf, html, other]
Title: MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
Krishna Teja Chitty-Venkata, Sylvia Howland, Golara Azar, Daria Soboleva, Natalia Vassilieva, Siddhisanket Raskar, Murali Emani, Venkatram Vishwanath
Comments: Preprint
Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[9] arXiv:2508.17344 (cross-list from cs.SE) [pdf, html, other]
Title: Who Wins the Race? (R Vs Python) - An Exploratory Study on Energy Consumption of Machine Learning Algorithms
Rajrupa Chattaraj, Sridhar Chimalakonda, Vibhu Saujanya Sharma, Vikrant Kaulgud
Comments: 18 pages including references, 5 figures
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG); Performance (cs.PF); Programming Languages (cs.PL)
[10] arXiv:2508.17311 (cross-list from cs.DC) [pdf, html, other]
Title: Bine Trees: Enhancing Collective Operations by Optimizing Communication Locality
Daniele De Sensi, Saverio Pasqualoni, Lorenzo Piarulli, Tommaso Bonato, Seydou Ba, Matteo Turisini, Jens Domke, Torsten Hoefler
Journal-ref: Proceedings of The International Conference for High Performance Computing Networking, Storage, and Analysis (SC '25) (2025)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Performance (cs.PF)
[11] arXiv:2508.16809 (cross-list from cs.DC) [pdf, html, other]
Title: PICO: Performance Insights for Collective Operations
Saverio Pasqualoni, Lorenzo Piarulli, Daniele De Sensi
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[12] arXiv:2508.16700 (cross-list from cs.AR) [pdf, html, other]
Title: GPT-OSS-20B: A Comprehensive Deployment-Centric Analysis of OpenAI's Open-Weight Mixture of Experts Model
Deepak Kumar, Divakar Yadav, Yash Patel
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[13] arXiv:2508.16592 (cross-list from cs.DC) [pdf, other]
Title: Performance measurements of modern Fortran MPI applications with Score-P
Gregor Corbin
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Mathematical Software (cs.MS); Performance (cs.PF)

Mon, 25 Aug 2025 (showing 2 of 2 entries )

[14] arXiv:2508.16449 [pdf, html, other]
Title: GreenLLM: SLO-Aware Dynamic Frequency Scaling for Energy-Efficient LLM Serving
Qunyou Liu, Darong Huang, Marina Zapater, David Atienza
Subjects: Performance (cs.PF)
[15] arXiv:2508.16293 [pdf, html, other]
Title: Two-Timescale Dynamic Service Deployment and Task Scheduling with Spatiotemporal Collaboration in Mobile Edge Networks
Yang Li, Xing Zhang, Yunji Zhao, Wenbo Wang
Comments: This paper is accepted by IEEE Globecom 2025
Subjects: Performance (cs.PF)

Fri, 22 Aug 2025 (showing 3 of 3 entries )

[16] arXiv:2508.15601 (cross-list from cs.DC) [pdf, html, other]
Title: Efficient Mixed-Precision Large Language Model Inference with TurboMind
Li Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[17] arXiv:2508.15478 (cross-list from cs.CL) [pdf, html, other]
Title: SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts -- Extended Version
Nghiem Thanh Pham, Tung Kieu, Duc-Manh Nguyen, Son Ha Xuan, Nghia Duong-Trung, Danh Le-Phuoc
Comments: 24 pages. An extended version of "SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts" accepted at EMNLP 2025
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Performance (cs.PF)
[18] arXiv:2508.15357 (cross-list from cs.CL) [pdf, html, other]
Title: KG-EDAS: A Meta-Metric Framework for Evaluating Knowledge Graph Completion Models
Haji Gul, Abul Ghani Naim, Ajaz Ahmad Bhat
Subjects: Computation and Language (cs.CL); Performance (cs.PF)

Thu, 21 Aug 2025 (showing 2 of 2 entries )

[19] arXiv:2508.14209 (cross-list from math.NA) [pdf, html, other]
Title: A High Performance GPU CountSketch Implementation and Its Application to Multisketching and Least Squares Problems
Andrew J. Higgins, Erik G. Boman, Ichitaro Yamazaki
Comments: 8 pages
Subjects: Numerical Analysis (math.NA); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[20] arXiv:2508.14117 (cross-list from astro-ph.IM) [pdf, html, other]
Title: SYCL for Energy-Efficient Numerical Astrophysics: the case of DPEcho
Salvatore Cielo, Alexander Pöppl, Ivan Pribec
Comments: 11 pages, 6 figures, 2 tables
Journal-ref: PECS workshop proceedings at EUROPAR 2025
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Performance (cs.PF)

Wed, 20 Aug 2025 (showing 5 of 5 entries )

[21] arXiv:2508.13249 [pdf, other]
Title: Multi-Metric Algorithmic Complexity: Beyond Asymptotic Analysis
Sergii Kavun
Comments: 24 pages, 12 figures, 3 tables
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS)
[22] arXiv:2508.13523 (cross-list from cs.DC) [pdf, html, other]
Title: LAMMPS-KOKKOS: Performance Portable Molecular Dynamics Across Exascale Architectures
Anders Johansson, Evan Weinberg, Christian R. Trott, Megan J. McCarthy, Stan G. Moore
Comments: 14 pages, 6 figures
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF); Computational Physics (physics.comp-ph)
[23] arXiv:2508.13298 (cross-list from cs.DC) [pdf, html, other]
Title: Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error Correction
Huynh Q. N. Vo, Md Tawsif Rahman Chowdhury, Paritosh Ramanan, Murat Yildirim, Gozde Tutuncuoglu
Comments: Submitted to Nature Communication Contact authors for any info
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Emerging Technologies (cs.ET); Performance (cs.PF); Systems and Control (eess.SY)
[24] arXiv:2508.13231 (cross-list from cs.AR) [pdf, html, other]
Title: Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory System
Yunhua Fang, Rui Xie, Asad Ul Haq, Linsen Ma, Kaoutar El Maghraoui, Naigang Wang, Meng Wang, Liu Liu, Tong Zhang
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Performance (cs.PF)
[25] arXiv:2508.13159 (cross-list from cs.AR) [pdf, html, other]
Title: Accelerating Transistor-Level Simulation of Integrated Circuits via Equivalence of RC Long-Chain Structures
Ruibai Tang, Wenlai Zhao
Subjects: Hardware Architecture (cs.AR); Performance (cs.PF)
Total of 25 entries
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack