Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.PF

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Performance

Authors and titles for April 2023

Total of 41 entries
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:2304.00899 [pdf, html, other]
Title: Load Balancing with Job-Size Testing: Performance Improvement or Degradation?
Jonatha Anselmi, Josu Doncel
Subjects: Performance (cs.PF)
[2] arXiv:2304.03013 [pdf, other]
Title: Tensor Slicing and Optimization for Multicore NPUs
Rafael Sousa, Marcio Pereira, Yongin Kwon, Taeho Kim, Namsoon Jung, Chang Soo Kim, Michael Frank, Guido Araujo
Journal-ref: Journal of Parallel and Distributed Computing Journal of Parallel and Distributed Computing, Volume 175, May 2023, Pages 66-79
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[3] arXiv:2304.09568 [pdf, other]
Title: WASEF: Web Acceleration Solutions Evaluation Framework
Moumena Chaqfeh, Rashid Tahir, Ayaz Rehman, Jesutofunmi Kupoluyi, Saad Ullah, Russell Coke, Muhammad Junaid, Muhammad Arham, Marc Wiggerman, Abijith Radhakrishnan, Ivano Malavolta, Fareed Zaffar, Yasir Zaki
Comments: 15 pages, 4 figures
Subjects: Performance (cs.PF)
[4] arXiv:2304.10218 [pdf, other]
Title: An Analysis of the Completion Time of the BB84 Protocol
Sounak Kar, Jean-Yves Le Boudec
Subjects: Performance (cs.PF); Quantum Physics (quant-ph)
[5] arXiv:2304.11219 [pdf, other]
Title: LightningSim: Fast and Accurate Trace-Based Simulation for High-Level Synthesis
Rishov Sarkar, Cong Hao
Comments: 11 pages, 7 figures. Accepted at FCCM 2023
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR)
[6] arXiv:2304.13110 [pdf, html, other]
Title: Analysis and Mitigation of Shared Resource Contention on Heterogeneous Multicore: An Industrial Case Study
Michael Bechtel, Heechul Yun
Subjects: Performance (cs.PF)
[7] arXiv:2304.13231 [pdf, other]
Title: Performance of the Gittins Policy in the G/G/1 and G/G/k, With and Without Setup Times
Yige Hong, Ziv Scully
Journal-ref: Performance Evaluation 163 (2024), 102377
Subjects: Performance (cs.PF); Probability (math.PR)
[8] arXiv:2304.00396 (cross-list from cs.DC) [pdf, other]
Title: Managing Cold-start in The Serverless Cloud with Temporal Convolutional Networks
Tam N. Nguyen
Comments: 8 pages, 7 figures, 3 tables
Journal-ref: Future Generation Computer Systems (FGCS), 2024
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Performance (cs.PF); Systems and Control (eess.SY)
[9] arXiv:2304.00990 (cross-list from cs.CV) [pdf, other]
Title: Efficient human-in-loop deep learning model training with iterative refinement and statistical result validation
Manuel Zahn, Douglas P. Perrin
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Performance (cs.PF); Machine Learning (stat.ML)
[10] arXiv:2304.01433 (cross-list from cs.AR) [pdf, other]
Title: TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Norman P. Jouppi, George Kurian, Sheng Li, Peter Ma, Rahul Nagarajan, Lifeng Nai, Nishant Patil, Suvinay Subramanian, Andy Swing, Brian Towles, Cliff Young, Xiang Zhou, Zongwei Zhou, David Patterson
Comments: 15 pages; 16 figures; to be published at ISCA 2023 (the International Symposium on Computer Architecture)
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[11] arXiv:2304.03487 (cross-list from cs.DC) [pdf, other]
Title: ParaGraph: Weighted Graph Representation for Performance Optimization of HPC Kernels
Ali TehraniJamsaz, Alok Mishra, Akash Dutta, Abid M. Malik, Barbara Chapman, Ali Jannesari
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Performance (cs.PF)
[12] arXiv:2304.05219 (cross-list from cs.LG) [pdf, html, other]
Title: BanditQ: Fair Bandits with Guaranteed Rewards
Abhishek Sinha
Comments: To appear in the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024), Barcelona, Spain
Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[13] arXiv:2304.05237 (cross-list from cs.CR) [pdf, other]
Title: TREBUCHET: Fully Homomorphic Encryption Accelerator for Deep Computation
David Bruce Cousins, Yuriy Polyakov, Ahmad Al Badawi, Matthew French, Andrew Schmidt, Ajey Jacob, Benedict Reynwar, Kellie Canida, Akhilesh Jaiswal, Clynn Mathew, Homer Gamil, Negar Neda, Deepraj Soni, Michail Maniatakos, Brandon Reagen, Naifeng Zhang, Franz Franchetti, Patrick Brinich, Jeremy Johnson, Patrick Broderick, Mike Franusich, Bo Zhang, Zeming Cheng, Massoud Pedram
Comments: 6 pages, 5 figures and 2 tables
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[14] arXiv:2304.05544 (cross-list from cs.LG) [pdf, other]
Title: MEMA Runtime Framework: Minimizing External Memory Accesses for TinyML on Microcontrollers
Andrew Sabot, Vikas Natesh, H.T. Kung, Wei-Te Ting
Comments: Accepted as a full paper by the TinyML Research Symposium 2023
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Performance (cs.PF); Programming Languages (cs.PL)
[15] arXiv:2304.06441 (cross-list from math.NA) [pdf, other]
Title: Fast And Automatic Floating Point Error Analysis With CHEF-FP
Garima Singh, Baidyanath Kundu, Harshitha Menon, Alexander Penev, David J. Lange, Vassil Vassilev
Comments: 11 pages, to appear in the 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS'23)
Subjects: Numerical Analysis (math.NA); Hardware Architecture (cs.AR); Performance (cs.PF)
[16] arXiv:2304.06460 (cross-list from cs.PL) [pdf, other]
Title: Repositioning Tiered HotSpot Execution Performance Relative to the Interpreter
Jonathan Lambert, Kevin Casey, Rosemary Monahan
Comments: 17 pages
Subjects: Programming Languages (cs.PL); Performance (cs.PF)
[17] arXiv:2304.07741 (cross-list from cs.LG) [pdf, html, other]
Title: Canvas: End-to-End Kernel Architecture Search in Neural Networks
Chenggang Zhao, Genghan Zhang, Ao Shen, Mingyu Gao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[18] arXiv:2304.08319 (cross-list from cs.LG) [pdf, html, other]
Title: Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance Analysis
Elias Werner, Nishant Kumar, Matthias Lieber, Sunna Torge, Stefan Gumhold, Wolfgang E. Nagel
Comments: Accepted at 13th International Conference on Data Science, Technology and Applications (DATA). Source code: this https URL
Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[19] arXiv:2304.08359 (cross-list from cs.LG) [pdf, other]
Title: Energy Efficiency Considerations for Popular AI Benchmarks
Raphael Fischer, Matthias Jakobs, Katharina Morik
Comments: Accepted at AAAI Conference on Artificial Intelligence 2023 - AI for Energy Innovation Workshop
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Performance (cs.PF)
[20] arXiv:2304.08532 (cross-list from cs.DB) [pdf, other]
Title: Hybrid Materialization in a Disk-Based Column-Store
Evgeniy Klyuchikov, Elena Mikhailova, George Chernishev
Subjects: Databases (cs.DB); Performance (cs.PF)
[21] arXiv:2304.08569 (cross-list from cs.DC) [pdf, other]
Title: Diagnosing applications' I/O behavior through system call observability
Tânia Esteves, Ricardo Macedo, Rui Oliveira, João Paulo
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Operating Systems (cs.OS); Performance (cs.PF)
[22] arXiv:2304.08697 (cross-list from cs.NI) [pdf, other]
Title: Performance Analysis and Comparison of Non-ideal Wireless PBFT and RAFT Consensus Networks in 6G Communications
Haoxiang Luo, Xiangyue Yang, Hongfang Yu, Gang Sun, Bo Lei, Mohsen Guizani
Comments: arXiv admin note: substantial text overlap with arXiv:2303.15759
Subjects: Networking and Internet Architecture (cs.NI); Performance (cs.PF); Signal Processing (eess.SP)
[23] arXiv:2304.08925 (cross-list from cs.LG) [pdf, other]
Title: Understand Data Preprocessing for Effective End-to-End Training of Deep Neural Networks
Ping Gong, Yuxin Ma, Cheng Li, Xiaosong Ma, Sam H. Noh
Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[24] arXiv:2304.09511 (cross-list from cs.DC) [pdf, other]
Title: Morpheus unleashed: Fast cross-platform SpMV on emerging architectures
Christodoulos Stylianou, Mark Klaisoongnoen, Ricardo Jesus, Nick Brown, Michele Weiland
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[25] arXiv:2304.10162 (cross-list from math.PR) [pdf, other]
Title: Poly-Exp Bounds in Tandem Queues
Florin Ciucu, Sima Mehri
Subjects: Probability (math.PR); Performance (cs.PF)
[26] arXiv:2304.11136 (cross-list from cs.AR) [pdf, other]
Title: Integrating Per-Stream Stat Tracking into Accel-Sim
Shichen Qiao, Xin Su, Matthew D. Sinclair
Comments: 13 pages
Subjects: Hardware Architecture (cs.AR); Performance (cs.PF)
[27] arXiv:2304.11277 (cross-list from cs.DC) [pdf, other]
Title: PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel
Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-Chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Pritam Damania, Bernard Nguyen, Geeta Chauhan, Yuchen Hao, Ajit Mathews, Shen Li
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[28] arXiv:2304.11921 (cross-list from cs.DC) [pdf, other]
Title: Performance Evaluation of a Next-Generation SX-Aurora TSUBASA Vector Supercomputer
Keichi Takahashi, Soya Fujimoto, Satoru Nagase, Yoko Isobe, Yoichi Shimomura, Ryusuke Egawa, Hiroyuki Takizawa
Comments: This paper has been accepted at ISC 2023
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[29] arXiv:2304.12568 (cross-list from cs.DC) [pdf, other]
Title: Performance Optimization using Multimodal Modeling and Heterogeneous GNN
Akash Dutta, Jordi Alcaraz, Ali TehraniJamsaz, Eduardo Cesar, Anna Sikora, Ali Jannesari
Comments: 14 pages, 9 figures, 3 tables
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Performance (cs.PF)
[30] arXiv:2304.12673 (cross-list from quant-ph) [pdf, other]
Title: Tools for the analysis of quantum protocols requiring state generation within a time window
Bethany Davies, Thomas Beauchamp, Gayane Vardoyan, Stephanie Wehner
Subjects: Quantum Physics (quant-ph); Performance (cs.PF)
[31] arXiv:2304.12678 (cross-list from cs.NI) [pdf, other]
Title: Critical Comparative Analysis and Recommendation in MAC Protocols for Wireless Mesh Networks Using Multi-objective Optimization and Statistical Testing
Ankita Singh, Sudhakar Singh, Shiv Prakash
Comments: 20 pages, 4 figures, Wireless Personal Communication
Journal-ref: Wireless Pers Commun 129, 2319-2344 (2023)
Subjects: Networking and Internet Architecture (cs.NI); Performance (cs.PF)
[32] arXiv:2304.12829 (cross-list from cs.LG) [pdf, other]
Title: Improving Robustness Against Adversarial Attacks with Deeply Quantized Neural Networks
Ferheen Ayaz, Idris Zakariyya, José Cano, Sye Loong Keoh, Jeremy Singer, Danilo Pau, Mounia Kharbouche-Harrari
Comments: Accepted at IJCNN 2023. 8 pages, 5 figures
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Performance (cs.PF)
[33] arXiv:2304.13039 (cross-list from eess.SY) [pdf, other]
Title: Optimizing Deep Learning Models For Raspberry Pi
Salem Ameen, Kangaranmulle Siriwardana, Theo Theodoridis
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[34] arXiv:2304.13278 (cross-list from cs.CR) [pdf, other]
Title: Understanding the Security and Performance of the Web Presence of Hospitals: A Measurement Study
Mohammed Alkinoon, Abdulrahman Alabduljabbar, Hattan Althebeiti, Rhongho Jang, DaeHun Nyang, David Mohaisen
Comments: 10 pages, 5 tables, 10 figures
Subjects: Cryptography and Security (cs.CR); Computers and Society (cs.CY); Performance (cs.PF)
[35] arXiv:2304.13302 (cross-list from cs.DC) [pdf, other]
Title: HiQ -- A Declarative, Non-intrusive, Dynamic and Transparent Observability and Optimization System
Fuheng Wu, Ivan Davchev, Jun Qian
Comments: 7 pages, 12 figures, opensource
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[36] arXiv:2304.13458 (cross-list from cs.CR) [pdf, other]
Title: Thwarting Code-Reuse and Side-Channel Attacks in Embedded Systems
Rodothea Myrsini Tsoupidi, Elena Troubitsyna, Panagiotis Papadimitratos
Subjects: Cryptography and Security (cs.CR); Performance (cs.PF)
[37] arXiv:2304.13541 (cross-list from cs.DC) [pdf, other]
Title: D-STACK: High Throughput DNN Inference by Effective Multiplexing and Spatio-Temporal Scheduling of GPUs
Aditya Dhakal, Sameer G. Kulkarni, K. K. Ramakrishnan
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF); Systems and Control (eess.SY)
[38] arXiv:2304.13551 (cross-list from cs.MM) [pdf, other]
Title: Latency Target based Analysis of the DASH.js Player
Piers O'Hanlon, Adil Aslam
Comments: To be published in Proceedings of the 14th ACM Multimedia Systems Conference (MMSys '23), June 7-10, 2023, Vancouver, BC, Canada
Subjects: Multimedia (cs.MM); Networking and Internet Architecture (cs.NI); Performance (cs.PF)
[39] arXiv:2304.14226 (cross-list from cs.LG) [pdf, other]
Title: TorchBench: Benchmarking PyTorch with High API Surface Coverage
Yueming Hao, Xu Zhao, Bin Bao, David Berard, Will Constable, Adnan Aziz, Xu Liu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Performance (cs.PF)
[40] arXiv:2304.14359 (cross-list from cs.CY) [pdf, other]
Title: Measuring and Modeling the Free Content Web
Abdulrahman Alabduljabbar, Runyu Ma, Ahmed Abusnaina, Rhongho Jang, Songqing Chen, DaeHun Nyang, and David Mohaisen
Comments: 30 pages, 3 tables, 9 figures. Under review by Computer Networks
Subjects: Computers and Society (cs.CY); Cryptography and Security (cs.CR); Performance (cs.PF)
[41] arXiv:2304.14790 (cross-list from cs.SE) [pdf, other]
Title: A Benchmarking Proposal for DevOps Practices on Open Source Software Projects
José Manuel Sánchez Ruiz, Francisco José Domínguez Mayo, Xavier Oriol, José Francisco Crespo, David Benavides, Ernest Teniente
Comments: 18 pages, 10 figures
Subjects: Software Engineering (cs.SE); Performance (cs.PF)
Total of 41 entries
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack