Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.AR

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Hardware Architecture

Authors and titles for March 2023

Total of 83 entries : 1-25 26-50 51-75 76-83
Showing up to 25 entries per page: fewer | more | all
[51] arXiv:2303.04739 (cross-list from cs.CV) [pdf, other]
Title: Advancing Direct Convolution using Convolution Slicing Optimization and ISA Extensions
Victor Ferrari, Rafael Sousa, Marcio Pereira, João P. L. de Carvalho, José Nelson Amaral, José Moreira, Guido Araujo
Comments: 15 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR); Machine Learning (cs.LG); Performance (cs.PF)
[52] arXiv:2303.05919 (cross-list from cs.PF) [pdf, other]
Title: eBPF-based Working Set Size Estimation in Memory Management
Zhilu Lian, Yangzi Li, Zhixiang Chen, Shiwen Shan, Baoxin Han, Yuxin Su
Comments: 8 pages, 6 figures
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[53] arXiv:2303.06150 (cross-list from cs.CE) [pdf, other]
Title: Improving computation efficiency using input and architecture features for a virtual screening application
Gianmarco Accordi, Emanuele Vitali, Davide Gadioli, Luigi Crisci, Biagio Cosenza, Mauro Bisson, Massimiliano Fatica, Andrea Beccari, Gianluca Palermo
Subjects: Computational Engineering, Finance, and Science (cs.CE); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[54] arXiv:2303.06153 (cross-list from cs.PF) [pdf, html, other]
Title: CXLMemSim: A pure software simulated CXL.mem for performance characterization
Yiwei Yang, Brian Zhao, Yusheng Zheng, Pooneh Safayenikoo, Tanvir Ahmed Khan, Andi Quinn
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR); Operating Systems (cs.OS)
[55] arXiv:2303.06169 (cross-list from cs.LG) [pdf, other]
Title: MOELA: A Multi-Objective Evolutionary/Learning Design Space Exploration Framework for 3D Heterogeneous Manycore Platforms
Sirui Qi, Yingheng Li, Sudeep Pasricha, Ryan Gary Kim
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[56] arXiv:2303.06182 (cross-list from cs.DC) [pdf, other]
Title: Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference
Haiyang Huang, Newsha Ardalani, Anna Sun, Liu Ke, Hsien-Hsin S. Lee, Anjali Sridhar, Shruti Bhosale, Carole-Jean Wu, Benjamin Lee
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[57] arXiv:2303.06420 (cross-list from cs.DC) [pdf, other]
Title: Design and Evaluation of a Rack-Scale Disaggregated Memory Architecture For Data Centers
Amit Puri, John Jose, Tamarapalli Venkatesh
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[58] arXiv:2303.06931 (cross-list from cs.LG) [pdf, other]
Title: DeepVigor: Vulnerability Value Ranges and Factors for DNNs' Reliability Assessment
Mohammad Hasan Ahmadilivani, Mahdi Taheri, Jaan Raik, Masoud Daneshtalab, Maksim Jenihhin
Comments: 6 pages, 6 figures, 2 tables, accepted at ETS 2023 (this http URL)
Journal-ref: ETS 2023
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[59] arXiv:2303.07470 (cross-list from cs.LG) [pdf, other]
Title: X-Former: In-Memory Acceleration of Transformers
Shrihari Sridharan, Jacob R. Stevens, Kaushik Roy, Anand Raghunathan
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[60] arXiv:2303.08226 (cross-list from cs.LG) [pdf, other]
Title: DeepAxe: A Framework for Exploration of Approximation and Reliability Trade-offs in DNN Accelerators
Mahdi Taheri, Mohammad Riazati, Mohammad Hasan Ahmadilivani, Maksim Jenihhin, Masoud Daneshtalab, Jaan Raik, Mikael Sjodin, Bjorn Lisper
Comments: This paper is accepted at the 24th International Symposium on Quality Electronic Design (ISQED) 2023, 8 pages, 4 figures, 4 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[61] arXiv:2303.08396 (cross-list from cs.DC) [pdf, other]
Title: Workload Behavior Driven Memory Subsystem Design for Hyperscale
Suyash Mahar (UC San Diego), Hao Wang (NVIDIA), Wei Shu (Tenstorrent), Abhishek Dhanotia (Meta Inc.)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[62] arXiv:2303.08706 (cross-list from eess.SY) [pdf, other]
Title: Hybrid Modular Redundancy: Exploring Modular Redundancy Approaches in RISC-V Multi-Core Computing Clusters for Reliable Processing in Space
Michael Rogenmoser, Yvan Tortorella, Davide Rossi, Francesco Conti, Luca Benini
Subjects: Systems and Control (eess.SY); Hardware Architecture (cs.AR)
[63] arXiv:2303.09565 (cross-list from cs.SE) [pdf, other]
Title: A SysML-based language for evaluating the integrity of simulation and physical embodiments of Cyber-Physical systems
Wojciech Dudek, Narcis Miguel, Tomasz Winiarski
Journal-ref: Robotics and Autonomous Systems, vol. 185, pp. 104884, 2025
Subjects: Software Engineering (cs.SE); Hardware Architecture (cs.AR); Multiagent Systems (cs.MA); Robotics (cs.RO); Systems and Control (eess.SY)
[64] arXiv:2303.10702 (cross-list from cs.LG) [pdf, other]
Title: Evaluation of Convolution Primitives for Embedded Neural Networks on 32-bit Microcontrollers
Baptiste Nguyen, Pierre-Alain Moellic, Sylvain Blayac
Comments: ISDA 2022
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[65] arXiv:2303.10901 (cross-list from cs.DC) [pdf, other]
Title: E2C: A Visual Simulator to Reinforce Education of Heterogeneous Computing Systems
Ali Mokhtari, Drake Rawls, Tony Huynh, Jeremiah Green, Mohsen Amini Salehi
Comments: Accepted in Edupar '23, as part of IPDPS '23 Conference. arXiv admin note: text overlap with arXiv:2212.11333
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Operating Systems (cs.OS)
[66] arXiv:2303.11049 (cross-list from cs.CY) [pdf, other]
Title: Nanomodular Electronics
Michael Filler, Benjamin Reinhardt
Comments: 55 pages, 15 figures
Subjects: Computers and Society (cs.CY); Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[67] arXiv:2303.11499 (cross-list from cs.DC) [pdf, html, other]
Title: CELLO: Co-designing Schedule and Hybrid Implicit/Explicit Buffer for Complex Tensor Reuse
Raveesh Garg, Michael Pellauer, Sivasankaran Rajamanickam, Tushar Krishna
Comments: Accepted for publication at the 39th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2025)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[68] arXiv:2303.11733 (cross-list from cs.PF) [pdf, other]
Title: DIPPM: a Deep Learning Inference Performance Predictive Model using Graph Neural Networks
Karthick Panner Selvam, Mats Brorsson
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[69] arXiv:2303.12258 (cross-list from cs.PF) [pdf, other]
Title: How does SSD Cluster Perform for Distributed File Systems: An Empirical Study
Jiashu Wu, Yang Wang, Jinpeng Wang, Hekang Wang, Taorui Lin
Comments: Accepted by Concurrency and Computation: Practice and Experience
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Applications (stat.AP)
[70] arXiv:2303.12529 (cross-list from cs.CV) [pdf, other]
Title: DevelSet: Deep Neural Level Set for Instant Mask Optimization
Guojin Chen, Ziyang Yu, Hongduo Liu, Yuzhe Ma, Bei Yu
Comments: Accepted by ICCAD21
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[71] arXiv:2303.12901 (cross-list from cs.DC) [pdf, other]
Title: Dynasparse: Accelerating GNN Inference through Dynamic Sparsity Exploitation
Bingyi Zhang, Viktor Prasanna
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[72] arXiv:2303.12910 (cross-list from cs.LG) [pdf, other]
Title: Cross-Layer Design for AI Acceleration with Non-Coherent Optical Computing
Febin Sunny, Mahdi Nikdast, Sudeep Pasricha
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[73] arXiv:2303.12914 (cross-list from cs.LG) [pdf, other]
Title: TRON: Transformer Neural Network Acceleration with Non-Coherent Silicon Photonics
Salma Afifi, Febin Sunny, Mahdi Nikdast, Sudeep Pasricha
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[74] arXiv:2303.13060 (cross-list from cs.CV) [pdf, other]
Title: DiffPattern: Layout Pattern Generation via Discrete Diffusion
Zixiao Wang, Yunheng Shen, Wenqian Zhao, Yang Bai, Guojin Chen, Farzan Farnia, Bei Yu
Comments: DAC2023 Accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
[75] arXiv:2303.13601 (cross-list from eess.IV) [pdf, other]
Title: Scaled Quantization for the Vision Transformer
Yangyang Chang, Gerald E. Sobelman
Comments: 9 pages, 0 figure
Journal-ref: Electrical and Electronics Engineering: An International Journal (ELELIJ), Vol.12, No.1, February 2023
Subjects: Image and Video Processing (eess.IV); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
Total of 83 entries : 1-25 26-50 51-75 76-83
Showing up to 25 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack