Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.AR

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Hardware Architecture

Authors and titles for July 2025

Total of 127 entries
Showing up to 2000 entries per page: fewer | more | all
[1] arXiv:2507.00367 [pdf, html, other]
Title: Presto: Hardware Acceleration of Ciphers for Hybrid Homomorphic Encryption
Yeonsoo Jeon, Mattan Erez, Michael Orshansky
Subjects: Hardware Architecture (cs.AR); Cryptography and Security (cs.CR)
[2] arXiv:2507.00642 [pdf, html, other]
Title: ChatHLS: Towards Systematic Design Automation and Optimization for High-Level Synthesis
Runkai Li, Jia Xiong, Xiuyuan He, Jiaqi Lv, Jieru Zhao, Xi Wang
Subjects: Hardware Architecture (cs.AR)
[3] arXiv:2507.00797 [pdf, html, other]
Title: VEDA: Efficient LLM Generation Through Voting-based KV Cache Eviction and Dataflow-flexible Accelerator
Zhican Wang, Hongxiang Fan, Haroon Waris, Gang Wang, Zhenyu Li, Jianfei Jiang, Yanan Sun, Guanghui He
Comments: DAC 2025
Subjects: Hardware Architecture (cs.AR)
[4] arXiv:2507.01145 [pdf, html, other]
Title: CarbonClarity: Understanding and Addressing Uncertainty in Embodied Carbon for Sustainable Computing
Xuesi Chen, Leo Han, Anvita Bhagavathula, Udit Gupta
Subjects: Hardware Architecture (cs.AR)
[5] arXiv:2507.01309 [pdf, html, other]
Title: SD-Acc: Accelerating Stable Diffusion through Phase-aware Sampling and Hardware Co-Optimizations
Zhican Wang, Guanghui He, Hongxiang Fan
Comments: Under Review
Subjects: Hardware Architecture (cs.AR)
[6] arXiv:2507.02067 [pdf, html, other]
Title: Advanced Printed Sensors for Environmental Applications: A Path Towards Sustainable Monitoring Solutions
Nikolaos Papanikolaou, Doha Touhafi, Jurgen Vandendriessche, Danial Karimi, Sohail Fatimi, Gianluca Cornetta, Abdellah Touhafi
Subjects: Hardware Architecture (cs.AR)
[7] arXiv:2507.02456 [pdf, html, other]
Title: System-performance and cost modeling of Large Language Model training and inference
Wenzhe Guo, Joyjit Kundu, Uras Tos, Weijiang Kong, Giuliano Sisto, Timon Evenblij, Manu Perumkunnil
Subjects: Hardware Architecture (cs.AR)
[8] arXiv:2507.02598 [pdf, html, other]
Title: AC-Refiner: Efficient Arithmetic Circuit Optimization Using Conditional Diffusion Models
Chenhao Xue, Kezhi Li, Jiaxing Zhang, Yi Ren, Zhengyuan Shi, Chen Zhang, Yibo Lin, Lining Zhang, Qiang Xu, Guangyu Sun
Comments: 8 pages, 12 figures, to appear in ASP-DAC'26
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[9] arXiv:2507.02654 [pdf, html, other]
Title: Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure
Rui Xie, Asad Ul Haq, Yunhua Fang, Linsen Ma, Sanchari Sen, Swagath Venkataramani, Liu Liu, Tong Zhang
Subjects: Hardware Architecture (cs.AR)
[10] arXiv:2507.03255 [pdf, html, other]
Title: ForgeHLS: A Large-Scale, Open-Source Dataset for High-Level Synthesis
Zedong Peng, Zeju Li, Mingzhe Gao, Qiang Xu, Chen Zhang, Jieru Zhao
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[11] arXiv:2507.03308 [pdf, html, other]
Title: Hummingbird: A Smaller and Faster Large Language Model Accelerator on Embedded FPGA
Jindong Li, Tenglong Li, Ruiqi Chen, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng
Comments: Accepted by ICCAD2025
Subjects: Hardware Architecture (cs.AR)
[12] arXiv:2507.03522 [pdf, html, other]
Title: A Flexible Instruction Set Architecture for Efficient GEMMs
Alexandre de Limas Santana, Adrià Armejach, Francesc Martinez, Erich Focht, Marc Casas
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[13] arXiv:2507.04276 [pdf, html, other]
Title: FIXME: Towards End-to-End Benchmarking of LLM-Aided Design Verification
Gwok-Waa Wan, Shengchu Su, Ruihu Wang, Qixiang Chen, Sam-Zaak Wong, Mengnv Xing, Hefei Feng, Yubo Wang, Yinan Zhu, Jingyi Zhang, Jianmin Ye, Xinlai Wan, Tao Ni, Qiang Xu, Nan Guan, Zhe Jiang, Xi Wang, Yang Jun
Subjects: Hardware Architecture (cs.AR)
[14] arXiv:2507.04315 [pdf, html, other]
Title: HLStrans: Dataset for LLM-Driven C-to-HLS Hardware Code Synthesis
Qingyun Zou, Nuo Chen, Yao Chen, Bingsheng He, WengFei Wong
Subjects: Hardware Architecture (cs.AR)
[15] arXiv:2507.04535 [pdf, html, other]
Title: da4ml: Distributed Arithmetic for Real-time Neural Networks on FPGAs
Chang Sun, Zhiqiang Que, Vladimir Loncar, Wayne Luk, Maria Spiropulu
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex)
[16] arXiv:2507.04677 [pdf, html, other]
Title: NeuroPDE: A Neuromorphic PDE Solver Based on Spintronic and Ferroelectric Devices
Siqing Fu, Lizhou Wu, Tiejun Li, Chunyuan Zhang, Sheng Ma, Jianmin Zhang, Yuhan Tang, Jixuan Tang
Comments: 9 pages, 12 figures, accepted at ICCAD 2025 (The 2025 IEEE/ACM International Conference on Computer-Aided Design)
Subjects: Hardware Architecture (cs.AR)
[17] arXiv:2507.04772 [pdf, html, other]
Title: Jack Unit: An Area- and Energy-Efficient Multiply-Accumulate (MAC) Unit Supporting Diverse Data Formats
Seock-Hwan Noh, Sungju Kim, Seohyun Kim, Daehoon Kim, Jaeha Kung, Yeseong Kim
Comments: Accepted for publication at the 30th ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED 2025)
Subjects: Hardware Architecture (cs.AR)
[18] arXiv:2507.05012 [pdf, html, other]
Title: Optimizing Scalable Multi-Cluster Architectures for Next-Generation Wireless Sensing and Communication
Samuel Riedel, Yichao Zhang, Marco Bertuletti, Luca Benini
Comments: 6 pages, 8 figures, accepted at IWASI 2025
Subjects: Hardware Architecture (cs.AR)
[19] arXiv:2507.05081 [pdf, html, other]
Title: ViPSN 2.0: A Reconfigurable Battery-free IoT Platform for Vibration Energy Harvesting
Xin Li, Mianxin Xiao, Xi Shen, Jiaqing Chu, Weifeng Huang, Jiashun Li, Yaoyi Li, Mingjing Cai, Jiaming Chen, Xinming Zhang, Daxing Zhang, Congsi Wang, Hong Tang, Bao Zhao, Qitao Lu, Yilong Wang, Jianjun Wang, Minyi Xu, Shitong Fang, Xuanyu Huang. Chaoyang Zhao, Zicheng Liu, Yaowen Yang, Guobiao Hu, Junrui Liang, Wei-Hsin Liao
Subjects: Hardware Architecture (cs.AR)
[20] arXiv:2507.05556 [pdf, html, other]
Title: Per-Row Activation Counting on Real Hardware: Demystifying Performance Overheads
Jumin Kim, Seungmin Baek, Minbok Wi, Hwayong Nam, Michael Jaemin Kim, Sukhan Lee, Kyomin Sohn, Jung Ho Ahn
Comments: 4 pages, 4 figures, to appear at IEEE Computer Architecture Letters
Subjects: Hardware Architecture (cs.AR); Cryptography and Security (cs.CR)
[21] arXiv:2507.05681 [pdf, html, other]
Title: GATMesh: Clock Mesh Timing Analysis using Graph Neural Networks
Muhammad Hadir Khan, Matthew Guthaus
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[22] arXiv:2507.06069 [pdf, html, other]
Title: RTGPU: Real-Time Computing with Graphics Processing Units
Atiyeh Gheibi-Fetrat, Amirsaeed Ahmadi-Tonekaboni, Farzam Koohi-Ronaghi, Pariya Hajipour, Sana Babayan-Vanestan, Fatemeh Fotouhi, Elahe Mortazavian-Farsani, Pouria Khajehpour-Dezfouli, Sepideh Safari, Shaahin Hessabi, Hamid Sarbazi-Azad
Comments: This document provides a concise summary of the book RTGPU, submitted to Synthesis Lectures on Computer Architecture. Due to copyright restrictions, the full content is not reproduced here; readers are referred to the complete book for more comprehensive details
Subjects: Hardware Architecture (cs.AR)
[23] arXiv:2507.06127 [pdf, html, other]
Title: PrefixAgent: An LLM-Powered Design Framework for Efficient Prefix Adder Optimization
Dongsheng Zuo, Jiadong Zhu, Yang Luo, Yuzhe Ma
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[24] arXiv:2507.06376 [pdf, html, other]
Title: SLDB: An End-To-End Heterogeneous System-on-Chip Benchmark Suite for LLM-Aided Design
Elisavet Lydia Alvanaki, Kevin Lee, Luca P. Carloni
Subjects: Hardware Architecture (cs.AR)
[25] arXiv:2507.06512 [pdf, html, other]
Title: Towards LLM-based Root Cause Analysis of Hardware Design Failures
Siyu Qiu, Muzhi Wang, Raheel Afsharmazayejani, Mohammad Moradi Shahmiri, Benjamin Tan, Hammond Pearce
Comments: 6 pages. Accepted for publication in IEEE COINS 2025 Special Session on LLMs for EDA and Security
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[26] arXiv:2507.07044 [pdf, html, other]
Title: Opto-ViT: Architecting a Near-Sensor Region of Interest-Aware Vision Transformer Accelerator with Silicon Photonics
Mehrdad Morsali, Chengwei Zhou, Deniz Najafi, Sreetama Sarkar, Pietro Mercati, Navid Khoshavi, Peter Beerel, Mahdi Nikdast, Gourav Datta, Shaahin Angizi
Subjects: Hardware Architecture (cs.AR)
[27] arXiv:2507.07683 [pdf, html, other]
Title: Accelerating Transposed Convolutions on FPGA-based Edge Devices
Jude Haris, José Cano
Comments: Accepted to 35th International Conference on Field-Programmable Logic and Applications (FPL) 2025
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[28] arXiv:2507.08406 [pdf, other]
Title: CCSS: Hardware-Accelerated RTL Simulation with Fast Combinational Logic Computing and Sequential Logic Synchronization
Weigang Feng, Yijia Zhang, Zekun Wang, Zhengyang Wang, Yi Wang, Peijun Ma, Ningyi Xu
Comments: We plan to add more experiments and refine the figures in the paper. In addition, the overall structure needs significant revision to improve its readability
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[29] arXiv:2507.08658 [pdf, html, other]
Title: Fast and Efficient Merge of Sorted Input Lists in Hardware Using List Offset Merge Sorters
Robert B. Kent, Marios S. Pattichis
Subjects: Hardware Architecture (cs.AR); Data Structures and Algorithms (cs.DS); Image and Video Processing (eess.IV)
[30] arXiv:2507.08923 [pdf, html, other]
Title: CEO-DC: Driving Decarbonization in HPC Data Centers with Actionable Insights
Rubén Rodríguez Álvarez, Denisa-Andreea Constantinescu, Miguel Peón-Quirós, David Atienza
Comments: 17 pages, 7 figures, 8 tables
Subjects: Hardware Architecture (cs.AR); Computers and Society (cs.CY); Performance (cs.PF)
[31] arXiv:2507.09010 [pdf, html, other]
Title: Hybrid Systolic Array Accelerator with Optimized Dataflow for Edge Large Language Model Inference
Chun-Ting Chen, HanGyeol Mun, Jian Meng, Mohamed S. Abdelfattah, Jae-sun Seo
Comments: Accepted as a conference paper at the 2025 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[32] arXiv:2507.09201 [pdf, html, other]
Title: SLIM: A Heterogeneous Accelerator for Edge Inference of Sparse Large Language Model via Adaptive Thresholding
Weihong Xu, Haein Choi, Po-kai Hsu, Shimeng Yu, Tajana Rosing
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[33] arXiv:2507.09660 [pdf, html, other]
Title: Tools and Methodologies for System-Level Design
Shuvra S. Bhattacharyya, Marilyn Wolf
Comments: This is a preprint of a chapter to appear in the forthcoming volume Handbook on Electronic Design Automation (third edition), published by Taylor & Francis. The final version may differ
Subjects: Hardware Architecture (cs.AR)
[34] arXiv:2507.09730 [pdf, html, other]
Title: Efficient FRW Transitions via Stochastic Finite Differences for Handling Non-Stratified Dielectrics
Jiechen Huang, Wenjian Yu
Comments: 5 pages, 6 figures
Subjects: Hardware Architecture (cs.AR); Numerical Analysis (math.NA)
[35] arXiv:2507.09774 [pdf, html, other]
Title: Low-Cost Fuel Dispenser Prototype Using STM32 and an H-bridge motor driver
MD Zobaer Hossain Bhuiyan, Abir Bin Faruque, Mahtab Newaz, Mohammad Abdul Qayum
Subjects: Hardware Architecture (cs.AR)
[36] arXiv:2507.09780 [pdf, html, other]
Title: BitParticle: Partializing Sparse Dual-Factors to Build Quasi-Synchronizing MAC Arrays for Energy-efficient DNNs
Feilong Qiaoyuan, Jihe Wang, Zhiyu Sun, Linying Wu, Yuanhua Xiao, Danghui Wang
Comments: 9 pages, 13 figures, 3 Tables
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[37] arXiv:2507.10178 [pdf, html, other]
Title: Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving
Wonung Kim, Yubin Lee, Yoonsung Kim, Jinwoo Hwang, Seongryong Oh, Jiyong Jung, Aziz Huseynov, Woong Gyu Park, Chang Hyun Park, Divya Mahajan, Jongse Park
Journal-ref: MICRO 2025
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[38] arXiv:2507.10573 [pdf, html, other]
Title: Device-Level Optimization Techniques for Solid-State Drives: A Survey
Tianyu Ren, Yajuan Du, Jinhua Cui, Yina Lv, Qiao Li, Chun Jason Xue
Subjects: Hardware Architecture (cs.AR)
[39] arXiv:2507.10639 [pdf, html, other]
Title: SPICEAssistant: LLM using SPICE Simulation Tools for Schematic Design of Switched-Mode Power Supplies
Simon Nau, Jan Krummenauer, André Zimmermann
Comments: 11 pages, 10 figures
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
[40] arXiv:2507.10748 [pdf, html, other]
Title: LASANA: Large-Scale Surrogate Modeling for Analog Neuromorphic Architecture Exploration
Jason Ho, James A. Boyle, Linshen Liu, Andreas Gerstlauer
Subjects: Hardware Architecture (cs.AR)
[41] arXiv:2507.10849 [pdf, html, other]
Title: OpenGCRAM: An Open-Source Gain Cell Compiler Enabling Design-Space Exploration for AI Workloads
Xinxin Wang, Lixian Yan, Shuhan Liu, Luke Upton, Zhuoqi Cai, Yiming Tan, Shengman Li, Koustav Jana, Peijing Li, Jesse Cirimelli-Low, Thierry Tambe, Matthew Guthaus, H.-S. Philip Wong
Subjects: Hardware Architecture (cs.AR); Systems and Control (eess.SY)
[42] arXiv:2507.10912 [pdf, html, other]
Title: Mapping Fusion: Improving FPGA Technology Mapping with ASIC Mapper
Cunxi Yu
Comments: 7 pages. to appear at MLCAD 2025
Subjects: Hardware Architecture (cs.AR)
[43] arXiv:2507.10971 [pdf, html, other]
Title: Security Enclave Architecture for Heterogeneous Security Primitives for Supply-Chain Attacks
Kshitij Raj, Atri Chatterjee, Patanjali SLPSK, Swarup Bhunia, Sandip Ray
Subjects: Hardware Architecture (cs.AR); Cryptography and Security (cs.CR)
[44] arXiv:2507.11331 [pdf, html, other]
Title: SystolicAttention: Fusing FlashAttention within a Single Systolic Array
Jiawei Lin, Guokai Chen, Yuanlong Li, Thomas Bourgeat
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[45] arXiv:2507.11506 [pdf, html, other]
Title: ELK: Exploring the Efficiency of Inter-core Connected AI Chips with Deep Learning Compiler Techniques
Yiqi Liu, Yuqi Xue, Noelle Crawford, Jilong Xue, Jian Huang
Comments: This paper is accepted at the 58th IEEE/ACM International Symposium on Microarchitecture (MICRO'25)
Journal-ref: In Proceedings of the 58th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'25), Seoul, Korea, October, 2025
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[46] arXiv:2507.11709 [pdf, html, other]
Title: Double Duty: FPGA Architecture to Enable Concurrent LUT and Adder Chain Usage
Junius Pun, Xilai Dai, Grace Zgheib, Mahesh A. Iyer, Andrew Boutros, Vaughn Betz, Mohamed S. Abdelfattah
Comments: accepted at FPL 2025
Subjects: Hardware Architecture (cs.AR)
[47] arXiv:2507.12028 [pdf, html, other]
Title: MOFCO: Mobility- and Migration-Aware Task Offloading in Three-Layer Fog Computing Environments
Soheil Mahdizadeh, Elyas Oustad, Mohsen Ansari
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI)
[48] arXiv:2507.12418 [pdf, html, other]
Title: High-Performance Pipelined NTT Accelerators with Homogeneous Digit-Serial Modulo Arithmetic
George Alexakis, Dimitrios Schoinianakis, Giorgos Dimitrakopoulos
Comments: 28th Euromicro Conference Series on Digital System Design (DSD 2025)
Subjects: Hardware Architecture (cs.AR)
[49] arXiv:2507.12442 [pdf, html, other]
Title: Characterizing State Space Model (SSM) and SSM-Transformer Hybrid Language Model Performance with Long Context Length
Saptarshi Mitra, Rachid Karami, Haocheng Xu, Sitao Huang, Hyoukjun Kwon
Comments: 12 pages, 7 figures
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY)
[50] arXiv:2507.12471 [pdf, html, other]
Title: Modular SAIL: dream or reality?
Petr Kourzanov, Anmol
Subjects: Hardware Architecture (cs.AR)
[51] arXiv:2507.12904 [pdf, other]
Title: An ultra-low-power CGRA for accelerating Transformers at the edge
Rohit Prasad
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[52] arXiv:2507.13281 [pdf, html, other]
Title: WIP: Turning Fake Chips into Learning Opportunities
Haniye Mehraban, Saad Azmeen-ur-Rahman, John Hu
Comments: This is the accepted version of a paper accepted for presentation at the 2025 IEEE Frontiers in Education Conference (FIE). The final version will be available via IEEE Xplore at:this https URL
Subjects: Hardware Architecture (cs.AR)
[53] arXiv:2507.13355 [pdf, html, other]
Title: PGR-DRC: Pre-Global Routing DRC Violation Prediction Using Unsupervised Learning
Riadul Islam, Dhandeep Challagundla
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY)
[54] arXiv:2507.13369 [pdf, html, other]
Title: VerilogDB: The Largest, Highest-Quality Dataset with a Preprocessing Framework for LLM-based RTL Generation
Paul E. Calzada, Zahin Ibnat, Tanvir Rahman, Kamal Kandula, Danyu Lu, Sujan Kumar Saha, Farimah Farahmandi, Mark Tehranipoor
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[55] arXiv:2507.13375 [pdf, html, other]
Title: GAP-LA: GPU-Accelerated Performance-Driven Layer Assignment
Chunyuan Zhao, Zizheng Guo, Zuodong Zhang, Yibo Lin
Subjects: Hardware Architecture (cs.AR)
[56] arXiv:2507.13631 [pdf, other]
Title: 4T2R X-ReRAM CiM Array for Variation-tolerant, Low-power, Massively Parallel MAC Operation
Fuyuki Kihara, Seiji Uenohara, Satoshi Awamura, Naoko Misawa, Chihiro Matsui, Ken Takeuchi
Comments: 4 pages
Subjects: Hardware Architecture (cs.AR)
[57] arXiv:2507.14139 [pdf, html, other]
Title: SpeedLLM: An FPGA Co-design of Large Language Model Inference Accelerator
Peipei Wang, Wu Guan, Liping Liang, Zhijun Wang, Hanqing Luo, Zhibin Zhang
Subjects: Hardware Architecture (cs.AR)
[58] arXiv:2507.14397 [pdf, html, other]
Title: Efficient LLM Inference: Bandwidth, Compute, Synchronization, and Capacity are all you need
Michael Davies, Neal Crago, Karthikeyan Sankaralingam, Christos Kozyrakis
Subjects: Hardware Architecture (cs.AR)
[59] arXiv:2507.14651 [pdf, html, other]
Title: Enabling Efficient Hardware Acceleration of Hybrid Vision Transformer (ViT) Networks at the Edge
Joren Dumoulin, Pouya Houshmand, Vikram Jain, Marian Verhelst
Subjects: Hardware Architecture (cs.AR)
[60] arXiv:2507.15300 [pdf, html, other]
Title: GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing
Minnan Pei, Gang Li, Junwen Si, Zeyu Zhu, Zitao Mo, Peisong Wang, Zhuoran Song, Xiaoyao Liang, Jian Cheng
Comments: Accepted to MICRO 2025
Subjects: Hardware Architecture (cs.AR)
[61] arXiv:2507.15465 [pdf, html, other]
Title: The New LLM Bottleneck: A Systems Perspective on Latent Attention and Mixture-of-Experts
Sungmin Yun, Seonyong Park, Hwayong Nam, Younjoo Lee, Gunjun Lee, Kwanhee Kyung, Sangpyo Kim, Nam Sung Kim, Jongmin Kim, Hyungyo Kim, Juhwan Cho, Seungmin Baek, Jung Ho Ahn
Comments: 15 pages, 11 figures
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[62] arXiv:2507.15603 [pdf, html, other]
Title: When Pipelined In-Memory Accelerators Meet Spiking Direct Feedback Alignment: A Co-Design for Neuromorphic Edge Computing
Haoxiong Ren, Yangu He, Kwunhang Wong, Rui Bao, Ning Lin, Zhongrui Wang, Dashan Shang
Comments: International Conference on Computer-Aided Design 2025
Subjects: Hardware Architecture (cs.AR)
[63] arXiv:2507.15664 [pdf, html, other]
Title: VeriRAG: A Retrieval-Augmented Framework for Automated RTL Testability Repair
Haomin Qi, Yuyang Du, Lihao Zhang, Soung Chang Liew, Kexin Chen, Yining Du
Comments: 8 pages, 5 figures
Subjects: Hardware Architecture (cs.AR)
[64] arXiv:2507.16177 [pdf, html, other]
Title: A Sparsity-Aware Autonomous Path Planning Accelerator with HW/SW Co-Design and Multi-Level Dataflow Optimization
Yifan Zhang, Xiaoyu Niu, Hongzheng Tian, Yanjun Zhang, Bo Yu, Shaoshan Liu, Sitao Huang
Comments: Accepted by ACM Transactions on Architecture and Code Optimization (ACM TACO)
Subjects: Hardware Architecture (cs.AR)
[65] arXiv:2507.16326 [pdf, html, other]
Title: Hourglass Sorting: A novel parallel sorting algorithm and its implementation
Daniel Bascones, Borja Morcillo
Comments: 6 pages, 5 figures
Subjects: Hardware Architecture (cs.AR)
[66] arXiv:2507.16379 [pdf, html, other]
Title: ApproxGNN: A Pretrained GNN for Parameter Prediction in Design Space Exploration for Approximate Computing
Ondrej Vlcek, Vojtech Mrazek
Comments: To appear at ICCAD 2025
Subjects: Hardware Architecture (cs.AR)
[67] arXiv:2507.16391 [pdf, html, other]
Title: Ironman: Accelerating Oblivious Transfer Extension for Privacy-Preserving AI with Near-Memory Processing
Chenqi Lin, Kang Yang, Tianshi Xu, Ling Liang, Yufei Wang, Zhaohui Chen, Runsheng Wang, Mingyu Gao, Meng Li
Subjects: Hardware Architecture (cs.AR)
[68] arXiv:2507.16628 [pdf, html, other]
Title: Augmenting Von Neumann's Architecture for an Intelligent Future
Rajpreet Singh, Vidhi Kothari
Comments: 6 pages, 2 figures
Subjects: Hardware Architecture (cs.AR)
[69] arXiv:2507.16793 [pdf, html, other]
Title: MTU: The Multifunction Tree Unit in zkSpeed for Accelerating HyperPlonk
Jianqiao Mo, Alhad Daftardar, Joey Ah-kiow, Kaiyue Guo, Benedikt Bünz, Siddharth Garg, Brandon Reagen
Subjects: Hardware Architecture (cs.AR)
[70] arXiv:2507.17953 [pdf, html, other]
Title: Clo-HDnn: A 4.66 TFLOPS/W and 3.78 TOPS/W Continual On-Device Learning Accelerator with Energy-efficient Hyperdimensional Computing via Progressive Search
Chang Eun Song, Weihong Xu, Keming Fan, Soumil Jain, Gopabandhu Hota, Haichao Yang, Leo Liu, Kerem Akarvardar, Meng-Fan Chang, Carlos H. Diaz, Gert Cauwenberghs, Tajana Rosing, Mingu Kang
Comments: Published in 2025 Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits), Kyoto, Japan, 2025
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[71] arXiv:2507.18040 [pdf, other]
Title: Designing High-Performance and Thermally Feasible Multi-Chiplet Architectures enabled by Non-bendable Glass Interposer
Harsh Sharma, Janardhan Rao Doppa, Umit Y. Ogras, Partha Pratim Pande
Comments: Paper accepted at ACM Transactions on Embedded Computing Systems. To be presented in Taiwan, Sept. 2025
Subjects: Hardware Architecture (cs.AR)
[72] arXiv:2507.18454 [pdf, html, other]
Title: Sandwich: Separating Prefill-Decode Compilation for Efficient CPU LLM Serving
Juntao Zhao, Jiuru Li, Chuan Wu
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Programming Languages (cs.PL)
[73] arXiv:2507.18581 [pdf, html, other]
Title: PRACtical: Subarray-Level Counter Update and Bank-Level Recovery Isolation for Efficient PRAC Rowhammer Mitigation
Ravan Nazaraliyev, Saber Ganjisaffar, Nurlan Nazaraliyev, Nael Abu-Ghazaleh
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[74] arXiv:2507.18889 [pdf, html, other]
Title: RailX: A Flexible, Scalable, and Low-Cost Network Architecture for Hyper-Scale LLM Training Systems
Yinxiao Feng, Tiancheng Chen, Yuchen Wei, Siyuan Shen, Shiju Wang, Wei Li, Kaisheng Ma, Torsten Hoefler
Comments: 25 pages, 21 figures, 6 tables
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI)
[75] arXiv:2507.19133 [pdf, other]
Title: 3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering
Wei-Hsing Huang, Cheng-Jhih Shih, Jian-Wei Su, Samuel Wade Wang, Vaidehi Garg, Yuyao Kong, Jen-Chun Tien, Nealson Li, Arijit Raychowdhury, Meng-Fan Chang, Yingyan (Celine)Lin, Shimeng Yu
Subjects: Hardware Architecture (cs.AR)
[76] arXiv:2507.19142 [pdf, other]
Title: A3D-MoE: Acceleration of Large Language Models with Mixture of Experts via 3D Heterogeneous Integration
Wei-Hsing Huang, Janak Sharda, Cheng-Jhih Shih, Yuyao Kong, Faaiq Waqar, Pin-Jun Chen, Yingyan (Celine)Lin, Shimeng Yu
Subjects: Hardware Architecture (cs.AR)
[77] arXiv:2507.19570 [pdf, other]
Title: MCP4EDA: LLM-Powered Model Context Protocol RTL-to-GDSII Automation with Backend Aware Synthesis Optimization
Yiting Wang, Wanghao Ye, Yexiao He, Yiran Chen, Gang Qu, Ang Li
Comments: 7 pages, 5 figures Keywords: Model Context Protocol, Electronic Design Automation, Large Language Models, Synthesis Optimization
Subjects: Hardware Architecture (cs.AR); Multiagent Systems (cs.MA)
[78] arXiv:2507.19819 [pdf, html, other]
Title: ChipletPart: Cost-Aware Partitioning for 2.5D Systems
Alexander Graening, Puneet Gupta, Andrew B. Kahng, Bodhisatta Pramanik, Zhiang Wang
Comments: 14 pages, 13 figures
Subjects: Hardware Architecture (cs.AR)
[79] arXiv:2507.20007 [pdf, html, other]
Title: AxOSyn: An Open-source Framework for Synthesizing Novel Approximate Arithmetic Operators
Siva Satyendra Sahoo, Salim Ullah, Akash Kumar
Comments: Under review with ACM TRETS
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET); Logic in Computer Science (cs.LO)
[80] arXiv:2507.20412 [pdf, html, other]
Title: RoCE BALBOA: Service-enhanced Data Center RDMA for SmartNICs
Maximilian Jakob Heer, Benjamin Ramhorst, Yu Zhu, Luhao Liu, Zhiyi Hu, Jonas Dann, Gustavo Alonso
Subjects: Hardware Architecture (cs.AR); Networking and Internet Architecture (cs.NI)
[81] arXiv:2507.20420 [pdf, other]
Title: Demystifying the 7-D Convolution Loop Nest for Data and Instruction Streaming in Reconfigurable AI Accelerators
Md Rownak Hossain Chowdhury, Mostafizur Rahman
Subjects: Hardware Architecture (cs.AR)
[82] arXiv:2507.21430 [pdf, other]
Title: Automated HEMT Model Construction from Datasheets via Multi-Modal Intelligence and Prior-Knowledge-Free Optimization
Yuang Peng, Jiarui Zhong, Yang Zhang, Hong Cai Chen
Comments: 12 pages, 12 figures, 2 tables
Subjects: Hardware Architecture (cs.AR)
[83] arXiv:2507.21499 [pdf, html, other]
Title: SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity
Xingyang Li, Jie Jiang, Yu Feng, Yiming Gan, Jieru Zhao, Zihan Liu, Jingwen Leng, Minyi Guo
Subjects: Hardware Architecture (cs.AR)
[84] arXiv:2507.21572 [pdf, html, other]
Title: No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering
Linye Wei, Jiajun Tang, Fan Fei, Boxin Shi, Runsheng Wang, Meng Li
Comments: Accepted by International Conference on Computer-Aided Design (ICCAD) 2025
Subjects: Hardware Architecture (cs.AR)
[85] arXiv:2507.21694 [pdf, other]
Title: A Multi-Agent Generative AI Framework for IC Module-Level Verification Automation
Wenbo Liu, Forbes Hou, Jon Zhang, Hong Liu, Allen Lei
Comments: 20 pages, 12 figures. DVCon China 2025
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[86] arXiv:2507.22221 [pdf, other]
Title: A Customized Memory-aware Architecture for Biological Sequence Alignment
Nasrin Akbari, Mehdi Modarressi, Alireza Khadem
Comments: 20 pages, 11 figures
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[87] arXiv:2507.00855 (cross-list from cs.DC) [pdf, html, other]
Title: A New Family of Thread to Core Allocation Policies for an SMT ARM Processor
Marta Navarro, Josué Feliu, Salvador Petit, María E. Gómez, Julio Sahuquillo
Comments: 13 pages
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[88] arXiv:2507.00937 (cross-list from cs.RO) [pdf, html, other]
Title: RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles
David Hunt, Shaocheng Luo, Spencer Hallyburton, Shafii Nillongo, Yi Li, Tingjun Chen, Miroslav Pajic
Comments: 8 pages, accepted by IROS 2025
Subjects: Robotics (cs.RO); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[89] arXiv:2507.00949 (cross-list from cs.DC) [pdf, html, other]
Title: How Fast Can Graph Computations Go on Fine-grained Parallel Architectures
Yuqing Wang, Charles Colley, Brian Wheatman, Jiya Su, David F. Gleich, Andrew A. Chien
Comments: 13 pages, 11 figures, 6 tables
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[90] arXiv:2507.01429 (cross-list from cs.ET) [pdf, html, other]
Title: Hardware-software co-exploration with racetrack memory based in-memory computing for CNN inference in embedded systems
Benjamin Chen Ming Choong, Tao Luo, Cheng Liu, Bingsheng He, Wei Zhang, Joey Tianyi Zhou
Subjects: Emerging Technologies (cs.ET); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[91] arXiv:2507.01676 (cross-list from cs.DC) [pdf, html, other]
Title: Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization
Giuseppe Ruggeri, Renzo Andri, Daniele Jahier Pagliari, Lukas Cavigelli
Comments: 5 pages, 4 figures, conference: IEEE ICCD24
Journal-ref: 2024 IEEE 42nd International Conference on Computer Design (ICCD), Milan, Italy, 2024, pp. 517-520
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Information Retrieval (cs.IR)
[92] arXiv:2507.02164 (cross-list from cs.MS) [pdf, html, other]
Title: Hardware-Accelerated Algorithm for Complex Function Roots Density Graph Plotting
Ruibai Tang, Chengbin Quan
Subjects: Mathematical Software (cs.MS); Hardware Architecture (cs.AR)
[93] arXiv:2507.02226 (cross-list from cs.PL) [pdf, html, other]
Title: DecoRTL: A Run-time Decoding Framework for RTL Code Generation with LLMs
Mohammad Akyash, Kimia Azar, Hadi Kamali
Comments: Accepted to the International Conference on Computer-Aided Design (ICCAD 2025)
Subjects: Programming Languages (cs.PL); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[94] arXiv:2507.02660 (cross-list from cs.AI) [pdf, html, other]
Title: Hey AI, Generate Me a Hardware Code! Agentic AI-based Hardware Design & Verification
Deepak Narayan Gadde, Keerthan Kopparam Radhakrishna, Vaisakh Naduvodi Viswambharan, Aman Kumar, Djones Lettnin, Wolfgang Kunz, Sebastian Simon
Comments: To appear at the 38th SBC/SBMicro/IEEE Symposium on Integrated Circuits and Systems Design (SBCCI), August 25-29, 2025, Manaus, BRAZIL
Subjects: Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[95] arXiv:2507.02871 (cross-list from cs.DC) [pdf, other]
Title: ZettaLith: An Architectural Exploration of Extreme-Scale AI Inference Acceleration
Kia Silverbrook
Comments: 53 pages, 15 figures, 23 tables
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[96] arXiv:2507.04736 (cross-list from cs.AI) [pdf, html, other]
Title: ChipSeek-R1: Generating Human-Surpassing RTL with LLM via Hierarchical Reward-Driven Reinforcement Learning
Zhirong Chen, Kaiyan Chang, Zhuolin Li, Xinyang He, Chujie Chen, Cangyuan Li, Mengdi Wang, Haobo Xu, Yinhe Han, Ying Wang
Subjects: Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Programming Languages (cs.PL)
[97] arXiv:2507.05531 (cross-list from cs.LG) [pdf, html, other]
Title: Bit-Flip Fault Attack: Crushing Graph Neural Networks via Gradual Bit Search
Sanaz Kazemi Abharian, Sai Manoj Pudukotai Dinakarrao
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[98] arXiv:2507.05576 (cross-list from cs.CR) [pdf, html, other]
Title: iThermTroj: Exploiting Intermittent Thermal Trojans in Multi-Processor System-on-Chips
Mehdi Elahi, Mohamed R. Elshamy, Abdel-Hameed Badawy, Ahmad Patooghy
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[99] arXiv:2507.05876 (cross-list from cs.NI) [pdf, html, other]
Title: OLAF: Programmable Data Plane Acceleration for Asynchronous Distributed Reinforcement Learning
Nehal Baganal Krishna, Anam Tahir, Firas Khamis, Mina Tahmasbi Arashloo, Michael Zink, Amr Rizk
Comments: 17 pages, 11 figures
Subjects: Networking and Internet Architecture (cs.NI); Hardware Architecture (cs.AR)
[100] arXiv:2507.06349 (cross-list from cs.DS) [pdf, html, other]
Title: Multi-Queue SSD I/O Modeling & Its Implications for Data Structure Design
Erin Ransom, Andrew Lim, Michael Mitzenmacher
Subjects: Data Structures and Algorithms (cs.DS); Hardware Architecture (cs.AR)
[101] arXiv:2507.06549 (cross-list from cs.LG) [pdf, html, other]
Title: Deep-Learning-Based Pre-Layout Parasitic Capacitance Prediction on SRAM Designs
Shan Shen, Dingcheng Yang, Yuyang Xie, Chunyan Pei, Wenjian Yu, Bei Yu
Comments: Published in Proceedings of GLSVLSI2024
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Systems and Control (eess.SY)
[102] arXiv:2507.07223 (cross-list from cs.DC) [pdf, other]
Title: Compute Can't Handle the Truth: Why Communication Tax Prioritizes Memory and Interconnects in Modern AI Infrastructure
Myoungsoo Jung
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[103] arXiv:2507.09776 (cross-list from eess.SP) [pdf, html, other]
Title: Compute SNR-Optimal Analog-to-Digital Converters for Analog In-Memory Computing
Mihir Kavishwar, Naresh Shanbhag
Comments: Code available at: this https URL
Subjects: Signal Processing (eess.SP); Hardware Architecture (cs.AR)
[104] arXiv:2507.09948 (cross-list from cs.LG) [pdf, html, other]
Title: Iceberg: Enhancing HLS Modeling with Synthetic Data
Zijian Ding, Tung Nguyen, Weikai Li, Aditya Grover, Yizhou Sun, Jason Cong
Comments: 9 pages. accepted to ICLAD'25
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[105] arXiv:2507.09965 (cross-list from cs.MA) [pdf, html, other]
Title: AnalogTester: A Large Language Model-Based Framework for Automatic Testbench Generation in Analog Circuit Design
Weiyu Chen, Chengjie Liu, Wenhao Huang, Jinyang Lyu, Mingqian Yang, Yuan Du, Li Du, Jun Yang
Comments: accepted by ISEDA 2025
Subjects: Multiagent Systems (cs.MA); Hardware Architecture (cs.AR)
[106] arXiv:2507.10338 (cross-list from cs.SE) [pdf, html, other]
Title: AssertCoder: LLM-Based Assertion Generation via Multimodal Specification Extraction
Enyuan Tian, Yiwei Ci, Qiusong Yang, Yufeng Li, Zhichao Lyu
Comments: 7 pages, 3 figures
Subjects: Software Engineering (cs.SE); Hardware Architecture (cs.AR); Logic in Computer Science (cs.LO)
[107] arXiv:2507.10463 (cross-list from cs.ET) [pdf, html, other]
Title: Solving the compute crisis with physics-based ASICs
Maxwell Aifer, Zach Belateche, Suraj Bramhavar, Kerem Y. Camsari, Patrick J. Coles, Gavin Crooks, Douglas J. Durian, Andrea J. Liu, Anastasia Marchenkova, Antonio J. Martinez, Peter L. McMahon, Faris Sbahi, Benjamin Weiner, Logan G. Wright
Comments: 16 pages, 5 figures
Subjects: Emerging Technologies (cs.ET); Hardware Architecture (cs.AR)
[108] arXiv:2507.10606 (cross-list from cs.LG) [pdf, html, other]
Title: DALI-PD: Diffusion-based Synthetic Layout Heatmap Generation for ML in Physical Design
Bing-Yue Wu, Vidya A. Chhabria
Comments: Under review at Asia and South Pacific Design Automation Conference (ASP-DAC'26)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[109] arXiv:2507.11134 (cross-list from cs.ET) [pdf, html, other]
Title: Fault-Free Analog Computing with Imperfect Hardware
Zhicheng Xu, Jiawei Liu, Sitao Huang, Zefan Li, Shengbo Wang, Bo Wen, Ruibin Mao, Mingrui Jiang, Giacomo Pedretti, Jim Ignowski, Kaibin Huang, Can Li
Subjects: Emerging Technologies (cs.ET); Hardware Architecture (cs.AR)
[110] arXiv:2507.12308 (cross-list from cs.CL) [pdf, html, other]
Title: Chain-of-Descriptions: Improving Code LLMs for VHDL Code Generation and Summarization
Prashanth Vijayaraghavan, Apoorva Nitsure, Charles Mackin, Luyao Shi, Stefano Ambrogio, Arvind Haran, Viresh Paruthi, Ali Elzein, Dan Coops, David Beymer, Tyler Baldwin, Ehsan Degan
Comments: 10 pages (6 content pages + 4 supplementary), 5 figures, Proceedings of the 2024 ACM/IEEE International Symposium on Machine Learning for CAD. 2024 (MLCAD'24)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[111] arXiv:2507.12445 (cross-list from cs.NI) [pdf, html, other]
Title: CRAFT: Latency and Cost-Aware Genetic-Based Framework for Node Placement in Edge-Fog Environments
Soheil Mahdizadeh, Amir Mahdi Rasouli, Mohammad Pourashory, Sadra Galavani, Mohsen Ansari
Subjects: Networking and Internet Architecture (cs.NI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[112] arXiv:2507.12935 (cross-list from cs.LG) [pdf, html, other]
Title: MC$^2$A: Enabling Algorithm-Hardware Co-Design for Efficient Markov Chain Monte Carlo Acceleration
Shirui Zhao, Jun Yin, Lingyun Yao, Martin Andraud, Wannes Meert, Marian Verhelst
Comments: 14 pages, 15 figures, IEEE journal paper
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[113] arXiv:2507.13736 (cross-list from cs.LG) [pdf, html, other]
Title: An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC
Matthias Jobst, Tim Langer, Chen Liu, Mehmet Alici, Hector A. Gonzalez, Christian Mayr
Comments: Poster at ACM ICONS 2025 - International Conference on Neuromorphic Systems
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[114] arXiv:2507.16200 (cross-list from cs.LG) [pdf, html, other]
Title: RealBench: Benchmarking Verilog Generation Models with Real-World IP Designs
Pengwei Jin, Di Huang, Chongxiao Li, Shuyao Cheng, Yang Zhao, Xinyao Zheng, Jiaguo Zhu, Shuyi Xing, Bohan Dou, Rui Zhang, Zidong Du, Qi Guo, Xing Hu
Comments: The benchmark is open-sourced at this https URL
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[115] arXiv:2507.16203 (cross-list from cs.CR) [pdf, html, other]
Title: SVAgent: AI Agent for Hardware Security Verification Assertion
Rui Guo, Avinash Ayalasomayajula, Henian Li, Jingbo Zhou, Sujan Kumar Saha, Farimah Farahmandi
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[116] arXiv:2507.16556 (cross-list from cs.CV) [pdf, html, other]
Title: Optimization of DNN-based HSI Segmentation FPGA-based SoC for ADS: A Practical Approach
Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe
Journal-ref: 2025 ACM Transactions on Embedded Computing Systems (TECS)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[117] arXiv:2507.16676 (cross-list from cs.LG) [pdf, html, other]
Title: Custom Algorithm-based Fault Tolerance for Attention Layers in Transformers
Vasileios Titopoulos, Kosmas Alexandridis, Giorgos Dimitrakopoulos
Comments: IEEE International System-on-Chip Conference (IEEE SOCC 2025)
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[118] arXiv:2507.17886 (cross-list from cs.NE) [pdf, html, other]
Title: Neuromorphic Computing: A Theoretical Framework for Time, Space, and Energy Scaling
James B Aimone
Comments: True pre-print; to be submitted at future date
Subjects: Neural and Evolutionary Computing (cs.NE); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[119] arXiv:2507.18174 (cross-list from cs.CV) [pdf, other]
Title: Real-Time Object Detection and Classification using YOLO for Edge FPGAs
Rashed Al Amin, Roman Obermaisser
Comments: This paper has been accepted for the 67th International Symposium on ELMAR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
[120] arXiv:2507.18179 (cross-list from cs.NE) [pdf, html, other]
Title: Explicit Sign-Magnitude Encoders Enable Power-Efficient Multipliers
Felix Arnold, Maxence Bouvier, Ryan Amaudruz, Renzo Andri, Lukas Cavigelli
Comments: Accepted and presented at the 34th International Workshop on Logic & Synthesis June 2025
Subjects: Neural and Evolutionary Computing (cs.NE); Hardware Architecture (cs.AR); Performance (cs.PF)
[121] arXiv:2507.18989 (cross-list from cs.LG) [pdf, html, other]
Title: GENIAL: Generative Design Space Exploration via Network Inversion for Low Power Algorithmic Logic Units
Maxence Bouvier, Ryan Amaudruz, Felix Arnold, Renzo Andri, Lukas Cavigelli
Comments: Under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[122] arXiv:2507.19795 (cross-list from cs.CV) [pdf, other]
Title: Smaller, Faster, Cheaper: Architectural Designs for Efficient Machine Learning
Steven Walton
Comments: Ph.D. Thesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[123] arXiv:2507.19963 (cross-list from cs.NI) [pdf, other]
Title: A Scalable Resource Management Layer for FPGA SoCs in 6G Radio Units
Nikolaos Bartzoudis, José Rubio Fernández, David López-Bueno, Antonio Román Villarroel
Comments: Paper accepted to the "XL Simposio Nacional de la Unión Científica Internacional de Radio (URSI 2025)", Tarragona, Spain, 3-5 September 2025. Proceedings are not published. Also part of the worj appears in Deliverables 2.2 and 5.2 of the SNS JU project VERGE
Subjects: Networking and Internet Architecture (cs.NI); Hardware Architecture (cs.AR)
[124] arXiv:2507.20399 (cross-list from eess.SY) [pdf, html, other]
Title: ACCESS-AV: Adaptive Communication-Computation Codesign for Sustainable Autonomous Vehicle Localization in Smart Factories
Rajat Bhattacharjya, Arnab Sarkar, Ish Kool, Sabur Baidya, Nikil Dutt
Comments: 28 pages, 9 figures
Subjects: Systems and Control (eess.SY); Hardware Architecture (cs.AR); Networking and Internet Architecture (cs.NI); Robotics (cs.RO); Signal Processing (eess.SP)
[125] arXiv:2507.23035 (cross-list from cs.LG) [pdf, html, other]
Title: KLLM: Fast LLM Inference with K-Means Quantization
Xueying Wu, Baijun Zhou, Zhihui Gao, Yuzhe Fu, Qilin Zheng, Yintao He, Hai Li
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[126] arXiv:2507.23398 (cross-list from eess.IV) [pdf, html, other]
Title: Smart Video Capsule Endoscopy: Raw Image-Based Localization for Enhanced GI Tract Investigation
Oliver Bause, Julia Werner, Paul Palomero Bernardo, Oliver Bringmann
Comments: Accepted at the 32nd International Conference on Neural Information Processing - ICONIP 2025
Subjects: Image and Video Processing (eess.IV); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[127] arXiv:2507.23562 (cross-list from cs.LG) [pdf, html, other]
Title: Hardware-Aware Fine-Tuning of Spiking Q-Networks on the SpiNNaker2 Neuromorphic Platform
Sirine Arfa, Bernhard Vogginger, Christian Mayr
Comments: 8 pages, 5 figures, 3 tables
Journal-ref: ACM ICONS 2025 - International Conference on Neuromorphic Systems
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
Total of 127 entries
Showing up to 2000 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack