Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.AR

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Hardware Architecture

Authors and titles for July 2020

Total of 61 entries : 1-25 26-50 51-61
Showing up to 25 entries per page: fewer | more | all
[1] arXiv:2007.00060 [pdf, other]
Title: TDO-CIM: Transparent Detection and Offloading for Computation In-memory
Kanishkan Vadivel, Lorenzo Chelini, Ali BanaGozar, Gagandeep Singh, Stefano Corda, Roel Jordans, Henk Corporaal
Comments: Full version of DATE2020 publication
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[2] arXiv:2007.00156 [pdf, other]
Title: Enabling Compute-Communication Overlap in Distributed Deep Learning Training Platforms
Saeed Rashidi, Matthew Denton, Srinivas Sridharan, Sudarshan Srinivasan, Amoghavarsha Suresh, Jade Ni, Tushar Krishna
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[3] arXiv:2007.00864 [pdf, other]
Title: Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave, Riyadh Baghdadi, Tony Nowatzki, Sasikanth Avancha, Aviral Shrivastava, Baoxin Li
Comments: Accepted for publication in Proceedings of the IEEE
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[4] arXiv:2007.01348 [pdf, other]
Title: Efficient Neural Network Deployment for Microcontroller
Hasan Unlu
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[5] arXiv:2007.01465 [pdf, other]
Title: Deep-PowerX: A Deep Learning-Based Framework for Low-Power Approximate Logic Synthesis
Ghasem Pasandi, Mackenzie Peterson, Moises Herrera, Shahin Nazarian, Massoud Pedram
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[6] arXiv:2007.01530 [pdf, other]
Title: FPnew: An Open-Source Multi-Format Floating-Point Unit Architecture for Energy-Proportional Transprecision Computing
Stefan Mach, Fabian Schuiki, Florian Zaruba, Luca Benini
Subjects: Hardware Architecture (cs.AR)
[7] arXiv:2007.01820 [pdf, other]
Title: A Machine Learning Pipeline Stage for Adaptive Frequency Adjustment
Arash Fouman Ajirlou, Inna Partin-Vaisband
Comments: 12 pages, 8 figures, 5 tables, IEEE transaction on computers. arXiv admin note: substantial text overlap with arXiv:2006.07450
Subjects: Hardware Architecture (cs.AR); Performance (cs.PF)
[8] arXiv:2007.02242 [pdf, other]
Title: A Ring Router Microarchitecture for NoCs
Wo-Tak Wu
Subjects: Hardware Architecture (cs.AR)
[9] arXiv:2007.03152 [pdf, other]
Title: The gem5 Simulator: Version 20.0+
Jason Lowe-Power, Abdul Mutaal Ahmad, Ayaz Akram, Mohammad Alian, Rico Amslinger, Matteo Andreozzi, Adrià Armejach, Nils Asmussen, Brad Beckmann, Srikant Bharadwaj, Gabe Black, Gedare Bloom, Bobby R. Bruce, Daniel Rodrigues Carvalho, Jeronimo Castrillon, Lizhong Chen, Nicolas Derumigny, Stephan Diestelhorst, Wendy Elsasser, Carlos Escuin, Marjan Fariborz, Amin Farmahini-Farahani, Pouya Fotouhi, Ryan Gambord, Jayneel Gandhi, Dibakar Gope, Thomas Grass, Anthony Gutierrez, Bagus Hanindhito, Andreas Hansson, Swapnil Haria, Austin Harris, Timothy Hayes, Adrian Herrera, Matthew Horsnell, Syed Ali Raza Jafri, Radhika Jagtap, Hanhwi Jang, Reiley Jeyapaul, Timothy M. Jones, Matthias Jung, Subash Kannoth, Hamidreza Khaleghzadeh, Yuetsu Kodama, Tushar Krishna, Tommaso Marinelli, Christian Menard, Andrea Mondelli, Miquel Moreto, Tiago Mück, Omar Naji, Krishnendra Nathella, Hoa Nguyen, Nikos Nikoleris, Lena E. Olson, Marc Orr, Binh Pham, Pablo Prieto, Trivikram Reddy, Alec Roelke, Mahyar Samani, Andreas Sandberg, Javier Setoain, Boris Shingarov, Matthew D. Sinclair, Tuan Ta, Rahul Thakur, Giacomo Travaglini, Michael Upton, Nilay Vaish, Ilias Vougioukas, William Wang, Zhengrong Wang, Norbert Wehn, Christian Weis, David A. Wood, Hongil Yoon, Éder F. Zulian
Comments: Source, comments, and feedback: this https URL
Subjects: Hardware Architecture (cs.AR)
[10] arXiv:2007.04292 [pdf, other]
Title: HALCONE : A Hardware-Level Timestamp-based Cache Coherence Scheme for Multi-GPU systems
Saiful A. Mojumder, Yifan Sun, Leila Delshadtehrani, Yenai Ma, Trinayan Baruah, José L. Abellán, John Kim, David Kaeli, Ajay Joshi
Comments: 13 pages, 9 figures
Subjects: Hardware Architecture (cs.AR)
[11] arXiv:2007.04552 [pdf, other]
Title: IOCA: High-Speed I/O-Aware LLC Management for Network-Centric Multi-Tenant Platform
Yifan Yuan, Mohammad Alian, Yipeng Wang, Ilia Kurakin, Ren Wang, Charlie Tai, Nam Sung Kim
Comments: Accepted by the 48th IEEE/ACM International Symposium on Computer Architecture (ISCA'21). The title is "Don't Forget the I/O When Allocating Your LLC"
Subjects: Hardware Architecture (cs.AR); Operating Systems (cs.OS)
[12] arXiv:2007.05657 [pdf, other]
Title: Hardware Implementation of Deep Network Accelerators Towards Healthcare and Biomedical Applications
Mostafa Rahimi Azghadi, Corey Lammie, Jason K. Eshraghian, Melika Payvand, Elisa Donati, Bernabe Linares-Barranco, Giacomo Indiveri
Comments: Accepted by IEEE Transactions on Biomedical Circuits and Systems (21 pages, 10 figures, 5 tables)
Journal-ref: IEEE Transactions on Biomedical Circuits and Systems, 2020
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG); Signal Processing (eess.SP)
[13] arXiv:2007.06563 [pdf, other]
Title: HOBFLOPS CNNs: Hardware Optimized Bitslice-Parallel Floating-Point Operations for Convolutional Neural Networks
James Garland, David Gregg
Comments: 14 pages, 3 tables, 9 figures
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[14] arXiv:2007.07131 [pdf, other]
Title: Irregular Accesses Reorder Unit: Improving GPGPU Memory Coalescing for Graph-Based Workloads
Albert Segura, Jose-Maria Arnau, Antonio Gonzalez
Subjects: Hardware Architecture (cs.AR)
[15] arXiv:2007.07759 [pdf, other]
Title: Enabling Mixed-Precision Quantized Neural Networks in Extreme-Edge Devices
Nazareno Bruschi, Angelo Garofalo, Francesco Conti, Giuseppe Tagliavini, Davide Rossi
Comments: 4 pages, 6 figures, published in 17th ACM International Conference on Computing Frontiers (CF '20), May 11--13, 2020, Catania, Italy
Subjects: Hardware Architecture (cs.AR); Image and Video Processing (eess.IV)
[16] arXiv:2007.07829 [pdf, other]
Title: A Survey of Aging Monitors and Reconfiguration Techniques
Leonardo Rezende Juracy, Matheus Trevisan Moreira, Alexandre de Morais Amory, Fernando Gehm Moraes
Comments: 19 pages, 65 references, 7 tables, 5 figures
Subjects: Hardware Architecture (cs.AR)
[17] arXiv:2007.08622 [pdf, other]
Title: Dagger: Towards Efficient RPCs in Cloud Microservices with Near-Memory Reconfigurable NICs
Nikita Lazarev, Neil Adit, Shaojie Xiang, Zhiru Zhang, Christina Delimitrou
Comments: 4 pages, 7 figures
Subjects: Hardware Architecture (cs.AR); Networking and Internet Architecture (cs.NI)
[18] arXiv:2007.08952 [pdf, other]
Title: Always-On 674uW @ 4GOP/s Error Resilient Binary Neural Networks with Aggressive SRAM Voltage Scaling on a 22nm IoT End-Node
Alfio Di Mauro, Francesco Conti, Pasquale Davide Schiavone, Davide Rossi, Luca Benini
Comments: Submitted to ISICAS2020 journal special issue
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG); Signal Processing (eess.SP)
[19] arXiv:2007.09109 [pdf, other]
Title: Klessydra-T: Designing Vector Coprocessors for Multi-Threaded Edge-Computing Cores
Abdallah Cheikh, Stefano Sordillo, Antonio Mastrandrea, Francesco Menichelli, Giuseppe Scotti, Mauro Olivieri
Comments: Final revision accepted for publication on IEEE Micro Journal
Journal-ref: IEEE Micro, 2021
Subjects: Hardware Architecture (cs.AR)
[20] arXiv:2007.09361 [pdf, other]
Title: Runtime Task Scheduling using Imitation Learning for Heterogeneous Many-Core Systems
Anish Krishnakumar, Samet E. Arda, A. Alper Goksoy, Sumit K. Mandal, Umit Y. Ogras, Anderson L. Sartor, Radu Marculescu
Comments: 14 pages, 12 figures, 8 tables. Accepted for publication in Embedded Systems Week CODES+ISSS 2020 (Special Issue in IEEE TCAD)
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[21] arXiv:2007.09363 [pdf, other]
Title: Design Space Exploration of Algorithmic Multi-Port Memories in High-Performance Application-Specific Accelerators
Khushal Sethi
Subjects: Hardware Architecture (cs.AR)
[22] arXiv:2007.09490 [pdf, other]
Title: DeepDive: An Integrative Algorithm/Architecture Co-Design for Deep Separable Convolutional Neural Networks
Mohammadreza Baharani, Ushma Sunil, Kaustubh Manohar, Steven Furgurson, Hamed Tabkhi
Subjects: Hardware Architecture (cs.AR)
[23] arXiv:2007.09578 [pdf, other]
Title: NeuroMAX: A High Throughput, Multi-Threaded, Log-Based Accelerator for Convolutional Neural Networks
Mahmood Azhar Qureshi, Arslan Munir
Comments: To be published in ICCAD 2020
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[24] arXiv:2007.09822 [pdf, other]
Title: UVMBench: A Comprehensive Benchmark Suite for Researching Unified Virtual Memory in GPUs
Yongbin Gu, Wenxuan Wu, Yunfan Li, Lizhong Chen
Comments: 10 pages
Subjects: Hardware Architecture (cs.AR)
[25] arXiv:2007.09976 [pdf, other]
Title: Energy Efficient Computing Systems: Architectures, Abstractions and Modeling to Techniques and Standards
Rajeev Muralidhar, Renata Borovica-Gajic, Rajkumar Buyya
Comments: 63 pages, 6 figures. arXiv admin note: text overlap with arXiv:1404.4629 by other authors
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
Total of 61 entries : 1-25 26-50 51-61
Showing up to 25 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack