Skip to main content

Showing 1–14 of 14 results for author: Ranganathan, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.23076  [pdf, other

    cs.AR cs.LG cs.PF

    Concorde: Fast and Accurate CPU Performance Modeling with Compositional Analytical-ML Fusion

    Authors: Arash Nasr-Esfahany, Mohammad Alizadeh, Victor Lee, Hanna Alam, Brett W. Coon, David Culler, Vidushi Dadu, Martin Dixon, Henry M. Levy, Santosh Pandey, Parthasarathy Ranganathan, Amir Yazdanbakhsh

    Abstract: Cycle-level simulators such as gem5 are widely used in microarchitecture design, but they are prohibitively slow for large-scale design space explorations. We present Concorde, a new methodology for learning fast and accurate performance models of microarchitectures. Unlike existing simulators and learning approaches that emulate each instruction, Concorde predicts the behavior of a program based… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

    Comments: 15 pages, 17 figures, To be published in ISCA 2025

  2. arXiv:2503.15669  [pdf, other

    cs.SE

    ECO: An LLM-Driven Efficient Code Optimizer for Warehouse Scale Computers

    Authors: Hannah Lin, Martin Maas, Maximilian Roquemore, Arman Hasanzadeh, Fred Lewis, Yusuf Simonson, Tzu-Wei Yang, Amir Yazdanbakhsh, Deniz Altinbüken, Florin Papa, Maggie Nolan Edmonds, Aditya Patil, Don Schwarz, Satish Chandra, Chris Kennelly, Milad Hashemi, Parthasarathy Ranganathan

    Abstract: With the end of Moore's Law, optimizing code for performance has become paramount for meeting ever-increasing compute demands, particularly in hyperscale data centers where even small efficiency gains translate to significant resource and energy savings. Traditionally, this process requires significant programmer effort to identify optimization opportunities, modify the code to implement the optim… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  3. arXiv:2502.01671  [pdf, other

    cs.AR cs.AI

    Life-Cycle Emissions of AI Hardware: A Cradle-To-Grave Approach and Generational Trends

    Authors: Ian Schneider, Hui Xu, Stephan Benecke, David Patterson, Keguo Huang, Parthasarathy Ranganathan, Cooper Elsworth

    Abstract: Specialized hardware accelerators aid the rapid advancement of artificial intelligence (AI), and their efficiency impacts AI's environmental sustainability. This study presents the first publication of a comprehensive AI accelerator life-cycle assessment (LCA) of greenhouse gas emissions, including the first publication of manufacturing emissions of an AI accelerator. Our analysis of five Tensor… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

  4. Adaptive Hoeffding Tree with Transfer Learning for Streaming Synchrophasor Data Sets

    Authors: Zakaria El Mrabet, Daisy Flora Selvaraj, Prakash Ranganathan

    Abstract: Synchrophasor technology or phasor measurement units (PMUs) are known to detect multiple type of oscillations or faults better than Supervisory Control and Data Acquisition (SCADA) systems, but the volume of Bigdata (e.g., 30-120 samples per second on a single PMU) generated by these sensors at the aggregator level (e.g., several PMUs) requires special handling. Conventional machine learning or da… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

    Journal ref: 2019 IEEE International Conference on Big Data (Big Data)

  5. arXiv:2404.19124  [pdf, other

    cs.CL

    Accelerating Production LLMs with Combined Token/Embedding Speculators

    Authors: Davis Wertheimer, Joshua Rosenkranz, Thomas Parnell, Sahil Suneja, Pavithra Ranganathan, Raghu Ganti, Mudhakar Srivatsa

    Abstract: This technical report describes the design and training of novel speculative decoding draft models, for accelerating the inference speeds of large language models in a production environment. By conditioning draft predictions on both context vectors and sampled tokens, we can train our speculators to efficiently predict high-quality n-grams, which the base model then accepts or rejects. This allow… ▽ More

    Submitted 6 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Original upload 4/29/24, updated 6/6/24 with additional references to concurrent work

  6. arXiv:2306.03964  [pdf

    cs.AR

    Fifty Years of ISCA: A data-driven retrospective on key trends

    Authors: Gaurang Upasani, Matthew D. Sinclair, Adrian Sampson, Parthasarathy Ranganathan, David Patterson, Shaan Shah, Nidhi Parthasarathy, Rutwik Jain

    Abstract: Computer Architecture, broadly, involves optimizing hardware and software for current and future processing systems. Although there are several other top venues to publish Computer Architecture research, including ASPLOS, HPCA, and MICRO, ISCA (the International Symposium on Computer Architecture) is one of the oldest, longest running, and most prestigious venues for publishing Computer Architectu… ▽ More

    Submitted 18 November, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 34 pages, 16 figures

  7. arXiv:2302.07867  [pdf, other

    cs.SE cs.AI cs.LG cs.PF

    Learning Performance-Improving Code Edits

    Authors: Alexander Shypula, Aman Madaan, Yimeng Zeng, Uri Alon, Jacob Gardner, Milad Hashemi, Graham Neubig, Parthasarathy Ranganathan, Osbert Bastani, Amir Yazdanbakhsh

    Abstract: With the decline of Moore's law, optimizing program performance has become a major focus of software research. However, high-level optimizations such as API and algorithm changes remain elusive due to the difficulty of understanding the semantics of code. Simultaneously, pretrained large language models (LLMs) have demonstrated strong capabilities at solving a wide range of programming tasks. To t… ▽ More

    Submitted 26 April, 2024; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Published as a conference paper at ICLR 2024 (Spotlight). Project website: https://pie4perf.com/

  8. arXiv:2208.05297  [pdf, other

    cs.SE cs.LG

    Learning to Improve Code Efficiency

    Authors: Binghong Chen, Daniel Tarlow, Kevin Swersky, Martin Maas, Pablo Heiber, Ashish Naik, Milad Hashemi, Parthasarathy Ranganathan

    Abstract: Improvements in the performance of computing systems, driven by Moore's Law, have transformed society. As such hardware-driven gains slow down, it becomes even more important for software developers to focus on performance and efficiency during development. While several studies have demonstrated the potential from such improved code efficiency (e.g., 2x better generational improvements compared t… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  9. arXiv:2108.06738  [pdf, other

    cs.CY

    Socio-Technological Challenges and Opportunities: Paths Forward

    Authors: Carole-Jean Wu, Srilatha Manne, Parthasarathy Ranganathan, Sarah Bird, Shane Greenstein

    Abstract: Advancements in digital technologies have a bootstrapping effect. The past fifty years of technological innovations from the computer architecture community have brought innovations and orders-of-magnitude efficiency improvements that engender use cases that were not previously possible -- stimulating novel application domains and increasing uses and deployments at an ever-faster pace. Consequentl… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: This article is intended to capture the ISCA panel and the following discussions on the Microprocessor 50: Societal Challenges from the lens of computer architects

  10. arXiv:2006.16239  [pdf, other

    cs.LG cs.AR stat.ML

    An Imitation Learning Approach for Cache Replacement

    Authors: Evan Zheran Liu, Milad Hashemi, Kevin Swersky, Parthasarathy Ranganathan, Junwhan Ahn

    Abstract: Program execution speed critically depends on increasing cache hits, as cache hits are orders of magnitude faster than misses. To increase cache hits, we focus on the problem of cache replacement: choosing which cache line to evict upon inserting a new line. This is challenging because it requires planning far ahead and currently there is no known practical solution. As a result, current replaceme… ▽ More

    Submitted 9 July, 2020; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: International Conference on Machine Learning (ICML), 2020

  11. arXiv:2006.08084  [pdf, other

    cs.LG cs.NE cs.PL stat.ML

    Neural Execution Engines: Learning to Execute Subroutines

    Authors: Yujun Yan, Kevin Swersky, Danai Koutra, Parthasarathy Ranganathan, Milad Hashemi

    Abstract: A significant effort has been made to train neural networks that replicate algorithmic reasoning, but they often fail to learn the abstract concepts underlying these algorithms. This is evidenced by their inability to generalize to data distributions that are outside of their restricted training sets, namely larger inputs and unseen data. We study these generalization issues at the level of numeri… ▽ More

    Submitted 22 October, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Accepted at 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  12. arXiv:1906.07181  [pdf, other

    cs.LG cs.AI cs.PL stat.ML

    Learning Execution through Neural Code Fusion

    Authors: Zhan Shi, Kevin Swersky, Daniel Tarlow, Parthasarathy Ranganathan, Milad Hashemi

    Abstract: As the performance of computer systems stagnates due to the end of Moore's Law, there is a need for new models that can understand and optimize the execution of general purpose code. While there is a growing body of work on using Graph Neural Networks (GNNs) to learn representations of source code, these representations do not understand how code dynamically executes. In this work, we propose a ne… ▽ More

    Submitted 10 March, 2020; v1 submitted 17 June, 2019; originally announced June 2019.

    Comments: 14 pages,7 figures

  13. arXiv:1803.02329  [pdf, other

    cs.LG stat.ML

    Learning Memory Access Patterns

    Authors: Milad Hashemi, Kevin Swersky, Jamie A. Smith, Grant Ayers, Heiner Litz, Jichuan Chang, Christos Kozyrakis, Parthasarathy Ranganathan

    Abstract: The explosion in workload complexity and the recent slow-down in Moore's law scaling call for new approaches towards efficient computing. Researchers are now beginning to use recent advances in machine learning in software optimizations, augmenting or replacing traditional heuristics and data structures. However, the space of machine learning for computer hardware architecture is only lightly expl… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.

  14. arXiv:0909.1784  [pdf

    cs.DB cs.PF

    Energy Efficiency: The New Holy Grail of Data Management Systems Research

    Authors: Stavros Harizopoulos, Mehul Shah, Justin Meza, Parthasarathy Ranganathan

    Abstract: Energy costs are quickly rising in large-scale data centers and are soon projected to overtake the cost of hardware. As a result, data center operators have recently started turning into using more energy-friendly hardware. Despite the growing body of research in power management techniques, there has been little work to date on energy efficiency from a data management software perspective. In… ▽ More

    Submitted 9 September, 2009; originally announced September 2009.

    Comments: CIDR 2009