Skip to main content

Showing 1–7 of 7 results for author: Miriyala, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.04135  [pdf, ps, other

    cs.CL cs.LG

    Enhancing Granular Sentiment Classification with Chain-of-Thought Prompting in Large Language Models

    Authors: Vihaan Miriyala, Smrithi Bukkapatnam, Lavanya Prahallad

    Abstract: We explore the use of Chain-of-Thought (CoT) prompting with large language models (LLMs) to improve the accuracy of granular sentiment categorization in app store reviews. Traditional numeric and polarity-based ratings often fail to capture the nuanced sentiment embedded in user feedback. We evaluated the effectiveness of CoT prompting versus simple prompting on 2000 Amazon app reviews by comparin… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 5 pages

  2. arXiv:2502.06304  [pdf, other

    cs.DC cs.AR

    Data-aware Dynamic Execution of Irregular Workloads on Heterogeneous Systems

    Authors: Zhenyu Bai, Dan Wu, Pranav Dangi, Dhananjaya Wijerathne, Venkata Pavan Kumar Miriyala, Tulika Mitra

    Abstract: Current approaches to scheduling workloads on heterogeneous systems with specialized accelerators often rely on manual partitioning, offloading tasks with specific compute patterns to accelerators. This method requires extensive experimentation and human effort to identify the tasks suitable for the accelerator. To solve this problem, we introduce DyPe, a scheduling framework tailored for heteroge… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: 10 pages

  3. arXiv:2204.09797  [pdf, other

    cs.AR cs.AI

    Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network Accelerator

    Authors: Miao Yu, Tingting Xiang, Venkata Pavan Kumar Miriyala, Trevor E. Carlson

    Abstract: Machine learning, particularly deep neural network inference, has become a vital workload for many computing systems, from data centers and HPC systems to edge-based computing. As advances in sparsity have helped improve the efficiency of AI acceleration, there is a continued need for improved system efficiency for both high-performance and system-level acceleration. This work takes a unique loo… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 12 pages, 9 figures and 5 tables

  4. arXiv:2010.11741  [pdf, other

    eess.AS cond-mat.dis-nn cs.AR cs.LG cs.SD

    Ultra-low power on-chip learning of speech commands with phase-change memories

    Authors: Venkata Pavan Kumar Miriyala, Masatoshi Ishii

    Abstract: Embedding artificial intelligence at the edge (edge-AI) is an elegant solution to tackle the power and latency issues in the rapidly expanding Internet of Things. As edge devices typically spend most of their time in sleep mode and only wake-up infrequently to collect and process sensor data, non-volatile in-memory computing (NVIMC) is a promising approach to design the next generation of edge-AI… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: This work has been submitted to the IEEE for possible publication

  5. arXiv:2006.09982  [pdf, other

    cs.NE cs.AI cs.AR cs.LG

    You Only Spike Once: Improving Energy-Efficient Neuromorphic Inference to ANN-Level Accuracy

    Authors: Srivatsa P, Kyle Timothy Ng Chu, Burin Amornpaisannon, Yaswanth Tavva, Venkata Pavan Kumar Miriyala, Jibin Wu, Malu Zhang, Haizhou Li, Trevor E. Carlson

    Abstract: In the past decade, advances in Artificial Neural Networks (ANNs) have allowed them to perform extremely well for a wide range of tasks. In fact, they have reached human parity when performing image recognition, for example. Unfortunately, the accuracy of these ANNs comes at the expense of a large number of cache and/or memory accesses and compute operations. Spiking Neural Networks (SNNs), a type… ▽ More

    Submitted 8 November, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: 10 pages, 4 figures. This work has been submitted to the IEEE for possible publication. This work is an extended version of the paper accepted to the 2nd Workshop on Accelerated Machine Learning (AccML 2020)

  6. arXiv:2003.11837  [pdf, other

    cs.NE cs.LG

    Rectified Linear Postsynaptic Potential Function for Backpropagation in Deep Spiking Neural Networks

    Authors: Malu Zhang, Jiadong Wang, Burin Amornpaisannon, Zhixuan Zhang, VPK Miriyala, Ammar Belatreche, Hong Qu, Jibin Wu, Yansong Chua, Trevor E. Carlson, Haizhou Li

    Abstract: Spiking Neural Networks (SNNs) use spatio-temporal spike patterns to represent and transmit information, which is not only biologically realistic but also suitable for ultra-low-power event-driven neuromorphic implementation. Motivated by the success of deep learning, the study of Deep Spiking Neural Networks (DeepSNNs) provides promising directions for artificial intelligence applications. Howeve… ▽ More

    Submitted 3 November, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: This work has been submitted to the IEEE for possible publication. Copyrightmay be transferred without notice, after which this version may no longer beaccessible

  7. arXiv:2003.05132  [pdf, other

    cs.ET cond-mat.dis-nn

    SIMBA: A Skyrmionic In-Memory Binary Neural Network Accelerator

    Authors: Venkata Pavan Kumar Miriyala, Kale Rahul Vishwanath, Xuanyao Fong

    Abstract: Magnetic skyrmions are emerging as potential candidates for next generation non-volatile memories. In this paper, we propose an in-memory binary neural network (BNN) accelerator based on the non-volatile skyrmionic memory, which we call as SIMBA. SIMBA consumes 26.7 mJ of energy and 2.7 ms of latency when running an inference on a VGG-like BNN. Furthermore, we demonstrate improvements in the perfo… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

    Journal ref: IEEE Transactions on Magnetics 2020