Skip to main content

Showing 1–24 of 24 results for author: Pedram, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.23705  [pdf, other

    cs.LG stat.ML

    Steering Large Agent Populations using Mean-Field Schrodinger Bridges with Gaussian Mixture Models

    Authors: George Rapakoulias, Ali Reza Pedram, Panagiotis Tsiotras

    Abstract: The Mean-Field Schrodinger Bridge (MFSB) problem is an optimization problem aiming to find the minimum effort control policy to drive a McKean-Vlassov stochastic differential equation from one probability measure to another. In the context of multiagent control, the objective is to control the configuration of a swarm of identical, interacting cooperative agents, as captured by the time-varying pr… ▽ More

    Submitted 3 April, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

  2. arXiv:2503.10843  [pdf, other

    cs.RO

    Communication-Aware Iterative Map Compression for Online Path-Planning

    Authors: Evangelos Psomiadis, Ali Reza Pedram, Dipankar Maity, Panagiotis Tsiotras

    Abstract: This paper addresses the problem of optimizing communicated information among heterogeneous, resource-aware robot teams to facilitate their navigation. In such operations, a mobile robot compresses its local map to assist another robot in reaching a target within an uncharted environment. The primary challenge lies in ensuring that the map compression step balances network load while transmitting… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 7 pages, 6 figures

  3. arXiv:2412.16432  [pdf, other

    cs.AR

    DFModel: Design Space Optimization of Large-Scale Systems Exploiting Dataflow Mappings

    Authors: Sho Ko, Nathan Zhang, Olivia Hsu, Ardavan Pedram, Kunle Olukotun

    Abstract: We propose DFModel, a modeling framework for mapping dataflow computation graphs onto large-scale systems. Mapping a workload to a system requires optimizing dataflow mappings at various levels, including the inter-chip (between chips) level and the intra-chip (within a chip) level. DFModel is, to the best of our knowledge, the first framework to perform the optimization at multiple levels of the… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  4. arXiv:2412.09059  [pdf, ps, other

    cs.LG

    Go With the Flow: Fast Diffusion for Gaussian Mixture Models

    Authors: George Rapakoulias, Ali Reza Pedram, Fengjiao Liu, Lingjiong Zhu, Panagiotis Tsiotras

    Abstract: Schrodinger Bridges (SBs) are diffusion processes that steer, in finite time, a given initial distribution to another final one while minimizing a suitable cost functional. Although various methods for computing SBs have recently been proposed in the literature, most of these approaches require computationally expensive training schemes, even for solving low-dimensional problems. In this work, we… ▽ More

    Submitted 30 May, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

  5. arXiv:2306.09552  [pdf, other

    cs.AR

    Retrospective: EIE: Efficient Inference Engine on Sparse and Compressed Neural Network

    Authors: Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally

    Abstract: EIE proposed to accelerate pruned and compressed neural networks, exploiting weight sparsity, activation sparsity, and 4-bit weight-sharing in neural network accelerators. Since published in ISCA'16, it opened a new design space to accelerate pruned and sparse neural networks and spawned many algorithm-hardware co-designs for model compression and acceleration, both in academia and commercial AI c… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Invited retrospective paper at ISCA 2023

  6. arXiv:2306.00264  [pdf, ps, other

    cs.RO

    Optimal Sampling-based Motion Planning in Gaussian Belief Space for Minimum Sensing Navigation

    Authors: Vrushabh Zinage, Ali Reza Pedram, Takashi Tanaka

    Abstract: In this paper, we consider the motion planning problem in Gaussian belief space for minimum sensing navigation. Despite the extensive use of sampling-based algorithms and their rigorous analysis in the deterministic setting, there has been little formal analysis of the quality of their solutions returned by sampling algorithms in Gaussian belief space. This paper aims to address this lack of resea… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 19 pages, 10 figures

  7. arXiv:2303.07326  [pdf, other

    cs.RO math.OC

    A Smoothing Algorithm for Minimum Sensing Path Plans in Gaussian Belief Space

    Authors: Ali Reza Pedram, Takashi Tanaka

    Abstract: This paper explores minimum sensing navigation of robots in environments cluttered with obstacles. The general objective is to find a path plan to a goal region that requires minimal sensing effort. In [1], the information-geometric RRT* (IG-RRT*) algorithm was proposed to efficiently find such a path. However, like any stochastic sampling-based planner, the computational complexity of IG-RRT* gro… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  8. arXiv:2109.13976  [pdf, other

    cs.RO cs.IT math.OC

    Gaussian Belief Space Path Planning for Minimum Sensing Navigation

    Authors: Ali Reza Pedram, Riku Funada, Takashi Tanaka

    Abstract: We propose a path planning methodology for a mobile robot navigating through an obstacle-filled environment to generate a reference path that is traceable with moderate sensing efforts. The desired reference path is characterized as the shortest path in an obstacle-filled Gaussian belief manifold equipped with a novel information-geometric distance function. The distance function we introduce is s… ▽ More

    Submitted 7 December, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

  9. arXiv:2109.13146  [pdf, other

    cs.RO cs.IT eess.SY

    Dynamic Allocation of Visual Attention for Vision-based Autonomous Navigation under Data Rate Constraints

    Authors: Ali Reza Pedram, Riku Funada, Takashi Tanaka

    Abstract: This paper considers the problem of task-dependent (top-down) attention allocation for vision-based autonomous navigation using known landmarks. Unlike the existing paradigm in which landmark selection is formulated as a combinatorial optimization problem, we model it as a resource allocation problem where the decision-maker (DM) is granted extra freedom to control the degree of attention to each… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  10. arXiv:2107.12922  [pdf, other

    cs.AR

    Griffin: Rethinking Sparse Optimization for Deep Learning Architectures

    Authors: Jong Hoon Shin, Ali Shafiee, Ardavan Pedram, Hamzah Abdel-Aziz, Ling Li, Joseph Hassoun

    Abstract: This paper examines the design space trade-offs of DNNs accelerators aiming to achieve competitive performance and efficiency metrics for all four combinations of dense or sparse activation/weight tensors. To do so, we systematically examine the overheads of supporting sparsity on top of an optimized dense core. These overheads are modeled based on parameters that indicate how a multiplier can bor… ▽ More

    Submitted 1 November, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: Accepted at the 28th IEEE International Symposium on High-Performance Computer Architecture - HPCA 2022

  11. arXiv:2106.04001  [pdf, other

    math.OC cs.IT

    Optimized Data Rate Allocation for Dynamic Sensor Fusion over Resource Constrained Communication Networks

    Authors: Hyunho Jung, Ali Reza Pedram, Travis Craig Cuvelier, Takashi Tanaka

    Abstract: This paper presents a new method to solve a dynamic sensor fusion problem. We consider a large number of remote sensors which measure a common Gauss-Markov process and encoders that transmit the measurements to a data fusion center through the resource restricted communication network. The proposed approach heuristically minimizes a weighted sum of communication costs subject to a constraint on th… ▽ More

    Submitted 18 October, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

  12. arXiv:2101.11748  [pdf, other

    cs.AR cs.LG

    Rethinking Floating Point Overheads for Mixed Precision DNN Accelerators

    Authors: Hamzah Abdel-Aziz, Ali Shafiee, Jong Hoon Shin, Ardavan Pedram, Joseph H. Hassoun

    Abstract: In this paper, we propose a mixed-precision convolution unit architecture which supports different integer and floating point (FP) precisions. The proposed architecture is based on low-bit inner product units and realizes higher precision based on temporal decomposition. We illustrate how to integrate FP computations on integer-based architecture and evaluate overheads incurred by FP arithmetic su… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted to appear in 4th Conference on Machine Learning and Systems 2021

  13. arXiv:2003.12548  [pdf, other

    eess.SY cs.LG

    Closed-loop Parameter Identification of Linear Dynamical Systems through the Lens of Feedback Channel Coding Theory

    Authors: Ali Reza Pedram, Takashi Tanaka

    Abstract: This paper considers the problem of closed-loop identification of linear scalar systems with Gaussian process noise, where the system input is determined by a deterministic state feedback policy. The regularized least-square estimate (LSE) algorithm is adopted, seeking to find the best estimate of unknown model parameters based on noiseless measurements of the state. We are interested in the funda… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

  14. arXiv:2002.12494  [pdf, ps, other

    cs.RO eess.SY

    Rationally Inattentive Path-Planning via RRT*

    Authors: Jeb Stefan, Ali Reza Pedram, Riku Funada, Takashi Tanaka

    Abstract: We consider a path-planning scenario for a mobile robot traveling in a configuration space with obstacles under the presence of stochastic disturbances. A novel path length metric is proposed on the uncertain configuration space and then integrated with the existing RRT* algorithm. The metric is a weighted sum of two terms which capture both the Euclidean distance traveled by the robot and the per… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 10 pages, 5 figures

  15. arXiv:2002.02921  [pdf, other

    cs.CV cs.LG cs.RO eess.IV

    Temporal Segmentation of Surgical Sub-tasks through Deep Learning with Multiple Data Sources

    Authors: Yidan Qin, Sahba Aghajani Pedram, Seyedshams Feyzabadi, Max Allan, A. Jonathan McLeod, Joel W. Burdick, Mahdi Azizian

    Abstract: Many tasks in robot-assisted surgeries (RAS) can be represented by finite-state machines (FSMs), where each state represents either an action (such as picking up a needle) or an observation (such as bleeding). A crucial step towards the automation of such surgical tasks is the temporal perception of the current surgical scene, which requires a real-time estimation of the states in the FSMs. The ob… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: Accepted to ICRA 2020

  16. arXiv:2001.03253  [pdf, other

    cs.LG stat.ML

    Campfire: Compressible, Regularization-Free, Structured Sparse Training for Hardware Accelerators

    Authors: Noah Gamboa, Kais Kudrolli, Anand Dhoot, Ardavan Pedram

    Abstract: This paper studies structured sparse training of CNNs with a gradual pruning technique that leads to fixed, sparse weight matrices after a set number of epochs. We simplify the structure of the enforced sparsity so that it reduces overhead caused by regularization. The proposed training methodology Campfire explores pruning at granularities within a convolutional kernel and filter. We study vari… ▽ More

    Submitted 12 January, 2020; v1 submitted 9 January, 2020; originally announced January 2020.

  17. arXiv:1910.03398  [pdf, other

    cs.RO cs.AI cs.LG

    Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach

    Authors: Sahba Aghajani Pedram, Peter Walker Ferguson, Changyeob Shin, Ankur Mehta, Erik P. Dutson, Farshid Alambeigi, Jacob Rosen

    Abstract: In this paper, we present a synergic learning algorithm to address the task of indirect manipulation of an unknown deformable tissue. Tissue manipulation is a common yet challenging task in various surgical interventions, which makes it a good candidate for robotic automation. We propose using a linear approximate Q-learning method in which human knowledge contributes to selecting useful yet simpl… ▽ More

    Submitted 11 October, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

  18. arXiv:1905.07459  [pdf, other

    cs.IT math.OC

    Bidirectional Information Flow and the Roles of Privacy Masks in Cloud-Based Control

    Authors: Ali Reza Pedram, Takashi Tanaka, Matthew Hale

    Abstract: We consider a cloud-based control architecture for a linear plant with Gaussian process noise, where the state of the plant contains a client's sensitive information. We assume that the cloud tries to estimate the state while executing a designated control algorithm. The mutual information between the client's actual state and the cloud's estimate is adopted as a measure of privacy loss. We discus… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

  19. Autonomous Tissue Manipulation via Surgical Robot Using Learning Based Model Predictive Control

    Authors: Changyeob Shin, Peter Walker Ferguson, Sahba Aghajani Pedram, Ji Ma, Erik P. Dutson, Jacob Rosen

    Abstract: Tissue manipulation is a frequently used fundamental subtask of any surgical procedures, and in some cases it may require the involvement of a surgeon's assistant. The complex dynamics of soft tissue as an unstructured environment is one of the main challenges in any attempt to automate the manipulation of it via a surgical robotic system. Two AI learning based model predictive control algorithms… ▽ More

    Submitted 2 March, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

    Comments: Accepted to ICRA 2019

  20. arXiv:1901.04588  [pdf, other

    cs.RO

    Optimal Needle Diameter, Shape, and Path in Autonomous Suturing

    Authors: S. Aghajani Pedram, P. Ferguson, J. Ma, E. Dutson, J. Rosen

    Abstract: Needle shape, diameter, and path are critical parameters that directly affect suture depth and tissue trauma in autonomous suturing. This paper presents an optimization-based approach to specify these parameters. Given clinical suturing guidelines, a kinematic model of needle-tissue interaction was developed to quantify suture parameters and constraints. The model was further used to formulate con… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

    Comments: 2 pages, 2 figures, 2017 SoCal Robotics Symposium

  21. arXiv:1706.00517  [pdf, other

    cs.DC cs.LG cs.NE

    CATERPILLAR: Coarse Grain Reconfigurable Architecture for Accelerating the Training of Deep Neural Networks

    Authors: Yuanfang Li, Ardavan Pedram

    Abstract: Accelerating the inference of a trained DNN is a well studied subject. In this paper we switch the focus to the training of DNNs. The training phase is compute intensive, demands complicated data communication, and contains multiple levels of data dependencies and parallelism. This paper presents an algorithm/architecture space exploration of efficient accelerators to achieve better network conver… ▽ More

    Submitted 8 June, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

    Comments: ASAP 2017: The 28th Annual IEEE International Conference on Application-specific Systems, Architectures and Processors

  22. arXiv:1606.04209  [pdf, other

    cs.DC cs.NE

    A Systematic Approach to Blocking Convolutional Neural Networks

    Authors: Xuan Yang, Jing Pu, Blaine Burton Rister, Nikhil Bhagdikar, Stephen Richardson, Shahar Kvatinsky, Jonathan Ragan-Kelley, Ardavan Pedram, Mark Horowitz

    Abstract: Convolutional Neural Networks (CNNs) are the state of the art solution for many computer vision problems, and many researchers have explored optimized implementations. Most implementations heuristically block the computation to deal with the large data sizes and high data reuse of CNNs. This paper explores how to block CNN computations for memory locality by creating an analytical model for CNN-li… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

  23. Dark Memory and Accelerator-Rich System Optimization in the Dark Silicon Era

    Authors: Ardavan Pedram, Stephen Richardson, Sameh Galal, Shahar Kvatinsky, Mark A. Horowitz

    Abstract: The key challenge to improving performance in the age of Dark Silicon is how to leverage transistors when they cannot all be used at the same time. In modern SOCs, these transistors are often used to create specialized accelerators which improve energy efficiency for some applications by 10-1000X. While this might seem like the magic bullet we need, for most CPU applications more energy is dissipa… ▽ More

    Submitted 26 April, 2016; v1 submitted 12 February, 2016; originally announced February 2016.

    Comments: 8 pages, To appear in IEEE Design and Test Journal

    Journal ref: IEEE Design & Test ( Volume: 34, Issue: 2, April 2017 )

  24. arXiv:1602.01528  [pdf, other

    cs.CV cs.AR

    EIE: Efficient Inference Engine on Compressed Deep Neural Network

    Authors: Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally

    Abstract: State-of-the-art deep neural networks (DNNs) have hundreds of millions of connections and are both computationally and memory intensive, making them difficult to deploy on embedded systems with limited hardware resources and power budgets. While custom hardware helps the computation, fetching weights from DRAM is two orders of magnitude more expensive than ALU operations, and dominates the require… ▽ More

    Submitted 3 May, 2016; v1 submitted 3 February, 2016; originally announced February 2016.

    Comments: External Links: TheNextPlatform: http://goo.gl/f7qX0L ; O'Reilly: https://goo.gl/Id1HNT ; Hacker News: https://goo.gl/KM72SV ; Embedded-vision: http://goo.gl/joQNg8 ; Talk at NVIDIA GTC'16: http://goo.gl/6wJYvn ; Talk at Embedded Vision Summit: https://goo.gl/7abFNe ; Talk at Stanford University: https://goo.gl/6lwuer. Published as a conference paper in ISCA 2016