Skip to main content

Showing 1–11 of 11 results for author: Odema, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.16007  [pdf, other

    cs.AR cs.AI cs.DC cs.PF

    Performance Implications of Multi-Chiplet Neural Processing Units on Autonomous Driving Perception

    Authors: Mohanad Odema, Luke Chen, Hyoukjun Kwon, Mohammad Abdullah Al Faruque

    Abstract: We study the application of emerging chiplet-based Neural Processing Units to accelerate vehicular AI perception workloads in constrained automotive settings. The motivation stems from how chiplets technology is becoming integral to emerging vehicular architectures, providing a cost-effective trade-off between performance, modularity, and customization; and from perception models being the most co… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

    Comments: DATE'2025

  2. arXiv:2405.00790  [pdf, other

    cs.AR cs.AI cs.DC cs.LG cs.PF

    SCAR: Scheduling Multi-Model AI Workloads on Heterogeneous Multi-Chiplet Module Accelerators

    Authors: Mohanad Odema, Luke Chen, Hyoukjun Kwon, Mohammad Abdullah Al Faruque

    Abstract: Emerging multi-model workloads with heavy models like recent large language models significantly increased the compute and memory demands on hardware. To address such increasing demands, designing a scalable hardware architecture became a key problem. Among recent solutions, the 2.5D silicon interposer multi-chip module (MCM)-based AI accelerator has been actively explored as a promising scalable… ▽ More

    Submitted 14 September, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: MICRO'24

  3. arXiv:2312.09401  [pdf, other

    cs.AR cs.AI cs.DC

    Inter-Layer Scheduling Space Exploration for Multi-model Inference on Heterogeneous Chiplets

    Authors: Mohanad Odema, Hyoukjun Kwon, Mohammad Abdullah Al Faruque

    Abstract: To address increasing compute demand from recent multi-model workloads with heavy models like large language models, we propose to deploy heterogeneous chiplet-based multi-chip module (MCM)-based accelerators. We develop an advanced scheduling framework for heterogeneous MCM accelerators that comprehensively consider complex heterogeneity and inter-chiplet pipelining. Our experiments using our fra… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted poster abstract to the IBM IEEE AI Compute Symposium (AICS'23)

  4. arXiv:2307.08065  [pdf, other

    cs.DC cs.CV cs.LG

    MaGNAS: A Mapping-Aware Graph Neural Architecture Search Framework for Heterogeneous MPSoC Deployment

    Authors: Mohanad Odema, Halima Bouzidi, Hamza Ouarnoughi, Smail Niar, Mohammad Abdullah Al Faruque

    Abstract: Graph Neural Networks (GNNs) are becoming increasingly popular for vision-based applications due to their intrinsic capacity in modeling structural and contextual relations between various parts of an image frame. On another front, the rising popularity of deep vision-based applications at the edge has been facilitated by the recent advancements in heterogeneous multi-processor Systems on Chips (M… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: This article appears as part of the ESWEEK-TECS special issue and was presented in the International Conference on Compilers, Architectures, and Synthesis for Embedded Systems (CASES), 2023

  5. arXiv:2302.12926  [pdf, other

    cs.DC cs.AR cs.LG

    Map-and-Conquer: Energy-Efficient Mapping of Dynamic Neural Nets onto Heterogeneous MPSoCs

    Authors: Halima Bouzidi, Mohanad Odema, Hamza Ouarnoughi, Smail Niar, Mohammad Abdullah Al Faruque

    Abstract: Heterogeneous MPSoCs comprise diverse processing units of varying compute capabilities. To date, the mapping strategies of neural networks (NNs) onto such systems are yet to exploit the full potential of processing parallelism, made possible through both the intrinsic NNs' structure and underlying hardware composition. In this paper, we propose a novel framework to effectively map NNs onto heterog… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted to the 60th ACM/IEEE Design Automation Conference (DAC 2023)

  6. arXiv:2302.12493  [pdf, other

    eess.SY cs.DC cs.LG

    SEO: Safety-Aware Energy Optimization Framework for Multi-Sensor Neural Controllers at the Edge

    Authors: Mohanad Odema, James Ferlez, Yasser Shoukry, Mohammad Abdullah Al Faruque

    Abstract: Runtime energy management has become quintessential for multi-sensor autonomous systems at the edge for achieving high performance given the platform constraints. Typical for such systems, however, is to have their controllers designed with formal guarantees on safety that precede in priority such optimizations, which in turn limits their application in real settings. In this paper, we propose a n… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted to the 60th ACM/IEEE Design Automation Conference (DAC 2023)

  7. arXiv:2302.06572  [pdf, other

    eess.SY cs.DC cs.LG cs.RO

    EnergyShield: Provably-Safe Offloading of Neural Network Controllers for Energy Efficiency

    Authors: Mohanad Odema, James Ferlez, Goli Vaisi, Yasser Shoukry, Mohammad Abdullah Al Faruque

    Abstract: To mitigate the high energy demand of Neural Network (NN) based Autonomous Driving Systems (ADSs), we consider the problem of offloading NN controllers from the ADS to nearby edge-computing infrastructure, but in such a way that formal vehicle safety properties are guaranteed. In particular, we propose the EnergyShield framework, which repurposes a controller ''shield'' as a low-power runtime safe… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted to be published in the proceedings of the 14th ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS 2023)

  8. arXiv:2212.03354  [pdf, other

    cs.LG cs.AR cs.NE cs.PF

    HADAS: Hardware-Aware Dynamic Neural Architecture Search for Edge Performance Scaling

    Authors: Halima Bouzidi, Mohanad Odema, Hamza Ouarnoughi, Mohammad Abdullah Al Faruque, Smail Niar

    Abstract: Dynamic neural networks (DyNNs) have become viable techniques to enable intelligence on resource-constrained edge devices while maintaining computational efficiency. In many cases, the implementation of DyNNs can be sub-optimal due to its underlying backbone architecture being developed at the design stage independent of both: (i) the dynamic computing features, e.g. early exiting, and (ii) the re… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: To be published in the 26th IEEE/ACM Design, Automation & Test in Europe Conference & Exhibition (DATE), April 2023, Antwerp, Belgium

  9. arXiv:2207.08865  [pdf, other

    cs.DC cs.CV cs.LG

    Romanus: Robust Task Offloading in Modular Multi-Sensor Autonomous Driving Systems

    Authors: Luke Chen, Mohanad Odema, Mohammad Abdullah Al Faruque

    Abstract: Due to the high performance and safety requirements of self-driving applications, the complexity of modern autonomous driving systems (ADS) has been growing, instigating the need for more sophisticated hardware which could add to the energy footprint of the ADS platform. Addressing this, edge computing is poised to encompass self-driving applications, enabling the compute-intensive autonomy-relate… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: This paper has been accepted to the 2022 International Conference On Computer-Aided Design (ICCAD 2022)

  10. arXiv:2107.10895  [pdf, other

    eess.SP cs.CV eess.SY

    SAGE: A Split-Architecture Methodology for Efficient End-to-End Autonomous Vehicle Control

    Authors: Arnav Malawade, Mohanad Odema, Sebastien Lajeunesse-DeGroot, Mohammad Abdullah Al Faruque

    Abstract: Autonomous vehicles (AV) are expected to revolutionize transportation and improve road safety significantly. However, these benefits do not come without cost; AVs require large Deep-Learning (DL) models and powerful hardware platforms to operate reliably in real-time, requiring between several hundred watts to one kilowatt of power. This power consumption can dramatically reduce vehicles' driving… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: This article appears as part of the ESWEEK-TECS special issue and was presented in the International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS), 2021

  11. arXiv:2107.09309  [pdf, other

    cs.LG cs.DC cs.NE

    LENS: Layer Distribution Enabled Neural Architecture Search in Edge-Cloud Hierarchies

    Authors: Mohanad Odema, Nafiul Rashid, Berken Utku Demirel, Mohammad Abdullah Al Faruque

    Abstract: Edge-Cloud hierarchical systems employing intelligence through Deep Neural Networks (DNNs) endure the dilemma of workload distribution within them. Previous solutions proposed to distribute workloads at runtime according to the state of the surroundings, like the wireless conditions. However, such conditions are usually overlooked at design time. This paper addresses this issue for DNN architectur… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: To appear at the 58th IEEE/ACM Design Automation Conference (DAC), December 2021, San Francisco, CA, USA