Skip to main content

Showing 1–5 of 5 results for author: Abdelfattah, M S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2108.08305  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Temporal Kernel Consistency for Blind Video Super-Resolution

    Authors: Lichuan Xiang, Royson Lee, Mohamed S. Abdelfattah, Nicholas D. Lane, Hongkai Wen

    Abstract: Deep learning-based blind super-resolution (SR) methods have recently achieved unprecedented performance in upscaling frames with unknown degradation. These models are able to accurately estimate the unknown downscaling kernel from a given low-resolution (LR) image in order to leverage the kernel during restoration. Although these approaches have largely been successful, they are predominantly ima… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

  2. arXiv:2007.08668  [pdf, other

    cs.LG eess.SP stat.ML

    BRP-NAS: Prediction-based NAS using GCNs

    Authors: Łukasz Dudziak, Thomas Chau, Mohamed S. Abdelfattah, Royson Lee, Hyeji Kim, Nicholas D. Lane

    Abstract: Neural architecture search (NAS) enables researchers to automatically explore broad design spaces in order to improve efficiency of neural networks. This efficiency is especially important in the case of on-device deployment, where improvements in accuracy should be balanced out with computational demands of a model. In practice, performance metrics of model are computationally expensive to obtain… ▽ More

    Submitted 19 January, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Published at NeurIPS 2020

  3. arXiv:2002.05022  [pdf, other

    eess.SP cs.LG

    Best of Both Worlds: AutoML Codesign of a CNN and its Hardware Accelerator

    Authors: Mohamed S. Abdelfattah, Łukasz Dudziak, Thomas Chau, Royson Lee, Hyeji Kim, Nicholas D. Lane

    Abstract: Neural architecture search (NAS) has been very successful at outperforming human-designed convolutional neural networks (CNN) in accuracy, and when hardware information is present, latency as well. However, NAS-designed CNNs typically have a complicated topology, therefore, it may be difficult to design a custom hardware (HW) accelerator for such CNNs. We automate HW-CNN codesign using NAS by incl… ▽ More

    Submitted 6 March, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: accepted at DAC 2020

  4. arXiv:1907.03540  [pdf, other

    cs.LG cs.AI eess.AS stat.ML

    ShrinkML: End-to-End ASR Model Compression Using Reinforcement Learning

    Authors: Łukasz Dudziak, Mohamed S. Abdelfattah, Ravichander Vipperla, Stefanos Laskaridis, Nicholas D. Lane

    Abstract: End-to-end automatic speech recognition (ASR) models are increasingly large and complex to achieve the best possible accuracy. In this paper, we build an AutoML system that uses reinforcement learning (RL) to optimize the per-layer compression ratios when applied to a state-of-the-art attention based end-to-end ASR model composed of several LSTM layers. We use singular value decomposition (SVD) lo… ▽ More

    Submitted 24 September, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: INTERSPEECH 2019

  5. arXiv:1807.06434  [pdf, other

    cs.DC cs.AR eess.SP

    DLA: Compiler and FPGA Overlay for Neural Network Inference Acceleration

    Authors: Mohamed S. Abdelfattah, David Han, Andrew Bitar, Roberto DiCecco, Shane OConnell, Nitika Shanker, Joseph Chu, Ian Prins, Joshua Fender, Andrew C. Ling, Gordon R. Chiu

    Abstract: Overlays have shown significant promise for field-programmable gate-arrays (FPGAs) as they allow for fast development cycles and remove many of the challenges of the traditional FPGA hardware design flow. However, this often comes with a significant performance burden resulting in very little adoption of overlays for practical applications. In this paper, we tailor an overlay to a specific applica… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: Accepted in the International Conference on Field-Programmable Logic and Applications (FPL 2018)