Skip to main content

Showing 1–20 of 20 results for author: Salameh, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.00636  [pdf, other

    cs.LG cs.CV

    Applying Graph Explanation to Operator Fusion

    Authors: Keith G. Mills, Muhammad Fetrat Qharabagh, Weichen Qiu, Fred X. Han, Mohammad Salameh, Wei Lu, Shangling Jui, Di Niu

    Abstract: Layer fusion techniques are critical to improving the inference efficiency of deep neural networks (DNN) for deployment. Fusion aims to lower inference costs by reducing data transactions between an accelerator's on-chip buffer and DRAM. This is accomplished by grouped execution of multiple operations like convolution and activations together into single execution units - fusion groups. However, o… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: DAC'23 WIP Poster; 8 pages, 5 Figures 5 Tables

  2. arXiv:2412.14628  [pdf, other

    cs.CV cs.LG

    Qua$^2$SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models

    Authors: Keith G. Mills, Mohammad Salameh, Ruichen Chen, Negar Hassanpour, Wei Lu, Di Niu

    Abstract: Diffusion Models (DM) have democratized AI image generation through an iterative denoising process. Quantization is a major technique to alleviate the inference cost and reduce the size of DM denoiser networks. However, as denoisers evolve from variants of convolutional U-Nets toward newer Transformer architectures, it is of growing importance to understand the quantization sensitivity of differen… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: AAAI 2025; version includes supplementary material; 22 Pages, 18 Figures, 8 Tables

  3. arXiv:2412.14283  [pdf, other

    cs.CV cs.AI cs.GR

    PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation

    Authors: Liyao Jiang, Negar Hassanpour, Mohammad Salameh, Mohammadreza Samadi, Jiao He, Fengyu Sun, Di Niu

    Abstract: Recent research explores the potential of Diffusion Models (DMs) for consistent object editing, which aims to modify object position, size, and composition, etc., while preserving the consistency of objects and background without changing their texture and attributes. Current inference-time methods often rely on DDIM inversion, which inherently compromises efficiency and the achievable consistency… ▽ More

    Submitted 29 January, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: AAAI 2025; version includes supplementary material; 27 Pages, 15 Figures, 6 Tables

  4. arXiv:2410.03936  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Truncated Causal History Model for Video Restoration

    Authors: Amirhosein Ghasemabadi, Muhammad Kamran Janjua, Mohammad Salameh, Di Niu

    Abstract: One key challenge to video restoration is to model the transition dynamics of video frames governed by motion. In this work, we propose TURTLE to learn the truncated causal history model for efficient and high-performing video restoration. Unlike traditional methods that process a range of contextual frames in parallel, TURTLE enhances efficiency by storing and summarizing a truncated history of t… ▽ More

    Submitted 15 October, 2024; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted to NeurIPS 2024. 24 pages

  5. arXiv:2408.11706  [pdf, other

    cs.CV

    FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting

    Authors: Liyao Jiang, Negar Hassanpour, Mohammad Salameh, Mohan Sai Singamsetti, Fengyu Sun, Wei Lu, Di Niu

    Abstract: Text-to-image (T2I) diffusion models have demonstrated impressive capabilities in generating high-quality images given a text prompt. However, ensuring the prompt-image alignment remains a considerable challenge, i.e., generating images that faithfully align with the prompt's semantics. Recent works attempt to improve the faithfulness by optimizing the latent code, which potentially could cause th… ▽ More

    Submitted 6 April, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: TMLR 2025

  6. arXiv:2408.08495  [pdf, other

    cs.CV

    FunEditor: Achieving Complex Image Edits via Function Aggregation with Diffusion Models

    Authors: Mohammadreza Samadi, Fred X. Han, Mohammad Salameh, Hao Wu, Fengyu Sun, Chunhua Zhou, Di Niu

    Abstract: Diffusion models have demonstrated outstanding performance in generative tasks, making them ideal candidates for image editing. Recent studies highlight their ability to apply desired edits effectively by following textual instructions, yet with two key challenges remaining. First, these models struggle to apply multiple edits simultaneously, resulting in computational inefficiencies due to their… ▽ More

    Submitted 17 December, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  7. arXiv:2404.07383  [pdf, other

    cs.RO cs.AI

    Incorporating Explanations into Human-Machine Interfaces for Trust and Situation Awareness in Autonomous Vehicles

    Authors: Shahin Atakishiyev, Mohammad Salameh, Randy Goebel

    Abstract: Autonomous vehicles often make complex decisions via machine learning-based predictive models applied to collected sensor data. While this combination of methods provides a foundation for real-time actions, self-driving behavior primarily remains opaque to end users. In this sense, explainability of real-time decisions is a crucial and natural requirement for building trust in autonomous vehicles.… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted to IEEE IV-2024

  8. arXiv:2403.13293  [pdf, other

    cs.CV cs.AI cs.LG

    Building Optimal Neural Architectures using Interpretable Knowledge

    Authors: Keith G. Mills, Fred X. Han, Mohammad Salameh, Shengyao Lu, Chunhua Zhou, Jiao He, Fengyu Sun, Di Niu

    Abstract: Neural Architecture Search is a costly practice. The fact that a search space can span a vast number of design choices with each architecture evaluation taking nontrivial overhead makes it hard for an algorithm to sufficiently explore candidate networks. In this paper, we propose AutoBuild, a scheme which learns to align the latent embeddings of operations and architecture modules with the ground-… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: CVPR'24; 18 Pages, 18 Figures, 3 Tables

  9. arXiv:2403.12176  [pdf, ps, other

    cs.RO cs.AI

    Safety Implications of Explainable Artificial Intelligence in End-to-End Autonomous Driving

    Authors: Shahin Atakishiyev, Mohammad Salameh, Randy Goebel

    Abstract: The end-to-end learning pipeline is gradually creating a paradigm shift in the ongoing development of highly autonomous vehicles (AVs), largely due to advances in deep learning, the availability of large-scale training datasets, and improvements in integrated sensor devices. However, a lack of explainability in real-time decisions with contemporary learning methods impedes user trust and attenuate… ▽ More

    Submitted 29 May, 2025; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in IEEE Transactions on Intelligent Transportation Systems

  10. arXiv:2401.15235  [pdf, other

    eess.IV cs.CV cs.LG

    CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

    Authors: Amirhosein Ghasemabadi, Muhammad Kamran Janjua, Mohammad Salameh, Chunhua Zhou, Fengyu Sun, Di Niu

    Abstract: Image restoration tasks traditionally rely on convolutional neural networks. However, given the local nature of the convolutional operator, they struggle to capture global information. The promise of attention mechanisms in Transformers is to circumvent this problem, but it comes at the cost of intensive computational overhead. Many recent studies in image restoration have focused on solving the c… ▽ More

    Submitted 7 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Published in Transactions on Machine Learning Research (TMLR), 2024. 20 pages

  11. arXiv:2307.10408  [pdf, other

    cs.CV cs.AI

    Explaining Autonomous Driving Actions with Visual Question Answering

    Authors: Shahin Atakishiyev, Mohammad Salameh, Housam Babiker, Randy Goebel

    Abstract: The end-to-end learning ability of self-driving vehicles has achieved significant milestones over the last decade owing to rapid advances in deep learning and computer vision algorithms. However, as autonomous driving technology is a safety-critical application of artificial intelligence (AI), road accidents and established regulatory principles necessitate the need for the explainability of intel… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted to the 2023 IEEE International Conference on Intelligent Transportation Systems (IEEE ITSC-2023)

  12. arXiv:2303.02733  [pdf, other

    cs.LG cs.AI cs.CV

    Reparameterization through Spatial Gradient Scaling

    Authors: Alexander Detkov, Mohammad Salameh, Muhammad Fetrat Qharabagh, Jialin Zhang, Wei Lui, Shangling Jui, Di Niu

    Abstract: Reparameterization aims to improve the generalization of deep neural networks by transforming convolutional layers into equivalent multi-branched structures during training. However, there exists a gap in understanding how reparameterization may change and benefit the learning process of neural networks. In this paper, we present a novel spatial gradient scaling method to redistribute learning foc… ▽ More

    Submitted 6 March, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

    Comments: Published at ICLR 2023. Code available at https://github.com/Ascend-Research/Reparameterization

  13. A General-Purpose Transferable Predictor for Neural Architecture Search

    Authors: Fred X. Han, Keith G. Mills, Fabian Chudak, Parsa Riahi, Mohammad Salameh, Jialin Zhang, Wei Lu, Shangling Jui, Di Niu

    Abstract: Understanding and modelling the performance of neural architectures is key to Neural Architecture Search (NAS). Performance predictors have seen widespread use in low-cost NAS and achieve high ranking correlations between predicted and ground truth performance in several NAS benchmarks. However, existing predictors are often designed based on network encodings specific to a predefined search space… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted to SDM2023; version includes supplementary material; 12 Pages, 3 Figures, 6 Tables

  14. AIO-P: Expanding Neural Performance Predictors Beyond Image Classification

    Authors: Keith G. Mills, Di Niu, Mohammad Salameh, Weichen Qiu, Fred X. Han, Puyuan Liu, Jialin Zhang, Wei Lu, Shangling Jui

    Abstract: Evaluating neural network performance is critical to deep neural network design but a costly procedure. Neural predictors provide an efficient solution by treating architectures as samples and learning to estimate their performance on a given task. However, existing predictors are task-dependent, predominantly estimating neural network performance on image classification benchmarks. They are also… ▽ More

    Submitted 24 April, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: AAAI 2023 Oral Presentation; version includes supplementary material; 16 Pages, 4 Figures, 22 Tables

  15. GENNAPE: Towards Generalized Neural Architecture Performance Estimators

    Authors: Keith G. Mills, Fred X. Han, Jialin Zhang, Fabian Chudak, Ali Safari Mamaghani, Mohammad Salameh, Wei Lu, Shangling Jui, Di Niu

    Abstract: Predicting neural architecture performance is a challenging task and is crucial to neural architecture design and search. Existing approaches either rely on neural performance predictors which are limited to modeling architectures in a predefined design space involving specific sets of operators and connection rules, and cannot generalize to unseen architectures, or resort to zero-cost proxies whi… ▽ More

    Submitted 24 April, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: AAAI 2023 Oral Presentation; includes supplementary materials with more details on introduced benchmarks; 14 Pages, 6 Figures, 10 Tables

  16. arXiv:2112.11561  [pdf, other

    cs.AI cs.CY

    Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

    Authors: Shahin Atakishiyev, Mohammad Salameh, Hengshuai Yao, Randy Goebel

    Abstract: Autonomous driving has achieved significant milestones in research and development over the last two decades. There is increasing interest in the field as the deployment of autonomous vehicles (AVs) promises safer and more ecologically friendly transportation systems. With the rapid progress in computationally powerful artificial intelligence (AI) techniques, AVs can sense their environment with h… ▽ More

    Submitted 25 April, 2024; v1 submitted 21 December, 2021; originally announced December 2021.

  17. arXiv:2111.10518  [pdf, other

    cs.AI

    Towards Safe, Explainable, and Regulated Autonomous Driving

    Authors: Shahin Atakishiyev, Mohammad Salameh, Hengshuai Yao, Randy Goebel

    Abstract: There has been recent and growing interest in the development and deployment of autonomous vehicles, encouraged by the empirical successes of powerful artificial intelligence techniques (AI), especially in the applications of deep learning and reinforcement learning. However, as demonstrated by recent traffic accidents, autonomous driving technology is not fully reliable for safe deployment. As AI… ▽ More

    Submitted 26 May, 2023; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: Accepted for publication in the Explainable AI for Intelligent Transportation Systems book

  18. L$^{2}$NAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning

    Authors: Keith G. Mills, Fred X. Han, Mohammad Salameh, Seyed Saeed Changiz Rezaei, Linglong Kong, Wei Lu, Shuo Lian, Shangling Jui, Di Niu

    Abstract: Neural architecture search (NAS) has achieved remarkable results in deep neural network design. Differentiable architecture search converts the search over discrete architectures into a hyperparameter optimization problem which can be solved by gradient descent. However, questions have been raised regarding the effectiveness and generalizability of gradient methods for solving non-convex architect… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

    Comments: Accepted as a Full Research Paper at CIKM 2021; 10 pages, 3 Figures, 5 Tables

  19. arXiv:2105.09356  [pdf, other

    cs.LG cs.CV

    Generative Adversarial Neural Architecture Search

    Authors: Seyed Saeed Changiz Rezaei, Fred X. Han, Di Niu, Mohammad Salameh, Keith Mills, Shuo Lian, Wei Lu, Shangling Jui

    Abstract: Despite the empirical success of neural architecture search (NAS) in deep learning applications, the optimality, reproducibility and cost of NAS schemes remain hard to assess. In this paper, we propose Generative Adversarial NAS (GA-NAS) with theoretically provable convergence guarantees, promoting stability and reproducibility in neural architecture search. Inspired by importance sampling, GA-NAS… ▽ More

    Submitted 23 June, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: 17 pages, 9 figures, 13 Tables

  20. Neural Architecture Search For Keyword Spotting

    Authors: Tong Mo, Yakun Yu, Mohammad Salameh, Di Niu, Shangling Jui

    Abstract: Deep neural networks have recently become a popular solution to keyword spotting systems, which enable the control of smart devices via voice. In this paper, we apply neural architecture search to search for convolutional neural network models that can help boost the performance of keyword spotting based on features extracted from acoustic signals while maintaining an acceptable memory footprint.… ▽ More

    Submitted 2 September, 2020; v1 submitted 31 August, 2020; originally announced September 2020.

    Comments: will be presented in INTERSPEECH 2020

    Journal ref: Proc. Interspeech 2020, 1982-1986