Skip to main content

Showing 1–4 of 4 results for author: Salamah, A H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.17507  [pdf, other

    cs.IR

    Dense Passage Retrieval in Conversational Search

    Authors: Ahmed H. Salamah, Pierre McWhannel, Nicole Yan

    Abstract: Information retrieval systems have traditionally relied on exact term match methods such as BM25 for first-stage retrieval. However, recent advancements in neural network-based techniques have introduced a new method called dense retrieval. This approach uses a dual-encoder to create contextual embeddings that can be indexed and clustered efficiently at run-time, resulting in improved retrieval pe… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2410.07081  [pdf, other

    cs.CV

    JPEG Inspired Deep Learning

    Authors: Ahmed H. Salamah, Kaixiang Zheng, Yiwen Liu, En-Hui Yang

    Abstract: Although it is traditionally believed that lossy image compression, such as JPEG compression, has a negative impact on the performance of deep neural networks (DNNs), it is shown by recent works that well-crafted JPEG compression can actually improve the performance of deep learning (DL). Inspired by this, we propose JPEG-DL, a novel DL framework that prepends any underlying DNN architecture with… ▽ More

    Submitted 20 March, 2025; v1 submitted 9 October, 2024; originally announced October 2024.

    Journal ref: The Thirteenth International Conference on Learning Representations 2025 (ICLR 2025)

  3. arXiv:2407.18041  [pdf, other

    cs.LG

    How to Train the Teacher Model for Effective Knowledge Distillation

    Authors: Shayan Mohajer Hamidi, Xizhen Deng, Renhao Tan, Linfeng Ye, Ahmed Hussein Salamah

    Abstract: Recently, it was shown that the role of the teacher in knowledge distillation (KD) is to provide the student with an estimate of the true Bayes conditional probability density (BCPD). Notably, the new findings propose that the student's error rate can be upper-bounded by the mean squared error (MSE) between the teacher's output and BCPD. Consequently, to enhance KD efficacy, the teacher should be… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: The paper was accepted at ECCV2024

  4. arXiv:2104.08314  [pdf, other

    cs.CV

    High Performance Convolution Using Sparsity and Patterns for Inference in Deep Convolutional Neural Networks

    Authors: Hossam Amer, Ahmed H. Salamah, Ahmad Sajedi, En-hui Yang

    Abstract: Deploying deep Convolutional Neural Networks (CNNs) is impacted by their memory footprint and speed requirements, which mainly come from convolution. Widely-used convolution algorithms, im2col and MEC, produce a lowered matrix from an activation map by redundantly storing the map's elements included at horizontal and/or vertical kernel overlappings without considering the sparsity of the map. Usin… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: 34 pages