Skip to main content

Showing 1–12 of 12 results for author: Mortaheb, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.04695  [pdf, other

    cs.LG cs.CV cs.IR cs.IT

    Re-ranking the Context for Multimodal Retrieval Augmented Generation

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating external knowledge to generate a response within a context with improved accuracy and reduced hallucinations. However, multi-modal RAG systems face unique challenges: (i) the retrieval process may select irrelevant entries to user query (e.g., images, documents), and (ii) vision-language models or multi-mod… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  2. arXiv:2501.03995  [pdf, other

    cs.LG cs.CV cs.IR cs.IT

    RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Retrieval-augmented generation (RAG) improves large language models (LLMs) by using external knowledge to guide response generation, reducing hallucinations. However, RAG, particularly multi-modal RAG, can introduce new hallucination sources: (i) the retrieval process may select irrelevant pieces (e.g., documents, images) as raw context from the database, and (ii) retrieved images are processed in… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  3. arXiv:2412.01817  [pdf, other

    cs.LG cs.CV cs.IT eess.SP

    Efficient Semantic Communication Through Transformer-Aided Compression

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Sennur Ulukus

    Abstract: Transformers, known for their attention mechanisms, have proven highly effective in focusing on critical elements within complex data. This feature can effectively be used to address the time-varying channels in wireless communication systems. In this work, we introduce a channel-aware adaptive framework for semantic communication, where different regions of the image are encoded and compressed ba… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  4. arXiv:2410.22192  [pdf, other

    cs.LG cs.IT eess.SP stat.ML

    $r$Age-$k$: Communication-Efficient Federated Learning Using Age Factor

    Authors: Matin Mortaheb, Priyanka Kaswan, Sennur Ulukus

    Abstract: Federated learning (FL) is a collaborative approach where multiple clients, coordinated by a parameter server (PS), train a unified machine-learning model. The approach, however, suffers from two key challenges: data heterogeneity and communication overhead. Data heterogeneity refers to inconsistencies in model training arising from heterogeneous data at different clients. Communication overhead a… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

  5. arXiv:2409.16285  [pdf, other

    cs.IT cs.NI eess.SP eess.SY

    Age of Gossip in Networks with Multiple Views of a Source

    Authors: Kian J. Khojastepour, Matin Mortaheb, Sennur Ulukus

    Abstract: We consider the version age of information (AoI) in a network where a subset of nodes act as sensing nodes, sampling a source that in general can follow a continuous distribution. Any sample of the source constitutes a new version of the information and the version age of the information is defined with respect to the most recent version of the information available for the whole network. We deriv… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  6. arXiv:2405.01521  [pdf, other

    cs.CV cs.IT cs.LG eess.SP

    Transformer-Aided Semantic Communications

    Authors: Matin Mortaheb, Erciyes Karakaya, Mohammad A. Amir Khojastepour, Sennur Ulukus

    Abstract: The transformer structure employed in large language models (LLMs), as a specialized category of deep neural networks (DNNs) featuring attention mechanisms, stands out for their ability to identify and highlight the most relevant aspects of input data. Such a capability is particularly beneficial in addressing a variety of communication challenges, notably in the realm of semantic communication wh… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  7. arXiv:2311.12918  [pdf, other

    eess.IV cs.IT cs.LG cs.NI eess.SY

    Deep Learning-Based Real-Time Quality Control of Standard Video Compression for Live Streaming

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Ensuring high-quality video content for wireless users has become increasingly vital. Nevertheless, maintaining a consistent level of video quality faces challenges due to the fluctuating encoded bitrate, primarily caused by dynamic video content, especially in live streaming scenarios. Video compression is typically employed to eliminate unnecessary redundancies within and between video frames, t… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2310.06857

  8. arXiv:2310.06857  [pdf, other

    cs.NI cs.IT cs.LG eess.SP eess.SY

    Deep Learning-Based Real-Time Rate Control for Live Streaming on Wireless Networks

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Providing wireless users with high-quality video content has become increasingly important. However, ensuring consistent video quality poses challenges due to variable encoded bitrate caused by dynamic video content and fluctuating channel bitrate caused by wireless fading effects. Suboptimal selection of encoder parameters can lead to video quality loss due to underutilized bandwidth or the intro… ▽ More

    Submitted 27 September, 2023; originally announced October 2023.

  9. arXiv:2308.11604  [pdf, other

    cs.LG cs.IT eess.SP

    Semantic Multi-Resolution Communications

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Deep learning based joint source-channel coding (JSCC) has demonstrated significant advancements in data reconstruction compared to separate source-channel coding (SSCC). This superiority arises from the suboptimality of SSCC when dealing with finite block-length data. Moreover, SSCC falls short in reconstructing data in a multi-user and/or multi-resolution fashion, as it only tries to satisfy the… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  10. arXiv:2212.11268  [pdf, ps, other

    cs.LG cs.IT eess.SP

    Personalized Decentralized Multi-Task Learning Over Dynamic Communication Graphs

    Authors: Matin Mortaheb, Sennur Ulukus

    Abstract: Decentralized and federated learning algorithms face data heterogeneity as one of the biggest challenges, especially when users want to learn a specific task. Even when personalized headers are used concatenated to a shared network (PF-MTL), aggregating all the networks with a decentralized algorithm can result in performance degradation as a result of heterogeneity in the data. Our algorithm uses… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  11. arXiv:2212.07414  [pdf, ps, other

    cs.LG cs.IT eess.SP stat.ML

    Hierarchical Over-the-Air FedGradNorm

    Authors: Cemil Vahapoglu, Matin Mortaheb, Sennur Ulukus

    Abstract: Multi-task learning (MTL) is a learning paradigm to learn multiple related tasks simultaneously with a single shared network where each task has a distinct personalized header network for fine-tuning. MTL can be integrated into a federated learning (FL) setting if tasks are distributed across clients and clients have a single shared network, leading to personalized federated learning (PFL). To cop… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

  12. arXiv:2203.13663  [pdf, ps, other

    cs.LG cs.IT eess.SP stat.ML

    FedGradNorm: Personalized Federated Gradient-Normalized Multi-Task Learning

    Authors: Matin Mortaheb, Cemil Vahapoglu, Sennur Ulukus

    Abstract: Multi-task learning (MTL) is a novel framework to learn several tasks simultaneously with a single shared network where each task has its distinct personalized header network for fine-tuning. MTL can be implemented in federated learning settings as well, in which tasks are distributed across clients. In federated settings, the statistical heterogeneity due to different task complexities and data h… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.