Skip to main content

Showing 1–9 of 9 results for author: Chakradhar, S T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.14101  [pdf, other

    cs.CV

    StreamingRAG: Real-time Contextual Retrieval and Generation Framework

    Authors: Murugan Sankaradas, Ravi K. Rajendran, Srimat T. Chakradhar

    Abstract: Extracting real-time insights from multi-modal data streams from various domains such as healthcare, intelligent transportation, and satellite remote sensing remains a challenge. High computational demands and limited knowledge scope restrict the applicability of Multi-Modal Large Language Models (MM-LLMs) on these data streams. Traditional Retrieval-Augmented Generation (RAG) systems address know… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: Accepted and Presented at AI4Sys, HPDC 2024

  2. arXiv:2501.04695  [pdf, other

    cs.LG cs.CV cs.IR cs.IT

    Re-ranking the Context for Multimodal Retrieval Augmented Generation

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating external knowledge to generate a response within a context with improved accuracy and reduced hallucinations. However, multi-modal RAG systems face unique challenges: (i) the retrieval process may select irrelevant entries to user query (e.g., images, documents), and (ii) vision-language models or multi-mod… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  3. arXiv:2501.03995  [pdf, other

    cs.LG cs.CV cs.IR cs.IT

    RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Retrieval-augmented generation (RAG) improves large language models (LLMs) by using external knowledge to guide response generation, reducing hallucinations. However, RAG, particularly multi-modal RAG, can introduce new hallucination sources: (i) the retrieval process may select irrelevant pieces (e.g., documents, images) as raw context from the database, and (ii) retrieved images are processed in… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  4. arXiv:2311.12918  [pdf, other

    eess.IV cs.IT cs.LG cs.NI eess.SY

    Deep Learning-Based Real-Time Quality Control of Standard Video Compression for Live Streaming

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Ensuring high-quality video content for wireless users has become increasingly vital. Nevertheless, maintaining a consistent level of video quality faces challenges due to the fluctuating encoded bitrate, primarily caused by dynamic video content, especially in live streaming scenarios. Video compression is typically employed to eliminate unnecessary redundancies within and between video frames, t… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2310.06857

  5. arXiv:2310.06857  [pdf, other

    cs.NI cs.IT cs.LG eess.SP eess.SY

    Deep Learning-Based Real-Time Rate Control for Live Streaming on Wireless Networks

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Providing wireless users with high-quality video content has become increasingly important. However, ensuring consistent video quality poses challenges due to variable encoded bitrate caused by dynamic video content and fluctuating channel bitrate caused by wireless fading effects. Suboptimal selection of encoder parameters can lead to video quality loss due to underutilized bandwidth or the intro… ▽ More

    Submitted 27 September, 2023; originally announced October 2023.

  6. arXiv:2308.11604  [pdf, other

    cs.LG cs.IT eess.SP

    Semantic Multi-Resolution Communications

    Authors: Matin Mortaheb, Mohammad A. Amir Khojastepour, Srimat T. Chakradhar, Sennur Ulukus

    Abstract: Deep learning based joint source-channel coding (JSCC) has demonstrated significant advancements in data reconstruction compared to separate source-channel coding (SSCC). This superiority arises from the suboptimality of SSCC when dealing with finite block-length data. Moreover, SSCC falls short in reconstructing data in a multi-user and/or multi-resolution fashion, as it only tries to satisfy the… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  7. arXiv:2212.04061  [pdf, other

    cs.CV cs.MA

    Elixir: A system to enhance data quality for multiple analytics on a video stream

    Authors: Sibendu Paul, Kunal Rao, Giuseppe Coviello, Murugan Sankaradas, Oliver Po, Y. Charlie Hu, Srimat T. Chakradhar

    Abstract: IoT sensors, especially video cameras, are ubiquitously deployed around the world to perform a variety of computer vision tasks in several verticals including retail, healthcare, safety and security, transportation, manufacturing, etc. To amortize their high deployment effort and cost, it is desirable to perform multiple video analytics tasks, which we refer to as Analytical Units (AUs), off the v… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  8. arXiv:2107.03964  [pdf, other

    cs.LG cs.CV

    Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning

    Authors: Sibendu Paul, Kunal Rao, Giuseppe Coviello, Murugan Sankaradas, Oliver Po, Y. Charlie Hu, Srimat T. Chakradhar

    Abstract: In Video Analytics Pipelines (VAP), Analytics Units (AUs) such as object detection and face recognition running on remote servers critically rely on surveillance cameras to capture high-quality video streams in order to achieve high accuracy. Modern IP cameras come with a large number of camera parameters that directly affect the quality of the video stream capture. While a few of such parameters,… ▽ More

    Submitted 15 September, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

  9. arXiv:2101.09752  [pdf, other

    eess.IV cs.CV cs.LG

    AQuA: Analytical Quality Assessment for Optimizing Video Analytics Systems

    Authors: Sibendu Paul, Utsav Drolia, Y. Charlie Hu, Srimat T. Chakradhar

    Abstract: Millions of cameras at edge are being deployed to power a variety of different deep learning applications. However, the frames captured by these cameras are not always pristine - they can be distorted due to lighting issues, sensor noise, compression etc. Such distortions not only deteriorate visual quality, they impact the accuracy of deep learning applications that process such video streams. In… ▽ More

    Submitted 25 October, 2021; v1 submitted 24 January, 2021; originally announced January 2021.