Skip to main content

Showing 1–1 of 1 results for author: Palavesam, K V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.14802  [pdf

    cs.DC cs.LG

    DNN-Powered MLOps Pipeline Optimization for Large Language Models: A Framework for Automated Deployment and Resource Management

    Authors: Mahesh Vaijainthymala Krishnamoorthy, Kuppusamy Vellamadam Palavesam, Siva Venkatesh Arcot, Rajarajeswari Chinniah Kuppuswami

    Abstract: The exponential growth in the size and complexity of Large Language Models (LLMs) has introduced unprecedented challenges in their deployment and operational management. Traditional MLOps approaches often fail to efficiently handle the scale, resource requirements, and dynamic nature of these models. This research presents a novel framework that leverages Deep Neural Networks (DNNs) to optimize ML… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: 22 pages, 15 figures, submitting to a AI Journal