Skip to main content

Showing 1–3 of 3 results for author: Nayak, N S

Searching in archive math. Search in all archives.
.
  1. arXiv:2504.07097  [pdf, other

    cs.LG cs.AI cs.CL math.PR stat.ML

    Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning

    Authors: Nikhil Shivakumar Nayak, Krishnateja Killamsetty, Ligong Han, Abhishek Bhandwaldar, Prateek Chanda, Kai Xu, Hao Wang, Aldo Pareja, Oleg Silkin, Mustafa Eyceoz, Akash Srivastava

    Abstract: Continual learning in large language models (LLMs) is prone to catastrophic forgetting, where adapting to new tasks significantly degrades performance on previously learned ones. Existing methods typically rely on low-rank, parameter-efficient updates that limit the model's expressivity and introduce additional parameters per task, leading to scalability issues. To address these limitations, we pr… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: 25 pages, 13 figures, 6 tables

    MSC Class: 68T50 ACM Class: I.2.0; G.3

  2. arXiv:2504.03175  [pdf, other

    math.NA cs.LG math.PR q-fin.CP

    Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework

    Authors: Nikhil Shivakumar Nayak

    Abstract: This study investigates enhancing option pricing by extending the Black-Scholes model to include stochastic volatility and interest rate variability within the Partial Differential Equation (PDE). The PDE is solved using the finite difference method. The extended Black-Scholes model and a machine learning-based LSTM model are developed and evaluated for pricing Google stock options. Both models we… ▽ More

    Submitted 13 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

    Comments: 7 pages, 3 figures

    MSC Class: 60G07 ACM Class: G.1.0; G.1.8; G.1.7; G.3; I.2.0

  3. arXiv:2504.02938  [pdf, other

    cs.LG cs.AI cs.DM math.DG stat.ML

    Graph Attention for Heterogeneous Graphs with Positional Encoding

    Authors: Nikhil Shivakumar Nayak

    Abstract: Graph Neural Networks (GNNs) have emerged as the de facto standard for modeling graph data, with attention mechanisms and transformers significantly enhancing their performance on graph-based tasks. Despite these advancements, the performance of GNNs on heterogeneous graphs often remains complex, with networks generally underperforming compared to their homogeneous counterparts. This work benchmar… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: 10 pages, 3 figures

    MSC Class: 53-02 ACM Class: G.2.2; I.2.0; I.2.4; G.3