Skip to main content

Showing 1–3 of 3 results for author: Vij, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.02028  [pdf, other

    cs.CL cs.AI

    Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study

    Authors: Anneketh Vij, Changhao Liu, Rahul Anil Nair, Theodore Eugene Ho, Edward Shi, Ayan Bhowmick

    Abstract: This research presents an exploration and study of the recipe generation task by fine-tuning various very small language models, with a focus on developing robust evaluation metrics and comparing across different language models the open-ended task of recipe generation. This study presents extensive experiments with multiple model architectures, ranging from T5-small (Raffel et al., 2023) and Smol… ▽ More

    Submitted 16 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: 18 pages, 10 figures,14 tables

  2. arXiv:2406.14971  [pdf, other

    cs.CL cs.AI cs.LG

    Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation

    Authors: Shamane Siriwardhana, Mark McQuade, Thomas Gauthier, Lucas Atkins, Fernando Fernandes Neto, Luke Meyers, Anneketh Vij, Tyler Odenthal, Charles Goddard, Mary MacCarthy, Jacob Solawetz

    Abstract: We conducted extensive experiments on domain adaptation of the Meta-Llama-3-70B-Instruct model on SEC data, exploring its performance on both general and domain-specific benchmarks. Our focus included continual pre-training (CPT) and model merging, aiming to enhance the model's domain-specific capabilities while mitigating catastrophic forgetting. Through this study, we evaluated the impact of int… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures

  3. Choice modelling in the age of machine learning -- discussion paper

    Authors: S. Van Cranenburgh, S. Wang, A. Vij, F. Pereira, J. Walker

    Abstract: Since its inception, the choice modelling field has been dominated by theory-driven modelling approaches. Machine learning offers an alternative data-driven approach for modelling choice behaviour and is increasingly drawing interest in our field. Cross-pollination of machine learning models, techniques and practices could help overcome problems and limitations encountered in the current theory-dr… ▽ More

    Submitted 24 November, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 40 pages, 2 tables, 0 figures

    Journal ref: Journal of Choice Modelling 42 (2022): 100340