Skip to main content

Showing 1–4 of 4 results for author: Thakur, U

.
  1. arXiv:2506.22977  [pdf, ps, other

    cs.CL cs.LG

    On the Generalizability of "Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals"

    Authors: Asen Dotsinski, Udit Thakur, Marko Ivanov, Mohammad Hafeez Khan, Maria Heuss

    Abstract: We present a reproduction study of "Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals" (Ortu et al., 2024), which investigates competition of mechanisms in language models between factual recall and counterfactual in-context repetition. Our study successfully reproduces their primary findings regarding the localization of factual and counterfactual information… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

    Comments: 22 pages, 25 figures. For an interactive dashboard with all figures, see https://comp-mech-generalizability.streamlit.app/ . For the accompanying code, see https://github.com/asendotsinski/comp-mech-generalizability . To be published in proceedings of the 2025 Machine Learning Reproducibility Challenge

    Journal ref: TMLR (2835-8856) 2025

  2. arXiv:2208.09662  [pdf, other

    cs.CV

    Offline Handwritten Mathematical Recognition using Adversarial Learning and Transformers

    Authors: Ujjwal Thakur, Anuj Sharma

    Abstract: Offline Handwritten Mathematical Expression Recognition (HMER) is a major area in the field of mathematical expression recognition. Offline HMER is often viewed as a much harder problem as compared to online HMER due to a lack of temporal information and variability in writing style. In this paper, we purpose a encoder-decoder model that uses paired adversarial learning. Semantic-invariant feature… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

  3. arXiv:2112.01401  [pdf, other

    cs.LG cs.AI

    Newton methods based convolution neural networks using parallel processing

    Authors: Ujjwal Thakur, Anuj Sharma

    Abstract: Training of convolutional neural networks is a high dimensional and a non-convex optimization problem. At present, it is inefficient in situations where parametric learning rates can not be confidently set. Some past works have introduced Newton methods for training deep neural networks. Newton methods for convolutional neural networks involve complicated operations. Finding the Hessian matrix in… ▽ More

    Submitted 5 April, 2023; v1 submitted 2 December, 2021; originally announced December 2021.

  4. arXiv:2104.00138  [pdf, other

    eess.IV cs.CV cs.LG

    Rapid quantification of COVID-19 pneumonia burden from computed tomography with convolutional LSTM networks

    Authors: Kajetan Grodecki, Aditya Killekar, Andrew Lin, Sebastien Cadet, Priscilla McElhinney, Aryabod Razipour, Cato Chan, Barry D. Pressman, Peter Julien, Judit Simon, Pal Maurovich-Horvat, Nicola Gaibazzi, Udit Thakur, Elisabetta Mancini, Cecilia Agalbato, Jiro Munechika, Hidenari Matsumoto, Roberto Menè, Gianfranco Parati, Franco Cernigliaro, Nitesh Nerlekar, Camilla Torlasco, Gianluca Pontone, Damini Dey, Piotr J. Slomka

    Abstract: Quantitative lung measures derived from computed tomography (CT) have been demonstrated to improve prognostication in coronavirus disease (COVID-19) patients, but are not part of the clinical routine since required manual segmentation of lung lesions is prohibitively time-consuming. We propose a new fully automated deep learning framework for rapid quantification and differentiation between lung l… ▽ More

    Submitted 16 July, 2021; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: Fixed some typing mistakes in v2. No other results changed