-
Joint Flow And Feature Refinement Using Attention For Video Restoration
Authors:
Ranjith Merugu,
Mohammad Sameer Suhail,
Akshay P Sarashetti,
Venkata Bharath Reddy Reddem,
Pankaj Kumar Bajpai,
Amit Satish Unde
Abstract:
Recent advancements in video restoration have focused on recovering high-quality video frames from low-quality inputs. Compared with static images, the performance of video restoration significantly depends on efficient exploitation of temporal correlations among successive video frames. The numerous techniques make use of temporal information via flow-based strategies or recurrent architectures.…
▽ More
Recent advancements in video restoration have focused on recovering high-quality video frames from low-quality inputs. Compared with static images, the performance of video restoration significantly depends on efficient exploitation of temporal correlations among successive video frames. The numerous techniques make use of temporal information via flow-based strategies or recurrent architectures. However, these methods often encounter difficulties in preserving temporal consistency as they utilize degraded input video frames. To resolve this issue, we propose a novel video restoration framework named Joint Flow and Feature Refinement using Attention (JFFRA). The proposed JFFRA is based on key philosophy of iteratively enhancing data through the synergistic collaboration of flow (alignment) and restoration. By leveraging previously enhanced features to refine flow and vice versa, JFFRA enables efficient feature enhancement using temporal information. This interplay between flow and restoration is executed at multiple scales, reducing the dependence on precise flow estimation. Moreover, we incorporate an occlusion-aware temporal loss function to enhance the network's capability in eliminating flickering artifacts. Comprehensive experiments validate the versatility of JFFRA across various restoration tasks such as denoising, deblurring, and super-resolution. Our method demonstrates a remarkable performance improvement of up to 1.62 dB compared to state-of-the-art approaches.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Multilingual Test-Time Scaling via Initial Thought Transfer
Authors:
Prasoon Bajpai,
Tanmoy Chakraborty
Abstract:
Test-time scaling has emerged as a widely adopted inference-time strategy for boosting reasoning performance. However, its effectiveness has been studied almost exclusively in English, leaving its behavior in other languages largely unexplored. We present the first systematic study of test-time scaling in multilingual settings, evaluating DeepSeek-R1-Distill-LLama-8B and DeepSeek-R1-Distill-Qwen-7…
▽ More
Test-time scaling has emerged as a widely adopted inference-time strategy for boosting reasoning performance. However, its effectiveness has been studied almost exclusively in English, leaving its behavior in other languages largely unexplored. We present the first systematic study of test-time scaling in multilingual settings, evaluating DeepSeek-R1-Distill-LLama-8B and DeepSeek-R1-Distill-Qwen-7B across both high- and low-resource Latin-script languages. Our findings reveal that the relative gains from test-time scaling vary significantly across languages. Additionally, models frequently switch to English mid-reasoning, even when operating under strictly monolingual prompts. We further show that low-resource languages not only produce initial reasoning thoughts that differ significantly from English but also have lower internal consistency across generations in their early reasoning. Building on our findings, we introduce MITT (Multilingual Initial Thought Transfer), an unsupervised and lightweight reasoning prefix-tuning approach that transfers high-resource reasoning prefixes to enhance test-time scaling across all languages, addressing inconsistencies in multilingual reasoning performance. MITT significantly boosts DeepSeek-R1-Distill-Qwen-7B's reasoning performance, especially for underrepresented languages.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks
Authors:
Amey Hengle,
Prasoon Bajpai,
Soham Dan,
Tanmoy Chakraborty
Abstract:
Existing multilingual long-context benchmarks, often based on the popular needle-in-a-haystack test, primarily evaluate a model's ability to locate specific information buried within irrelevant texts. However, such a retrieval-centric approach is myopic and inherently limited, as successful recall alone does not indicate a model's capacity to reason over extended contexts. Moreover, these benchmar…
▽ More
Existing multilingual long-context benchmarks, often based on the popular needle-in-a-haystack test, primarily evaluate a model's ability to locate specific information buried within irrelevant texts. However, such a retrieval-centric approach is myopic and inherently limited, as successful recall alone does not indicate a model's capacity to reason over extended contexts. Moreover, these benchmarks are susceptible to data leakage, short-circuiting, and risk making the evaluation a priori identifiable. To address these limitations, we introduce MLRBench, a new synthetic benchmark for multilingual long-context reasoning. Unlike existing benchmarks, MLRBench goes beyond surface-level retrieval by including tasks that assess multi-hop inference, aggregation, and epistemic reasoning. Spanning seven languages, MLRBench is designed to be parallel, resistant to leakage, and scalable to arbitrary context lengths. Our extensive experiments with an open-weight large language model (LLM) reveal a pronounced gap between high- and low-resource languages, particularly for tasks requiring the model to aggregate multiple facts or predict the absence of information. We also find that, in multilingual settings, LLMs effectively utilize less than 30% of their claimed context length. Although off-the-shelf Retrieval Augmented Generation helps alleviate this to a certain extent, it does not solve the long-context problem. We open-source MLRBench to enable future research in improved evaluation and training of multilingual LLMs.
△ Less
Submitted 17 April, 2025;
originally announced April 2025.
-
Information Anxiety in Large Language Models
Authors:
Prasoon Bajpai,
Sarah Masud,
Tanmoy Chakraborty
Abstract:
Large Language Models (LLMs) have demonstrated strong performance as knowledge repositories, enabling models to understand user queries and generate accurate and context-aware responses. Extensive evaluation setups have corroborated the positive correlation between the retrieval capability of LLMs and the frequency of entities in their pretraining corpus. We take the investigation further by condu…
▽ More
Large Language Models (LLMs) have demonstrated strong performance as knowledge repositories, enabling models to understand user queries and generate accurate and context-aware responses. Extensive evaluation setups have corroborated the positive correlation between the retrieval capability of LLMs and the frequency of entities in their pretraining corpus. We take the investigation further by conducting a comprehensive analysis of the internal reasoning and retrieval mechanisms of LLMs. Our work focuses on three critical dimensions - the impact of entity popularity, the models' sensitivity to lexical variations in query formulation, and the progression of hidden state representations across LLM layers. Our preliminary findings reveal that popular questions facilitate early convergence of internal states toward the correct answer. However, as the popularity of a query increases, retrieved attributes across lexical variations become increasingly dissimilar and less accurate. Interestingly, we find that LLMs struggle to disentangle facts, grounded in distinct relations, from their parametric memory when dealing with highly popular subjects. Through a case study, we explore these latent strains within LLMs when processing highly popular queries, a phenomenon we term information anxiety. The emergence of information anxiety in LLMs underscores the adversarial injection in the form of linguistic variations and calls for a more holistic evaluation of frequently occurring entities.
△ Less
Submitted 16 November, 2024;
originally announced November 2024.
-
Effective approximation to complex algebraic numbers by quadratic numbers
Authors:
Prajeet Bajpai,
Yann Bugeaud
Abstract:
We establish an effective improvement on the Liouville inequality for approximation to complex non-real algebraic numbers by quadratic complex algebraic numbers.
We establish an effective improvement on the Liouville inequality for approximation to complex non-real algebraic numbers by quadratic complex algebraic numbers.
△ Less
Submitted 14 November, 2024;
originally announced November 2024.
-
Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators
Authors:
Prasoon Bajpai,
Niladri Chatterjee,
Subhabrata Dutta,
Tanmoy Chakraborty
Abstract:
Large Language Models (LLMs) and AI assistants driven by these models are experiencing exponential growth in usage among both expert and amateur users. In this work, we focus on evaluating the reliability of current LLMs as science communicators. Unlike existing benchmarks, our approach emphasizes assessing these models on scientific questionanswering tasks that require a nuanced understanding and…
▽ More
Large Language Models (LLMs) and AI assistants driven by these models are experiencing exponential growth in usage among both expert and amateur users. In this work, we focus on evaluating the reliability of current LLMs as science communicators. Unlike existing benchmarks, our approach emphasizes assessing these models on scientific questionanswering tasks that require a nuanced understanding and awareness of answerability. We introduce a novel dataset, SCiPS-QA, comprising 742 Yes/No queries embedded in complex scientific concepts, along with a benchmarking suite that evaluates LLMs for correctness and consistency across various criteria. We benchmark three proprietary LLMs from the OpenAI GPT family and 13 open-access LLMs from the Meta Llama-2, Llama-3, and Mistral families. While most open-access models significantly underperform compared to GPT-4 Turbo, our experiments identify Llama-3-70B as a strong competitor, often surpassing GPT-4 Turbo in various evaluation aspects. We also find that even the GPT models exhibit a general incompetence in reliably verifying LLM responses. Moreover, we observe an alarming trend where human evaluators are deceived by incorrect responses from GPT-4 Turbo.
△ Less
Submitted 21 September, 2024;
originally announced September 2024.
-
Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models
Authors:
Amey Hengle,
Prasoon Bajpai,
Soham Dan,
Tanmoy Chakraborty
Abstract:
While recent large language models (LLMs) demonstrate remarkable abilities in responding to queries in diverse languages, their ability to handle long multilingual contexts is unexplored. As such, a systematic evaluation of the long-context capabilities of LLMs in multilingual settings is crucial, specifically in the context of information retrieval. To address this gap, we introduce the MultiLing…
▽ More
While recent large language models (LLMs) demonstrate remarkable abilities in responding to queries in diverse languages, their ability to handle long multilingual contexts is unexplored. As such, a systematic evaluation of the long-context capabilities of LLMs in multilingual settings is crucial, specifically in the context of information retrieval. To address this gap, we introduce the MultiLingual Needle-in-a-Haystack (MLNeedle) test, designed to assess a model's ability to retrieve relevant information (the needle) from a collection of multilingual distractor texts (the haystack). This test serves as an extension of the multilingual question-answering task, encompassing both monolingual and cross-lingual retrieval. We evaluate four state-of-the-art LLMs on MLNeedle. Our findings reveal that model performance can vary significantly with language and needle position. Specifically, we observe that model performance is the lowest when the needle is (i) in a language outside the English language family and (ii) located in the middle of the input context. Furthermore, although some models claim a context size of $8k$ tokens or greater, none demonstrate satisfactory cross-lingual retrieval performance as the context length increases. Our analysis provides key insights into the long-context behavior of LLMs in multilingual settings to guide future evaluation protocols. To our knowledge, this is the first study to investigate the multilingual long-context behavior of LLMs.
△ Less
Submitted 19 August, 2024;
originally announced August 2024.
-
An annotated bibliography for comparative prime number theory
Authors:
Greg Martin,
Pu Justin Scarfy Yang,
Aram Bahrini,
Prajeet Bajpai,
Kübra Benli,
Jenna Downey,
Yuan Yuan Li,
Xiaoxuan Liang,
Amir Parvardi,
Reginald Simpson,
Ethan Patrick White,
Chi Hoi Yip
Abstract:
The goal of this annotated bibliography is to record every publication on the topic of comparative prime number theory (through mid-2024) together with a summary of its results. We use a unified system of notation for the quantities being studied and for the hypotheses under which results are obtained.
The goal of this annotated bibliography is to record every publication on the topic of comparative prime number theory (through mid-2024) together with a summary of its results. We use a unified system of notation for the quantities being studied and for the hypotheses under which results are obtained.
△ Less
Submitted 11 December, 2024; v1 submitted 15 September, 2023;
originally announced September 2023.
-
Effective $S$-unit Equations Beyond 3 Terms : Newman's Conjecture
Authors:
Prajeet Bajpai,
Michael A. Bennett
Abstract:
We show how to effectively solve 5-term $S$-unit equations when the set of primes $S$ has cardinality at most 3, and use this to provide an explicit answer to an old question of D.J. Newman on representations of integers as sums of $S$-units.
We show how to effectively solve 5-term $S$-unit equations when the set of primes $S$ has cardinality at most 3, and use this to provide an explicit answer to an old question of D.J. Newman on representations of integers as sums of $S$-units.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Distributed Coordination of Multi-Microgrids in Active Distribution Networks for Provisioning Ancillary Services
Authors:
Arghya Mallick,
Abhishek Mishra,
Ashish R. Hota,
Prabodh Bajpai
Abstract:
With the phenomenal growth in renewable energy generation, the conventional synchronous generator-based power plants are gradually getting replaced by renewable energy sources-based microgrids. Such transition gives rise to the challenges of procuring various ancillary services from microgrids. We propose a distributed optimization framework that coordinates multiple microgrids in an active distri…
▽ More
With the phenomenal growth in renewable energy generation, the conventional synchronous generator-based power plants are gradually getting replaced by renewable energy sources-based microgrids. Such transition gives rise to the challenges of procuring various ancillary services from microgrids. We propose a distributed optimization framework that coordinates multiple microgrids in an active distribution network for provisioning passive voltage support-based ancillary services while satisfying operational constraints. Specifically, we exploit the reactive power support capability of the inverters and the flexibility offered by storage systems available with microgrids for provisioning ancillary service support to the transmission grid. We develop novel mixed-integer inequalities to represent the set of feasible active and reactive power exchange with the transmission grid that ensures passive voltage support. The proposed alternating direction method of multipliers-based algorithm is fully distributed, and does not require the presence of a centralized entity to achieve coordination among the microgrids. We present detailed numerical results on the IEEE 33-bus distribution test system to demonstrate the effectiveness of the proposed approach and examine the scalability and convergence behavior of the distributed algorithm for different choice of hyperparameters and network sizes.
△ Less
Submitted 2 July, 2024; v1 submitted 8 May, 2023;
originally announced May 2023.
-
MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results
Authors:
Qingpeng Zhu,
Wenxiu Sun,
Yuekun Dai,
Chongyi Li,
Shangchen Zhou,
Ruicheng Feng,
Qianhui Sun,
Chen Change Loy,
Jinwei Gu,
Yi Yu,
Yangke Huang,
Kang Zhang,
Meiya Chen,
Yu Wang,
Yongchao Li,
Hao Jiang,
Amrit Kumar Muduli,
Vikash Kumar,
Kunal Swami,
Pankaj Kumar Bajpai,
Yunchao Ma,
Jiajun Xiao,
Zhi Ling
Abstract:
Depth completion from RGB images and sparse Time-of-Flight (ToF) measurements is an important problem in computer vision and robotics. While traditional methods for depth completion have relied on stereo vision or structured light techniques, recent advances in deep learning have enabled more accurate and efficient completion of depth maps from RGB images and sparse ToF measurements. To evaluate t…
▽ More
Depth completion from RGB images and sparse Time-of-Flight (ToF) measurements is an important problem in computer vision and robotics. While traditional methods for depth completion have relied on stereo vision or structured light techniques, recent advances in deep learning have enabled more accurate and efficient completion of depth maps from RGB images and sparse ToF measurements. To evaluate the performance of different depth completion methods, we organized an RGB+sparse ToF depth completion competition. The competition aimed to encourage research in this area by providing a standardized dataset and evaluation metrics to compare the accuracy of different approaches. In this report, we present the results of the competition and analyze the strengths and weaknesses of the top-performing methods. We also discuss the implications of our findings for future research in RGB+sparse ToF depth completion. We hope that this competition and report will help to advance the state-of-the-art in this important area of research. More details of this challenge and the link to the dataset can be found at https://mipi-challenge.org/MIPI2023.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
Effective Approximation to Complex Algebraic Numbers by Algebraic Numbers of Bounded Degree
Authors:
Prajeet Bajpai,
Yann Bugeaud
Abstract:
We establish the first effective improvements on the Liouville inequality for approximation to complex non-real algebraic numbers by complex algebraic numbers of degree at most 4.
We establish the first effective improvements on the Liouville inequality for approximation to complex non-real algebraic numbers by complex algebraic numbers of degree at most 4.
△ Less
Submitted 14 November, 2024; v1 submitted 10 February, 2023;
originally announced February 2023.
-
Arithmetic Progressions in Squarefull Numbers
Authors:
Prajeet Bajpai,
Michael A. Bennett,
Tsz Ho Chan
Abstract:
We answer a number of questions of Erdős on the existence of arithmetic progressions in $k$-full numbers (i.e. integers with the property that every prime divisor necessarily occurs to at least the $k$-th power). Further, we deduce a variety of arithmetic constraints upon such progressions, under the assumption of the $abc$-conjecture of Masser and Oesterlé.
We answer a number of questions of Erdős on the existence of arithmetic progressions in $k$-full numbers (i.e. integers with the property that every prime divisor necessarily occurs to at least the $k$-th power). Further, we deduce a variety of arithmetic constraints upon such progressions, under the assumption of the $abc$-conjecture of Masser and Oesterlé.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
Effective Methods for Norm-Form Equations
Authors:
Prajeet Bajpai
Abstract:
While effective resolution of Thue equations has been well understood since the work of Baker in the 1960s, similar results for norm-form equations in more than two variables have proven difficult to achieve. In 1983, Vojta was able to address the case of three variables over totally complex and Galois number fields. In this paper, we extend his results to effectively resolve several new classes o…
▽ More
While effective resolution of Thue equations has been well understood since the work of Baker in the 1960s, similar results for norm-form equations in more than two variables have proven difficult to achieve. In 1983, Vojta was able to address the case of three variables over totally complex and Galois number fields. In this paper, we extend his results to effectively resolve several new classes of norm-form equations. In particular, we completely and effectively settle the question of norm-form equations over totally complex Galois sextic fields.
△ Less
Submitted 3 November, 2022; v1 submitted 26 December, 2021;
originally announced December 2021.
-
A note on pencils of norm-form equations
Authors:
Prajeet Bajpai,
Michael A. Bennett
Abstract:
We find all solutions to the parametrized family of norm-form equations $x^3-(t^3-1)y^3+3(t^3-1)xy+(t^3-1)^2 = \pm 1$ studied by Amoroso, Masser and Zannier. Our proof relies upon an appeal to lower bounds for linear forms in logarithms and various elementary arguments.
We find all solutions to the parametrized family of norm-form equations $x^3-(t^3-1)y^3+3(t^3-1)xy+(t^3-1)^2 = \pm 1$ studied by Amoroso, Masser and Zannier. Our proof relies upon an appeal to lower bounds for linear forms in logarithms and various elementary arguments.
△ Less
Submitted 9 April, 2021;
originally announced April 2021.
-
AcED: Accurate and Edge-consistent Monocular Depth Estimation
Authors:
Kunal Swami,
Prasanna Vishnu Bondada,
Pankaj Kumar Bajpai
Abstract:
Single image depth estimation is a challenging problem. The current state-of-the-art method formulates the problem as that of ordinal regression. However, the formulation is not fully differentiable and depth maps are not generated in an end-to-end fashion. The method uses a naïve threshold strategy to determine per-pixel depth labels, which results in significant discretization errors. For the fi…
▽ More
Single image depth estimation is a challenging problem. The current state-of-the-art method formulates the problem as that of ordinal regression. However, the formulation is not fully differentiable and depth maps are not generated in an end-to-end fashion. The method uses a naïve threshold strategy to determine per-pixel depth labels, which results in significant discretization errors. For the first time, we formulate a fully differentiable ordinal regression and train the network in end-to-end fashion. This enables us to include boundary and smoothness constraints in the optimization function, leading to smooth and edge-consistent depth maps. A novel per-pixel confidence map computation for depth refinement is also proposed. Extensive evaluation of the proposed model on challenging benchmarks reveals its superiority over recent state-of-the-art methods, both quantitatively and qualitatively. Additionally, we demonstrate practical utility of the proposed method for single camera bokeh solution using in-house dataset of challenging real-life images.
△ Less
Submitted 16 June, 2020;
originally announced June 2020.
-
DISCO: Depth Inference from Stereo using Context
Authors:
Kunal Swami,
Kaushik Raghavan,
Nikhilanj Pelluri,
Rituparna Sarkar,
Pankaj Bajpai
Abstract:
Recent deep learning based approaches have outperformed classical stereo matching methods. However, current deep learning based end-to-end stereo matching methods adopt a generic encoder-decoder style network with skip connections. To limit computational requirement, many networks perform excessive down sampling, which results in significant loss of useful low-level information. Additionally, many…
▽ More
Recent deep learning based approaches have outperformed classical stereo matching methods. However, current deep learning based end-to-end stereo matching methods adopt a generic encoder-decoder style network with skip connections. To limit computational requirement, many networks perform excessive down sampling, which results in significant loss of useful low-level information. Additionally, many network designs do not exploit the rich multi-scale contextual information. In this work, we address these aforementioned problems by carefully designing the network architecture to preserve required spatial information throughout the network, while at the same time achieve large effective receptive field to extract multiscale contextual information. For the first time, we create a synthetic disparity dataset reflecting real life images captured using a smartphone; this enables us to obtain state-of-the-art results on common real life images. The proposed model DISCO is pre-trained on the synthetic Scene Flow dataset and evaluated on popular benchmarks and our in-house dataset of challenging real life images. The proposed model outperforms existing state-of-the-art methods in terms of quality as well as quantitative metrics.
△ Less
Submitted 31 May, 2019;
originally announced June 2019.
-
Virtual Reality: Blessings and Risk Assessment
Authors:
Ayush Sharma,
Piyush Bajpai,
Sukhdev Singh,
Kiran Khatter
Abstract:
Objectives: This paper presents an up-to-date overview of research performed in the Virtual Reality (VR) environment ranging from definitions, its presence in the various fields, and existing market players and their projects in the VR technology. Further an attempt is made to gain an insight on the psychological mechanism underlying experience in using VR device. Methods: Our literature survey is…
▽ More
Objectives: This paper presents an up-to-date overview of research performed in the Virtual Reality (VR) environment ranging from definitions, its presence in the various fields, and existing market players and their projects in the VR technology. Further an attempt is made to gain an insight on the psychological mechanism underlying experience in using VR device. Methods: Our literature survey is based on the research articles, analysis of the projects of various companies and their findings for different areas of interest. Findings: In our literature survey we observed that the recent advances in virtual reality enabling technologies have led to variety of virtual devices that facilitate people to interact with the digital world. In fact in the past two decades researchers have tried to integrate reality and VR in the form of intuitive computer interface. Improvements: This has led to variety of potential benefits of VR in many applications such as News, Healthcare, Entertainment, Tourism, Military and Defence etc. However despite the extensive research efforts in creating virtual system environments it is yet to become apparent in normal daily life.
△ Less
Submitted 30 August, 2017;
originally announced August 2017.
-
Equivariant division
Authors:
Prajeet Bajpai,
Peter G. Doyle
Abstract:
Let C be a non-empty finite set, and Gamma a subgroup of the symmetric group S(C). Given a bijection f:A cross C to B cross C, the problem of Gamma-equivariant division is to find a quotient bijection h:A to B respecting whatever symmetries f may have under the action of S(A) cross S(B) cross Gamma. Say that Gamma is fully cancelling if this is possible for any f, and finitely cancelling if it is…
▽ More
Let C be a non-empty finite set, and Gamma a subgroup of the symmetric group S(C). Given a bijection f:A cross C to B cross C, the problem of Gamma-equivariant division is to find a quotient bijection h:A to B respecting whatever symmetries f may have under the action of S(A) cross S(B) cross Gamma. Say that Gamma is fully cancelling if this is possible for any f, and finitely cancelling if it is possible providing A,B are finite. Feldman and Propp showed that a permutation group is finitely cancelling just if it has a globally fixed point. We show here that a permutation group is fully cancelling just if it is trivial. This sheds light on the fact that all known division algorithms that eschew the Axiom of Choice depend on fixing an ordering for the elements of C.
△ Less
Submitted 13 April, 2017;
originally announced April 2017.