-
DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG Rerankers
Authors:
Navve Wasserman,
Oliver Heinimann,
Yuval Golbari,
Tal Zimbalist,
Eli Schwartz,
Michal Irani
Abstract:
Rerankers play a critical role in multimodal Retrieval-Augmented Generation (RAG) by refining ranking of an initial set of retrieved documents. Rerankers are typically trained using hard negative mining, whose goal is to select pages for each query which rank high, but are actually irrelevant. However, this selection process is typically passive and restricted to what the retriever can find in the…
▽ More
Rerankers play a critical role in multimodal Retrieval-Augmented Generation (RAG) by refining ranking of an initial set of retrieved documents. Rerankers are typically trained using hard negative mining, whose goal is to select pages for each query which rank high, but are actually irrelevant. However, this selection process is typically passive and restricted to what the retriever can find in the available corpus, leading to several inherent limitations. These include: limited diversity, negative examples which are often not hard enough, low controllability, and frequent false negatives which harm training. Our paper proposes an alternative approach: Single-Page Hard Negative Query Generation, which goes the other way around. Instead of retrieving negative pages per query, we generate hard negative queries per page. Using an automated LLM-VLM pipeline, and given a page and its positive query, we create hard negatives by rephrasing the query to be as similar as possible in form and context, yet not answerable from the page. This paradigm enables fine-grained control over the generated queries, resulting in diverse, hard, and targeted negatives. It also supports efficient false negative verification. Our experiments show that rerankers trained with data generated using our approach outperform existing models and significantly improve retrieval performance.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
KernelFusion: Assumption-Free Blind Super-Resolution via Patch Diffusion
Authors:
Oliver Heinimann,
Assaf Shocher,
Tal Zimbalist,
Michal Irani
Abstract:
Traditional super-resolution (SR) methods assume an ``ideal'' downscaling SR-kernel (e.g., bicubic downscaling) between the high-resolution (HR) image and the low-resolution (LR) image. Such methods fail once the LR images are generated differently. Current blind-SR methods aim to remove this assumption, but are still fundamentally restricted to rather simplistic downscaling SR-kernels (e.g., anis…
▽ More
Traditional super-resolution (SR) methods assume an ``ideal'' downscaling SR-kernel (e.g., bicubic downscaling) between the high-resolution (HR) image and the low-resolution (LR) image. Such methods fail once the LR images are generated differently. Current blind-SR methods aim to remove this assumption, but are still fundamentally restricted to rather simplistic downscaling SR-kernels (e.g., anisotropic Gaussian kernels), and fail on more complex (out of distribution) downscaling degradations. However, using the correct SR-kernel is often more important than using a sophisticated SR algorithm. In ``KernelFusion'' we introduce a zero-shot diffusion-based method that makes no assumptions about the kernel. Our method recovers the unique image-specific SR-kernel directly from the LR input image, while simultaneously recovering its corresponding HR image. KernelFusion exploits the principle that the correct SR-kernel is the one that maximizes patch similarity across different scales of the LR image. We first train an image-specific patch-based diffusion model on the single LR input image, capturing its unique internal patch statistics. We then reconstruct a larger HR image with the same learned patch distribution, while simultaneously recovering the correct downscaling SR-kernel that maintains this cross-scale relation between the HR and LR images. Empirical results show that KernelFusion vastly outperforms all SR baselines on complex downscaling degradations, where existing SotA Blind-SR methods fail miserably. By breaking free from predefined kernel assumptions, KernelFusion pushes Blind-SR into a new assumption-free paradigm, handling downscaling kernels previously thought impossible.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Don't Judge Before You CLIP: A Unified Approach for Perceptual Tasks
Authors:
Amit Zalcher,
Navve Wasserman,
Roman Beliy,
Oliver Heinimann,
Michal Irani
Abstract:
Visual perceptual tasks aim to predict human judgment of images (e.g., emotions invoked by images, image quality assessment). Unlike objective tasks such as object/scene recognition, perceptual tasks rely on subjective human assessments, making its data-labeling difficult. The scarcity of such human-annotated data results in small datasets leading to poor generalization. Typically, specialized mod…
▽ More
Visual perceptual tasks aim to predict human judgment of images (e.g., emotions invoked by images, image quality assessment). Unlike objective tasks such as object/scene recognition, perceptual tasks rely on subjective human assessments, making its data-labeling difficult. The scarcity of such human-annotated data results in small datasets leading to poor generalization. Typically, specialized models were designed for each perceptual task, tailored to its unique characteristics and its own training dataset. We propose a unified architectural framework for solving multiple different perceptual tasks leveraging CLIP as a prior. Our approach is based on recent cognitive findings which indicate that CLIP correlates well with human judgment. While CLIP was explicitly trained to align images and text, it implicitly also learned human inclinations. We attribute this to the inclusion of human-written image captions in CLIP's training data, which contain not only factual image descriptions, but inevitably also human sentiments and emotions. This makes CLIP a particularly strong prior for perceptual tasks. Accordingly, we suggest that minimal adaptation of CLIP suffices for solving a variety of perceptual tasks. Our simple unified framework employs a lightweight adaptation to fine-tune CLIP to each task, without requiring any task-specific architectural changes. We evaluate our approach on three tasks: (i) Image Memorability Prediction, (ii) No-reference Image Quality Assessment, and (iii) Visual Emotion Analysis. Our model achieves state-of-the-art results on all three tasks, while demonstrating improved generalization across different datasets.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
Core-collapse supernovae in the hall of mirrors. A three-dimensional code-comparison project
Authors:
Rubén M. Cabezón,
Kuo-Chuan Pan,
Matthias Liebendörfer,
Takami Kuroda,
Kevin Ebinger,
Oliver Heinimann,
Friedrich-Karl Thielemann,
Albino Perego
Abstract:
Modeling core-collapse supernovae (CCSNe) with neutrino transport in three dimensions (3D) requires tremendous computing resources and some level of approximation. We present a first comparison study of CCSNe in 3D with different physics approximations and hydrodynamics codes. We aim to assess the impact of the hydrodynamics code, approximations for the neutrino and gravity treatments, and rotatio…
▽ More
Modeling core-collapse supernovae (CCSNe) with neutrino transport in three dimensions (3D) requires tremendous computing resources and some level of approximation. We present a first comparison study of CCSNe in 3D with different physics approximations and hydrodynamics codes. We aim to assess the impact of the hydrodynamics code, approximations for the neutrino and gravity treatments, and rotation on the simulation of CCSNe in 3D. We use four different hydrodynamics codes in this work (ELEPHANT, FLASH, fGR1, and SPHYNX) in combination with two different neutrino treatments, the isotropic diffusion source approximation (IDSA) and two-moment M1, and three different gravity treatments: Newtonian, 1D General Relativity (GR) correction, and full GR). Additional parameters discussed in this study are the inclusion of neutrino-electron scattering via a parametrized deleptonization (PD) and the influence of rotation. The four codes compared in this work include Eulerian and fully Lagrangian (smoothed particle hydrodynamics) codes for the first time. They show agreement in the overall evolution of the collapse phase and early post-bounce within the range of 10% (20% in some cases). The comparison of the different neutrino treatments highlights the need to further investigate the antineutrino luminosities in IDSA, which tend to be relatively high. We also demonstrate the requirement for a more detailed heavy-lepton neutrino leakage. When comparing with a full GR code, including an M1 transport method, we confirm the influence of neutrino-electron scattering during the collapse phase, which is adequately captured by the PD scheme. Also, the effective GR potential reproduces the overall dynamic evolution correctly in all Newtonian codes. Additionally, we verify that rotation aids the shock expansion and estimate the overall angular momentum losses for each code in rotating scenarios.
△ Less
Submitted 18 December, 2018; v1 submitted 24 June, 2018;
originally announced June 2018.
-
Towards generating a new supernova equation of state: A systematic analysis of cold hybrid stars
Authors:
Oliver Heinimann,
Matthias Hempel,
Friedrich-Karl Thielemann
Abstract:
The hadron-quark phase transition in core-collapse supernovae (CCSNe) has the potential to trigger explosions in otherwise nonexploding models. However, those hybrid supernova equations of state (EOS) shown to trigger an explosion do not support the observational 2 M$_\odot$ neutron star maximum mass constraint. In this work, we analyze cold hybrid stars by the means of a systematic parameter scan…
▽ More
The hadron-quark phase transition in core-collapse supernovae (CCSNe) has the potential to trigger explosions in otherwise nonexploding models. However, those hybrid supernova equations of state (EOS) shown to trigger an explosion do not support the observational 2 M$_\odot$ neutron star maximum mass constraint. In this work, we analyze cold hybrid stars by the means of a systematic parameter scan for the phase transition properties, with the aim to develop a new hybrid supernova EOS. The hadronic phase is described with the state-of-the-art supernova EOS HS(DD2), and quark matter by an EOS with a constant speed of sound (CSS) of $c_{QM}^2=1/3$. We find promising cases which meet the 2 M$_\odot$ criterion and are interesting for CCSN explosions. We show that the very simple CSS EOS is transferable into the well-known thermodynamic bag model, important for future application in CCSN simulations. In the second part, the occurrence of reconfinement and multiple phase transitions is discussed. In the last part, the influence of hyperons in our parameter scan is studied. Including hyperons no change in the general behavior is found, except for overall lower maximum masses. In both cases (with and without hyperons) we find that quark matter with $c_{QM}^2=1/3$ can increase the maximum mass only if reconfinement is suppressed or if quark matter is absolutely stable.
△ Less
Submitted 22 November, 2016; v1 submitted 31 August, 2016;
originally announced August 2016.
-
Hot third family of compact stars and the possibility of core-collapse supernova explosions
Authors:
Matthias Hempel,
Oliver Heinimann,
Andrey Yudin,
Igor Iosilevskiy,
Matthias Liebendörfer,
Friedrich-Karl Thielemann
Abstract:
A phase transition to quark matter can lead to interesting phenomenological consequences in core-collapse supernovae, e.g., triggering an explosion in spherically symmetric models. However, until now this explosion mechanism was only shown to be working for equations of state that are in contradiction with recent pulsar mass measurements. Here, we identify that this explosion mechanism is related…
▽ More
A phase transition to quark matter can lead to interesting phenomenological consequences in core-collapse supernovae, e.g., triggering an explosion in spherically symmetric models. However, until now this explosion mechanism was only shown to be working for equations of state that are in contradiction with recent pulsar mass measurements. Here, we identify that this explosion mechanism is related to the existence of a third family of compact stars. For the equations of state investigated, the third family is only pronounced in the hot, early stages of the protocompact star and absent or negligibly small at zero temperature and thus represents a novel kind of third family. This interesting behavior is a result of unusual thermal properties induced by the phase transition, e.g., characterized by a decrease of temperature with increasing density for isentropes, and can be related to a negative slope of the phase transition line in the temperature-pressure phase diagram.
△ Less
Submitted 11 November, 2016; v1 submitted 20 November, 2015;
originally announced November 2015.