-
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Authors:
Zhilin Wang,
Jaehun Jung,
Ximing Lu,
Shizhe Diao,
Ellie Evans,
Jiaqi Zeng,
Pavlo Molchanov,
Yejin Choi,
Jan Kautz,
Yi Dong
Abstract:
Evaluating progress in large language models (LLMs) is often constrained by the challenge of verifying responses, limiting assessments to tasks like mathematics, programming, and short-form question-answering. However, many real-world applications require evaluating LLMs in processing professional documents, synthesizing information, and generating comprehensive reports in response to user queries…
▽ More
Evaluating progress in large language models (LLMs) is often constrained by the challenge of verifying responses, limiting assessments to tasks like mathematics, programming, and short-form question-answering. However, many real-world applications require evaluating LLMs in processing professional documents, synthesizing information, and generating comprehensive reports in response to user queries. We introduce ProfBench: a set of over 7000 response-criterion pairs as evaluated by human-experts with professional knowledge across Physics PhD, Chemistry PhD, Finance MBA and Consulting MBA. We build robust and affordable LLM-Judges to evaluate ProfBench rubrics, by mitigating self-enhancement bias and reducing the cost of evaluation by 2-3 orders of magnitude, to make it fair and accessible to the broader community. Our findings reveal that ProfBench poses significant challenges even for state-of-the-art LLMs, with top-performing models like GPT-5-high achieving only 65.9\% overall performance. Furthermore, we identify notable performance disparities between proprietary and open-weight models and provide insights into the role that extended thinking plays in addressing complex, professional-domain tasks. Data: https://huggingface.co/datasets/nvidia/ProfBench and Code: https://github.com/NVlabs/ProfBench
△ Less
Submitted 21 October, 2025;
originally announced October 2025.
-
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
Authors:
Hanrong Ye,
Chao-Han Huck Yang,
Arushi Goel,
Wei Huang,
Ligeng Zhu,
Yuanhang Su,
Sean Lin,
An-Chieh Cheng,
Zhen Wan,
Jinchuan Tian,
Yuming Lou,
Dong Yang,
Zhijian Liu,
Yukang Chen,
Ambrish Dantrey,
Ehsan Jahangiri,
Sreyan Ghosh,
Daguang Xu,
Ehsan Hosseini-Asl,
Danial Mohseni Taheri,
Vidya Murali,
Sifei Liu,
Jason Lu,
Oluwatobi Olabiyi,
Frank Wang
, et al. (7 additional authors not shown)
Abstract:
Advancing machine intelligence requires developing the ability to perceive across multiple modalities, much as humans sense the world. We introduce OmniVinci, an initiative to build a strong, open-source, omni-modal LLM. We carefully study the design choices across model architecture and data curation. For model architecture, we present three key innovations: (i) OmniAlignNet for strengthening ali…
▽ More
Advancing machine intelligence requires developing the ability to perceive across multiple modalities, much as humans sense the world. We introduce OmniVinci, an initiative to build a strong, open-source, omni-modal LLM. We carefully study the design choices across model architecture and data curation. For model architecture, we present three key innovations: (i) OmniAlignNet for strengthening alignment between vision and audio embeddings in a shared omni-modal latent space; (ii) Temporal Embedding Grouping for capturing relative temporal alignment between vision and audio signals; and (iii) Constrained Rotary Time Embedding for encoding absolute temporal information in omni-modal embeddings. We introduce a curation and synthesis pipeline that generates 24M single-modal and omni-modal conversations. We find that modalities reinforce one another in both perception and reasoning. Our model, OmniVinci, outperforms Qwen2.5-Omni with +19.05 on DailyOmni (cross-modal understanding), +1.7 on MMAR (audio), and +3.9 on Video-MME (vision), while using just 0.2T training tokens - a 6 times reduction compared to Qwen2.5-Omni's 1.2T. We finally demonstrate omni-modal advantages in downstream applications spanning robotics, medical AI, and smart factory.
△ Less
Submitted 17 October, 2025;
originally announced October 2025.
-
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
Authors:
Shih-Yang Liu,
Xin Dong,
Ximing Lu,
Shizhe Diao,
Mingjie Liu,
Min-Hung Chen,
Hongxu Yin,
Yu-Chiang Frank Wang,
Kwang-Ting Cheng,
Yejin Choi,
Jan Kautz,
Pavlo Molchanov
Abstract:
Reasoning language models such as OpenAI-o1, DeepSeek-R1, and Qwen achieve strong performance via extended chains of thought but often generate unnecessarily long outputs. Maximizing intelligence per token--accuracy relative to response length--remains an open problem. We revisit reinforcement learning (RL) with the simplest length penalty--truncation--and show that accuracy degradation arises not…
▽ More
Reasoning language models such as OpenAI-o1, DeepSeek-R1, and Qwen achieve strong performance via extended chains of thought but often generate unnecessarily long outputs. Maximizing intelligence per token--accuracy relative to response length--remains an open problem. We revisit reinforcement learning (RL) with the simplest length penalty--truncation--and show that accuracy degradation arises not from the lack of sophisticated penalties but from inadequate RL optimization. We identify three key challenges: (i) large bias in advantage estimation, (ii) entropy collapse, and (iii) sparse reward signal. We address them with Doing Length pEnalty Right (DLER), a training recipe combining batch-wise reward normalization, higher clipping, dynamic sampling, and a simple truncation length penalty. DLER achieves state-of-the-art accuracy--efficiency trade-offs, cutting output length by over 70 percent while surpassing all previous baseline accuracy. It also improves test-time scaling: compared to DeepSeek-R1-7B, DLER-7B generates multiple concise responses in parallel with 28 percent higher accuracy and lower latency. We further introduce Difficulty-Aware DLER, which adaptively tightens truncation on easier questions for additional efficiency gains. We also propose an update-selective merging method that preserves baseline accuracy while retaining the concise reasoning ability of the DLER model, which is useful for scenarios where RL training data is scarce.
△ Less
Submitted 16 October, 2025;
originally announced October 2025.
-
Measurement of $C\!P$ asymmetry in $D^0 \to K^0_{\rm S} K^0_{\rm S}$ decays with the LHCb Upgrade I detector
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
M. Akthar,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1187 additional authors not shown)
Abstract:
A measurement of $C\!P$ asymmetry in $D^0 \to K^0_{\rm S} K^0_{\rm S}$ decays is reported, based on a data sample of proton-proton collisions collected with the LHCb Upgrade I detector in 2024 at a centre-of-mass energy of $13.6\,$TeV, corresponding to an integrated luminosity of $6.2\,\mathrm{fb}^{-1}$. The $D^0 \to K^0_{\rm S} π^+ π^-$ decay is used as calibration channel to cancel residual dete…
▽ More
A measurement of $C\!P$ asymmetry in $D^0 \to K^0_{\rm S} K^0_{\rm S}$ decays is reported, based on a data sample of proton-proton collisions collected with the LHCb Upgrade I detector in 2024 at a centre-of-mass energy of $13.6\,$TeV, corresponding to an integrated luminosity of $6.2\,\mathrm{fb}^{-1}$. The $D^0 \to K^0_{\rm S} π^+ π^-$ decay is used as calibration channel to cancel residual detection and production asymmetries. The time-integrated $C\!P$ asymmetry for the $D^0 \to K^0_{\rm S} K^0_{\rm S}$ mode is measured to be $$ {\cal A}^{C\!P} (D^0 \to K^0_{\rm S} K^0_{\rm S}) = (1.86 \pm 1.04\pm 0.41)\%, $$ where the first uncertainty is statistical, and the second is systematic. This is the most precise determination of this quantity to date.
△ Less
Submitted 16 October, 2025;
originally announced October 2025.
-
Searches for $B^0\to K^+π^-τ^+τ^-$ and $B_s^0\to K^+K^-τ^+τ^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
M. Akthar,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1182 additional authors not shown)
Abstract:
The first searches for $B^0\to K^+π^-τ^+τ^-$ and $B^0_s\to K^+K^-τ^+τ^-$ decays at the LHCb experiment are conducted with $pp$ collision data corresponding to an integrated luminosity of $5.4\textrm{ fb}^{-1}$. The tau leptons are reconstructed using the $τ^+\to μ^+\overlineν_τν_μ$ decay and the results are presented in bins of $K^+π^-$ or $K^+K^-$ mass. No signal is observed and upper limits are…
▽ More
The first searches for $B^0\to K^+π^-τ^+τ^-$ and $B^0_s\to K^+K^-τ^+τ^-$ decays at the LHCb experiment are conducted with $pp$ collision data corresponding to an integrated luminosity of $5.4\textrm{ fb}^{-1}$. The tau leptons are reconstructed using the $τ^+\to μ^+\overlineν_τν_μ$ decay and the results are presented in bins of $K^+π^-$ or $K^+K^-$ mass. No signal is observed and upper limits are set on the branching fractions. The searches result in the first upper limits for $B^0\to K^+π^-τ^+τ^-$ decays outside the $K^*(892)^0$ region in $K^+π^-$ mass and the first limits for $B^0_s\to K^+K^-τ^+τ^-$ decays. The searches are recast into limits on the decays $B^0\to K^*(892)^0τ^+τ^-$ and $B^0_s\to φ(1020)τ^+τ^-$, yielding $2.8\times10^{-4}$ ($2.5\times10^{-4}$) and $4.7\times10^{-4}$ ($4.1\times10^{-4}$) at the $95\%$ ($90\%$) confidence level, respectively. For the decay $B^0\to K^*(892)^0τ^+τ^-$, this result improves on the current best upper limit by an order of magnitude.
△ Less
Submitted 15 October, 2025;
originally announced October 2025.
-
Study of charm mixing and CP violation with $D^0\to K^\pmπ^\mpπ^\pmπ^\mp$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1186 additional authors not shown)
Abstract:
A study of charm mixing and CP violation in $D^0\to K^\pmπ^\mpπ^\pmπ^\mp$ decays is performed using data collected by the LHCb experiment in proton-proton collisions from 2015 to 2018, corresponding to an integrated luminosity of 6$\text{fb}^{-1}$. The ratio of promptly produced $D^0\to K^+π^- π^+π^-$ to $D^0\to K^-π^+ π^-π^+$ decay rates is measured as a function of $D^0$ decay time, both inclusi…
▽ More
A study of charm mixing and CP violation in $D^0\to K^\pmπ^\mpπ^\pmπ^\mp$ decays is performed using data collected by the LHCb experiment in proton-proton collisions from 2015 to 2018, corresponding to an integrated luminosity of 6$\text{fb}^{-1}$. The ratio of promptly produced $D^0\to K^+π^- π^+π^-$ to $D^0\to K^-π^+ π^-π^+$ decay rates is measured as a function of $D^0$ decay time, both inclusive over phase space and in bins of phase space. Taking external inputs for the $D^0 -\overline{D}^0$ mixing parameters $x$ and $y$ allows constraints to be obtained on the hadronic parameters of the charm decay. When combined with previous measurements from charm-threshold experiments and at LHCb, improved knowledge is obtained for these parameters, which is valuable for studies of the angle $γ$ of the Unitarity Triangle. An alternative analysis is also performed, in which external inputs are taken for the hadronic parameters, and the mixing parameters are determined, including $Δx$ and $Δy$, which are nonzero in the presence of CP violation. It is found that $x=\left(0.85^{+0.15}_{-0.24}\right)\%$, $y=\left( 0.21^{+0.29}{-0.27} \right)\%$, $Δx=\left( -0.02\pm {0.04} \right)\% $ and $Δy=\left( 0.02^{+0.04}_{-0.03} \right)\%$. These results are consistent with previous measurements and the hypothesis of \CP conservation.
△ Less
Submitted 6 October, 2025;
originally announced October 2025.
-
RLP: Reinforcement as a Pretraining Objective
Authors:
Ali Hatamizadeh,
Syeda Nahida Akter,
Shrimai Prabhumoye,
Jan Kautz,
Mostofa Patwary,
Mohammad Shoeybi,
Bryan Catanzaro,
Yejin Choi
Abstract:
The dominant paradigm for training large reasoning models starts with pre-training using next-token prediction loss on vast amounts of data. Reinforcement learning, while powerful in scaling reasoning, is introduced only as the very last phase of post-training, preceded by supervised fine-tuning. While dominant, is this an optimal way of training? In this paper, we present RLP, an information-driv…
▽ More
The dominant paradigm for training large reasoning models starts with pre-training using next-token prediction loss on vast amounts of data. Reinforcement learning, while powerful in scaling reasoning, is introduced only as the very last phase of post-training, preceded by supervised fine-tuning. While dominant, is this an optimal way of training? In this paper, we present RLP, an information-driven reinforcement pretraining objective, that brings the core spirit of reinforcement learning -- exploration -- to the last phase of pretraining. The key idea is to treat chain-of-thought as an exploratory action, with rewards computed based on the information gain it provides for predicting future tokens. This training objective essentially encourages the model to think for itself before predicting what comes next, thus teaching an independent thinking behavior earlier in the pretraining. More concretely, the reward signal measures the increase in log-likelihood of the next token when conditioning on both context and a sampled reasoning chain, compared to conditioning on context alone. This approach yields a verifier-free dense reward signal, allowing for efficient training for the full document stream during pretraining. Specifically, RLP reframes reinforcement learning for reasoning as a pretraining objective on ordinary text, bridging the gap between next-token prediction and the emergence of useful chain-of-thought reasoning. Pretraining with RLP on Qwen3-1.7B-Base lifts the overall average across an eight-benchmark math-and-science suite by 19%. With identical post-training, the gains compound, with the largest improvements on reasoning-heavy tasks such as AIME25 and MMLU-Pro. Applying RLP to the hybrid Nemotron-Nano-12B-v2 increases the overall average from 42.81% to 61.32% and raises the average on scientific reasoning by 23%, demonstrating scalability across architectures and model sizes.
△ Less
Submitted 26 September, 2025;
originally announced October 2025.
-
BroRL: Scaling Reinforcement Learning via Broadened Exploration
Authors:
Jian Hu,
Mingjie Liu,
Ximing Lu,
Fang Wu,
Zaid Harchaoui,
Shizhe Diao,
Yejin Choi,
Pavlo Molchanov,
Jun Yang,
Jan Kautz,
Yi Dong
Abstract:
Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a key ingredient for unlocking complex reasoning capabilities in large language models. Recent work ProRL has shown promise in scaling RL by increasing the number of training steps. However, performance plateaus after thousands of steps, with clear diminishing returns from allocating more computation to additional training. In th…
▽ More
Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a key ingredient for unlocking complex reasoning capabilities in large language models. Recent work ProRL has shown promise in scaling RL by increasing the number of training steps. However, performance plateaus after thousands of steps, with clear diminishing returns from allocating more computation to additional training. In this work, we investigate a complementary paradigm for scaling RL, BroR-Lincreasing the number of rollouts per example to hundreds to exhaustively Broaden exploration, which yields continuous performance gains beyond the saturation point observed in ProRL when scaling the number of training steps. Our approach is motivated by a mass balance equation analysis allowing us to characterize the rate of change in probability mass for correct and incorrect tokens during the reinforcement process. We show that under a one-step RL assumption, sampled rollout tokens always contribute to correct-mass expansion, while unsampled tokens outside rollouts may lead to gains or losses depending on their distribution and the net reward balance. Importantly, as the number of rollouts per example N increases, the effect of unsampled terms diminishes, ensuring overall correct-mass expansion. To validate our theoretical analysis, we conduct simulations under more relaxed conditions and find that a sufficiently large rollout size N-corresponding to ample exploration-guarantees an increase in the probability mass of all correct tokens. Empirically, BroRL revives models saturated after 3K ProRL training steps and demonstrates robust, continuous improvement, achieving state-of-the-art results for the 1.5B model across diverse benchmarks.
△ Less
Submitted 1 October, 2025;
originally announced October 2025.
-
Measurement of the $W \to μν_μ$ cross-sections as a function of the muon transverse momentum in $pp$ collisions at 5.02 TeV
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1184 additional authors not shown)
Abstract:
The $pp \to W^{\pm} (\to μ^{\pm} ν_μ) X$ cross-sections are measured at a proton-proton centre-of-mass energy $\sqrt{s} = 5.02$ TeV using a dataset corresponding to an integrated luminosity of 100 pb$^{-1}$ recorded by the LHCb experiment. Considering muons in the pseudorapidity range $2.2 < η< 4.4$, the cross-sections are measured differentially in twelve intervals of muon transverse momentum bet…
▽ More
The $pp \to W^{\pm} (\to μ^{\pm} ν_μ) X$ cross-sections are measured at a proton-proton centre-of-mass energy $\sqrt{s} = 5.02$ TeV using a dataset corresponding to an integrated luminosity of 100 pb$^{-1}$ recorded by the LHCb experiment. Considering muons in the pseudorapidity range $2.2 < η< 4.4$, the cross-sections are measured differentially in twelve intervals of muon transverse momentum between $28 < p_\mathrm{T} < 52$ GeV. Integrated over $p_\mathrm{T}$, the measured cross-sections are \begin{align*} σ_{W^+ \to μ^+ ν_μ} &= 300.9 \pm 2.4 \pm 3.8 \pm 6.0~\text{pb}, \\ σ_{W^- \to μ^- \barν_μ} &= 236.9 \pm 2.1 \pm 2.7 \pm 4.7~\text{pb}, \end{align*} where the first uncertainties are statistical, the second are systematic, and the third are associated with the luminosity calibration. These integrated results are consistent with theoretical predictions.
This analysis introduces a new method to determine the $W$-boson mass using the measured differential cross-sections corrected for detector effects. The measurement is performed on this statistically limited dataset as a proof of principle and yields \begin{align*} m_W = 80369 \pm 130 \pm 33~\text{MeV}, \end{align*} where the first uncertainty is experimental and the second is theoretical.
△ Less
Submitted 23 September, 2025;
originally announced September 2025.
-
First evidence of $CP$ violation in beauty baryon to charmonium decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1172 additional authors not shown)
Abstract:
A study of the difference in the $CP$ asymmetries between $Λ^0_b \rightarrow J / ψp π^-$ and $Λ^0_b \rightarrow J / ψp K^-$ decays, $Δ{\cal A}_{CP}$, is performed using proton-proton collision data collected by the LHCb experiment in the years 2015--2018, corresponding to an integrated luminosity of $6 {\rm fb}^{-1}$. This quantity is measured to be $ Δ{\cal A}_{CP}=(4.03\pm 1.18\pm 0.23)\%$, wher…
▽ More
A study of the difference in the $CP$ asymmetries between $Λ^0_b \rightarrow J / ψp π^-$ and $Λ^0_b \rightarrow J / ψp K^-$ decays, $Δ{\cal A}_{CP}$, is performed using proton-proton collision data collected by the LHCb experiment in the years 2015--2018, corresponding to an integrated luminosity of $6 {\rm fb}^{-1}$. This quantity is measured to be $ Δ{\cal A}_{CP}=(4.03\pm 1.18\pm 0.23)\%$, where the first uncertainty is statistical and the second is systematic. When combined with the previous LHCb result, a value of $Δ{\cal A}_{CP} = (4.31 \pm 1.06 \pm 0.28)\%$ is obtained, corresponding to a significance of $3.9σ$ against the $CP$ symmetry hypothesis. Studies of triple-product asymmetries, which provide an additional probe of $CP$ violation, show no significant deviation from $CP$ symmetry.
△ Less
Submitted 19 September, 2025;
originally announced September 2025.
-
Observation of $B_c^+ \to D h^+ h^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1184 additional authors not shown)
Abstract:
Searches are presented for $B_{c}^{+} \to D h^+ h^-$ decays, where $D$ is a charmed meson and $h^{\pm}$ is a charged pion or kaon, using $pp$ collision data collected by the LHCb experiment corresponding to an integrated luminosity of $9~\text{fb}^{-1}$. The decays $B_c^+\to D^+ K^+π^-$, $B_c^+\to D^{*+} K^+π^-$ and $B_c^+\to D_s^+ K^+ K^-$ are observed for the first time. Their branching fraction…
▽ More
Searches are presented for $B_{c}^{+} \to D h^+ h^-$ decays, where $D$ is a charmed meson and $h^{\pm}$ is a charged pion or kaon, using $pp$ collision data collected by the LHCb experiment corresponding to an integrated luminosity of $9~\text{fb}^{-1}$. The decays $B_c^+\to D^+ K^+π^-$, $B_c^+\to D^{*+} K^+π^-$ and $B_c^+\to D_s^+ K^+ K^-$ are observed for the first time. Their branching fractions, expressed as ratios relative to that of the $B_c^+\to B_s^0π^+$ decay, are determined to be \begin{align*} \mathcal{R}(B_c^+\to D^+ K^+π^-) =(1.96 \pm 0.23\pm 0.08 \pm 0.10)\times 10^{-3},&\\ \mathcal{R}(B_c^+\to D^{*+} K^+π^-) =(3.67 \pm 0.55 \pm 0.24\pm 0.20)\times 10^{-3},&\\ \mathcal{R}(B_c^+\to D_s^+ K^+ K^-) =(1.61 \pm 0.35\pm 0.13\pm 0.07)\times 10^{-3}, \end{align*} where the first uncertainty is statistical, the second is systematic, and the third is due to the limited precision on the $D$-meson branching fractions. The decay channels proceed primarily through excited $K^0$ or $D^0$ resonances or $φ$ mesons, and open a new avenue for studies of charge-parity violation in beauty mesons.
△ Less
Submitted 19 September, 2025;
originally announced September 2025.
-
A model-independent measurement of the CKM angle $γ$ in the decays $B^\pm\to[K^+K^-π^+π^-]_D h^\pm$ and $B^\pm\to[π^+π^-π^+π^-]_D h^\pm$ ($h = K, π$)
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1163 additional authors not shown)
Abstract:
A model-independent determination of the CKM angle $γ$ is presented, using the $B^\pm\to[K^+K^-π^+π^-]_D h^\pm$ and $B^\pm\to[π^+π^-π^+π^-]_D h^\pm$ decays, with $h=K,π$. This measurement is the first phase-space-binned study of these decay modes, and uses a sample of proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of $9$fb$^{-1}$. The phase…
▽ More
A model-independent determination of the CKM angle $γ$ is presented, using the $B^\pm\to[K^+K^-π^+π^-]_D h^\pm$ and $B^\pm\to[π^+π^-π^+π^-]_D h^\pm$ decays, with $h=K,π$. This measurement is the first phase-space-binned study of these decay modes, and uses a sample of proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of $9$fb$^{-1}$. The phase-space bins are optimised for sensitivity to $γ$, and in each bin external inputs from the BESIII experiment are used to constrain the charm strong-phase parameters. The result of this binned analysis is $γ= (53.9_{-8.9}^{+9.5})^\circ$, where the uncertainty includes both statistical and systematic contributions. Furthermore, when combining with existing phase-space-integrated measurements of the same decay modes, a value of $γ= (52.6_{-6.4}^{+8.5})^\circ$ is obtained, which is one of the most precise determinations of $γ$ to date.
△ Less
Submitted 18 September, 2025;
originally announced September 2025.
-
3D Aware Region Prompted Vision Language Model
Authors:
An-Chieh Cheng,
Yang Fu,
Yukang Chen,
Zhijian Liu,
Xiaolong Li,
Subhashree Radhakrishnan,
Song Han,
Yao Lu,
Jan Kautz,
Pavlo Molchanov,
Hongxu Yin,
Xiaolong Wang,
Sifei Liu
Abstract:
We present Spatial Region 3D (SR-3D) aware vision-language model that connects single-view 2D images and multi-view 3D data through a shared visual token space. SR-3D supports flexible region prompting, allowing users to annotate regions with bounding boxes, segmentation masks on any frame, or directly in 3D, without the need for exhaustive multi-frame labeling. We achieve this by enriching 2D vis…
▽ More
We present Spatial Region 3D (SR-3D) aware vision-language model that connects single-view 2D images and multi-view 3D data through a shared visual token space. SR-3D supports flexible region prompting, allowing users to annotate regions with bounding boxes, segmentation masks on any frame, or directly in 3D, without the need for exhaustive multi-frame labeling. We achieve this by enriching 2D visual features with 3D positional embeddings, which allows the 3D model to draw upon strong 2D priors for more accurate spatial reasoning across frames, even when objects of interest do not co-occur within the same view. Extensive experiments on both general 2D vision language and specialized 3D spatial benchmarks demonstrate that SR-3D achieves state-of-the-art performance, underscoring its effectiveness for unifying 2D and 3D representation space on scene understanding. Moreover, we observe applicability to in-the-wild videos without sensory 3D inputs or ground-truth 3D annotations, where SR-3D accurately infers spatial relationships and metric measurements.
△ Less
Submitted 16 September, 2025;
originally announced September 2025.
-
Measurement of the branching fraction of the $Λ_b^0\to J/ψΛ$ decay and isospin asymmetry of $B\to J/ψK$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
M. Akthar,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1191 additional authors not shown)
Abstract:
This paper describes a measurement of the $Λ_b^0\to J/ψΛ$ branching fraction using data collected with the LHCb experiment in proton-proton collisions from 2016 to 2018. The dataset corresponds to an integrated luminosity of 5.4$\,\text{fb}^{-1}$. The branching fraction is determined relative to that of $B^0\to J/ψK^0_\text{S}$ decays,…
▽ More
This paper describes a measurement of the $Λ_b^0\to J/ψΛ$ branching fraction using data collected with the LHCb experiment in proton-proton collisions from 2016 to 2018. The dataset corresponds to an integrated luminosity of 5.4$\,\text{fb}^{-1}$. The branching fraction is determined relative to that of $B^0\to J/ψK^0_\text{S}$ decays, $\frac{\mathcal{B}(Λ_b^0\to J/ψΛ)}{\mathcal{B}(B^0\to J/ψK^0_\text{S}} = 0.750 \pm 0.005 \pm 0.022 \pm 0.005 \pm 0.062\,,$ yielding $\mathcal{B}(Λ_b^0\to J/ψΛ) = (3.34 \pm 0.02 \pm 0.10 \pm 0.08 \pm 0.28)\times 10^{-4}$, where the first uncertainty is statistical, the second systematic, the third due to external inputs on branching fractions and the fourth due to the ratio of $Λ_b^0$ baryon and $B^0$ meson hadronisation fractions. In addition, the isospin asymmetry between the rates of $B^0\to J/ψK^0_\text{S}$ and $B^+\to J/ψK^+$ decays is measured to be $A_{\rm I} = -0.0135 \pm 0.0004 \pm 0.0133$, where the first uncertainty is statistical and the second systematic.
△ Less
Submitted 22 September, 2025; v1 submitted 16 September, 2025;
originally announced September 2025.
-
Amplitude analysis of $B^0 \rightarrow η_c(1S) K^+ π^- $ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1184 additional authors not shown)
Abstract:
An amplitude analysis of the $B^0 \rightarrow η_{c}(1S) K^+ π^- $ decays with $η_{c}(1S) \to p \bar{p}$ is performed using a sample corresponding to an integrated luminosity of 9$\text{fb}^{-1}$ of $pp$ collision data collected by the LHCb detector at centre-of-mass energies of $\sqrt{s}$ = 7, 8 and 13TeV. The data are described with a model including only intermediate contributions from known…
▽ More
An amplitude analysis of the $B^0 \rightarrow η_{c}(1S) K^+ π^- $ decays with $η_{c}(1S) \to p \bar{p}$ is performed using a sample corresponding to an integrated luminosity of 9$\text{fb}^{-1}$ of $pp$ collision data collected by the LHCb detector at centre-of-mass energies of $\sqrt{s}$ = 7, 8 and 13TeV. The data are described with a model including only intermediate contributions from known $K^{0\star}$ resonances. Evidence for an exotic resonance in the $η_{c}(1S) π^{-} $ system, reported in a previous analysis of this decay channel, is not confirmed. The inclusive branching fraction of the $B^0 \rightarrow η_{c}(1S) K^+ π^- $ decays is measured to be \begin{align*} \mathcal{B}(B^0 \rightarrow η_{c}(1S) K^+ π^- ) = (5.82 \pm 0.20 \pm 0.23 \pm 0.55) \times 10^{-4}, \end{align*} where the first uncertainty is statistical, the second systematic, and the third arises from the limited knowledge of external branching fractions.
△ Less
Submitted 3 September, 2025;
originally announced September 2025.
-
Inclusive $B$-meson flavour-tagging algorithm at LHCb
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1178 additional authors not shown)
Abstract:
A new algorithm is developed to identify the flavour of neutral $B$ mesons at production in $pp$ collisions by utilising all tracks from the hadronisation process. The algorithm is calibrated separately for $B^0$ and $B^{0}_{s}$ mesons using $B^{0}\to J/ψK^{+}π^-$ and $B^{0}_{s}\to D_{s}^{-}π^+$ decays from $pp$ collision data collected by the LHCb experiment at a centre-of-mass energy of 13\,TeV.…
▽ More
A new algorithm is developed to identify the flavour of neutral $B$ mesons at production in $pp$ collisions by utilising all tracks from the hadronisation process. The algorithm is calibrated separately for $B^0$ and $B^{0}_{s}$ mesons using $B^{0}\to J/ψK^{+}π^-$ and $B^{0}_{s}\to D_{s}^{-}π^+$ decays from $pp$ collision data collected by the LHCb experiment at a centre-of-mass energy of 13\,TeV. This new algorithm improves the tagging power by 35\% for $B^{0}$ mesons and 20\% for $B^{0}_{s}$ mesons when compared to the combined performance of the existing LHCb flavour-tagging algorithms.
△ Less
Submitted 27 August, 2025;
originally announced August 2025.
-
Measurement of branching fractions and $CP$ asymmetries in $\mathitΛ_b^0(\mathitΞ_b^0)\!\to pK_{\mathrm S}^0h^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1159 additional authors not shown)
Abstract:
A study of $\mathitΛ_b^0$ and $\mathitΞ_b^0$ baryon decays to the final states $pK_{\mathrm S}^0π^-$ and $pK_{\mathrm S}^0K^-$ is performed using $pp$ collision data collected by the LHCb experiment, corresponding to an integrated luminosity of $9\,\mathrm{fb}^{-1}$. The decays $\mathitΛ_b^0\!\to pK_{\mathrm S}^0K^-$ and $\mathitΞ_b^0\!\to pK_{\mathrm S}^0K^-$ are observed for the first time, with…
▽ More
A study of $\mathitΛ_b^0$ and $\mathitΞ_b^0$ baryon decays to the final states $pK_{\mathrm S}^0π^-$ and $pK_{\mathrm S}^0K^-$ is performed using $pp$ collision data collected by the LHCb experiment, corresponding to an integrated luminosity of $9\,\mathrm{fb}^{-1}$. The decays $\mathitΛ_b^0\!\to pK_{\mathrm S}^0K^-$ and $\mathitΞ_b^0\!\to pK_{\mathrm S}^0K^-$ are observed for the first time, with significances reaching eight standard deviations. The branching fractions and integrated $CP$ asymmetries are measured for the $\mathitΛ_b^0\!\to pK_{\mathrm S}^0π^-$, $\mathitΛ_b^0\!\to pK_{\mathrm S}^0K^-$, and $\mathitΞ_b^0\!\to pK_{\mathrm S}^0K^-$ decays. For the decay $\mathitΛ_b^0\!\to pK_{\mathrm S}^0π^-$, the $CP$ asymmetries are measured in different regions of the Dalitz plot. No evidence of $CP$ violation is observed.
△ Less
Submitted 25 August, 2025;
originally announced August 2025.
-
First observation of the charmless baryonic decay $B^+\to\barΛp\bar{p}p$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1184 additional authors not shown)
Abstract:
A search for the charmless baryonic decay $B^+\to \barΛ p\bar{p}p$ is performed using proton-proton collision data recorded by the LHCb experiment, corresponding to an integrated luminosity of 5.4~$\text{fb}^{-1}$. The branching fraction for this decay is measured for the first time relative to that of the topologically similar decay $B^+\to J/ψK^+$, with $J/ψ\to \barΛ p K^-$. The branching fracti…
▽ More
A search for the charmless baryonic decay $B^+\to \barΛ p\bar{p}p$ is performed using proton-proton collision data recorded by the LHCb experiment, corresponding to an integrated luminosity of 5.4~$\text{fb}^{-1}$. The branching fraction for this decay is measured for the first time relative to that of the topologically similar decay $B^+\to J/ψK^+$, with $J/ψ\to \barΛ p K^-$. The branching fraction is measured to be \mbox{$\mathcal{B}(B^+\to \barΛ p\bar{p}p) = (2.08 \pm 0.34 \pm 0.12 \pm 0.26) \times 10^{-7}$}, where the first uncertainty is statistical, the second is systematic, and the third arises from the uncertainty in the normalization channel branching fraction. The $CP$ asymmetry is measured to be $\mathcal{A}_{CP}=(5.4\pm 15.6\pm 2.4)\%$, where the uncertainties are statistical and systematic. The background-subtracted invariant-mass distributions of $\barΛp$ and $\bar{p}$ pairs exhibit pronounced enhancements at both kinematic thresholds, in contrast to a uniform phase-space distribution.
△ Less
Submitted 22 August, 2025;
originally announced August 2025.
-
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Authors:
NVIDIA,
:,
Aarti Basant,
Abhijit Khairnar,
Abhijit Paithankar,
Abhinav Khattar,
Adithya Renduchintala,
Aditya Malte,
Akhiad Bercovich,
Akshay Hazare,
Alejandra Rico,
Aleksander Ficek,
Alex Kondratenko,
Alex Shaposhnikov,
Alexander Bukharin,
Ali Taghibakhshi,
Amelia Barton,
Ameya Sunil Mahabaleshwarkar,
Amy Shen,
Andrew Tao,
Ann Guan,
Anna Shors,
Anubhav Mandarwal,
Arham Mehta,
Arun Venkatesan
, et al. (192 additional authors not shown)
Abstract:
We introduce Nemotron-Nano-9B-v2, a hybrid Mamba-Transformer language model designed to increase throughput for reasoning workloads while achieving state-of-the-art accuracy compared to similarly-sized models. Nemotron-Nano-9B-v2 builds on the Nemotron-H architecture, in which the majority of the self-attention layers in the common Transformer architecture are replaced with Mamba-2 layers, to achi…
▽ More
We introduce Nemotron-Nano-9B-v2, a hybrid Mamba-Transformer language model designed to increase throughput for reasoning workloads while achieving state-of-the-art accuracy compared to similarly-sized models. Nemotron-Nano-9B-v2 builds on the Nemotron-H architecture, in which the majority of the self-attention layers in the common Transformer architecture are replaced with Mamba-2 layers, to achieve improved inference speed when generating the long thinking traces needed for reasoning. We create Nemotron-Nano-9B-v2 by first pre-training a 12-billion-parameter model (Nemotron-Nano-12B-v2-Base) on 20 trillion tokens using an FP8 training recipe. After aligning Nemotron-Nano-12B-v2-Base, we employ the Minitron strategy to compress and distill the model with the goal of enabling inference on up to 128k tokens on a single NVIDIA A10G GPU (22GiB of memory, bfloat16 precision). Compared to existing similarly-sized models (e.g., Qwen3-8B), we show that Nemotron-Nano-9B-v2 achieves on-par or better accuracy on reasoning benchmarks while achieving up to 6x higher inference throughput in reasoning settings like 8k input and 16k output tokens. We are releasing Nemotron-Nano-9B-v2, Nemotron-Nano12B-v2-Base, and Nemotron-Nano-9B-v2-Base checkpoints along with the majority of our pre- and post-training datasets on Hugging Face.
△ Less
Submitted 2 September, 2025; v1 submitted 20 August, 2025;
originally announced August 2025.
-
First observation of $CP$ violation and measurement of polarization in $B^+\toρ(770)^0 K^*(892)^+$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1182 additional authors not shown)
Abstract:
An amplitude analysis of the $B^+\to(π^+π^-)(K^0_{\mathrm{S}}π^+)$ decay is performed in the mass regions $0.30 < m_{π^+π^-} < 1.10\,\mathrm{GeV}/c^2$ and $0.75 < m_{K^0_{\mathrm{S}}π^+} < 1.20\,\mathrm{GeV}/c^2$, using $pp$ collision data recorded with the LHCb detector corresponding to an integrated luminosity of $9\,\mathrm{fb}^{-1}$. The polarization fractions and $CP$ asymmetries for…
▽ More
An amplitude analysis of the $B^+\to(π^+π^-)(K^0_{\mathrm{S}}π^+)$ decay is performed in the mass regions $0.30 < m_{π^+π^-} < 1.10\,\mathrm{GeV}/c^2$ and $0.75 < m_{K^0_{\mathrm{S}}π^+} < 1.20\,\mathrm{GeV}/c^2$, using $pp$ collision data recorded with the LHCb detector corresponding to an integrated luminosity of $9\,\mathrm{fb}^{-1}$. The polarization fractions and $CP$ asymmetries for $B^+\toρ(770)^0K^*(892)^+$ decays are measured. Violation of the $CP$ symmetry in the decay $B^+\toρ(770)^0K^*(892)^+$ is observed for the first time, with a significance exceeding nine standard deviations. The $CP$ asymmetry is measured to be ${\cal A}_{CP} = 0.507 \pm 0.062\ \text{(stat)} \pm 0.017\ \text{(syst)}$ and the $CP$-averaged longitudinal polarization fraction of $f_L = 0.720 \pm 0.028\ \text{(stat)} \pm 0.009\ \text{(syst)}$. The measurements help to shed light on the polarization puzzle of $B$ mesons decaying to two vector mesons.
△ Less
Submitted 19 August, 2025;
originally announced August 2025.
-
First Beam Neutrinos Observed with an LAPPD in the ANNIE Experiment
Authors:
B. W. Adams,
S. Abubakar,
D. Ajana,
M. A. Aman,
M. Ascencio-Sosa,
A. Augusthy,
Z. Bagdasarian,
J. Beacom,
M. Bergevin,
D. Bick,
M. Breisch,
E. Brunner-Huber,
G. Caceres Vera,
S. Dazeley,
S. Deng,
S. Donnelly,
S. Doran,
E. Drakopoulou,
S. Edayath,
R. Edwards,
J. Eisch,
Y. Feng,
V. Fischer,
R. Foster,
S. Gardiner
, et al. (48 additional authors not shown)
Abstract:
The Accelerator Neutrino Neutron Interaction Experiment (ANNIE) probes the physics of neutrino-nucleus interactions in a gadolinium-loaded water (Gd-water) target while serving as a flexible testbed for advanced next-generation optical neutrino detection technologies. These advanced technologies include novel detection media (particularly Gd-water and hybrid Cherenkov-scintillation through water-b…
▽ More
The Accelerator Neutrino Neutron Interaction Experiment (ANNIE) probes the physics of neutrino-nucleus interactions in a gadolinium-loaded water (Gd-water) target while serving as a flexible testbed for advanced next-generation optical neutrino detection technologies. These advanced technologies include novel detection media (particularly Gd-water and hybrid Cherenkov-scintillation through water-based liquid scintillator) and novel photosensors. In this paper we demonstrate the first implementation of a fully-integrated setup for Large Area Picosecond PhotoDetectors (LAPPDs) in a neutrino experiment. Details are presented regarding the design, commissioning, and deployment of an LAPPD and the supporting systems. We also present the first neutrino interactions ever observed with an LAPPD.
△ Less
Submitted 14 August, 2025;
originally announced August 2025.
-
HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis
Authors:
Timo Teufel,
Pulkit Gera,
Xilong Zhou,
Umar Iqbal,
Pramod Rao,
Jan Kautz,
Vladislav Golyanik,
Christian Theobalt
Abstract:
Simultaneous relighting and novel-view rendering of digital human representations is an important yet challenging task with numerous applications. Progress in this area has been significantly limited due to the lack of publicly available, high-quality datasets, especially for full-body human captures. To address this critical gap, we introduce the HumanOLAT dataset, the first publicly accessible l…
▽ More
Simultaneous relighting and novel-view rendering of digital human representations is an important yet challenging task with numerous applications. Progress in this area has been significantly limited due to the lack of publicly available, high-quality datasets, especially for full-body human captures. To address this critical gap, we introduce the HumanOLAT dataset, the first publicly accessible large-scale dataset of multi-view One-Light-at-a-Time (OLAT) captures of full-body humans. The dataset includes HDR RGB frames under various illuminations, such as white light, environment maps, color gradients and fine-grained OLAT illuminations. Our evaluations of state-of-the-art relighting and novel-view synthesis methods underscore both the dataset's value and the significant challenges still present in modeling complex human-centric appearance and lighting interactions. We believe HumanOLAT will significantly facilitate future research, enabling rigorous benchmarking and advancements in both general and human-specific relighting and rendering techniques.
△ Less
Submitted 12 August, 2025;
originally announced August 2025.
-
Deuteron identification via time of flight with LHCb
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
M. Akthar,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1182 additional authors not shown)
Abstract:
It is shown that the timing capabilities of the LHCb detector operated during the LHC Run 2 can be used to identify light ion particles with momenta of a few GeV/$c$. This is achieved by estimating the particle time of flight through a newly developed technique. A dedicated reconstruction procedure and a neural-network-based estimator of the particle speed have been developed to enable deuteron id…
▽ More
It is shown that the timing capabilities of the LHCb detector operated during the LHC Run 2 can be used to identify light ion particles with momenta of a few GeV/$c$. This is achieved by estimating the particle time of flight through a newly developed technique. A dedicated reconstruction procedure and a neural-network-based estimator of the particle speed have been developed to enable deuteron identification by suppressing the abundant background from lighter particles. The performance of the identification procedure is demonstrated in a sample of proton-helium collisions at $\sqrt{s_{\text{NN}}}=110$ GeV, where the production of deuteron and triton particles is observed. This novel approach opens the way to study deuteron and antideuteron production for different collision systems at different energy scales, exploiting the rich dataset collected by the LHCb experiment.
△ Less
Submitted 8 August, 2025;
originally announced August 2025.
-
Measurement of transverse $Λ$ and $\barΛ$ hyperon polarization in $p$Pb collisions at $\sqrt{s_{NN}} = 5.02$ TeV
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1128 additional authors not shown)
Abstract:
The transverse polarization of $Λ$ and $\barΛ$ hyperons is measured in $p$Pb collisions collected by the LHCb experiment at a nucleon-nucleon center-of-mass energy of $5.02 $ TeV. The polarization is averaged over hyperon transverse momentum in the range $0.15 < p_{T} < 6.00 $ GeV/$c$, and Feynman-$x$ in the ranges $0.005 < x_{F} < 0.040$ (forward region) and $-0.10 < x_{F} < -0.01$ (backward regi…
▽ More
The transverse polarization of $Λ$ and $\barΛ$ hyperons is measured in $p$Pb collisions collected by the LHCb experiment at a nucleon-nucleon center-of-mass energy of $5.02 $ TeV. The polarization is averaged over hyperon transverse momentum in the range $0.15 < p_{T} < 6.00 $ GeV/$c$, and Feynman-$x$ in the ranges $0.005 < x_{F} < 0.040$ (forward region) and $-0.10 < x_{F} < -0.01$ (backward region) defined relative to the proton beam direction. The transverse polarization is found to be compatible with zero for both $Λ$ and $\barΛ$ hyperons. The results are also measured as a function of $p_{T}$ and $x_{F}$ with no significant dependence on these variables observed. The results are compared with previous experimental measurements at different center-of-mass energies and collision environments.
△ Less
Submitted 3 August, 2025;
originally announced August 2025.
-
Amplitude analysis of the $Ξ^+_c\to pK^-π^+$ decay and $Ξ^+_c$ baryon polarization measurement in semileptonic beauty-hadron decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1123 additional authors not shown)
Abstract:
An amplitude analysis of the $Ξ^+_c\to pK^-π^+$ decay together with a measurement of the $Ξ^+_c$ polarization vector in semileptonic beauty-hadron decays is presented. The analysis is performed using proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of 9 ${\rm fb}^{-1}$. An amplitude model is developed and the resonance fractions as well as tw…
▽ More
An amplitude analysis of the $Ξ^+_c\to pK^-π^+$ decay together with a measurement of the $Ξ^+_c$ polarization vector in semileptonic beauty-hadron decays is presented. The analysis is performed using proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of 9 ${\rm fb}^{-1}$. An amplitude model is developed and the resonance fractions as well as two- and three-body decay parameters are reported. A sizeable $Ξ^+_c$ polarization is found. A large sensitivity of the $Ξ^+_c\to pK^-π^+$ decay to the polarization is seen, making the amplitude model suitable for $Ξ^+_c$ polarization measurements in other systems.
△ Less
Submitted 1 August, 2025;
originally announced August 2025.
-
Search for the decay $B^0 \rightarrow φφ$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1159 additional authors not shown)
Abstract:
A search for the decay $B^0 \rightarrow φφ$ is made using $pp$ collision data collected with the LHCb detector at centre-of-mass energies of 7, 8 and 13 TeV, corresponding to an integrated luminosity of $9$ fb$^{-1}$. No significant signal is observed, and an upper limit on the branching fraction of $1.3~(1.4)\times 10^{-8}$ at $90 ~(95) \%$ confidence level is set. This result supersedes the prev…
▽ More
A search for the decay $B^0 \rightarrow φφ$ is made using $pp$ collision data collected with the LHCb detector at centre-of-mass energies of 7, 8 and 13 TeV, corresponding to an integrated luminosity of $9$ fb$^{-1}$. No significant signal is observed, and an upper limit on the branching fraction of $1.3~(1.4)\times 10^{-8}$ at $90 ~(95) \%$ confidence level is set. This result supersedes the previous LHCb study and improves the upper limit by a factor of two.
△ Less
Submitted 28 July, 2025;
originally announced July 2025.
-
Measurement of the $B^0\rightarrow ρ(770)^{0}γ$ branching fraction
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1159 additional authors not shown)
Abstract:
The ratio between the branching fractions of the $B^0\rightarrow ρ(770)^{0}γ$ and $B^{0}\rightarrow K^{*}(892)^{0}γ$ decays is measured with proton-proton collision data collected by the LHCb experiment at centre-of-mass energies of 7, 8, and 13 TeV, corresponding to an integrated luminosity of 9 fb${}^{-1}$. The measured value is \begin{equation*} \frac{{\cal B}(B^0\rightarrow ρ(770)^{0}γ)}{{\cal…
▽ More
The ratio between the branching fractions of the $B^0\rightarrow ρ(770)^{0}γ$ and $B^{0}\rightarrow K^{*}(892)^{0}γ$ decays is measured with proton-proton collision data collected by the LHCb experiment at centre-of-mass energies of 7, 8, and 13 TeV, corresponding to an integrated luminosity of 9 fb${}^{-1}$. The measured value is \begin{equation*} \frac{{\cal B}(B^0\rightarrow ρ(770)^{0}γ)}{{\cal B}(B^0\rightarrow K^{*}(892)^{0}γ)}=0.0189\pm 0.0007\pm 0.0005, \end{equation*} where the first uncertainty is statistical and the second systematic. The branching fraction for $B^0\rightarrow ρ(770)^{0}γ$ decays is hence obtained as \begin{equation*} {\cal{B}}(B^0\rightarrow ρ(770)^{0}γ) =(7.9\pm 0.3\pm 0.2\pm 0.2) \times 10^{-7}, \end{equation*} where the last uncertainty is due to the branching fraction of the normalisation mode. This result assumes that both the $ρ(770)^0$ and $K^{*}(892)^0$ decays saturate the dihadron mass spectra considered in the analysis. It is consistent with the current world-average value and by far the most precise measurement to date.
△ Less
Submitted 18 July, 2025;
originally announced July 2025.
-
Search for resonances decaying to photon pairs with masses between 4.9 and 19.4 GeV
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1142 additional authors not shown)
Abstract:
A search is presented for axion-like particles (ALPs) with masses between 4.9 and 19.4 GeV decaying to a pair of photons, using proton-proton collisions collected with the LHCb detector during 2018 at a centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 2.1 fb$^{-1}$. The same strategy and sample is used to search for the decays of the $B^0_s$, $B^0$ and $η_b$ mesons int…
▽ More
A search is presented for axion-like particles (ALPs) with masses between 4.9 and 19.4 GeV decaying to a pair of photons, using proton-proton collisions collected with the LHCb detector during 2018 at a centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 2.1 fb$^{-1}$. The same strategy and sample is used to search for the decays of the $B^0_s$, $B^0$ and $η_b$ mesons into photon pairs.
No significant excess is found. Upper limits on the photon-pair branching fraction times the cross-section of ALP production are determined as a function of the ALP mass. Limits on the branching fractions of the beauty states are determined to be $\mathcal{B}(B^0_s\toγγ)<2.7\times10^{-5}$, $\mathcal{B}(B^0\toγγ)<0.83\times10^{-5}$, and $
σ(pp\toη_b X)\times\mathcal{B}(η_b\toγγ)<765\,\text{pb}$ at 95 % confidence level.
△ Less
Submitted 18 July, 2025;
originally announced July 2025.
-
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
Authors:
Dachuan Shi,
Yonggan Fu,
Xiangchi Yuan,
Zhongzhi Yu,
Haoran You,
Sixu Li,
Xin Dong,
Jan Kautz,
Pavlo Molchanov,
Yingyan,
Lin
Abstract:
Recent advancements in Large Language Models (LLMs) have spurred interest in numerous applications requiring robust long-range capabilities, essential for processing extensive input contexts and continuously generating extended outputs. As sequence lengths increase, the number of Key-Value (KV) pairs in LLMs escalates, creating a significant efficiency bottleneck. In this paper, we propose a new K…
▽ More
Recent advancements in Large Language Models (LLMs) have spurred interest in numerous applications requiring robust long-range capabilities, essential for processing extensive input contexts and continuously generating extended outputs. As sequence lengths increase, the number of Key-Value (KV) pairs in LLMs escalates, creating a significant efficiency bottleneck. In this paper, we propose a new KV cache optimization paradigm called LaCache, a training-free method for efficient and accurate generative inference of LLMs. LaCache enables LLMs to simultaneously address both of the critical challenges in long-range modeling: robust long-range capabilities and continuous generation without running out-of-memory (OOM). Specifically, LaCache integrates two key innovations: (1) a ladder-shaped KV cache pattern that stores KV pairs not only sequentially (left-to-right within each layer) but also across layers (from shallow to deep), providing an extended span for capturing long-range dependencies under a fixed storage budget, thereby boosting long-range capabilities; and (2) an iterative compaction mechanism that progressively compresses older caches, freeing up space for new tokens within a fixed cache size. This token distance-based dynamic compression enables more effective continuous generation under constrained cache budgets. Experiments across various tasks, benchmarks, and LLM models consistently validate LaCache's effectiveness in enhancing LLMs' long-range capabilities. Our code is available at https://github.com/GATECH-EIC/LaCache.
△ Less
Submitted 14 July, 2025;
originally announced July 2025.
-
Improved measurement of $η/ η^{\prime}$ mixing in $B^{0}_{(s)} \rightarrow J/ψη^{(\prime)}$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1181 additional authors not shown)
Abstract:
Branching fraction ratios between the decays $B^{0}_{(s)} \rightarrow J/ψη^{(\prime)}$ are measured using proton-proton collision data collected by the LHCb experiment at centre-of-mass energies of $7$, $8$ and $13~\textrm{TeV}$, corresponding to an integrated luminosity of $9~ \textrm{fb}^{-1}$. The measured ratios of these branching fractions are…
▽ More
Branching fraction ratios between the decays $B^{0}_{(s)} \rightarrow J/ψη^{(\prime)}$ are measured using proton-proton collision data collected by the LHCb experiment at centre-of-mass energies of $7$, $8$ and $13~\textrm{TeV}$, corresponding to an integrated luminosity of $9~ \textrm{fb}^{-1}$. The measured ratios of these branching fractions are $\frac{BF(B^{0} \rightarrow J/ψη^{\prime})}{BF(B^{0} \rightarrow J/ψη)} = 0.48 \pm 0.06 \pm 0.02 \pm 0.01$ and $\frac{BF(B^{0}_{s} \rightarrow J/ψη^{\prime})}{BF(B^{0}_{s} \rightarrow J/ψη)} = 0.80 \pm 0.02 \pm 0.02 \pm 0.01$, where the uncertainties are statistical, systematic and related to the precision of the $η^{(\prime)}$ branching fractions, respectively. They are used to constrain the $η/η^{\prime}$ mixing angle, $φ_{P}$, and to probe the presence of a possible glueball component in the $η^{\prime}$ meson, described by the gluonic mixing angle $φ_{G}$. The obtained results are $φ_{P} = (41.6^{+1.0}_{-1.2})^\circ$ and $φ_{G} = (28.1^{+3.9}_{-4.0})^\circ$, where the uncertainties are statistically dominated. While the value of $φ_{P}$ is compatible with existing experimental determinations and theoretical calculations, the angle $φ_{G}$ differs from zero by more than four standard deviations, which points to a substantial glueball component in the $η^{\prime}$ meson and/or unexpectedly large contributions from gluon-mediated processes in these decays. The absolute branching fractions are also measured relative to that of the well-established $B^{0}_{s} \rightarrow J/ψφ$ decay, which serves as the normalisation channel. These results supersede the previous LHCb measurements and are the most precise to date.
△ Less
Submitted 18 July, 2025;
originally announced July 2025.
-
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training
Authors:
Mingjie Liu,
Shizhe Diao,
Jian Hu,
Ximing Lu,
Xin Dong,
Hao Zhang,
Alexander Bukharin,
Shaokun Zhang,
Jiaqi Zeng,
Makesh Narsimhan Sreedhar,
Gerald Shen,
David Mosallanezhad,
Di Zhang,
Jonas Yang,
June Yang,
Oleksii Kuchaiev,
Guilin Liu,
Zhiding Yu,
Pavlo Molchanov,
Yejin Choi,
Jan Kautz,
Yi Dong
Abstract:
Recent advancements in reasoning-focused language models such as OpenAI's O1 and DeepSeek-R1 have shown that scaling test-time computation-through chain-of-thought reasoning and iterative exploration-can yield substantial improvements on complex tasks like mathematics and code generation. These breakthroughs have been driven by large-scale reinforcement learning (RL), particularly when combined wi…
▽ More
Recent advancements in reasoning-focused language models such as OpenAI's O1 and DeepSeek-R1 have shown that scaling test-time computation-through chain-of-thought reasoning and iterative exploration-can yield substantial improvements on complex tasks like mathematics and code generation. These breakthroughs have been driven by large-scale reinforcement learning (RL), particularly when combined with verifiable reward signals that provide objective and grounded supervision. In this report, we investigate the effects of prolonged reinforcement learning on a small language model across a diverse set of reasoning domains. Our work identifies several key ingredients for effective training, including the use of verifiable reward tasks, enhancements to Group Relative Policy Optimization (GRPO), and practical techniques to improve training stability and generalization. We introduce controlled KL regularization, clipping ratio, and periodic reference policy resets as critical components for unlocking long-term performance gains. Our model achieves significant improvements over strong baselines, including +14.7% on math, +13.9% on coding, and +54.8% on logic puzzle tasks. To facilitate continued research, we release our model publicly.
△ Less
Submitted 16 July, 2025;
originally announced July 2025.
-
Precision measurement of the ${\itΞ}_b^0$ baryon lifetime
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1175 additional authors not shown)
Abstract:
A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.4 fb$^{-1}$ and collected by the LHCb experiment during LHC Run 2, is used to measure the ratio of the lifetime of the ${\itΞ}_b^0$ baryon to that of the ${\itΛ}_b^0$ baryon, $r_τ\equivτ_{{\itΞ}_b^0}/τ_{{\itΛ}_b^0}$. The value ${r_τ^{\rm Run\,2}=1.004\pm0.009\pm0.006}$ is obtained, where the first uncertainty is statis…
▽ More
A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.4 fb$^{-1}$ and collected by the LHCb experiment during LHC Run 2, is used to measure the ratio of the lifetime of the ${\itΞ}_b^0$ baryon to that of the ${\itΛ}_b^0$ baryon, $r_τ\equivτ_{{\itΞ}_b^0}/τ_{{\itΛ}_b^0}$. The value ${r_τ^{\rm Run\,2}=1.004\pm0.009\pm0.006}$ is obtained, where the first uncertainty is statistical and the second systematic. This value is averaged with the corresponding value from Run 1 to obtain ${r_τ = 1.004\pm0.008\pm0.005}$. Multiplying by the known value of the ${\itΛ}_b^0$ lifetime yields ${{τ_{{\itΞ}_b^0}} = 1.475\pm0.012\pm0.008\pm0.009~{\rm ps}}$, where the last uncertainty is due to the limited knowledge of the ${\itΛ}_b^0$ lifetime. This measurement improves the precision of the current world average of the ${\itΞ}_b^0$ lifetime by about a factor of two, and is in good agreement with the most recent theoretical predictions.
△ Less
Submitted 30 September, 2025; v1 submitted 16 July, 2025;
originally announced July 2025.
-
First observation of the $\mathitΛ_b^{0}\!\rightarrow\mathitΛ_{c}^{+}D_{s}^{-}K^{+}K^{-}$ decay and search for pentaquarks in the $\mathitΛ_{c}^{+}D_{s}^{-}$ system
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1175 additional authors not shown)
Abstract:
The $\mathitΛ_b^{0}\!\rightarrow\mathitΛ_{c}^{+}D_{s}^{-}K^{+}K^{-}$ decay is observed for the first time using the data sample from proton-proton collisions recorded at a center-of-mass energy of $13\,\text{TeV}$ with the LHCb detector, corresponding to an integrated luminosity of $6\,\text{fb}^{-1}$. The ratio of branching fraction to that of…
▽ More
The $\mathitΛ_b^{0}\!\rightarrow\mathitΛ_{c}^{+}D_{s}^{-}K^{+}K^{-}$ decay is observed for the first time using the data sample from proton-proton collisions recorded at a center-of-mass energy of $13\,\text{TeV}$ with the LHCb detector, corresponding to an integrated luminosity of $6\,\text{fb}^{-1}$. The ratio of branching fraction to that of $\mathitΛ_b^{0} \!\rightarrow\mathitΛ_{c}^{+}D_{s}^{-}$ decays is measured as $0.0141 \pm 0.0019 \pm 0.0012$, where the first uncertainty is statistical and the second systematic. A search for hidden-charm pentaquarks with strangeness is performed in the $\mathitΛ_{c}^{+}D_{s}^{-}$ system. No evidence is found, and upper limits on the production ratio of $P_{c\bar{c}s}(4338)^0$ and $P_{c\bar{c}s}(4459)^0$ pentaquarks relative to the $\mathitΛ_{c}^{+}D_{s}^{-}$ final state are set at the $95\%$ confidence level as $0.12$ and $0.20$, respectively.
△ Less
Submitted 30 September, 2025; v1 submitted 14 July, 2025;
originally announced July 2025.
-
Scaling RL to Long Videos
Authors:
Yukang Chen,
Wei Huang,
Baifeng Shi,
Qinghao Hu,
Hanrong Ye,
Ligeng Zhu,
Zhijian Liu,
Pavlo Molchanov,
Jan Kautz,
Xiaojuan Qi,
Sifei Liu,
Hongxu Yin,
Yao Lu,
Song Han
Abstract:
We introduce a full-stack framework that scales up reasoning in vision-language models (VLMs) to long videos, leveraging reinforcement learning. We address the unique challenges of long video reasoning by integrating three critical components: (1) a large-scale dataset, LongVideo-Reason, comprising 104K long video QA pairs with high-quality reasoning annotations across diverse domains such as spor…
▽ More
We introduce a full-stack framework that scales up reasoning in vision-language models (VLMs) to long videos, leveraging reinforcement learning. We address the unique challenges of long video reasoning by integrating three critical components: (1) a large-scale dataset, LongVideo-Reason, comprising 104K long video QA pairs with high-quality reasoning annotations across diverse domains such as sports, games, and vlogs; (2) a two-stage training pipeline that extends VLMs with chain-of-thought supervised fine-tuning (CoT-SFT) and reinforcement learning (RL); and (3) a training infrastructure for long video RL, named Multi-modal Reinforcement Sequence Parallelism (MR-SP), which incorporates sequence parallelism and a vLLM-based engine tailored for long video, using cached video embeddings for efficient rollout and prefilling. In our experiments, LongVILA-R1-7B achieves strong performance on video benchmarks, reaching 65.1% and 71.1% accuracy on VideoMME without and with subtitles, respectively, and consistently outperforming LongVILA-7B across multiple benchmarks. Moreover, LongVILA-R1-7B supports processing up to 8,192 video frames per video, and configurable FPS settings. Notably, our MR-SP system achieves up to 2.1x speedup on long video RL training. In addition, we release our training system for public availability that supports RL training on various modalities (video, text, and audio), various models (VILA and Qwen series), and even image and video generation models. On a single A100 node (8 GPUs), it supports RL training on hour-long videos (e.g., 3,600 frames).
△ Less
Submitted 30 September, 2025; v1 submitted 10 July, 2025;
originally announced July 2025.
-
Observation of orbitally excited $B_{c}^{+}$ states
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1154 additional authors not shown)
Abstract:
The observation of a wide peaking structure in the $B_{c}^{+} γ$ mass spectrum is reported using proton-proton collision data collected by the LHCb detector at center-of-mass energies of $7$, $8$ and $13~\text{TeV}$, corresponding to a total integrated luminosity of $9~\text{fb}^{-1}$. The statistical significance over the background-only hypothesis exceeds seven standard deviations. The width of…
▽ More
The observation of a wide peaking structure in the $B_{c}^{+} γ$ mass spectrum is reported using proton-proton collision data collected by the LHCb detector at center-of-mass energies of $7$, $8$ and $13~\text{TeV}$, corresponding to a total integrated luminosity of $9~\text{fb}^{-1}$. The statistical significance over the background-only hypothesis exceeds seven standard deviations. The width of the observed structure is larger than the expectation from a single-peak hypothesis, and is well described by an effective minimal model consisting of two narrow peaks located at $6704.8 \pm 5.5 \pm 2.8 \pm 0.3~\mathrm{Me\kern -0.1em V\!/}c^2$ and $6752.4 \pm 9.5 \pm 3.1 \pm 0.3~\mathrm{Me\kern -0.1em V\!/}c^2$. The uncertainty terms are statistical, systematic, and associated to the knowledge of the $B_{c}^{+}$ mass, respectively. The measured peak locations are in line with theoretical predictions for lowest excited $P$-wave $B_{c}^{+}$ states, marking the first observation of orbitally excited beauty-charm mesons and providing important insights into the internal dynamics of hadrons containing two heavy quarks.
△ Less
Submitted 4 July, 2025; v1 submitted 2 July, 2025;
originally announced July 2025.
-
Study of $B_{c}(1P)^{+}$ states in the $B_{c}^{+} γ$ mass spectrum
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1154 additional authors not shown)
Abstract:
The study of a wide peaking structure in the $B_{c}^{+} γ$ mass spectrum is reported using a data sample of proton-proton collisions collected by the LHCb detector at center-of-mass energies of $7$, $8$ and $13~\text{TeV}$, corresponding to an integrated luminosity of $9~\text{fb}^{-1}$. The observed structure is consistent with the lowest excited $P$-wave $B_{c}^{+}$ states and exhibits a statist…
▽ More
The study of a wide peaking structure in the $B_{c}^{+} γ$ mass spectrum is reported using a data sample of proton-proton collisions collected by the LHCb detector at center-of-mass energies of $7$, $8$ and $13~\text{TeV}$, corresponding to an integrated luminosity of $9~\text{fb}^{-1}$. The observed structure is consistent with the lowest excited $P$-wave $B_{c}^{+}$ states and exhibits a statistical significance exceeding seven standard deviations relative to the background-only hypothesis. A two-peak model serves as an effective description of the data, with various theory-constrained models further explored to provide physical interpretation. Based on the predictions for the $B_{c}(1P)^{+}$ spectrum, the relative production cross-section of the overall $B_{c}(1P)^{+}$ states with respect to the $B_{c}^{+}$ ground state with the transverse momentum $p_{\text{T}}$ and rapidity $y$ of $B_{c}^{+}$ mesons in the regions $p_{\text{T}}<20~\mathrm{Ge\kern -0.1em V\!/}c$ and $2.0<y<4.5$ at $\sqrt{s}=13~\text{TeV}$ is measured to be $0.20 \pm 0.03 \pm 0.02 \pm 0.03$, where the uncertainty terms represent statistical, systematic, and uncertainties related to the choice of theoretical models, respectively. The results provide a test of theoretical models and deepen our understanding of quantum chromodynamics.
△ Less
Submitted 4 July, 2025; v1 submitted 2 July, 2025;
originally announced July 2025.
-
Updated measurement of $CP$ violation and polarisation in $B^0_s \rightarrow J/ψ\overline{K}{}^{*}\kern-1pt(892)^{0}$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
R. Aleksiejunas,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1168 additional authors not shown)
Abstract:
A time-integrated angular analysis of the decay $B^0_s \rightarrow J/ψ\overline{K}{}^{*}\kern-1pt(892)^{0}$, with $J/ψ\rightarrow μ^{+} μ^{-}$ and $\overline{K}{}^{*}\kern-1pt(892)^{0} \rightarrow K^{-} π^{+}$, is presented. The analysis employs a sample of proton-proton collision data collected by the LHCb experiment during 2015-2018 at a centre-of-mass energy of $13 \text{TeV}$, corresponding to…
▽ More
A time-integrated angular analysis of the decay $B^0_s \rightarrow J/ψ\overline{K}{}^{*}\kern-1pt(892)^{0}$, with $J/ψ\rightarrow μ^{+} μ^{-}$ and $\overline{K}{}^{*}\kern-1pt(892)^{0} \rightarrow K^{-} π^{+}$, is presented. The analysis employs a sample of proton-proton collision data collected by the LHCb experiment during 2015-2018 at a centre-of-mass energy of $13 \text{TeV}$, corresponding to an integrated luminosity of $6 \text{fb}^{-1}$. A simultaneous maximum-likelihood fit is performed to the angular distributions in bins of the $K^{-} π^{+}$ mass. This fit yields measurements of the $CP$-averaged polarisation fractions and $CP$ asymmetries for the P-wave component of the $K^{-} π^{+}$ system. The longitudinal and parallel polarisation fractions are determined to be $f_{0} = 0.534 \pm 0.012 \pm 0.009$ and $f_{\parallel} = 0.211 \pm 0.014 \pm 0.005$, respectively, where the first uncertainty is statistical and the second is systematic. The $CP$ asymmetries are measured with $3$-$7\%$ precision and are found to be consistent with zero. These measurements, along with an updated determination of the branching fraction relative to the $B^0 \rightarrow J/ψK^{*0}$ decay, are combined with previous LHCb results, providing the most precise values for these observables to date.
△ Less
Submitted 23 October, 2025; v1 submitted 27 June, 2025;
originally announced June 2025.
-
Minifinetuning: Low-Data Generation Domain Adaptation through Corrective Self-Distillation
Authors:
Peter Belcak,
Greg Heinrich,
Jan Kautz,
Pavlo Molchanov
Abstract:
Finetuning language models for a new domain inevitably leads to the deterioration of their general performance. This becomes more pronounced the more limited the finetuning data resource.
We introduce minifinetuning (MFT), a method for language model domain adaptation that considerably reduces the effects of overfitting-induced degeneralization in low-data settings and which does so in the absen…
▽ More
Finetuning language models for a new domain inevitably leads to the deterioration of their general performance. This becomes more pronounced the more limited the finetuning data resource.
We introduce minifinetuning (MFT), a method for language model domain adaptation that considerably reduces the effects of overfitting-induced degeneralization in low-data settings and which does so in the absence of any pre-training data for replay. MFT demonstrates 2-10x more favourable specialization-to-degeneralization ratios than standard finetuning across a wide range of models and domains and exhibits an intrinsic robustness to overfitting when data in the new domain is scarce and down to as little as 500 samples.
Employing corrective self-distillation that is individualized on the sample level, MFT outperforms parameter-efficient finetuning methods, demonstrates replay-like degeneralization mitigation properties, and is composable with either for a combined effect.
△ Less
Submitted 29 May, 2025;
originally announced June 2025.
-
Search for the lepton-flavour-violating decays $B^0 \to K^{*0} τ^\pm e^\mp$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1128 additional authors not shown)
Abstract:
A first search for the lepton-flavour-violating decays $B^0\to K^{*0}τ^\pm e^\mp$ is presented. The analysis is performed using a sample of proton-proton collision data, collected with the LHCb detector at a centre-of-mass energy of 13 TeV between 2016 and 2018, corresponding to an integrated luminosity of 5.4 fb$^{-1}$. No significant signal is observed, and upper limits on the branching fraction…
▽ More
A first search for the lepton-flavour-violating decays $B^0\to K^{*0}τ^\pm e^\mp$ is presented. The analysis is performed using a sample of proton-proton collision data, collected with the LHCb detector at a centre-of-mass energy of 13 TeV between 2016 and 2018, corresponding to an integrated luminosity of 5.4 fb$^{-1}$. No significant signal is observed, and upper limits on the branching fractions are determined to be $\cal{B}$$(B^0 \to K^{*0}τ^-e^+)< 5.9$ $(7.1)\times 10^{-6}$ and $\cal{B}$$(B^0 \to K^{*0}τ^+e^-)< 4.9$ $(5.9)\times 10^{-6}$ at the 90\% (95\%) confidence level.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Measurement of the $Ω_c^0$ and $Ξ_c^0$ baryon lifetimes using hadronic $b$-baryon decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1141 additional authors not shown)
Abstract:
The lifetimes of the $Ω_c^0$ and $Ξ_c^0$ baryons are measured using a $pp$ collision dataset collected by the LHCb experiment, corresponding to an integrated luminosity of $9~\rm{fb^{-1}}$. The charm baryons are produced in the fully reconstructed decay chains $Ω_b^- \rightarrow Ω_c^0 (\rightarrow pK^-K^-π^+)~π^-$ and $Ξ_b^- \rightarrow Ξ_c^0 (\rightarrow pK^-K^-π^+)~π^-$. The measurement uses top…
▽ More
The lifetimes of the $Ω_c^0$ and $Ξ_c^0$ baryons are measured using a $pp$ collision dataset collected by the LHCb experiment, corresponding to an integrated luminosity of $9~\rm{fb^{-1}}$. The charm baryons are produced in the fully reconstructed decay chains $Ω_b^- \rightarrow Ω_c^0 (\rightarrow pK^-K^-π^+)~π^-$ and $Ξ_b^- \rightarrow Ξ_c^0 (\rightarrow pK^-K^-π^+)~π^-$. The measurement uses topologically and kinematically similar $B^- \rightarrow D^0(\rightarrow K^-K^+π^-π^+)~π^-$ decays for normalisation. The measured lifetimes are
$τ_{Ω_c^0} = 276.3 \pm 19.4~\rm{(stat)} \pm 1.8~\rm{(syst)} \pm 0.7~(τ_{D^0})~\rm{fs}$,
$τ_{Ξ_c^0} = 149.2 \pm ~\,2.5~\rm{(stat)} \pm 0.9~\rm{(syst)} \pm 0.4~(τ_{D^0})~\rm{fs}$,
where the first uncertainty is statistical, the second systematic and the third due to the uncertainty of the $D^0$ lifetime. These results are consistent with previous measurements performed by the LHCb experiment.
△ Less
Submitted 1 October, 2025; v1 submitted 16 June, 2025;
originally announced June 2025.
-
Measurement of $ψ(2S)$ to $J/ψ$ cross-section ratio as function of multiplicity in $p$Pb collisions at$\sqrt{s_{NN}} = 8.16$ TeV
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1137 additional authors not shown)
Abstract:
The production ratio of $ψ(2S)$ to $J/ψ$ charmonium states is presented as a function of multiplicity in proton-lead collisions at a centre-of-mass energy of $\sqrt{s_{NN}}=8.16$ TeV, for both prompt and nonprompt sources. The total luminosity recorded by the LHCb experiment corresponds to 13.6 $pb^{-1}$ for $p$Pb collisions and 20.8 $pb^{-1}$ for Pb$p$ collisions, where the first particle indicat…
▽ More
The production ratio of $ψ(2S)$ to $J/ψ$ charmonium states is presented as a function of multiplicity in proton-lead collisions at a centre-of-mass energy of $\sqrt{s_{NN}}=8.16$ TeV, for both prompt and nonprompt sources. The total luminosity recorded by the LHCb experiment corresponds to 13.6 $pb^{-1}$ for $p$Pb collisions and 20.8 $pb^{-1}$ for Pb$p$ collisions, where the first particle indicates the forward direction of the detector. Measurements are performed in the dimuon final state at forward (backward) centre-of-mass rapidity $1.5<y^*<4.0$ ($-5.0<y^*<-2.5$) for $p$Pb (Pb$p$) collisions.A multiplicity dependence of the prompt production ratio is observed in $p$Pb collisions, whereas no dependence is found in nonprompt production, nor in either prompt or nonprompt production in Pb$p$ collisions. These results suggest that in the Pb-going direction additional suppression mechanisms beyond comover effects may be present, possibly related to the formation of quark-gluon plasma. This highlights a transition from small to large collision systems and provides important insight into the suppression of charmonia in proton-nucleus collisions.
△ Less
Submitted 12 June, 2025; v1 submitted 10 June, 2025;
originally announced June 2025.
-
Coherent photoproduction of $ρ^0, ω$ and excited vector mesons in ultraperipheral PbPb collisions
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1127 additional authors not shown)
Abstract:
The invariant-mass distribution for the coherent photoproduction of dipions in ultraperipheral PbPb collisions is measured using data, corresponding to an integrated luminosity of $ 224.6 \pm 9.6\ μ$b$^{-1}$, collected by the LHCb experiment in 2018 at a nucleon-nucleon centre-of-mass energy $\sqrt{s_{\rm NN}}=5.02$ TeV. The dominant contribution is due to the $ρ^0$ meson but a consistent descript…
▽ More
The invariant-mass distribution for the coherent photoproduction of dipions in ultraperipheral PbPb collisions is measured using data, corresponding to an integrated luminosity of $ 224.6 \pm 9.6\ μ$b$^{-1}$, collected by the LHCb experiment in 2018 at a nucleon-nucleon centre-of-mass energy $\sqrt{s_{\rm NN}}=5.02$ TeV. The dominant contribution is due to the $ρ^0$ meson but a consistent description across the full invariant-mass range requires accounting for the $ω$ meson and introducing two resonances at masses of $1350\pm20$ MeV and $1790\pm20$ MeV with widths of about 300 MeV. The cross-section for each meson is measured differentially in twelve bins of rapidity from 2.05 to 4.90. Significant nuclear suppression is observed for the $ρ^0$ meson compared to expectations based on photoproduction on the proton.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Three-pion Bose-Einstein correlations measured in proton-proton collisions
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1125 additional authors not shown)
Abstract:
A study on the Bose-Einstein correlations for triplets of same-sign pions is presented. The analysis is performed using proton-proton collisions at a centre-of-mass energy of $\sqrt{s}$ = 7 TeV, recorded by the LHCb experiment, corresponding to an integrated luminosity of 1.0 fb$^{-1}$. For the first time, the results are interpreted in the core-halo model. The parameters of the model are determin…
▽ More
A study on the Bose-Einstein correlations for triplets of same-sign pions is presented. The analysis is performed using proton-proton collisions at a centre-of-mass energy of $\sqrt{s}$ = 7 TeV, recorded by the LHCb experiment, corresponding to an integrated luminosity of 1.0 fb$^{-1}$. For the first time, the results are interpreted in the core-halo model. The parameters of the model are determined in regions of charged-particle multiplicity. This measurement provides insight into the nature of hadronisation in terms of coherence, showing a coherent emission of pions.
△ Less
Submitted 29 August, 2025; v1 submitted 3 June, 2025;
originally announced June 2025.
-
AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion
Authors:
Yangyi Huang,
Ye Yuan,
Xueting Li,
Jan Kautz,
Umar Iqbal
Abstract:
Existing methods for image-to-3D avatar generation struggle to produce highly detailed, animation-ready avatars suitable for real-world applications. We introduce AdaHuman, a novel framework that generates high-fidelity animatable 3D avatars from a single in-the-wild image. AdaHuman incorporates two key innovations: (1) A pose-conditioned 3D joint diffusion model that synthesizes consistent multi-…
▽ More
Existing methods for image-to-3D avatar generation struggle to produce highly detailed, animation-ready avatars suitable for real-world applications. We introduce AdaHuman, a novel framework that generates high-fidelity animatable 3D avatars from a single in-the-wild image. AdaHuman incorporates two key innovations: (1) A pose-conditioned 3D joint diffusion model that synthesizes consistent multi-view images in arbitrary poses alongside corresponding 3D Gaussian Splats (3DGS) reconstruction at each diffusion step; (2) A compositional 3DGS refinement module that enhances the details of local body parts through image-to-image refinement and seamlessly integrates them using a novel crop-aware camera ray map, producing a cohesive detailed 3D avatar. These components allow AdaHuman to generate highly realistic standardized A-pose avatars with minimal self-occlusion, enabling rigging and animation with any input motion. Extensive evaluation on public benchmarks and in-the-wild images demonstrates that AdaHuman significantly outperforms state-of-the-art methods in both avatar reconstruction and reposing. Code and models will be publicly available for research purposes.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Authors:
Mingjie Liu,
Shizhe Diao,
Ximing Lu,
Jian Hu,
Xin Dong,
Yejin Choi,
Jan Kautz,
Yi Dong
Abstract:
Recent advances in reasoning-centric language models have highlighted reinforcement learning (RL) as a promising method for aligning models with verifiable rewards. However, it remains contentious whether RL truly expands a model's reasoning capabilities or merely amplifies high-reward outputs already latent in the base model's distribution, and whether continually scaling up RL compute reliably l…
▽ More
Recent advances in reasoning-centric language models have highlighted reinforcement learning (RL) as a promising method for aligning models with verifiable rewards. However, it remains contentious whether RL truly expands a model's reasoning capabilities or merely amplifies high-reward outputs already latent in the base model's distribution, and whether continually scaling up RL compute reliably leads to improved reasoning performance. In this work, we challenge prevailing assumptions by demonstrating that prolonged RL (ProRL) training can uncover novel reasoning strategies that are inaccessible to base models, even under extensive sampling. We introduce ProRL, a novel training methodology that incorporates KL divergence control, reference policy resetting, and a diverse suite of tasks. Our empirical analysis reveals that RL-trained models consistently outperform base models across a wide range of pass@k evaluations, including scenarios where base models fail entirely regardless of the number of attempts. We further show that reasoning boundary improvements correlates strongly with task competence of base model and training duration, suggesting that RL can explore and populate new regions of solution space over time. These findings offer new insights into the conditions under which RL meaningfully expands reasoning boundaries in language models and establish a foundation for future work on long-horizon RL for reasoning. We release model weights to support further research: https://huggingface.co/nvidia/Nemotron-Research-Reasoning-Qwen-1.5B
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
Authors:
Yunze Man,
De-An Huang,
Guilin Liu,
Shiwei Sheng,
Shilong Liu,
Liang-Yan Gui,
Jan Kautz,
Yu-Xiong Wang,
Zhiding Yu
Abstract:
Recent advances in multimodal large language models (MLLMs) have demonstrated remarkable capabilities in vision-language tasks, yet they often struggle with vision-centric scenarios where precise visual focus is needed for accurate reasoning. In this paper, we introduce Argus to address these limitations with a new visual attention grounding mechanism. Our approach employs object-centric grounding…
▽ More
Recent advances in multimodal large language models (MLLMs) have demonstrated remarkable capabilities in vision-language tasks, yet they often struggle with vision-centric scenarios where precise visual focus is needed for accurate reasoning. In this paper, we introduce Argus to address these limitations with a new visual attention grounding mechanism. Our approach employs object-centric grounding as visual chain-of-thought signals, enabling more effective goal-conditioned visual attention during multimodal reasoning tasks. Evaluations on diverse benchmarks demonstrate that Argus excels in both multimodal reasoning tasks and referring object grounding tasks. Extensive analysis further validates various design choices of Argus, and reveals the effectiveness of explicit language-guided visual region-of-interest engagement in MLLMs, highlighting the importance of advancing multimodal intelligence from a visual-centric perspective. Project page: https://yunzeman.github.io/argus/
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
Measurement of the Lund plane for light- and beauty-quark jets
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1133 additional authors not shown)
Abstract:
The substructure of jets in quantum chromodynamics (QCD) has garnered significant attention with the advent of infrared- and collinear-safe clustering algorithms and observables. A key question emerging from these studies is how in-jet emissions at soft and hard energy scales, across collinear and wide angles relative to the emitter, differ with the mass of the emitting parton. The Lund jet plane…
▽ More
The substructure of jets in quantum chromodynamics (QCD) has garnered significant attention with the advent of infrared- and collinear-safe clustering algorithms and observables. A key question emerging from these studies is how in-jet emissions at soft and hard energy scales, across collinear and wide angles relative to the emitter, differ with the mass of the emitting parton. The Lund jet plane (LJP) is a perturbatively well-defined substructure observable that maps the radiation pattern of jets onto a plane, visually distinguishing emissions with different kinematic properties. Comparing LJP for jets containing hadrons of low versus high mass enables the testing of QCD splitting functions from first-principles calculations across both soft and hard regimes and at different radiation angles. This article presents the first measurement of the LJP for light-quark-enriched and beauty-initiated jets at center-of-mass energy of 13\tev at LHCb. This marks the first direct observation of the dead-cone effect in beauty-quark jets, measured in the collinear region of the LJP.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion
Authors:
Gwanghyun Kim,
Xueting Li,
Ye Yuan,
Koki Nagano,
Tianye Li,
Jan Kautz,
Se Young Chun,
Umar Iqbal
Abstract:
Estimating accurate and temporally consistent 3D human geometry from videos is a challenging problem in computer vision. Existing methods, primarily optimized for single images, often suffer from temporal inconsistencies and fail to capture fine-grained dynamic details. To address these limitations, we present GeoMan, a novel architecture designed to produce accurate and temporally consistent dept…
▽ More
Estimating accurate and temporally consistent 3D human geometry from videos is a challenging problem in computer vision. Existing methods, primarily optimized for single images, often suffer from temporal inconsistencies and fail to capture fine-grained dynamic details. To address these limitations, we present GeoMan, a novel architecture designed to produce accurate and temporally consistent depth and normal estimations from monocular human videos. GeoMan addresses two key challenges: the scarcity of high-quality 4D training data and the need for metric depth estimation to accurately model human size. To overcome the first challenge, GeoMan employs an image-based model to estimate depth and normals for the first frame of a video, which then conditions a video diffusion model, reframing video geometry estimation task as an image-to-video generation problem. This design offloads the heavy lifting of geometric estimation to the image model and simplifies the video model's role to focus on intricate details while using priors learned from large-scale video datasets. Consequently, GeoMan improves temporal consistency and generalizability while requiring minimal 4D training data. To address the challenge of accurate human size estimation, we introduce a root-relative depth representation that retains critical human-scale details and is easier to be estimated from monocular inputs, overcoming the limitations of traditional affine-invariant and metric depth representations. GeoMan achieves state-of-the-art performance in both qualitative and quantitative evaluations, demonstrating its effectiveness in overcoming longstanding challenges in 3D human geometry estimation from videos.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
FLARE: Robot Learning with Implicit World Modeling
Authors:
Ruijie Zheng,
Jing Wang,
Scott Reed,
Johan Bjorck,
Yu Fang,
Fengyuan Hu,
Joel Jang,
Kaushil Kundalia,
Zongyu Lin,
Loic Magne,
Avnish Narayan,
You Liang Tan,
Guanzhi Wang,
Qi Wang,
Jiannan Xiang,
Yinzhen Xu,
Seonghyeon Ye,
Jan Kautz,
Furong Huang,
Yuke Zhu,
Linxi Fan
Abstract:
We introduce $\textbf{F}$uture $\textbf{LA}$tent $\textbf{RE}$presentation Alignment ($\textbf{FLARE}$), a novel framework that integrates predictive latent world modeling into robot policy learning. By aligning features from a diffusion transformer with latent embeddings of future observations, $\textbf{FLARE}$ enables a diffusion transformer policy to anticipate latent representations of future…
▽ More
We introduce $\textbf{F}$uture $\textbf{LA}$tent $\textbf{RE}$presentation Alignment ($\textbf{FLARE}$), a novel framework that integrates predictive latent world modeling into robot policy learning. By aligning features from a diffusion transformer with latent embeddings of future observations, $\textbf{FLARE}$ enables a diffusion transformer policy to anticipate latent representations of future observations, allowing it to reason about long-term consequences while generating actions. Remarkably lightweight, $\textbf{FLARE}$ requires only minimal architectural modifications -- adding a few tokens to standard vision-language-action (VLA) models -- yet delivers substantial performance gains. Across two challenging multitask simulation imitation learning benchmarks spanning single-arm and humanoid tabletop manipulation, $\textbf{FLARE}$ achieves state-of-the-art performance, outperforming prior policy learning baselines by up to 26%. Moreover, $\textbf{FLARE}$ unlocks the ability to co-train with human egocentric video demonstrations without action labels, significantly boosting policy generalization to a novel object with unseen geometry with as few as a single robot demonstration. Our results establish $\textbf{FLARE}$ as a general and scalable approach for combining implicit world modeling with high-frequency robotic control.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
Measurement of the Z-boson mass
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1126 additional authors not shown)
Abstract:
The first dedicated $Z$-boson mass measurement at the LHC with $Z \to μ^+μ^-$ decays is reported. The dataset uses proton-proton collisions at a centre-of-mass energy of $13$ TeV, recorded in 2016 by the LHCb experiment, and corresponds to an integrated luminosity of $1.7$ fb$^{-1}$. A template fit to the $μ^+μ^-$ mass distribution yields the following result for the $Z$-boson mass, \begin{equatio…
▽ More
The first dedicated $Z$-boson mass measurement at the LHC with $Z \to μ^+μ^-$ decays is reported. The dataset uses proton-proton collisions at a centre-of-mass energy of $13$ TeV, recorded in 2016 by the LHCb experiment, and corresponds to an integrated luminosity of $1.7$ fb$^{-1}$. A template fit to the $μ^+μ^-$ mass distribution yields the following result for the $Z$-boson mass, \begin{equation*}
m_{Z} = 91185.7 \pm 8.3 \pm 3.9 \rm{MeV}, \end{equation*} where the first uncertainty is statistical and the second systematic. This result is consistent with previous measurements and predictions from global electroweak fits.
△ Less
Submitted 19 October, 2025; v1 submitted 21 May, 2025;
originally announced May 2025.