-
XRISM Reveals a Remnant Torus in the Low-Luminosity AGN M81*
Authors:
Jon M. Miller,
Ehud Behar,
Hisamitsu Awaki,
Ann Hornschemeier,
Jesse Bluem,
Luigi Gallo,
Shogo B. Kobayashi,
Richard Mushotzky,
Masanori Ohno,
Robert Petre,
Kosuke Sato,
Yuichi Terashima,
Mihoko Yukita
Abstract:
Up to 40% of galaxies in the local universe host a low-luminosity active galactic nucleus (LLAGN), making it vital to understand this mode of black hole accretion. However, the presence or absence of Seyfert-like geometries - an accretion disk close to the black hole, an optical broad line region (BLR), and a molecular torus - remains uncertain owing to the low flux levels of sources within this c…
▽ More
Up to 40% of galaxies in the local universe host a low-luminosity active galactic nucleus (LLAGN), making it vital to understand this mode of black hole accretion. However, the presence or absence of Seyfert-like geometries - an accretion disk close to the black hole, an optical broad line region (BLR), and a molecular torus - remains uncertain owing to the low flux levels of sources within this class. Herein, we present an analysis of a XRISM/Resolve spectrum of M81*, the LLAGN in the heart of the nearby spiral galaxy M81. A weak, neutral Fe K emission line is detected and resolved into K$_{α,1}$ and K$_{α,2}$ components. It shows a negligible velocity shift, and weak broadening (FWHM$=460^{+260}_{-160}~{\rm km}~{\rm s}^{-1}$) that corresponds to an inner emission radius of ${\rm r} \geq 2.7\times 10^{4}~GM/c^{2}$ for likely inclinations. The Fe K$_α$ line likely traces a torus. The upper limit on additional splitting of the Fe K$_α$ line components translates to a limit on the local magnetic field of ${\rm B} \leq 3.5\times 10^{8}$ Gauss, assuming Zeeman splitting. The spectra also reveal ionized plasma(s) through He-like Fe XXV and H-like Fe XXVI emission lines. These can be fit equally well assuming photoionization and collisional excitation. The H-like Fe XXVI line is better described when a second component is included with a red-shift of ${\rm v} = 1600~{\rm km}~{\rm s}^{-1}$, but this addition is of marginal statistical significance. We discuss these results in the context of radiatively inefficient accretion flow models, magnetically arrested disks, and possible links to the Fermi bubbles in the Milky Way.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
Study of magneto-thermal resistance effect in a Co50Fe50/Cu multilayer through the analysis of electron and lattice thermal conductivities
Authors:
Fuya Makino,
Takamasa Hirai,
Takuma Shiga,
Hirofumi Suto,
Hiroshi Fujihisa,
Koichi Oyanagi,
Satoru Kobayashi,
Taisuke Sasaki,
Takashi Yagi,
Ken-ichi Uchida,
Yuya Sakuraba
Abstract:
This study investigates the giant magneto-thermal resistance (GMTR) effect in a fully-bcc epitaxial Co50Fe50/Cu multilayer through both experimental and theoretical approaches. The applied magnetic field results in a giant change of the cross-plane thermal conductivity (Δ\k{appa}) of 37 W m-1 K-1, which reaches 1.5 times larger than the previously reported value for a magnetic multilayer and recor…
▽ More
This study investigates the giant magneto-thermal resistance (GMTR) effect in a fully-bcc epitaxial Co50Fe50/Cu multilayer through both experimental and theoretical approaches. The applied magnetic field results in a giant change of the cross-plane thermal conductivity (Δ\k{appa}) of 37 W m-1 K-1, which reaches 1.5 times larger than the previously reported value for a magnetic multilayer and record the highest value at room temperature among the other solid-state thermal switching materials working on different principles. We investigated the electron thermal conductivity for exploring the remarkable Δ\k{appa} by the two-current-series-resistor model combined with the Wiedemann-Franz (WF) law. However, the result shows the electron contribution accounts for only 35% of the Δ\k{appa}, indicating the presence of additional spin-dependent heat carriers. Further investigation of the lattice thermal conductivity, which is expected to be spin-independent, using non-equilibrium molecular dynamics (NEMD) simulations suggests a striking contrast: the additional spin-dependent heat carrier contribution is significantly enhanced in the parallel magnetization configuration but nearly negligible in the antiparallel configuration. These findings provide a fundamental insight into the origin of large GMTR effect and highlight its potential of active thermal management technologies for future electronic devices.
△ Less
Submitted 14 May, 2025;
originally announced May 2025.
-
Constraining gas motion and non-thermal pressure beyond the core of the Abell 2029 galaxy cluster with XRISM
Authors:
XRISM Collaboration,
Marc Audard,
Hisamitsu Awaki,
Ralf Ballhausen,
Aya Bamba,
Ehud Behar,
Rozenn Boissay-Malaquin,
Laura Brenneman,
Gregory Brown,
Lia Corrales,
Elisa Costantini,
Renata Cumbee,
Maria Diaz Trigo,
Chris Done,
Tadayasu Dotani,
Ken Ebisawa,
Megan Eckart,
Dominique Eckert,
Satoshi Eguchi,
Teruaki Enoto,
Yuichiro Ezoe,
Adam Foster,
Ryuichi Fujimoto,
Yutaka Fujita,
Yasushi Fukazawa
, et al. (115 additional authors not shown)
Abstract:
We report a detailed spectroscopic study of the gas dynamics and hydrostatic mass bias of the galaxy cluster Abell 2029, utilizing high-resolution observations from XRISM Resolve. Abell 2029, known for its cool core and relaxed X-ray morphology, provides an excellent opportunity to investigate the influence of gas motions beyond the central region. Expanding upon prior studies that revealed low tu…
▽ More
We report a detailed spectroscopic study of the gas dynamics and hydrostatic mass bias of the galaxy cluster Abell 2029, utilizing high-resolution observations from XRISM Resolve. Abell 2029, known for its cool core and relaxed X-ray morphology, provides an excellent opportunity to investigate the influence of gas motions beyond the central region. Expanding upon prior studies that revealed low turbulence and bulk motions within the core, our analysis covers regions out to the scale radius $R_{2500}$ (670~kpc) based on three radial pointings extending from the cluster center toward the northern side. We obtain accurate measurements of bulk and turbulent velocities along the line of sight. The results indicate that non-thermal pressure accounts for no more than 2% of the total pressure at all radii, with a gradual decrease outward. The observed radial trend differs from many numerical simulations, which often predict an increase in non-thermal pressure fraction at larger radii. These findings suggest that deviations from hydrostatic equilibrium are small, leading to a hydrostatic mass bias of around 2% across the observed area.
△ Less
Submitted 10 May, 2025;
originally announced May 2025.
-
XRISM forecast for the Coma cluster: stormy, with a steep power spectrum
Authors:
XRISM Collaboration,
Marc Audard,
Hisamitsu Awaki,
Ralf Ballhausen,
Aya Bamba,
Ehud Behar,
Rozenn Boissay-Malaquin,
Laura Brenneman,
Gregory V. Brown,
Lia Corrales,
Elisa Costantini,
Renata Cumbee,
Maria Diaz Trigo,
Chris Done,
Tadayasu Dotani,
Ken Ebisawa,
Megan E. Eckart,
Dominique Eckert,
Satoshi Eguchi,
Teruaki Enoto,
Yuichiro Ezoe,
Adam Foster,
Ryuichi Fujimoto,
Yutaka Fujita,
Yasushi Fukazawa
, et al. (120 additional authors not shown)
Abstract:
The XRISM Resolve microcalorimeter array measured the velocities of hot intracluster gas at two positions in the Coma galaxy cluster: 3'x3' squares at the center and at 6' (170 kpc) to the south. We find the line-of-sight velocity dispersions in those regions to be sigma_z=208+-12 km/s and 202+-24 km/s, respectively. The central value corresponds to a 3D Mach number of M=0.24+-0.015 and the ratio…
▽ More
The XRISM Resolve microcalorimeter array measured the velocities of hot intracluster gas at two positions in the Coma galaxy cluster: 3'x3' squares at the center and at 6' (170 kpc) to the south. We find the line-of-sight velocity dispersions in those regions to be sigma_z=208+-12 km/s and 202+-24 km/s, respectively. The central value corresponds to a 3D Mach number of M=0.24+-0.015 and the ratio of the kinetic pressure of small-scale motions to thermal pressure in the intracluster plasma of only 3.1+-0.4%, at the lower end of predictions from cosmological simulations for merging clusters like Coma, and similar to that observed in the cool core of the relaxed cluster A2029. Meanwhile, the gas in both regions exhibits high line-of-sight velocity differences from the mean velocity of the cluster galaxies, Delta v_z=450+-15 km/s and 730+-30 km/s, respectively. A small contribution from an additional gas velocity component, consistent with the cluster optical mean, is detected along a sightline near the cluster center. The combination of the observed velocity dispersions and bulk velocities is not described by a Kolmogorov velocity power spectrum of steady-state turbulence; instead, the data imply a much steeper effective slope (i.e., relatively more power at larger linear scales). This may indicate either a very large dissipation scale resulting in the suppression of small-scale motions, or a transient dynamic state of the cluster, where large-scale gas flows generated by an ongoing merger have not yet cascaded down to small scales.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
A duality for minimal surfaces in the Heisenberg group
Authors:
Shimpei Kobayashi
Abstract:
We introduce and study the notion of a transformation surface associated with a nowhere-vertical minimal surface in the three-dimensional Heisenberg group, and prove its minimality and duality. Furthermore, by using the logarithmic derivative of the moving frame with respect to the spectral parameter, we derive the Sym formula for the dual minimal surface.
We introduce and study the notion of a transformation surface associated with a nowhere-vertical minimal surface in the three-dimensional Heisenberg group, and prove its minimality and duality. Furthermore, by using the logarithmic derivative of the moving frame with respect to the spectral parameter, we derive the Sym formula for the dual minimal surface.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
Impossibility via W states and feasibility via W-like states for perfect quantum teleportation
Authors:
Sora Kobayashi,
Kei-Ichi Kondo
Abstract:
We examine the two-party perfect quantum teleportation of an unknown 1-qubit state in the case of sharing various 3-qubit entangled states between a sender and a receiver: GHZ state, W state and W-like state. We give an impossibility proof that the W state cannot be used as the sharing state to realize the perfect quantum teleportation for transmitting an arbitrary 1-qubit state, in sharp contrast…
▽ More
We examine the two-party perfect quantum teleportation of an unknown 1-qubit state in the case of sharing various 3-qubit entangled states between a sender and a receiver: GHZ state, W state and W-like state. We give an impossibility proof that the W state cannot be used as the sharing state to realize the perfect quantum teleportation for transmitting an arbitrary 1-qubit state, in sharp contrast with the GHZ state which is well known to realize the perfect quantum transportation. Moreover, we give a procedure of obtaining a modified entangled state which we call the W-like state to achieve the perfect quantum transportation under a prescribed measurement basis.
△ Less
Submitted 28 April, 2025;
originally announced April 2025.
-
When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars
Authors:
Rei Higuchi,
Ryotaro Kawata,
Naoki Nishikawa,
Kazusato Oko,
Shoichiro Yamaguchi,
Sosuke Kobayashi,
Seiya Tokui,
Kohei Hayashi,
Daisuke Okanohara,
Taiji Suzuki
Abstract:
The ability to acquire latent semantics is one of the key properties that determines the performance of language models. One convenient approach to invoke this ability is to prepend metadata (e.g. URLs, domains, and styles) at the beginning of texts in the pre-training data, making it easier for the model to access latent semantics before observing the entire text. Previous studies have reported t…
▽ More
The ability to acquire latent semantics is one of the key properties that determines the performance of language models. One convenient approach to invoke this ability is to prepend metadata (e.g. URLs, domains, and styles) at the beginning of texts in the pre-training data, making it easier for the model to access latent semantics before observing the entire text. Previous studies have reported that this technique actually improves the performance of trained models in downstream tasks; however, this improvement has been observed only in specific downstream tasks, without consistent enhancement in average next-token prediction loss. To understand this phenomenon, we closely investigate how prepending metadata during pre-training affects model performance by examining its behavior using artificial data. Interestingly, we found that this approach produces both positive and negative effects on the downstream tasks. We demonstrate that the effectiveness of the approach depends on whether latent semantics can be inferred from the downstream task's prompt. Specifically, through investigations using data generated by probabilistic context-free grammars, we show that training with metadata helps improve model's performance when the given context is long enough to infer the latent semantics. In contrast, the technique negatively impacts performance when the context lacks the necessary information to make an accurate posterior inference.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Dust-obscured Galaxies with Broken Power-law Spectral Energy Distributions Discovered by UNIONS
Authors:
Taketo Yoshida,
Tohru Nagao,
Yoshiki Toba,
Akatoki Noboriguchi,
Kohei Ichikawa,
Hendrik Hildebrandt,
Naomichi Yutani,
Kenneth C. Chambers,
Ryo Iwamoto,
Seira Kobayashi,
Masamune Oguri,
Ken Osato,
Kohei Shibata,
Yuxing Zhong
Abstract:
We report on the spectral energy distributions (SEDs) of infrared-bright dust-obscured galaxies (DOGs) with $(i - [22])_{\rm AB} \geq 7.0$. Using photometry from the deep and wide Ultraviolet Near-Infrared Optical Northern Survey, combined with near-IR and mid-IR data from the UKIRT Infrared Deep Sky Survey and the Wide-field Infrared Survey Explorer, we successfully identified 382 DOGs in $\sim$…
▽ More
We report on the spectral energy distributions (SEDs) of infrared-bright dust-obscured galaxies (DOGs) with $(i - [22])_{\rm AB} \geq 7.0$. Using photometry from the deep and wide Ultraviolet Near-Infrared Optical Northern Survey, combined with near-IR and mid-IR data from the UKIRT Infrared Deep Sky Survey and the Wide-field Infrared Survey Explorer, we successfully identified 382 DOGs in $\sim$ 170 deg$^2$. Among them, the vast majority (376 DOGs) were classified into two subclasses: bump DOGs (132/376) and power-law (PL) DOGs (244/376), which are dominated by star formation and active galactic nucleus (AGN), respectively. Through the SED analysis, we found that roughly half (120/244) of the PL DOGs show ``broken'' power-law SEDs. The significant red slope from optical to near-IR in the SEDs of these ``broken power-law DOGs'' (BPL DOGs) probably reflects their large amount of dust extinction. In other words, BPL DOGs are more heavily obscured AGNs, compared to PL DOGs with non-broken power-law SEDs.
△ Less
Submitted 15 May, 2025; v1 submitted 21 April, 2025;
originally announced April 2025.
-
Exceptionally large winding number of a finite-size topological superconductor
Authors:
Satoshi Ikegaya,
Shingo Kobayashi,
Yasuhiro Asano
Abstract:
We study finite-size-induced topological phenomena in unconventional superconductors. Specifically, we focus on a thin film with a persistent spin texture, fabricated on a high-$T_{\text{c}}$ cuprate $d_{xy}$-wave superconductors. In two-dimensional $d_{xy}$-wave superconductors, flat-band Andreev bound states appear at the edges. As the system narrows, these bound states acquire an energy gap due…
▽ More
We study finite-size-induced topological phenomena in unconventional superconductors. Specifically, we focus on a thin film with a persistent spin texture, fabricated on a high-$T_{\text{c}}$ cuprate $d_{xy}$-wave superconductors. In two-dimensional $d_{xy}$-wave superconductors, flat-band Andreev bound states appear at the edges. As the system narrows, these bound states acquire an energy gap due to finite-size hybridization and spin-orbit coupling of the persistent spin texture. This induced gap gives rise to the emergence of a topological phase, characterized by an exceptionally large one-dimensional winding number that scales with the film width. We demonstrate the appearance of highly degenerate zero-energy states, leading to anomalous perfect charge transport in dirty superconducting junctions. These findings provide a promising platform for exploring fascinating topological superconducting phases driven by gapped Andreev bound states.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
The evolution of a curve induced by the Pohlmeyer-Lund-Regge equation
Authors:
Shimpei Kobayashi,
Yuhei Kogo,
Nozomu Matsuura
Abstract:
This paper investigates the evolution of space curves governed by the Pohlmeyer-Lund-Regge (PLR) equation, an integrable extension of the sine-Gordon equation. We examine a specific type of curve evolution, known as the Lund-Regge evolution, and derive its representation in the Frenet frame. We show the Frenet frame evolution aligns with the Lax system of the PLR equation and develop a constructio…
▽ More
This paper investigates the evolution of space curves governed by the Pohlmeyer-Lund-Regge (PLR) equation, an integrable extension of the sine-Gordon equation. We examine a specific type of curve evolution, known as the Lund-Regge evolution, and derive its representation in the Frenet frame. We show the Frenet frame evolution aligns with the Lax system of the PLR equation and develop a construction method for curve families via the Sym formula. In conclusion, we describe the Lund-Regge evolution corresponding to Date's multi-soliton solutions to the PLR equation, with illustrations of curves and surfaces.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
Higher-order topological phases for time-reversal-symmetry breaking superconductivity in UTe$_2$
Authors:
Yuki Yamazaki,
Shingo Kobayashi
Abstract:
The recent discovery of heavy-fermion superconductor UTe$_2$ has broadened the possibility of realizing exotic time-reversal-symmetry-breaking superconductivity. However, a comprehensive understanding of the topological phases in the superconducting states of UTe$_2$ is still lacking. Here, we present an exhaustive classification of topological phases for all time-reversal symmetry breaking pairin…
▽ More
The recent discovery of heavy-fermion superconductor UTe$_2$ has broadened the possibility of realizing exotic time-reversal-symmetry-breaking superconductivity. However, a comprehensive understanding of the topological phases in the superconducting states of UTe$_2$ is still lacking. Here, we present an exhaustive classification of topological phases for all time-reversal symmetry breaking pairing symmetries of UTe$_2$. Using the K theoretical classification approach, we uncover that 25 out of 36 possible pairing states are classified as higher-order topological phases, with some demonstrating hybrid-order topology through an intricate interplay of hinge and corner states. Furthermore, under the weak-coupling condition of the pair potentials, the possible pairing symmetries are constrained to $B_{ju} + i B_{ku}$, $A_{u} + i B_{j u}$, and $B_{j g} + iA_u$ ($j,k = 1,2,3$; $j \neq k$), where these symbols denote the irreducible representations of the point group $D_{2h}$. For these pairing states, the topological invariants are related to the Fermi surface topology via the Fermi-surface formula, enabling us to systematically diagnose higher-order topological phases. Using a tight-binding model, we demonstrate the higher-order topological phases of the mixed-parity $A_u + iB_{1g}$ superconductors, where the second-order and hybrid-order topological phases emerge as the number of Fermi surfaces enclosing the time-reversal invariant momentum evolves from two to four. The findings suggest that UTe$_2$ serves as a compelling platform for exploring higher-order topological superconductors with diverse topological surface states.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Efficient Construction of Model Family through Progressive Training Using Model Expansion
Authors:
Kazuki Yano,
Sho Takase,
Sosuke Kobayashi,
Shun Kiyono,
Jun Suzuki
Abstract:
As Large Language Models (LLMs) gain widespread practical application, providing the model family of different parameter sizes has become standard practice to address diverse computational requirements. Conventionally, each model in a family is trained independently, resulting in computational costs that scale additively with the number of models. We propose an efficient method for constructing th…
▽ More
As Large Language Models (LLMs) gain widespread practical application, providing the model family of different parameter sizes has become standard practice to address diverse computational requirements. Conventionally, each model in a family is trained independently, resulting in computational costs that scale additively with the number of models. We propose an efficient method for constructing the model family through progressive training, where smaller models are incrementally expanded to larger sizes to create a complete model family. Through extensive experiments with a model family ranging from 1B to 8B parameters, we demonstrate that our method reduces computational costs by approximately 25% while maintaining comparable performance to independently trained models. Furthermore, by strategically adjusting maximum learning rates based on model size, our method outperforms the independent training across various metrics. Beyond performance gains, our approach offers an additional advantage: models in our family tend to yield more consistent behavior across different model sizes.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
In-orbit Performance of the Soft X-ray Imaging Telescope Xtend aboard XRISM
Authors:
Hiroyuki Uchida,
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Hirofumi Noda,
Takaaki Tanaka,
Hiroshi Murakami,
Hiromasa Suzuki,
Shogo Benjamin Kobayashi,
Tomokage Yoneyama,
Kouichi Hagino,
Kumiko Kawabata Nobukawa,
Hideki Uchiyama,
Masayoshi Nobukawa,
Hironori Matsumoto,
Takeshi Go Tsuru,
Makoto Yamauchi,
Isamu Hatsukade,
Hirokazu Odaka,
Takayoshi Kohmura,
Kazutaka Yamaoka,
Tessei Yoshida,
Yoshiaki Kanemaru,
Daiki Ishi,
Tadayasu Dotani
, et al. (40 additional authors not shown)
Abstract:
We present a summary of the in-orbit performance of the soft X-ray imaging telescope Xtend onboard the XRISM mission, based on in-flight observation data, including first-light celestial objects, calibration sources, and results from the cross-calibration campaign with other currently-operating X-ray observatories. XRISM/Xtend has a large field of view of $38.5'\times38.5'$, covering an energy ran…
▽ More
We present a summary of the in-orbit performance of the soft X-ray imaging telescope Xtend onboard the XRISM mission, based on in-flight observation data, including first-light celestial objects, calibration sources, and results from the cross-calibration campaign with other currently-operating X-ray observatories. XRISM/Xtend has a large field of view of $38.5'\times38.5'$, covering an energy range of 0.4--13 keV, as demonstrated by the first-light observation of the galaxy cluster Abell 2319. It also features an energy resolution of 170--180 eV at 6 keV, which meets the mission requirement and enables to resolve He-like and H-like Fe K$α$ lines. Throughout the observation during the performance verification phase, we confirm that two issues identified in SXI onboard the previous Hitomi mission -- light leakage and crosstalk events -- are addressed and suppressed in the case of Xtend. A joint cross-calibration observation of the bright quasar 3C273 results in an effective area measured to be $\sim420$ cm$^{2}[email protected] keV and $\sim310$ cm$^{2}[email protected] keV, which matches values obtained in ground tests. We also continuously monitor the health of Xtend by analyzing overclocking data, calibration source spectra, and day-Earth observations: the readout noise is stable and low, and contamination is negligible even one year after launch. A low background level compared to other major X-ray instruments onboard satellites, combined with the largest grasp ($Ω_{\rm eff}\sim60$ ${\rm cm^2~degree^2}$) of Xtend, will not only support Resolve analysis, but also enable significant scientific results on its own. This includes near future follow-up observations and transient searches in the context of time-domain and multi-messenger astrophysics.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Prompt Periodicity in the GRB 211211A Precursor: Black-hole or magnetar engine?
Authors:
Gavin P. Lamb,
Thomas Baxter,
Conor M. B. Omand,
Dimple,
Zoë McGrath,
Cairns Turnbull,
Eric Burns,
Hamid Hamidani,
Ilya Mandel,
Kim L. Page,
Stephan Rosswog,
Nikhil Sarin,
Andrew Blain,
Laurence Datrier,
Shiho Kobayashi,
Andrew Levan,
Rhaana Starling,
Benjamin Gompertz,
Nusrin Habeeb,
Khang Nguyen,
Nial Tanvir
Abstract:
The merger origin long GRB 211211A was a class (re-)defining event. A precursor was identified with a $\sim 1$ s separation from the main burst, as well as a claimed candidate quasi-periodic oscillation (QPO) with a frequency $\sim20$ Hz. Here, we explore the implications of the precursor, assuming the quasi-periodicity is real. The precursor variability timescale requires relativistic motion with…
▽ More
The merger origin long GRB 211211A was a class (re-)defining event. A precursor was identified with a $\sim 1$ s separation from the main burst, as well as a claimed candidate quasi-periodic oscillation (QPO) with a frequency $\sim20$ Hz. Here, we explore the implications of the precursor, assuming the quasi-periodicity is real. The precursor variability timescale requires relativistic motion with a Lorentz factor $Γ\gtrsim80$, and implies an engine driven jetted outflow. The declining amplitude of the consecutive pulses requires an episodic engine with an `on/off' cycle consistent with the QPO. For a black-hole central engine, the QPO can have its origin in Lense-Thirring precession of the inner disk at $\sim6-9$ $r_g$ (gravitational radii) for a mass $M_\bullet\leq4.5$ M$_{\odot}$, and $\lesssim 7$ $r_g$ for $M_\bullet>4.5$ M$_{\odot}$ and dimensionless spin $χ\sim 0.3 - 0.9$. Alternatively, at a disk density of $\sim10^{8 - 12}$ g cm$^{-3}$, the required magnetic field strength for a QPO via magnetohydrodynamic effects will be on the order $B\sim10^{12 - 14}$ G. If the central engine is a short lived magnetar or hypermassive neutron star, then a low-frequency QPO can be produced via instabilities within the disk at a radius of $\sim20 - 70$ km, for a disk density $\sim10^{9 - 12}$ g cm$^{-3}$ and magnetic field $\gtrsim10^{13 - 14}$ G. The QPO cannot be coupled to the neutron star spin, as the co-rotation radius is beyond the scale of the disk. Neither engine can be ruled out -- however, we favour an origin for the precursor candidate QPO as early jet-disk coupling for a neutron star -- black hole merger remnant with mass $M_\bullet>4.5$ M$_{\odot}$.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Probing the axial symmetry of gamma-ray burst jets using afterglow polarimetry
Authors:
Thomas Baxter,
Shiho Kobayashi
Abstract:
Polarisation measurements of gamma-ray burst afterglows provide a powerful tool for probing the structure of relativistic jets. In this study, we revisit polarisation signals observed in gamma-ray burst afterglows, focusing on the effects of non-axisymmetric jet structures. To characterize these non-axisymmetric jets, we adopt a simple elliptical jet head model and investigate how deviations from…
▽ More
Polarisation measurements of gamma-ray burst afterglows provide a powerful tool for probing the structure of relativistic jets. In this study, we revisit polarisation signals observed in gamma-ray burst afterglows, focusing on the effects of non-axisymmetric jet structures. To characterize these non-axisymmetric jets, we adopt a simple elliptical jet head model and investigate how deviations from axisymmetry influence the temporal evolution of polarisation properties, particularly around the jet break. Our results show that the polarisation degree curve typically exhibits two peaks for top-hat jets or a single peak for structured jets, even in the presence of an elliptical jet head. In non-axisymmetric jets, a complete drop in polarisation between peaks is generally absent, and the position angle rotation between the peaks can deviate significantly from 90 degrees. In single-peak cases, the polarisation position angle evolves gradually, contrasting with the constant position angle expected in axisymmetric jets. We also explore the implications of these findings for recent GRB events, including GRB 121024A, GRB 091018, GRB 020813, and GRB 210610B.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
New CCD Driving Technique to Suppress Anomalous Charge Intrusion from Outside the Imaging Area for Soft X-ray Imager of Xtend onboard XRISM
Authors:
Hirofumi Noda,
Mio Aoyagi,
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takaaki Tanaka,
Hiromasa Suzuki,
Hiroshi Murakami,
Hiroyuki Uchida,
Takeshi G. Tsuru,
Keitaro Miyazaki,
Kohei Kusunoki,
Yoshiaki Kanemaru,
Yuma Aoki,
Kumiko Nobukawa,
Masayoshi Nobukawa,
Kohei Shima,
Marina Yoshimoto,
Kazunori Asakura,
Hironori Matsumoto,
Tomokage Yoneyama,
Shogo B. Kobayashi,
Kouichi Hagino,
Hideki Uchiyama,
Kiyoshi Hayashida
Abstract:
The Soft X-ray Imager (SXI) is an X-ray CCD camera of the Xtend system onboard the X-Ray Imaging and Spectroscopy Mission (XRISM), which was successfully launched on September 7, 2023 (JST). During ground cooling tests of the CCDs in 2020/2021, using the flight-model detector housing, electronic boards, and a mechanical cooler, we encountered an unexpected issue. Anomalous charges appeared outside…
▽ More
The Soft X-ray Imager (SXI) is an X-ray CCD camera of the Xtend system onboard the X-Ray Imaging and Spectroscopy Mission (XRISM), which was successfully launched on September 7, 2023 (JST). During ground cooling tests of the CCDs in 2020/2021, using the flight-model detector housing, electronic boards, and a mechanical cooler, we encountered an unexpected issue. Anomalous charges appeared outside the imaging area of the CCDs and intruded into the imaging area, causing pulse heights to stick to the maximum value over a wide region. Although this issue has not occurred in subsequent tests or in orbit so far, it could seriously affect the imaging and spectroscopic performance of the SXI if it were to happen in the future. Through experiments with non-flight-model detector components, we successfully reproduced the issue and identified that the anomalous charges intrude via the potential structure created by the charge injection electrode at the top of the imaging area. To prevent anomalous charge intrusion and maintain imaging and spectroscopic performance that satisfies the requirements, even if this issue occurs in orbit, we developed a new CCD driving technique. This technique is different from the normal operation in terms of potential structure and its changes during imaging and charge injection. In this paper, we report an overview of the anomalous charge issue, the related potential structures, the development of the new CCD driving technique to prevent the issue, the imaging and spectroscopic performance of the new technique, and the results of experiments to investigate the cause of anomalous charges.
△ Less
Submitted 9 March, 2025;
originally announced March 2025.
-
EP240801a/XRF 240801B: An X-ray Flash Detected by the Einstein Probe and Implications of its Multiband Afterglow
Authors:
Shuai-Qing Jiang,
Dong Xu,
Agnes P. C. van Hoof,
Wei-Hua Lei,
Yuan Liu,
Hao Zhou,
Yong Chen,
Shao-Yu Fu,
Jun Yang,
Xing Liu,
Zi-Pei Zhu,
Alexei V. Filippenko,
Peter G. Jonker,
A. S. Pozanenko,
He Gao,
Xue-Feng Wu,
Bing Zhang,
Gavin P Lamb,
Massimiliano De Pasquale,
Shiho Kobayashi,
Franz Erik Bauer,
Hui Sun,
Giovanna Pugliese,
Jie An,
Valerio D'Elia
, et al. (67 additional authors not shown)
Abstract:
We present multiband observations and analysis of EP240801a, a low-energy, extremely soft gamma-ray burst (GRB) discovered on August 1, 2024 by the Einstein Probe (EP) satellite, with a weak contemporaneous signal also detected by Fermi/GBM. Optical spectroscopy of the afterglow, obtained by GTC and Keck, identified the redshift of $z = 1.6734$. EP240801a exhibits a burst duration of 148 s in X-ra…
▽ More
We present multiband observations and analysis of EP240801a, a low-energy, extremely soft gamma-ray burst (GRB) discovered on August 1, 2024 by the Einstein Probe (EP) satellite, with a weak contemporaneous signal also detected by Fermi/GBM. Optical spectroscopy of the afterglow, obtained by GTC and Keck, identified the redshift of $z = 1.6734$. EP240801a exhibits a burst duration of 148 s in X-rays and 22.3 s in gamma-rays, with X-rays leading by 80.61 s. Spectral lag analysis indicates the gamma-ray signal arrived 8.3 s earlier than the X-rays. Joint spectral fitting of EP/WXT and Fermi/GBM data yields an isotropic energy $E_{γ,\rm{iso}} = (5.57^{+0.54}_{-0.50})\times 10^{51}\,\rm{erg}$, a peak energy $E_{\rm{peak}} = 14.90^{+7.08}_{-4.71}\,\rm{keV}$, a fluence ratio $\rm S(25-50\,\rm{keV})/S(50-100\,\rm{keV}) = 1.67^{+0.74}_{-0.46}$, classifying EP240801a as an X-ray flash (XRF). The host-galaxy continuum spectrum, inferred using Prospector, was used to correct its contribution for the observed outburst optical data. Unusual early $R$-band behavior and EP/FXT observations suggest multiple components in the afterglow. Three models are considered: two-component jet model, forward-reverse shock model and forward-shock model with energy injection. Both three provide reasonable explanations. The two-component jet model and the energy injection model imply a relatively small initial energy and velocity of the jet in the line of sight, while the forward-reverse shock model remains typical. Under the two-component jet model, EP240801a may resemble GRB 221009A (BOAT) if the bright narrow beam is viewed on-axis. Therefore, EP240801a can be interpreted as an off-beam (narrow) jet or an intrinsically weak GRB jet. Our findings provide crucial clues for uncovering the origin of XRFs.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Nonuniform superconducting states caused by odd-frequency Cooper pairs
Authors:
Takumi Sato,
Satoru Hayami,
Shingo Kobayashi,
Yasuhiro Asano
Abstract:
We discuss the origin of a nonuniform superconducting state in which Cooper pairs have a finite center of mass momentum. The instability to such a nonuniform superconducting state is analyzed by a pole of the pair fluctuation propagator for weak coupling superconductors. The results show that odd(even)-frequency Cooper pairs stabilize a nonuniform (uniform) superconducting phase below the transiti…
▽ More
We discuss the origin of a nonuniform superconducting state in which Cooper pairs have a finite center of mass momentum. The instability to such a nonuniform superconducting state is analyzed by a pole of the pair fluctuation propagator for weak coupling superconductors. The results show that odd(even)-frequency Cooper pairs stabilize a nonuniform (uniform) superconducting phase below the transition temperature. We provide a theoretical framework that explains the reasons for appearing the nonuniform superconducting states.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Soft X-ray Imager of the Xtend system onboard XRISM
Authors:
Hirofumi Noda,
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takaaki Tanaka,
Hiroshi Murakami,
Hiroyuki Uchida,
Hiromasa Suzuki,
Shogo Benjamin Kobayashi,
Tomokage Yoneyama,
Kouichi Hagino,
Kumiko Nobukawa,
Hideki Uchiyama,
Masayoshi Nobukawa,
Hironori Matsumoto,
Takeshi Go Tsuru,
Makoto Yamauchi,
Isamu Hatsukade,
Hirokazu Odaka,
Takayoshi Kohmura,
Kazutaka Yamaoka,
Tessei Yoshida,
Yoshiaki Kanemaru,
Junko Hiraga,
Tadayasu Dotani
, et al. (35 additional authors not shown)
Abstract:
The Soft X-ray Imager (SXI) is the X-ray charge-coupled device (CCD) camera for the soft X-ray imaging telescope Xtend installed on the X-ray Imaging and Spectroscopy Mission (XRISM), which was adopted as a recovery mission for the Hitomi X-ray satellite and was successfully launched on 2023 September 7 (JST). In order to maximize the science output of XRISM, we set the requirements for Xtend and…
▽ More
The Soft X-ray Imager (SXI) is the X-ray charge-coupled device (CCD) camera for the soft X-ray imaging telescope Xtend installed on the X-ray Imaging and Spectroscopy Mission (XRISM), which was adopted as a recovery mission for the Hitomi X-ray satellite and was successfully launched on 2023 September 7 (JST). In order to maximize the science output of XRISM, we set the requirements for Xtend and find that the CCD set employed in the Hitomi/SXI or similar, i.e., a $2 \times 2$ array of back-illuminated CCDs with a $200~μ$m-thick depletion layer, would be practically best among available choices, when used in combination with the X-ray mirror assembly. We design the XRISM/SXI, based on the Hitomi/SXI, to have a wide field of view of $38' \times 38'$ in the $0.4-13$ keV energy range. We incorporated several significant improvements from the Hitomi/SXI into the CCD chip design to enhance the optical-light blocking capability and to increase the cosmic-ray tolerance, reducing the degradation of charge-transfer efficiency in orbit. By the time of the launch of XRISM, the imaging and spectroscopic capabilities of the SXI has been extensively studied in on-ground experiments with the full flight-model configuration or equivalent setups and confirmed to meet the requirements. The optical blocking capability, the cooling and temperature control performance, and the transmissivity and quantum efficiency to incident X-rays of the CCDs are also all confirmed to meet the requirements. Thus, we successfully complete the pre-flight development of the SXI for XRISM.
△ Less
Submitted 11 February, 2025;
originally announced February 2025.
-
Quantitative noncontact measurement of thermal Hall angle and transverse thermal conductivity by lock-in thermography
Authors:
Takumi Imamura,
Takamasa Hirai,
Koichi Oyanagi,
Ryo Iguchi,
Kenta Takamori,
Satoru Kobayashi,
Ken-ichi Uchida
Abstract:
We propose and demonstrate a quantitative noncontact measurement method for the thermal Hall effect (THE) based on magnetic-field-modulated lock-in thermography. This method enables visualization of THE-induced temperature change and quantitative estimation of the thermal Hall angle $θ_{\rm THE}$ by applying periodic magnetic fields to a sample and obtaining the first harmonic response of thermal…
▽ More
We propose and demonstrate a quantitative noncontact measurement method for the thermal Hall effect (THE) based on magnetic-field-modulated lock-in thermography. This method enables visualization of THE-induced temperature change and quantitative estimation of the thermal Hall angle $θ_{\rm THE}$ by applying periodic magnetic fields to a sample and obtaining the first harmonic response of thermal images. By combining this method with LIT-based measurement techniques for the longitudinal thermal conductivity $κ_{xx}$, we also quantify the transverse thermal conductivity $κ_{xy}$. We validate our measurement methods by estimating $θ_{\rm THE}$, $κ_{xx}$, and $κ_{xy}$ in a ferromagnetic Heusler alloy Co$_2$MnGa slab showing large THE.
△ Less
Submitted 6 February, 2025; v1 submitted 17 January, 2025;
originally announced January 2025.
-
Simultaneous achievement of large anomalous Nernst effect and reduced thermal conductivity in sintered polycrystalline topological Heusler ferromagnets
Authors:
Koichi Oyanagi,
Hossein Sepehri-Amin,
Kenta Takamori,
Terumasa Tadano,
Takumi Imamura,
Ren Nagasawa,
Krishnan Mahalingam,
Takamasa Hirai,
Fuyuki Ando,
Yuya Sakuraba,
Satoru Kobayashi,
Ken-ichi Uchida
Abstract:
This study reports the observation of the large anomalous Nernst effect in polycrystalline ferromagnetic Co$_{2}$MnGa (CMG) slabs prepared by a spark plasma sintering method. By optimizing the sintering conditions, the anomalous Nernst coefficient reaches ~7.5 $μ$V K$^{-1}$ at room temperature, comparable to the highest value reported in the single-crystalline CMG slabs. Owing to the sizable anoma…
▽ More
This study reports the observation of the large anomalous Nernst effect in polycrystalline ferromagnetic Co$_{2}$MnGa (CMG) slabs prepared by a spark plasma sintering method. By optimizing the sintering conditions, the anomalous Nernst coefficient reaches ~7.5 $μ$V K$^{-1}$ at room temperature, comparable to the highest value reported in the single-crystalline CMG slabs. Owing to the sizable anomalous Nernst coefficient and reduced thermal conductivity, the dimensionless figure of merit in our optimized CMG slab shows the record-high value of ~8$\times$10$^{-4}$ at room temperature. With the aid of the nano/microstructure characterization and first-principles phonon calculation, this study discusses the dependence of the transport properties on the degree of crystalline ordering and morphology of crystal-domain boundaries in the sintered CMG slabs. The results reveal a potential of polycrystalline topological materials for transverse thermoelectric applications, enabling the construction of large-scale modules.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
In-situ high voltage generation with Cockcroft-Walton multiplier for xenon gas time projection chamber
Authors:
Shinichi Akiyama,
Junya Hikida,
Masashi Yoshida,
Kazuhiro Nakamura,
Sei Ban,
Masanori Hirose,
Atsuko K. Ichikawa,
Yoshihisa Iwashita,
Tatsuya Kikawa,
Yasuhiro Nakajima,
Kiseki D. Nakamura,
Tsuyoshi Nakaya,
Shuhei Obara,
Ken Sakashita,
Hiroyuki Sekiya,
Bungo Sugashima,
Soki Urano,
Sota Hatsumi,
Sota Kobayashi,
Hayato Sasaki
Abstract:
We have newly developed a Cockcroft-Walton (CW) multiplier that can be used in a gas time projection chamber (TPC). A TPC requires a high voltage to form an electric field that drifts ionization electrons. Supplying the high voltage from outside the pressure vessel requires a dedicated high-voltage feedthrough. An alternative approach is to generate the high voltage inside the pressure vessel with…
▽ More
We have newly developed a Cockcroft-Walton (CW) multiplier that can be used in a gas time projection chamber (TPC). A TPC requires a high voltage to form an electric field that drifts ionization electrons. Supplying the high voltage from outside the pressure vessel requires a dedicated high-voltage feedthrough. An alternative approach is to generate the high voltage inside the pressure vessel with a relatively low voltage introduced from outside. A CW multiplier can convert a low AC voltage input to a high DC voltage output, making it suitable for this purpose.
We have integrated a CW multiplier into the AXEL (A Xenon ElectroLuminescence detector), a high pressure xenon gas TPC to search for neutrinoless double beta decay of $^{136}$Xe. It uses silicon photomultipliers to detect the ionization electrons through elecrtoluminescence, making it strong against electronic noise. Operation of the CW multiplier was successfully demonstrated; the TPC was operated for 40 days at 6.8 bar, and an energy resolution as high as (0.67 $\pm$ 0.08) % (FWHM) at 2615 keV was obtained.
△ Less
Submitted 7 May, 2025; v1 submitted 14 January, 2025;
originally announced January 2025.
-
Gauss maps of Möbius surfaces in the $n$-dimensional sphere
Authors:
David Brander,
Shimpei Kobayashi,
Peng Wang
Abstract:
In this note we discuss Gauss maps for Möbius surfaces in the $n$-sphere, and their applications in the study of Willmore surfaces. One such ``Gauss map'', naturally associated to a Willmore surface that has a dual Willmore surface, is the Lorentzian $2$-plane bundle given by a lift of the suface and its dual. More generally, we define the concept of a Lorentzian $2$-plane lift for an arbitrary Mö…
▽ More
In this note we discuss Gauss maps for Möbius surfaces in the $n$-sphere, and their applications in the study of Willmore surfaces. One such ``Gauss map'', naturally associated to a Willmore surface that has a dual Willmore surface, is the Lorentzian $2$-plane bundle given by a lift of the suface and its dual. More generally, we define the concept of a Lorentzian $2$-plane lift for an arbitrary Möbius surface, and show that the conformal harmonicity of this lift is equivalent to the Willmore condition for the surface. This clarifies some previous work of F. Hélein, Q. Xia-Y Shen, X. Ma and others, and, for instance, allows for the treatment of the Björling problem for Willmore surfaces in the presence of umbilics.
△ Less
Submitted 15 December, 2024;
originally announced December 2024.
-
Revisiting Volterra defects: Geometrical relation between edge dislocations and wedge disclinations
Authors:
Shunsuke Kobayashi,
Katsumi Takemasa,
Ryuichi Tarumi
Abstract:
This study presents a comprehensive mathematical model for Volterra defects and explores their relations using differential geometry on Riemann--Cartan manifolds. Following the standard Volterra process, we derived the Cartan moving frame, a geometric representation of plastic fields, and the associated Riemannian metric using exterior algebra. Although the analysis naturally defines the geometry…
▽ More
This study presents a comprehensive mathematical model for Volterra defects and explores their relations using differential geometry on Riemann--Cartan manifolds. Following the standard Volterra process, we derived the Cartan moving frame, a geometric representation of plastic fields, and the associated Riemannian metric using exterior algebra. Although the analysis naturally defines the geometry of three types of dislocations and the wedge disclination, it fails to classify twist disclinations owing to the persistent torsion component, suggesting the need for modifications to the Volterra process. By leveraging the interchangeability of the Weitzenböck and Levi-Civita connections and applying an analytical solution for plasticity derived from the Biot--Savart law, we provide a rigorous mathematical proof of the long-standing phenomenological relationship between edge dislocations and wedge disclinations. Additionally, we showcase the effectiveness of novel mathematical tools, including Riemannian holonomy for analysing the Frank vector and complex potentials that encapsulate the topological properties of wedge disclinations as jump discontinuities. Furthermore, we derive analytical expressions for the linearized stress fields of wedge disclinations and confirm their consistency with existing results. These findings demonstrate that the present geometrical framework extends and generalizes the classical theory of Volterra defects.
△ Less
Submitted 17 December, 2024; v1 submitted 12 December, 2024;
originally announced December 2024.
-
Detection of extended X-ray emission around the PeVatron microquasar V4641 Sgr with XRISM
Authors:
Hiromasa Suzuki,
Naomi Tsuji,
Yoshiaki Kanemaru,
Megumi Shidatsu,
Laura Olivera-Nieto,
Samar Safi-Harb,
Shigeo S. Kimura,
Eduardo de la Fuente,
Sabrina Casanova,
Kaya Mori,
Xiaojie Wang,
Sei Kato,
Dai Tateishi,
Hideki Uchiyama,
Takaaki Tanaka,
Hiroyuki Uchida,
Shun Inoue,
Dezhi Huang,
Marianne Lemoine-Goumard,
Daiki Miura,
Shoji Ogawa,
Shogo B. Kobayashi,
Chris Done,
Maxime Parra,
María Díaz Trigo
, et al. (4 additional authors not shown)
Abstract:
A recent report on the detection of very-high-energy gamma rays from V4641 Sagittarii (V4641 Sgr) up to ~0.8 peta-electronvolt has made it the second confirmed "PeVatron" microquasar. Here we report on the observation of V4641 Sgr with X-Ray Imaging and Spectroscopy Mission (XRISM) in September 2024. Thanks to the large field of view and low background, the CCD imager Xtend successfully detected f…
▽ More
A recent report on the detection of very-high-energy gamma rays from V4641 Sagittarii (V4641 Sgr) up to ~0.8 peta-electronvolt has made it the second confirmed "PeVatron" microquasar. Here we report on the observation of V4641 Sgr with X-Ray Imaging and Spectroscopy Mission (XRISM) in September 2024. Thanks to the large field of view and low background, the CCD imager Xtend successfully detected for the first time X-ray extended emission around V4641 Sgr with a significance of > 4.5 sigma and > 10 sigma based on our imaging and spectral analysis, respectively. The spatial extent is estimated to have a radius of $7 \pm 3$ arcmin ($13 \pm 5$ pc at a distance of 6.2 kpc) assuming a Gaussian-like radial distribution, which suggests that the particle acceleration site is within ~10 pc of the microquasar. If the X-ray morphology traces the diffusion of accelerated electrons, this spatial extent can be explained by either an enhanced magnetic field (~80 uG) or a suppressed diffusion coefficient (~$10^{27}$ cm$^2$ s$^{-1}$ at 100 TeV). The integrated X-ray flux, (4-6)$\times 10^{-12}$ erg s$^{-1}$ cm$^{-2}$ (2-10 keV), would require a magnetic field strength higher than the galactic mean (> 8 uG) if the diffuse X-ray emission originates from synchrotron radiation and the gamma-ray emission is predominantly hadronic. If the X-rays are of thermal origin, the measured extension, temperature, and plasma density can be explained by a jet with a luminosity of ~$2\times 10^{39}$ erg s$^{-1}$, which is comparable to the Eddington luminosity of this system.
△ Less
Submitted 19 December, 2024; v1 submitted 10 December, 2024;
originally announced December 2024.
-
Black $p$-Branes in Heterotic String Theory
Authors:
Masaki Fukuda,
Shun K. Kobayashi,
Kento Watanabe,
Kazuya Yonekura
Abstract:
Heterotic string theory has nonsupersymmetric branes whose existence is suggested by the cobordism conjecture. We numerically construct static, spherically symmetric, and asymptotically flat black brane solutions in ten-dimensional heterotic superstring theories for 0- and 4-branes. These branes carry charges that are measured by Chern classes on the sphere surrounding the branes. For the extremal…
▽ More
Heterotic string theory has nonsupersymmetric branes whose existence is suggested by the cobordism conjecture. We numerically construct static, spherically symmetric, and asymptotically flat black brane solutions in ten-dimensional heterotic superstring theories for 0- and 4-branes. These branes carry charges that are measured by Chern classes on the sphere surrounding the branes. For the extremal case, the solutions have a throat region with a linear dilaton profile as expected from the corresponding world-sheet theory. We also construct non-extremal solutions by compactifying the time direction. To verify the reliability of our numerical calculations, we confirm that they reproduce the known analytical solutions for the 6-brane. Our black brane solutions provide evidence supporting the existence of such branes in heterotic string theory.
△ Less
Submitted 3 December, 2024;
originally announced December 2024.
-
Thermoelectric effect in a superconductor with Bogoliubov Fermi surfaces
Authors:
Tomoya Sano,
Takumi Sato,
Akihiro Sasaki,
Satoshi Ikegaya,
Shingo Kobayashi,
Yasuhiro Asano
Abstract:
We study theoretically the thermoelectric effect in a superconducting state having the Bogoliubov-Fermi surfaces which stays in a thin superconducting layer between a conventional superconductor and an insulator. The thermoelectric coefficients calculated based on the linear response theory show the remarkable anisotropy in real space, which are explained well by the anisotropic shape of the Bogol…
▽ More
We study theoretically the thermoelectric effect in a superconducting state having the Bogoliubov-Fermi surfaces which stays in a thin superconducting layer between a conventional superconductor and an insulator. The thermoelectric coefficients calculated based on the linear response theory show the remarkable anisotropy in real space, which are explained well by the anisotropic shape of the Bogoliubov-Fermi surface in momentum space. Our results indicate a way to check the existence of the Bogoliubov-Fermi surfaces in a stable superconducting state because the anisotropy is controlled by the direction of an applied magnetic field.
△ Less
Submitted 16 March, 2025; v1 submitted 10 November, 2024;
originally announced November 2024.
-
Weight decay induces low-rank attention layers
Authors:
Seijin Kobayashi,
Yassir Akram,
Johannes Von Oswald
Abstract:
The effect of regularizers such as weight decay when training deep neural networks is not well understood. We study the influence of weight decay as well as $L2$-regularization when training neural network models in which parameter matrices interact multiplicatively. This combination is of particular interest as this parametrization is common in attention layers, the workhorse of transformers. Her…
▽ More
The effect of regularizers such as weight decay when training deep neural networks is not well understood. We study the influence of weight decay as well as $L2$-regularization when training neural network models in which parameter matrices interact multiplicatively. This combination is of particular interest as this parametrization is common in attention layers, the workhorse of transformers. Here, key-query, as well as value-projection parameter matrices, are multiplied directly with each other: $W_K^TW_Q$ and $PW_V$. We extend previous results and show on one hand that any local minimum of a $L2$-regularized loss of the form $L(AB^\top) + λ(\|A\|^2 + \|B\|^2)$ coincides with a minimum of the nuclear norm-regularized loss $L(AB^\top) + λ\|AB^\top\|_*$, and on the other hand that the 2 losses become identical exponentially quickly during training. We thus complement existing works linking $L2$-regularization with low-rank regularization, and in particular, explain why such regularization on the matrix product affects early stages of training. Based on these theoretical insights, we verify empirically that the key-query and value-projection matrix products $W_K^TW_Q, PW_V$ within attention layers, when optimized with weight decay, as usually done in vision tasks and language modelling, indeed induce a significant reduction in the rank of $W_K^TW_Q$ and $PW_V$, even in fully online training. We find that, in accordance with existing work, inducing low rank in attention matrix products can damage language model performance, and observe advantages when decoupling weight decay in attention layers from the rest of the parameters.
△ Less
Submitted 31 October, 2024;
originally announced October 2024.
-
Multi-agent cooperation through learning-aware policy gradients
Authors:
Alexander Meulemans,
Seijin Kobayashi,
Johannes von Oswald,
Nino Scherrer,
Eric Elmoznino,
Blake Richards,
Guillaume Lajoie,
Blaise Agüera y Arcas,
João Sacramento
Abstract:
Self-interested individuals often fail to cooperate, posing a fundamental challenge for multi-agent learning. How can we achieve cooperation among self-interested, independent learning agents? Promising recent work has shown that in certain tasks cooperation can be established between learning-aware agents who model the learning dynamics of each other. Here, we present the first unbiased, higher-d…
▽ More
Self-interested individuals often fail to cooperate, posing a fundamental challenge for multi-agent learning. How can we achieve cooperation among self-interested, independent learning agents? Promising recent work has shown that in certain tasks cooperation can be established between learning-aware agents who model the learning dynamics of each other. Here, we present the first unbiased, higher-derivative-free policy gradient algorithm for learning-aware reinforcement learning, which takes into account that other agents are themselves learning through trial and error based on multiple noisy trials. We then leverage efficient sequence models to condition behavior on long observation histories that contain traces of the learning dynamics of other agents. Training long-context policies with our algorithm leads to cooperative behavior and high returns on standard social dilemmas, including a challenging environment where temporally-extended action coordination is required. Finally, we derive from the iterated prisoner's dilemma a novel explanation for how and when cooperation arises among self-interested learning-aware agents.
△ Less
Submitted 19 March, 2025; v1 submitted 24 October, 2024;
originally announced October 2024.
-
Curling morphology of knitted fabrics: Structure and Mechanics
Authors:
Kotone Tajiri,
Riki Murakami,
Shunsuke Kobayashi,
Ryuichi Tarumi,
Tomohiko G. Sano
Abstract:
Knitted fabrics are two-dimensional-like structures formed by stitching one-dimensional yarn into three-dimensional curves. Plain stitch or stockinette stitch, one of the most fundamental knitting stitches, consists of periodic lattices of bent yarns, where three-dimensional (3D) curling behavior naturally emerges at the edges. The elasticity and geometry of knitted fabrics have been studied in pr…
▽ More
Knitted fabrics are two-dimensional-like structures formed by stitching one-dimensional yarn into three-dimensional curves. Plain stitch or stockinette stitch, one of the most fundamental knitting stitches, consists of periodic lattices of bent yarns, where three-dimensional (3D) curling behavior naturally emerges at the edges. The elasticity and geometry of knitted fabrics have been studied in previous studies, primarily based on 2D modeling. Still, the relation between 3D geometry and the mechanics of knitted fabrics has not been clarified so far. The curling behavior of knits is intricately related to the forces and moments acting on the yarns, geometry of the unit knitted loops, mechanical properties, and contacts, hence requiring a 3D analysis. Here, we show that the curling of plain knits emerges through the elasticity and geometry of the knitted loops, combining desktop-scale experiments and reduced elasticity-based simulations. We find that by changing the horizontal and vertical knitting numbers, three types of curl shapes emerge: side curl and top/bottom curl shapes, which are curled only horizontally and vertically, and double curl shape, in which both curl shapes appear together. The fundamental mechanism of intricate shape deformation is clarified through the force and moment balance along yarn whose centerline shape is discretized through the B-spline curves where elastic stretching, bending, and contact mechanics are taken into account. We reveal that the 3D structure of the single-knitted loop plays a critical role in the curling behavior. Our results imply that the change in shape per a single knitted loop has the potential to control the 3D natural overall shape of knitted fabrics, and could be applied in predicting or designing more complex 3D shapes made of knitted fabrics.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Snap and Jump: How Elastic Shells Pop Out
Authors:
Takara Abe,
Isamu Hashiguchi,
Yukitake Nakahara,
Shunsuke Kobayashi,
Ryuichi Tarumi,
Hidetoshi Takahashi,
Genya Ishigami,
Tomohiko G. Sano
Abstract:
Grip, walk, crawl, and jump. Soft robots are integrated functional structures composed of compliant mechanisms, whose activity spans various industrial applications such as surgery, healthcare, surveillance, and even planetary exploration. One of their promising mobility mechanism is snap-buckling; the instability mode of flexible structures passing from one equilibrium state to another can instan…
▽ More
Grip, walk, crawl, and jump. Soft robots are integrated functional structures composed of compliant mechanisms, whose activity spans various industrial applications such as surgery, healthcare, surveillance, and even planetary exploration. One of their promising mobility mechanism is snap-buckling; the instability mode of flexible structures passing from one equilibrium state to another can instantaneously generate large power for its motion. Predicting their performance with even simple geometry requires disentangling material, geometric nonlinearity, and contact, thereby still being a challenging problem to date. Here, we study the jumping dynamics of hemispherical elastic shells driven by snap-buckling, as a model system of soft jumping mechanisms, combining experiments, simulations, and analytical theory. We find that the contact transition dynamics trigger the jumping phenomenon upon snap-buckling by constructing the analytical predictions with shell elasticity in excellent agreement with both experiments and simulations. Despite the simple geometry of the shell, its dynamical performance primarily relies on a complex interplay between elasticity, geometry, and contact friction. By elucidating the dynamics of the building blocks of soft robots that undergo large deformations, we can build their predictive experimental and numerical framework. Our research paves the way for designing soft robots suitable for the required loading conditions or structural requirements without empirical methods.
△ Less
Submitted 10 December, 2024; v1 submitted 11 October, 2024;
originally announced October 2024.
-
Coherence influx is indispensable for quantum reservoir computing
Authors:
Shumpei Kobayashi,
Quoc Hoan Tran,
Kohei Nakajima
Abstract:
Echo state property (ESP) is a fundamental property that allows an input-driven dynamical system to perform information processing tasks. Recently, extensions of ESP to potentially nonstationary systems and subsystems, that is, nonstationary ESP and subset/subspace ESP, have been proposed. In this paper, we theoretically and numerically analyze the sufficient and necessary conditions for a quantum…
▽ More
Echo state property (ESP) is a fundamental property that allows an input-driven dynamical system to perform information processing tasks. Recently, extensions of ESP to potentially nonstationary systems and subsystems, that is, nonstationary ESP and subset/subspace ESP, have been proposed. In this paper, we theoretically and numerically analyze the sufficient and necessary conditions for a quantum system to satisfy nonstationary ESP and subset/subspace nonstationary ESP. Based on extensive usage of the Pauli transfer matrix (PTM) form, we find that (1) the interaction with a quantum-coherent environment, termed $\textit{coherence influx}$, is indispensable in realizing nonstationary ESP, and (2) the spectral radius of PTM can characterize the fading memory property of quantum reservoir computing (QRC). Our numerical experiment, involving a system with a Hamiltonian that entails a spin-glass/many-body localization phase, reveals that the spectral radius of PTM can describe the dynamical phase transition intrinsic to such a system. To comprehensively understand the mechanisms under ESP of QRC, we propose a simplified model, multiplicative reservoir computing (mRC), which is a reservoir computing (RC) system with a one-dimensional multiplicative input. Theoretically and numerically, we show that the parameters corresponding to the spectral radius and coherence influx in mRC directly correlates with its linear memory capacity (MC). Our findings about QRC and mRC will provide a theoretical aspect of PTM and the input multiplicativity of QRC. The results will lead to a better understanding of QRC and information processing in open quantum systems.
△ Less
Submitted 22 September, 2024; v1 submitted 19 September, 2024;
originally announced September 2024.
-
Dynamic Link and Flow Prediction in Bank Transfer Networks
Authors:
Shu Takahashi,
Kento Yamamoto,
Shumpei Kobayashi,
Ryoma Kondo,
Ryohei Hisano
Abstract:
The prediction of both the existence and weight of network links at future time points is essential as complex networks evolve over time. Traditional methods, such as vector autoregression and factor models, have been applied to small, dense networks, but become computationally impractical for large-scale, sparse, and complex networks. Some machine learning models address dynamic link prediction,…
▽ More
The prediction of both the existence and weight of network links at future time points is essential as complex networks evolve over time. Traditional methods, such as vector autoregression and factor models, have been applied to small, dense networks, but become computationally impractical for large-scale, sparse, and complex networks. Some machine learning models address dynamic link prediction, but few address the simultaneous prediction of both link presence and weight. Therefore, we introduce a novel model that dynamically predicts link presence and weight by dividing the task into two sub-tasks: predicting remittance ratios and forecasting the total remittance volume. We use a self-attention mechanism that combines temporal-topological neighborhood features to predict remittance ratios and use a separate model to forecast the total remittance volume. We achieve the final prediction by multiplying the outputs of these models. We validated our approach using two real-world datasets: a cryptocurrency network and bank transfer network.
△ Less
Submitted 14 October, 2024; v1 submitted 13 September, 2024;
originally announced September 2024.
-
Biot-Savart law in the geometrical theory of dislocations
Authors:
Shunsuke Kobayashi,
Ryuichi Tarumi
Abstract:
Universal mechanical principles may exist behind seemingly unrelated physical phenomena, providing novel insights into these phenomena. This study sheds light on the geometrical theory of dislocations through an analogy with electromagnetics. In this theory, solving Cartan's first structure equation is essential for connecting the dislocation density to the plastic deformation field of the disloca…
▽ More
Universal mechanical principles may exist behind seemingly unrelated physical phenomena, providing novel insights into these phenomena. This study sheds light on the geometrical theory of dislocations through an analogy with electromagnetics. In this theory, solving Cartan's first structure equation is essential for connecting the dislocation density to the plastic deformation field of the dislocations. The additional constraint of a divergence-free condition, derived from the Helmholtz decomposition, forms the governing equations that mirror Ampère's and Gauss' law in electromagnetics. This allows for the analytical integration of the equations using the Biot-Savart law. The plastic deformation fields of screw and edge dislocations obtained through this process form both a vortex and an orthogonal coordinate system on the cross-section perpendicular to the dislocation line. This orthogonality is rooted in the conformal property of the corresponding complex function that satisfies the Cauchy-Riemann equations, leading to the complex potential of plastic deformation. We validate the results through a comparison with the classical dislocation theory. The incompatibility tensor is crucial in the generation of the mechanical field. These findings reveal a profound unification of dislocation theories, electromagnetics, and complex functions through their underlying mathematical parallels.
△ Less
Submitted 8 September, 2024; v1 submitted 5 September, 2024;
originally announced September 2024.
-
New results on the gamma-ray burst variability-luminosity relation
Authors:
C. Guidorzi,
R. Maccary,
A. Tsvetkova,
S. Kobayashi,
L. Amati,
L. Bazzanini,
M. Bulla,
A. E. Camisasca,
L. Ferro,
D. Frederiks,
F. Frontera,
A. Lysenko,
M. Maistrello,
A. Ridnaia,
D. Svinkin,
M. Ulanov
Abstract:
At the dawn of the gamma-ray burst (GRB) afterglow era, a Cepheid-like correlation was discovered between time variability V and isotropic-equivalent peak luminosity Liso of the prompt emission of about a dozen long GRBs with measured redshift available at that time. Soon afterwards, the correlation was confirmed against a sample of about 30 GRBs, despite being affected by significant scatter. Unl…
▽ More
At the dawn of the gamma-ray burst (GRB) afterglow era, a Cepheid-like correlation was discovered between time variability V and isotropic-equivalent peak luminosity Liso of the prompt emission of about a dozen long GRBs with measured redshift available at that time. Soon afterwards, the correlation was confirmed against a sample of about 30 GRBs, despite being affected by significant scatter. Unlike the minimum variability timescale (MVT), V measures the relative power of short-to-intermediate timescales. We aim to test the correlation using about two hundred long GRBs with spectroscopically measured redshift, detected by Swift, Fermi, and Konus/WIND, for which both observables can be accurately estimated. For all the selected GRBs, variability was calculated according to the original definition using the 64-ms background-subtracted light curves of Swift/BAT (Fermi/GBM) in the 15-150 (8-900) keV energy passband. Peak luminosities were either taken from literature or derived from modelling broad-band spectra acquired with either Konus/WIND or Fermi/GBM. The statistical significance of the correlation has weakened to <~2%, mostly due to the appearance of a number of smooth and luminous GRBs characterised by a relatively small V. At odds with most long GRBs, 3 out of 4 long-duration merger candidates have high V and low Liso. Luminosity is more tightly connected with shortest timescales measured by MVT rather than short-to-intermediate ones, measured by V. We discuss the implications on internal dissipation models and the role of the e+- photosphere. We identified a few, smooth GRBs with a single broad pulse and low V, that might have an external shock origin, in contrast with most GRBs. The combination of high variability (V>~0.1), low luminosity (Liso<~10^51 erg s^-1) and short MVT (<~ 0.1 s) could be a good indicator for a compact binary merger origin.
△ Less
Submitted 4 September, 2024; v1 submitted 3 September, 2024;
originally announced September 2024.
-
A millimeter rebrightening in GRB 210702A
Authors:
Simon de Wet,
Tanmoy Laskar,
Paul J. Groot,
Rodolfo Barniol Duran,
Edo Berger,
Shivani Bhandari,
Tarraneh Eftekhari,
C. Guidorzi,
Shiho Kobayashi,
Daniel A. Perley,
Re'em Sari,
Genevieve Schroeder
Abstract:
We present X-ray to radio frequency observations of the bright long gamma-ray burst GRB 210702A. Our ALMA 97.5 GHz observations show a significant rebrightening by a factor of ~2 beginning at 8.2 days post-burst and rising to peak brightness at 18.1 days before declining again. This is the first such rebrightening seen in a millimeter afterglow light curve. A standard forward shock model in a stel…
▽ More
We present X-ray to radio frequency observations of the bright long gamma-ray burst GRB 210702A. Our ALMA 97.5 GHz observations show a significant rebrightening by a factor of ~2 beginning at 8.2 days post-burst and rising to peak brightness at 18.1 days before declining again. This is the first such rebrightening seen in a millimeter afterglow light curve. A standard forward shock model in a stellar wind circumburst medium can explain most of our X-ray, optical and millimeter observations prior to the rebrightening, but significantly over-predicts the self-absorbed radio emission, and cannot explain the millimeter rebrightening. We investigate possible explanations for the millimeter rebrightening and find that energy injection or a reverse shock from a late-time shell collision are plausible causes. Similar to other bursts, our radio data may require alternative scenarios such as a thermal electron population or a structured jet to explain the data. Our observations demonstrate that millimeter light curves can exhibit some of the rich features more commonly seen in optical and X-ray afterglow light curves, motivating further millimeter wavelength studies of GRB afterglows.
△ Less
Submitted 26 August, 2024;
originally announced August 2024.
-
Hecke $L$-values, definite Shimura sets and Mod $\ell$ non-vanishing
Authors:
Ashay A. Burungale,
Wei He,
Shinichi Kobayashi,
Kazuto Ota
Abstract:
Let $λ$ be a self-dual Hecke character over an imaginary quadratic field $K$ of infinity type $(1,0)$. Let $\ell$ and $p$ be primes which are coprime to $6N_{K/\mathbb{Q}}({\mathrm cond}(λ))$. We determine the $\ell$-adic valuation of Hecke $L$-values $L(1,λχ)/Ω_K$ as $χ$ varies over $p$-power order anticyclotomic characters over $K$. As an application, for $p$ inert in $K$, we prove the vanishing…
▽ More
Let $λ$ be a self-dual Hecke character over an imaginary quadratic field $K$ of infinity type $(1,0)$. Let $\ell$ and $p$ be primes which are coprime to $6N_{K/\mathbb{Q}}({\mathrm cond}(λ))$. We determine the $\ell$-adic valuation of Hecke $L$-values $L(1,λχ)/Ω_K$ as $χ$ varies over $p$-power order anticyclotomic characters over $K$. As an application, for $p$ inert in $K$, we prove the vanishing of the $μ$-invariant of Rubin's $p$-adic $L$-function, leading to the first results on the $μ$-invariant of imaginary quadratic fields at non-split primes.
Our approach and results complement the work of Hida and Finis. The approach is rooted in the arithmetic of a CM form on a definite Shimura set.The application to Rubin's $p$-adic $L$-function also relies on the proof of his conjecture. Along the way, we present an automorphic view on Rubin's theory.
△ Less
Submitted 8 April, 2025; v1 submitted 25 August, 2024;
originally announced August 2024.
-
Learning Randomized Algorithms with Transformers
Authors:
Johannes von Oswald,
Seijin Kobayashi,
Yassir Akram,
Angelika Steger
Abstract:
Randomization is a powerful tool that endows algorithms with remarkable properties. For instance, randomized algorithms excel in adversarial settings, often surpassing the worst-case performance of deterministic algorithms with large margins. Furthermore, their success probability can be amplified by simple strategies such as repetition and majority voting. In this paper, we enhance deep neural ne…
▽ More
Randomization is a powerful tool that endows algorithms with remarkable properties. For instance, randomized algorithms excel in adversarial settings, often surpassing the worst-case performance of deterministic algorithms with large margins. Furthermore, their success probability can be amplified by simple strategies such as repetition and majority voting. In this paper, we enhance deep neural networks, in particular transformer models, with randomization. We demonstrate for the first time that randomized algorithms can be instilled in transformers through learning, in a purely data- and objective-driven manner. First, we analyze known adversarial objectives for which randomized algorithms offer a distinct advantage over deterministic ones. We then show that common optimization techniques, such as gradient descent or evolutionary strategies, can effectively learn transformer parameters that make use of the randomness provided to the model. To illustrate the broad applicability of randomization in empowering neural networks, we study three conceptual tasks: associative recall, graph coloring, and agents that explore grid worlds. In addition to demonstrating increased robustness against oblivious adversaries through learned randomization, our experiments reveal remarkable performance improvements due to the inherently random nature of the neural networks' computation and predictions.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
Discovery of a hyperluminous quasar at z = 1.62 with Eddington ratio > 3 in the eFEDS field confirmed by KOOLS-IFU on Seimei Telescope
Authors:
Yoshiki Toba,
Keito Masu,
Naomi Ota,
Zhen-Kai Gao,
Masatoshi Imanishi,
Anri Yanagawa,
Satoshi Yamada,
Itsuki Dosaka,
Takumi Kakimoto,
Seira Kobayashi,
Neiro Kurokawa,
Aika Oki,
Sorami Soga,
Kohei Shibata,
Sayaka Takeuchi,
Yukana Tsujita,
Tohru Nagao,
Masayuki Tanaka,
Yoshihiro Ueda,
Wei-Hao Wang
Abstract:
We report the discovery of a hyperluminous type 1 quasar (eFEDS J082826.9-013911; eFEDSJ0828-0139) at $z_{\rm spec}$ = 1.622 with a super-Eddington ratio ($λ_{\rm Edd}$). We perform the optical spectroscopic observations with KOOLS-IFU on the Seimei Telescope. The black hole mass ($M_{\rm BH}$) based on the single-epoch method with MgII $λ$2798 is estimated to be…
▽ More
We report the discovery of a hyperluminous type 1 quasar (eFEDS J082826.9-013911; eFEDSJ0828-0139) at $z_{\rm spec}$ = 1.622 with a super-Eddington ratio ($λ_{\rm Edd}$). We perform the optical spectroscopic observations with KOOLS-IFU on the Seimei Telescope. The black hole mass ($M_{\rm BH}$) based on the single-epoch method with MgII $λ$2798 is estimated to be $M_{\rm BH} = (6.2 \pm 1.2) \times 10^8$ $M_{\odot}$. To measure the precise infrared luminosity ($L_{\rm IR}$), we obtain submillimeter data taken by SCUBA-2 on JCMT and conduct the spectral energy distribution analysis with X-ray to submillimeter data. We find that $L_{\rm IR}$ of eFEDSJ0828-0139 is $L_{\rm IR} = (6.8 \pm 1.8) \times 10^{13}$ $L_{\odot}$, confirming the existence of a hypeluminous infrared galaxy (HyLIRG). $λ_{\rm Edd}$ is estimated to be $λ_{\rm Edd} = 3.6 \pm 0.7$, making it one of the quasars with the highest BH mass accretion rate at cosmic noon.
△ Less
Submitted 15 August, 2024;
originally announced August 2024.
-
Photon energy reconstruction with the MEG II liquid xenon calorimeter
Authors:
Kensuke Yamamoto,
Sei Ban,
Lukas Gerritzen,
Toshiyuki Iwamoto,
Satoru Kobayashi,
Ayaka Matsushita,
Toshinori Mori,
Rina Onda,
Wataru Ootani,
Atsushi Oya
Abstract:
The MEG II experiment searches for a charged-lepton-flavour-violating $μ\to e γ$ with the target sensitivity of $6 \times 10^{-14}$. A liquid xenon calorimeter with VUV-sensitive photosensors measures photon position, timing, and energy. This paper concentrates on the precise photon energy reconstruction with the MEG II liquid xenon calorimeter. Since a muon beam rate is…
▽ More
The MEG II experiment searches for a charged-lepton-flavour-violating $μ\to e γ$ with the target sensitivity of $6 \times 10^{-14}$. A liquid xenon calorimeter with VUV-sensitive photosensors measures photon position, timing, and energy. This paper concentrates on the precise photon energy reconstruction with the MEG II liquid xenon calorimeter. Since a muon beam rate is $3\text{-}5 \times 10^{7}~\text{s}^{-1}$, multi-photon elimination analysis is performed using waveform analysis techniques such as a template waveform fit. As a result, background events in the energy range of 48-58 MeV were reduced by 34 %. The calibration of the energy scale of the calorimeter with several calibration sources is also discussed to achieve a high resolution of 1.8 %.
△ Less
Submitted 17 December, 2024; v1 submitted 28 July, 2024;
originally announced July 2024.
-
When can transformers compositionally generalize in-context?
Authors:
Seijin Kobayashi,
Simon Schug,
Yassir Akram,
Florian Redhardt,
Johannes von Oswald,
Razvan Pascanu,
Guillaume Lajoie,
João Sacramento
Abstract:
Many tasks can be composed from a few independent components. This gives rise to a combinatorial explosion of possible tasks, only some of which might be encountered during training. Under what circumstances can transformers compositionally generalize from a subset of tasks to all possible combinations of tasks that share similar components? Here we study a modular multitask setting that allows us…
▽ More
Many tasks can be composed from a few independent components. This gives rise to a combinatorial explosion of possible tasks, only some of which might be encountered during training. Under what circumstances can transformers compositionally generalize from a subset of tasks to all possible combinations of tasks that share similar components? Here we study a modular multitask setting that allows us to precisely control compositional structure in the data generation process. We present evidence that transformers learning in-context struggle to generalize compositionally on this task despite being in principle expressive enough to do so. Compositional generalization becomes possible only when introducing a bottleneck that enforces an explicit separation between task inference and task execution.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Majorana multipole response with magnetic point group symmetry
Authors:
Yuki Yamazaki,
Shingo Kobayashi
Abstract:
Majorana fermions (MFs) in a topological superconductor exhibit anisotropic electromagnetic responses, called Majorana multipole responses, when MFs are degenerate under time-reversal and crystalline symmetries. In time-reversal symmetric systems, the Majorana multipole response relates to Cooper pair symmetry in the underlying superconducting material, which provides a way to identify pairing sym…
▽ More
Majorana fermions (MFs) in a topological superconductor exhibit anisotropic electromagnetic responses, called Majorana multipole responses, when MFs are degenerate under time-reversal and crystalline symmetries. In time-reversal symmetric systems, the Majorana multipole response relates to Cooper pair symmetry in the underlying superconducting material, which provides a way to identify pairing symmetries through surface-spin-sensitive measurements. Here, we extend the concept of Majorana multipole response to systems with magnetic point group symmetry that break time-reversal symmetry and clarify how the response of MFs includes information about underlying superconductors. From a topological classification of symmetry-protected MFs and an effective surface theory, we classify possible magnetic and electric responses for MFs, which manifests a direct connection to Cooper pair symmetry for a symmetry-enforced pair of MFs. Additionally, we find several time-reversal-even higher-order multipole responses, such as the quadrupole response, which are forbidden in time-reversal symmetric systems, whereby indicating breaking of time-reversal symmetry. The theory is applied to the odd-parity chiral superconductor UTe$_2$ and the ferromagnetic superconductor UCoGe, demonstrating the appearance of a magnetic quadrupole response on a surface.
△ Less
Submitted 14 October, 2024; v1 submitted 1 July, 2024;
originally announced July 2024.
-
Status of Xtend telescope onboard X-Ray Imaging and Spectroscopy Mission (XRISM)
Authors:
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takashi Okajima,
Hirofumi Noda,
Hiroyuki Uchida,
Hiromasa Suzuki,
Shogo Benjamin Kobayashi,
Tomokage Yoneyama,
Kouichi Hagino,
Kumiko Nobukawa,
Takaaki Tanaka,
Hiroshi Murakami,
Hideki Uchiyama,
Masayoshi Nobukawa,
Hironori Matsumoto,
Takeshi Tsuru,
Makoto Yamauchi,
Isamu Hatsukade,
Hirokazu Odaka,
Takayoshi Kohmura,
Kazutaka Yamaoka,
Manabu Ishida,
Yoshitomo Maeda,
Takayuki Hayashi
, et al. (38 additional authors not shown)
Abstract:
Xtend is one of the two telescopes onboard the X-ray imaging and spectroscopy mission (XRISM), which was launched on September 7th, 2023. Xtend comprises the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. A large field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is…
▽ More
Xtend is one of the two telescopes onboard the X-ray imaging and spectroscopy mission (XRISM), which was launched on September 7th, 2023. Xtend comprises the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. A large field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is realized by the combination of the SXI and XMA with a focal length of 5.6 m. The SXI employs four P-channel, back-illuminated type CCDs with a thick depletion layer of 200 $μ$m. The four CCD chips are arranged in a 2$\times$2 grid and cooled down to $-110$ $^{\circ}$C with a single-stage Stirling cooler. Before the launch of XRISM, we conducted a month-long spacecraft thermal vacuum test. The performance verification of the SXI was successfully carried out in a course of multiple thermal cycles of the spacecraft. About a month after the launch of XRISM, the SXI was carefully activated and the soundness of its functionality was checked by a step-by-step process. Commissioning observations followed the initial operation. We here present pre- and post-launch results verifying the Xtend performance. All the in-orbit performances are consistent with those measured on ground and satisfy the mission requirement. Extensive calibration studies are ongoing.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Initial operations of the Soft X-ray Imager onboard XRISM
Authors:
Hiromasa Suzuki,
Tomokage Yoneyama,
Shogo B. Kobayashi,
Hirofumi Noda,
Hiroyuki Uchida,
Kumiko K. Nobukawa,
Kouichi Hagino,
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takaaki Tanaka,
Hiroshi Murakami,
Hideki Uchiyama,
Masayoshi Nobukawa,
Yoshiaki Kanemaru,
Yoshinori Otsuka,
Haruhiko Yokosu,
Wakana Yonemaru,
Hanako Nakano,
Kazuhiro Ichikawa,
Reo Takemoto,
Tsukasa Matsushima,
Marina Yoshimoto,
Mio Aoyagi,
Kohei Shima
, et al. (30 additional authors not shown)
Abstract:
XRISM (X-Ray Imaging and Spectroscopy Mission) is an astronomical satellite with the capability of high-resolution spectroscopy with the X-ray microcalorimeter, Resolve, and wide field-of-view imaging with the CCD camera, Xtend. Xtend consists of the mirror assembly (XMA: X-ray Mirror Assembly) and detector (SXI: Soft X-ray Imager). The SXI is composed of CCDs, analog and digital electronics, and…
▽ More
XRISM (X-Ray Imaging and Spectroscopy Mission) is an astronomical satellite with the capability of high-resolution spectroscopy with the X-ray microcalorimeter, Resolve, and wide field-of-view imaging with the CCD camera, Xtend. Xtend consists of the mirror assembly (XMA: X-ray Mirror Assembly) and detector (SXI: Soft X-ray Imager). The SXI is composed of CCDs, analog and digital electronics, and a mechanical cooler. After the successful launch on September 6th, 2023 (UT) and subsequent critical operations, the mission instruments were turned on and set up. The CCDs have been kept at the designed operating temperature of $-110^\circ$C after the electronics and cooling system were successfully set up. During the initial operation phase, which continued for more than a month after the critical operations, we verified the observation procedure, stability of the cooling system, all the observation options with different imaging areas and/or timing resolutions, and time-tagged and automated operations including those for South Atlantic Anomaly passages. We optimized the operation procedure and observation parameters including the cooler settings, imaging areas for the small window modes, and event selection algorithm. We summarize our policy and procedure of the initial operations for the SXI. We also report on a couple of issues we faced during the initial operations and lessons learned from them.
△ Less
Submitted 14 February, 2025; v1 submitted 28 June, 2024;
originally announced June 2024.
-
Representation-protected topology of spin-singlet $s$-wave superconductors
Authors:
Shingo Kobayashi,
Akira Furusaki
Abstract:
We show that spin-singlet $s$-wave multi-band superconductors have a topological phase protected by rotation symmetry and time-reversal symmetry without spin-orbit coupling in two and three dimensions. This topological phase, an example of a representation-protected topological phase, has a $\mathbb{Z}_2$ topological index and is stable as long as the bands at the Fermi energy are formed by a doub…
▽ More
We show that spin-singlet $s$-wave multi-band superconductors have a topological phase protected by rotation symmetry and time-reversal symmetry without spin-orbit coupling in two and three dimensions. This topological phase, an example of a representation-protected topological phase, has a $\mathbb{Z}_2$ topological index and is stable as long as the bands at the Fermi energy are formed by a doublet of orbital states with finite angular momenta. In the limit of weak superconducting pair potential, the $\mathbb{Z}_2$ index gives a Fermi-surface formula and is related to the winding number of three-dimensional strong topological superconductors of class CI. We present a model of a topological $s_{\pm}$-wave superconductor that has gapless surface states with a quadratic dispersion and suggest a connection with iron-based superconductors.
△ Less
Submitted 7 October, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
Attention as a Hypernetwork
Authors:
Simon Schug,
Seijin Kobayashi,
Yassir Akram,
João Sacramento,
Razvan Pascanu
Abstract:
Transformers can under some circumstances generalize to novel problem instances whose constituent parts might have been encountered during training, but whose compositions have not. What mechanisms underlie this ability for compositional generalization? By reformulating multi-head attention as a hypernetwork, we reveal that a composable, low-dimensional latent code specifies key-query specific ope…
▽ More
Transformers can under some circumstances generalize to novel problem instances whose constituent parts might have been encountered during training, but whose compositions have not. What mechanisms underlie this ability for compositional generalization? By reformulating multi-head attention as a hypernetwork, we reveal that a composable, low-dimensional latent code specifies key-query specific operations. We find empirically that this latent code is predictive of the subtasks the network performs on unseen task compositions, revealing that latent codes acquired during training are reused to solve unseen problem instances. To further examine the hypothesis that the intrinsic hypernetwork of multi-head attention supports compositional generalization, we ablate whether making the hypernetwork-generated linear value network nonlinear strengthens compositionality. We find that this modification improves compositional generalization on abstract reasoning tasks. In particular, we introduce a symbolic version of the Raven's Progressive Matrices human intelligence test, which gives us precise control over the problem compositions encountered during training and evaluation. We demonstrate on this task how scaling model size and data enables compositional generalization in transformers and gives rise to a functionally structured latent space.
△ Less
Submitted 17 February, 2025; v1 submitted 9 June, 2024;
originally announced June 2024.
-
Half-dimensional immersions into the para-complex projective space and Ruh-Vilms type theorems
Authors:
Josef F. Dorfmeister,
Roland Hildebrand,
Shimpei Kobayashi
Abstract:
In this paper we study isometric immersions $f:M^n \to {\mathbb {C}^{\prime}}\!P^n$ of an $n$-dimensional pseudo-Riemannian manifold $M^n$ into the $n$-dimensional para-complex projective space ${\mathbb {C}^{\prime}}\!P^n$. We study the immersion $f$ by means of a lift $\mathfrak f$ of $f$ into a quadric hypersurface in ${S^{2n+1}_{n+1}}$. We find the frame equations and compatibility conditions.…
▽ More
In this paper we study isometric immersions $f:M^n \to {\mathbb {C}^{\prime}}\!P^n$ of an $n$-dimensional pseudo-Riemannian manifold $M^n$ into the $n$-dimensional para-complex projective space ${\mathbb {C}^{\prime}}\!P^n$. We study the immersion $f$ by means of a lift $\mathfrak f$ of $f$ into a quadric hypersurface in ${S^{2n+1}_{n+1}}$. We find the frame equations and compatibility conditions. We specialize these results to dimension $n = 2$ and a definite metric on $M^2$ in isothermal coordinates and consider the special cases of Lagrangian surface immersions and minimal surface immersions. We characterize surface immersions with special properties in terms of primitive harmonicity of the Gauss maps.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Phase-Dependent Spectral Shape Changes in the Ultraluminous X-Ray Pulsar NGC 5907 ULX1
Authors:
Daiki Miura,
Shogo B. Kobayashi,
Hiroya Yamaguchi
Abstract:
Discovery of coherent pulsations from several ultraluminous X-ray pulsars (ULXPs) has provided direct evidence of super-critical accretion flow. However, geometrical structure of such accretion flow onto the central neutron star remains poorly understood. NGC 5907 ULX1 is one of the most luminous ULXPs with the luminosity exceeding $10^{41}~{\rm erg~s^{-1}}$. Here we present a broadband X-ray stud…
▽ More
Discovery of coherent pulsations from several ultraluminous X-ray pulsars (ULXPs) has provided direct evidence of super-critical accretion flow. However, geometrical structure of such accretion flow onto the central neutron star remains poorly understood. NGC 5907 ULX1 is one of the most luminous ULXPs with the luminosity exceeding $10^{41}~{\rm erg~s^{-1}}$. Here we present a broadband X-ray study of this ULXP using the data from simultaneous observations with XMM-Newton and NuSTAR conducted in July 2014. The phase-resolved spectra are well reproduced by a model consisting of a multicolor disk blackbody emission with a temperature gradient of $p = 0.5~(T \propto r^{-p})$ and a power law with an exponential cutoff. The disk component is phase-invariant, and has an innermost temperature of $\sim 0.3~{\rm keV}$. Its normalization suggests a relatively low inclination angle of the disk, in contrast to the previous claim in other literature. The power law component, attributed to the emission from the accretion flow inside the magnetosphere of the neutron star, indicates phase-dependent spectral shape changes; the spectrum is slightly harder in the pre-peak phase than in the post-peak phase. This implies that the magnetosphere has an asymmetric geometry around the magnetic axis, and that hotter regions close to the magnetic pole become visible before the pulse peak.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Electromagnetic response of spinful Majorana fermions
Authors:
Shingo Kobayashi,
Masatoshi Sato
Abstract:
A remarkable feature of topological superconductors is the emergence of Majorana fermions in electron systems. Whereas the emergent Majorana fermions share the self-anti-particle property with Majorana fermions in particle physics, they may have essentially different electromagnetic properties. In this paper, we argue the electromagnetic response of spinful Majorana fermions in topological superco…
▽ More
A remarkable feature of topological superconductors is the emergence of Majorana fermions in electron systems. Whereas the emergent Majorana fermions share the self-anti-particle property with Majorana fermions in particle physics, they may have essentially different electromagnetic properties. In this paper, we argue the electromagnetic response of spinful Majorana fermions in topological superconductors. We present a general theory of the electromagnetic response of spinful Majorana fermions in topological superconductors and clarify how the pairing symmetry is encoded in the electromagnetic response. As an application, we predict the sublattice-dependent dipole (Ising)-type magnetic response of corner Majorana fermions in iron-based superconductors.
△ Less
Submitted 9 July, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
The complex landslide flow and the method of integrable systems
Authors:
Shimpei Kobayashi
Abstract:
We investigate a connection between the complex landslide flow, defined on a pair of Teichmüller spaces, and the integrable system approach to harmonic maps into a symmetric space. We will prove that the holonomy of the complex landslide flow can be derived from the holonomy of the family of flat connections determined by a harmonic map into the hyperbolic two-space.
We investigate a connection between the complex landslide flow, defined on a pair of Teichmüller spaces, and the integrable system approach to harmonic maps into a symmetric space. We will prove that the holonomy of the complex landslide flow can be derived from the holonomy of the family of flat connections determined by a harmonic map into the hyperbolic two-space.
△ Less
Submitted 18 February, 2025; v1 submitted 22 April, 2024;
originally announced April 2024.