-
Hidden memory and stochastic fluctuations in science
Authors:
Keisuke Okamura
Abstract:
Understanding the statistical laws governing citation dynamics remains a fundamental challenge in network theory and the science of science. Citation networks typically exhibit in-degree distributions well approximated by log-normal distributions, yet they also display power-law behaviour in the high-citation regime, presenting an apparent contradiction that lacks a unified explanation. Here, we i…
▽ More
Understanding the statistical laws governing citation dynamics remains a fundamental challenge in network theory and the science of science. Citation networks typically exhibit in-degree distributions well approximated by log-normal distributions, yet they also display power-law behaviour in the high-citation regime, presenting an apparent contradiction that lacks a unified explanation. Here, we identify a previously unrecognised phenomenon: the variance of the logarithm of citation counts per unit time follows a power law with respect to time since publication, scaling as $t^{H}$. This discovery introduces a new challenge while simultaneously offering a crucial clue to resolving this discrepancy. We develop a stochastic model in which latent attention to publications evolves through a memory-driven process incorporating cumulative advantage. This process is characterised by the Hurst parameter $H$, derived from fractional Brownian motion, and volatility. Our framework reconciles this contradiction by demonstrating that anti-persistent fluctuations ($H<\tfrac{1}{2}$) give rise to log-normal citation distributions, whereas persistent dynamics ($H>\tfrac{1}{2}$) favour heavy-tailed power laws. Numerical simulations confirm our model's explanatory and predictive power, interpolating between log-normal and power-law distributions while reproducing the $t^{H}$ law. Empirical analysis of arXiv e-prints further supports our theory, revealing an intrinsically anti-persistent nature with an upper bound of approximately $H=0.13$. By linking memory effects and stochastic fluctuations to broader network dynamics, our findings provide a unifying framework for understanding the evolution of collective attention in science and other attention-driven processes.
△ Less
Submitted 4 March, 2025;
originally announced March 2025.
-
Evolving interdisciplinary contributions to global societal challenges: A 50-year overview
Authors:
Keisuke Okamura
Abstract:
Addressing global societal challenges necessitates insights and expertise that transcend the boundaries of individual disciplines. In recent decades, interdisciplinary collaboration has been recognised as a vital driver of innovation and effective problem-solving, with the potential to profoundly influence policy and practice worldwide. However, quantitative evidence remains limited regarding how…
▽ More
Addressing global societal challenges necessitates insights and expertise that transcend the boundaries of individual disciplines. In recent decades, interdisciplinary collaboration has been recognised as a vital driver of innovation and effective problem-solving, with the potential to profoundly influence policy and practice worldwide. However, quantitative evidence remains limited regarding how cross-disciplinary efforts contribute to societal challenges, as well as the evolving roles and relevance of specific disciplines in addressing these issues. To fill this gap, this study examines the long-term evolution of interdisciplinary contributions to the United Nations' Sustainable Development Goals (SDGs), drawing on extensive bibliometric data from OpenAlex. By analysing publication and citation trends across 19 research fields from 1970 to 2022, we reveal how the relative presence of different disciplines in addressing particular SDGs has shifted over time. Our results also provide unique evidence of the increasing interconnection between fields since the 2000s, coinciding with the United Nations' initiative to tackle global societal challenges through interdisciplinary efforts. These insights will benefit policymakers and practitioners as they reflect on past progress and plan for future action, particularly with the SDG target deadline approaching in the next five years.
△ Less
Submitted 27 October, 2024;
originally announced October 2024.
-
Evolving landscape of US-China science collaboration: Convergence and divergence
Authors:
Kensei Kitajima,
Keisuke Okamura
Abstract:
International research collaboration among global scientific powerhouses has exhibited a discernible trend towards convergence in recent decades. Notably, the US and China have significantly fortified their collaboration across diverse scientific disciplines, solidifying their status as a national-level duopoly in global scientific knowledge production. However, recent reports hint at a potential…
▽ More
International research collaboration among global scientific powerhouses has exhibited a discernible trend towards convergence in recent decades. Notably, the US and China have significantly fortified their collaboration across diverse scientific disciplines, solidifying their status as a national-level duopoly in global scientific knowledge production. However, recent reports hint at a potential decline in collaboration between these two giants, even amidst the backdrop of advancing global convergence. Understanding the intricate interplay between cooperation and disparity within the US-China relationship is vital for both academia and policy leaders, as it provides invaluable insights into the potential future trajectory of global science collaboration. Despite its significance, there remains a noticeable dearth of quantitative evidence that adequately encapsulates the dynamism across disciplines and over time. To bridge this knowledge gap, this study delves into the evolving landscape of interaction between the US and China over recent decades. This investigation employs two approaches, one based on paper identifiers and the other on researcher identifiers, both obtained from bibliometric data sourced from OpenAlex. From both approaches, our findings unveil the unique and dynamic nature of the US-China relationship, characterised by a collaboration pattern initially marked by rapid convergence, followed by a recent phase of divergence.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.
-
Atlas of Science Collaboration, 1971-2020
Authors:
Keisuke Okamura
Abstract:
The evolving landscape of interinstitutional collaborative research across 15 natural science disciplines is explored using the open data sourced from OpenAlex. This extensive exploration spans the years from 1971 to 2020, facilitating a thorough investigation of leading scientific output producers and their collaborative relationships based on coauthorships. The findings are visually presented on…
▽ More
The evolving landscape of interinstitutional collaborative research across 15 natural science disciplines is explored using the open data sourced from OpenAlex. This extensive exploration spans the years from 1971 to 2020, facilitating a thorough investigation of leading scientific output producers and their collaborative relationships based on coauthorships. The findings are visually presented on world maps and other diagrams, offering a clear and insightful portrayal of notable variations in both national and international collaboration patterns across various fields and time periods. These visual representations serve as valuable resources for science policymakers, diplomats and institutional researchers, providing them with a comprehensive overview of global collaboration and aiding their intuitive grasp of the evolving nature of these partnerships over time.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
Metrization of powers of the Jensen-Shannon divergence
Authors:
Kazuki Okamura
Abstract:
Metrization of statistical divergences is valuable in both theoretical and practical aspects. One approach to obtaining metrics associated with divergences is to consider their fractional powers. Motivated by this idea, Osán, Bussandri, and Lamberti (2018) studied the metrization of fractional powers of the Jensen-Shannon divergence between multinomial distributions and posed an open problem. In t…
▽ More
Metrization of statistical divergences is valuable in both theoretical and practical aspects. One approach to obtaining metrics associated with divergences is to consider their fractional powers. Motivated by this idea, Osán, Bussandri, and Lamberti (2018) studied the metrization of fractional powers of the Jensen-Shannon divergence between multinomial distributions and posed an open problem. In this short note, we provide an affirmative answer to their conjecture. Moreover, our method is also applicable to fractional powers of $f$-divergences between Cauchy distributions.
△ Less
Submitted 30 March, 2025; v1 submitted 20 February, 2023;
originally announced February 2023.
-
A half-century of global collaboration in science and the 'Shrinking World'
Authors:
Keisuke Okamura
Abstract:
Recent decades have witnessed a dramatic shift in the cross-border collaboration mode of researchers, with countries increasingly cooperating and competing with one another. It is crucial for leaders in academia and policy to understand the full extent of international research collaboration, their country's position within it, and its evolution over time. However, evidence for such world-scale dy…
▽ More
Recent decades have witnessed a dramatic shift in the cross-border collaboration mode of researchers, with countries increasingly cooperating and competing with one another. It is crucial for leaders in academia and policy to understand the full extent of international research collaboration, their country's position within it, and its evolution over time. However, evidence for such world-scale dynamism is still scarce. This paper provides unique evidence of how international collaboration clusters have formed and evolved over the past 50 years across various scientific publications, using data from OpenAlex, a large-scale Open Bibliometrics platform launched in 2022. We first examine how the global presence of top-tier countries has changed in 15 natural science disciplines over time, as measured by publication volumes and international collaboration rates. Notably, we observe that the US and China have been rapidly moving closer together for decades but began moving apart after 2019. We then perform a hierarchical clustering to analyse and visualise the international collaboration clusters for each discipline and period. Finally, we provide quantitative evidence of a `Shrinking World' of research collaboration at a global scale over the past half-century. Our results provide valuable insights into the big picture of past, present and future international collaboration.
△ Less
Submitted 7 September, 2023; v1 submitted 8 November, 2022;
originally announced November 2022.
-
Information measures and geometry of the hyperbolic exponential families of Poincaré and hyperboloid distributions
Authors:
Frank Nielsen,
Kazuki Okamura
Abstract:
We study various information-theoretic measures and the information geometry of the Poincaré distributions and the related hyperboloid distributions, and prove that their statistical mixture models are universal density estimators of smooth densities in hyperbolic spaces. The Poincaré and the hyperboloid distributions are two types of hyperbolic probability distributions defined using different mo…
▽ More
We study various information-theoretic measures and the information geometry of the Poincaré distributions and the related hyperboloid distributions, and prove that their statistical mixture models are universal density estimators of smooth densities in hyperbolic spaces. The Poincaré and the hyperboloid distributions are two types of hyperbolic probability distributions defined using different models of hyperbolic geometry. Namely, the Poincaré distributions form a triparametric bivariate exponential family whose sample space is the hyperbolic Poincaré upper-half plane and natural parameter space is the open 3D convex cone of two-by-two positive-definite matrices. The family of hyperboloid distributions form another exponential family which has sample space the forward sheet of the two-sheeted unit hyperboloid modeling hyperbolic geometry. In the first part, we prove that all $f$-divergences between Poincaré distributions can be expressed using three canonical terms using Eaton's framework of maximal group invariance. We also show that the $f$-divergences between any two Poincaré distributions are asymmetric except when those distributions belong to a same leaf of a particular foliation of the parameter space. We report closed-form formula for the Fisher information matrix, the Shannon's differential entropy and the Kullback-Leibler divergence. and Bhattacharyya distances between such distributions using the framework of exponential families. In the second part, we state the corresponding results for the exponential family of hyperboloid distributions by highlighting a parameter correspondence between the Poincaré and the hyperboloid distributions. Finally, we describe a random generator to draw variates and present two Monte Carlo methods to stochastically estimate numerically $f$-divergences between hyperbolic distributions.
△ Less
Submitted 24 November, 2024; v1 submitted 27 May, 2022;
originally announced May 2022.
-
A note on the $f$-divergences between multivariate location-scale families with either prescribed scale matrices or location parameters
Authors:
Frank Nielsen,
Kazuki Okamura
Abstract:
We first extend the result of Ali and Silvey [Journal of the Royal Statistical Society: Series B, 28.1 (1966), 131-142] who first reported that any $f$-divergence between two isotropic multivariate Gaussian distributions amounts to a corresponding strictly increasing scalar function of their corresponding Mahalanobis distance. We report sufficient conditions on the standard probability density fun…
▽ More
We first extend the result of Ali and Silvey [Journal of the Royal Statistical Society: Series B, 28.1 (1966), 131-142] who first reported that any $f$-divergence between two isotropic multivariate Gaussian distributions amounts to a corresponding strictly increasing scalar function of their corresponding Mahalanobis distance. We report sufficient conditions on the standard probability density function generating a multivariate location family and the function generator $f$ in order to generalize this result. This property is useful in practice as it allows to compare exactly $f$-divergences between densities of these location families via their corresponding Mahalanobis distances, even when the $f$-divergences are not available in closed-form as it is the case, for example, for the Jensen-Shannon divergence or the total variation distance between densities of a normal location family. Second, we consider $f$-divergences between densities of multivariate scale families: We recall Ali and Silvey 's result that for normal scale families we get matrix spectral divergences, and we extend this result to densities of a scale family.
△ Less
Submitted 30 May, 2022; v1 submitted 22 April, 2022;
originally announced April 2022.
-
Scientometric engineering: Exploring citation dynamics via arXiv eprints
Authors:
Keisuke Okamura
Abstract:
Scholarly communications have been rapidly integrated into digitised and networked open ecosystems, where preprint servers have played a pivotal role in accelerating the knowledge transfer processes. However, quantitative evidence is scarce regarding how this paradigm shift beyond the traditional journal publication system has affected the dynamics of collective attention on science. To address th…
▽ More
Scholarly communications have been rapidly integrated into digitised and networked open ecosystems, where preprint servers have played a pivotal role in accelerating the knowledge transfer processes. However, quantitative evidence is scarce regarding how this paradigm shift beyond the traditional journal publication system has affected the dynamics of collective attention on science. To address this issue, we investigate the citation data of more than 1.5 million eprints on arXiv (https://arxiv.boxedpaper.com/) and analyse the long-term citation trend for each discipline involved. We find that the typical growth and obsolescence patterns vary across disciplines, reflecting different publication and communication practices. The results provide unique evidence on the attention dynamics shaped by the research community today, including the dramatic growth and fast obsolescence of Computer Science eprints, which has not been captured in previous studies relying on the citation data of journal papers. Subsequently, we develop a quantitatively-and-temporally normalised citation index with an approximately normal distribution, which is useful for comparing citational attention across disciplines and time periods. Further, we derive a stochastic model consistent with the observed quantitative and temporal characteristics of citation growth and obsolescence. The findings and the developed framework open a new avenue for understanding the nature of citation dynamics.
△ Less
Submitted 5 February, 2022; v1 submitted 9 June, 2021;
originally announced June 2021.
-
On $f$-divergences between Cauchy distributions
Authors:
Frank Nielsen,
Kazuki Okamura
Abstract:
We prove that the $f$-divergences between univariate Cauchy distributions are all symmetric, and can be expressed as strictly increasing scalar functions of the symmetric chi-squared divergence. We report the corresponding scalar functions for the total variation distance, the Kullback-Leibler divergence, the squared Hellinger divergence, and the Jensen-Shannon divergence among others. Next, we gi…
▽ More
We prove that the $f$-divergences between univariate Cauchy distributions are all symmetric, and can be expressed as strictly increasing scalar functions of the symmetric chi-squared divergence. We report the corresponding scalar functions for the total variation distance, the Kullback-Leibler divergence, the squared Hellinger divergence, and the Jensen-Shannon divergence among others. Next, we give conditions to expand the $f$-divergences as converging infinite series of higher-order power chi divergences, and illustrate the criterion for converging Taylor series expressing the $f$-divergences between Cauchy distributions. We then show that the symmetric property of $f$-divergences holds for multivariate location-scale families with prescribed matrix scales provided that the standard density is even which includes the cases of the multivariate normal and Cauchy families. However, the $f$-divergences between multivariate Cauchy densities with different scale matrices are shown asymmetric. Finally, we present several metrizations of $f$-divergences between univariate Cauchy distributions and further report geometric embedding properties of the Kullback-Leibler divergence.
△ Less
Submitted 7 December, 2021; v1 submitted 29 January, 2021;
originally announced January 2021.
-
Category theory as a foundation for soft robotics
Authors:
Hayato Saigo,
Makoto Naruse,
Kazuya Okamura,
Hirokazu Hori,
Izumi Ojima
Abstract:
Soft robotics is an emerging field of research where the robot body is composed of compliant and soft materials. It allows the body to bend, twist, and deform to move or to adapt its shape to the environment for grasping, all of which are difficult for traditional hard robots with rigid bodies. However, the theoretical basis and design principles for soft robotics are not well-founded despite thei…
▽ More
Soft robotics is an emerging field of research where the robot body is composed of compliant and soft materials. It allows the body to bend, twist, and deform to move or to adapt its shape to the environment for grasping, all of which are difficult for traditional hard robots with rigid bodies. However, the theoretical basis and design principles for soft robotics are not well-founded despite their recognized importance. For example, the control of soft robots is outsourced to morphological attributes and natural processes; thus, the coupled relations between a robot and its environment are particularly crucial. In this paper, we propose a mathematical foundation for soft robotics based on category theory, which is a branch of abstract math where any notions can be described by objects and arrows. It allows for a rigorous description of the inherent characteristics of soft robots and their relation to the environment as well as the differences compared to conventional hard robots. We present a notion called the category of mobility that well describes the subject matter. The theory was applied to a model system and analysis to highlight the adaptation behavior observed in universal grippers, which are a typical example of soft robotics. This paper paves the way to developing a theoretical background and design principles for soft robotics.
△ Less
Submitted 25 September, 2018; v1 submitted 16 May, 2018;
originally announced May 2018.
-
Local reservoir model for choice-based learning
Authors:
Makoto Naruse,
Eiji Yamamoto,
Takashi Nakao,
Takuma Akimoto,
Hayato Saigo,
Kazuya Okamura,
Izumi Ojima,
Georg Northoff,
Hirokazu Hori
Abstract:
Decision making based on behavioral and neural observations of living systems has been extensively studied in brain science, psychology, and other disciplines. Decision-making mechanisms have also been experimentally implemented in physical processes, such as single photons and chaotic lasers. The findings of these experiments suggest that there is a certain common basis in describing decision mak…
▽ More
Decision making based on behavioral and neural observations of living systems has been extensively studied in brain science, psychology, and other disciplines. Decision-making mechanisms have also been experimentally implemented in physical processes, such as single photons and chaotic lasers. The findings of these experiments suggest that there is a certain common basis in describing decision making, regardless of its physical realizations. In this study, we propose a local reservoir model to account for choice-based learning (CBL). CBL describes decision consistency as a phenomenon where making a certain decision increases the possibility of making that same decision again later, which has been intensively investigated in neuroscience, psychology, etc. Our proposed model is inspired by the viewpoint that a decision is affected by its local environment, which is referred to as a local reservoir. If the size of the local reservoir is large enough, consecutive decision making will not be affected by previous decisions, thus showing lower degrees of decision consistency in CBL. In contrast, if the size of the local reservoir decreases, a biased distribution occurs within it, which leads to higher degrees of decision consistency in CBL. In this study, an analytical approach on local reservoirs is presented, as well as several numerical demonstrations. Furthermore, a physical architecture for CBL based on single photons is discussed, and the effects of local reservoirs is numerically demonstrated. Decision consistency in human decision-making tasks and in recruiting empirical data are evaluated based on local reservoir. In summary, the proposed local reservoir model paves a path toward establishing a foundation for computational mechanisms and the systematic analysis of decision making on different levels.
△ Less
Submitted 12 April, 2018;
originally announced April 2018.
-
Entangled-photon decision maker
Authors:
Nicolas Chauvet,
David Jegouso,
Benoît Boulanger,
Hayato Saigo,
Kazuya Okamura,
Hirokazu Hori,
Aurélien Drezet,
Serge Huant,
Guillaume Bachelier,
Makoto Naruse
Abstract:
The competitive multi-armed bandit (CMAB) problem is related to social issues such as maximizing total social benefits while preserving equality among individuals by overcoming conflicts between individual decisions, which could seriously decrease social benefits. The study described herein provides experimental evidence that entangled photons physically resolve the CMAB in the 2-arms 2-players ca…
▽ More
The competitive multi-armed bandit (CMAB) problem is related to social issues such as maximizing total social benefits while preserving equality among individuals by overcoming conflicts between individual decisions, which could seriously decrease social benefits. The study described herein provides experimental evidence that entangled photons physically resolve the CMAB in the 2-arms 2-players case, maximizing the social rewards while ensuring equality. Moreover, we demonstrated that deception, or outperforming the other player by receiving a greater reward, cannot be accomplished in a polarization-entangled-photon-based system, while deception is achievable in systems based on classical polarization-correlated photons with fixed polarizations. Besides, random polarization-correlated photons have been studied numerically and shown to ensure equality between players and deception prevention as well, although the CMAB maximum performance is reduced as compared with entangled photon experiments. Autonomous alignment schemes for polarization bases were also experimentally demonstrated based only on decision conflict information observed by an individual without communications between players. This study paves a way for collective decision making in uncertain dynamically changing environments based on entangled quantum states, a crucial step toward utilizing quantum systems for intelligent functionalities.
△ Less
Submitted 27 August, 2019; v1 submitted 12 April, 2018;
originally announced April 2018.
-
Scalable photonic reinforcement learning by time-division multiplexing of laser chaos
Authors:
Makoto Naruse,
Takatomo Mihana,
Hirokazu Hori,
Hayato Saigo,
Kazuya Okamura,
Mikio Hasegawa,
Atsushi Uchida
Abstract:
Reinforcement learning involves decision making in dynamic and uncertain environments and constitutes a crucial element of artificial intelligence. In our previous work, we experimentally demonstrated that the ultrafast chaotic oscillatory dynamics of lasers can be used to solve the two-armed bandit problem efficiently, which requires decision making concerning a class of difficult trade-offs call…
▽ More
Reinforcement learning involves decision making in dynamic and uncertain environments and constitutes a crucial element of artificial intelligence. In our previous work, we experimentally demonstrated that the ultrafast chaotic oscillatory dynamics of lasers can be used to solve the two-armed bandit problem efficiently, which requires decision making concerning a class of difficult trade-offs called the exploration-exploitation dilemma. However, only two selections were employed in that research; thus, the scalability of the laser-chaos-based reinforcement learning should be clarified. In this study, we demonstrated a scalable, pipelined principle of resolving the multi-armed bandit problem by introducing time-division multiplexing of chaotically oscillated ultrafast time-series. The experimental demonstrations in which bandit problems with up to 64 arms were successfully solved are presented in this report. Detailed analyses are also provided that include performance comparisons among laser chaos signals generated in different physical conditions, which coincide with the diffusivity inherent in the time series. This study paves the way for ultrafast reinforcement learning by taking advantage of the ultrahigh bandwidths of light wave and practical enabling technologies.
△ Less
Submitted 26 March, 2018;
originally announced March 2018.