Curiosity as filling, compressing, and reconfiguring knowledge networks
Authors:
Shubhankar P. Patankar,
Dale Zhou,
Christopher W. Lynn,
Jason Z. Kim,
Mathieu Ouellet,
Harang Ju,
Perry Zurn,
David M. Lydon-Staley,
Dani S. Bassett
Abstract:
Due to the significant role that curiosity plays in our lives, several theoretical constructs, such as the information gap theory and compression progress theory, have sought to explain how we engage in its practice. According to the former, curiosity is the drive to acquire information that is missing from our understanding of the world. According to the latter, curiosity is the drive to construc…
▽ More
Due to the significant role that curiosity plays in our lives, several theoretical constructs, such as the information gap theory and compression progress theory, have sought to explain how we engage in its practice. According to the former, curiosity is the drive to acquire information that is missing from our understanding of the world. According to the latter, curiosity is the drive to construct an increasingly parsimonious mental model of the world. To complement the densification processes inherent to these theories, we propose the conformational change theory, wherein we posit that curiosity results in mental models with marked conceptual flexibility. We formalize curiosity as the process of building a growing knowledge network to quantitatively investigate information gap theory, compression progress theory, and the conformational change theory of curiosity. In knowledge networks, gaps can be identified as topological cavities, compression progress can be quantified using network compressibility, and flexibility can be measured as the number of conformational degrees of freedom. We leverage data acquired from the online encyclopedia Wikipedia to determine the degree to which each theory explains the growth of knowledge networks built by individuals and by collectives. Our findings lend support to a pluralistic view of curiosity, wherein intrinsically motivated information acquisition fills knowledge gaps and simultaneously leads to increasingly compressible and flexible knowledge networks. Across individuals and collectives, we determine the contexts in which each theoretical account may be explanatory, thereby clarifying their complementary and distinct explanations of curiosity. Our findings offer a novel network theoretical perspective on intrinsically motivated information acquisition that may harmonize with or compel an expansion of the traditional taxonomy of curiosity.
△ Less
Submitted 3 April, 2022;
originally announced April 2022.
Network embedding unveils the hidden interactions in the mammalian virome
Authors:
Timothée Poisot,
Marie-Andrée Ouellet,
Nardus Mollentze,
Maxwell J. Farrell,
Daniel J. Becker,
Liam Brierly,
Gregory F. Albery,
Rory J. Gibb,
Stephanie N. Seifert,
Colin J. Carlson
Abstract:
At most 1-2% of the global virome has been sampled to date. Recent work has shown that predicting which host-virus interactions are possible but undiscovered or unrealized is, fundamentally, a network science problem. Here, we develop a novel method that combines a coarse recommender system (Linear Filtering; LF) with an imputation algorithm based on low-rank graph embedding (Singular Value Decomp…
▽ More
At most 1-2% of the global virome has been sampled to date. Recent work has shown that predicting which host-virus interactions are possible but undiscovered or unrealized is, fundamentally, a network science problem. Here, we develop a novel method that combines a coarse recommender system (Linear Filtering; LF) with an imputation algorithm based on low-rank graph embedding (Singular Value Decomposition; SVD) to infer host-virus associations. This combination of techniques results in informed initial guesses based on directly measurable network properties (density, degree distribution) that are refined through SVD (which is able to leverage emerging features). Using this method, we recovered highly plausible undiscovered interactions with a strong signal of viral coevolutionary history, and revealed a global hotspot of unusually unique but unsampled (or unrealized) host-virus interactions in the Amazon rainforest. We develop several tests for quantifying the bias and realism of these predictions, and show that the LF-SVD method is robust in each aspect. We finally show that graph embedding of the imputed network can be used to improve predictions of human infection from viral genome features, showing that the global structure of the mammal-virus network provides additional insights into human disease emergence.
△ Less
Submitted 24 March, 2022; v1 submitted 31 May, 2021;
originally announced May 2021.