-
Entropy of Co-Enrolment Networks Reveal Disparities in High School STEM Participation
Authors:
Steven Martin Turnbull,
Dion R. J. O'Neale
Abstract:
The current study uses a network analysis approach to explore the STEM pathways that students take through their final year of high school in Aotearoa New Zealand. By accessing individual-level microdata from New Zealand's Integrated Data Infrastructure, we are able to create a co-enrolment network comprised of all STEM assessment standards taken by students in New Zealand between 2010 and 2016. W…
▽ More
The current study uses a network analysis approach to explore the STEM pathways that students take through their final year of high school in Aotearoa New Zealand. By accessing individual-level microdata from New Zealand's Integrated Data Infrastructure, we are able to create a co-enrolment network comprised of all STEM assessment standards taken by students in New Zealand between 2010 and 2016. We explore the structure of this co-enrolment network though use of community detection and a novel measure of entropy. We then investigate how network structure differs across sub-populations based on students' sex, ethnicity, and the socio-economic-status (SES) of the high school they attended. Results show the structure of the STEM co-enrolment network differs across these sub-populations, and also changes over time. We find that, while female students were more likely to have been enrolled in life science standards, they were less well represented in physics, calculus, and vocational (e.g., agriculture, practical technology) standards. Our results also show that the enrolment patterns of the Maori and Pacific Islands sub-populations had higher levels of entropy, an observation that may be explained by fewer enrolments in key science and mathematics standards. Through further investigation of this disparity, we find that ethnic group differences in entropy are moderated by high school SES, such that the difference in entropy between Maori and Pacific Islands students, and European and Asian students is even greater. We discuss these findings in the context of the New Zealand education system and policy changes that occurred between 2010 and 2016.
△ Less
Submitted 27 August, 2020;
originally announced August 2020.
-
Transitivity and degree assortativity explained: The bipartite structure of social networks
Authors:
Demival Vasques Filho,
Dion R. J. O'Neale
Abstract:
Dynamical processes, such as the diffusion of knowledge, opinions, pathogens, "fake news", innovation, and others, are highly dependent on the structure of the social network on which they occur. However, questions on why most social networks present some particular structural features, namely high levels of transitivity and degree assortativity, when compared to other types of networks remain ope…
▽ More
Dynamical processes, such as the diffusion of knowledge, opinions, pathogens, "fake news", innovation, and others, are highly dependent on the structure of the social network on which they occur. However, questions on why most social networks present some particular structural features, namely high levels of transitivity and degree assortativity, when compared to other types of networks remain open. First, we argue that every one-mode network can be regarded as a projection of a bipartite network, and show that this is the case using two simple examples solved with the generating functions formalism. Second, using synthetic and empirical data, we reveal how the combination of the degree distribution of both sets of nodes of the bipartite network --- together with the presence of cycles of length four and six --- explains the observed levels of transitivity and degree assortativity in the one-mode projected network. Bipartite networks with top node degrees that display a more right-skewed distribution than the bottom nodes result in highly transitive and degree assortative projections, especially if a large number of small cycles are present in the bipartite structure.
△ Less
Submitted 6 December, 2019;
originally announced December 2019.
-
The role of bipartite structure in R&D collaboration networks
Authors:
D. Vasques Filho,
Dion R. J. O'Neale
Abstract:
A number of real-world networks are, in fact, one-mode projections of bipartite networks comprised of two types of nodes. For institutions engaging in collaboration for technological innovation, the underlying network is bipartite with institutions (agents) linked to the patents they have filed (artifacts), while the projection is the co-patenting network. Projected network topology is highly affe…
▽ More
A number of real-world networks are, in fact, one-mode projections of bipartite networks comprised of two types of nodes. For institutions engaging in collaboration for technological innovation, the underlying network is bipartite with institutions (agents) linked to the patents they have filed (artifacts), while the projection is the co-patenting network. Projected network topology is highly affected by the underlying bipartite structure, hence a lack of understanding of the bipartite network has consequences for the information that might be drawn from the one-mode co-patenting network. Here, we create an empirical bipartite network using data from 2.7 million patents. We project this network onto the agents (institutions) and look at properties of both the bipartite and projected networks that may play a role in knowledge sharing and collaboration. We compare these empirical properties to those of synthetic bipartite networks and their projections in order to understand the processes that might operate in the network formation. A good understanding of the topology is critical for investigating the potential flow of technological knowledge. We show how degree distributions and small cycles affect the topology of the one-mode projected network - specifically degree and clustering distributions, and assortativity. We propose new network-based metrics to quantify how collaborative agents are in the co-patenting network. We find that several large corporations that are the most collaborative agents in the network, however such organisations tend to have a low diversity of collaborators. In contrast, the most prolific institutions tend to collaborate relatively little but with a diverse set of collaborators. This indicates that they concentrate the knowledge of their core technical research, while seeking specific complementary knowledge via collaboration with smaller companies.
△ Less
Submitted 4 May, 2020; v1 submitted 24 September, 2019;
originally announced September 2019.
-
Evolution of interdependent co-authorship and citation networks
Authors:
Chakresh Kr. Singh,
Demival Vasques Filho,
Shivakumar Jolad,
Dion R. J. O'Neale
Abstract:
Studies of bibliographic data suggest a strong correlation between the growth of citation networks and their corresponding co-authorship networks. We explore the interdependence between evolving citation and co-authorship networks focused on the publications, by Indian authors, in American Physical Society journals between 1970 and 2013. We record interactions between each possible pair of authors…
▽ More
Studies of bibliographic data suggest a strong correlation between the growth of citation networks and their corresponding co-authorship networks. We explore the interdependence between evolving citation and co-authorship networks focused on the publications, by Indian authors, in American Physical Society journals between 1970 and 2013. We record interactions between each possible pair of authors in two ways: first, by tracing the change in citations they exchanged and, second, by tracing the shortest path between authors in the co-authorship network. We create these data for every year of the period of our analysis. We use probability methods to quantify the correlation between citations and shortest paths, and the effect on the dynamics of the citation-co-authorship system. We find that author pairs who have a co-authorship distance $d \leq 3$ significantly affect each others citations, but that this effect falls off rapidly for longer distances in the co-authorship network. The exchange of citation between pairs with $d=1$ exhibits a sudden increase at the time of first co-authorship events and decays thereafter, indicating an aging effect in collaboration. This suggests that the dynamics of the co-authorship network appear to be driving those of the citation network rather than vice versa. Moreover, the majority of citations received by most authors are due to reciprocal citations from current, or past, co-authors. We conclude that, in order to answer questions on nature and dynamics of scientific collaboration, it is necessary to study both co-authorship and citation network simultaneously.
△ Less
Submitted 31 August, 2019;
originally announced September 2019.
-
Degree distributions of bipartite networks and their projections
Authors:
Demival Vasques Filho,
Dion R. J. O'Neale
Abstract:
Bipartite (two-mode) networks are important in the analysis of social and economic systems as they explicitly show conceptual links between different types of entities. However, applications of such networks often work with a projected (one-mode) version of the original bipartite network. The topology of the projected network, and the dynamics that take place on it, are highly dependent on the deg…
▽ More
Bipartite (two-mode) networks are important in the analysis of social and economic systems as they explicitly show conceptual links between different types of entities. However, applications of such networks often work with a projected (one-mode) version of the original bipartite network. The topology of the projected network, and the dynamics that take place on it, are highly dependent on the degree distributions of the two different node types from the original bipartite structure. To date, the interaction between the degree distributions of bipartite networks and their one-mode projections is well understood for only a few cases, or for networks that satisfy a restrictive set of assumptions. Here we show a broader analysis in order to fill the gap left by previous studies. We use the formalism of generating functions to prove that the degree distributions of both node types in the original bipartite network affect the degree distribution in the projected version. To support our analysis, we simulate several types of synthetic bipartite networks using a configuration model where node degrees are assigned from specific probability distributions, ranging from peaked to heavy-tailed distributions. Our findings show that when projecting a bipartite network onto a particular set of nodes, the degree distribution for the resulting one-mode network follows the distribution of the nodes being projected on to, but only so long as the degree distribution for the opposite set of nodes does not have a heavier tail. Furthermore, we show that bipartite degree distributions are not the only feature driving topology formation of projected networks, in contrast to what is commonly described in the literature.
△ Less
Submitted 3 March, 2019; v1 submitted 13 February, 2018;
originally announced February 2018.
-
Power Law Distributions of Patents as Indicators of Innovation
Authors:
D. R. J. O'Neale,
S. C. Hendy
Abstract:
The total number of patents produced by a country (or the number of patents produced per capita) is often used as an indicator for innovation. Here we present evidence that the distribution of patents amongst applicants within many OECD countries is well-described by power laws with exponents that vary between 1.66 (Japan) and 2.37 (Poland). Using simulations based on simple preferential attachmen…
▽ More
The total number of patents produced by a country (or the number of patents produced per capita) is often used as an indicator for innovation. Here we present evidence that the distribution of patents amongst applicants within many OECD countries is well-described by power laws with exponents that vary between 1.66 (Japan) and 2.37 (Poland). Using simulations based on simple preferential attachment-type rules that generate power laws, we find we can explain some of the variation in exponents between countries, with countries that have larger numbers of patents per applicant generally exhibiting smaller exponents in both the simulated and actual data. Similarly we find that the exponents for most countries are inversely correlated with other indicators of innovation, such as R&D intensity or the ubiquity of export baskets. This suggests that in more advanced economies, which tend to have smaller values of the exponent, a greater proportion of the total number of patents are filed by large companies than in less advanced countries.
△ Less
Submitted 30 April, 2012;
originally announced April 2012.