Supporting novel biomedical research via multilayer collaboration networks
Authors:
Konstantin Kuzmin,
Xiaoyan Lu,
Partha Sarathi Mukherjee,
Juntao Zhuang,
Chris Gaiteri,
Boleslaw K Szymanski
Abstract:
The value of research containing novel combinations of molecules can be seen in many innovative and award-winning research programs. Despite calls to use innovative approaches to address common diseases, an increasing majority of research funding goes toward "safe" incremental research. Counteracting this trend by nurturing novel and potentially transformative scientific research is challenging, i…
▽ More
The value of research containing novel combinations of molecules can be seen in many innovative and award-winning research programs. Despite calls to use innovative approaches to address common diseases, an increasing majority of research funding goes toward "safe" incremental research. Counteracting this trend by nurturing novel and potentially transformative scientific research is challenging, it must be supported in competition with established research programs. Therefore, we propose a tool that helps to resolve the tension between safe but fundable research vs. high-risk but potentially transformational research. It does this by identifying hidden overlapping interest around novel molecular research topics. Specifically, it identifies paths of molecular interactions that connect research topics and hypotheses that would not typically be associated, as the basis for scientific collaboration. Because these collaborations are related to the scientists' present trajectory, they are low risk and can be initiated rapidly. Unlike most incremental steps, these collaborations have the potential for leaps in understanding, as they reposition research for novel disease applications. We demonstrate the use of this tool to identify scientists who could contribute to understanding the cellular role of genes with novel associations with Alzheimer's disease, which have not been thoroughly characterized, in part due to the funding emphasis on established research.
△ Less
Submitted 28 October, 2016;
originally announced October 2016.
Identifying robust communities and multi-community nodes by combining top-down and bottom-up approaches to clustering
Authors:
Chris Gaiteri,
Mingming Chen,
Boleslaw Szymanski,
Konstantin Kuzmin,
Jierui Xie,
Changkyu Lee,
Timothy Blanche,
Elias Chaibub Neto,
Su-Chun Huang,
Thomas Grabowski,
Tara Madhyastha,
Vitalina Komashko
Abstract:
Biological functions are carried out by groups of interacting molecules, cells or tissues, known as communities. Membership in these communities may overlap when biological components are involved in multiple functions. However, traditional clustering methods detect non-overlapping communities. These detected communities may also be unstable and difficult to replicate, because traditional methods…
▽ More
Biological functions are carried out by groups of interacting molecules, cells or tissues, known as communities. Membership in these communities may overlap when biological components are involved in multiple functions. However, traditional clustering methods detect non-overlapping communities. These detected communities may also be unstable and difficult to replicate, because traditional methods are sensitive to noise and parameter settings. These aspects of traditional clustering methods limit our ability to detect biological communities, and therefore our ability to understand biological functions.
To address these limitations and detect robust overlapping biological communities, we propose an unorthodox clustering method called SpeakEasy which identifies communities using top-down and bottom-up approaches simultaneously. Specifically, nodes join communities based on their local connections, as well as global information about the network structure. This method can quantify the stability of each community, automatically identify the number of communities, and quickly cluster networks with hundreds of thousands of nodes.
SpeakEasy shows top performance on synthetic clustering benchmarks and accurately identifies meaningful biological communities in a range of datasets, including: gene microarrays, protein interactions, sorted cell populations, electrophysiology and fMRI brain imaging.
△ Less
Submitted 25 February, 2015; v1 submitted 19 January, 2015;
originally announced January 2015.