-
A systematic review of guidelines for the use of race, ethnicity, and ancestry reveals widespread consensus but also points of ongoing disagreement
Authors:
Madelyn Mauro,
Danielle S. Allen,
Bege Dauda,
Santiago J. Molina,
Benjamin M. Neale,
Anna C. F. Lewis
Abstract:
The use of population descriptors like race, ethnicity, and ancestry in science, medicine and public health has a long, complicated, and at times dark history, particularly for genetics, given the field's perceived importance for understanding between-group differences. The historical and potential harms that come with irresponsible use of these categories suggests a clear need for definitive guid…
▽ More
The use of population descriptors like race, ethnicity, and ancestry in science, medicine and public health has a long, complicated, and at times dark history, particularly for genetics, given the field's perceived importance for understanding between-group differences. The historical and potential harms that come with irresponsible use of these categories suggests a clear need for definitive guidance about when and how they can be used appropriately. However, while many prior authors have provided such guidance, no established consensus exists, and the extant literature has not been examined for implied consensus and sources of disagreement. Here we present the results of a systematic review of published normative recommendations regarding the use of population categories, particularly in genetics research. Following PRISMA guidelines, we extracted recommendations from n=121 articles matching inclusion criteria. Articles were published consistently throughout the time period examined and in a broad range of journals, demonstrating an ongoing and interdisciplinary perceived need for guidance. Examined recommendations fall under one of eight themes identified during analysis. Seven are characterized by broad agreement across articles; one, Appropriate definitions of population categories and contexts for use, revealed substantial fundamental disagreement among articles. While many articles focus on the inappropriate use of race, none fundamentally problematize ancestry. This work can be a resource to researchers looking for normative guidance on the use of population descriptors, and can orient authors of future guidelines to this complex field, contributing to the development of more effective future guidelines for genetics research.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Getting Genetic Ancestry Right for Science and Society
Authors:
Anna C. F. Lewis,
Santiago J. Molina,
Paul S Appelbaum,
Bege Dauda,
Anna Di Rienzo,
Agustin Fuentes,
Stephanie M. Fullerton,
Nanibaa' A. Garrison,
Nayanika Ghosh,
Evelynn M. Hammonds,
David S. Jones,
Eimear E. Kenny,
Peter Kraft,
Sandra S. -J. Lee,
Madelyn Mauro,
John Novembre,
Aaron Panofsky,
Mashaal Sohail,
Benjamin M. Neale,
Danielle S. Allen
Abstract:
There is a scientific and ethical imperative to embrace a multidimensional, continuous view of ancestry and move away from continental ancestry categories
There is a scientific and ethical imperative to embrace a multidimensional, continuous view of ancestry and move away from continental ancestry categories
△ Less
Submitted 14 October, 2021; v1 submitted 12 October, 2021;
originally announced October 2021.
-
SIG-DB: leveraging homomorphic encryption to Securely Interrogate privately held Genomic DataBases
Authors:
Alexander J. Titus,
Audrey Flower,
Patrick Hagerty,
Paul Gamble,
Charlie Lewis,
Todd Stavish,
Kevin P. OConnell,
Greg Shipley,
Stephanie M. Rogers
Abstract:
Genomic data are becoming increasingly valuable as we develop methods to utilize the information at scale and gain a greater understanding of how genetic information relates to biological function. Advances in synthetic biology and the decreased cost of sequencing are increasing the amount of privately held genomic data. As the quantity and value of private genomic data grows, so does the incentiv…
▽ More
Genomic data are becoming increasingly valuable as we develop methods to utilize the information at scale and gain a greater understanding of how genetic information relates to biological function. Advances in synthetic biology and the decreased cost of sequencing are increasing the amount of privately held genomic data. As the quantity and value of private genomic data grows, so does the incentive to acquire and protect such data, which creates a need to store and process these data securely. We present an algorithm for the Secure Interrogation of Genomic DataBases (SIG-DB). The SIG-DB algorithm enables databases of genomic sequences to be searched with an encrypted query sequence without revealing the query sequence to the Database Owner or any of the database sequences to the Querier. SIG-DB is the first application of its kind to take advantage of locality-sensitive hashing and homomorphic encryption to allow generalized sequence-to-sequence comparisons of genomic data.
△ Less
Submitted 26 March, 2018;
originally announced March 2018.
-
Monodisperse self-assembly in a model with protein-like interactions
Authors:
Alex W. Wilber,
Jonathan P. K. Doye,
Ard A. Louis,
Anna C. F. Lewis
Abstract:
We study the self-assembly behaviour of patchy particles with `protein-like' interactions that can be considered as a minimal model for the assembly of viral capsids and other shell-like protein complexes. We thoroughly explore the thermodynamics and dynamics of self assembly as a function of the parameters of the model and find robust assembly of all target structures considered. Optimal assemb…
▽ More
We study the self-assembly behaviour of patchy particles with `protein-like' interactions that can be considered as a minimal model for the assembly of viral capsids and other shell-like protein complexes. We thoroughly explore the thermodynamics and dynamics of self assembly as a function of the parameters of the model and find robust assembly of all target structures considered. Optimal assembly occurs in the region of parameter space where a free energy barrier regulates the rate of nucleation, thus preventing the premature exhaustion of the supply of monomers that can lead to the formation of incomplete shells. The interactions also need to be specific enough to prevent the assembly of malformed shells, but whilst maintaining kinetic accessibility. Free-energy landscapes computed for our model have a funnel-like topography guiding the system to form the target structure, and show that the torsional component of the interparticle interactions prevents the formation of disordered aggregates that would otherwise act as kinetic traps.
△ Less
Submitted 27 July, 2009;
originally announced July 2009.
-
The Function of Communities in Protein Interaction Networks at Multiple Scales
Authors:
Anna C. F. Lewis,
Nick S. Jones,
Mason A. Porter,
Charlotte M. Deane
Abstract:
Background: If biology is modular then clusters, or communities, of proteins derived using only protein interaction network structure should define protein modules with similar biological roles. We investigate the link between biological modules and network communities in yeast and its relationship to the scale at which we probe the network.
Results: Our results demonstrate that the functional…
▽ More
Background: If biology is modular then clusters, or communities, of proteins derived using only protein interaction network structure should define protein modules with similar biological roles. We investigate the link between biological modules and network communities in yeast and its relationship to the scale at which we probe the network.
Results: Our results demonstrate that the functional homogeneity of communities depends on the scale selected, and that almost all proteins lie in a functionally homogeneous community at some scale. We judge functional homogeneity using a novel test and three independent characterizations of protein function, and find a high degree of overlap between these measures. We show that a high mean clustering coefficient of a community can be used to identify those that are functionally homogeneous. By tracing the community membership of a protein through multiple scales we demonstrate how our approach could be useful to biologists focusing on a particular protein.
Conclusions: We show that there is no one scale of interest in the community structure of the yeast protein interaction network, but we can identify the range of resolution parameters that yield the most functionally coherent communities, and predict which communities are most likely to be functionally homogeneous.
△ Less
Submitted 12 March, 2010; v1 submitted 6 April, 2009;
originally announced April 2009.
-
The self-assembly and evolution of homomeric protein complexes
Authors:
Gabriel Villar,
Alex W. Wilber,
Alex J. Williamson,
Parvinder Thiara,
Jonathan P. K. Doye,
Ard A. Louis,
Mara N. Jochum,
Anna C. F. Lewis,
Emmanuel D. Levy
Abstract:
We introduce a simple "patchy particle" model to study the thermodynamics and dynamics of self-assembly of homomeric protein complexes. Our calculations allow us to rationalize recent results for dihedral complexes. Namely, why evolution of such complexes naturally takes the system into a region of interaction space where (i) the evolutionarily newer interactions are weaker, (ii) subcomplexes in…
▽ More
We introduce a simple "patchy particle" model to study the thermodynamics and dynamics of self-assembly of homomeric protein complexes. Our calculations allow us to rationalize recent results for dihedral complexes. Namely, why evolution of such complexes naturally takes the system into a region of interaction space where (i) the evolutionarily newer interactions are weaker, (ii) subcomplexes involving the stronger interactions are observed to be thermodynamically stable on destabilization of the protein-protein interactions and (iii) the self-assembly dynamics are hierarchical with these same subcomplexes acting as kinetic intermediates.
△ Less
Submitted 22 November, 2008;
originally announced November 2008.