-
Latent Feature Representation via Unsupervised Learning for Pattern Discovery in Massive Electron Microscopy Image Volumes
Authors:
Gary B Huang,
Huei-Fang Yang,
Shin-ya Takemura,
Pat Rivlin,
Stephen M Plaza
Abstract:
We propose a method to facilitate exploration and analysis of new large data sets. In particular, we give an unsupervised deep learning approach to learning a latent representation that captures semantic similarity in the data set. The core idea is to use data augmentations that preserve semantic meaning to generate synthetic examples of elements whose feature representations should be close to on…
▽ More
We propose a method to facilitate exploration and analysis of new large data sets. In particular, we give an unsupervised deep learning approach to learning a latent representation that captures semantic similarity in the data set. The core idea is to use data augmentations that preserve semantic meaning to generate synthetic examples of elements whose feature representations should be close to one another.
We demonstrate the utility of our method applied to nano-scale electron microscopy data, where even relatively small portions of animal brains can require terabytes of image data. Although supervised methods can be used to predict and identify known patterns of interest, the scale of the data makes it difficult to mine and analyze patterns that are not known a priori. We show the ability of our learned representation to enable query by example, so that if a scientist notices an interesting pattern in the data, they can be presented with other locations with matching patterns. We also demonstrate that clustering of data in the learned space correlates with biologically-meaningful distinctions. Finally, we introduce a visualization tool and software ecosystem to facilitate user-friendly interactive analysis and uncover interesting biological patterns. In short, our work opens possible new avenues in understanding of and discovery in large data sets, arising in domains such as EM analysis.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
Fully-Automatic Synapse Prediction and Validation on a Large Data Set
Authors:
Gary B. Huang,
Louis K. Scheffer,
Stephen M. Plaza
Abstract:
Extracting a connectome from an electron microscopy (EM) data set requires identification of neurons and determination of synapses between neurons. As manual extraction of this information is very time-consuming, there has been extensive research effort to automatically segment the neurons to help guide and eventually replace manual tracing. Until recently, there has been comparatively less resear…
▽ More
Extracting a connectome from an electron microscopy (EM) data set requires identification of neurons and determination of synapses between neurons. As manual extraction of this information is very time-consuming, there has been extensive research effort to automatically segment the neurons to help guide and eventually replace manual tracing. Until recently, there has been comparatively less research on automatically detecting the actual synapses between neurons. This discrepancy can, in part, be attributed to several factors: obtaining neuronal shapes is a prerequisite first step in extracting a connectome, manual tracing is much more time-consuming than annotating synapses, and neuronal contact area can be used as a proxy for synapses in determining connections.
However, recent research has demonstrated that contact area alone is not a sufficient predictor of synaptic connection. Moreover, as segmentation has improved, we have observed that synapse annotation is consuming a more significant fraction of overall reconstruction time. This ratio will only get worse as segmentation improves, gating overall possible speed-up. Therefore, we address this problem by developing algorithms that automatically detect pre-synaptic neurons and their post-synaptic partners. In particular, pre-synaptic structures are detected using a Deep and Wide Multiscale Recursive Network, and post-synaptic partners are detected using a MLP with features conditioned on the local segmentation.
This work is novel because it requires minimal amount of training, leverages advances in image segmentation directly, and provides a complete solution for polyadic synapse detection. We further introduce novel metrics to evaluate our algorithm on connectomes of meaningful size. These metrics demonstrate that complete automatic prediction can be used to effectively characterize most connectivity correctly.
△ Less
Submitted 11 April, 2016;
originally announced April 2016.
-
Large-Scale Electron Microscopy Image Segmentation in Spark
Authors:
Stephen M. Plaza,
Stuart E. Berg
Abstract:
The emerging field of connectomics aims to unlock the mysteries of the brain by understanding the connectivity between neurons. To map this connectivity, we acquire thousands of electron microscopy (EM) images with nanometer-scale resolution. After aligning these images, the resulting dataset has the potential to reveal the shapes of neurons and the synaptic connections between them. However, imag…
▽ More
The emerging field of connectomics aims to unlock the mysteries of the brain by understanding the connectivity between neurons. To map this connectivity, we acquire thousands of electron microscopy (EM) images with nanometer-scale resolution. After aligning these images, the resulting dataset has the potential to reveal the shapes of neurons and the synaptic connections between them. However, imaging the brain of even a tiny organism like the fruit fly yields terabytes of data. It can take years of manual effort to examine such image volumes and trace their neuronal connections. One solution is to apply image segmentation algorithms to help automate the tracing tasks. In this paper, we propose a novel strategy to apply such segmentation on very large datasets that exceed the capacity of a single machine. Our solution is robust to potential segmentation errors which could otherwise severely compromise the quality of the overall segmentation, for example those due to poor classifier generalizability or anomalies in the image dataset. We implement our algorithms in a Spark application which minimizes disk I/O, and apply them to a few large EM datasets, revealing both their effectiveness and scalability. We hope this work will encourage external contributions to EM segmentation by providing 1) a flexible plugin architecture that deploys easily on different cluster environments and 2) an in-memory representation of segmentation that could be conducive to new advances.
△ Less
Submitted 1 April, 2016;
originally announced April 2016.
-
Large-scale EM Analysis of the Drosophila Antennal Lobe with Automatically Computed Synapse Point Clouds
Authors:
Ting Zhao,
Shin-ya Takemura,
Gary B. Huang,
Jane Anne Horne,
William T. Katz,
Kazunori Shinomiya,
Louis K. Scheffer,
Ian A. Meinertzhagen,
Patricia K. Rivlin,
Stephen M. Plaza
Abstract:
The promise of extracting connectomes and performing useful analysis on large electron microscopy (EM) datasets has been an elusive dream for many years. Tracing in even the smallest portions of neuropil requires copious human annotation, the rate-limiting step for generating a connectome. While a combination of improved imaging and automatic segmentation will lead to the analysis of increasingly…
▽ More
The promise of extracting connectomes and performing useful analysis on large electron microscopy (EM) datasets has been an elusive dream for many years. Tracing in even the smallest portions of neuropil requires copious human annotation, the rate-limiting step for generating a connectome. While a combination of improved imaging and automatic segmentation will lead to the analysis of increasingly large volumes, machines still fail to reach the quality of human tracers. Unfortunately, small errors in image segmentation can lead to catastrophic distortions of the connectome.
In this paper, to analyze very large datasets, we explore different mechanisms that are less sensitive to errors in automation. Namely, we advocate and deploy extensive synapse detection on the entire antennal lobe (AL) neuropil in the brain of the fruit fly Drosophila, a region much larger than any densely annotated to date. The resulting synapse point cloud produced is invaluable for determining compartment boundaries in the AL and choosing specific regions for subsequent analysis. We introduce our methodology in this paper for region selection and show both manual and automatic synapse annotation results. Finally, we note the correspondence between image datasets obtained using the synaptic marker, antibody nc82, and our datasets enabling registration between light and EM image modalities.
△ Less
Submitted 25 August, 2015;
originally announced August 2015.
-
Automatic Neuron Type Identification by Neurite Localization in the Drosophila Medulla
Authors:
Ting Zhao,
Stephen M Plaza
Abstract:
Mapping the connectivity of neurons in the brain (i.e., connectomics) is a challenging problem due to both the number of connections in even the smallest organisms and the nanometer resolution required to resolve them. Because of this, previous connectomes contain only hundreds of neurons, such as in the C.elegans connectome. Recent technological advances will unlock the mysteries of increasingly…
▽ More
Mapping the connectivity of neurons in the brain (i.e., connectomics) is a challenging problem due to both the number of connections in even the smallest organisms and the nanometer resolution required to resolve them. Because of this, previous connectomes contain only hundreds of neurons, such as in the C.elegans connectome. Recent technological advances will unlock the mysteries of increasingly large connectomes (or partial connectomes). However, the value of these maps is limited by our ability to reason with this data and understand any underlying motifs. To aid connectome analysis, we introduce algorithms to cluster similarly-shaped neurons, where 3D neuronal shapes are represented as skeletons. In particular, we propose a novel location-sensitive clustering algorithm. We show clustering results on neurons reconstructed from the Drosophila medulla that show high-accuracy.
△ Less
Submitted 5 September, 2014;
originally announced September 2014.
-
Annotating Synapses in Large EM Datasets
Authors:
Stephen M. Plaza,
Toufiq Parag,
Gary B. Huang,
Donald J. Olbris,
Mathew A. Saunders,
Patricia K. Rivlin
Abstract:
Reconstructing neuronal circuits at the level of synapses is a central problem in neuroscience and becoming a focus of the emerging field of connectomics. To date, electron microscopy (EM) is the most proven technique for identifying and quantifying synaptic connections. As advances in EM make acquiring larger datasets possible, subsequent manual synapse identification ({\em i.e.}, proofreading) f…
▽ More
Reconstructing neuronal circuits at the level of synapses is a central problem in neuroscience and becoming a focus of the emerging field of connectomics. To date, electron microscopy (EM) is the most proven technique for identifying and quantifying synaptic connections. As advances in EM make acquiring larger datasets possible, subsequent manual synapse identification ({\em i.e.}, proofreading) for deciphering a connectome becomes a major time bottleneck. Here we introduce a large-scale, high-throughput, and semi-automated methodology to efficiently identify synapses. We successfully applied our methodology to the Drosophila medulla optic lobe, annotating many more synapses than previous connectome efforts. Our approaches are extensible and will make the often complicated process of synapse identification accessible to a wider-community of potential proofreaders.
△ Less
Submitted 4 December, 2014; v1 submitted 5 September, 2014;
originally announced September 2014.
-
Focused Proofreading: Efficiently Extracting Connectomes from Segmented EM Images
Authors:
Stephen M. Plaza
Abstract:
Identifying complex neural circuitry from electron microscopic (EM) images may help unlock the mysteries of the brain. However, identifying this circuitry requires time-consuming, manual tracing (proofreading) due to the size and intricacy of these image datasets, thus limiting state-of-the-art analysis to very small brain regions. Potential avenues to improve scalability include automatic image s…
▽ More
Identifying complex neural circuitry from electron microscopic (EM) images may help unlock the mysteries of the brain. However, identifying this circuitry requires time-consuming, manual tracing (proofreading) due to the size and intricacy of these image datasets, thus limiting state-of-the-art analysis to very small brain regions. Potential avenues to improve scalability include automatic image segmentation and crowd sourcing, but current efforts have had limited success. In this paper, we propose a new strategy, focused proofreading, that works with automatic segmentation and aims to limit proofreading to the regions of a dataset that are most impactful to the resulting circuit. We then introduce a novel workflow, which exploits biological information such as synapses, and apply it to a large dataset in the fly optic lobe. With our techniques, we achieve significant tracing speedups of 3-5x without sacrificing the quality of the resulting circuit. Furthermore, our methodology makes the task of proofreading much more accessible and hence potentially enhances the effectiveness of crowd sourcing.
△ Less
Submitted 3 September, 2014;
originally announced September 2014.