-
Training-Free Guidance for Discrete Diffusion Models for Molecular Generation
Authors:
Thomas J. Kerby,
Kevin R. Moon
Abstract:
Training-free guidance methods for continuous data have seen an explosion of interest due to the fact that they enable foundation diffusion models to be paired with interchangable guidance models. Currently, equivalent guidance methods for discrete diffusion models are unknown. We present a framework for applying training-free guidance to discrete data and demonstrate its utility on molecular grap…
▽ More
Training-free guidance methods for continuous data have seen an explosion of interest due to the fact that they enable foundation diffusion models to be paired with interchangable guidance models. Currently, equivalent guidance methods for discrete diffusion models are unknown. We present a framework for applying training-free guidance to discrete data and demonstrate its utility on molecular graph generation tasks using the discrete diffusion model architecture of DiGress. We pair this model with guidance functions that return the proportion of heavy atoms that are a specific atom type and the molecular weight of the heavy atoms and demonstrate our method's ability to guide the data generation.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
3D Graph Contrastive Learning for Molecular Property Prediction
Authors:
Kisung Moon,
Sunyoung Kwon
Abstract:
Self-supervised learning (SSL) is a method that learns the data representation by utilizing supervision inherent in the data. This learning method is in the spotlight in the drug field, lacking annotated data due to time-consuming and expensive experiments. SSL using enormous unlabeled data has shown excellent performance for molecular property prediction, but a few issues exist. (1) Existing SSL…
▽ More
Self-supervised learning (SSL) is a method that learns the data representation by utilizing supervision inherent in the data. This learning method is in the spotlight in the drug field, lacking annotated data due to time-consuming and expensive experiments. SSL using enormous unlabeled data has shown excellent performance for molecular property prediction, but a few issues exist. (1) Existing SSL models are large-scale; there is a limitation to implementing SSL where the computing resource is insufficient. (2) In most cases, they do not utilize 3D structural information for molecular representation learning. The activity of a drug is closely related to the structure of the drug molecule. Nevertheless, most current models do not use 3D information or use it partially. (3) Previous models that apply contrastive learning to molecules use the augmentation of permuting atoms and bonds. Therefore, molecules having different characteristics can be in the same positive samples. We propose a novel contrastive learning framework, small-scale 3D Graph Contrastive Learning (3DGCL) for molecular property prediction, to solve the above problems. 3DGCL learns the molecular representation by reflecting the molecule's structure through the pre-training process that does not change the semantics of the drug. Using only 1,128 samples for pre-train data and 1 million model parameters, we achieved the state-of-the-art or comparable performance in four regression benchmark datasets. Extensive experiments demonstrate that 3D structural information based on chemical knowledge is essential to molecular representation learning for property prediction.
△ Less
Submitted 18 August, 2022; v1 submitted 31 May, 2022;
originally announced August 2022.
-
Coarse Graining of Data via Inhomogeneous Diffusion Condensation
Authors:
Nathan Brugnone,
Alex Gonopolskiy,
Mark W. Moyle,
Manik Kuchroo,
David van Dijk,
Kevin R. Moon,
Daniel Colon-Ramos,
Guy Wolf,
Matthew J. Hirn,
Smita Krishnaswamy
Abstract:
Big data often has emergent structure that exists at multiple levels of abstraction, which are useful for characterizing complex interactions and dynamics of the observations. Here, we consider multiple levels of abstraction via a multiresolution geometry of data points at different granularities. To construct this geometry we define a time-inhomogeneous diffusion process that effectively condense…
▽ More
Big data often has emergent structure that exists at multiple levels of abstraction, which are useful for characterizing complex interactions and dynamics of the observations. Here, we consider multiple levels of abstraction via a multiresolution geometry of data points at different granularities. To construct this geometry we define a time-inhomogeneous diffusion process that effectively condenses data points together to uncover nested groupings at larger and larger granularities. This inhomogeneous process creates a deep cascade of intrinsic low pass filters on the data affinity graph that are applied in sequence to gradually eliminate local variability while adjusting the learned data geometry to increasingly coarser resolutions. We provide visualizations to exhibit our method as a continuously-hierarchical clustering with directions of eliminated variation highlighted at each step. The utility of our algorithm is demonstrated via neuronal data condensation, where the constructed multiresolution data geometry uncovers the organization, grouping, and connectivity between neurons.
△ Less
Submitted 9 March, 2020; v1 submitted 9 July, 2019;
originally announced July 2019.
-
Making Sense of Consciousness as Integrated Information: Evolution and Issues of IIT
Authors:
Kyumin Moon,
Hongju Pae
Abstract:
The purpose of this article is to provide an overall critical appraisal of Integrated Information Theory(IIT) of consciousness. We explore how it has evolved and what problems are involved in the theory. IIT is a hypothesis that consciousness can be explained in terms of integrated information. It argues that a number of fundamental properties of experience can be properly analyzed and explained b…
▽ More
The purpose of this article is to provide an overall critical appraisal of Integrated Information Theory(IIT) of consciousness. We explore how it has evolved and what problems are involved in the theory. IIT is a hypothesis that consciousness can be explained in terms of integrated information. It argues that a number of fundamental properties of experience can be properly analyzed and explained by physical systems' informational properties. Throughout the last decade, there have been many advances in IIT's theoretical structure and mathematical model. In addition, like all hypotheses in the field of science of consciousness, IIT has given rise to several controversies and issues. In this context, a critical survey for IIT is urgently needed. To this end, we first introduce fundamental concepts of IIT and related issues. Thereafter, we discuss major transitions IIT has been through and point out related intra-model issues. Finally, in the last section, some theoretical, extra-model issues involved in IIT's principles are presented. The article concludes by suggesting that, for the sake of future development, IIT should more seriously take metacognitive accessibility to experience.
△ Less
Submitted 30 September, 2018; v1 submitted 29 June, 2018;
originally announced July 2018.
-
The intrinsic value of HFO features as a biomarker of epileptic activity
Authors:
Stephen V. Gliske,
Kevin R. Moon,
William C. Stacey,
Alfred O. Hero III
Abstract:
High frequency oscillations (HFOs) are a promising biomarker of epileptic brain tissue and activity. HFOs additionally serve as a prototypical example of challenges in the analysis of discrete events in high-temporal resolution, intracranial EEG data. Two primary challenges are 1) dimensionality reduction, and 2) assessing feasibility of classification. Dimensionality reduction assumes that the da…
▽ More
High frequency oscillations (HFOs) are a promising biomarker of epileptic brain tissue and activity. HFOs additionally serve as a prototypical example of challenges in the analysis of discrete events in high-temporal resolution, intracranial EEG data. Two primary challenges are 1) dimensionality reduction, and 2) assessing feasibility of classification. Dimensionality reduction assumes that the data lie on a manifold with dimension less than that of the feature space. However, previous HFO analyses have assumed a linear manifold, global across time, space (i.e. recording electrode/channel), and individual patients. Instead, we assess both a) whether linear methods are appropriate and b) the consistency of the manifold across time, space, and patients. We also estimate bounds on the Bayes classification error to quantify the distinction between two classes of HFOs (those occurring during seizures and those occurring due to other processes). This analysis provides the foundation for future clinical use of HFO features and buides the analysis for other discrete events, such as individual action potentials or multi-unit activity.
△ Less
Submitted 12 October, 2015;
originally announced October 2015.