Skip to main content

Showing 1–50 of 78 results for author: Park, J

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2506.08059  [pdf, ps, other

    q-bio.QM cs.AI cs.LG

    CaliciBoost: Performance-Driven Evaluation of Molecular Representations for Caco-2 Permeability Prediction

    Authors: Huong Van Le, Weibin Ren, Junhong Kim, Yukyung Yun, Young Bin Park, Young Jun Kim, Bok Kyung Han, Inho Choi, Jong IL Park, Hwi-Yeol Yun, Jae-Mun Choi

    Abstract: Caco-2 permeability serves as a critical in vitro indicator for predicting the oral absorption of drug candidates during early-stage drug discovery. To enhance the accuracy and efficiency of computational predictions, we systematically investigated the impact of eight molecular feature representation types including 2D/3D descriptors, structural fingerprints, and deep learning-based embeddings com… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 49 pages, 11 figures

  2. arXiv:2505.22134  [pdf

    q-bio.PE

    Infection dynamics for fluctuating infection or removal rates regarding the number of infected and susceptible individuals

    Authors: Seong Jun Park, M. Y. Choi

    Abstract: In general, the rates of infection and removal (whether through recovery or death) are nonlinear functions of the number of infected and susceptible individuals. One of the simplest models for the spread of infectious diseases is the SIR model, which categorizes individuals as susceptible, infectious, recovered or deceased. In this model, the infection rate, governing the transition from susceptib… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  3. arXiv:2504.03732  [pdf, other

    cs.AR cs.DC q-bio.GN

    SAGe: A Lightweight Algorithm-Architecture Co-Design for Mitigating the Data Preparation Bottleneck in Large-Scale Genome Analysis

    Authors: Nika Mansouri Ghiasi, Talu Güloglu, Harun Mustafa, Can Firtina, Konstantina Koliogeorgi, Konstantinos Kanellopoulos, Haiyu Mao, Rakesh Nadig, Mohammad Sadrosadati, Jisung Park, Onur Mutlu

    Abstract: Given the exponentially growing volumes of genomic data, there are extensive efforts to accelerate genome analysis. We demonstrate a major bottleneck that greatly limits and diminishes the benefits of state-of-the-art genome analysis accelerators: the data preparation bottleneck, where genomic data is stored in compressed form and needs to be decompressed and formatted first before an accelerator… ▽ More

    Submitted 21 April, 2025; v1 submitted 31 March, 2025; originally announced April 2025.

  4. arXiv:2503.20767  [pdf, ps, other

    cs.LG q-bio.QM stat.ML

    Reliable algorithm selection for machine learning-guided design

    Authors: Clara Fannjiang, Ji Won Park

    Abstract: Algorithms for machine learning-guided design, or design algorithms, use machine learning-based predictions to propose novel objects with desired property values. Given a new design task -- for example, to design novel proteins with high binding affinity to a therapeutic target -- one must choose a design algorithm and specify any hyperparameters and predictive and/or generative models involved. H… ▽ More

    Submitted 2 July, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

    Comments: ICML 2025

  5. arXiv:2503.19924  [pdf, other

    q-bio.NC nlin.AO

    EEG relative phase-based analysis unveils the complexity and universality of human brain dynamics: integrative insights from general anesthesia and ADHD

    Authors: Athokpam Langlen Chanu, Youngjai Park, Younghwa Cha, UnCheol Lee, Joon-Young Moon, Jong-Min Park

    Abstract: Understanding brain wave patterns is fundamental to uncovering neural information processing mechanisms, making quantifying complexity across brain states an important line of investigation. We present a comprehensive analysis of the complexity of electroencephalography (EEG) signals, integrating data from seven distinct states experienced by participants undergoing general anesthesia, and resting… ▽ More

    Submitted 21 April, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

  6. arXiv:2502.04892  [pdf, other

    cs.LG q-bio.NC stat.ML

    A Foundational Brain Dynamics Model via Stochastic Optimal Control

    Authors: Joonhyeong Park, Byoungwoo Park, Chang-Bae Bang, Jungwon Choi, Hyungjin Chung, Byung-Hoon Kim, Juho Lee

    Abstract: We introduce a foundational model for brain dynamics that utilizes stochastic optimal control (SOC) and amortized inference. Our method features a continuous-discrete state space model (SSM) that can robustly handle the intricate and noisy nature of fMRI signals. To address computational limitations, we implement an approximation strategy grounded in the SOC framework. Additionally, we present a s… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: The first two authors contributed equally

  7. arXiv:2501.15208  [pdf

    q-bio.MN

    Advancing Understanding of Long COVID Pathophysiology Through Quantum Walk-Based Network Analysis

    Authors: Jaesub Park, Woochang Hwang, Seokjun Lee, Hyun Chang Lee, Méabh MacMahon, Matthias Zilbauer, Namshik Han

    Abstract: Long COVID is a multisystem condition characterized by persistent symptoms such as fatigue, cognitive impairment, and systemic inflammation, following COVID-19 infection, yet its mechanisms remain poorly understood. In this study, we applied quantum walk (QW), a computational approach leveraging quantum interference, to explore large-scale SARS-CoV-2-induced protein (SIP) networks. Compared to the… ▽ More

    Submitted 29 January, 2025; v1 submitted 25 January, 2025; originally announced January 2025.

    Comments: 25 pages, 6 figures and 3 tables

  8. arXiv:2501.14790  [pdf, other

    q-bio.NC cs.AI cs.SD eess.AS

    Towards Dynamic Neural Communication and Speech Neuroprosthesis Based on Viseme Decoding

    Authors: Ji-Ha Park, Seo-Hyun Lee, Soowon Kim, Seong-Whan Lee

    Abstract: Decoding text, speech, or images from human neural signals holds promising potential both as neuroprosthesis for patients and as innovative communication tools for general users. Although neural signals contain various information on speech intentions, movements, and phonetic details, generating informative outputs from them remains challenging, with mostly focusing on decoding short intentions or… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: 5 pages, 5 figures, 1 table, Name of Conference: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing

  9. arXiv:2411.00871  [pdf, other

    cs.LG cs.AI q-bio.MN

    LLaMo: Large Language Model-based Molecular Graph Assistant

    Authors: Jinyoung Park, Minseong Bae, Dohwan Ko, Hyunwoo J. Kim

    Abstract: Large Language Models (LLMs) have demonstrated remarkable generalization and instruction-following capabilities with instruction tuning. The advancements in LLMs and instruction tuning have led to the development of Large Vision-Language Models (LVLMs). However, the competency of the LLMs and instruction tuning have been less explored in the molecular domain. Thus, we propose LLaMo: Large Language… ▽ More

    Submitted 30 October, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  10. arXiv:2410.20255  [pdf, other

    cs.LG cs.AI physics.chem-ph q-bio.BM

    Equivariant Blurring Diffusion for Hierarchical Molecular Conformer Generation

    Authors: Jiwoong Park, Yang Shen

    Abstract: How can diffusion models process 3D geometries in a coarse-to-fine manner, akin to our multiscale view of the world? In this paper, we address the question by focusing on a fundamental biochemical problem of generating 3D molecular conformers conditioned on molecular graphs in a multiscale manner. Our approach consists of two hierarchical stages: i) generation of coarse-grained fragment-level 3D s… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024

  11. arXiv:2410.17270  [pdf, other

    q-bio.BM cond-mat.mtrl-sci cs.LG

    MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks

    Authors: Nayoung Kim, Seongsu Kim, Minsu Kim, Jinkyoo Park, Sungsoo Ahn

    Abstract: Metal-organic frameworks (MOFs) are a class of crystalline materials with promising applications in many areas such as carbon capture and drug delivery. In this work, we introduce MOFFlow, the first deep generative model tailored for MOF structure prediction. Existing approaches, including ab initio calculations and even deep generative models, struggle with the complexity of MOF structures due to… ▽ More

    Submitted 19 March, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: 10 pages, 6 figures

    Journal ref: International Conference on Learning Representations (ICLR) 2025

  12. arXiv:2410.04542  [pdf, other

    q-bio.BM cs.LG

    Generative Flows on Synthetic Pathway for Drug Design

    Authors: Seonghwan Seo, Minsu Kim, Tony Shen, Martin Ester, Jinkyoo Park, Sungsoo Ahn, Woo Youn Kim

    Abstract: Generative models in drug discovery have recently gained attention as efficient alternatives to brute-force virtual screening. However, most existing models do not account for synthesizability, limiting their practical use in real-world scenarios. In this paper, we propose RxnFlow, which sequentially assembles molecules using predefined molecular building blocks and chemical reaction templates to… ▽ More

    Submitted 6 March, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

    Comments: Accepted to ICLR 2025, 32 pages, 17 figures, code: https://github.com/SeonghwanSeo/RxnFlow

  13. arXiv:2410.04461  [pdf, ps, other

    cs.LG q-bio.BM

    Improved Off-policy Reinforcement Learning in Biological Sequence Design

    Authors: Hyeonah Kim, Minsu Kim, Taeyoung Yun, Sanghyeok Choi, Emmanuel Bengio, Alex Hernández-García, Jinkyoo Park

    Abstract: Designing biological sequences with desired properties is challenging due to vast search spaces and limited evaluation budgets. Although reinforcement learning methods use proxy models for rapid reward evaluation, insufficient training data can cause proxy misspecification on out-of-distribution inputs. To address this, we propose a novel off-policy search, $δ$-Conservative Search, that enhances r… ▽ More

    Submitted 16 June, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

    Comments: ICML 2025

  14. arXiv:2409.05484  [pdf, other

    cs.LG cs.AI q-bio.GN q-bio.QM

    CRADLE-VAE: Enhancing Single-Cell Gene Perturbation Modeling with Counterfactual Reasoning-based Artifact Disentanglement

    Authors: Seungheun Baek, Soyon Park, Yan Ting Chok, Junhyun Lee, Jueon Park, Mogan Gim, Jaewoo Kang

    Abstract: Predicting cellular responses to various perturbations is a critical focus in drug discovery and personalized therapeutics, with deep learning models playing a significant role in this endeavor. Single-cell datasets contain technical artifacts that may hinder the predictability of such models, which poses quality control issues highly regarded in this area. To address this, we propose CRADLE-VAE,… ▽ More

    Submitted 9 September, 2024; v1 submitted 9 September, 2024; originally announced September 2024.

  15. arXiv:2408.12907  [pdf, other

    q-bio.QM

    Bundling instability of lophotrichous bacteria

    Authors: Jeungeun Park, Yongsam Kim, Wanho Lee, Veronika Pfeifer, Valeriia Muraveva, Carsten Beta, Sookkyung Lim

    Abstract: We present a mathematical model of lophotrichous bacteria, motivated by Pseudomonas putida, which swim through fluid by rotating a cluster of multiple flagella extended from near one pole of the cell body. Although the flagella rotate individually, they are typically bundled together, enabling the bacterium to exhibit three primary modes of motility: push, pull, and wrapping. One key determinant o… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    MSC Class: 92-10; 92-08; 76-10; 76Z10

  16. arXiv:2407.21028  [pdf, other

    q-bio.BM cs.LG

    Antibody DomainBed: Out-of-Distribution Generalization in Therapeutic Protein Design

    Authors: Nataša Tagasovska, Ji Won Park, Matthieu Kirchmeyer, Nathan C. Frey, Andrew Martin Watkins, Aya Abdelsalam Ismail, Arian Rokkum Jamasb, Edith Lee, Tyler Bryson, Stephen Ra, Kyunghyun Cho

    Abstract: Machine learning (ML) has demonstrated significant promise in accelerating drug design. Active ML-guided optimization of therapeutic molecules typically relies on a surrogate model predicting the target property of interest. The model predictions are used to determine which designs to evaluate in the lab, and the model is updated on the new measurements to inform the next cycle of decisions. A key… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  17. arXiv:2406.19113  [pdf, other

    cs.AR cs.DC q-bio.GN

    MegIS: High-Performance, Energy-Efficient, and Low-Cost Metagenomic Analysis with In-Storage Processing

    Authors: Nika Mansouri Ghiasi, Mohammad Sadrosadati, Harun Mustafa, Arvid Gollwitzer, Can Firtina, Julien Eudine, Haiyu Mao, Joël Lindegger, Meryem Banu Cavlak, Mohammed Alser, Jisung Park, Onur Mutlu

    Abstract: Metagenomics has led to significant advances in many fields. Metagenomic analysis commonly involves the key tasks of determining the species present in a sample and their relative abundances. These tasks require searching large metagenomic databases. Metagenomic analysis suffers from significant data movement overhead due to moving large amounts of low-reuse data from the storage system. In-storag… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: To appear in ISCA 2024. arXiv admin note: substantial text overlap with arXiv:2311.12527

  18. arXiv:2403.20109  [pdf, ps, other

    cs.LG cs.AI q-bio.BM

    Mol-AIR: Molecular Reinforcement Learning with Adaptive Intrinsic Rewards for Goal-directed Molecular Generation

    Authors: Jinyeong Park, Jaegyoon Ahn, Jonghwan Choi, Jibum Kim

    Abstract: Optimizing techniques for discovering molecular structures with desired properties is crucial in artificial intelligence(AI)-based drug discovery. Combining deep generative models with reinforcement learning has emerged as an effective strategy for generating molecules with specific properties. Despite its potential, this approach is ineffective in exploring the vast chemical space and optimizing… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  19. arXiv:2402.05982  [pdf, other

    q-bio.QM cs.LG

    Decoupled Sequence and Structure Generation for Realistic Antibody Design

    Authors: Nayoung Kim, Minsu Kim, Sungsoo Ahn, Jinkyoo Park

    Abstract: Recently, deep learning has made rapid progress in antibody design, which plays a key role in the advancement of therapeutics. A dominant paradigm is to train a model to jointly generate the antibody sequence and the structure as a candidate. However, the joint generation requires the model to generate both the discrete amino acid categories and the continuous 3D coordinates; this limits the space… ▽ More

    Submitted 16 January, 2025; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 22 pages, 6 figures

    Journal ref: Transactions on Machine Learning Research, 2025

  20. arXiv:2402.05961  [pdf, other

    q-bio.BM cs.LG cs.NE

    Genetic-guided GFlowNets for Sample Efficient Molecular Optimization

    Authors: Hyeonah Kim, Minsu Kim, Sanghyeok Choi, Jinkyoo Park

    Abstract: The challenge of discovering new molecules with desired properties is crucial in domains like drug discovery and material design. Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency due to the computational expense of evaluating the reward function. This paper proposes a novel algorithm for sample-efficient molecular optimization by… ▽ More

    Submitted 29 December, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: NeurIPS 2024

  21. arXiv:2402.05953  [pdf, other

    q-bio.QM cs.GR cs.HC cs.LG

    idMotif: An Interactive Motif Identification in Protein Sequences

    Authors: Ji Hwan Park, Vikash Prasad, Sydney Newsom, Fares Najar, Rakhi Rajan

    Abstract: This article introduces idMotif, a visual analytics framework designed to aid domain experts in the identification of motifs within protein sequences. Motifs, short sequences of amino acids, are critical for understanding the distinct functions of proteins. Identifying these motifs is pivotal for predicting diseases or infections. idMotif employs a deep learning-based method for the categorization… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: IEEE CGA

    Journal ref: idMotif: An Interactive Motif Identification in Protein Sequences," in IEEE Computer Graphics and Applications, 2023

  22. arXiv:2311.12527  [pdf, other

    cs.AR q-bio.GN q-bio.QM

    MetaStore: High-Performance Metagenomic Analysis via In-Storage Computing

    Authors: Nika Mansouri Ghiasi, Mohammad Sadrosadati, Harun Mustafa, Arvid Gollwitzer, Can Firtina, Julien Eudine, Haiyu Ma, Joël Lindegger, Meryem Banu Cavlak, Mohammed Alser, Jisung Park, Onur Mutlu

    Abstract: Metagenomics has led to significant advancements in many fields. Metagenomic analysis commonly involves the key tasks of determining the species present in a sample and their relative abundances. These tasks require searching large metagenomic databases containing information on different species' genomes. Metagenomic analysis suffers from significant data movement overhead due to moving large amo… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  23. arXiv:2311.04468  [pdf

    eess.IV q-bio.NC

    A human brain atlas of chi-separation for normative iron and myelin distributions

    Authors: Kyeongseon Min, Beomseok Sohn, Woo Jung Kim, Chae Jung Park, Soohwa Song, Dong Hoon Shin, Kyung Won Chang, Na-Young Shin, Minjun Kim, Hyeong-Geol Shin, Phil Hyu Lee, Jongho Lee

    Abstract: Iron and myelin are primary susceptibility sources in the human brain. These substances are essential for healthy brain, and their abnormalities are often related to various neurological disorders. Recently, an advanced susceptibility mapping technique, which is referred to as chi-separation, has been proposed, successfully disentangling paramagnetic iron from diamagnetic myelin. This method opene… ▽ More

    Submitted 2 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 19 pages, 9 figures

  24. arXiv:2309.11438  [pdf, other

    cond-mat.soft q-bio.NC

    Brain-inspired computing with fluidic iontronic nanochannels

    Authors: T. M. Kamsma, J. Kim, K. Kim, W. Q. Boon, C. Spitoni, J. Park, R. van Roij

    Abstract: The brain's remarkable and efficient information processing capability is driving research into brain-inspired (neuromorphic) computing paradigms. Artificial aqueous ion channels are emerging as an exciting platform for neuromorphic computing, representing a departure from conventional solid-state devices by directly mimicking the brain's fluidic ion transport. Supported by a quantitative theoreti… ▽ More

    Submitted 25 April, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Journal ref: Proceedings of the National Academy of Sciences (2024), Vol 121, Issue 18

  25. The Past, Present, and Future of the Brain Imaging Data Structure (BIDS)

    Authors: Russell A. Poldrack, Christopher J. Markiewicz, Stefan Appelhoff, Yoni K. Ashar, Tibor Auer, Sylvain Baillet, Shashank Bansal, Leandro Beltrachini, Christian G. Benar, Giacomo Bertazzoli, Suyash Bhogawar, Ross W. Blair, Marta Bortoletto, Mathieu Boudreau, Teon L. Brooks, Vince D. Calhoun, Filippo Maria Castelli, Patricia Clement, Alexander L Cohen, Julien Cohen-Adad, Sasha D'Ambrosio, Gilles de Hollander, María de la iglesia-Vayá, Alejandro de la Vega, Arnaud Delorme , et al. (89 additional authors not shown)

    Abstract: The Brain Imaging Data Structure (BIDS) is a community-driven standard for the organization of data and metadata from a growing range of neuroscience modalities. This paper is meant as a history of how the standard has developed and grown over time. We outline the principles behind the project, the mechanisms by which it has been extended, and some of the challenges being addressed as it evolves.… ▽ More

    Submitted 8 January, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  26. Vis-SPLIT: Interactive Hierarchical Modeling for mRNA Expression Classification

    Authors: Braden Roper, James C. Mathews, Saad Nadeem, Ji Hwan Park

    Abstract: We propose an interactive visual analytics tool, Vis-SPLIT, for partitioning a population of individuals into groups with similar gene signatures. Vis-SPLIT allows users to interactively explore a dataset and exploit visual separations to build a classification model for specific cancers. The visualization components reveal gene expression and correlation to assist specific partitioning decisions,… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: To be published in IEEE Visualization and Visual Analytics (VIS), 2023

  27. arXiv:2309.01670  [pdf, other

    q-bio.GN cs.LG

    Blind Biological Sequence Denoising with Self-Supervised Set Learning

    Authors: Nathan Ng, Ji Won Park, Jae Hyeon Lee, Ryan Lewis Kelly, Stephen Ra, Kyunghyun Cho

    Abstract: Biological sequence analysis relies on the ability to denoise the imprecise output of sequencing platforms. We consider a common setting where a short sequence is read out repeatedly using a high-throughput long-read platform to generate multiple subreads, or noisy observations of the same sequence. Denoising these subreads with alignment-based approaches often fails when too few subreads are avai… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  28. arXiv:2306.16085  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    Mass Spectra Prediction with Structural Motif-based Graph Neural Networks

    Authors: Jiwon Park, Jeonghee Jo, Sungroh Yoon

    Abstract: Mass spectra, which are agglomerations of ionized fragments from targeted molecules, play a crucial role across various fields for the identification of molecular structures. A prevalent analysis method involves spectral library searches,where unknown spectra are cross-referenced with a database. The effectiveness of such search-based approaches, however, is restricted by the scope of the existing… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 19 pages, 3figures

  29. arXiv:2306.03111  [pdf, other

    q-bio.QM cs.LG stat.ML

    Bootstrapped Training of Score-Conditioned Generator for Offline Design of Biological Sequences

    Authors: Minsu Kim, Federico Berto, Sungsoo Ahn, Jinkyoo Park

    Abstract: We study the problem of optimizing biological sequences, e.g., proteins, DNA, and RNA, to maximize a black-box score function that is only evaluated in an offline dataset. We propose a novel solution, bootstrapped training of score-conditioned generator (BootGen) algorithm. Our algorithm repeats a two-stage process. In the first stage, our algorithm trains the biological sequence generator with ra… ▽ More

    Submitted 22 March, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023, 19 pages, 5 figures

  30. arXiv:2305.12341  [pdf, other

    q-bio.PE

    Enhancing biodiversity through intraspecific suppression in large ecosystems

    Authors: Seong-Gyu Yang, Hye Jin Park

    Abstract: The competitive exclusion principle (CEP) is a fundamental concept in the niche theory, which posits that the number of available resources constrains the coexistence of species. While the CEP offers an intuitive explanation on coexistence, it has been challenged by counterexamples observed in nature. One prominent counterexample is the phytoplankton community, known as the paradox of the plankton… ▽ More

    Submitted 1 April, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: 40 pages (including Appendix), 25 figures (5 figures in main, 20 figures in Appendix)

  31. arXiv:2304.10065  [pdf

    physics.bio-ph q-bio.CB

    Machine learning traction force maps of cell monolayers

    Authors: Changhao Li, Luyi Feng, Yang Jeong Park, Jian Yang, Ju Li, Sulin Zhang

    Abstract: Cellular force transmission across a hierarchy of molecular switchers is central to mechanobiological responses. However, current cellular force microscopies suffer from low throughput and resolution. Here we introduce and train a generative adversarial network (GAN) to paint out traction force maps of cell monolayers with high fidelity to the experimental traction force microscopy (TFM). The GAN… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  32. arXiv:2301.00556  [pdf, ps, other

    q-bio.PE cond-mat.stat-mech physics.soc-ph

    Competition of alliances in a cyclically dominant eight-species population

    Authors: Junpyo Park, Xiaojie Chen, Attila Szolnoki

    Abstract: In a diverse population, where many species are present, competitors can fight for surviving at individual and collective levels. In particular, species, which would beat each other individually, may form a specific alliance that ensures them stable coexistence against the invasion of an external species. Our principal goal is to identify those general features of a formation which determine its v… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: 10 double-column pages, 11 figures

    Journal ref: Chaos, Solitons and Fractals 166 (2023) 113004

  33. Invasion and Interaction Determine Population Composition in an Open Evolving System

    Authors: Youngjai Park, Takashi Shimada, Seung-Woo Son, Hye Jin Park

    Abstract: It is well-known that interactions between species determine the population composition in an ecosystem. Conventional studies have focused on fixed population structures to reveal how interactions shape population compositions. However, interaction structures are not fixed, but change over time due to invasions. Thus, invasion and interaction play an important role in shaping communities. Despite… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: 15 pages (including supplementary material), 8 figures (4 figures in main, 4 figures in SI)

  34. arXiv:2210.04096  [pdf, other

    cs.LG q-bio.QM

    PropertyDAG: Multi-objective Bayesian optimization of partially ordered, mixed-variable properties for biological sequence design

    Authors: Ji Won Park, Samuel Stanton, Saeed Saremi, Andrew Watkins, Henri Dwyer, Vladimir Gligorijevic, Richard Bonneau, Stephen Ra, Kyunghyun Cho

    Abstract: Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarch… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures. Submitted to NeurIPS 2022 AI4Science Workshop

  35. arXiv:2208.14959  [pdf

    stat.ME q-bio.QM

    Inference of Mixed Graphical Models for Dichotomous Phenotypes using Markov Random Field Model

    Authors: Jaehyun Park, Sungho Won

    Abstract: In this article, we propose a new method named fused mixed graphical model (FMGM), which can infer network structures for dichotomous phenotypes. We assumed that the interplay of different omics markers is associated with disease status and proposed an FMGM-based method to detect the associated omics marker network difference. The statistical models of the networks were based on a pairwise Markov… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: 31 pages (excluding figures and tables), 4 figures, 3 tables, submitted to Biometrics

    MSC Class: 92B15 (Primary) 62P10 62H10 62-08 (Secondary)

  36. arXiv:2208.10661  [pdf, other

    q-bio.GN

    Therapeutic algebra of immunomodulatory drug responses at single-cell resolution

    Authors: Jialong Jiang, Sisi Chen, Tiffany Tsou, Christopher S. McGinnis, Tahmineh Khazaei, Qin Zhu, Jong H. Park, Paul Rivaud, Inna-Marie Strazhnik, Eric D. Chow, David A. Sivak, Zev J. Gartner, Matt Thomson

    Abstract: Therapeutic modulation of immune states is central to the treatment of human disease. However, how drugs and drug combinations impact the diverse cell types in the human immune system remains poorly understood at the transcriptome scale. Here, we apply single-cell mRNA-seq to profile the response of human immune cells to 502 immunomodulatory drugs alone and in combination. We develop a unified mat… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 19 pages, 5 figures

  37. arXiv:2205.04259  [pdf, other

    cs.LG q-bio.BM

    Multi-segment preserving sampling for deep manifold sampler

    Authors: Daniel Berenberg, Jae Hyeon Lee, Simon Kelow, Ji Won Park, Andrew Watkins, Vladimir Gligorijević, Richard Bonneau, Stephen Ra, Kyunghyun Cho

    Abstract: Deep generative modeling for biological sequences presents a unique challenge in reconciling the bias-variance trade-off between explicit biological insight and model flexibility. The deep manifold sampler was recently proposed as a means to iteratively sample variable-length protein sequences by exploiting the gradients from a function predictor. We introduce an alternative approach to this guide… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  38. arXiv:2204.03742  [pdf, other

    eess.IV cs.CV physics.med-ph q-bio.QM

    Mitosis domain generalization in histopathology images -- The MIDOG challenge

    Authors: Marc Aubreville, Nikolas Stathonikos, Christof A. Bertram, Robert Klopleisch, Natalie ter Hoeve, Francesco Ciompi, Frauke Wilm, Christian Marzahl, Taryn A. Donovan, Andreas Maier, Jack Breen, Nishant Ravikumar, Youjin Chung, Jinah Park, Ramin Nateghi, Fattaneh Pourakpour, Rutger H. J. Fick, Saima Ben Hadj, Mostafa Jahanifar, Nasir Rajpoot, Jakob Dexl, Thomas Wittenberg, Satoshi Kondo, Maxime W. Lafarge, Viktor H. Koelzer , et al. (10 additional authors not shown)

    Abstract: The density of mitotic figures within tumor tissue is known to be highly correlated with tumor proliferation and thus is an important marker in tumor grading. Recognition of mitotic figures by pathologists is known to be subject to a strong inter-rater bias, which limits the prognostic value. State-of-the-art deep learning methods can support the expert in this assessment but are known to strongly… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 19 pages, 9 figures, summary paper of the 2021 MICCAI MIDOG challenge

    Journal ref: Medical Image Analysis 84 (2023) 102699

  39. arXiv:2202.10400  [pdf, other

    cs.AR cs.DC cs.OS q-bio.GN

    GenStore: A High-Performance and Energy-Efficient In-Storage Computing System for Genome Sequence Analysis

    Authors: Nika Mansouri Ghiasi, Jisung Park, Harun Mustafa, Jeremie Kim, Ataberk Olgun, Arvid Gollwitzer, Damla Senol Cali, Can Firtina, Haiyu Mao, Nour Almadhoun Alserr, Rachata Ausavarungnirun, Nandita Vijaykumar, Mohammed Alser, Onur Mutlu

    Abstract: Read mapping is a fundamental, yet computationally-expensive step in many genomics applications. It is used to identify potential matches and differences between fragments (called reads) of a sequenced genome and an already known genome (called a reference genome). To address the computational challenges in genome analysis, many prior works propose various approaches such as filters that select th… ▽ More

    Submitted 6 April, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: Published at ASPLOS 2022

  40. BLEND: A Fast, Memory-Efficient, and Accurate Mechanism to Find Fuzzy Seed Matches in Genome Analysis

    Authors: Can Firtina, Jisung Park, Mohammed Alser, Jeremie S. Kim, Damla Senol Cali, Taha Shahroodi, Nika Mansouri Ghiasi, Gagandeep Singh, Konstantinos Kanellopoulos, Can Alkan, Onur Mutlu

    Abstract: Generating the hash values of short subsequences, called seeds, enables quickly identifying similarities between genomic sequences by matching seeds with a single lookup of their hash values. However, these hash values can be used only for finding exact-matching seeds as the conventional hashing methods assign distinct hash values for different seeds, including highly similar seeds. Finding only e… ▽ More

    Submitted 23 May, 2023; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Published in NARGAB

    Journal ref: NAR Genomics and Bioinformatics, vol. 5, no. 1, p. lqad004, Mar. 2023

  41. arXiv:2112.05782  [pdf, ps, other

    physics.soc-ph q-bio.PE

    Dynamical clustering of U.S. states reveals four distinct infection patterns that predict SARS-CoV-2 pandemic behavior

    Authors: Joseph L. Natale, Varun Viswanath, Oscar Trujillo Acevedo, Sophia Pérez Giottonini, Sandy Ihuiyan Romero Hernández, Diana G. Cruz Millán, A. Montserrat Palacios-Puga, Ammar Mandvi, Brian M. Khan, Martin Lilik, Jay Park, Benjamin L. Smarr

    Abstract: The SARS-CoV-2 pandemic has so far unfolded diversely across the fifty United States of America, reflected both in different time progressions of infection "waves" and in magnitudes of local infection rates. Despite a marked diversity of presentations, most U.S. states experienced their single greatest surge in daily new cases during the transition from Fall 2020 to Winter 2021. Popular media also… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: 22 pages, 4 figures; submitted to PLOS ONE

  42. arXiv:2106.13202  [pdf, other

    q-bio.QM cs.LG

    SALT: Sea lice Adaptive Lattice Tracking -- An Unsupervised Approach to Generate an Improved Ocean Model

    Authors: Ju An Park, Vikram Voleti, Kathryn E. Thomas, Alexander Wong, Jason L. Deglint

    Abstract: Warming oceans due to climate change are leading to increased numbers of ectoparasitic copepods, also known as sea lice, which can cause significant ecological loss to wild salmon populations and major economic loss to aquaculture sites. The main transport mechanism driving the spread of sea lice populations are near-surface ocean currents. Present strategies to estimate the distribution of sea li… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: 5 pages, 3 figures, 3 tables

  43. arXiv:2106.10627  [pdf, other

    q-bio.NC math.DS

    Experimentally testable whole brain manifolds that recapitulate behavior

    Authors: Gerald M Pao, Cameron Smith, Joseph Park, Keichi Takahashi, Wassapon Watanakeesuntorn, Hiroaki Natsukawa, Sreekanth H Chalasani, Tom Lorimer, Ryousei Takano, Nuttida Rungratsameetaweemana, George Sugihara

    Abstract: We propose an algorithm grounded in dynamical systems theory that generalizes manifold learning from a global state representation, to a network of local interacting manifolds termed a Generative Manifold Network (GMN). Manifolds are discovered using the convergent cross mapping (CCM) causal inference algorithm which are then compressed into a reduced redundancy network. The representation is a ne… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

    Comments: 20 pages, 15 figures; corresponding author: Gerald Pao [email protected]

  44. arXiv:2011.13554  [pdf

    q-bio.CB

    Towards decoding the coupled decision-making of metabolism and epithelial-mesenchymal transition in cancer

    Authors: Dongya Jia, Jun Hyoung Park, Harsimran Kaur, Kwang Hwa Jung, Sukjin Yang, Shubham Tripathi, Madeline Galbraith, Youyuan Deng, Mohit Kumar Jolly, Benny Abraham Kaipparettu, Jose N. Onuchic, Herbert Levine

    Abstract: Cancer cells have the plasticity to adjust their metabolic phenotypes for survival and metastasis. During metastasis, a developmental program known as the epithelial-mesenchymal transition (EMT) plays a critical role. There is extensive cross-talk between metabolism and EMT, but how this leads to coordinated physiological changes is still uncertain. The elusive connection between metabolism and EM… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

    Comments: 31 pages, 3 figures

  45. arXiv:2011.11082  [pdf, other

    cs.DC math.DS q-bio.QM

    Massively Parallel Causal Inference of Whole Brain Dynamics at Single Neuron Resolution

    Authors: Wassapon Watanakeesuntorn, Keichi Takahashi, Kohei Ichikawa, Joseph Park, George Sugihara, Ryousei Takano, Jason Haga, Gerald M. Pao

    Abstract: Empirical Dynamic Modeling (EDM) is a nonlinear time series causal inference framework. The latest implementation of EDM, cppEDM, has only been used for small datasets due to computational cost. With the growth of data collection capabilities, there is a great need to identify causal relationships in large datasets. We present mpEDM, a parallel distributed implementation of EDM optimized for moder… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

    Comments: 10 pges, 10 figures, accepted at IEEE International Conference on Parallel and Distributed Systems (ICPADS)2020, corresponding authors: Keichi Takahashi, Gerald M Pao

    ACM Class: K.6.3; G.4; J.3

  46. arXiv:2008.05377  [pdf

    q-bio.QM q-bio.MN

    Network reinforcement driven drug repurposing for COVID-19 by exploiting disease-gene-drug associations

    Authors: Yonghyun Nam, Jae-Seung Yun, Seung Mi Lee, Ji Won Park, Ziqi Chen, Brian Lee, Anurag Verma, Xia Ning, Li Shen, Dokyoon Kim

    Abstract: Currently, the number of patients with COVID-19 has significantly increased. Thus, there is an urgent need for developing treatments for COVID-19. Drug repurposing, which is the process of reusing already-approved drugs for new medical conditions, can be a good way to solve this problem quickly and broadly. Many clinical trials for COVID-19 patients using treatments for other diseases have already… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: 4 figures

  47. arXiv:2006.00688  [pdf, other

    q-bio.QM math.AP q-bio.CB

    A Mathematical Description of Bacterial Chemotaxis in Response to Two Stimuli

    Authors: Jeungeun Park, Zahra Aminzare

    Abstract: Bacteria are often exposed to multiple stimuli in complex environments, and their efficient chemotactic decisions are critical to survive and grow in their native environments. Bacterial responses to the environmental stimuli depend on the ratio of their corresponding chemoreceptors. By incorporating the signaling machinery of individual cells, we analyze the collective motion of a population of E… ▽ More

    Submitted 8 June, 2021; v1 submitted 31 May, 2020; originally announced June 2020.

    MSC Class: 35Q92; 58J55; 60J75; 92B05; 92C17; 92D25

  48. Absolute ethanol intake drives ethanol preference in Drosophila

    Authors: Scarlet J. Park, William W. Ja

    Abstract: Factors that mediate ethanol preference in Drosophila melanogaster are not well understood. A major confound has been the use of diverse methods to estimate ethanol consumption. We measured fly consumptive ethanol preference on base diets varying in nutrients, taste, and ethanol concentration. Both sexes showed ethanol preference that was abolished on high nutrient concentration diets. Additionall… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

    Comments: 11 pages, 2 figures, 1 table. Complete raw data accessible from https://github.com/HungryFly/JaLab/raw/master/publications/ethanol_JEB/SI_dataset.xlsx This version of the manuscript is original submission before undergoing peer review process. Final accepted and published version of this manuscript is available from https://doi.org/10.1242/jeb.224121 J Exp Biol (2020)

  49. arXiv:2002.02601  [pdf, other

    stat.ML cs.LG q-bio.QM stat.AP stat.ME

    Bidimensional linked matrix factorization for pan-omics pan-cancer analysis

    Authors: Eric F. Lock, Jun Young Park, Katherine A. Hoadley

    Abstract: Several modern applications require the integration of multiple large data matrices that have shared rows and/or columns. For example, cancer studies that integrate multiple omics platforms across multiple types of cancer, pan-omics pan-cancer analysis, have extended our knowledge of molecular heterogenity beyond what was observed in single tumor and single platform studies. However, these studies… ▽ More

    Submitted 7 April, 2022; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: 26 pages, 5 figures

    Journal ref: Annals of Applied Statistics 2022, Vol. 16, No. 1, 193-215

  50. arXiv:1909.03992  [pdf

    q-bio.QM

    Acoustomicrofluidic separation of tardigrades from raw cultures for sample preparation

    Authors: Muhammad Afzal, Jinsoo Park, Ghulam Destgeer, Husnain Ahmed, Syed Atif Iqrar, Sanghee Kim, Sunghyun Kang, Anas Alazzam, Tae-Sung Yoon, Hyung Jin Sung

    Abstract: Tardigrades are microscopic animals widely known for their survival capabilities under extreme conditions. They are the focus of current research in the fields of taxonomy, biogeography, genomics, proteomics, development, space biology, evolution, and ecology. Tardigrades, such as Hypsibius exemplaris, are being advocated as a next-generation model organism for genomic and developmental studies. T… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.