Search | arXiv e-print repository

Language-specific Tonal Features Drive Speaker-Listener Neural Synchronization

Authors: Chen Hong, Xiangbin Teng, Yu Li, Shen-Mou Hsu, Feng-Ming Tsao, Patrick C. M. Wong, Gangyi Feng

Abstract: Verbal communication transmits information across diverse linguistic levels, with neural synchronization (NS) between speakers and listeners emerging as a putative mechanism underlying successful exchange. However, the specific speech features driving this synchronization and how language-specific versus universal characteristics facilitate information transfer remain poorly understood. We develop… ▽ More Verbal communication transmits information across diverse linguistic levels, with neural synchronization (NS) between speakers and listeners emerging as a putative mechanism underlying successful exchange. However, the specific speech features driving this synchronization and how language-specific versus universal characteristics facilitate information transfer remain poorly understood. We developed a novel content-based interbrain encoding model to disentangle the contributions of acoustic and linguistic features to speaker-listener NS during Mandarin storytelling and listening, as measured via magnetoencephalography (MEG). Results revealed robust NS throughout frontotemporal-parietal networks with systematic time lags between speech production and perception. Crucially, suprasegmental lexical tone features (tone categories, pitch height, and pitch contour), essential for lexical meaning in Mandarin, contributed more significantly to NS than either acoustic elements or universal segmental units (consonants and vowels). These tonal features generated distinctive spatiotemporal NS patterns, creating language-specific neural "communication channels" that facilitated efficient representation sharing between interlocutors. Furthermore, the strength and patterns of NS driven by these language-specific features predicted communication success. These findings demonstrate the neural mechanisms underlying shared representations during verbal exchange and highlight how language-specific features can shape neural coupling to optimize information transfer during human communication. △ Less

Submitted 7 March, 2025; originally announced March 2025.

arXiv:2402.04274 [pdf, other]

FPGA Deployment of LFADS for Real-time Neuroscience Experiments

Authors: Xiaohan Liu, ChiJui Chen, YanLun Huang, LingChi Yang, Elham E Khoda, Yihui Chen, Scott Hauck, Shih-Chieh Hsu, Bo-Cheng Lai

Abstract: Large-scale recordings of neural activity are providing new opportunities to study neural population dynamics. A powerful method for analyzing such high-dimensional measurements is to deploy an algorithm to learn the low-dimensional latent dynamics. LFADS (Latent Factor Analysis via Dynamical Systems) is a deep learning method for inferring latent dynamics from high-dimensional neural spiking data… ▽ More Large-scale recordings of neural activity are providing new opportunities to study neural population dynamics. A powerful method for analyzing such high-dimensional measurements is to deploy an algorithm to learn the low-dimensional latent dynamics. LFADS (Latent Factor Analysis via Dynamical Systems) is a deep learning method for inferring latent dynamics from high-dimensional neural spiking data recorded simultaneously in single trials. This method has shown a remarkable performance in modeling complex brain signals with an average inference latency in milliseconds. As our capacity of simultaneously recording many neurons is increasing exponentially, it is becoming crucial to build capacity for deploying low-latency inference of the computing algorithms. To improve the real-time processing ability of LFADS, we introduce an efficient implementation of the LFADS models onto Field Programmable Gate Arrays (FPGA). Our implementation shows an inference latency of 41.97 $μ$s for processing the data in a single trial on a Xilinx U55C. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: 6 pages, 8 figures

Journal ref: Fast Machine Learning for Science, ICCAD 2023

arXiv:2212.07505 [pdf, other]

Animal Synchrony and agents' segregation

Authors: Laura P. Schaposnik, Sheryl Hsu, Robin I. M. Dunbar

Abstract: In recent years it has become evident the need of understanding how failure of coordination imposes constraints on the size of stable groups that highly social mammals can live in. We examine here the forces that keep animals together as a herd and others that drive them apart. Different phenotypes (e.g. genders) have different rates of gut fill, causing them to spend different amounts of time per… ▽ More In recent years it has become evident the need of understanding how failure of coordination imposes constraints on the size of stable groups that highly social mammals can live in. We examine here the forces that keep animals together as a herd and others that drive them apart. Different phenotypes (e.g. genders) have different rates of gut fill, causing them to spend different amounts of time performing activities. By modeling a group as a set of semi-coupled oscillators on a disc, we show that the members of the group may become less and less coupled until the group dissolves and breaks apart. We show that when social bonding creates a stickiness, or gravitational pull, between pairs of individuals, fragmentation is reduced. △ Less

Submitted 14 December, 2022; originally announced December 2022.

Comments: Dedicated to Prof. Fidel A. Schaposnik on the occasion of his 75th birthday

arXiv:2101.05870 [pdf, other]

From Genotype to Phenotype: polygenic prediction of complex human traits

Authors: Timothy G. Raben, Louis Lello, Erik Widen, Stephen D. H. Hsu

Abstract: Decoding the genome confers the capability to predict characteristics of the organism(phenotype) from DNA (genotype). We describe the present status and future prospects of genomic prediction of complex traits in humans. Some highly heritable complex phenotypes such as height and other quantitative traits can already be predicted with reasonable accuracy from DNA alone. For many diseases, includin… ▽ More Decoding the genome confers the capability to predict characteristics of the organism(phenotype) from DNA (genotype). We describe the present status and future prospects of genomic prediction of complex traits in humans. Some highly heritable complex phenotypes such as height and other quantitative traits can already be predicted with reasonable accuracy from DNA alone. For many diseases, including important common conditions such as coronary artery disease, breast cancer, type I and II diabetes, individuals with outlier polygenic scores (e.g., top few percent) have been shown to have 5 or even 10 times higher risk than average. Several psychiatric conditions such as schizophrenia and autism also fall into this category. We discuss related topics such as the genetic architecture of complex traits, sibling validation of polygenic scores, and applications to adult health, in vitro fertilization (embryo selection), and genetic engineering. △ Less

Submitted 14 January, 2021; originally announced January 2021.

Comments: 33 pages, 7 figures, 1 table, A version of this article was prepared for "Genomic Prediction of Complex Traits", Springer Nature book series "Methods in Molecular Biology"

arXiv:1709.06489 [pdf, ps, other]

Accurate Genomic Prediction Of Human Height

Authors: Louis Lello, Steven G. Avery, Laurent Tellier, Ana Vazquez, Gustavo de los Campos, Stephen D. H. Hsu

Abstract: We construct genomic predictors for heritable and extremely complex human quantitative traits (height, heel bone density, and educational attainment) using modern methods in high dimensional statistics (i.e., machine learning). Replication tests show that these predictors capture, respectively, $\sim$40, 20, and 9 percent of total variance for the three traits. For example, predicted heights corre… ▽ More We construct genomic predictors for heritable and extremely complex human quantitative traits (height, heel bone density, and educational attainment) using modern methods in high dimensional statistics (i.e., machine learning). Replication tests show that these predictors capture, respectively, $\sim$40, 20, and 9 percent of total variance for the three traits. For example, predicted heights correlate $\sim$0.65 with actual height; actual heights of most individuals in validation samples are within a few cm of the prediction. The variance captured for height is comparable to the estimated SNP heritability from GCTA (GREML) analysis, and seems to be close to its asymptotic value (i.e., as sample size goes to infinity), suggesting that we have captured most of the heritability for the SNPs used. Thus, our results resolve the common SNP portion of the "missing heritability" problem -- i.e., the gap between prediction R-squared and SNP heritability. The $\sim$20k activated SNPs in our height predictor reveal the genetic architecture of human height, at least for common SNPs. Our primary dataset is the UK Biobank cohort, comprised of almost 500k individual genotypes with multiple phenotypes. We also use other datasets and SNPs found in earlier GWAS for out-of-sample validation of our results. △ Less

Submitted 19 September, 2017; originally announced September 2017.

Comments: 17 pages, 10 figures

arXiv:1408.6583 [pdf, other]

Determination of Nonlinear Genetic Architecture using Compressed Sensing

Authors: Chiu Man Ho, Stephen D. H. Hsu

Abstract: We introduce a statistical method that can reconstruct nonlinear genetic models (i.e., including epistasis, or gene-gene interactions) from phenotype-genotype (GWAS) data. The computational and data resource requirements are similar to those necessary for reconstruction of linear genetic models (or identification of gene-trait associations), assuming a condition of generalized sparsity, which limi… ▽ More We introduce a statistical method that can reconstruct nonlinear genetic models (i.e., including epistasis, or gene-gene interactions) from phenotype-genotype (GWAS) data. The computational and data resource requirements are similar to those necessary for reconstruction of linear genetic models (or identification of gene-trait associations), assuming a condition of generalized sparsity, which limits the total number of gene-gene interactions. An example of a sparse nonlinear model is one in which a typical locus interacts with several or even many others, but only a small subset of all possible interactions exist. It seems plausible that most genetic architectures fall in this category. Our method uses a generalization of compressed sensing (L1-penalized regression) applied to nonlinear functions of the sensing matrix. We give theoretical arguments suggesting that the method is nearly optimal in performance, and demonstrate its effectiveness on broad classes of nonlinear genetic models using both real and simulated human genomes. △ Less

Submitted 19 July, 2015; v1 submitted 27 August, 2014; originally announced August 2014.

Comments: 20 pages, 8 figures. arXiv admin note: text overlap with arXiv:1408.3421

Journal ref: GigaScience 4: 44 (2015)

arXiv:1408.3421 [pdf, other]

On the genetic architecture of intelligence and other quantitative traits

Authors: Stephen D. H. Hsu

Abstract: How do genes affect cognitive ability or other human quantitative traits such as height or disease risk? Progress on this challenging question is likely to be significant in the near future. I begin with a brief review of psychometric measurements of intelligence, introducing the idea of a "general factor" or g score. The main results concern the stability, validity (predictive power), and heritab… ▽ More How do genes affect cognitive ability or other human quantitative traits such as height or disease risk? Progress on this challenging question is likely to be significant in the near future. I begin with a brief review of psychometric measurements of intelligence, introducing the idea of a "general factor" or g score. The main results concern the stability, validity (predictive power), and heritability of adult g. The largest component of genetic variance for both height and intelligence is additive (linear), leading to important simplifications in predictive modeling and statistical estimation. Due mainly to the rapidly decreasing cost of genotyping, it is possible that within the coming decade researchers will identify loci which account for a significant fraction of total g variation. In the case of height analogous efforts are well under way. I describe some unpublished results concerning the genetic architecture of height and cognitive ability, which suggest that roughly 10k moderately rare causal variants of mostly negative effect are responsible for normal population variation. Using results from Compressed Sensing (L1-penalized regression), I estimate the statistical power required to characterize both linear and nonlinear models for quantitative traits. The main unknown parameter s (sparsity) is the number of loci which account for the bulk of the genetic variation. The required sample size is of order 100s, or roughly a million in the case of cognitive ability. △ Less

Submitted 30 August, 2014; v1 submitted 14 August, 2014; originally announced August 2014.

Comments: 30 pages, 13 figures; v2 minor edits

arXiv:1310.2264 [pdf, other]

Application of compressed sensing to genome wide association studies and genomic selection

Authors: Shashaank Vattikuti, James J. Lee, Christopher C. Chang, Stephen D. H. Hsu, Carson C. Chow

Abstract: We show that the signal-processing paradigm known as compressed sensing (CS) is applicable to genome-wide association studies (GWAS) and genomic selection (GS). The aim of GWAS is to isolate trait-associated loci, whereas GS attempts to predict the phenotypic values of new individuals on the basis of training data. CS addresses a problem common to both endeavors, namely that the number of genotype… ▽ More We show that the signal-processing paradigm known as compressed sensing (CS) is applicable to genome-wide association studies (GWAS) and genomic selection (GS). The aim of GWAS is to isolate trait-associated loci, whereas GS attempts to predict the phenotypic values of new individuals on the basis of training data. CS addresses a problem common to both endeavors, namely that the number of genotyped markers often greatly exceeds the sample size. We show using CS methods and theory that all loci of nonzero effect can be identified (selected) using an efficient algorithm, provided that they are sufficiently few in number (sparse) relative to sample size. For heritability h2 = 1, there is a sharp phase transition to complete selection as the sample size is increased. For heritability values less than one, complete selection can still occur although the transition is smoothed. The transition boundary is only weakly dependent on the total number of genotyped markers. The crossing of a transition boundary provides an objective means to determine when true effects are being recovered; we discuss practical methods for detecting the boundary. For h2 = 0.5, we find that a sample size that is thirty times the number of nonzero loci is sufficient for good recovery. △ Less

Submitted 11 May, 2014; v1 submitted 8 October, 2013; originally announced October 2013.

Comments: 30 pages, 11 figures. Version to appear in journal GigaScience

arXiv:1006.3271 [pdf]

The probabilistic analysis of language acquisition: Theoretical, computational, and experimental analysis

Authors: Anne S. Hsu, Nick Chater, Paul M. B. Vitanyi

Abstract: There is much debate over the degree to which language learning is governed by innate language-specific biases, or acquired through cognition-general principles. Here we examine the probabilistic language acquisition hypothesis on three levels: We outline a novel theoretical result showing that it is possible to learn the exact generative model underlying a wide class of languages, purely from obs… ▽ More There is much debate over the degree to which language learning is governed by innate language-specific biases, or acquired through cognition-general principles. Here we examine the probabilistic language acquisition hypothesis on three levels: We outline a novel theoretical result showing that it is possible to learn the exact generative model underlying a wide class of languages, purely from observing samples of the language. We then describe a recently proposed practical framework, which quantifies natural language learnability, allowing specific learnability predictions to be made for the first time. In previous work, this framework was used to make learnability predictions for a wide variety of linguistic constructions, for which learnability has been much debated. Here, we present a new experiment which tests these learnability predictions. We find that our experimental results support the possibility that these linguistic constructions are acquired probabilistically from cognition-general principles. △ Less

Submitted 16 June, 2010; originally announced June 2010.

Comments: 26 pages, pdf, 4 figures, Submitted to "Cognition"

MSC Class: 91E10; 97C30; 68T50

arXiv:cond-mat/0306628 [pdf, ps, other]

Global Spread of Infectious Diseases

Authors: S. Hsu, A. Zee

Abstract: We develop simple models for the global spread of infectious diseases, emphasizing human mobility via air travel and the variation of public health infrastructure from region to region. We derive formulas relating the total and peak number of infections in two countries to the rate of travel between them and their respective epidemiological parameters. We develop simple models for the global spread of infectious diseases, emphasizing human mobility via air travel and the variation of public health infrastructure from region to region. We derive formulas relating the total and peak number of infections in two countries to the rate of travel between them and their respective epidemiological parameters. △ Less

Submitted 25 June, 2003; originally announced June 2003.

Comments: 13 pages, 7 figures (eps), latex

Showing 1–10 of 10 results for author: Hsu, S