Skip to main content

Showing 1–10 of 10 results for author: Rudzicz, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2007.09028  [pdf, other

    cs.LG cs.AI cs.HC stat.ML

    Sequential Explanations with Mental Model-Based Policies

    Authors: Arnold YS Yeung, Shalmali Joshi, Joseph Jay Williams, Frank Rudzicz

    Abstract: The act of explaining across two parties is a feedback loop, where one provides information on what needs to be explained and the other provides an explanation relevant to this information. We apply a reinforcement learning framework which emulates this format by providing explanations based on the explainee's current mental model. We conduct novel online human experiments where explanations gener… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted into ICML 2020 Workshop on Human Interpretability in Machine Learning (Spotlight)

  2. arXiv:2005.11371  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Speaker diarization with session-level speaker embedding refinement using graph neural networks

    Authors: Jixuan Wang, Xiong Xiao, Jian Wu, Ranjani Ramamurthy, Frank Rudzicz, Michael Brudno

    Abstract: Deep speaker embedding models have been commonly used as a building block for speaker diarization systems; however, the speaker embedding model is usually trained according to a global loss defined on the training data, which could be sub-optimal for distinguishing speakers locally in a specific meeting session. In this work we present the first use of graph neural networks (GNNs) for the speaker… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    Comments: ICASSP 2020 (45th International Conference on Acoustics, Speech, and Signal Processing)

  3. arXiv:1906.10064  [pdf, other

    cs.LG cs.AI stat.ML

    Variations on the Chebyshev-Lagrange Activation Function

    Authors: Yuchen Li, Frank Rudzicz, Jekaterina Novikova

    Abstract: We seek to improve the data efficiency of neural networks and present novel implementations of parameterized piece-wise polynomial activation functions. The parameters are the y-coordinates of n+1 Chebyshev nodes per hidden unit and Lagrangian interpolation between the nodes produces the polynomial on [-1, 1]. We show results for different methods of handling inputs outside [-1, 1] on synthetic da… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

  4. arXiv:1902.02375  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Centroid-based deep metric learning for speaker recognition

    Authors: Jixuan Wang, Kuan-Chieh Wang, Marc Law, Frank Rudzicz, Michael Brudno

    Abstract: Speaker embedding models that utilize neural networks to map utterances to a space where distances reflect similarity between speakers have driven recent progress in the speaker recognition task. However, there is still a significant performance gap between recognizing speakers in the training set and unseen speakers. The latter case corresponds to the few-shot learning task, where a trained model… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

    Comments: ICASSP 2019 (44th International Conference on Acoustics, Speech, and Signal Processing)

  5. arXiv:1811.12254  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    The Effect of Heterogeneous Data for Alzheimer's Disease Detection from Speech

    Authors: Aparna Balagopalan, Jekaterina Novikova, Frank Rudzicz, Marzyeh Ghassemi

    Abstract: Speech datasets for identifying Alzheimer's disease (AD) are generally restricted to participants performing a single task, e.g. describing an image shown to them. As a result, models trained on linguistic features derived from such datasets may not be generalizable across tasks. Building on prior work demonstrating that same-task data of healthy participants helps improve AD detection on a single… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/147

  6. arXiv:1811.10376  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Robustness against the channel effect in pathological voice detection

    Authors: Yi-Te Hsu, Zining Zhu, Chi-Te Wang, Shih-Hau Fang, Frank Rudzicz, Yu Tsao

    Abstract: Many people are suffering from voice disorders, which can adversely affect the quality of their lives. In response, some researchers have proposed algorithms for automatic assessment of these disorders, based on voice signals. However, these signals can be sensitive to the recording devices. Indeed, the channel effect is a pervasive problem in machine learning for healthcare. In this study, we pro… ▽ More

    Submitted 2 December, 2018; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/200

  7. arXiv:1811.08081  [pdf, other

    cs.LG cs.CV stat.ML

    ChainGAN: A sequential approach to GANs

    Authors: Safwan Hossain, Kiarash Jamali, Yuchen Li, Frank Rudzicz

    Abstract: We propose a new architecture and training methodology for generative adversarial networks. Current approaches attempt to learn the transformation from a noise sample to a generated data sample in one shot. Our proposed generator architecture, called $\textit{ChainGAN}$, uses a two-step process. It first attempts to transform a noise vector into a crude sample, similar to a traditional generator.… ▽ More

    Submitted 22 November, 2018; v1 submitted 20 November, 2018; originally announced November 2018.

  8. arXiv:1811.07266  [pdf, other

    cs.CV stat.ML

    DeepConsensus: using the consensus of features from multiple layers to attain robust image classification

    Authors: Yuchen Li, Safwan Hossain, Kiarash Jamali, Frank Rudzicz

    Abstract: We consider a classifier whose test set is exposed to various perturbations that are not present in the training set. These test samples still contain enough features to map them to the same class as their unperturbed counterpart. Current architectures exhibit rapid degradation of accuracy when trained on standard datasets but then used to classify perturbed samples of that data. To address this,… ▽ More

    Submitted 2 December, 2018; v1 submitted 17 November, 2018; originally announced November 2018.

  9. arXiv:1807.07217  [pdf, other

    cs.LG stat.ML

    Deconfounding age effects with fair representation learning when assessing dementia

    Authors: Zining Zhu, Jekaterina Novikova, Frank Rudzicz

    Abstract: One of the most prevalent symptoms among the elderly population, dementia, can be detected by classifiers trained on linguistic features extracted from narrative transcripts. However, these linguistic features are impacted in a similar but different fashion by the normal aging process. Aging is therefore a confounding factor, whose effects have been hard for machine learning classifiers (especiall… ▽ More

    Submitted 7 September, 2019; v1 submitted 18 July, 2018; originally announced July 2018.

    Comments: 9 pages, 2 figures

  10. arXiv:1805.09366  [pdf, other

    cs.LG cs.MM cs.SD eess.AS eess.SP stat.ML

    Semi-supervised classification by reaching consensus among modalities

    Authors: Zining Zhu, Jekaterina Novikova, Frank Rudzicz

    Abstract: Deep learning has demonstrated abilities to learn complex structures, but they can be restricted by available data. Recently, Consensus Networks (CNs) were proposed to alleviate data sparsity by utilizing features from multiple modalities, but they too have been limited by the size of labeled data. In this paper, we extend CN to Transductive Consensus Networks (TCNs), suitable for semi-supervised… ▽ More

    Submitted 19 November, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: NIPS IRASL Workshop 2018