Skip to main content

Showing 1–23 of 23 results for author: Haws, D

.
  1. arXiv:2505.08699  [pdf, other

    eess.AS

    Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities

    Authors: George Saon, Avihu Dekel, Alexander Brooks, Tohru Nagano, Abraham Daniels, Aharon Satt, Ashish Mittal, Brian Kingsbury, David Haws, Edmilson Morais, Gakuto Kurata, Hagai Aronowitz, Ibrahim Ibrahim, Jeff Kuo, Kate Soule, Luis Lastras, Masayuki Suzuki, Ron Hoory, Samuel Thomas, Sashi Novitasari, Takashi Fukuda, Vishal Sunder, Xiaodong Cui, Zvi Kons

    Abstract: Granite-speech LLMs are compact and efficient speech language models specifically designed for English ASR and automatic speech translation (AST). The models were trained by modality aligning the 2B and 8B parameter variants of granite-3.3-instruct to speech on publicly available open-source corpora containing audio inputs and text targets consisting of either human transcripts for ASR or automati… ▽ More

    Submitted 13 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: 7 pages, 9 figures

  2. arXiv:2410.16048  [pdf, other

    eess.AS

    Continuous Speech Synthesis using per-token Latent Diffusion

    Authors: Arnon Turetzky, Nimrod Shabtay, Slava Shechtman, Hagai Aronowitz, David Haws, Ron Hoory, Avihu Dekel

    Abstract: The success of autoregressive transformer models with discrete tokens has inspired quantization-based approaches for continuous modalities, though these often limit reconstruction quality. We therefore introduce SALAD, a per-token latent diffusion model for zero-shot text-to-speech, that operates on continuous representations. SALAD builds upon the recently proposed expressive diffusion head for i… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Preprint, Under review

  3. arXiv:2309.11210  [pdf, other

    eess.AS cs.CL cs.SD

    Speak While You Think: Streaming Speech Synthesis During Text Generation

    Authors: Avihu Dekel, Slava Shechtman, Raul Fernandez, David Haws, Zvi Kons, Ron Hoory

    Abstract: Large Language Models (LLMs) demonstrate impressive capabilities, yet interaction with these models is mostly facilitated through text. Using Text-To-Speech to synthesize LLM outputs typically results in notable latency, which is impractical for fluent voice conversations. We propose LLM2Speech, an architecture to synthesize speech while text is being generated by an LLM which yields significant l… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Under review for ICASSP 2024

  4. arXiv:2208.01818  [pdf, other

    cs.SD cs.CL eess.AS

    VQ-T: RNN Transducers using Vector-Quantized Prediction Network States

    Authors: Jiatong Shi, George Saon, David Haws, Shinji Watanabe, Brian Kingsbury

    Abstract: Beam search, which is the dominant ASR decoding algorithm for end-to-end models, generates tree-structured hypotheses. However, recent studies have shown that decoding with hypothesis merging can achieve a more efficient search with comparable or better performance. But, the full context in recurrent networks is not compatible with hypothesis merging. We propose to use vector-quantized long short-… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Interspeech 2022 accepted paper

  5. arXiv:2207.12262  [pdf, other

    eess.AS cs.SD

    Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis

    Authors: Raul Fernandez, David Haws, Guy Lorberbom, Slava Shechtman, Alexander Sorin

    Abstract: Sequence-to-Sequence Text-to-Speech architectures that directly generate low level acoustic features from phonetic sequences are known to produce natural and expressive speech when provided with adequate amounts of training data. Such systems can learn and transfer desired speaking styles from one seen speaker to another (in multi-style multi-speaker settings), which is highly desirable for creati… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted for presentation at Interspeech 2022

  6. arXiv:2108.10803  [pdf, ps, other

    cs.CL cs.AI cs.SD eess.AS

    Reducing Exposure Bias in Training Recurrent Neural Network Transducers

    Authors: Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltan Tuske

    Abstract: When recurrent neural network transducers (RNNTs) are trained using the typical maximum likelihood criterion, the prediction network is trained only on ground truth label sequences. This leads to a mismatch during inference, known as exposure bias, when the model must deal with label sequences containing errors. In this paper we investigate approaches to reducing exposure bias in training to impro… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: accepted to Interspeech 2021

  7. Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis

    Authors: Slava Shechtman, Raul Fernandez, David Haws

    Abstract: Although Sequence-to-Sequence (S2S) architectures have become state-of-the-art in speech synthesis, capable of generating outputs that approach the perceptual quality of natural samples, they are limited by a lack of flexibility when it comes to controlling the output. In this work we present a framework capable of controlling the prosodic output via a set of concise, interpretable, disentangled p… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: IEEE Spoken Language Technology Workshop (SLT), 2021

  8. Polyhedral aspects of score equivalence in Bayesian network structure learning

    Authors: James Cussens, David Haws, Milan Studeny

    Abstract: This paper deals with faces and facets of the family-variable polytope and the characteristic-imset polytope, which are special polytopes used in integer linear programming approaches to statistically learn Bayesian network structure. A common form of linear objectives to be maximized in this area leads to the concept of score equivalence (SE), both for linear objectives and for faces of the famil… ▽ More

    Submitted 10 April, 2015; v1 submitted 3 March, 2015; originally announced March 2015.

    Comments: 37 pages

    MSC Class: 90-02 ACM Class: G.1.6

    Journal ref: Mathematical Programming A 164 (2017) n. 1-2, 285-324

  9. arXiv:1310.1659  [pdf, ps, other

    cs.LG cs.CE

    MINT: Mutual Information based Transductive Feature Selection for Genetic Trait Prediction

    Authors: Dan He, Irina Rish, David Haws, Simon Teyssedre, Zivan Karaman, Laxmi Parida

    Abstract: Whole genome prediction of complex phenotypic traits using high-density genotyping arrays has attracted a great deal of attention, as it is relevant to the fields of plant and animal breeding and genetic epidemiology. As the number of genotypes is generally much bigger than the number of samples, predictive models suffer from the curse-of-dimensionality. The curse-of-dimensionality problem not onl… ▽ More

    Submitted 6 October, 2013; originally announced October 2013.

  10. arXiv:1310.1649  [pdf, other

    cs.DS

    QuickLexSort: An efficient algorithm for lexicographically sorting nested restrictions of a database

    Authors: David Haws

    Abstract: Lexicographical sorting is a fundamental problem with applications to contingency tables, databases, Bayesian networks, and more. A standard method to lexicographically sort general data is to iteratively use a stable sort -- a sort which preserves existing orders. Here we present a new method of lexicographical sorting called QuickLexSort. Whereas a stable sort based lexicographical sorting algor… ▽ More

    Submitted 6 October, 2013; originally announced October 2013.

    Comments: 17, 1 figure

    MSC Class: 68Q25; 68P10; 62H17 ACM Class: F.2.2; G.3

  11. arXiv:1204.3070  [pdf, other

    math.ST math.CO

    Markov degree of the three-state toric homogeneous Markov chain model

    Authors: David Haws, Abraham Martín del Campo, Akimichi Takemura, Ruriko Yoshida

    Abstract: We consider the three-state toric homogeneous Markov chain model (THMC) without loops and initial parameters. At time $T$, the size of the design matrix is $6 \times 3\cdot 2^{T-1}$ and the convex hull of its columns is the model polytope. We study the behavior of this polytope for $T\geq 3$ and we show that it is defined by 24 facets for all $T\ge 5$. Moreover, we give a complete description of t… ▽ More

    Submitted 17 September, 2013; v1 submitted 13 April, 2012; originally announced April 2012.

    Comments: 26 pages, 1 figure

  12. arXiv:1111.6518   

    math.CO stat.CO

    Semigroups and sequential importance sampling for multiway tables

    Authors: Ruriko Yoshida, Jing Xi, Shaoceng Wei, Feng Zhou, David Haws

    Abstract: When an interval of integers between the lower bound $l_i$ and the upper bound $u_i$ is the support of the marginal distribution $n_i|(n_{i-1}, ...,n_1)$, Chen et al, 2005 noticed that sampling from the interval at each step, for $n_i$ during a sequential importance sampling (SIS) procedure, always produces a table which satisfies the marginal constraints. However, in general, the interval may not… ▽ More

    Submitted 18 January, 2018; v1 submitted 28 November, 2011; originally announced November 2011.

    Comments: There are some theoretical mistakes. Thus, we would like to withdraw the paper

  13. arXiv:1109.4453  [pdf, other

    math.CO

    Volumes and Tangent Cones of Matroid Polytopes

    Authors: David C. Haws

    Abstract: De Loera et al. 2009, showed that when the rank is fixed the Ehrhart polynomial of a matroid polytope can be computed in polynomial time when the number of elements varies. A key to proving this is the fact that the number of simplicial cones in any triangulation of a tangent cone is bounded polynomially in the number of elements when the rank is fixed. The authors speculated whether or not the Eh… ▽ More

    Submitted 20 September, 2011; originally announced September 2011.

    Comments: 10 pages, 5 figures

    MSC Class: 05; 52B

  14. arXiv:1108.5939  [pdf, other

    math.ST math.CO

    Estimating the number of zero-one multi-way tables via sequential importance sampling

    Authors: Jing Xi, Ruriko Yoshida, David Haws

    Abstract: In 2005, Chen et al introduced a sequential importance sampling (SIS) procedure to analyze zero-one two-way tables with given fixed marginal sums (row and column sums) via the conditional Poisson (CP) distribution. They showed that compared with Monte Carlo Markov chain (MCMC)-based approaches, their importance sampling method is more efficient in terms of running time and also provides an easy an… ▽ More

    Submitted 28 November, 2011; v1 submitted 30 August, 2011; originally announced August 2011.

    Comments: 1 figures, 16 pages

  15. arXiv:1108.2311   

    math.ST math.CO

    Semigroups and sequential importance sampling for multiway tables and beyond

    Authors: Jing Xi, Shaoceng Wei, Feng Zhou, Ruriko Yoshida, David Haws

    Abstract: When an interval of integers between the lower bound l_i and the upper bounds u_i is the support of the marginal distribution n_i|(n_{i-1}, ...,n_1), Chen et al. 2005 noticed that sampling from the interval at each step, for n_i during the sequential importance sampling (SIS) procedure, always produces a table which satisfies the marginal constraints. However, in general, the interval may not be e… ▽ More

    Submitted 15 November, 2011; v1 submitted 10 August, 2011; originally announced August 2011.

    Comments: Jing Xi, Shaoceng Wei and Feng Zhou are joint first authors. Withdrawn for theoretical revisions

    MSC Class: 62H17

  16. arXiv:1108.0481  [pdf, other

    math.CO math.ST

    Degree Bounds for a Minimal Markov Basis for the Three-State Toric Homogeneous Markov Chain Model

    Authors: David Haws, Abraham Martin Del Campo, Ruriko Yoshida

    Abstract: We study the three state toric homogeneous Markov chain model and three special cases of it, namely: (i) when the initial state parameters are constant, (ii) without self-loops, and (iii) when both cases are satisfied at the same time. Using as a key tool a directed multigraph associated to the model, the state-graph, we give a bound on the number of vertices of the polytope associated to the mode… ▽ More

    Submitted 3 August, 2011; v1 submitted 2 August, 2011; originally announced August 2011.

    MSC Class: 05cxx; 60J10; 52B20

  17. arXiv:1107.4708  [pdf, ps, other

    math.ST

    On polyhedral approximations of polytopes for learning Bayes nets

    Authors: Milan Studeny, David Haws

    Abstract: We review three vector encodings of Bayesian network structures. The first one has recently been applied by Jaakkola 2010, the other two use special integral vectors formerly introduced, called imsets [Studeny 2005, Studeny 2010]. The central topic is the comparison of outer polyhedral approximations of the corresponding polytopes. We show how to transform the inequalities suggested by Jaakkola et… ▽ More

    Submitted 3 August, 2011; v1 submitted 23 July, 2011; originally announced July 2011.

    MSC Class: 62H17 ACM Class: G.3

    Journal ref: 2013, Journal of Algebraic Statistics, 4:1, 59-92

  18. arXiv:1004.2101  [pdf, other

    q-bio.PE q-bio.GN

    Statistical Phylogenetic Tree Analysis Using Differences of Means

    Authors: Elissaveta Arnaoudova, David Haws, Peter Huggins, Jerzy W. Jaromczyk, Neil Moore, Chris Schardl, Ruriko Yoshida

    Abstract: We propose a statistical method to test whether two phylogenetic trees with given alignments are significantly incongruent. Our method compares the two distributions of phylogenetic trees given by the input alignments, instead of comparing point estimations of trees. This statistical approach can be applied to gene tree analysis for example, detecting unusual events in genome evolution such as ho… ▽ More

    Submitted 12 April, 2010; originally announced April 2010.

    Comments: 17 pages, 6 figures

  19. arXiv:1004.2073  [pdf, other

    q-bio.PE math.CO

    Optimality of the Neighbor Joining Algorithm and Faces of the Balanced Minimum Evolution Polytope

    Authors: David C. Haws, Terrell Hodge, Ruriko Yoshida

    Abstract: Balanced minimum evolution (BME) is a statistically consistent distance-based method to reconstruct a phylogenetic tree from an alignment of molecular data. In 2000, Pauplin showed that the BME method is equivalent to optimizing a linear functional over the BME polytope, the convex hull of the BME vectors obtained from Pauplin's formula applied to all binary trees. The BME method is related to the… ▽ More

    Submitted 3 February, 2011; v1 submitted 12 April, 2010; originally announced April 2010.

    Comments: 24 pages,4 figure

    MSC Class: 52B11; 92D15

  20. arXiv:0911.0645  [pdf, ps, other

    q-bio.PE cs.LG q-bio.QM

    Bayes estimators for phylogenetic reconstruction

    Authors: Peter Huggins, Wenbin Li, David Haws, Thomas Friedrich, Jinze Liu, Ruriko Yoshida

    Abstract: Tree reconstruction methods are often judged by their accuracy, measured by how close they get to the true tree. Yet most reconstruction methods like ML do not explicitly maximize this accuracy. To address this problem, we propose a Bayesian solution. Given tree samples, we propose finding the tree estimate which is closest on average to the samples. This ``median'' tree is known as the Bayes es… ▽ More

    Submitted 21 November, 2009; v1 submitted 3 November, 2009; originally announced November 2009.

    Comments: 31 pages, 4 figures, and 3 tables

  21. arXiv:0905.4405  [pdf, other

    math.CO

    Matroid Polytopes: Algorithms, Theory, and Applications

    Authors: David C. Haws

    Abstract: This dissertation presents new results on three different themes all related to matroid polytopes. First we investigate properties of Ehrhart polynomials of matroid polytopes, independence matroid polytopes, and polymatroids. We prove that for fixed rank their Ehrhart polynomials are computable in polynomial time. The proof relies on the geometry of these polytopes as well as a new refined analy… ▽ More

    Submitted 27 May, 2009; originally announced May 2009.

    MSC Class: 52B40; 90C30; 05B35; 90C27

  22. Ehrhart polynomials of matroid polytopes and polymatroids

    Authors: Jesús A. De Loera, David C. Haws, Matthias Köppe

    Abstract: We investigate properties of Ehrhart polynomials for matroid polytopes, independence matroid polytopes, and polymatroids. In the first half of the paper we prove that for fixed rank their Ehrhart polynomials are computable in polynomial time. The proof relies on the geometry of these polytopes as well as a new refined analysis of the evaluation of Todd polynomials. In the second half we discuss… ▽ More

    Submitted 23 October, 2007; originally announced October 2007.

    Comments: 28 pages, 6 figures, submitted to Discrete and Computational Geometry

    MSC Class: 05; 52B

    Journal ref: Discrete Comput. Geom. 42 (2009), no. 4, 670-702

  23. arXiv:math/0307350  [pdf, ps, other

    math.CO

    Short Rational Functions for Toric Algebra and Applications

    Authors: Jesus De Loera, David Haws, Raymond Hemmecke, Peter Huggins, Bernd Sturmfels, Ruriko Yoshida

    Abstract: We encode the binomials belonging to the toric ideal $I_A$ associated with an integral $d \times n$ matrix $A$ using a short sum of rational functions as introduced by Barvinok \cite{bar,newbar}. Under the assumption that $d,n$ are fixed, this representation allows us to compute the Graver basis and the reduced Gröbner basis of the ideal $I_A$, with respect to any term order, in time polynomial… ▽ More

    Submitted 26 July, 2003; originally announced July 2003.

    Comments: 13 pages, using elsart.sty and elsart.cls

    MSC Class: 05A15 (primary); 13P10 (secondary)