Skip to main content

Showing 1–16 of 16 results for author: Fernandez-Baca, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.02103  [pdf, ps, other

    cs.DS cs.DM

    Exact Algorithms for No-Rainbow Coloring and Phylogenetic Decisiveness

    Authors: Ghazaleh Parvini, David Fernández-Baca

    Abstract: The input to the no-rainbow hypergraph coloring problem is a hypergraph $H$ where every hyperedge has $r$ nodes. The question is whether there exists an $r$-coloring of the nodes of $H$ such that all $r$ colors are used and there is no rainbow hyperedge -- i.e., no hyperedge uses all $r$ colors. The no-rainbow hypergraph $r$-coloring problem is known to be NP-complete for $r \geq 3$. The special c… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    MSC Class: 05C15; 68Q25; 68W20; 68W40; 92D15

  2. arXiv:2002.09725  [pdf, other

    cs.DS

    Testing the Agreement of Trees with Internal Labels

    Authors: David Fernández-Baca, Lei Liu

    Abstract: The input to the agreement problem is a collection $P = \{T_1, T_2, \dots , T_k\}$ of phylogenetic trees, called input trees, over partially overlapping sets of taxa. The question is whether there exists a tree $T$, called an agreement tree, whose taxon set is the union of the taxon sets of the input trees, such that for each $i \in \{1, 2, \dots , k\}$, the restriction of $T$ to the taxon set of… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

    ACM Class: F.2; J.3

  3. arXiv:2002.09722  [pdf, ps, other

    cs.DS

    Checking Phylogenetic Decisiveness in Theory and in Practice

    Authors: Ghazaleh Parvini, Katherine Braught, David Fernández-Baca

    Abstract: Suppose we have a set $X$ consisting of $n$ taxa and we are given information from $k$ loci from which to construct a phylogeny for $X$. Each locus offers information for only a fraction of the taxa. The question is whether this data suffices to construct a reliable phylogeny. The decisiveness problem expresses this question combinatorially. Although a precise characterization of decisiveness is k… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

    MSC Class: 05C15; 05C65 ACM Class: F.2; J.3

  4. arXiv:1910.07819  [pdf, other

    q-bio.PE cs.DS

    EvoZip: Efficient Compression of Large Collections of Evolutionary Trees

    Authors: Balanand Jha, David Fernández-Baca, Akshay Deepak, Kumar Abhishek

    Abstract: Phylogenetic trees represent evolutionary relationships among sets of organisms. Popular phylogenetic reconstruction approaches typically yield hundreds to thousands of trees on a common leafset. Storing and sharing such large collection of trees requires considerable amount of space and bandwidth. Furthermore, the huge size of phylogenetic tree databases can make search and retrieval operations t… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

  5. arXiv:1811.01338  [pdf, other

    cs.LG q-bio.BM stat.ML

    Deep Robust Framework for Protein Function Prediction using Variable-Length Protein Sequences

    Authors: Ashish Ranjan, Md Shah Fahad, David Fernandez-Baca, Akshay Deepak, Sudhakar Tripathi

    Abstract: Amino acid sequence portrays most intrinsic form of a protein and expresses primary structure of protein. The order of amino acids in a sequence enables a protein to acquire a particular stable conformation that is responsible for the functions of the protein. This relationship between a sequence and its function motivates the need to analyse the sequences for predicting protein functions. Early g… ▽ More

    Submitted 19 June, 2019; v1 submitted 4 November, 2018; originally announced November 2018.

    Journal ref: IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2019

  6. arXiv:1605.02045  [pdf, other

    cs.DS

    Fast Compatibility Testing for Phylogenies with Nested Taxa

    Authors: Yun Deng, David Fernández-Baca

    Abstract: Semi-labeled trees are phylogenies whose internal nodes may be labeled by higher-order taxa. Thus, a leaf labeled Mus musculus could nest within a subtree whose root node is labeled Rodentia, which itself could nest within a subtree whose root is labeled Mammalia. Suppose we are given collection $\mathcal P$ of semi-labeled trees over various subsets of a set of taxa. The ancestral compatibility p… ▽ More

    Submitted 6 May, 2016; originally announced May 2016.

    Comments: 3 figures

    MSC Class: 05C85; 68Q25; 68W40; 92D15 ACM Class: F.2.2; G.2.2; J.3

  7. arXiv:1510.07758  [pdf, other

    cs.DS

    Fast Compatibility Testing for Rooted Phylogenetic Trees

    Authors: Yun Deng, David Fernández-Baca

    Abstract: We consider the following basic problem in phylogenetic tree construction. Let $\mathcal{P} = \{T_1, \ldots, T_k\}$ be a collection of rooted phylogenetic trees over various subsets of a set of species. The tree compatibility problem asks whether there is a tree $T$ with the following property: for each $i \in \{1, \dots, k\}$, $T_i$ can be obtained from the restriction of $T$ to the species set o… ▽ More

    Submitted 26 October, 2015; originally announced October 2015.

    ACM Class: F.2.0

  8. arXiv:1503.03877  [pdf, ps, other

    cs.DS

    Constructing and Employing Tree Alignment Graphs for Phylogenetic Synthesis

    Authors: Ruchi Chaudhary, David Fernandez-Baca, J. Gordon Burleigh

    Abstract: Tree alignment graphs (TAGs) provide an intuitive data structure for storing phylogenetic trees that exhibits the relationships of the individual input trees and can potentially account for nested taxonomic relationships. This paper provides a theoretical foundation for the use of TAGs in phylogenetics. We provide a formal definition of TAG that - unlike previous definition - does not depend on th… ▽ More

    Submitted 12 March, 2015; originally announced March 2015.

  9. arXiv:1307.7828  [pdf, ps, other

    cs.DM cs.CE q-bio.QM

    Characterizing Compatibility and Agreement of Unrooted Trees via Cuts in Graphs

    Authors: Sudheer Vakati, David Fernández-Baca

    Abstract: Deciding whether there is a single tree -a supertree- that summarizes the evolutionary information in a collection of unrooted trees is a fundamental problem in phylogenetics. We consider two versions of this question: agreement and compatibility. In the first, the supertree is required to reflect precisely the relationships among the species exhibited by the input trees. In the second, the supert… ▽ More

    Submitted 30 July, 2013; originally announced July 2013.

    Comments: Peer-reviewed and presented as part of the 13th Workshop on Algorithms in Bioinformatics (WABI2013)

  10. arXiv:1210.3762  [pdf, ps, other

    cs.DM

    On Two Graph-Theoretic Characterizations of Tree Compatibility

    Authors: Sudheer Vakati, David Fernández-Baca

    Abstract: Deciding whether a collection of unrooted trees is compatible is a fundamental problem in phylogenetics. Two different graph-theoretic characterizations of tree compatibility have recently been proposed. In one of these, tree compatibility is characterized in terms of the existence of a specific kind of triangulation in a structure known as the display graph. An alternative characterization expres… ▽ More

    Submitted 14 October, 2012; originally announced October 2012.

    MSC Class: 68R10; 92B10 ACM Class: F.2.2; G.2.2; J.3

  11. arXiv:1210.2665  [pdf, other

    cs.DS q-bio.PE

    Inferring Species Trees from Incongruent Multi-Copy Gene Trees Using the Robinson-Foulds Distance

    Authors: Ruchi Chaudhary, J. Gordon Burleigh, David Fernández-Baca

    Abstract: We present a new method for inferring species trees from multi-copy gene trees. Our method is based on a generalization of the Robinson-Foulds (RF) distance to multi-labeled trees (mul-trees), i.e., gene trees in which multiple leaves can have the same label. Unlike most previous phylogenetic methods using gene trees, this method does not assume that gene tree incongruence is caused by a single, s… ▽ More

    Submitted 9 October, 2012; originally announced October 2012.

    Comments: 16 pages, 11 figures

  12. arXiv:1205.6359  [pdf, other

    cs.DS q-bio.PE

    Extracting Conflict-free Information from Multi-labeled Trees

    Authors: Akshay Deepak, David Fernández-Baca, Michelle M. McMahon

    Abstract: A multi-labeled tree, or MUL-tree, is a phylogenetic tree where two or more leaves share a label, e.g., a species name. A MUL-tree can imply multiple conflicting phylogenetic relationships for the same set of taxa, but can also contain conflict-free information that is of interest and yet is not obvious. We define the information content of a MUL-tree T as the set of all conflict-free quartet topo… ▽ More

    Submitted 28 June, 2012; v1 submitted 29 May, 2012; originally announced May 2012.

    Comments: Submitted in Workshop on Algorithms in Bioinformatics 2012 (http://algo12.fri.uni-lj.si/?file=wabi)

  13. arXiv:1205.5779  [pdf, ps, other

    math.CO cs.DM cs.DS

    Improved Lower Bounds on the Compatibility of Multi-State Characters

    Authors: Brad Shutters, Sudheer Vakati, David Fernández-Baca

    Abstract: We study a long standing conjecture on the necessary and sufficient conditions for the compatibility of multi-state characters: There exists a function $f(r)$ such that, for any set $C$ of $r$-state characters, $C$ is compatible if and only if every subset of $f(r)$ characters of $C$ is compatible. We show that for every $r \ge 2$, there exists an incompatible set $C$ of… ▽ More

    Submitted 25 May, 2012; originally announced May 2012.

  14. arXiv:1106.0874  [pdf, ps, other

    cs.DS cs.DM

    A Simple Characterization of the Minimal Obstruction Sets for Three-State Perfect Phylogenies

    Authors: Brad Shutters, David Fernández-Baca

    Abstract: Lam, Gusfield, and Sridhar (2009) showed that a set of three-state characters has a perfect phylogeny if and only if every subset of three characters has a perfect phylogeny. They also gave a complete characterization of the sets of three three-state characters that do not have a perfect phylogeny. However, it is not clear from their characterization how to find a subset of three characters that d… ▽ More

    Submitted 5 June, 2011; originally announced June 2011.

  15. arXiv:1004.4196  [pdf, ps, other

    cs.DM

    Graph Triangulations and the Compatibility of Unrooted Phylogenetic Trees

    Authors: Sudheer Vakati, David Fernández-Baca

    Abstract: We characterize the compatibility of a collection of unrooted phylogenetic trees as a question of determining whether a graph derived from these trees --- the display graph --- has a specific kind of triangulation, which we call legal. Our result is a counterpart to the well known triangulation-based characterization of the compatibility of undirected multi-state characters.

    Submitted 23 April, 2010; originally announced April 2010.

    MSC Class: 68R10; 92B10 ACM Class: F.2.2; G.2.2; J.3

  16. arXiv:0906.5089  [pdf, ps, other

    cs.DS cs.DM

    Comparing and Aggregating Partially Resolved Trees

    Authors: Mukul S. Bansal, Jianrong Dong, David Fernández-Baca

    Abstract: We define, analyze, and give efficient algorithms for two kinds of distance measures for rooted and unrooted phylogenies. For rooted trees, our measures are based on the topologies the input trees induce on triplets; that is, on three-element subsets of the set of species. For unrooted trees, the measures are based on quartets (four-element subsets). Triplet and quartet-based distances provide a… ▽ More

    Submitted 27 June, 2009; originally announced June 2009.

    Comments: 34 pages

    ACM Class: F.2.2; G.2; J.3