Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > q-bio > arXiv:2101.05012

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Quantitative Biology > Genomics

arXiv:2101.05012 (q-bio)
[Submitted on 13 Jan 2021]

Title:Computer Architecture-Aware Optimisation of DNA Analysis Systems

Authors:Hasindu Gamaarachchi
View a PDF of the paper titled Computer Architecture-Aware Optimisation of DNA Analysis Systems, by Hasindu Gamaarachchi
View PDF
Abstract:DNA sequencing is revolutionising the field of medicine. DNA sequencers, the machines which perform DNA sequencing, have evolved from the size of a fridge to that of a mobile phone over the last two decades. The cost of sequencing a human genome also has reduced from billions of dollars to hundreds of dollars. Despite these improvements, DNA sequencers output hundreds or thousands of gigabytes of data that must be analysed on computers to discover meaningful information with biological implications. Unfortunately, the analysis techniques have not kept the pace with rapidly improving sequencing technologies. Consequently, even today, the process of DNA analysis is performed on high-performance computers, just as it was a couple of decades ago. Such high-performance computers are not portable. Consequently, the full utility of an ultra-portable sequencer for sequencing in-the-field or at the point-of-care is limited by the lack of portable lightweight analytic techniques. This thesis proposes computer architecture-aware optimisation of DNA analysis software. DNA analysis software is inevitably convoluted due to the complexity associated with biological data. Modern computer architectures are also complex. Performing architecture-aware optimisations requires the synergistic use of knowledge from both domains, (i.e, DNA sequence analysis and computer architecture). This thesis aims to draw the two domains together. In this thesis, gold-standard DNA sequence analysis workflows are systematically examined for algorithmic components that cause performance bottlenecks. Identified bottlenecks are resolved through architecture-aware optimisations at different levels, i.e., memory, cache, register and processor. The optimised software tools are used in complete end-to-end analysis workflows and their efficacy is demonstrated by running on prototypical embedded systems.
Comments: Supervisors: Parameswaran, Sri , Computer Science & Engineering, Faculty of Engineering, UNSW; Ignjatovic, Aleksandar , Computer Science & Engineering, Faculty of Engineering, UNSW; Smith, Martin A., Garvan Institute of Medical Research, Faculty of Medicine, UNSW unsworks: this http URL
Subjects: Genomics (q-bio.GN); Computational Engineering, Finance, and Science (cs.CE)
ACM classes: J.3; C.3; C.4
Cite as: arXiv:2101.05012 [q-bio.GN]
  (or arXiv:2101.05012v1 [q-bio.GN] for this version)
  https://doi.org/10.48550/arXiv.2101.05012
arXiv-issued DOI via DataCite

Submission history

From: Hasindu Gamaarachchi [view email]
[v1] Wed, 13 Jan 2021 11:29:12 UTC (35,575 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled Computer Architecture-Aware Optimisation of DNA Analysis Systems, by Hasindu Gamaarachchi
  • View PDF
  • TeX Source
  • Other Formats
license icon view license
Current browse context:
q-bio.GN
< prev   |   next >
new | recent | 2021-01
Change to browse by:
cs
cs.CE
q-bio

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack