Hierarchical Overlap Graph

Bastien Cazaux 1, 2 Eric Rivals 1, 2
2 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Given a set of finite words, the Overlap Graph (OG) is a complete weighted digraph where each word is a node and where the weight of an arc equals the length of the longest overlap of one word onto the other (Overlap is an asymmetric notion). The OG serves to assemble DNA fragments or to compute shortest superstrings which are a compressed representation of the input. The OG requires a space is quadratic in the number of words, which limits its scalability. The Hierarchical Overlap Graph (HOG) is an alternative graph that also encodes all maximal overlaps, but uses a space that is linear in the sum of the lengths of the input words. We propose the first algorithm to build the HOG in linear space for words of equal length.
Complete list of metadatas

Cited literature [13 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01674319
Contributor : Eric Rivals <>
Submitted on : Monday, January 29, 2018 - 10:22:28 AM
Last modification on : Friday, April 19, 2019 - 4:55:29 PM
Long-term archiving on : Friday, May 25, 2018 - 10:02:30 AM

File

hog-art.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : lirmm-01674319, version 2

Collections

Citation

Bastien Cazaux, Eric Rivals. Hierarchical Overlap Graph. 2017. ⟨lirmm-01674319v2⟩

Share

Metrics

Record views

163

Files downloads

235