SparkGC: Spark based genome compression for large collections of genomes

Abstract Since the completion of the Human Genome Project at the turn of the century, there has been an unprecedented proliferation of sequencing data. One of the consequences is that it becomes extremely difficult to store, backup, and migrate enormous amount of genomic datasets, not to mention the...
Ausführliche Beschreibung

Gespeichert in:
Autor*in:

Yao, Haichang [verfasserIn]

Hu, Guangyong

Liu, Shangdong

Fang, Houzhi

Ji, Yimu

Format:

E-Artikel

Sprache:

Englisch

Erschienen:

2022

Schlagwörter:

Genome compression

Reference-based compression

Spark

Distributed parallel

Anmerkung:

© The Author(s) 2022

Übergeordnetes Werk:

Enthalten in: BMC bioinformatics - London : BioMed Central, 2000, 23(2022), 1 vom: 25. Juli

Übergeordnetes Werk:

volume:23 ; year:2022 ; number:1 ; day:25 ; month:07

Links:

Volltext

DOI / URN:

10.1186/s12859-022-04825-5

Katalog-ID:

SPR050878700

Nicht das Richtige dabei?

Schreiben Sie uns!