A crucial step in metagenomic data analysis is fragment recruitment, a process of aligning sequencing reads to reference genomes. FR-HIT offers high speed and high sensitivity in recruiting large-scale metagenomic reads.
Microbiome data are directly obtained from various environments and contain genomics information of many known and novel microorganisms. An important step to study these organisms’ identity and abundance is to align the sequencing reads against the available reference genomes. This process was called fragment recruitment in the Global Ocean Sampling (GOS) project that surveyed the world’s oceans (Rusch et al. 2007).
A metagenomic dataset may have many novel species without available reference genomes. Even if references are available, the microbial species may undergo large variations. So a fragment recruitment method needs to...
KeywordsReference Genome Mapping Program Metagenomic Sequence Candidate Match Metagenomic Dataset
- Burkhardt S, Cramer A, Ferragina P. q-gram based database searching using a suffix array (QUASAR). RECOMB ’99; 1999 Apr 11–14; Lyon; 1999, pp. 77–83.Google Scholar
- Jokinen P, Ukkonen E. 2 algorithms for approximate string matching in static texts. In: Tarlecki A, editor. Mathematical foundations of computer science. Lecture notes in computer science, vol 520. Berlin: Springer; 1991, pp. 240–248.Google Scholar