Skip to main content
Log in

Rapid Homology Search with Neighbor Seeds

  • Published:
Algorithmica Aims and scope Submit manuscript

Abstract

Using a seed to rapidly "hit" possible homologies for further scrutiny is a common practice to speed up homology search in molecular sequences. It was shown that a collection of higher weight seeds have better sensitivity than a single lower weight seed at the same speed. However, huge memory requirements diminish the advantages of high weight seeds. This paper describes a two-stage extension method, which simulates high weight seeds with modest memory requirements. By drawing upon the previously studied ideas of vector seed and multiple seeds, we introduce neighbor seeds, which are implemented using two-stage extension. Neighbor seeds provide the flexibility to maximize the independence between the seeds, which is a well-known criterion for optimizing sensitivity. A major advantage of neighbor seeds is that they all rely on the same pre-built database index. Based on considerations of sensitivity and biological adequacy, different neighbor seeds can be used for different queries without rebuilding the index. The paper also discusses some other practical techniques to reduce memory usage.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Miklos Csuros or Bin Ma.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Csuros, M., Ma, B. Rapid Homology Search with Neighbor Seeds. Algorithmica 48, 187–202 (2007). https://doi.org/10.1007/s00453-007-0062-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00453-007-0062-y

Keywords

Navigation