A BitParallel, General IntegerScoring Sequence Alignment Algorithm
 Gary Benson,
 Yozen Hernandez,
 Joshua Loving
Abstract
Mapping of nextgeneration sequencing data and other processorintensive sequence comparison applications have motivated a continued search for high efficiency sequence alignment algorithms. In one approach, which exploits the inherent parallelism in computer logic calculations, individual cells in an alignment scoring matrix are represented as bits in a computer word and the calculation of scores is emulated by a series of bit operations comprised of AND, OR, XOR, complement, shift, and addition. Bitparallelism has been successfully applied to the Longest Common Subsequence (LCS) and editdistance problems, producing solutions which are significantly faster than standard implementations. But, the intensive mental effort required to produce these solutions, which are closely tied to special properties of the problems, has limited efforts to extend bitparallelism to more general scoring schemes. In this paper, we give the first bitparallel solution for general, integerscoring global alignment. Integerscoring schemes, which are widely used, assign integer weights for match, mismatch, and insertion/deletion or indel. Our method depends on structural properties of the relationship between adjacent scores in the scoring matrix. We utilize these properties to construct a class of efficient algorithms, each designed for a particular set of weights, and we introduce a standard for characterizing the efficiency in terms of the average number of bitoperations per cell of the original scoring matrix.
