American Journal of Bioinformatics Research
p-ISSN: 2167-6992 e-ISSN: 2167-6976
2014; 4(1): 1-5
doi:10.5923/j.bioinformatics.20140401.01
1Department of Mathematical Science, University of Saint Joseph, West Hartford, 06117, USA
2Department of Chemical Biology and Center for Cancer Prevention Research, The State University of New Jersey, Rutgers, 08854, USA
Correspondence to: Hong Zhou, Department of Mathematical Science, University of Saint Joseph, West Hartford, 06117, USA.
Email: |
Copyright © 2012 Scientific & Academic Publishing. All Rights Reserved.
The Basic Local Alignment Search Tool (BLAST) is the most powerful and popular sequence alignment tool used in biological research, which includes the siRNA or shRNA design. Unfortunately, BLAST can overlook some significant homologies that may cause off-target effect for siRNA gene silence. Therefore, the Smith-Waterman alignment algorithm has been called upon, since it guarantees to find all possible mismatch alignments that may cause off-target effect. However, the Smith-Waterman alignment algorithm suffers from its inefficiency in searching through a large sequence database. This paper presents a two-phase homology search strategy that preserves the strength of Smith-Waterman alignment algorithm while shortening its running time. In the first phase of this algorithm, selected siRNA sequences are divided into multiple mutually disjointed substrings and each is used to scan the sequence database for perfect matches against other genes. Only the sequences that have a perfect match to substrings (of a given siRNA) are kept for the second phase. The second phase is the bona fide Smith-Waterman procedure. During this phase, the algorithm only checks the local vicinity sequences where a substring lands on a perfect match. This two-phased arrangement of the algorithm significantly improves the efficiency of the original Smith-Waterman algorithm by concentrating the search on localized regions instead of the whole genome database.
Keywords: siRNA, shRNA, Smith-Waterman, Sequence Alignment, Off-Target
Cite this paper: Hong Zhou, Hong Wang, Localized Smith-Waterman Algorithm for Fast and Complete Search of siRNA off-Target Sequences, American Journal of Bioinformatics Research, Vol. 4 No. 1, 2014, pp. 1-5. doi: 10.5923/j.bioinformatics.20140401.01.
(1) |
(2) |
(3) |
Figure 4. The value of x impacts the efficiency of the two-phase algorithm. sw: Smith-Waterman algorithm alone. tp: the proposed two-phase algorithm |