Computer Analysis of Sequence Data pp 101-116 | Cite as
GCG: Database Searching
- 525 Downloads
Abstract
Scormg histograms of typical database searches. The number of hits IS plotted vs the “score” this hit causes during the searching procedure. Subsequent alignment might change these scores because of gaps and homologies. A. Result of searching human calmodulin DNA in the EMBL database. The related protein, troponin C, is found in the steep descent of the statistical noise. B. Result of searching a randomized sequence (again, calmodulin) at precisely the same conditions. Note the random hits with low scores, and the change of scale in the X axis. C. Result of searching human calmodulin protein sequence with tfastu. Note the difference in scores relative to A. D. Result of searching an alignment of calmodulms using the profilesearch method. The reading frame of 10 calmodulins was extracted from the database and alignment as described in Chapter 9. Note the difference m the scores relative to A.
Keywords
Query Sequence Batch Mode Output File Command Line Virtual MemoryReferences
- 1.Pearson, W. R (1989) Rapid and sensitive sequence-comparison with FASTP and FASTA, in Methods in Enzymology (Dayhoff, M O, ed.), vol. 183, Academic, San Diego, pp. 146–159.Google Scholar
- 2.Devereux, J., Haeberli, P., and Smithies, 0. (1984) A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Res. 12, 387–395.PubMedCrossRefGoogle Scholar
- 3.Gribskov, M and Eisenberg, D. (1989) Detection of structural patterns with profile analysis, in Techniques in Protein Chemistry (Hugh, T E., ed.), Academic, San Diego, pp. 108–117.Google Scholar
