Pearson WR (2013) An introduction to sequence similarity (“homology”) searching. Curr Protoc Bioinform 42:1–3. https://doi.org/10.1002/0471250953.bi0301s42
Article
Google Scholar
Vialle RA, Pedrosa FO, Weiss VA et al (2016) RAFTS3: rapid alignment-free tool for sequence similarity search. bioRxiv. https://doi.org/10.1101/055269
Article
Google Scholar
Lambert C, Campenhout JM, DeBolle X, Depiereux E (2003) Review of common sequence alignment methods: clues to enhance reliability. Curr Genom 4:131–146. https://doi.org/10.2174/1389202033350038
CAS
Article
Google Scholar
Vinga S, Almeida J (2003) Alignment-free sequence comparison—a review. Bioinform Oxf Engl 19:513–523. https://doi.org/10.1093/bioinformatics/btg005
CAS
Article
Google Scholar
Zielezinski A, Vinga S, Almeida J, Karlowski WM (2017) Alignment-free sequence comparison: benefits, applications, and tools. Genome Biol 18:186. https://doi.org/10.1186/s13059-017-1319-7
Article
PubMed
PubMed Central
Google Scholar
Krasnogor N, Pelta DA (2004) Measuring the similarity of protein structures by means of the universal similarity metric. Bioinform Oxf Engl 20:1015–1021. https://doi.org/10.1093/bioinformatics/bth031
CAS
Article
Google Scholar
Mahmood K, Webb GI, Song J et al (2012) Efficient large-scale protein sequence comparison and gene matching to identify orthologs and co-orthologs. Nucleic Acids Res 40:e44. https://doi.org/10.1093/nar/gkr1261
CAS
Article
PubMed
Google Scholar
Steinegger M, Söding J (2017) MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol 35:1026–1028. https://doi.org/10.1038/nbt.3988
CAS
Article
PubMed
Google Scholar
Sheynkman GM, Shortreed MR, Cesnik AJ, Smith LM (2016) Proteogenomics: integrating next-generation sequencing and mass spectrometry to characterize human proteomic variation. Annu Rev Anal Chem Palo Alto Calif 9:521–545. https://doi.org/10.1146/annurev-anchem-071015-041722
Article
PubMed
PubMed Central
Google Scholar
Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 48:443–453
CAS
Article
Google Scholar
Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147:195–197
CAS
Article
Google Scholar
Altschul SF, Gish W, Miller W et al (1990) Basic local alignment search tool. J Mol Biol 215:403–410
CAS
Article
Google Scholar
Pearson WR, Lipman DJ (1988) Improved tools for biological sequence comparison. Proc Natl Acad Sci USA 85:2444–2448. https://doi.org/10.1073/pnas.85.8.2444
CAS
Article
PubMed
Google Scholar
Dayhoff M, Schwartz R, Orcutt B (1978) A model of evolutionary change in proteins. In: Dayhoff MO (ed) Atlas of protein sequence and structure, vol 5. National Biomedical Research Foundation. Washington, DC, pp 345–352
Google Scholar
Henikoff S, Henikoff JG (1992) Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA 89:10915–10919. https://doi.org/10.1073/pnas.89.22.10915
CAS
Article
PubMed
Google Scholar
Yu Y-K, Altschul SF (2005) The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions. Bioinform Oxf Engl 21:902–911. https://doi.org/10.1093/bioinformatics/bti070
CAS
Article
Google Scholar
Altschul SF, Madden TL, Schäffer AA et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402. https://doi.org/10.1093/nar/25.17.3389
CAS
Article
PubMed
PubMed Central
Google Scholar
Panchenko AR, Bryant SH (2002) A comparison of position-specific score matrices based on sequence and structure alignments. Protein Sci Publ Protein Soc 11:361–370. https://doi.org/10.1110/ps.19902
CAS
Article
Google Scholar
Jaroszewski L, Rychlewski L, Li Z et al (2005) FFAS03: a server for profile–profile sequence alignments. Nucleic Acids Res 33:W284–W288. https://doi.org/10.1093/nar/gki418
CAS
Article
PubMed
PubMed Central
Google Scholar
Biegert A, Söding J (2009) Sequence context-specific profiles for homology searching. Proc Natl Acad Sci 106:3770. https://doi.org/10.1073/pnas.0810767106
Article
PubMed
Google Scholar
Kaushik S, Nair AG, Mutt E et al (2016) Rapid and enhanced remote homology detection by cascading hidden Markov model searches in sequence space. Bioinformatics 32:338–344. https://doi.org/10.1093/bioinformatics/btv538
CAS
Article
PubMed
Google Scholar
Kaznadzey A, Alexandrova N, Novichkov V, Kaznadzey D (2013) PSimScan: algorithm and utility for fast protein similarity search. PLoS ONE 8:e58505. https://doi.org/10.1371/journal.pone.0058505
CAS
Article
PubMed
PubMed Central
Google Scholar
Ge H, Sun L, Yu J (2017) Fast batch searching for protein homology based on compression and clustering. BMC Bioinform 18:508. https://doi.org/10.1186/s12859-017-1938-8
CAS
Article
Google Scholar
Nguyen VH, Lavenier D (2009) PLAST: parallel local alignment search tool for database comparison. BMC Bioinform 10:329. https://doi.org/10.1186/1471-2105-10-329
CAS
Article
Google Scholar
Qi Z-H, Jin M-Z, Li S-L, Feng J (2015) A protein mapping method based on physicochemical properties and dimension reduction. Comput Biol Med 57:1–7. https://doi.org/10.1016/j.compbiomed.2014.11.012
CAS
Article
PubMed
Google Scholar
Gupta MK, Niyogi R, Misra M (2013) An alignment-free method to find similarity among protein sequences via the general form of Chou’s pseudo amino acid composition. SAR QSAR Environ Res 24:597–609. https://doi.org/10.1080/1062936X.2013.773378
CAS
Article
PubMed
Google Scholar
Rost B (1999) Twilight zone of protein sequence alignments. Protein Eng 12:85–94. https://doi.org/10.1093/protein/12.2.85
CAS
Article
PubMed
Google Scholar
Kumar R, Mishra BK, Lahiri T et al (2017) PCV: an alignment free method for finding homologous nucleotide sequences and its application in phylogenetic study. Interdiscip Sci Comput Life Sci 9:173–183. https://doi.org/10.1007/s12539-015-0136-5
CAS
Article
Google Scholar
Vella F (1998) The cell. A molecular approach; Edited by G H Cooper. pp 673. ASM Press, Washington DC, Sinauer Associates, Sunderland, MA. 1997 ISBN 0-87893-119-8. Biochem Educ 26:98–99
Article
Google Scholar
Sneath PH (1966) Relations between chemical structure and biological activity in peptides. J Theor Biol 12:157–195
CAS
Article
Google Scholar
Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105–132
CAS
Article
Google Scholar
Grantham R (1974) Amino acid difference formula to help explain protein evolution. Science 185:862–864
CAS
Article
Google Scholar
Rice P, Longden I, Bleasby A (2000) EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet TIG 16:276–277
CAS
Article
Google Scholar
Zielezinski A, Girgis HZ, Bernard G et al (2019) Benchmarking of alignment-free sequence comparison methods. Genome Biol 20:144. https://doi.org/10.1186/s13059-019-1755-7
Article
PubMed
PubMed Central
Google Scholar
Abhilash CB, Rohitaksha K (2014) A comparative study on global and local alignment algorithm methods. Int J Emerg Technol Adv Eng 4:34–43
Google Scholar
Kolekar P, Kale M, Kulkarni-Kale U (2012) Alignment-free distance measure based on return time distribution for sequence analysis: applications to clustering, molecular phylogeny and subtyping. Mol Phylogenet Evol 65:510–522. https://doi.org/10.1016/j.ympev.2012.07.003
Article
PubMed
Google Scholar
Dolatshah M, Hadian A, Minaei-Bidgoli B (2015) Ball*-tree: Efficient spatial indexing for constrained nearest-neighbor search in metric spaces. ArXiv:151100628 Cs
Rodgers JL, Nicewander WA (1988) Thirteen ways to look at the correlation coefficient. Am Stat 42:59–66. https://doi.org/10.1080/00031305.1988.10475524
Article
Google Scholar
Asamoah MK (2014) Re-examination of the limitations associated with correlational research. Educ Res Rev 2:45–52
Google Scholar