Bowie J U, Luthy R, Eisenberg D. A method to identify protein sequences that fold into a known three-dimensional structure. Science, 1991, 253: 164–170
PubMed
Article
CAS
Google Scholar
Jones D T, Taylor W R, Thornton J M. A new approach to protein fold recognition. Nature, 1992, 358: 86–89
PubMed
Article
CAS
Google Scholar
Regan L, Degrado W F. Characterization of a helical protein designed from first principles. Science, 1988, 241: 976–978
PubMed
Article
CAS
Google Scholar
Kamtekar S. Protein design by binary patterning of polar and nopolar amino acids. Science, 1993, 262: 1680–1685
PubMed
Article
CAS
Google Scholar
Plaxco K W. Simplified proteins: Minimalist solutions to the “protein folding problem”. Curr Opin Struct Biol, 1998, 8: 80–85
PubMed
Article
CAS
Google Scholar
Wang J, Wang W. A computational approach to simplifying the protein folding alphabet. Nature Struct Biol, 1999, 6: 1033–1038
PubMed
Article
CAS
Google Scholar
Henikoff S, Henikoff J G. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA, 1992, 89: 10915–10919
PubMed
Article
CAS
Google Scholar
Ogata K, Ohya M, Umeyama H. Amino acid similarity matrix for homology derived from structural alignment and optimized by the Monte Carlo method. J Mol Graph Model, 1998, 16: 178–189
PubMed
CAS
Google Scholar
Zhou H, Zhou Y. Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments. Proteins, 2005, 58: 321–328
PubMed
Article
CAS
Google Scholar
Friedberg I, Kaplan T, Margalit H. Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments. Protein Sci, 2000, 9: 2278–2284
PubMed
CAS
Article
Google Scholar
Mallick P, Weiss R, Eisenberg D. The directional atomic solvation energy: An atombased potential for the assignment of protein sequences to known folds. Proc Natl Acad Sci USA, 2002, 99: 16041–16046
PubMed
Article
CAS
Google Scholar
Kleiger G. PFIT and PFRIT: Bioinformatic algorithms for detecting glycosidase function from structure and sequence. Protein Sci, 2004, 13: 221–229
PubMed
Article
CAS
Google Scholar
Karlin S, Altschul S F. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci USA, 1990, 87: 2264–2268
PubMed
Article
CAS
Google Scholar
Altschul S F. Amino acid substitution matrices from an information theoretic perspective. J Mol Biol, 1991, 219: 555–565
PubMed
Article
CAS
Google Scholar
Karlin S, Altschul S F. Applications and statistics for multiple high-scoring segments in molecular sequences. Proc Natl Acad Sci USA, 1993, 90: 5873–5877
PubMed
Article
CAS
Google Scholar
Higgins D G, Sharp P M. CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. Gene, 1988, 73: 237–244
PubMed
Article
CAS
Google Scholar
Holm L, Sander C. Mapping the protein universe. Science, 1996, 273: 595–602
PubMed
Article
CAS
Google Scholar
Holm L, Sander C. Dictionary of recurrent domains in protein structures. Proteins, 1998, 33: 88–96
PubMed
Article
CAS
Google Scholar
Blake J D, Cohen F E. Pairwise sequence alignment below the twilight zone. J Mol Biol, 2001, 307: 721–735
PubMed
Article
CAS
Google Scholar
Dosztanyi Z, Torda A E. Amino acid identity matrices based on force fields. Bioinformatics, 2001, 17: 686–699
PubMed
Article
CAS
Google Scholar
Johnson M S, Overington J P. A structural basis for sequence comparisons an evaluation of scoring methodologies. J Mol Biol, 1993, 233: 716–738
PubMed
Article
CAS
Google Scholar
Li T. Reduction of protein sequence complexity by residue grouping Protein Eng, 2003, 16: 323–330
CAS
Google Scholar
Fan K, Wang W. What is the minimum number of letters required to fold a protein. J Mol Biol, 2003, 328: 921–926
PubMed
Article
CAS
Google Scholar
Koradi R, Billeter M, Whrich K. MOLMOL: A program for display and analysis of macromolecular structures. J Mol Graphics, 1996, 14: 51–55
Article
CAS
Google Scholar
Henikoff S. Automated construction and graphical presentation of protein blocks from unaligned sequences. Gene, 1995, 163: GC17–GC26
PubMed
Article
CAS
Google Scholar
Pietrokovski S, Henikoff J G, Henikoff S. The blocks database-A system for protein classification. Nucleic Acids Res, 1996, 24: 197–200
PubMed
Article
CAS
Google Scholar
Clarke N D. Sequence “minimization”: Exploring the sequence landscape with simplified sequences. Curr Opin Biotech, 1995, 6: 467–472
PubMed
Article
CAS
Google Scholar
Riddle D S. Functional rapidly folding proteins from simplified amino acid sequences. Nature Struct Biol, 1997, 4: 805–809
PubMed
Article
CAS
Google Scholar
Akanuma S, Kigawa T, Yokoyama S. Combinatorial mutagenesis to restricted amino acid usage in an enzyme to a reduced set. Proc Natl Acad Sci USA, 2002, 99: 13549–13553
PubMed
Article
CAS
Google Scholar
Felsenstein J. Confidence limits on phylogenies: An approach using the bootstrap. Evolution, 1985, 39: 783–791
Article
Google Scholar
Liu X. Simplified amino acid alphabets based on deviation of conditional probability from random background. Phys Rev E, 2002, 66: 021906-1–021906-4
Google Scholar