Skip to main content
Log in

Identification of repetitive units in protein structures with ReUPred

  • Original Article
  • Published:
Amino Acids Aims and scope Submit manuscript

Abstract

Over the last decade, numerous studies have demonstrated the fundamental importance of tandem repeat (TR) proteins in many biological processes. A plethora of new repeat structures have also been solved. The recently published RepeatsDB provides information on TR proteins. However, a detailed structural characterization of repetitive elements is largely missing, as repeat unit annotation is manually curated and currently covers only 3 % of the bona fide TR proteins. Repeat Protein Unit Predictor (ReUPred) is a novel method for the fast automatic prediction of repeat units and repeat classification using an extensive Structure Repeat Unit Library (SRUL) derived from RepeatsDB. ReUPred uses an iterative structural search against the SRUL to find repetitive units. On a test set of solenoid proteins, ReUPred is able to correctly detect 92 % of the proteins. Unlike previous methods, it is also able to correctly classify solenoid repeats in 89 % of cases. It also outperforms two recent state-of-the-art methods for the repeat unit identification problem. The accurate prediction of repeat units increases the number of annotated repeat units by an order of magnitude compared to the sequence-based Pfam classification. ReUPred is implemented in Python for Linux and freely available from the URL: http://protein.bio.unipd.it/reupred/.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

Download references

Acknowledgments

The authors are grateful to members of the BioComputing UP lab for insightful discussions. D.P. is funded by the FIRC project no. 16621.This project was partially supported by AIRC grant IG17753 and Elixir-Ita.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Silvio C. E. Tosatto.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Research involving human participants and/or animals

No.

Additional information

Layla Hirsh and Damiano Piovesan Contributed equally.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hirsh, L., Piovesan, D., Paladin, L. et al. Identification of repetitive units in protein structures with ReUPred. Amino Acids 48, 1391–1400 (2016). https://doi.org/10.1007/s00726-016-2187-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00726-016-2187-2

Keywords

Navigation