Skip to main content
Log in

OMArk, a tool for gene annotation quality control, reveals erroneous gene inference

  • Research Briefing
  • Published:

From Nature Biotechnology

View current issue Submit your manuscript

We developed the OMArk software package for evaluating protein-coding gene annotation quality. In addition to assessing the completeness of a proteome, OMArk estimates the overall quality of the gene set’s content, a feature that will help to improve public protein sequence data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1: OMArk concept and output graph.

References

  1. Blaxter, M. et al. Why sequence all eukaryotes? Proc. Natl Acad. Sci. USA 119, e2115636118 (2022). An article that presents the reasoning behind the ongoing massive sequencing of all eukaryotic genomes.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Salzberg, S. L. Next-generation genome annotation: we still struggle to get it right. Genome Biol. 20, 92 (2019). A piece summarizing the challenges of gene annotation.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Altenhoff, A. M. et al. OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA ecosystem. Nucleic Acids Res. https://doi.org/10.1093/nar/gkad1020 (2023). The latest paper presenting the OMA orthology database, which OMArk uses as reference.

    Article  PubMed Central  Google Scholar 

  4. Rossier, V., Vesztrocy, A. W., Robinson-Rechavi, M. & Dessimoz, C. OMAmer: tree-driven and alignment-free protein assignment to subfamilies outperforms closest sequence approaches. Bioinformatics https://doi.org/10.1093/bioinformatics/btab219 (2021). This paper describes OMAmer, the first step in the workflow of OMArk.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Nevers, Y. et al. Quality assessment of gene repertoire annotations with OMArk. Nat. Biotechnol. https://doi.org/10.1038/s41587-024-02147-w (2024).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

OMArk, a tool for gene annotation quality control, reveals erroneous gene inference. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-024-02155-w

Download citation

  • Published:

  • DOI: https://doi.org/10.1038/s41587-024-02155-w

  • Springer Nature America, Inc.

Navigation