SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs)

Abrahams, Brett S; Arking, Dan E; Campbell, Daniel B; Mefford, Heather C; Morrow, Eric M; Weiss, Lauren A; Menashe, Idan; Wadkins, Tim; Banerjee-Basu, Sharmila; Packer, Alan

doi:10.1186/2040-2392-4-36

SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs)

Letter to the Editor
Open access
Published: 03 October 2013

Volume 4, article number 36, (2013)
Cite this article

Download PDF

You have full access to this open access article

Molecular Autism Aims and scope Submit manuscript

SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs)

Download PDF

Brett S Abrahams¹,
Dan E Arking²,
Daniel B Campbell³,
Heather C Mefford⁴,
Eric M Morrow⁵,
Lauren A Weiss⁶,
Idan Menashe^7,8,
Tim Wadkins⁷,
Sharmila Banerjee-Basu⁷ &
…
Alan Packer⁹

10k Accesses
460 Citations
19 Altmetric
4 Mentions
Explore all metrics

Abstract

New technologies enabling genome-wide interrogation have led to a large and rapidly growing number of autism spectrum disorder (ASD) candidate genes. Although encouraging, the volume and complexity of these data make it challenging for scientists, particularly non-geneticists, to comprehensively evaluate available evidence for individual genes. Described here is the Gene Scoring module within SFARI Gene 2.0 (https://gene.sfari.org/autdb/GS_Home.do), a platform developed to enable systematic community driven assessment of genetic evidence for individual genes with regard to ASD.

Letter to the Editor

Previous work has crafted curated lists of genes implicated in autism spectrum disorders (ASDs)[1–5], and in some cases developed loosely defined “evidence scores” for entries[6–8]. Such efforts are incomplete, non-systematic and static. Further, this process requires the integration of diverse lines of support that are unequal. What to conclude for a replicated association with P <10^-5? How does this compare to a common variant association not independently replicated, but with rare variants in the same gene? What to make of modest association accompanied by case-control gene expression differences? Given that downstream work involves considerable resources, a portal and framework enabling systematic assessments of all available evidence holds great value.

The Gene Scoring module within SFARI Gene 2.0 was built in an effort to assess the strength of evidence associated with candidate genes and address these concerns. Using a human genetics perspective, we developed a set of criteria to quantify evidence for involvement in the ASDs. These criteria, along with worked examples, are available on the SFARI website (https://gene.sfari.org/autdb/GS_Classification.do). We set out with three guiding principles. First, that relationship to ASD should be based on evaluation of genetic variation in human cohorts. Second, that we start with no assumptions about individual genes. And, finally, that a system to permit active involvement from the scientific community should be at the core of this endeavor. For genes already included in the portal, users with a SFARI login are able to add their own scores alongside curated calls (https://gene.sfari.org/autdb/search); these are viewable by the entire community in the form of “counts”. Registered individuals are also able to propose scores for genes not yet included in the database (https://gene.sfari.org/autdb/user/AddAGene.do), and also suggest modifications to the scoring criteria. Oversight by staff curators and ad hoc review by the SFARI Gene Advisory Board together with score histories and a versioning system for scoring criteria will ensure consistency and stability. Although the scoring system we have developed is itself not entirely immune to bias, evidence is examined and applied in a systemic fashion and is sensitive to community feedback.

Using these newly developed criteria, we scored an initial set of 196 genes. Scores and associated annotation were deposited into a newly developed gene-centric web interface. Beyond the gene score itself, a summary of the underlying rationale, links to PubMed and other external databases, functional annotation and a compendium of all identified variants are included. Video tutorials, outlining use of the community annotation interface, have also been developed to facilitate broad uptake (http://www.youtube.com/watch?v=x6PcOXVK0bY). Importantly, all of the underlying data are fully downloadable.

Analysis of this initial set of scored genes was revealing (Table1). A total of 58% of the scored genes, many having been highlighted elsewhere as top candidates, were assigned to the “Minimal Evidence” category. Although scores are not static and will change in the face of new data, these data suggest that there exists only modest support for the majority of autism-candidate genes proposed to date. We then looked at the relationship between category placement and attention from the field, as defined by the number of manually curated publications containing "(autism OR autistic)" in the title or abstract. Enormous variability was observed both within and between categories, along with marked skewing towards specific genes within a given category (Figure1). Within syndromic genes discovered more than four years ago (n = 17), for example, two genes accounted for almost 50% of the ASD-associated publications. At the other extreme, the eight least attended to from this group collectively accounted for only 8.4% of the publications. This “winner takes most” effect, where two genes attracted almost 50% of the community’s attention, is in direct conflict with a comprehensive understanding of autism genetics. Finally, almost half of the genes with no or relatively modest support (categories 4/5/6) have a greater number of ASD-associated publications than those with more evidence for involvement in disease (categories S/2/3). These data highlight a relatively poor relationship between genetic evidence and attention from the scientific community. It is our hope that SFARI Gene 2.0, together with larger cohorts and greater statistical rigor, will help to highlight those genes with the strongest underlying support.

Table 1 More than half of all scored genes were placed within the “Minimal Evidence Category”

Full size table

Although similar in concept to other efforts, SFARI Gene 2.0 differs in terms of philosophy, methodology and output. To minimize bias, we did not begin with any discussion or review of the literature, but rather the establishment of formalized criteria to quantify available support. Application of these criteria resulted in the identification of genes which despite substantial support had received little or no attention from the field. Also new and unique to SFARI Gene 2.0 is a mechanism that enables researchers to provide reasoned arguments for the introduction of new genes, offer alternate scores that sit alongside existing ones, and provide suggestions for the modification of the scoring criteria. Extensive safeguards - including staff and advisory board oversight - are in place to ensure the quality and utility of the resource.

In summary, key strengths of SFARI Gene 2.0 include explicitly defined scoring criteria, continuous updates and infrastructure to permit community based involvement. We see enormous potential for the resource, a blend of OMIM and Wikipedia, and suggest that it may be useful in helping to ensure the relevance of future computational and functional studies to disease.

Abbreviations

ASD:: Autism spectrum disorder.

References

Basu SN, Kollu R, Banerjee-Basu S: AutDB: a gene reference resource for autism research. Nucleic Acids Res. 2009, 37: 832-836. 10.1093/nar/gkn941.
Article Google Scholar
Matuszek G, Talebizadeh Z: Autism Genetic Database (AGD): a comprehensive database including autism susceptibility gene-CNVs integrated with known noncoding RNAs and fragile sites. BMC Med Genet. 2009, 10: 102-10.1186/1471-2350-10-102. Epub 2009/09/26
Article PubMed Central PubMed Google Scholar
Pinto D, Pagnamenta AT, Klei L, Anney R, Merico D, Regan R, Conroy J, Magalhaes TR, Correia C, Abrahams BS, Almeida J, Bacchelli E, Bader GD, Bailey AJ, Baird G, Battaglia A, Berney T, Bolshakova N, Bölte S, Bolton PF, Bourgeron T, Brennan S, Brian J, Bryson SE, Carson AR, Casallo G, Casey J, Chung BH, Cochrane L, Corsello C: Functional impact of global rare copy number variation in autism spectrum disorders. Nature. 2010, 466: 368-372. 10.1038/nature09146.
Article PubMed Central CAS PubMed Google Scholar
Betancur C: Etiological heterogeneity in autism spectrum disorders: more than 100 genetic and genomic disorders and still counting. Brain Res. 2011, 1380: 42-77.
Article CAS PubMed Google Scholar
Weiss LA, Arking DE, Daly MJ, Chakravarti A, Gene Discovery Project of Johns Hopkins & the Autism Consortium: A genome-wide linkage and association scan reveals novel loci for autism. Chakravarti A. Nature. 2009, 461 (7265): 802-808.
Article CAS PubMed Google Scholar
Abrahams BS, Geschwind DH: Advances in autism genetics: on the threshold of a new neurobiology. Nat Rev Genet. 2008, 9 (5): 341-355. 10.1038/nrg2346.
Article PubMed Central CAS PubMed Google Scholar
Abrahams BS, Geschwind DH: Genetics of autism. Vogel and Motulsky's Human Genetics: Problems & Approaches. Edited by: Speicher MR, Antonarakis SE, Motulsky AG. 2009, Berlin: Springer-Verlag, 699-714. 4
Google Scholar
Xu LM, Li JR, Huang Y, Zhao M, Tang X, Wei L: AutismKB: an evidence-based knowledgebase of autism genetics. Nucleic Acids Res. 2012, 40: D1016-D1022. 10.1093/nar/gkr1145.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

SFARI Gene 2.0 is funded by the Simons Foundation. The portal developed for this work was created by Ravi Kollu at MindSpec. We thank members of the Simons Foundation Autism Research Initiative staff for helpful feedback and support, including: G. Fischbach, D. Choi, M. Carlson, J. Spiro, M. Benedetti, A. Mandavilli, A. Tauro-Greenberg and C. Fleisch.

Author information

Authors and Affiliations

Departments of Genetics and Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
Brett S Abrahams
McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Dan E Arking
Zilkha Neurogenetic Institute and Department of Psychiatry and the Behavioral Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
Daniel B Campbell
Department of Pediatrics, Division of Genetic Medicine, University of Washington, Seattle, WA, USA
Heather C Mefford
Department of Molecular Biology, Cell Biology and Biochemistry; and Department of Psychiatry and Human Behavior, Brown University, Providence, RI, USA
Eric M Morrow
IMHRO/Staglin Assistant Professor, Department of Psychiatry, Institute for Human Genetics, Center for Neurobiology and Psychiatry, UCSF, San Francisco, CA, USA
Lauren A Weiss
MindSpec Inc., 8280 Greensboro Drive, Suite 150, McLean, VA, USA
Idan Menashe, Tim Wadkins & Sharmila Banerjee-Basu
Department of Public Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Idan Menashe
Simons Foundation Autism Research Initiative, New York, NY, USA
Alan Packer

Authors

Brett S Abrahams
View author publications
You can also search for this author in PubMed Google Scholar
Dan E Arking
View author publications
You can also search for this author in PubMed Google Scholar
Daniel B Campbell
View author publications
You can also search for this author in PubMed Google Scholar
Heather C Mefford
View author publications
You can also search for this author in PubMed Google Scholar
Eric M Morrow
View author publications
You can also search for this author in PubMed Google Scholar
Lauren A Weiss
View author publications
You can also search for this author in PubMed Google Scholar
Idan Menashe
View author publications
You can also search for this author in PubMed Google Scholar
Tim Wadkins
View author publications
You can also search for this author in PubMed Google Scholar
Sharmila Banerjee-Basu
View author publications
You can also search for this author in PubMed Google Scholar
Alan Packer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Sharmila Banerjee-Basu or Alan Packer.

Additional information

Competing interests

SFARI Gene is licensed by the Simons Foundation from MindSpec. SFARI Gene 2.0 Advisory Board Members have received a yearly honorarium from the Simons Foundation. BSA has also received consulting fees from Integragen, LLC. BSA and DEA also hold patents managed by UCLA and Johns Hopkins University, respectively.

Authors’ contributions

SBB and AP conceived of and initiated the work. Gene scoring criteria were developed by BSA, DEA, DBC, HCM, EMM, LAW, SBB, and AP. The list of ASD candidate genes complied by IM, TW, and SBB were scoring by BSA, DEA, DBC, HCM, EMM, and LAW, SBB, and AP. Following analyses carried out by all authors, BSA drafted the manuscript, which all authors then edited and revised.

Dan E Arking, Daniel B Campbell, Heather C Mefford, Eric M Morrow, Lauren A Weiss contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Abrahams, B.S., Arking, D.E., Campbell, D.B. et al. SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs). Molecular Autism 4, 36 (2013). https://doi.org/10.1186/2040-2392-4-36

Download citation

Received: 10 June 2013
Accepted: 23 August 2013
Published: 03 October 2013
DOI: https://doi.org/10.1186/2040-2392-4-36

SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs)

Abstract

Letter to the Editor

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Rights and permissions

About this article

Cite this article

Keywords

Navigation

SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs)

Abstract

Letter to the Editor

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation