Data Archive for the BRAIN Initiative (DABI)

Duncan, Dominique; Garner, Rachael; Brinkerhoff, Sarah; Walker, Harrison C.; Pouratian, Nader; Toga, Arthur W.

doi:10.1038/s41597-023-01972-z

Data Archive for the BRAIN Initiative (DABI)

Article
Open access
Published: 09 February 2023

Volume 10, article number 83, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Data

Data Archive for the BRAIN Initiative (DABI)

Download PDF

Dominique Duncan ORCID: orcid.org/0000-0002-6154-9262¹,
Rachael Garner¹,
Sarah Brinkerhoff²,
Harrison C. Walker²,
Nader Pouratian³ &
…
Arthur W. Toga¹

2617 Accesses
5 Citations
21 Altmetric
2 Mentions
Explore all metrics

Abstract

Data sharing is becoming ubiquitous and can be advantageous for most biomedical research. However, some data are inherently more amenable to sharing than others. For example, human intracranial neurophysiology recordings and associated multimodal data have unique features that warrant special considerations. The associated data are heterogeneous, difficult to compare, highly specific, and collected from small cohorts with treatment resistant conditions, posing additional complications when attempting to perform generalizable analyses across projects. We present the Data Archive for the BRAIN Initiative (DABI) and describe features of the platform that are designed to overcome these and other challenges. DABI is a data repository and portal for BRAIN Initiative projects that collect human and animal intracranial recordings, and it allows users to search, visualize, and analyze multimodal data from these projects. The data providers maintain full control of data sharing privileges and can organize and manage their data with a user-friendly and intuitive interface. We discuss data privacy and security concerns, example analyses from two DABI datasets, and future goals for DABI.

A comparison of neuroelectrophysiology databases

Article Open access 19 October 2023

The Human Connectome Project's neuroimaging approach

Article 26 August 2016

Standardization of electroencephalography for multi-site, multi-platform and multi-investigator studies: insights from the canadian biomarker integration network in depression

Article Open access 07 August 2017

Introduction

Human intracranial recordings are used to study a variety of neurological disorders, such as epilepsy, stroke, neuropsychiatric disorders, and Parkinson’s disease and other movement disorders. The Data Archive for the BRAIN Initiative (DABI) provides a platform of networked and centralized web-accessible data archives to capture, store, curate, and share data related to the Brain Research Through Advancing Innovative Neurotechnologies® (BRAIN) Initiative¹ proposals that collect human intracranial neurophysiological data for the broader scientific community (https://dabi.loni.usc.edu). DABI was created at the Laboratory of Neuro Imaging (LONI) at the University of Southern California by Drs. Arthur Toga, Dominique Duncan, and Nader Pouratian and is funded through the National Institutes of Health as part of the BRAIN Initiative.

Data sharing can be a valuable force to accelerate scientific discovery. Intracranial neurophysiology studies are often exclusive in their design and methodology, which presents unique challenges for data sharing. Given the invasive nature of these studies and the involvement of potentially vulnerable neurosurgical patients, costs and recruitment for such studies are uniquely challenging, as well. Small patient cohorts reduce the statistical power necessary to validate the safety and efficacy of invasive devices and to identify candidate patients, target brain regions, and recording/stimulation parameters. Given the relative rarity of these recordings compared to recordings in model systems, there is an even greater imperative and need to share across research institutions. For these reasons, interest surrounding data sharing to expand patient cohorts has grown in recent years.

There are clear advantages to facilitating the sharing of these valuable data collections, as evidenced by the significant successes of multi-site data sharing related to Alzheimer’s disease^2,3 and Parkinson’s disease^4,5, among other neuroscience efforts. Expanding available cohorts for multi-site analysis also expands the likelihood of identifying generalizable findings for exploratory investigations, as examining cognitive function within only one pathology or a narrow clinical phenotype may yield clinical confounds. DABI fills this scientific need by providing a centralized database for human intracranial neurophysiology recordings and related data – clinical, imaging, pathology, demographics, behavioral, and scalp electrophysiology (Fig. 1) as well as a wide variety of associated data formats. DABI ingests, harmonizes, aggregates, stores, visualizes, and disseminates a wide variety of data types. And importantly, provides for granular search as part of its interface.

While standardizing the format for organizing human intracranial neurophysiological data has gained some traction with the Brain Imaging Data Structure (iEEG-BIDS)⁶, previous efforts to create centralized data archives for human intracranial neurophysiology data have not been widely adopted due to many challenges, such as large file sizes, persistent varying formats, privacy constraints, and funding.

The European Union-funded EPILEPSIAE database was made publicly available in 2012 and provides long-term scalp and intracranial electroencephalography (iEEG) recordings with annotation and metadata of 275 patients⁷. Access is restricted to scientific groups that financially contribute to the maintenance of the database. The National Institute of Neurological Disorders and Stroke-funded cloud-based platform, IEEG.org, contains over 1200 human and animal (dog, mouse, rat, sheep, and primate) datasets that includes neuroimaging, EEG, electrocardiogram, and clinical data⁸. This platform uses Amazon cloud services for data storage. However, one of the major challenges is the need to secure enough resources, long term, to sustain the effort beyond the initial funding period so that the platform can truly bring value to the broader scientific research community⁹. Moreover, coupling such data archives with analytic tools can help researchers with their analyses and increase reproducibility. Existing data archives such as IEEG.org support code dissemination by linking users to public GIT repositories or hosting downloadable files within a stored dataset. However, this requires users to have code and data on local machines and have access to sufficient computational resources to perform the required analyses.

In developing DABI, we have taken these concerns into consideration and applied our experience in other types of data sharing platforms^10,11. We have built DABI, using our existing tools and unique resources, with the goal of maintaining the archive after funding ends without charging user fees for data access. LONI has dedicated itself to maintaining data archives and hosting software even after funding windows close: all former LONI projects from the last 30 + years have been maintained continuously. With this mindset, we hope to ensure the perpetuity of the archive so that it remains beneficial to the scientific community.

Methods: DABI Architecture

DABI is designed to address the full lifecycle of scientific inquiry for BRAIN partners, from secure data ingestion and storage through visualization, analytics, and dissemination. Specific functionalities, seen as an overview in Fig. 2, include (1) data de-identification, protocol detection, and data deposition, (2) data quality assessment, processing of quality assessment results to tag data, and analysis results data integration, (3) mapping of data attributes into a common schema for use in search and visualization interfaces, (4) interfaces to search, select, and download data, (5) integrated processing so that database and compute resources are coupled, (6) a visualization interface for inspecting and comparing data, and (7) a comprehensive website containing training materials, study-related information (e.g., protocols), announcements, and a knowledgebase wherein investigators post questions and receive answers from DABI and the community.

Data storage

DABI accommodates 2 different models for data archiving. Contributing investigators who prefer to deliver their data to a centralized database can transfer their data for secure storage at LONI (see section 2.2). Under centralized systems, researchers collect cohort data and store the data in a single remote system that allows for convenient storage and retrieval as well as user access controls for sharing data with other specified scientists. This allows researchers to utilize computational tools that they may not have sufficient resources to develop otherwise. The centralized mechanism can be chosen by using Aspera or the Web Uploader to upload data that is stored on DABI servers.

The second model accommodates those investigators who prefer to use cloud-based storage for their data. The cloud storage mechanism can be chosen on the Provider Controls page by linking a cloud storage account. The cloud storage option is achieved by implementing cloud specific linking protocols so that users can connect supported cloud storage service accounts with DABI. Once the cloud account is linked, cloud downloads are achieved by mediating data transfer between the storage provider and the user’s browser. The fees associated with cloud storage are arranged between the data owners and the cloud storage provider. If an institution chooses to end their cloud service or wish to convert to centralized storage for another reason, they can take a cloud snapshot, where a copy of the dataset is stored centrally on the DABI servers.

These two options are available for data providers to choose, and we are agnostic to where each project’s data are stored. Regardless of storage mechanism, data are accessed on the Request Data page and on the DOI (main page) for each project.

Data upload

Data providers who choose to use the centralized data storage model have 3 options to upload data to DABI. The Web Uploader allows users to upload comma-separated value (CSV) metadata, a blank version with standardized metadata fields is provided on the Web Uploader page for reference, and datasets that are stored according to iEEG-BIDS⁶, a community-driven specification funded by the BRAIN Initiative which has a tree organization and specific rules for naming. This method does not require any additional software to be installed on the providers’ local machines. The uploader validates the BIDS-compliance and anonymizes data to be sure there is no patient history information. Both zipped and unzipped files are accepted. When providers are ready to add new files to their existing DABI data they can simply upload the root folder and only new files will upload without uploading existing data again. DABI encourages use of iEEG-BIDS to ensure comprehensive documentation and streamlined ingestion to standard data processing pipelines. DABI also supports file formats stored in the Neurodata Without Borders (NWB) data format, a BRAIN Initiative funded project to standardize neurophysiology data and related metadata¹². These data formats and the analytics package linkages that we use are shown in Fig. 3.

Datasets that are not stored in iEEG-BIDS can also be uploaded using ASPERA, the latest Health Insurance Portability and Accountability Act (HIPAA)-compliant software utility from IBM. ASPERA is an extremely fast—10–100x faster than traditional file transfer protocol (FTP)—and lightweight file transfer client and is not subject to limitations of web browsers. This upload method requires no cost installation of the client ASPERA on a local machine, followed by a request for connection and host credentials from DABI, which include file paths and storage locations on the DABI servers. Once installed, providers simply log in and select files to transfer. No file structure or naming requirements are involved and data deletion from DABI can occur at any time. Data providers may also use their own secure shell (SSH) or secure FTP (SFTP) File Transfer Client. DABI will provide the necessary credentials for data providers to access their project’s home directories on the DABI server. This method does not have file structure or naming requirements, and providers the most flexibility for transferring data.

Security, data de-identification, access control, and data backup

Access control and encryption are provided to support a wide variety of data sharing policies. Each data provider that contributes data to DABI has consented their patients through permission of their Institutional Review Board, and data owners have the responsibility to remove protected health information (PHI) prior to data upload, though we have some built-in precautions to ensure that HIPAA regulations are followed using our integrated data de-identification components. Uploaded data are immediately available for viewing and download by approved users (if private) or the public, so it is critical that data are de-identified prior to upload. Specific guidelines to reduce the chance of PHI transmission include (1) video and audio uploads are not accepted formats at this time, (2) de-facing procedures are encouraged to reduce patient recognizability and combat automated reverse facial detection algorithms, and (3) use of external de-identification software to remove PHI from standard imaging formats such as NIFTI and DICOM. There is also a built-in tool through the Web Uploader that does allow users to de-identify MRI during upload as an optional step. If this is selected, users can review the changed/removed fields during upload. Imaging data are de-identified to exclude header fields that may contain PHI, including patient name, study date, referring physician, and institution name, among other fields. Any dates are also revised to include only month and year, excluding specific days of birth, intervention, surgery, etc. There is an efficient mechanism for tracking the status of all datasets and providing an audit trail so that investigators know who processed the data when and how. In addition, this audit trail helps to guarantee that all researchers can easily be acknowledged for their contributions. Detailed sharing capabilities are defined by each site with designation of which components are shared at which level. Sharing levels will include (1) site specific, (2) project specific, and (3) public.

We ensure an encrypted transfer to DABI servers using https and Aspera. We provide immediate and uninterrupted access to data as dictated by data use agreements from participating investigators. Furthermore, we have developed functionality to test the speed and reliability of investigators’ network connections and provide recommended download methods based on the test results.

Data harmonization

To make DABI data sharing most useful across different projects and sites, common data elements, data dictionaries consistent with other BRAIN Initiative efforts, and aliases have been adopted wherever possible. The Neuroimaging Data Model (NIDM)^13,14 aims to improve metadata precision, especially in the context of experimental design, data acquisition, and analytic workflows. NIDM has worked towards a standard ontology, hosted by SciCrunch with support by NeuroLex, with over 400,000 terms hosted^{15,16,17,18,19}. In association, OpenNeuro, a repository for hosting and sharing BIDS imaging datasets, has adopted language to define high-level concepts to annotate datasets²⁰.

Currently, we do also accept non-BIDS datasets to support institutions at which standard specification adoption is still ongoing. If data are not BIDS-compliant, we do not convert or modify the original datasets to adhere to BIDS, as conversion from potentially proprietary or custom data formats requires a highly detailed knowledge of the study and data acquisition. However, we do ask that users provide one additional file (that is not part of the uploaded dataset) that includes harmonized metadata fields to facilitate searchability of data on the Explore Page. These variables were proposed by data partners, are at the subject or project level, and allow users to search for datasets that meet specific criteria of interest. A template CSV with potential variables is available for download on the Web Uploader page. Variables include subject ID, gender, age, diagnosis, handedness, interventions, region of interest/electrode location, device(s) model, etc. These CSVs are then internally converted to JavaScript Object Notation (JSON), a highly adaptable data-interchange format that stores text as attribute-value pairs and arrays²¹, which is also the candidate file type due to its utilization by iEEG-BIDS⁶. We have built this together with the data providers to accommodate laboratory needs, and we continue to work closely with them to adapt these features.

Data access management

Investigators establish their own data use requirements and policies, which may vary for users at different laboratories and projects. We have created a flexible system with various levels of granularity so that access control rules and consequent authentication systems match the needs of the different projects. Data providers often choose to share data after a publication, so we have given data providers the ability to do this and to choose a date for making those data public if they know the date in advance.

DABI has also integrated automated digital object identifier (DOI) generation. Anytime a project or subproject (a subset of files from a project) is created, DABI automatically registers a unique link with DOI.org. This link grants view access to the dataset homepage, which includes a customizable dataset description and visualization of the dataset’s file tree. However, users must be logged in and have explicit data access permission (granted by the PI) to allow data download. Data providers may also be given a publicly available link to their dataset that does not require an account for access; this link includes an embedded token that allows the accessor to download data without logging in. This can be useful to include for anonymized peer review or publications.

Most importantly, the data providers always remain in full control of data access.

Querying data and cohort creation through the explore page

Although all data remain private until explicitly made public by a PI or delegate, all metadata are searchable through the Explore Page. We have linked this integrated graphically controlled processing system so that the results of queries can be explored further prior to data download. Patterns and trends that may be observable across projects can be visualized and plotted using tools that we provide that are coupled with the data portal. We have developed techniques and standards to import and interlink data and metadata from a variety of different modalities, support highly flexible search and browsing of these data (Fig. 4), and enable linking analyzed results to raw data along with their provenance.

Investigators can query data based off specific filters, including gender, diagnosis, age, recording location, and data modalities available. These filters can then be used to create cohorts that can be used in downstream analyses.

Integrated analytic tools

By centralizing an enduring data archive, we allow the broader neuroscience research community to access and thereby analyze the data from various BRAIN Initiative projects. All information pertaining to data acquisition, quality control, pre-processing, and analyses are captured and retained, providing a comprehensive history and provenance to the data. Data provenance includes timestamped raw data with timeline noting data upload revisions and versions, preprocessed data (provided by data collectors or produced by users within associated analytic tools), saved cohorts, and analysis workflows saved by users. We have pioneered innovative standardization/co-registration references, fully supported by novel image and electrophysiology processing methods, to extract candidate biomarkers from the diverse data to address the specific projects’ goals. Spatial descriptions and co-registrations of regions of interest are made according to detailed coordinate/imaging maps of the brain, co-registered to sensors, such as implanted or scalp electrodes, when possible. With the aid of the LONI Pipeline^22,23 that is integrated into DABI, much of this work is automated. Not only is a well-curated and standardized multi-modal data set facilitating the development of models of various diseases, but it is also ensuring that such models are statistically significant and validated.

Data trends and correlations can then be calculated in DABI, without downloading raw data. Integrated software and analytics include image visualization, quality control²⁴, LONI Pipeline^22,23, Jupyter²⁵, R Analysis and Visualization of intracranial EEG Data (RAVE)²⁶, and a variety of statistical tests. RAVE allows users to visualize intracranial EEG (iEEG) recordings and apply various dimensionality reduction and statistical methods to analyze these large iEEG datasets^27,28. Investigators maintain complete ownership and control of their data. Unaffiliated users must be granted access from PIs to download raw data or conduct analysis using DABI’s built-in analytics.

Results: Sample Analyses within DABI

Performing statistical analyses

DABI has integrated a statistical workflow to allow investigators to perform exploratory analyses without needing to download any data to their local machine. The pipeline supports both nominal (e.g. diagnosis) and numeric variables (e.g. spectral analysis). The analysis framework eases identification of correlations between multiple variables and a target parameter via batch processing. Depending on the data distributions, and whether the independent and dependent variables are numeric or nominal, appropriate statistical tests are applied. For example, to evaluate the relationship between movement disorder diagnoses and band power, Welch’s T-test would be performed between the diagnosis and average band power for each frequency range. Comprehensive documentation of the statistical tests available within DABI can be accessed via the DABI site²⁹.

In one analysis we utilize data collected by Dr. Harrison Walker at the University of Alabama at Birmingham³⁰. His BRAIN Initiative study investigates directional lead technology with the goal to determine electrophysiology biomarkers that best predict the optimal combination of active contacts with directional DBS electrode technology. This dataset for 31 patients includes intraoperative electrophysiology, imaging, and longitudinal motor and neuropsychological testing. Within the DABI analysis workflow, users can explore various methods to assess symptom improvement over the course of the study. In this example, we perform a linear mixed effects regression to model the patients’ Parkinson’s Disease Questionnaire-8 (PDQ8) score across the session timeline (e.g. preoperative baseline, 2 month follow-up, 4 month follow-up, and 6 month follow-up). This analysis (Fig. 5) illustrates that PDQ8 scores decrease under current treatment (p-value: 0.005).

Machine learning ecosystem within DABI

DABI also supports advanced supervised data exploration through its innovative machine learning (ML) ecosystem. The ML framework includes supervised learning to perform classification and regression with standard algorithms in the H2O AutoML library³¹, an open-source, R and Python-supported infrastructure for scalable ML. Supported models include General Linear Model, Random Forest, XGBOOST, Gradient Boosting, Deep Learning, and Stacked Ensemble. DABI’s automated ML ecosystem has been designed in a way such that users can build models within minutes even without knowledge of programming by inputting specific parameters, such as the number of folds for k-fold cross-validation or whether to perform oversampling to balance an unbalanced dataset. Analysis results such as variable importance and area under the curve measures are automatically visualized to allow users to evaluate model performance.

The ML ecosystem is built to allow custom electrophysiology analyses conducted in RAVE to be easily and automatically introduced as variables in ML. In this example, we performed task-based iEEG analyses in RAVE for 26 Parkinson’s disease and Dystonia patients from Dr. Nader Pouratian’s study conducted at University of California, Los Angeles and UT Southwestern³². In this study, Dr. Pouratian records invasive neurophysiology during deep brain stimulation surgery to study network level control of motor control. The team obtains multi-focal cortical and basal ganglia recordings across three action suppression tasks (self-paced movement, Eriksen Flanker task, and stop signal). Electrophysiology data were first preprocessed to remove false trials and perform notch filtering and wavelet decomposition. Then, RAVE power signal outputs and harmonized brain recording regions were introduced into several ML models (Random Forest, Gradient Boosting Machine) to explore whether there were correlations between these variables and the task type (Fig. 6).

Future goals

By working closely with BRAIN Initiative data providers, we aim to adapt DABI and develop more features that would be beneficial for the data providers. User feedback is critical to this process, and we hope that close involvement by BRAIN Initiative partners will help us iteratively design a useful site for data storage, exploration, and analysis. As more projects are onboarded and more data are shared, we expect the platform will be even more useful for both data providers and other researchers who request access to data. As reach has grown, we have added a Frequently Asked Questions page to the site to support new users. We have also added a feature that allows users to submit questions and requests for features on the contact page to encourage user feedback. We plan to continue to request feedback from users regularly so that we can adapt the platform to their specific needs.

Discussion

DABI is a unique and innovative data repository designed for the needs of the BRAIN Initiative projects that collect human invasive recordings. It is a platform of networked and centralized web-accessible data archives to capture, store, and curate invasive human neurophysiological data and make them broadly available and accessible to the scientific community for furthering neuroscience research. Moreover, DABI supports investigators by providing a centralized platform to organize, compare, analyze, and share vast arrays of data collected in intracranial human recording studies in one centralized location.

There is significant need for a centralized data archive for human invasive recordings and research involving such data on various neurological and neuropsychiatric disorders. The process of analyzing such data is often multifactorial and crosses multiple modalities, and investigators require access to a large number of high quality, well-curated data points and study subjects. Since data generating and collecting sites are spread among different laboratories, clinical sites, heterogeneous data types, and formats, before the data can even be analyzed, they must be standardized, and tools for searching, viewing, annotating, and analyzing them must be coupled to those data. We have described how we have addressed these challenges while building DABI to include data de-identification, data quality assessment, analysis results data integration, mapping of data attributes into a common schema for use in search and visualization interfaces, interfaces to search, select, and download data, integrated processing so that database and compute resources are coupled, a visualization interface for inspecting and comparing data, and a comprehensive website containing training materials, study-related information (e.g., protocols), announcements, and a feedback knowledgebase wherein investigators post questions and receive answers from DABI and the community. We have also shown the power of integrated analyses with an example from representative DABI datasets.

Data availability

The datasets supporting the results of this article are available in the Data Archive for the BRAIN Initiative (DABI), https://doi.org/10.18120/sr2n-gz34³⁰ (PI: Walker) and https://doi.org/10.18120/x7mj-am06³² (PI: Pouratian).

Code availability

Open-source software was used for analysis, including the open-source H2O library for machine learning frameworks, Jupyter notebooks for python analysis, and R Analysis and Visualization of intracranial EEG (RAVE).

This project is supported by the NIH/NINDS under award number R24MH114796. Data used in the sample analyses presented were supported by the NIH/NINDS under award numbers UH3NS100553, R01NS119520 and U01NS098961 and the Michael J. Fox Foundation under grant number 15098.

References

Insel, T. R., Landis, S. C. & Collins, F. S. The NIH BRAIN Initiative. Science. 10, 687–688 (2013).
Article ADS Google Scholar
Petersen, R. C. et al. Alzheimer’s Disease Neuroimaging Initiative (ADNI): clinical characterization. Neurology. 74, 201 (2010).
Article PubMed PubMed Central Google Scholar
Ashish, N., Bhatt, P. & Toga, A. W. Global data sharing in Alzheimer’s disease research. Alzheimer Dis. Assoc. Disord. 30, 160 (2016).
Article PubMed PubMed Central Google Scholar
Marek, K. et al. The Parkinson’s progression markers initiative (PPMI) – establishing a PD biomarker cohort. Ann. Clin. Transl. Neurol. 5, 1460–1477 (2018).
Article CAS PubMed PubMed Central Google Scholar
Smolensky, L. et al. Fox Insight collects online, longitudinal patient-reported outcomes and genetic data on Parkinson’s disease. Sci. Data 2020 71 7, 1–9 (2020).
Google Scholar
Holdgraf, C. et al. iEEG-BIDS, extending the Brain Imaging Data Structure specification to human intracranial electrophysiology. Sci. Data 2019 61 6, 1–6 (2019).
Google Scholar
Klatt, J. et al. The EPILEPSIAE database: an extensive electroencephalography database of epilepsy patients. Epilepsia 53, 1669–1676 (2012).
Article PubMed Google Scholar
Kini, L. G., Davis, K. A. & Wagenaar, J. B. Data integration: combined imaging and electrophysiology data in the cloud. Neuroimage 124, 1175 (2016).
Article PubMed Google Scholar
Wagenaar, J. B. et al. Collaborating and sharing data in epilepsy research. J. Clin. Neurophysiol. 32, 235 (2015).
Article PubMed PubMed Central Google Scholar
Amari, S. I. et al. Neuroinformatics: the integration of shared databases and tools towards integrative neuroscience. J. Integr. Neurosci. 1, 117–128 (2002).
Article PubMed Google Scholar
Toga, A. W. Neuroimage databases: the good, the bad and the ugly. Nat. Rev. Neurosci. 3, 302–309 (2002).
Article CAS PubMed Google Scholar
Teeters, J. L. et al. Neurodata Without Borders: creating a common data format for neurophysiology. Neuron 88, 629–634 (2015).
Article CAS PubMed Google Scholar
Keator, D. B. et al. Towards structured sharing of raw and derived neuroimaging data across existing resources. Neuroimage 82, 647–661 (2013).
Article CAS PubMed Google Scholar
Sochat, V. & Nichols, B. N. The Neuroimaging Data Model (NIDM) API. Gigascience 5 (2016).
Maumet, C. et al. Sharing brain mapping statistical results with the neuroimaging data model. Sci. Data 2016 31 3, 1–15 (2016).
Google Scholar
Kennedy, D. N. et al. Everything matters: the repronim perspective on reproducible neuroimaging. Front. Neuroinform. 13, 1 (2019).
Article PubMed PubMed Central Google Scholar
Keator, D. et al. Tools for FAIR neuroimaging experiment metadata annotation with NIDM experiment. OHBM 2019-25th Annu. Meet. Organ. Hum. Brain Mapp. 1–5 (2019).
Helmer, K. et al. Constructing an ontology of neuroscience experiments for the Neuroimaging Data Model (NIDM). OHBM 2019-25th Annu. Meet. Organ. Hum. Brain Mapp. 1–4 (2019).
Keator, D. et al. NIDM terminology: terminologies for the neuroimaging community. Available at: https://scicrunch.org/nidm-terms. (Accessed: 1st July 2022).
Markiewicz, C. J. et al. The openneuro resource for sharing of neuroscience data. Elife 10 (2021).
Pezoa, F., Reutter, J. L., Suarez, F., Ugarte, M. & Vrgoč, D. Foundations of JSON schema. 25th Int. World Wide Web Conf. WWW 2016 263–273 https://doi.org/10.1145/2872427.2883029 (2016).
Rex, D. E., Ma, J. Q. & Toga, A. W. The LONI Pipeline processing environment. Neuroimage 19, 1033–1048 (2003).
Article PubMed Google Scholar
Dinov, I. D. et al. Efficient, distributed and interactive neuroimaging data analysis using the LONI Pipeline. Front. Neuroinform. 3 (2009).
Kim, H. et al. The LONI QC system: a semi-automated, web-based and freely-available environment for the comprehensive quality control of neuroimaging data. Front. Neuroinform. 13, 60 (2019).
Article PubMed PubMed Central Google Scholar
Kluyver, T. et al. Jupyter Notebooks - a publishing format for reproducible computational workflows. ELPUB 87–90, https://doi.org/10.3233/978-1-61499-649-1-87 (2016).
Magnotti, J. F., Wang, Z. & Beauchamp, M. S. RAVE: comprehensive open-source software for reproducible analysis and visualization of intracranial EEG data. Neuroimage 223 (2020).
Karas, P. J. et al. The visual speech head start improves perception and reduces superior temporal cortex responses to auditory speech. Elife 8 (2019).
Metzger, B. A. et al. Responses to visual speech in human posterior superior temporal gyrus examined with iEEG deconvolution. J. Neurosci. 40, 6938–6948 (2020).
Article CAS PubMed PubMed Central Google Scholar
Duncan, D., Toga, A. W. & Pouratian, N. Data Archive for the BRAIN Initiative (DABI) analysis documentation. Available at: https://dabi.loni.usc.edu/assets/files/dabi-analysis.pdf (Accessed: 7th January 2022).
Walker, H. C. Dataset: Noninvasive biomarkers to advance emerging DBS electrode technologies in Parkinson’s disease. USC Mark and Mary Stevens Neuroimaging and Informatics Institute https://doi.org/10.18120/sr2n-gz34 (2021)
LeDell, E. H2O AutoML: Scalable automatic machine learning. 7th ICML Work. Autom. Mach. Learn. (2020).
Pouratian, N. Dataset: Invasive approach to model human cortex-basal ganglia action-regulating networks. USC Mark and Mary Stevens Neuroimaging and Informatics Institute https://doi.org/10.18120/x7mj-am06 (2021).

Download references

Acknowledgements

The authors would like to thank the rest of the DABI development team (Samuel Hobel, Faraz Rabbani, Kalpana Sundaram, Caroline O’Driscoll, Priyanka Subash, Alex Gray, Tom Picton, Samantha Cohen, and Sana Salehi, University of Southern California).

Author information

Authors and Affiliations

Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, Los Angeles, CA, USA
Dominique Duncan, Rachael Garner & Arthur W. Toga
Department of Neurology, University of Alabama at Birmingham, Birmingham, AL, USA
Sarah Brinkerhoff & Harrison C. Walker
Department of Neurological Surgery, UT Southwestern Medical Center, Dallas, TX, USA
Nader Pouratian

Authors

Dominique Duncan
View author publications
You can also search for this author in PubMed Google Scholar
Rachael Garner
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Brinkerhoff
View author publications
You can also search for this author in PubMed Google Scholar
Harrison C. Walker
View author publications
You can also search for this author in PubMed Google Scholar
Nader Pouratian
View author publications
You can also search for this author in PubMed Google Scholar
Arthur W. Toga
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Dominique Duncan: Conceptualization, Validation, Writing – Original Draft, Visualization, Supervision, Project Administration, Funding Acquisition. Rachael Garner: Methodology, Software, Validation, Writing – Original Draft, Visualization. Sarah Brinkerhoff: Data Curation, Formal Analysis, Writing – Review & Editing. Harrison C. Walker: Data Curation, Investigation, Writing – Review & Editing.Nader Pouratian: Conceptualization, Data Curation, Investigation, Writing – Review & Editing, Supervision, Funding Acquisition. Arthur W. Toga: Conceptualization, Resources, Writing – Review & Editing, Supervision, Project Administration, Funding Acquisition.

Corresponding author

Correspondence to Dominique Duncan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Duncan, D., Garner, R., Brinkerhoff, S. et al. Data Archive for the BRAIN Initiative (DABI). Sci Data 10, 83 (2023). https://doi.org/10.1038/s41597-023-01972-z

Download citation

Received: 27 January 2022
Accepted: 17 January 2023
Published: 09 February 2023
DOI: https://doi.org/10.1038/s41597-023-01972-z
Springer Nature Limited

This article is cited by

A comparison of neuroelectrophysiology databases
- Priyanka Subash
- Alex Gray
- Dominique Duncan
Scientific Data (2023)
Data Archive for the BRAIN Initiative (DABI)
- Dominique Duncan
- Rachael Garner
- Arthur W. Toga
Scientific Data (2023)
Modified Neuropixels probes for recording human neurophysiology in the operating room
- Brian Coughlin
- William Muñoz
- Angelique C. Paulk
Nature Protocols (2023)

Data Archive for the BRAIN Initiative (DABI)

Abstract

Similar content being viewed by others

A comparison of neuroelectrophysiology databases

The Human Connectome Project's neuroimaging approach

Standardization of electroencephalography for multi-site, multi-platform and multi-investigator studies: insights from the canadian biomarker integration network in depression

Introduction