The braingraph.org database with more than 1000 robust human connectomes in five resolutions

Varga, Bálint; Grolmusz, Vince

doi:10.1007/s11571-021-09670-5

The braingraph.org database with more than 1000 robust human connectomes in five resolutions

Brief Communication
Open access
Published: 12 March 2021

Volume 15, pages 915–919, (2021)
Cite this article

Download PDF

You have full access to this open access article

Cognitive Neurodynamics Aims and scope Submit manuscript

The braingraph.org database with more than 1000 robust human connectomes in five resolutions

Download PDF

1311 Accesses
4 Citations
Explore all metrics

Abstract

The human brain is the most complex object of study we encounter today. Mapping the neuronal-level connections between the more than 80 billion neurons in the brain is a hopeless task for science. By the recent advancement of magnetic resonance imaging (MRI), we are able to map the macroscopic connections between about 1000 brain areas. The MRI data acquisition and the subsequent algorithmic workflow contain several complex steps, where errors can occur. In the present contribution we describe and publish 1064 human connectomes, computed from the public release of the Human Connectome Project. Each connectome is available in 5 resolutions, with 83, 129, 234, 463 and 1015 anatomically labeled nodes. For error correction we follow an averaging and extreme value deleting strategy for each edge and for each connectome. The resulting 5320 braingraphs can be downloaded from the https://braingraph.org site. This dataset makes possible the access to this graphs for scientists unfamiliar with neuroimaging- and connectome-related tools: mathematicians, physicists and engineers can use their expertize and ideas in the analysis of the connections of the human brain. Brain scientists and computational neuroscientists also have a robust and large, multi-resolution set for connectomical studies.

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Connectomes or braingraphs are compact and focused derivatives of the diffusion magnetic resonance images (MRIs) of the brain: their vertices are labeled by the anatomical areas, and two such vertices are connected by a weighted graph-edge, if a tractography workflow Besson et al. (2014) finds neural tracks between the areas, corresponded to the vertices. By focusing on the connections between cerebral areas instead of analyzing the whole MR image, we can make use of the rich and refined resources of graph theory, born with the famous article of Leonhard Euler on the problem of the Königsberg Bridges Euler (1741) in 1741.

Our research group earlier has prepared several undirected and directed braingraph sets (Kerepesi et al. 2016, 2017; Szalkai et al. 2015a, 2017a, 2019a) from the 500 Subjects Data Release McNab et al. (2013) of the Human Connectome Project (HCP). The resulting graphs were made available at the site https://braingraph.org, and were applied in several structural studies of the human brain (Szalkai et al. 2015b; Kerepesi et al. 2018a; Szalkai et al. 2019a; Kerepesi et al. March 2018b; Szalkai et al. Feb 2019b, 2018; Szalkai et al. 2017b; Szalkai et al. 2016; Fellner et al. 2019, 2020a, 2020b).

In the present contribution we describe a new braingraph set, computed from the 1200 Subjects Data Release of the Human Connectome Project McNab et al. (2013). The set contains 1064 connectomes, each in five resolutions, and each edge is weighted by three different weight functions. Our dataset may serve as a robust resource for the computational neuroscience community in the coming years.

Methods

The data source of the workflow is the 1200 Subjects Data Release of the Human Connectome Project (HCP) McNab et al. (2013), documented at the site https://www.humanconnectome.org/study/hcp-young-adult/document/1200-subjects-data-release. For the present study the “re-preprocessed” 3T diffusion data was applied, as was detailed at the HCP site.

The Connectome Mapper Tool Kit (CMTK) workflow Daducci et al. (2012) was utilized in the graph computation on the HCP data. For each subject, we have applied the segmentation and the parcellation steps only once, but the probabilistic tractography part of the workflow 10 times. The parcellation scheme was the Lausanne2008 atlas, the labels applied are listed in https://github.com/LTS5/cmp_nipype/blob/master/cmtklib/data/parcellation/lausanne2008/ParcellationLausanne2008.xls.

The graph construction was performed in the following steps:

1.
For each subject the MRtrix 0.3 tractography algorithm Tournier et al. (2012) was run, with probabilistic seeding and probabilistic tractography. The number of streamlines was set to 1 million. For defining the graph edges, let us consider two distinct, anatomically labeled areas of the cortical- or sub-cortical gray areas of the brain, denoted by A and B. If the tractography algorithm found at least one streamline between the area A and B, then vertex a, representing area A was connected to vertex b, representing area B, by a graph edge. The three weights of $\{a,b\}$ give the number of streamlines or fibers found between areas A and B, the average length of the streamlines, and the mean fractional anisotropy of the streamlines.
2.
Step 1 was repeated 10 times for each subject. We accepted $\{a,b\}$ to be an edge of the connectome of the subject, if it was present in all ten graphs computed in the repetitions. Next, for each edge we computed the maximum and the minimum number of the fibers, defining that edge, and deleted those two extremal values. Consequently, there remained 8 fiber numbers for each edge. We computed the mean value of those fiber numbers, the mean value of the lengths of the streamlines and the fractional anisotropies for the three weights of the edge.

In other words, the probabilistic tractography was performed 10 times, the graphs were constructed after each run, (i.e., 10 graphs were constructed for each subject), next the extremal fiber number values were deleted, the remaining 8 values were averaged, and the edges, which were present in all 10 graphs were allowed to be included in the resulting graph.

Steps 1 and 2 were performed only in the highest (i.e., the finest) resolution with 1015 vertices. For lower resolutions, the graphs were computed from the 1015-vertex graph by contracting vertices, summing the fiber numbers of the multiple edges between the two contracted vertices and contracting the multiple edges.

On the choice of 10 as the repetition number of the probabilistic tractography we refer to the detailed analysis in the “Discussion and results” section below.

From the dataset of the HCP website we were able to finish the graph computations for 1064 subjects.

The computation was done on our 24-member Intel i7 cluster (each with 6 physical and 12 virtual CPU cores and 16 GB of RAM) within 3 weeks running time.

Data records

The data source of this work was published at the Human Connectome Project’s website at http://www.humanconnectome.org/McNab et al. (2013) as the 1200 Subjects Public Release. The parcellation data, containing the anatomically labeled ROIs, is listed in the CMTK nypipe GitHub repository https://github.com/LTS5/cmp_nipype/blob/master/cmtklib/data/parcellation/lausanne2008/ParcellationLausanne2008.xls.

The braingraphs, computed by us, can be accessed at the https://braingraph.org/cms/download-pit-group-connectomes/ site, by selecting one of the download options, denoted by “X nodes set, 1064 brains, 1 000 000 streamlines, 10x repeated”, where $X=86, 129, 234, 463, 1015$.

The graphs are given in GraphML format, described in https://cmtk.org Daducci et al. (2012). Each file begins with an attribute definition section, then the nodes are described with their coordinates and anatomical labels, corresponding to the parcellation at https://github.com/LTS5/cmp_nipype/blob/master/cmtklib/data/parcellation/lausanne2008/ParcellationLausanne2008.xls.

Next the (un-directed) edges are listed. The edges carry three weights:

The number of fibers;
The mean value of the fiber lengths in the edge;
And the mean fractional anisotropy of the fibers

Note that the edge weights are averages from the eight of the ten tractography-runs, therefore, even the fiber number is—typically —a non-integer.

Discussion and results

Here we describe the workflow, which implied the choice of the 10 repetitions of step 1 in the graph construction above. We note that the present section describes only the process, resulting the specific choice of the repetition number 10, and not the actual graph construction (which was already duly described in the “Methods” section).

The implementations of the deterministic tractography algorithms also contain a probabilistic seeding step; i.e., two runs of these tractography computations almost always yield different results. When we use probabilistic tractography Girard et al. Sep (2014); Buchanan et al. Feb (2014), it is evident that distinct runs yield different results.

For generating reproducible results in the graph construction with a probabilistic tractography phase, it is a natural idea to repeat the probabilistic tractography algorithm for the very same input several times, and to average the results of the tractography in a careful way.

Let us fix two vertices, and let the random variable X denote the number of fibers discovered between then, then, clearly, for any X: $E(X-E(X))=E(X)-E(X)=0$, that is, the expectation of the difference of X from its expected value E(X) is 0. This fact implies that the repetitions and the averaging will increase the reliability of the tractography results.

For the determination of the number of repetitions k, with the trade-off with practical computability and robustness, we have followed the strategy, described as follows. In short, we determined the number of necessary repetitions by comparing deviations for 10 average values, each for k repetitions, for $k=1,2,\ldots ,50$.

More exactly, we have chosen 9 subjects: for each non-zero leading digits of the ID numbers, one was chosen randomly (the choices were: 136631, 200008, 300618, 401422, 500222, 601127,700634, 800941, 901038). For a given subject, and a given positive integer value k, we have generated the following ten braingraphs:

$$\begin{aligned} {G_k}_1, {G_k}_2, \ldots {G_k}_{10}, \end{aligned}$$

where ${G_k}_i$ was calculated by k repetitions of the tractography phase, and averaging the numbers of fibers for each edge on the k runs.

For $i=1,2,\ldots ,10$, we have generated independent k instances, and averaged these k fiber numbers for each edge. Next, we have thrown out those edges, which were not present in all the ten copies of the averaged graphs. Now, for each remaining edge $\{u,v\}$ of the graph G, we computed the average fiber number values over k repetitions: one average value $w^{(k)}_i(u,v)$ for each i in ${G_k}_i$, for $i=1,2,\ldots ,10$. For readability, we omit (u, v) from $w^{(k)}_i(u,v)$ in what follows.

For these ten $w^{(k)}_i$ values we computed the relative standard deviation (also called coefficient of variation) of the ten $w^{(k)}_i$ values:

$$\begin{aligned} c_v(w^{(k)})={\sigma (w^{(k)})\over \mu (w^{(k)})}, \end{aligned}$$

(1)

where

$$\begin{aligned} \mu (w^{(k)})={ 1\over 10}\sum _{i=1}^{10}w^{(k)}_i, \ \ \sigma (w^{(k)})=\sqrt{{1\over 9}\sum _{i=1}^{10} (w^{(k)}_i-\mu (w^{(k)}))^2} \end{aligned}$$

(2)

Figure 1 displays the change of the relative standard deviation of the fiber number of a given edge (the edge, connecting vertex 19 and vertex 21 in the 463-vertex resolution in the case of subject No. 901038) for $k=1,2,\ldots ,50$.

Figure 2 shows the change of the relative standard deviations, averaged for all edges as a function of k, in the case of a given braingraph, in 234-vertex resolution. Supporting Figures 1, 2, 3 and 4 show the same in graphs of different resolutions.

Based on the visual examination of Figure 2 (and the related figures for other resolutions and subjects, cf. Supporting Figs. 1, 2, 3 and 4), we have chosen the $k=10$ value for repetitions as a good trade-off between deviation and practical computability: for repetitions $k>10$ the decrease of the red horizontal lines, showing the median relative standard deviations, is very small on Fig. 2 and Supporting Figs. 1 and 2, and still small on Supporting Figs. 3 and 4.

References

Besson P, Dinkelacker V, Valabregue R, Thivard L, Leclerc X, Baulac M, Sammler D, Colliot O, Lehéricy S, Samson S, Dupont S (2014) Structural connectivity differences in left and right temporal lobe epilepsy. Neuroimage 100C:135–144. https://doi.org/10.1016/j.neuroimage.2014.04.071
Article Google Scholar
Buchanan CR, Pernet CR, Gorgolewski KJ, Storkey AJ, Bastin ME (2014) Test-retest reliability of structural brain networks from diffusion MRI. Neuroimage 86:231–243. https://doi.org/10.1016/j.neuroimage.2013.09.054
Article PubMed Google Scholar
Daducci A, Gerhard S, Griffa A, Lemkaddem A, Cammoun L, Gigandet X, Meuli R, Hagmann P, Thiran JP (2012) The connectome mapper: an open-source processing pipeline to map connectomes with MRI. PLoS One 7(12):e48121. https://doi.org/10.1371/journal.pone.0048121
Article CAS PubMed PubMed Central Google Scholar
Euler L. Solutio problematis ad geometriam situs pertinentis. Commentarii Academiae Scientarum Imperialis Petropolitanae 8 (1): 128–140, 1741. http://eulerarchive.maa.org//docs/originals/E053.pdf
Fellner M, Varga B, Grolmusz V (2019) The frequent subgraphs of the connectome of the human brain. Cognit Neurodynam 13(5):453–460. https://doi.org/10.1007/s11571-019-09535-y
Article Google Scholar
Fellner M, Varga B, Grolmusz V (2020a) The frequent complete subgraphs in the human connectome. PloS One 15(8):e0236883. https://doi.org/10.1371/journal.pone.0236883
Article CAS PubMed PubMed Central Google Scholar
Fellner M, Varga B, Grolmusz V (2020b) The frequent network neighborhood mapping of the human hippocampus shows much more frequent neighbor sets in males than in females. PLOS One 15(1):e0227910. https://doi.org/10.1371/journal.pone.0227910
Article CAS PubMed PubMed Central Google Scholar
Girard G, Whittingstall K, Deriche R, Descoteaux M (2014) Towards quantitative connectivity analysis: reducing tractography biases. Neuroimage 98:266–278. https://doi.org/10.1016/j.neuroimage.2014.04.074
Article PubMed Google Scholar
Kerepesi C, Szalkai B, Varga B, Grolmusz V (2016) How to direct the edges of the connectomes: dynamics of the consensus connectomes and the development of the connections in the human brain. PLOS One 11(6):e0158680. https://doi.org/10.1371/journal.pone.0158680
Article CAS PubMed PubMed Central Google Scholar
Kerepesi C, Szalkai B, Varga B, Grolmusz V (2017) The braingraph. org database of high resolution structural connectomes and the brain graph tools. Cognit Neurodynam 11(5):483–486
Article Google Scholar
Kerepesi C, Szalkai B, Varga B, Grolmusz V (2018a) Comparative connectomics: mapping the inter-individual variability of connections within the regions of the human brain. Neurosci Lett 662(1):17–21. https://doi.org/10.1016/j.neulet.2017.10.003
Article CAS PubMed Google Scholar
Kerepesi C, Varga B, Szalkai B, Grolmusz V (2018b) The dorsal striatum and the dynamics of the consensus connectomes in the frontal lobe of the human brain. Neurosci Lett 673:51–55. https://doi.org/10.1016/j.neulet.2018.02.052
Article CAS PubMed Google Scholar
McNab JA, Edlow BL, Witzel T, Huang SY, Bhat H, Heberlein K, Feiweier T, Liu K, Keil B, Cohen-Adad J, Tisdall D, Folkerth RD, Kinney HC, Wald LL (2013) The Human Connectome Project and beyond: initial applications of 300 mT/m gradients. Neuroimage 80:234–245. https://doi.org/10.1016/j.neuroimage.2013.05.074
Article PubMed Google Scholar
Szalkai B, Kerepesi C, Varga B, Grolmusz V (2015a) The Budapest reference connectome server v2. 0. Neurosci Lett 595:60–62
Article CAS Google Scholar
Szalkai B, Varga B, Grolmusz V (2015b) Graph theoretical analysis reveals: Women’s brains are better connected than men’s. PLoS One 10(7):e0130045. https://doi.org/10.1371/journal.pone.0130045
Article CAS PubMed PubMed Central Google Scholar
Szalkai B, Kerepesi C, Varga B, Grolmusz V (2017a) Parameterizable consensus connectomes from the human connectome project: the budapest reference connectome server v3.0. Cognit Neurodynam 11(1):113–116. https://doi.org/10.1007/s11571-016-9407-z
Article Google Scholar
Szalkai B, Varga B, Grolmusz V (2017b) The robustness and the doubly-preferential attachment simulation of the consensus connectome dynamics of the human brain. Sci Rep. https://doi.org/10.1038/s41598-017-16326-0
Article PubMed PubMed Central Google Scholar
Szalkai B, Varga B, Grolmusz V (2018) Comparing advanced graph-theoretical parameters of the connectomes of the lobes of the human brain. Cognit Neurodynam 12(6):549–559
Article Google Scholar
Szalkai B, Kerepesi C, Varga B, Grolmusz V (2019a) High-resolution directed human connectomes and the consensus connectome dynamics. PLoS ONE 14(4):e0215473. https://doi.org/10.1371/journal.pone.0215473
Article CAS PubMed PubMed Central Google Scholar
Szalkai B, Varga B, Grolmusz V (2019b) Mapping correlations of psychological and connectomical properties of the dataset of the human connectome project with the maximum spanning tree method. Brain Imag Behav 13(5):1185–1192. https://doi.org/10.1007/s11682-018-9937-6
Article Google Scholar
Szalkai B, Varga B, and Grolmusz V (2021) The graph of our mind. Brain Sci 11(3):342. https://doi.org/10.3390/brainsci11030342
Tournier J, Calamante F, Connelly A et al (2012) Mrtrix: diffusion tractography in crossing fiber regions. Int J Imag Syst Technol 22(1):53–66
Article Google Scholar

Download references

Acknowledgements

Data were provided in part by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. VG and BV were partially supported by the VEKOP-2.3.2-16-2017-00014 program, supported by the European Union and the State of Hungary, co-financed by the European Regional Development Fund, and the NKFI-127909 grant of the National Research, Development and Innovation Office of Hungary. VG and BV was supported in part by the EFOP-3.6.3-VEKOP-16-2017-00002 grant, supported by the European Union, co-financed by the European Social Fund.

Funding

Open access funding provided by Eötvös Loránd University.

Author information

Authors and Affiliations

PIT Bioinformatics Group, Eötvös University, 1117, Budapest, Hungary
Bálint Varga & Vince Grolmusz
Uratim Ltd., 1118, Budapest, Hungary
Vince Grolmusz

Authors

Bálint Varga
View author publications
You can also search for this author in PubMed Google Scholar
Vince Grolmusz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

BV constructed the image processing system, computed the braingraphs, and prepared the figure, VG has secured funding, initiated the study, analyzed data and wrote the paper.

Corresponding author

Correspondence to Vince Grolmusz.

Ethics declarations

Conflicts of interest

The authors declare no conflicts of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary material 1 (png 39 KB)

Supplementary material 2 (png 39 KB)

Supplementary material 3 (png 39 KB)

Supplementary material 4 (png 40 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Varga, B., Grolmusz, V. The braingraph.org database with more than 1000 robust human connectomes in five resolutions. Cogn Neurodyn 15, 915–919 (2021). https://doi.org/10.1007/s11571-021-09670-5

Download citation

Received: 02 September 2020
Revised: 03 February 2021
Accepted: 13 February 2021
Published: 12 March 2021
Issue Date: October 2021
DOI: https://doi.org/10.1007/s11571-021-09670-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The braingraph.org database with more than 1000 robust human connectomes in five resolutions

Abstract

Explore related subjects

Introduction

Methods

Data records

Discussion and results

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary material 1 (png 39 KB)

Supplementary material 2 (png 39 KB)

Supplementary material 3 (png 39 KB)

Supplementary material 4 (png 40 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation