The Dementias Platform UK (DPUK) Data Portal

Bauermeister, Sarah; Orton, Christopher; Thompson, Simon; Barker, Roger A.; Bauermeister, Joshua R.; Ben-Shlomo, Yoav; Brayne, Carol; Burn, David; Campbell, Archie; Calvin, Catherine; Chandran, Siddharthan; Chaturvedi, Nishi; Chêne, Geneviève; Chessell, Iain P.; Corbett, Anne; Davis, Daniel H. J.; Denis, Mike; Dufouil, Carole; Elliott, Paul; Fox, Nick; Hill, Derek; Hofer, Scott M.; Hu, Michele T.; Jindra, Christoph; Kee, Frank; Kim, Chi-Hun; Kim, Changsoo; Kivimaki, Mika; Koychev, Ivan; Lawson, Rachael A.; Linden, Gerry J.; Lyons, Ronan A.; Mackay, Clare; Matthews, Paul M.; McGuiness, Bernadette; Middleton, Lefkos; Moody, Catherine; Moore, Katrina; Na, Duk L.; O’Brien, John T.; Ourselin, Sebastien; Paranjothy, Shantini; Park, Ki-Soo; Porteous, David J.; Richards, Marcus; Ritchie, Craig W.; Rohrer, Jonathan D.; Rossor, Martin N.; Rowe, James B.; Scahill, Rachael; Schnier, Christian; Schott, Jonathan M.; Seo, Sang W.; South, Matthew; Steptoe, Matthew; Tabrizi, Sarah J.; Tales, Andrea; Tillin, Therese; Timpson, Nicholas J.; Toga, Arthur W.; Visser, Pieter-Jelle; Wade-Martins, Richard; Wilkinson, Tim; Williams, Julie; Wong, Andrew; Gallacher, John E. J.

doi:10.1007/s10654-020-00633-4

The Dementias Platform UK (DPUK) Data Portal

DATA RESOURCE
Open access
Published: 23 April 2020

Volume 35, pages 601–611, (2020)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Epidemiology Aims and scope Submit manuscript

The Dementias Platform UK (DPUK) Data Portal

Download PDF

Sarah Bauermeister¹,
Christopher Orton²,
Simon Thompson²,
Roger A. Barker³,
Joshua R. Bauermeister¹,
Yoav Ben-Shlomo⁴,
Carol Brayne⁵,
David Burn⁶,
Archie Campbell⁷,
Catherine Calvin¹,
Siddharthan Chandran⁸,
Nishi Chaturvedi⁹,
Geneviève Chêne¹⁰,
Iain P. Chessell¹¹,
Anne Corbett¹²,
Daniel H. J. Davis⁹,
Mike Denis¹³,
Carole Dufouil¹⁰,
Paul Elliott^14,39,40,
Nick Fox¹⁵,
Derek Hill¹⁶,
Scott M. Hofer¹⁷,
Michele T. Hu¹⁸,
Christoph Jindra¹,
Frank Kee¹⁹,
Chi-Hun Kim¹,
Changsoo Kim²⁰,
Mika Kivimaki²¹,
Ivan Koychev¹,
Rachael A. Lawson²²,
Gerry J. Linden¹⁹,
Ronan A. Lyons²,
Clare Mackay¹,
Paul M. Matthews²³,
Bernadette McGuiness¹⁹,
Lefkos Middleton¹⁴,
Catherine Moody²⁴,
Katrina Moore¹⁵,
Duk L. Na²⁵,
John T. O’Brien²⁶,
Sebastien Ourselin²⁷,
Shantini Paranjothy²⁸,
Ki-Soo Park²⁹,
David J. Porteous³⁰,
Marcus Richards⁹,
Craig W. Ritchie⁸,
Jonathan D. Rohrer¹⁵,
Martin N. Rossor¹⁵,
James B. Rowe³,
Rachael Scahill¹⁵,
Christian Schnier³¹,
Jonathan M. Schott¹⁵,
Sang W. Seo²⁵,
Matthew South¹,
Matthew Steptoe³²,
Sarah J. Tabrizi¹⁵,
Andrea Tales³³,
Therese Tillin³⁴,
Nicholas J. Timpson⁴,
Arthur W. Toga³⁵,
Pieter-Jelle Visser³⁶,
Richard Wade-Martins³⁷,
Tim Wilkinson³¹,
Julie Williams³⁸,
Andrew Wong⁹ &
…
John E. J. Gallacher ORCID: orcid.org/0000-0002-2394-5299¹

6890 Accesses
34 Citations
13 Altmetric
Explore all metrics

Abstract

The Dementias Platform UK Data Portal is a data repository facilitating access to data for 3 370 929 individuals in 42 cohorts. The Data Portal is an end-to-end data management solution providing a secure, fully auditable, remote access environment for the analysis of cohort data. All projects utilising the data are by default collaborations with the cohort research teams generating the data. The Data Portal uses UK Secure eResearch Platform infrastructure to provide three core utilities: data discovery, access, and analysis. These are delivered using a 7 layered architecture comprising: data ingestion, data curation, platform interoperability, data discovery, access brokerage, data analysis and knowledge preservation. Automated, streamlined, and standardised procedures reduce the administrative burden for all stakeholders, particularly for requests involving multiple independent datasets, where a single request may be forwarded to multiple data controllers. Researchers are provided with their own secure ‘lab’ using VMware which is accessed using two factor authentication. Over the last 2 years, 160 project proposals involving 579 individual cohort data access requests were received. These were received from 268 applicants spanning 72 institutions (56 academic, 13 commercial, 3 government) in 16 countries with 84 requests involving multiple cohorts. Projects are varied including multi-modal, machine learning, and Mendelian randomisation analyses. Data access is usually free at point of use although a small number of cohorts require a data access fee.

Management and Curation of Multi-Dimensional Data in Biobank Studies

Research-ready data: the C-Surv data model

Article Open access 07 January 2023

SOCR data dashboard: an integrated big data archive mashing medicare, labor, census and econometric information

Article Open access 17 July 2015

Introduction

Here we describe the Dementias Platform UK (DPUK) Data Portal (https://portal.dementiasplatform.uk/) [1]. The Data Portal is a collaboration between DPUK and a growing number of cohort research teams who wish to make their data globally accessible. DPUK was established by the Medical Research Council (MRC) to accelerate the development of new treatments for dementia. The Data Portal is a component infrastructure of DPUK, designed to exploit the opportunities provided by cohort data to inform experimental medicine and to improve access to cohort data more widely. It is a data repository facilitating access to data for 3 370 929 individuals in 42 cohorts. The Data Portal provides a secure, fully auditable, remote access environment for the analysis of cohort data. The Data Portal supports the FAIR principles (Findability, Accessibility, Interoperability and Reusability) to improve the infrastructure supporting the re-use of data [2].

Arguments for multi-cohort-focused data repositories include: (1) as research questions focus on smaller effect sizes, access is required to data at-scale to achieve statistical purchase, (2) as emerging research questions become more complex, access to diverse multi-modal data is needed for rigorous hypothesis testing, (3) as scientific rigour increases there is growing recognition of the value of triangulation and replication using independent datasets, (4) as cohort datasets increase in size the transfer of large datasets is decreasingly feasible, (5) as cohort datasets become more complex the mastering of bespoke data models for survey, omics (genomics, proteomics and metabolomics), imaging and device data becomes burdensome, (6) as cohort datasets become more sensitive the non-auditable use of data is decreasingly acceptable. Whilst these issues can be addressed individually, the Data Portal provides an integrated solution.

The Data Portal is an end-to-end data management solution designed to support cohort data sharing. All projects utilising the data are by default collaborations with the cohort research teams generating the data. Although it has the three core utilities of data discovery, access, and analysis, to achieve these it operates across seven layers (Fig. 1).

Layer 1: Data ingestion

The data journey begins with upload to the Data Portal. Datasets and data dictionaries are received from cohorts on an ‘as-is’ basis along with other supporting documentation. The Data Portal operates within the UK Secure eResearch Platform (UKSeRP) environment according to ISO 27001 [3] and operates exclusively as a data processor according to the UK Data Protection Act 2018 [4] and EU General Data Protection Regulation 2016 [5]. DPUK facilitates the legal engagements necessary for data transfer into the Data Portal on behalf of data controllers, by ensuring robust contractual arrangements are in place as an overarching mechanism for data governance and use [6]. In practice the data controller is considered to be the principal investigator for the dataset. A dataset is removed from the Data Portal upon receipt of a wet-ink signature request from the data controller.

Layer 2: Data curation

Upon receipt, ‘native’ data are curated to a common data model (C-Surv). The C-Surv ontology is designed to simplify the analytic challenge of working across multiple datasets and multiple modalities by providing standard structure, variable naming and value labelling conventions. Other data models, CDISC [7], OMOP [8], or HPO [9] involve structural complexity that is rarely relevant to cohort-based analyses. An example variable name using the C-Surv model is given below:

$${\text{GEN05\_PAINCHESTEVR\_0\_1}}$$

The cohort is identified using a three digit alphabetic character (GEN for Generation Scotland), and category-level data by a two digit numeric character (05 for physical health status). The measure is described by an alphanumeric acronym (PAINCHESTEVR for: Do you ever get pain or discomfort in your chest?). This is followed by an integer giving the number of repeat measurements within a study wave (_0 indicates there were no repeat measurements). Finally, an integer suffix indicates study wave (_1 for recruitment, _2 for the first follow-up, etc.). This data model is a contribution to a wider debate on how global access to cohort data can be achieved.

C-Surv is optimised for the analysis of ‘flat-file’ data. Higher order data must be pre-processed prior to curation. The XNAT [10] imaging platform is used to receive and process DICOM and NIfTI files. For genetics, variant call format and allele frequency data may be uploaded. C-Surv is also designed to improve the efficiency of data selection and management from the applicants and administrators’ perspectives. There are 22 categories and 132 sub-categories, which may be used for data selection as an alternative to individual variable selection. The ontology is machine-readable for mapping to other ontologies. Researchers may request access to either native or curated data. Standardisation does not imply harmonisation i.e. comparability (equivalence) of values and/or distributions across variables. Data harmonisation is implicitly purpose specific. DPUK undertakes a very limited harmonisation programme solely to support data discovery (see Layer 4). Data curation is resource intensive and is ongoing. To accelerate the process we are developing machine-learning approaches, which currently achieve an 80% accuracy.

Layer 3: Interoperability

We anticipate a global mixed-model data access environment that respects sovereign boundaries and requires the highest levels of security. In practice, this involves the integration of individual participant data across datasets (‘pooled’ analyses) and the integration of summary data across datasets (‘federated’ analyses). Both models require interoperability between data platforms across national boundaries. DPUK supports the development of interoperability in several ways. In partnership with UKSeRP, we provide access to our software solutions to other data platforms to facilitate architectural compatibility. We also engage with other data platforms to develop interoperability across architectures. In collaboration with Gates Ventures and other data platforms, we are working to develop a high-level data-gateway for cross-platform data access.

Layer 4: Data discovery

A tiered data discovery pathway begins with the Cohort Matrix [11] which provides a high-level comparison of data availability for each cohort. The Cohort Directory [12] enables detailed exploration across cohorts using a range of metadata categories. The Cohort Explorer [13] provides access to a set of 30 variables harmonised across cohorts to enable feasibility analysis. Adding cohorts to the Cohort Explorer is ongoing. As the Cohort Explorer provides access to harmonised data, login to the Data Portal is required. Data discovery is a dynamic area and the development of increasingly ergonomic and intuitive tools is anticipated. This is likely to include the identification of high priority variable sets and variable selection algorithms through the analysis of user-activity and the development of application programming interfaces (APIs) to rapidly reproduce data selection (see Layer 7). DPUK welcomes user feedback and collaboration to inform an ongoing programme of discovery tool development.

Layer 5: Data access brokerage

Automated third-party brokerage, allowing submission of a single data access request to multiple data controllers, reduces the researcher administrative burden. Standard cross-cohort access agreements and streamlined decision-making procedures simplifies the application and approvals process for all stakeholders. Access requests can be specified at the level of detail required by the data controller and if approval is granted, a standard access agreement must be completed by both parties prior to access being granted. The application form is a synthesis of key issues that are addressed by most, if not all, the individual cohort access management procedures, fail-fast criteria have been developed to facilitate rapid and transparent decision-making by data controllers. These comprise the proposal not being in the public interest, potential identifiability of participants, no clear scientific rationale, no appropriate analysis plan, or a conflict of scientific interest. Upon access approval by a cohort data access committee (DAC), completion by the applicant of a data access agreement is required prior to access being granted. Upon receipt of a completed data access agreement, data access is granted within several days for any one cohort [14]. Data access is normally free at the point of use with a small number of cohorts levying a data access fee (Table 1).

Table 1 Collaborating cohorts

Full size table

Layer 6: Data analysis

The analysis environment is VMware based with each researcher allocated their own private personal ‘lab’ (desktop) into which approved data are moved, and from which they can access a range of pre-loaded generic and specialist software packages. Bespoke software may also be uploaded upon approval. VMware clients are available for a range of operating system including Windows, Linux, IOS, Mac, and Android. A standard desktop is provided for researchers seeking to access standard phenotypic data on the Portal. The standard desktop has the following specification: Windows 7/10, 8 GB RAM, and 4 CPUs. It is pre-loaded with R, RStudio, SPSS, SAS, Stata, Python, Eclipse, MATLAB, SQL Server Management Studio, Microsoft Office. Statistical software, such as R and Python, can connect to its official library/package/index directory to enable configuration of software on a per-user basis. Larger desktops are available on request (32 GB RAM with 8 CPUs, 128 GB RAM and 16 CPUs) which are more suitable for large-scale multi-modal analyses such as omics and imaging analyses, and for machine learning.

Storage is scalable according to the study requirements, with basic access to data stored on our systems free to all users. Studies needing unusually large amounts of storage to ingress their own at-scale data will be subject to potential charges which will be discussed with researchers as part of the desktop set-up. High Performance Computing and non-Windows operating systems are available upon request. The Data Portal also offers consortia-based workspace; providing an independent and transparent storage, access and analysis solution for use by multiple institutions.

Two-factor authentication is required to access approved datasets. This involves the provision of a username with password creation, and an authentication code generated by an app on a mobile device of the applicant’s choosing. Data may not be removed from the Data Portal. However, tables, graphs and scripts for export are submitted to the data export panel for approval. Manuscripts may be prepared in the Data Portal so that collaborators who are registered users may contribute without the need for manuscript download. A facility for import is also available, enabling researchers to upload scripts and additional datasets from outside the Data Portal to reside within their approved DPUK datasets.

Layer 7: Knowledge environment

Cohort data grow in quantity and can be complex, but ultimately there is the need for the transition from data to knowledge and insight. In response to this challenge, the Data Portal is developing a Knowledge Hub to organise and make information readily accessible on the key activities of the Data Portal. The immediate purpose of the Knowledge Hub is to enable collaborative knowledge building; enabling researchers to understand what is available, what is known, and where there are knowledge gaps. It also serves as a knowledge preservation repository enabling the storage of linked datasets and analysis code with persistent identifiers for rapid access and replication. DPUK welcomes collaboration in the development of this hub.

Application for access can be made through the Data Portal: https://portal.dementiasplatform.uk/[56].

Results

Data resources

DPUK collaborates with 42 clinical and population cohorts (Table 1). Of these, 24 (n = 1 410 302) have either completed or are in the process of uploading full or partial datasets and 18 (n = 1 960 627) upload data on a per-project basis. The data are diverse. Clinical cohort studies include familial disease cohorts [43, 55], disease focused patient cohorts [32, 36], ageing focused population cohorts [21, 24, 40, 57], re-purposed cardiovascular cohorts [19, 37], birth cohorts [38, 49], repurposed cancer cohorts [15, 44], and disease agnostic cohorts [45, 47, 54]. In terms of real world evidence, the Data Portal provides access to an e-cohort (SAIL-DeC) covering health records for 1.2 m individuals including 130 k dementia cases [34]. Data availability varies according to cohort but includes epidemiologic survey, imaging, genetics, and linked administrative data. The Data Portal provides a link to the UK-CRIS Network for natural language processing of 2.5 m UK mental health records [58]. Overall, these cohorts and the data modalities represented, provide an unusually rich and complex data environment.

Data resource use

From launch in November 2017 to end of January 2019, 160 project proposals involving 579 individual cohort data access requests were received. These were received from 268 applicants spanning 72 institutions (56 academic, 13 commercial, 3 government) in 16 countries with 84 requests involving multiple cohorts.

Of the 160 project proposals, 40 failed triage due to under-specification and were not forwarded to cohort DACs. The 120 projects that were forwarded to DACs involved 396 individual cohort access requests, 19 of which were withdrawn by the applicant pre-approval. Of the 377 access requests considered 171 (45%) were approved, 80 (21%) were declined, 78 (21%) were closed following non-response after 200 days from either the applicant or the cohort, 19 (5%) ceased after approval and 29 (8%) are in process. Of the 80 declined requests, 49 (61%) were due to the data requested either not existing or not being relevant for the analysis.

The data requests were diverse. Projects include Mendelian randomisation, imaging, psychometric, and machine learning studies, alongside risk stratification studies and dementia prediction and diagnosis modelling. Other studies are less dementia specific including trajectories of longitudinal assessment of comorbidities between mental health, hormonal indicators and cognitive change; the impact of child adversity on adult outcomes; the longitudinal tracking and determinants of well-being and cognitive performance; and successful cognitive ageing in 90 + year olds.

To facilitate innovative and exploratory analyses, the Data Portal can be configured as a ‘sandbox’ environment, without compromising data security or cohort governance principles. An example of this was the hosting of a three-day multidisciplinary workshop (datathon) for 40 data scientists at the Alan Turing Institute utilising the Deep and Frequent Phenotyping pilot study data [23]. These multi-modal data include Magneto-encephalography (MEG), Positron Emission Tomography (PET), structural and functional MRI, ophthalmology, gait, and serial cognitive and clinical assessment. The Data Portal was used to host 25 data scientists in a datathon at the University of Exeter and Swansea University, during which traditional regression and machine learning procedures were used to interrogate the data within the virtual desktop interface. A datathon at the University of East Anglia hosted 60 attendees, mostly early career researchers, using data from the Deep and Frequent Phenotyping Study [23], the English Longitudinal Study of Ageing [24], and the Caerphilly Prospective Study [59]. The datathon programme is ongoing with a further 6 planned for 2019. The sandbox environment is also suitable for teaching and training purposes as it enables access to standardised datasets by multiple users within a secure environment.

Data Portal performance

Data Portal utility varies according to the extent that standardised procedures are used. Currently, 38 cohorts (90%) accept our standard legal agreements, 28 cohorts (67%) accept our streamlined data access process, with a further 13 accepting a DPUK application as an initial enquiry. For variable selection, 22 (52%) cohorts accept the use of our variable selection tools with the remainder requiring the use of cohort-specific variable selection tools (Table 1).

Triage is rapid, usually upon receipt. Cohort DAC decision times vary according to the complexity of the project, clarity of the proposal, and the DAC meeting frequency. DPUK policy is to consider a project closed (in effect expired) if there has been no response from the cohort DAC within 200 days. Our target DAC response time is 28 days. Current performance has a median value of 28 days (mean = 40). Time from approval to data access is dependent on the completion of the data access agreement by the applicant. Where data are already uploaded to the Data Portal access is typically given with 24 h of receiving the completed Data Access Agreement. Where data are uploaded on a per-project basis, access is dependent on the cohort data management process. Typically, upload of native data (not curated by DPUK) occurs within 2 weeks and can be made accessible to the researcher immediately.

Discussion

Rapid data access continues to be a challenge for researchers, funders, and governments. The DPUK Data Portal is one of several initiatives to address this issue. In the dementia space, other data platforms are available. The JPND Global Cohort Directory [60] provides contact details for 175 cohorts (n = 3 586 109) whilst the IALSA Network [61] provides details for 110 cohorts (n = 1 485 410). More sophisticated and convenient data discovery tools are provided by GAAIN [62] with 47 cohorts (n = 480 020). GAAIN also offers centralised processing for selected datasets. EMIF-AD [61] offers a comprehensive data harmonisation programme for a selection of their 60 catalogued cohorts (n = 135 959) and 18 electronic health records datasets (n = 65 m). For selected datasets EMIF-AD provides centralised cohort data processing facilities through the TranSMART platform [63].

Data Portal benefits

However, in providing independent trusted third-party data management, the data repository approach has several benefits. Although the Data Portal is optimised for dementia, its remote access model provides a generic solution that can be adapted to any outcome for which relevant data are available. Specifically, by automating, streamlining and standardising procedures, decision-making can be more precise, quicker, and more transparent, and the practice of repeated data transfers avoided.

For cohort research teams a repository substantially reduces the administrative burden of routine data management tasks, particularly avoiding repeated data transfers. This assists cohorts in meeting their data sharing obligations, and releases cohort-based data scientists to focus on increasing the scientific value of their data.

For researchers a repository provides a single point of contact and a single standard procedure for accessing multiple independent datasets. Within the Data Portal this supports data discovery, feasibility modelling, access management and analysis. The use of a common data model reduces data preparation times and reduces uncertainty due to unnecessary variation in data pre-processing.

For the research community as a whole, a repository facilitates knowledge management including the linking of code-to-dataset through the use of persistent identifiers (Digital Object Identifiers: DOIs), the development and discoverability of APIs for rapid replication, and the improvement of the end-to-end user experience through adaptive learning.

Repositories have an important role in the democratisation of science. The Data Portal allows remote and secure data analysis, enabling the best ideas to be tested in high quality data from any location with suitable connectivity. This is fundamentally empowering to researchers without access to high-end computing, particularly in developing countries.

Data Portal challenges

New infrastructure requires new working practice, which brings challenges to platform operators, data controllers and data users.

Operational challenges for platform operators focus on improving the user experience and increasing the scientific opportunity. Data platforms are complex infrastructures that integrate the highest levels of security and governance requirements for accessing large, complex, and non-standard datasets. They are also integrated within larger computing facilities which require specific resource allocation for computer intensive operations.

In this context, improving the user experience is an incremental process. For example, improving data discovery and visualisation tools to improve the precision of data access requests, streamlining decision making to reducing approval times, and developing links to similar resources globally. All involve engagement with different stakeholders, each of which have different expectations and requirements.

For some data controllers the repository ‘trusted third-party’ approach to data management involves uncertainty. Whilst the practice of managing access through repeated data transfer may be labour intensive, it is familiar. Uncertainty surrounds the legal status of Electronic Health Records (EHRs) data accessed via the platform, quality of the proposed science, the relevance (and existence) of the requested data, and potential conflicts of scientific interest. The legal status of accessing EHR data linked to research data using third party platforms is a grey area in the UK. Although there is no change in data controller, and no forward sharing to a third-party data controller, current UK data sharing licenses have not anticipated the Data Portal solution. Discussions are underway in the UK data community to resolve this issue. Wider access also brings reduced control, and data controllers can be concerned over the reputation of their datasets if used for egregious science. In response DPUK strongly advises a collaboration with cohort research groups as this is the quickest way to understand a dataset and receive expert advice on how to approach its analysis. To facilitate good science it is not unreasonable for cohort research teams to insist on a collaborative approach. That 61% of declined access requests (see section: Data Resource Use) were due to the data requested either not existing or not being relevant for the analysis is to an extent a matter of due diligence on the part of the applicant, but it can also be a matter of judgement. The impact of conflict of interest depends largely on the perspective of the DAC. In an age of increasingly large and complex analyses, many cohorts are underpowered for emerging research questions. Cohort research team interests can be well-served though transparent collaborative relationships. To reassure data controllers, their approval is required for all data access, all use is auditable, data cannot be downloaded, and data are erased upon request.

For researchers, accessing data via a repository may be less convenient than through their host institution. However, the additional security measures are not onerous and are balanced by improved data access. A secure repository requires an account and user authentication. Whilst these procedures may be simple (two-factor authentication takes ≈ 10 s), they require a level of familiarity before becoming second nature. For access approval, epidemiologic, imaging, and genetic communities each have different conventions over what constitutes an appropriate data request. By streamlining and standardising the application process DPUK encourages coalescence around the development of common standards that are widely accepted. The informatics infrastructure required by these groups also varies in terms of operating systems, compute power, and software. Our response is to provide flexibility. Standard environments are available that suit the requirements of most users, but bespoke environments are built on request including access to high performance computing.

In the pursuit of scientific rigour, increased data access is only part of the equation. Also required is a commensurate development in analytic competence. That 40 project proposals failed a not very demanding triage process, suggests that some analysts are unaware of the value and complexity of the data resources that are available. Although DPUK can increase the opportunity for training via the Data Portal, there is a wider responsibility to up-skill analysts to take advantage of this.

Looking ahead

The mission of the Data Portal is to increase the realised scientific value of cohort data. Future developments include developing linkage to electronic health records. In partnership with Health Data Research UK, DPUK is developing a centralised system enabling DPUK cohorts to link research data to routinely collected data. In partnership with Gates Ventures, DPUK along with other platforms is developing global connectivity. DPUK is also working with academic and industry partners to provide digital phenotyping using devices such as smart phones, for collaborating cohorts. A third development is the use of cohort data to risk-stratify for trials recruitment. For dementia, there is a growing need for risk stratification by phenotype according to genotype in order to recruit to highly targeted experimental medicine studies. Cohorts, by virtue of lead-in data are well positioned to support these studies. DPUK collaborates with the Airwave and HealthWise Wales cohorts to provide genotyping and cognitive testing for experimental medicine recruitment. DPUK welcomes extending this collaboration to other cohorts.

Conclusions

The DPUK Data Portal was established by the MRC to support the development of new treatments for dementia by using cohort data to inform experimental medicine. It is recognition of the unique value of cohort data and a contribution to the wider debate on how best to support cohort studies and facilitate their contribution to the wider research environment. By streamlining procedures for cohort research teams, increasing data accessibility for researchers, reducing costs, and adding value, the Data Portal is an investment in the future of cohort studies.

Applications for access can be made through the Data Portal: https://portal.dementiasplatform.uk/.

References

Dementias Platform UK. https://portal.dementiasplatform.uk/. Accessed 09 June 2019.
Wilkinson MD, Dumontier M, Aalbersberg IJ, et al. The FAIR guiding principles for scientific data management and stewardship. Sci Data. 2016;3:160018. https://doi.org/10.1038/sdata.2016.18.
Article PubMed PubMed Central Google Scholar
Farr Institute. http://farrinstitute.org/public-engagement-involvement/100-ways-of-using-data-to-make-lives-better/case-study/the-creation-of-a-uk-secure-eresearch-platform-ukserp-to-support-population-data-research. Accessed 09 June 2019.
Data Protection Act. https://www.gov.uk/data-protection. Accessed 09 June 2019.
European Union General Data Protection Regulation (GDPR). https://eugdpr.org/. Accessed 09 June 2019.
Dementias Platform UK Data Deposit Agreement. https://portal.dementiasplatform.uk/Content/Docs/data_deposit__agreement1_SAMPLE.pdf. Accessed 09 June 2019.
Clinical Data Interchange Standards Consortium. https://www.cdisc.org/. Accessed 09 June 2019.
Observational Health Data Sciences and Informatics. https://www.ohdsi.org/data-standardization/the-common-data-model/. Accessed 09 June 2019.
Human Phenotype Ontology. https://hpo.jax.org/app/. Accessed 09 June 2019.
XNAT Imaging Platform. https://www.xnat.org/. Accessed 09 June 2019.
DPUK Cohort Matrix. https://portal.dementiasplatform.uk/CohortMatrix. Accessed 09 June 2019.
DPUK Cohort Directory. https://portal.dementiasplatform.uk/CohortDirectory. Accessed 09 June 2019.
DPUK Cohort Explorer. https://portal.dementiasplatform.uk/CohortExplorer. Accessed 11 Feb 2019.
Dementias Platform UK Data Access Agreement. https://portal.dementiasplatform.uk/Content/Docs/access_agreement1_SAMPLE.pdf. Accessed 09 June 2019.
Elliott P, Vergnaud AC, Singh D, Neasham D, Spear J, Heard A. The Airwave Health Monitoring Study of police officers and staff in Great Britain: rationale, design and methods. Environ Res. 2014;134:280–5. https://doi.org/10.1016/j.envres.2014.07.025.
Article CAS PubMed Google Scholar
AMyloid imaging for Phenotyping LEwy body dementia. https://www.ncl.ac.uk/magres/research/projects/amyloidimagingforphenotypinglewybodydementiaample.html. Accessed 09 June 2019.
BRACE. https://www.alzheimers-brace.org/. Accessed 09 June 2019.
Foltynie T, Brayne CE, Robbins TW, Barker RA. The cognitive ability of an incident cohort of Parkinson’s patients in the UK. The CamPaIGN study. Brain. 2004;127(Pt 3):550–60. https://doi.org/10.1093/brain/awh067.
Article PubMed Google Scholar
Caerphilly Heart Disease Study. https://www.bristol.ac.uk/population-health-sciences/projects/caerphilly/. Accessed 09 June 2019.
The Cognitive Function and Ageing Studies. http://www.cfas.ac.uk/. Accessed 09 June 2019.
Brayne C, McCracken C, Matthews FE, Medical Research Council Coginitive F, Ageing S. Cohort profile: the Medical Research Council Cognitive Function and Ageing Study (CFAS). Int J Epidemiol. 2006;35(5):1140–5. https://doi.org/10.1093/ije/dyl199.
Article PubMed Google Scholar
Davis D, Richardson S, Hornby J, et al. The delirium and population health informatics cohort study protocol: ascertaining the determinants and outcomes from delirium in a whole population. BMC Geriatr. 2018;18(1):45. https://doi.org/10.1186/s12877-018-0742-2.
Article PubMed PubMed Central Google Scholar
Koychev I, Gunn RN, Firouzian A, et al. PET tau and amyloid-beta burden in mild Alzheimer’s disease: divergent relationship with age, cognition, and cerebrospinal fluid biomarkers. J Alzheimers Dis. 2017;60(1):283–93. https://doi.org/10.3233/JAD-170129.
Article CAS PubMed PubMed Central Google Scholar
Steptoe A, Breeze E, Banks J, Nazroo J. Cohort profile: the English longitudinal study of ageing. Int J Epidemiol. 2013;42(6):1640–8. https://doi.org/10.1093/ije/dys168.
Article PubMed Google Scholar
Environmental pollution-induced Neurological Effects study. https://portal.dementiasplatform.uk/CohortDirectory/Item?fingerPrintID=74bf7bb33739285586427ecefdcf07f8. Accessed 09 June 2019.
EXTEND Exeter 10,000 study. https://exetercrfnihr.org/about/exeter-10000/ Accessed 09 June 2019.
Smith BH, Campbell A, Linksted P, et al. Cohort profile: generation Scotland: Scottish Family Health Study (GS:SFHS). The study, its participants and their potential for genetic research on health and illness. Int J Epidemiol. 2013;42(3):689–700. https://doi.org/10.1093/ije/dys084.
Article PubMed Google Scholar
Hollingworth P, Sweet R, Sims R, et al. Genome-wide association study of Alzheimer’s disease with psychotic symptoms. Mol Psychiatry. 2012;17(12):1316–27. https://doi.org/10.1038/mp.2011.125.
Article CAS PubMed Google Scholar
Yarnall AJ, Breen DP, Duncan GW, et al. Characterizing mild cognitive impairment in incident Parkinson disease: the ICICLE-PD study. Neurology. 2014;82(4):308–16. https://doi.org/10.1212/WNL.0000000000000066.
Article PubMed PubMed Central Google Scholar
Dufouil C, Dubois B, Vellas B, et al. Cognitive and imaging markers in non-demented subjects attending a memory clinic: study design and baseline findings of the MEMENTO cohort. Alzheimers Res Ther. 2017;9(1):67. https://doi.org/10.1186/s13195-017-0288-0.
Article CAS PubMed PubMed Central Google Scholar
Bevan-Jones WR, Surendranathan A, Passamonti L, et al. Neuroimaging of Inflammation in Memory and Related Other Disorders (NIMROD) study protocol: a deep phenotyping cohort study of the role of brain inflammation in dementia, depression and other neurological illnesses. BMJ Open. 2017. https://doi.org/10.1136/bmjopen-2016-013187.
Article PubMed PubMed Central Google Scholar
Lawton M, Baig F, Rolinski M, et al. Parkinson’s disease subtypes in the Oxford Parkinson Disease Centre (OPDC) discovery cohort. J Parkinson Dis. 2015;5(2):269–79. https://doi.org/10.3233/Jpd-140523.
Article Google Scholar
Evans JR, Cummins G, Breen DP, et al. Comparative epidemiology of incident Parkinson’s disease in Cambridgeshire, UK. J Neurol Neurosurg Psychiatry. 2016;87(9):1034–6. https://doi.org/10.1136/jnnp-2015-312581.
Article PubMed Google Scholar
Wilkinson T, Schinier C, Akbari A, et al. SAIL dementia e-cohort (SAIL-DeC). https://datashare.is.ed.ac.uk/handle/10283/3268. Accessed 21 April 2020
Kim SE, Woo S, Kim SW, et al. A nomogram for predicting amyloid PET positivity in amnestic mild cognitive impairment. J Alzheimers Dis. 2018;66(2):681–91. https://doi.org/10.3233/JAD-180048.
Article CAS PubMed Google Scholar
Moss DJH, Pardinas AF, Langbehn D, et al. Identification of genetic variants associated with Huntington’s disease progression: a genome-wide association study. Lancet Neurol. 2017;16(9):701–11. https://doi.org/10.1016/S1474-4422(17)30161-8.
Article CAS PubMed Google Scholar
Whitehall II study. https://www.ucl.ac.uk/iehc/research/epidemiology-public-health/research/whitehallII. Accessed 09 June 2019.
Boyd A, Golding J, Macleod J, et al. Cohort profile: the ‘children of the 90 s’—the index offspring of the Avon Longitudinal Study of Parents and Children. Int J Epidemiol. 2013;42(1):111–27. https://doi.org/10.1093/ije/dys064.
Article PubMed Google Scholar
Fraser A, Macdonald-Wallis C, Tilling K, et al. Cohort profile: the avon longitudinal study of parents and children: ALSPAC mothers cohort. Int J Epidemiol. 2013;42(1):97–110. https://doi.org/10.1093/ije/dys066.
Article PubMed Google Scholar
Francis PT, Costello H, Hayes GM. Brains for dementia research: evolution in a longitudinal brain donation cohort to maximize current and future value. J Alzheimers Dis. 2018;66(4):1635–44. https://doi.org/10.3233/JAD-180699.
Article PubMed PubMed Central Google Scholar
Taylor JR, Williams N, Cusack R, et al. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample. Neuroimage. 2017;144(Pt B):262–9. https://doi.org/10.1016/j.neuroimage.2015.09.018.
Article PubMed Google Scholar
Larsen ME, Curry L, Mastellos N, Robb C, Car J, Middleton LT. Development of the CHARIOT Research Register for the Prevention of Alzheimer’s Dementia and Other Late Onset Neurodegenerative Diseases. PLoS ONE. 2015;10(11):e0141806. https://doi.org/10.1371/journal.pone.0141806.
Article CAS PubMed PubMed Central Google Scholar
Bateman RJ, Xiong C, Benzinger TL, et al. Clinical and biomarker changes in dominantly inherited Alzheimer’s disease. N Engl J Med. 2012;367(9):795–804. https://doi.org/10.1056/NEJMoa1202753.
Article CAS PubMed PubMed Central Google Scholar
Hayat SA, Luben R, Keevil VL, et al. Cohort profile: a prospective cohort study of objective physical and cognitive capability and visual health in an ageing population of men and women in Norfolk (EPIC-Norfolk 3). Int J Epidemiol. 2014;43(4):1063–72. https://doi.org/10.1093/ije/dyt086.
Article PubMed Google Scholar
Healthwise Wales. https://www.healthwisewales.gov.wales/. Accessed 09 June 2019.
Deary IJ, Gow AJ, Taylor MD, et al. The Lothian Birth Cohort 1936: a study to examine influences on cognitive ageing from age 11 to age 70 and beyond. BMC Geriatr. 2007;7:28. https://doi.org/10.1186/1471-2318-7-28.
Article PubMed PubMed Central Google Scholar
Green J, Reeves GK, Floud S, et al. Cohort profile: the million women study. Int J Epidemiol. 2018. https://doi.org/10.1093/ije/dyy065.
Article PubMed PubMed Central Google Scholar
The Northern Ireland longitudinal study of ageing. https://www.qub.ac.uk/sites/NICOLA/. Accessed 09 June 2019.
Wadsworth M, Kuh D, Richards M, Hardy R. Cohort profile: the 1946 National Birth Cohort (MRC National Survey of Health and Development). Int J Epidemiol. 2006;35(1):49–54. https://doi.org/10.1093/ije/dyi201.
Article PubMed Google Scholar
Ritchie CW, Ritchie K. The PREVENT study: a prospective cohort study to identify mid-life biomarkers of late-onset Alzheimer’s disease. BMJ Open. 2012. https://doi.org/10.1136/bmjopen-2012-001893.
Article PubMed PubMed Central Google Scholar
The Belfast PRIME study https://www.qub.ac.uk/research-centres/PRIMEStudy/. Accessed 09 June 2019.
PROTECT study. http://www.protectstudy.org.uk/. Accessed 09 June 2019.
Tillin T, Forouhi NG, McKeigue PM, Chaturvedi N, Group SS. Southall And Brent REvisited: cohort profile of SABRE, a UK population-based comparison of cardiovascular disease and diabetes in people of European, Indian Asian and African Caribbean origins. Int J Epidemiol. 2012;41(1):33–42. https://doi.org/10.1093/ije/dyq175.
Article PubMed Google Scholar
Sudlow C, Gallacher J, Allen N, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779. https://doi.org/10.1371/journal.pmed.1001779.
Article PubMed PubMed Central Google Scholar
The Genetic Frontotemporal dementia Initiative. http://genfi.org.uk/. Accessed 09 June 2019.
Dementias Platform UK Data Portal. https://portal.dementiasplatform.uk/. Accessed 11 Feb 2019.
Matthews FE, Stephan BC, Robinson L, et al. A two decade dementia incidence comparison from the Cognitive Function and Ageing Studies I and II. Nat Commun. 2016;7:11398. https://doi.org/10.1038/ncomms11398.
Article CAS PubMed PubMed Central Google Scholar
UK Clinical Record Interactive Search Network. https://crisnetwork.co/uk-cris-programme. Accessed 09 June 2019.
The Caerphilly and Speedwell Collaborative Group. Caerphilly and speedwell collaborative heart disease studies. J Epidemiol Community Health. 1984;38(3):259–62.
Article Google Scholar
EU Joint Programme - Neurodegenerative Disease Research Global Cohort Portal. http://www.neurodegenerationresearch.eu/jpnd-global-cohort-portal/. Accessed 09 June 2019.
Integrative Analysis of Longitudinal Studies of Aging. http://www.ialsa.org/. Accessed 09 June 2019.
The Global Alzheimer’s Association Interactive Network. http://www.gaain.org/. Accessed 09 June 2019.
TranSMART platform. https://transmartfoundation.org/overview-of-platform/. Accessed 09 June 2019.

Download references

Acknowledgements

DPUK would like to express gratitude to: Cohort members and their research teams for generously making data available; EMIF-AD for providing access to their data catalogue and supporting software; Professor Ian Deary, Dr Declan Jones and Professor Sir Simon Lovestone for their contribution to this paper from their support in the DPUK Executive Team.

Funding

This work was supported by the UK Research and Innovation Medical Research Council [MR/L023784/1 and MR/L023784/2].

Author information

Sarah Bauermeister, Christopher Orton and Simon Thompson: Joint lead authors.

Authors and Affiliations

Department of Psychiatry, University of Oxford, Oxford, UK
Sarah Bauermeister, Joshua R. Bauermeister, Catherine Calvin, Christoph Jindra, Chi-Hun Kim, Ivan Koychev, Clare Mackay, Matthew South & John E. J. Gallacher
Swansea University Medical School, Swansea University, Swansea, UK
Christopher Orton, Simon Thompson & Ronan A. Lyons
Cambridge University Department of Clinical Neurosciences and Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
Roger A. Barker & James B. Rowe
Population Health Sciences, University of Bristol, Bristol, UK
Yoav Ben-Shlomo & Nicholas J. Timpson
Department of Public Health, University of Cambridge, Cambridge, UK
Carol Brayne
Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
David Burn
Department of Medical Genetics, University of Edinburgh, Edinburgh, UK
Archie Campbell
Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, UK
Siddharthan Chandran & Craig W. Ritchie
MRC Unit for Lifelong Health and Ageing, UCL, London, UK
Nishi Chaturvedi, Daniel H. J. Davis, Marcus Richards & Andrew Wong
Bordeaux Population Health, Université Bordeaux, Bordeaux, France
Geneviève Chêne & Carole Dufouil
Neuroscience BioPharma R&D, AstraZeneca, Cambridge, UK
Iain P. Chessell
College of Medicine and Health, University of Exeter, Exeter, UK
Anne Corbett
Oxford Academic Health Science Network, University of Oxford, Oxford, UK
Mike Denis
MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
Paul Elliott & Lefkos Middleton
Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London, UK
Nick Fox, Katrina Moore, Jonathan D. Rohrer, Martin N. Rossor, Rachael Scahill, Jonathan M. Schott & Sarah J. Tabrizi
King’s College London, London, UK
Derek Hill
Department of Psychology, University of Victoria, Victoria, Canada
Scott M. Hofer
Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK
Michele T. Hu
Centre for Public Health, Queen’s University Belfast, Belfast, UK
Frank Kee, Gerry J. Linden & Bernadette McGuiness
Department of Preventive Medicine, Yonsei University College of Medicine, Seoul, South Korea
Changsoo Kim
Institute of Epidemiology and Health, University College London, London, UK
Mika Kivimaki
Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, UK
Rachael A. Lawson
Division of Brain Sciences and UK Dementia Research Institute, Imperial College London, London, UK
Paul M. Matthews
Medical Research Council, Swindon, UK
Catherine Moody
Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
Duk L. Na & Sang W. Seo
Department of Psychiatry, University of Cambridge, Cambridge, UK
John T. O’Brien
School of Biomedical Engineering & Imaging Sciences, King’s College London, London, UK
Sebastien Ourselin
School of Medicine, Cardiff University, Cardiff, UK
Shantini Paranjothy
Institute of Health Science, Gyeongsang National University, Jinju-si, South Korea
Ki-Soo Park
Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
David J. Porteous
Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, UK
Christian Schnier & Tim Wilkinson
Department of Behavioural Science and Health, UCL, London, UK
Matthew Steptoe
Centre for Innovative Ageing, Swansea University, Swansea, UK
Andrea Tales
UCL Institute for Cardiovascular Science, London, UK
Therese Tillin
Laboratory of Neuro Imaging, USC, Los Angeles, USA
Arthur W. Toga
VU University Medical Centre, Maastricht University, Maastricht, The Netherlands
Pieter-Jelle Visser
Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, UK
Richard Wade-Martins
Institute of Psychological Medicine and Clinical Neurosciences, and UK Dementia Research Institute, Cardiff University, Cardiff, UK
Julie Williams
Imperial College NIHR Biomedical Research Centre, Imperial College London, London, UK
Paul Elliott
Health Data Research UK London at Imperial College London, London, UK
Paul Elliott

Authors

Sarah Bauermeister
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Orton
View author publications
You can also search for this author in PubMed Google Scholar
Simon Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Roger A. Barker
View author publications
You can also search for this author in PubMed Google Scholar
Joshua R. Bauermeister
View author publications
You can also search for this author in PubMed Google Scholar
Yoav Ben-Shlomo
View author publications
You can also search for this author in PubMed Google Scholar
Carol Brayne
View author publications
You can also search for this author in PubMed Google Scholar
David Burn
View author publications
You can also search for this author in PubMed Google Scholar
Archie Campbell
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Calvin
View author publications
You can also search for this author in PubMed Google Scholar
Siddharthan Chandran
View author publications
You can also search for this author in PubMed Google Scholar
Nishi Chaturvedi
View author publications
You can also search for this author in PubMed Google Scholar
Geneviève Chêne
View author publications
You can also search for this author in PubMed Google Scholar
Iain P. Chessell
View author publications
You can also search for this author in PubMed Google Scholar
Anne Corbett
View author publications
You can also search for this author in PubMed Google Scholar
Daniel H. J. Davis
View author publications
You can also search for this author in PubMed Google Scholar
Mike Denis
View author publications
You can also search for this author in PubMed Google Scholar
Carole Dufouil
View author publications
You can also search for this author in PubMed Google Scholar
Paul Elliott
View author publications
You can also search for this author in PubMed Google Scholar
Nick Fox
View author publications
You can also search for this author in PubMed Google Scholar
Derek Hill
View author publications
You can also search for this author in PubMed Google Scholar
Scott M. Hofer
View author publications
You can also search for this author in PubMed Google Scholar
Michele T. Hu
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Jindra
View author publications
You can also search for this author in PubMed Google Scholar
Frank Kee
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Hun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Changsoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Mika Kivimaki
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Koychev
View author publications
You can also search for this author in PubMed Google Scholar
Rachael A. Lawson
View author publications
You can also search for this author in PubMed Google Scholar
Gerry J. Linden
View author publications
You can also search for this author in PubMed Google Scholar
Ronan A. Lyons
View author publications
You can also search for this author in PubMed Google Scholar
Clare Mackay
View author publications
You can also search for this author in PubMed Google Scholar
Paul M. Matthews
View author publications
You can also search for this author in PubMed Google Scholar
Bernadette McGuiness
View author publications
You can also search for this author in PubMed Google Scholar
Lefkos Middleton
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Moody
View author publications
You can also search for this author in PubMed Google Scholar
Katrina Moore
View author publications
You can also search for this author in PubMed Google Scholar
Duk L. Na
View author publications
You can also search for this author in PubMed Google Scholar
John T. O’Brien
View author publications
You can also search for this author in PubMed Google Scholar
Sebastien Ourselin
View author publications
You can also search for this author in PubMed Google Scholar
Shantini Paranjothy
View author publications
You can also search for this author in PubMed Google Scholar
Ki-Soo Park
View author publications
You can also search for this author in PubMed Google Scholar
David J. Porteous
View author publications
You can also search for this author in PubMed Google Scholar
Marcus Richards
View author publications
You can also search for this author in PubMed Google Scholar
Craig W. Ritchie
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan D. Rohrer
View author publications
You can also search for this author in PubMed Google Scholar
Martin N. Rossor
View author publications
You can also search for this author in PubMed Google Scholar
James B. Rowe
View author publications
You can also search for this author in PubMed Google Scholar
Rachael Scahill
View author publications
You can also search for this author in PubMed Google Scholar
Christian Schnier
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan M. Schott
View author publications
You can also search for this author in PubMed Google Scholar
Sang W. Seo
View author publications
You can also search for this author in PubMed Google Scholar
Matthew South
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Steptoe
View author publications
You can also search for this author in PubMed Google Scholar
Sarah J. Tabrizi
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Tales
View author publications
You can also search for this author in PubMed Google Scholar
Therese Tillin
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas J. Timpson
View author publications
You can also search for this author in PubMed Google Scholar
Arthur W. Toga
View author publications
You can also search for this author in PubMed Google Scholar
Pieter-Jelle Visser
View author publications
You can also search for this author in PubMed Google Scholar
Richard Wade-Martins
View author publications
You can also search for this author in PubMed Google Scholar
Tim Wilkinson
View author publications
You can also search for this author in PubMed Google Scholar
Julie Williams
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Wong
View author publications
You can also search for this author in PubMed Google Scholar
John E. J. Gallacher
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the conception, creation and development of all the themes of the DPUK itself, including this Data Portal. Material preparation was completed by John Gallacher, Sarah Bauermeister, Chris Orton and Simon Thompson. The first draft of the manuscript was written by John Gallacher and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to John E. J. Gallacher.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bauermeister, S., Orton, C., Thompson, S. et al. The Dementias Platform UK (DPUK) Data Portal. Eur J Epidemiol 35, 601–611 (2020). https://doi.org/10.1007/s10654-020-00633-4

Download citation

Received: 14 February 2020
Accepted: 10 April 2020
Published: 23 April 2020
Issue Date: June 2020
DOI: https://doi.org/10.1007/s10654-020-00633-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The Dementias Platform UK (DPUK) Data Portal

Abstract

Similar content being viewed by others

Management and Curation of Multi-Dimensional Data in Biobank Studies

Research-ready data: the C-Surv data model

SOCR data dashboard: an integrated big data archive mashing medicare, labor, census and econometric information

Introduction

Layer 1: Data ingestion

Layer 2: Data curation

Layer 3: Interoperability

Layer 4: Data discovery

Layer 5: Data access brokerage

Layer 6: Data analysis

Layer 7: Knowledge environment

Results

Data resources

Data resource use

Data Portal performance

Discussion

Data Portal benefits

Data Portal challenges

Looking ahead

Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The Dementias Platform UK (DPUK) Data Portal

Abstract

Similar content being viewed by others

Management and Curation of Multi-Dimensional Data in Biobank Studies

Research-ready data: the C-Surv data model

SOCR data dashboard: an integrated big data archive mashing medicare, labor, census and econometric information

Introduction

Layer 1: Data ingestion

Layer 2: Data curation

Layer 3: Interoperability

Layer 4: Data discovery

Layer 5: Data access brokerage

Layer 6: Data analysis

Layer 7: Knowledge environment

Results

Data resources

Data resource use

Data Portal performance

Discussion

Data Portal benefits

Data Portal challenges

Looking ahead

Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation