Introduction

Preclinical imaging deals with the visualization of small animals, such as mice and rats, for research purposes, in order to assay biological structures and activities in vivo, thus providing quantifiable, spatial and temporal information on healthy and pathological tissues down to both cellular and molecular level [1]. Importantly, because of its non-invasiveness, imaging is suitable for longitudinal studies of animal models in the fields of diagnostics, epidemiology and drug development [2].

Imaging research generates large amounts of data and information, mostly produced by computers and laboratory instruments. Consequently, it is of utmost importance to provide appropriate data storage to ensure adequate management and organization to the research labs. Data loss not only implies losing processed images and the know-how gained, but also a waste of time and resources. Data sharing among research groups is another critical factor, particularly in scientific collaborations that involve many partners. In the last decade, several image repositories have emerged enabling the discovery of datasets from peer-reviewed publications or research studies in the life science domain, from biological imaging such as electron/fluorescence microscopy and high content screening, to biomedical imaging, such as magnetic resonance imaging (MRI), positron electron tomography (PET), computed tomography (CT) and ultrasound (US) [3,4,5]. These resources can now be retrieved online by users through a browser user interface or programmatically by using an application programming interface (API).

For human population studies on biomedical imaging, large data repositories are routinely used by researchers and physicians all over the world. Among them, the Human Connectome Project [6] is a compilation of neural data, The Cancer Imaging Archive (TCIA) is a broad collection of cancer image data [7], the Alzheimer’s Disease Neuroimaging Initiative (ADNI) is a shared catalogue of image data related to Alzheimer’s disease [8] and the Open Access Series of Imaging Studies (OASIS) is a repository of magnetic resonance images [9].

The processing of large volumes of biomedical image datasets requires dedicated platforms and proper infrastructures. The Longitudinal Online Research and Imaging System (LORIS) is a flexible online system for data management devoted to multicenter studies that covers all the aspects from data acquisition from multiple sources to storage, processing and dissemination [10]. Other examples of extensible data management platforms are The Human Imaging Database (HID) and the Collaborative Informatics and Neuroimaging Suite (COINS) for clinical neuroimaging studies [11, 12]. To tackle the needs of heterogeneous studies, the Medical Imaging Research Management and Associated Information Database (MIRMAID), a web-accessible content management system for medical images, was proposed [13]. More recently, Anastasopoulos et al. introduced Nora Imaging, an in-house web platform, to process medical imaging data derived from brain imaging studies [14].

The Extensible Neuroimaging Archive Toolkit (XNAT) is an imaging informatics system designed by the Neuroinformatics Research Group at the Washington University to manage images from several sources, to save data in a safe database and to share data among authorized users [15, 16]. It was originally conceived to deal with data management of neuroimaging laboratories, but its increasing success has promoted its use in many other medical imaging fields. Over the years, XNAT has become a large project sustained by active user and developer communities. Indeed, many academic institutions and hospitals build their own data management and process infrastructure upon the XNAT system with the aim to extend its core features and provide new functionalities to meet the needs of researchers and physicians. An XNAT-based framework for managing large-scale clinical studies was developed by Doran et al. along with in-house applications for data selection and uploading [17]. GIFT-Cloud is a medical image sharing platform built on top of XNAT 1.6 dedicated to GIFT-Surg, an international scientific project that develops novel imaging techniques for prenatal surgery [18]. Moreover, XNAT has been customized for automated quality assessment of retinal optical coherence tomography (OCT) [19] and for collaborative research in human sleep medicine [20]. XNAT has also been used to promote multicenter reproducibility studies for radiomics [21]. The European Population Imaging Infrastructure (EPI2) provides an XNAT-based environment for the implementation of large, prospective epidemiological imaging studies, allowing for permanent and/or temporary storage of medical images. Furthermore, EPI2 develops state-of-the-art image analysis pipelines for high-volume image processing [22, 23]. Lastly, Health-RI offers an XNAT-based service for research projects related to archive, view and process clinical imaging data [24, 25].

Animal population imaging allows researchers to test novel therapies and drugs on small animal models that will be eventually translated into the clinic. On the other hand, it suffers from the lack of standard tools to store, process and share imaging data produced by scientists. Kain et al. have recently proposed the Small Animal Shanoir (SAS), an expansion of the Shanoir platform that was developed for data management dedicated to human neuroimaging repositories [26, 27]. SAS offers a cloud-based solution for exchanging data and analysis tools for small animal imaging studies in the framework of France Life Imaging (FLI).

Small animal imaging facilities are highly specialized centers that provide the research community, both academic institutions and industrial enterprises, with access to cutting-edge imaging technologies. Therefore, animal imaging centers have to deal with the complexity and the variety of preclinical trial datasets. The time needed to perform these studies and the relative costs are prompting imaging scientists to share image data in public repositories. Sharing data from independent research investigations, beyond saving resources, ensures the reproducibility of the experiments, aids the scientific community to get noticed and boosts collaborations among research institutions. The main difficulties arise from the complexity of the analysis and the scarcity of standard applications for sharing and processing preclinical images.

Vendors of preclinical imaging machinery use proprietary formats to store the acquired images. This makes data handling difficult for users with limited experience in data management and image processing. These drawbacks arise significant questions for multimodal image investigations especially in recording, curation and processing of imaging data. Commercial softwares distributed by imaging device manufacturers provide the possibility to export proprietary raw image datasets to internationally recognized data standards, being Digital Imaging and COmmunications in Medicine (DICOM) [28] and Neuroimaging Informatics Technology Initiative (NIfTI) [29] the most popular ones. Notably, the storage of the parameters related to the image acquisition is poorly supported by NIfTI, as opposed to DICOM [30]. Despite DICOM being the internationally acknowledged standard for the interchange and management of medical images, some sections still require coherent semantics to guarantee consistent sharing of image data across several third-party products. In addition, the DICOM standard needs to be updated and expanded to support novel applications and incorporate emerging imaging technologies. In particular, at preclinical level new techniques have been developed such as chemical exchange saturation transfer (CEST) MRI, photoacoustic imaging (PAI) and optical imaging (OI), so that standardization as well as ad hoc DICOM attributes are desirable.

This work aims to develop an XNAT-based platform that meets the needs of the preclinical imaging community. This system is named XNAT-PIC that stands for XNAT for Preclinical Imaging Centers. Our goal is two-fold: we intend to provide the preclinical imaging researchers with tools to easily extract, import and archive biomedical image data and with reusable processes for high-throughput extraction of quantitative features from raw image data, all within an XNAT environment. All the developments are free and open-source. We believe that XNAT-PIC will allow a streamlined exchange and reuse of image data among preclinical imaging facilities.

Materials and Methods

The Molecular Imaging Center (CIM) hosted at the Department of Molecular Biotechnology and Health Sciences of the University of Torino has deployed an XNAT instance devoted to preclinical imaging available at http://cim-xnat.unito.it. XNAT is a free and open-source Java-based web application, exploiting the PostgreSQL database system. Users can personalize an instance and broaden its basic features to support their data and project management needs. The deployment presented in this paper is running on XNAT 1.7.6, using Apache Tomcat 7, Oracle JDK 8, PostgreSQL 11 and Ubuntu 20.04 LTS Operative System.

XNAT natively supports multiple imaging modalities, such as MRI, PET, CT and US with the possibility to extend XNAT datatypes to other imaging methods. XNAT comes with a built-in DICOM image management application that also allows to store images in NIfTI format. It offers key functionalities, for instance importing and downloading images in multiple formats, archiving, distributing data and setting data protection and accessibility. In XNAT, users can either store the raw or processed images on local disks or transfer them through the network to a DICOM C-STORE Service Class Provider (SCP), for instance a picture archive and communications systems (PACS) or another XNAT deployment. Moreover, XNAT comes with a Java-based viewer to access and inspect the images in the archive. This viewer can be customized with plugins to integrate supplementary functionalities specific to the image type of interest.

XNAT for Preclinical Imaging Centers (XNAT-PIC) has been developed to meet the needs of the preclinical imaging community. In particular, XNAT-PIC consists of a suite of tools aimed at converting raw MR image series to DICOM standard, uploading to XNAT and processing large image datasets. This workflow is based on the steps schematically depicted in Fig. 1:

Fig. 1
figure 1

Schematic workflow of image archiving and processing. XNAT-PIC is a suite of tools aimed at facilitating the management and the analysis of preclinical image datasets

  1. 1.

    MRI2DICOM, a MR image converter from ParaVision® (Bruker, Inc. Billerica, MA) file format to DICOM standard.

  2. 2.

    XNAT-PIC Uploader to upload large, multimodal image datasets in DICOM standard to XNAT.

  3. 3.

    XNAT-PIC Pipelines for processing single or multiple subjects in an XNAT project.

MRI2DICOM is a free and open-source tool built in Python 3.7.6 downloadable at https://github.com/szullino/XNAT-PIC [31]. The application uses the numpy 1.15.4 and Pydicom 1.2.1 libraries to handle DICOM files [32,33,34,35]. MRI2DICOM has been designed for and tested on several, different versions of ParaVision®, such as 5.1, 6.0.1 and 360.1.1. It accesses the ParaVision® data structure, deciphers the binary file (2dseq) containing the image and parses the acquisition parameters stored in different files (visupars, method, reco, and acqp) into Python dictionaries [36]. Lastly, it saves all the relevant information into the DICOM header and image set field, according to the MRI acquisition protocol.

XNAT-PIC Uploader is built in Python 3.7.6. The communication with XNAT is possible through xnatpy 0.3.22, a new and open-source XNAT Python client, and pyAesCrypt 0.4.3, a Python library to encrypt the files containing the XNAT login credentials [37, 38]. PyInstaller 3.5 has been used to bundle the Python applications and all its dependencies into a single package to run MRI2DICOM and XNAT-PIC Uploader as a stand-alone executable, in both Windows and Linux environments [39].

XNAT-PIC Pipelines have been designed to process DICOM images stored in XNAT, extract quantitative information and produce parametric maps. These pipelines have been built on top of pre-existing image analysis scripts developed in MATLAB R2020b (The MathWorks, Inc., US) by our group. XNAT-PIC Pipelines can be of two types: subject level or project level pipeline. Subject level pipelines consist of a bash script that invokes MATLAB to perform the computation. A different approach has been used for project level pipelines, i.e. to process multiple subjects within the same project. A Python 2.7 virtual environment has been created and the following packages have been installed: (i) the MATLAB Engine API for Python to run MATLAB scripts within a Python session [40], (ii) pyxnat-1.2.1.0.post3 that facilitates scripting interactions with the XNAT database [41, 42], (iii) Requests 2.23.0 that makes Hypertext Transfer Protocol (HTTP) requests to communicate with XNAT extremely simple [43]. Finally, the Mask Average Pipeline has been developed to calculate the mean value in a region of interest (ROI) of a parametric map generated from processing. This pipeline runs on Python 3.8.3 and uses the following libraries: numpy 1.18.5 [44] dcmrtstruct2nii 1.0.19 to export DICOM Radiotherapy Structure Set (RTSTRUCT) to NIfTI mask [45], the image processing library opencv-python 4.4.0.40 [46], and nibabel 3.1.1 for manipulating NIfTI images [47].

Results

MRI2DICOM Converter

Upon launching, XNAT-PIC offers multiple functions as shown in Fig. 2A: users can use MRI2DICOM to convert the binary data to DICOM standard or, given the data already in DICOM, users can use XNAT-PIC Uploader to import the MR image sessions to XNAT. If MRI2DICOM has been selected by the user, the converter needs to know the directory of the project in ParaVision® format to start the conversion (Fig. 2B). Once the process is completed, a new folder containing the project in DICOM standard will be created at the same location as the original one in raw format.

Fig. 2
figure 2

Snapshots of the XNAT-PIC application. A Upon launch, users can choose between converting raw image data to DICOM standard or upload pre-existing DICOM images to XNAT. B If the converter has been selected, MRI2DICOM allows the user to browse to the directory of the project in ParaVision® format

Commercial software in preclinical imaging poorly conforms to standards regarding data storage and each vendor uses a proprietary format for its data. In order to deal with emerging MRI techniques, a novel set of vocabularies can be introduced as private DICOM attributes specifically devoted to these Information Object Definition (IOD) [48]. For example, new dictionary items have been dynamically added to the ‘standard’ DICOM dictionary to describe CEST-MRI datasets. The full list of DICOM attributes is available in DICOM Part 6: Data Dictionary [49]. To avoid conflicts of private attributes from different implementers, a block in the (0061, 0010) space has been reserved for this specific use. The complete description of Private Data Elements encoded in the DICOM standard can be found in DICOM Part 5: Data Structure and Encoding [50]. Table 1 shows the private DICOM dictionary that has been developed for CEST-MRI acquisitions along with the corresponding entry gathered from the method file of ParaVision® 360.1.1. The value multiplicity (VM) of an attribute defines the number of values contained in that attribute, while the value representation (VR) describes the data type and format of each DICOM attribute.

Table 1 Private DICOM attributes for CEST-MRI modality gathered from the ParaVision® method file. To avoid conflicts of private tags from different vendors, a block in the (0061, 0010) space has been reserved for this specific imaging IOD. VM, value multiplicity; VR, value representation; LO, long string; DS, decimal string

Preclinical investigations may cover a wide variability of studies addressing specific scientific questions. Therefore, the data organization is usually tailored to the study of interest and needs to be matched to the XNAT data hierarchy. The capability of XNAT to manage different imaging data structures has then been extended, including longitudinal studies (several timepoints), treatments protocols (control and treated groups) and drug administrations (doses). The flexibility in managing several experimental conditions has been achieved by developing specific custom variables. Besides the default variables, XNAT users can specify an unlimited number of personalized variables. These custom variables can be reused within other projects, allowing for inter-project standardization. For longitudinal studies, three sets of custom variables have been created: the term ‘group’ refers to the treatment protocol and can have two possible values, ‘treated’ or ‘untreated’ (i.e., control group), ‘timepoint’ is related to the timing of the acquisitions (i.e. before (t0) and after (t1, t2) a treatment, or simply different timepoints) and ‘dose’ refers to a specific drug dosage in therapeutic treatment investigations (Fig. 3). Notably, XNAT-PIC users can adjust the custom variables accordingly to their data structure, provided that the custom variables are at most three and their naming conventions in XNAT are respected.

Fig. 3
figure 3

Schematic representation of the custom variables implemented in XNAT to match a typical data organization in preclinical longitudinal studies: ‘group’ refers to treated and untreated mice; ‘timepoints’ is related to the timing of the acquisition: t0 = pre, before treatment; t1 = post1w, 1 week after; t2 = post2w, 2 weeks after; ‘dose’ refers to amount of drug (dose1 and dose2, respectively) used in this specific experiment

XNAT-PIC Uploader

Upon conversion to DICOM format, the image dataset can be uploaded to XNAT. XNAT-PIC Uploader is designed to perform single subject uploading or batch uploading. It needs the XNAT webpage address and the login details (Fig. 4A); then, users can generate a new project or choose an existing one in the list (Fig. 4B), navigate to the directory of interest and enter the number of custom variables (Fig. 4C). A pop-up message notifies the user once the dataset is successfully imported to XNAT. The original raw images can be eventually uploaded to the database as project level resources.

Fig. 4
figure 4

Snapshots of XNAT-PIC Uploader. A Accessing XNAT requires the login details, such as the XNAT web address, username, and password; B users can create a new project or click the drop-down menu to select an existing project in the list; C custom variables can be entered by typing the number of variables in the 1 to 3 range that corresponds to number of folders in the project directory

Preclinical studies may imply the use of several imaging modalities that further increase the complexity of data management, since the same subject can typically undergo several examinations with different imaging instrumentation. For instance, studies that investigate tumor metabolism and acidosis may require the combined use of 18-fluorodeoxyglucose (FDG) PET technique for measuring tumor glucose uptake and CEST-MRI pH imaging for assessing tumor acidosis [48, 51]. The XNAT-PIC Uploader has been designed to upload image datasets of different modalities to XNAT such as MRI, PET, CT and US, allowing users to efficiently manage large, multimodal imaging studies.

The XNAT-PIC Uploader has been tested and validated on several studies, for instance including data gathered from CEST-MRI experiments, in particular GlucoCEST imaging for assessing tumor metabolism following glucose injection [52, 53]. Figure 5 shows a snapshot of the project and subject webpage in XNAT where the custom variables ‘group’ and ‘timepoints’ have been used to describe the GLINT project.

Fig. 5
figure 5

Snapshots of the project (panel A) and subject (panel B) webpage in XNAT with multiple image sessions (MR, CT and PET, respectively). The custom variable ‘group’ describes the patient status (treated or untreated) and is shown at each level of the XNAT data hierarchy, while the custom variable ‘timepoint’ referring to the timing of the treatment administration is displayed at the subject level (panel B) in the Label field

XNAT-PIC Pipelines

One core activity in XNAT is the execution of pipelines. A pipeline is defined as a sequence of steps. Each step of the pipeline invokes an external application, a shell script or an executable for processing the data. The pipeline is executed by the XNAT Pipeline Engine, a Java-based framework efficiently integrated within the XNAT platform [15]. The pipeline engine works with simple data flows on a step-by-step basis and can perform computational tasks on project data. The workflow is defined in an eXtensible Markup Language (XML) document named pipeline descriptor and the executables are defined in another XML document named resource descriptors.

Pipelines need to be enabled through the Pipelines tab in the project webpage. By clicking the Add More Pipelines button, the Pipelines tab will display a table of pipelines currently installed in XNAT. Users can add and configure a pipeline to be registered for the project.

XNAT is currently designed to run pipelines at subject level only. This is not feasible if one needs to process large-scale image datasets. The urgency was therefore to scale up this approach and create pipelines to process an entire project. Our aim was to ensure the same user experience as originally provided by XNAT when launching a pipeline for all the subjects in a project. The main idea was to build a project level pipeline that can be initiated from any subject within the selected project and iterated over all the subjects in that project. MATLAB R2020b scripts previously developed by our group have been used to build pipelines in XNAT and process a variety of MRI acquisitions, such as T1/T2 mapping and diffusion-weighted imaging (DWI). Two approaches have been designed and tested to process these MR images at subject and project level, respectively.

Subject Level Pipeline

An XNAT pipeline can be launched via the Run Pipeline tool in the Actions Menu in the MR Session webpage. A pop-up window lists all the pipelines available for that project. This list contains both the pipeline processing only the current subject and the pipeline processing all the subjects in that project. Figure 6 shows a screenshot of the pop-up windows where users can select between two pipelines: process_T2 processes the T2 map of the current subject, while process_T2_project processes the T2 maps of all the subjects in the same project. Upon selection of the subject level pipeline, users are then asked via web browser to tick the scan number to be analyzed. The pipeline descriptor contains XML instructions to take this user input, create the working directory, download the DICOM files corresponding to the scan of choice, upload the results of the analysis back to XNAT in a dedicated resource folder corresponding to the scan of interest and call the resource descriptor. The resource descriptor invokes a bash script that passes the DICOM files directory to MATLAB running in background to process those images.

Fig. 6
figure 6

Screenshot of the pop-up windows with the list of pipelines available for this project. Users can select between two pipelines: process_T2 processes the T2 map of the current subject, while process_T2_project processes the T2 maps of all the subjects in the same project

Project Level Pipeline

Users may need to process all the subjects of a project in a single run. Therefore, they can select a project level pipeline from the Run Pipeline pop-up menu.

In this case, the pipeline descriptor contains XML instructions to create the working directories needed for the processing and call the resource descriptor. The resource descriptor invokes a bash script running a Python wrapper consisting of a sequence of Python scripts in order to:

  1. 1.

    Search for the images within the project that have a specific DICOM attribute describing the acquisition protocol of the scans to be processed.

  2. 2.

    For each subject, create a local folder and then download the corresponding DICOM images.

  3. 3.

    Loop over the subjects and locally execute the MATLAB scripts.

  4. 4.

    Upload the results back to XNAT in a resource folder created by the pipeline at subject level.

In order to run XNAT-PIC pipelines, two scenarios are possible: (i) generic users can register to our CIM-XNAT instance, upload their image datasets and add the pipelines to their own projects (see: Adding Pipelines To Your Project: https://wiki.xnat.org/documentation/how-to-use-xnat/adding-pipelines-to-your-project); (ii) XNAT Admins can download XNAT-PIC pipelines from https://github.com/szullino/XNAT-PIC-Pipelines, install and register the pipelines in their own XNAT instance (see: Installing Pipelines in XNAT: https://wiki.xnat.org/documentation/xnat-administration/configuring-the-pipeline-engine/installing-pipelines-in-xnat).

Table 2 lists the processing pipelines currently installed on our XNAT deployment and available for download.

Table 2 Processing pipelines currently installed on our CIM-XNAT https://doi.org/10.1007/s10278-022-00612-z instance (http://cim-xnat.unito.it) and available for download at https://github.com/szullino/XNAT-PIC-Pipelines

In Fig. 7, the workflow for processing DWI acquisitions is schematically presented. Post-processed image data and other files are uploaded back to XNAT in the resource folder created at subject level or scan level, according to the full project or standard subject pipeline respectively, and accessible by the user through a Manage Files console. The resource folder can contain several subfolders, each of them populated with several files, such as parametric images in several file formats (i.e. NIfTI), MATLAB workspace and log files.

Fig. 7
figure 7

Schematic workflow of the process_DWI pipeline which retrieves, downloads, and processes the MRI diffusion scan. The output files (text file, log file, NIfTI images and MATLAB workspace) are then uploaded back to XNAT under the corresponding subject, experiment, and scan. This workflow is iterated when the project level process_DWI_project pipeline is launched to process all the MRI diffusion scans contained in the project; then, the resource folder is created at subject level

Mask Average Pipeline

Following the processing, preclinical imaging users are usually interested in performing some simple statistics on the processed images. We have therefore developed a pipeline that applies a mask to a parametric map and compute the mean value in the ROI. Therefore, the XNAT OHIF Viewer 2.0 Plugin has been installed on our XNAT instance [54]. The plugin is an integration of the Open Health Imaging Foundation (OHIF) viewer into XNAT [55]. Users can create contour-based, such as DICOM RTSTRUCT and Annotation Image Markup (AIM) as well as mask-based DICOM Segmentation (DICOM SEG) ROI Collections, and import/export them to a ROI Collection Assessors in XNAT (Fig. 8). Once the ROI has been drawn on the T2-weighted image and stored to XNAT, the Mask_Average pipeline can be launched from the Run Pipeline tool. The pipeline descriptor contains XML instructions to take the scans corresponding both to the T2-weighted acquisition and the parametric map in the resource folder, download the morphological image in DICOM standard and the parametric image in NIfTI format corresponding to the user selection, download the ROI Collection Assessors, create the working directory, upload the results of the analysis back to XNAT in a dedicated resource folder corresponding to the scan of interest and call the resource descriptor. The resource descriptor invokes a bash script passing both the T2-weighted DICOM image and the DICOM RT-STRUCT directories to a Python code to be converted into a NIfTI mask by the Python package dcmrtstruct2nii [56]. The same bash script passes the resulting mask to a Python script that computes statistical calculations in the ROI and uploads the results back to the database as XNAT resources (Fig. 9). A typical output of the Mask_Average pipeline is shown in Fig. 10. The XNAT-OHIF plugin is used to draw the ROIs corresponding to the mouse kidneys on the T2-weighted l image (Fig. 10A). The full T2 map obtained from the process_T2 pipeline along with the relative T2 map after masking are shown in Fig. 10B, C, respectively. A text file with the result of the calculation is depicted in Fig. 10D.

Fig. 8
figure 8

Snapshot of the ROI Collection Assessor webpage in XNAT. The mask represented here is composed by two ROIs, tumor1 and tumor2 respectively, and is saved in the DICOM RT-STRUCT file that needs to be converted in NIfTI format before usage

Fig. 9
figure 9

Schematic workflow of the Mask_Average pipeline that applies a mask to a parametric map in NIfTI format, computes the mean value in the ROI and uploads the masked parametric map and other output files back to XNAT

Fig. 10
figure 10

Outputs generated from the Mask_Average pipeline. A Reference anatomic T2-weighted axial image showing kidneys and corresponding ROIs. B T2 map obtained from process_T2 pipeline. C) The mask is applied to the obtained T2 map showing only the kidneys. D) Output text file reporting statistics for each ROI

Discussion

Preclinical imaging facilities are usually equipped with a variety of imaging devices, each of them yielding large amounts of data usually stored in several workstations. Storing and sharing preclinical data in a safe, fast and reliable manner is therefore imperative. Furthermore, image analysis is of utmost importance in preclinical applications to unravel the physiological mechanisms of the disease process or investigate the response to a new therapeutic treatment. Despite the urgent needs, few efforts have been done so far to manage preclinical imaging studies and ensure reproducibility through highly standardized image processing methods.

In this work, we have developed XNAT-PIC, a free and open-source application consisting of a MR image converter from proprietary file format to DICOM standard, an uploader to import large and multi-modal imaging studies to XNAT and a catalogue of pipelines for processing preclinical image datasets. MRI2DICOM is a MR image converter from ParaVision® file format to DICOM that takes into account emerging MRI methods such as CEST imaging. A controlled vocabulary has been created for the first time to describe CEST-MRI acquisitions. The entries are extracted from the raw image files and used to feed a new, private DICOM dictionary specifically created for this IOD. These DICOM attributes can be then easily accessed and reused by external applications for post-processing. XNAT-PIC Uploader has been designed to overcome an XNAT limitation regarding data upload. There are several options for uploading image sessions in XNAT: the XNAT Desktop Client and the XNAT Upload Assistant are stand-alone applications that can be installed for uploading and downloading data, while the XNAT Compressed Uploader is a tool that runs in the XNAT browser and does not require installation. Other options are the DICOM C-STORE Service Class Provider that can send data directly to the XNAT server and the representational state transfer (REST) API import service, the latter not applicable by non-expert users. To our knowledge, none of these methods is capable of uploading multiple subjects of different imaging modalities. The XNAT-PIC Uploader offers the possibility to upload several subjects screened with different modalities (i.e. MRI, PET, CT and US) related to the same project in a single run through a user-friendly graphical interface. The imaging modalities that can be uploaded using the XNAT-PIC Uploader are the ones currently supported by XNAT, such as MRI, CT, PET and US, for which the DICOM standard fully applies. The use of DICOM for preclinical research applications has been inhibited by the complexity of the standard, the limited awareness of its potential, and the demanding resources for its maintenance. Future work will be needed to include new data types into XNAT related to emerging imaging technologies that are popular in preclinical research, such as PAI and OI. This effort needs to resonate with the development of the DICOM standard and its adoption by the imaging device manufacturers. Indeed, some vendors currently use the US IOD to store PAI objects, while OI is still referred to other (OT). Nevertheless, OI in DICOM falls into the Visible Light IOD as defined in DICOM Part 3: Information Object Definitions, section A.32 ‘Visible Light Information Object Definitions’ [57]. The DICOM Standard Committee has established more than thirty Working Groups to expand the Standard for a particular imaging modality, clinical domain or technical field. Importantly, DICOM Working Group 34 ‘Photoacoustic’ is extending the DICOM standard to support PAI [57]. Similarly, the DICOM Working Group 30 “Small Animal Imaging” aims at extending and promoting the use of DICOM in preclinical imaging research [59]. At present, the list of DICOM IODs is available in DICOM Part 3: Information Object Definitions, Section C.7.3.1.1.1 ‘Modality’ [58]. Lastly, XNAT-PIC includes a collection of processing methods routinely used in preclinical imaging such as T1 and T2 mapping and DWI, with the aim to extend the offer of XNAT pipelines to other MRI techniques, firstly the emerging CEST-MRI imaging. XNAT-PIC Pipelines are built on top of the XNAT Pipeline Engine which reads an XML document and launches a sequence of steps defined in the XML document. The drawback of this approach is that all the executables and their dependencies have to be pre-installed on the local XNAT instance where the pipeline will be executed. This process requires the intervention of a system administrator. The XNAT Container Service Plugin was released in 2017 and leverages Docker. The operating system with all the executables needs to be packaged and the resulting image can be registered as container to the XNAT site. Unlike an XML pipeline, the Container Service requires a command json file with instruction on which container to use, what is the input and how to handle the output. The advantage of containerization technology relies on the flexibility to distribute and manage such kind of workflow. Since XNAT Container Service Plugin is becoming popular, containerized versions of the XNAT-PIC Pipelines will be developed in the future by our group. Likewise, our intent is to improve the reproducibility of preclinical research and guarantee standardized image processing procedures by offering a free service to integrate custom-built analysis pipeline in the XNAT-PIC workflow upon user request.

The Small Animal Shanoir is another preclinical research solution developed to manage and process imaging datasets. It offers a platform to store and distribute the data, manage the metadata associated to the study and process the images on high performance systems. In addition, the possibility to integrate custom-made processing pipelines and algorithms into SAS provides the reproducibility and accessibility of the research outputs. However, these services are subject to fees for both storage and processing. Interestingly, SAS architecture is based on micro-services, allowing SAS to function as a combination of independent structures that can be easily updated with new features or scaled up. Current, available micro-services in SAS comprise Dicomifier, a generic and open-source Bruker to DICOM or NIfTI converter, and applications to extract T1 and T2 relaxation times from MR image data. Recently, other commercial data management systems have been also introduced in preclinical imaging. The Small Animal Big-data warehouse Environment for Research (SABER) supports preclinical workflow and promotes data sharing, although this platform is unavailable on the internet. Flywheel is a commercial data management system to store research data of different imaging modalities in a centralized archive, thus improving productivity and collaboration in life science research [60, 61].

The urgent need to develop the infrastructure and services to support sharing and reusing of scholarly data motivated the establishment of the Findable, Accessible, Interoperable and Reusable (FAIR) Data Principle [62]. These rules are necessary to govern the scientific data management and stewardship and are applicable to several entities, such as industry, academia, scientific publishers and funding agencies. The principles may serve as guidance for stakeholders willing to strengthen their data reusability. Unlike similar initiatives committed to human scholarly work, the FAIR Principles are intended to make the data discoverable and readable by both individuals and machines, therefore supporting their reuse [62, 63]. XNAT-PIC was developed in the framework of the demonstrator projects under the auspices of EOSC-Life, a European Union’s Horizon 2020 research and innovation programme. XNAT-PIC is supported by the Multi-Modal Molecular Imaging Italian Node (MMMI) of Euro-BioImaging ERIC, a research infrastructure that offers open access to the most advanced imaging technologies, training and data services in biological and biomedical imaging [64]. In this scenario, XNAT-PIC acts as a first step towards data FAIRification in terms of Accessibility and Reusability. Generally, the data stored in an XNAT-based system is accessible upon authentication and authorization procedure: only trusted users have rights to access, manipulate and work with images. Data, tools and processing pipelines developed in XNAT-PIC are fully reusable as they are distributed under GNU General Public License v3 or any later version and available on GitHub. Some work needs still to be done regarding Findability and Interoperability. To date, the possibility to associate the dataset stored in XNAT with a digital object identifier (DOI) or Persistent Identifier is missing, preventing the data to be found by both humans and computers. In addition, machine-readable metadata that are necessary for data discovery are still needed.

The European Open Science Cloud (EOSC) is an EU-funded project based on FAIR principles whose goal is to provide a public data repository compliant to open science principles. EOSC aims at providing ‘all researchers in Europe with seamless access to an open-by-default, efficient and cross-disciplinary environment for storing, accessing, reusing, and processing research data supported by FAIR data principles’ as stated in The Vienna Declaration on the European Open Science Cloud [65]. Therefore, all the research materials that relate to scholarly data must be turned into FAIR. This includes the raw material, such as imaging dataset, as well as the tools, workflows and pipelines needed to process the data, allowing to extract and quantify the information. The FAIR revolution also involves the standards, metadata, and ontologies that are necessary to provide significance to both the data itself and any complementary material.

The portfolio of tools devoted to preclinical imaging in XNAT-PIC will be expanded by developing open-source converters for proprietary MR image data formats, such as Aspect Imaging, to DICOM standard and novel MR image processing scripts. Our future plan is to deploy a federated XNAT portal to collect preclinical imaging data from local XNAT installations and make them available to a broader community. The imaging community will largely benefit from this free, cloudified service, since it will enable users to discover image datasets normally not accessible, promoting the free exchange and reuse of data and ensuring higher standards of reproducibility of the experiments.

Conclusion

While the basic XNAT deployment serves as a system for safely accessing, archiving, and processing clinical imaging studies, XNAT-PIC widens its core features in several ways to support preclinical imaging facilities. Our aim is to overcome the current limitations that arise from the management and the storage of preclinical imagery, thereby facilitating the analysis of biomedical image data.

The advantage of this approach relies in the capability to interface with several imaging modalities, including emerging imaging techniques such as CEST-MRI, manage different preclinical investigation protocols and easily process preclinical image data. We believe that such a workflow may be of interest for preclinical imaging centers, thus allowing the scientific community to efficiently store, process and share biomedical imaging data.