Deep learning segmentation of the choroid plexus from structural magnetic resonance imaging (MRI): validation and normative ranges across the adult lifespan

Eisma, Jarrod J.; McKnight, Colin D.; Hett, Kilian; Elenberger, Jason; Han, Caleb J.; Song, Alexander K.; Considine, Ciaran; Claassen, Daniel O.; Donahue, Manus J.

doi:10.1186/s12987-024-00525-9

Research
Open access
Published: 29 February 2024

Deep learning segmentation of the choroid plexus from structural magnetic resonance imaging (MRI): validation and normative ranges across the adult lifespan

Jarrod J. Eisma¹,
Colin D. McKnight²,
Kilian Hett¹,
Jason Elenberger¹,
Caleb J. Han¹,
Alexander K. Song¹,
Ciaran Considine¹,
Daniel O. Claassen¹ &
…
Manus J. Donahue^1,3,4

Fluids and Barriers of the CNS volume 21, Article number: 21 (2024) Cite this article

1266 Accesses
1 Citations
2 Altmetric
Metrics details

Abstract

Background

The choroid plexus functions as the blood-cerebrospinal fluid (CSF) barrier, plays an important role in CSF production and circulation, and has gained increased attention in light of the recent elucidation of CSF circulation dysfunction in neurodegenerative conditions. However, methods for routinely quantifying choroid plexus volume are suboptimal and require technical improvements and validation. Here, we propose three deep learning models that can segment the choroid plexus from commonly-acquired anatomical MRI data and report performance metrics and changes across the adult lifespan.

Methods

Fully convolutional neural networks were trained from 3D T₁-weighted, 3D T₂-weighted, and 2D T₂-weighted FLAIR MRI using gold-standard manual segmentations in control and neurodegenerative participants across the lifespan (n = 50; age = 21–85 years). Dice coefficients, 95% Hausdorff distances, and area-under-curve (AUCs) were calculated for each model and compared to segmentations from FreeSurfer using two-tailed Wilcoxon tests (significance criteria: p < 0.05 after false discovery rate multiple comparisons correction). Metrics were regressed against lateral ventricular volume using generalized linear models to assess model performance for varying levels of atrophy. Finally, models were applied to an expanded cohort of adult controls (n = 98; age = 21–89 years) to provide an exemplar of choroid plexus volumetry values across the lifespan.

Results

Deep learning results yielded Dice coefficient = 0.72, Hausdorff distance = 1.97 mm, AUC = 0.87 for T₁-weighted MRI, Dice coefficient = 0.72, Hausdorff distance = 2.22 mm, AUC = 0.87 for T₂-weighted MRI, and Dice coefficient = 0.74, Hausdorff distance = 1.69 mm, AUC = 0.87 for T₂-weighted FLAIR MRI; values did not differ significantly between MRI sequences and were statistically improved compared to current commercially-available algorithms (p < 0.001). The intraclass coefficients were 0.95, 0.95, and 0.96 between T₁-weighted and T₂-weighted FLAIR, T₁-weighted and T₂-weighted, and T₂-weighted and T₂-weighted FLAIR models, respectively. Mean lateral ventricle choroid plexus volume across all participants was 3.20 ± 1.4 cm³; a significant, positive relationship (R² = 0.54-0.60) was observed between participant age and choroid plexus volume for all MRI sequences (p < 0.001).

Conclusions

Findings support comparable performance in choroid plexus delineation between standard, clinically available, non-contrasted anatomical MRI sequences. The software embedding the evaluated models is freely available online and should provide a useful tool for the growing number of studies that desire to quantitatively evaluate choroid plexus structure and function (https://github.com/hettk/chp_seg).

Background

The choroid plexus consists of a collection of fenestrated capillaries and epithelial cells that filter blood plasma down an osmotic gradient to secrete cerebrospinal fluid (CSF) in each of the brain’s four ventricles, with the majority of the choroid plexus volume residing in the atria of the lateral ventricles. The choroid plexus is widely believed to be the primary source of CSF production in the brain, producing CSF at a rate of 430–530 mL/day [1], and the choroid plexus has gained additional recent attention owing to its role as one of the most proximal components of the brain’s waste clearance system [2].

The choroid plexus structure and function has been well characterized from animal and post-mortem studies, but how choroid plexus structure and function change in the context of disease [3,4,5] and aging [6, 7] in humans is an area of active and emerging interest. In the context of aging, it has been shown that choroid plexus volume increases, and perfusion decreases, with advanced age [7, 8]. Diffusion-weighted magnetic resonance imaging (MRI) has also revealed that choroid plexus mean diffusivity increases, and fractional anisotropy decreases, with advanced age [8]. Increasing choroid plexus volume may relate to increasing severity of cognitive impairment in the spectrum of Alzheimer’s disease related disorders [3] and in support of this possibility, reduced choroid plexus metabolism from ¹⁸F-Fluorodeoxyglucose positron emission tomography (PET) has been reported in patients with Alzheimer’s disease compared to patients with amnestic mild cognitive impairment and healthy controls [9]. Perfusion-weighted arterial spin labelling MRI has been utilized further to characterize choroid plexus response to various pharmacological stimuli [10], which may aid in evaluating novel therapeutic delivery pathways or mechanisms.

However, one limitation to the advancement of neuroimaging studies of the choroid plexus is the lack of an accurate, automatic tool to segment the structure from anatomical images. Manual segmentations, as with other tissues, are impractical in large cohort studies, and the choroid plexus has varying appearances on standard MRI sequences due to its heterogeneous relaxometry characteristics [8], making manual segmentations an even more difficult process. Alisch et al. found that both the T₁ and T₂ relaxation times of the choroid plexus increase with advancing age [8], which affects the contrast of the choroid plexus on standard MRI sequences. For instance, this finding might suggest that the choroid plexus is more visible on T₁-weighted and T₂-weighted images for younger people, since the relaxometry of the choroid plexus contrasts with the surrounding CSF more in younger people.

Many neuroimaging software packages do not include segmentation options for the choroid plexus, and those that do include choroid plexus segmentation tools have been reported to have suboptimal performance in many applications [11]. Fully convolutional neural networks (FCNN) have shown state-of-the-art performance for segmentation of other brain structures [12], and recent work has proposed deep learning-based methods to segment the choroid plexus [11, 13, 14]. These methods rely on 3D magnetization-prepared-rapid-gradient-echo (MPRAGE) T₁-weighted MRI to learn choroid plexus anatomical patterns, however, this approach may provide suboptimal contrast for choroid plexus visualization and quantification given limited contrast between hypointense CSF signal and normo-to-mildly hypointense choroid plexus signal. In addition to T₁-weighted MRI, T₂-weighted and T₂-weighted FLuid Attenuated Inversion Recovery (FLAIR) MRI also are commonly acquired in both clinical and research neuroimaging environments and may yield differing segmentation accuracy, although this possibility has not been investigated rigorously.

In this study, we aim to develop and evaluate automated tools for segmenting the choroid plexus from three types of commonly acquired MRI sequences: T₁-weighted, T₂-weighted, and T₂-weighted FLAIR; and to compare the results from these methods to gold-standard manual tracings and to commonly used neuroimaging analysis software, FreeSurfer [15, 16]. We also evaluate performance of these methods in an additional cohort of adult controls to report how the choroid plexus evolves across the adult lifespan, which will provide an exemplar for future clinical studies which may implicate the choroid plexus, such as Alzheimer’s disease, Parkinson’s disease, multiple sclerosis, and traumatic brain injury. The processing code is also made publicly available for free academic use.

Methods

Demographics

This study had two components. First, we developed and evaluated a deep learning algorithm using separate standard MRI sequences in a diverse cohort of adults (deliberately selected to span a range of ages and conditions) with the intent of providing a generalizable segmentation algorithm. Second, we applied the method to adult controls across the lifespan to provide an exemplar for how choroid plexus volume changes with age in a cross-sectional analysis.

Adult participants (n = 50 for model training; n = 98 for subsequent adult control lifespan analysis) provided informed, written consent in accordance with the Vanderbilt University Institutional Review Board (IRB) and the Declaration of Helsinki and its amendments. All participants were enrolled between February 2020 and July 2023. It is well-known that the brain atrophies with advancing age and in various neurological disorders, and with this atrophy comes ventricular enlargement and possibly choroid plexus hypertrophy [7, 8]. In order to make the proposed method as generalizable as possible, for algorithm training and development we deliberately enrolled a heterogeneous cohort of persons across the adult lifespan. Participants included controls and patients with mild cognitive impairment (MCI), Alzheimer’s disease, Parkinson’s disease, and Huntington’s disease. Inclusion criteria for control participants consisted of no history of cerebrovascular disease, anemia, psychosis, or neurological disorder including but not limited to prior overt stroke, sickle cell anemia, schizophrenia, bipolar disorder, Alzheimer’s disease, Parkinson’s disease, or multiple sclerosis. The presence of non-specific white matter lesions was not an exclusion criterion for controls, as these lesions become more prevalent with aging, and we sought our cohort to be generalizable and representative. Diagnosis of Alzheimer’s disease, mild cognitive impairment, Parkinson’s disease, or Huntington’s disease was made by a board-certified neurologist (DOC; experience = 15 years) using clinical criteria.

Image acquisition

All participants underwent non-contrasted MRI at 3 Tesla with body coil radiofrequency transmission and 32-channel SENSE phased-array reception on a Philips Ingenia system (Philips Healthcare, Best, The Netherlands). Anatomical images consisted of: (i) 3D T₁-weighted MPRAGE (TR = 8.1 ms; TE = 3.7 ms; field of view = 256 × 180 × 150 mm³; number of slices = 150; spatial resolution = 1.0 × 1.0 × 1.0 mm³; duration = 4 min 32 s), (ii) 2D T₂-weighted FLAIR turbo-spin-echo (TR = 11,000 ms; TE = 120 ms; TI = 2800 ms; field of view = 230 × 184 × 144 mm³; number of slices = 29; spatial resolution = 0.57 × 0.57 × 4.0 mm³; duration = 1 min 39 s), and (iii) 3D T₂-weighted turbo-spin-echo (TR = 2500 ms; TE = 331 ms; field of view = 250 × 250 × 189 mm³; number of slices = 242; spatial resolution = 0.78 × 0.78 × 0.78 mm³; duration = 4 min 8 s).

Manual segmentation of the choroid plexus

Data utilized for manual segmentation of the choroid plexus consisted of 3D T₁-weighted, 2D axial T₂-weighted FLAIR, and 3D T₂-weighted MRI from 50 participants. Ground truth choroid plexus segmentation was performed manually with final approval from a board-certified neuroradiologist (CDM; experience = 9 years). Additionally, for assessment of the inter-rater reliability of manual delineations of the choroid plexus, two additional raters manually segmented the choroid plexus following the same protocol as the primary rater in 10 participants from the machine learning training sample (see Supplementary Materials). In all cases, the manual delineation protocol was defined as follows: first, 2D axial T₂-weighted FLAIR and 3D T₂-weighted images were co-registered to 3D T₁-weighted images using linear registration tools from the Advanced Normalization Tools (ANTs) software package [17]. Next, the contrast from all three co-registered images was utilized by the primary rater to generate a single choroid plexus segmentation for each participant, using the FMRIB Software Library (FSL) tool fsleyes for segmentation and to visualize all three images in the same space simultaneously [18]. This approach was chosen to make efficient use of the higher spatial resolution T₁-weighted and T₂-weighted scans as well as the intraventricular contrast afforded on the T₂-weighted FLAIR. Given this process, a single ground truth segmentation was produced for each training subject. Manual segmentations focused on the choroid plexus in the atria of the lateral ventricles. We chose to focus on this region of the choroid plexus for several reasons. First, to limit biasing of the deep learning method, delineations were careful not to include partial voluming from nearby subcortical structures or periventricular white matter. The choroid plexus of the lateral ventricles minimizes the partial volumes effect, as the trigonum ventriculi contain the largest portion of the choroid plexus distinct from surrounding tissue [19]. It has been reported previously that across all four brain ventricles, more than half of the choroid plexus mass is located within the lateral ventricles [20].

Automatic choroid plexus segmentation

Automatic choroid plexus segmentations were generated via a fully convolutional neural network model.

The machine learning model was designed following a U-NET architecture [21]. This architecture was chosen because of its proven success in medical image segmentation algorithms and consisted of an encoding and a decoding step. The encoding step was composed of three blocks each composed of two layers. The number of filters was set to 64 for the first block and doubled at each block thereafter. Each layer consised of a 3D convolution (kernel size = 3 × 3 × 3 voxels, stride = 1, and padding = 1), a batch normalization, followed by a rectified linear unit. Feature maps from each block were downsampled using a maximum pooling operation (kernel size = 2 × 2 × 2 voxels). The decoding step followed the same architecture, with each block dividing the number of filters by 2. Up-sampling between each decoding block was performed with a 3D transposed convolution (kernel size = 2 × 2 × 2 voxels, stride = 2 × 2 × 2 voxels, and no padding). The final segmentation map was obtained using a block composed of a 3D convolution operator (kernel size = 1 × 1 × 1 voxels, stride = 1 × 1 × x1 voxels, and no padding) followed by a hyperbolic tangent as the activation function. The model was trained on three separate datasets to compare performance across different MRI sequences (i.e., T₁-weighted, T₂-weighted, and T₂-weighted FLAIR images). All images were registered non-linearly with ANTs software to the International Consortium for Brain Mapping-Montreal Neurological Institute (ICBM-MNI) 152 T₁-weighted template [22]. Non-linear registration was utilized in this step in order to reduce morphological variability of the lateral ventricles and thus increase the inter-subject similarity of the choroid plexus appearance.

Implementation details

A patch-based approach for training of the machine learning model was employed. Patches of 64 × 64 × 64 voxels were extracted from the MNI-registered images, and these patches were centered on random voxels from the choroid plexus probabilistic atlas that was generated from the average of the manual choroid plexus segmentations included in the training data set. In total, 41 overlapping patches from each participant were used to train the model. During the training phase, random flipping along the longitudinal fissure was implemented to increase the training sample size further. The network was trained using an ADAM [23] optimizer with a learning rate set to 10^–4. A generalized Dice loss function was used to train the network [24]. Lastly, the segmentation mask patches were pieced back together in MNI space and transformed back to the native T₁-weighted space using the inverse transformation and nearest-neighbor interpolation. An overview of the processing pipeline with a diagram of the 3D U-NET architecture is shown in Fig. 1.

For comparison to available algorithms, choroid plexus segmentation masks were generated using FreeSurfer’s standard segmentation procedure from T₁-weighted MRI in all training subjects [15, 16]. Briefly, input images were skull-stripped and intensity corrected, and FreeSurfer’s aseg atlas was used to generate left and right choroid plexus labels. These labels were inverse transformed back to each subject’s native T₁ space and combined to form one choroid plexus mask per subject. These masks were then compared to ground truth manual segmentations for statistical analysis. As a secondary and exploratory analysis, to assess test–retest reliability of the proposed machine learning methods, we analyzed consecutive T₁-weighted MRI collected in 10 participants during different scan sessions within a 2 month time frame (see Aditional file 1).

Lastly, for the lifespan volumetry analysis, T₁, T₂, and T₂-weighted FLAIR images were separately preprocessed as described previously, and the deep learning model for each corresponding MRI sequence was utilized to generate choroid plexus segmentations for all enrolled participants. From these segmentations in each participant’s native imaging space, the choroid plexus volume was calculated in cm³ .

Statistical analyses

To evaluate the accuracy of the choroid plexus segmentation, we investigated how each model performed when trained with different sets of MRI sequences (i.e., T₁-weighted, T₂-weighted, and T₂-weighted FLAIR images) compared to ground truth manual delineations. For each MRI sequence, a fivefold cross-validation scheme was implemented with 30 participants utilized in model training, 10 participants utilized in model validation, and 10 participants utilized in model evaluation. Pseudo-randomization was used to ensure the same participant groups across each modality-based model.

To verify the accuracy of the machine learning and FreeSurfer outputs, standard comparison metrics between the ground truth segmentation and machine learning output were calculated. The Dice-Sørensen coefficient, 95% Hausdorff distance, and area under curve (AUC) were calculated for each iteration of cross-validation and averaged across the iterations to produce representative metrics for each modality-based model. These metrics were then compared to FreeSurfer using two-tailed Wilcoxon tests.

As an exploratory analysis, we evaluated these performance metrics as a function of participant lateral ventricular volume to gain more understanding on how the machine learning models perform in different anatomical environments. Lateral ventricular volume was calculated from each participant’s T₁-weighted MRI using the AssemblyNet software package [25]. Generalized linear models were utilized to separately regress performance metrics against model-testing participants’ lateral ventricular volume.

The intraclass correlation coefficients between the choroid plexus volume for each control participant from the three deep learning models and between each of the choroid plexus volumes for the training participants and their ground truth choroid plexus volumes, were calculated and descriptive statistics presented as Bland–Altman plots.

For the lifespan component of this study, a generalized linear model was utilized to regress the choroid plexus volume from each modality model against participant age. Sex was included as a covariate as well in this regression to account for previously found sex-dependence on choroid plexus volume [4], and total intracranial volume calculated from AssemblyNet was included as a covariate as well. The McFadden R² values were calculated for each regression model.

The machine learning algorithm was implemented using the PyTorch Python library [26], and pre-processing, post-processing, and statistical analyses were implemented in Matlab [27]. All statistical analyses were implemented using the R software package [28]. All p-values were corrected with false discovery rate for multiple comparisons correction [29]. Significance criteria was defined as p < 0.05.

Results

Demographics: algorithm development

Participants (n = 50) included in the training, validation, and testing of the machine learning models ranged in age from 21 to 85 years, included 27 males and 23 females, and 29 control participants and 21 participants with neurodegeneration (Additional file 1: Table S1).

Algorithm performance metrics

Performance metrics for each proposed machine learning model, and a previously available FreeSurfer algorithm, are reported in Table 1. The average Dice coefficients were 0.72, 0.72, and 0.74 for the T₁-weighted, T₂-weighted, and T₂-weighted FLAIR models, respectively, while the average Dice coefficient for the FreeSurfer output applied to the T₁-weighted image was 0.19. Two-tailed Wilcoxon tests revealed a significant difference in the Dice coefficient between the T₁-weighted machine learning method and FreeSurfer, the T₂-weighted machine learning method and FreeSurfer, and the T₂-weighted FLAIR machine learning method and FreeSurfer (all p-values < 0.001).

Table 1 Performance metrics for each machine learning method and FreeSurfer using manual segmentations as the ground truth

Full size table

The average 95% Hausdorff distances were 1.97, 2.22, and 1.69 mm for the T₁-weighted, T₂-weighted, and T₂-weighted FLAIR models, respectively, and the average 95% Hausdorff distance for the FreeSurfer output was 10.4 mm. Two-tailed Wilcoxon tests revealed a significant difference in the 95% Hausdorff distance between the T₁-weighted machine learning method and FreeSurfer, the T₂-weighted machine learning method and FreeSurfer, and the T₂-weighted FLAIR machine learning method and FreeSurfer (all p-values < 0.001).

The average AUCs were 0.87 for each of the models and the average AUC for the FreeSurfer output was 0.56. Two-tailed Wilcoxon tests revealed a significant difference in the AUC between the T₁-weighted machine learning method and FreeSurfer, the T₂-weighted machine learning method and FreeSurfer, and the T₂-weighted FLAIR machine learning method and FreeSurfer (all p-values < 0.001).

An example of each machine learning model output compared to ground truth and FreeSurfer choroid plexus segmentations from a 53-year-old male with MCI are shown in Figs. 2 and 3.

Finally, supplementary sub-analyses on inter-rater manual segmentation (ICC between all raters = 0.73) and inter-scan machine learning segmentation performance (ICC between consecutive segmentations = 0.99) are summarized in the Supplementary Materials.

We also investigated the relationship between model performance and lateral ventricular volume. Numerical results and graphical representations of these relationships are shown in Additional file 1: Table S2 and Fig. 4, respectively. For each MRI sequence, the lateral ventricular volume of the testing participant was not significantly related to the Dice coefficient (T₁-weighted p-value: 0.44; T₂-weighted p-value: 0.99; T₂-weighted FLAIR p-value: 0.40). For the T₂-weighted and T₂-weighted FLAIR models, the lateral ventricular volume was not significantly related to the 95% Hausdorff Distance (T₂-weighted p-value: 0.92; T₂-weighted FLAIR p-value: 0.094); however, for the T₁-weighted model, the lateral ventricular volume was positively related to the 95% Hausdorff Distance (p-value: 0.050). For each MRI sequence, the lateral ventricular volume was not significantly related to the AUC (T₁-weighted p-value: 0.20; T₂-weighted p-value: 0.35; T₂-weighted FLAIR p-value: 0.69). For the FreeSurfer outputs, none of the metrics related to ventricular volume (Dice p-value: 0.99; 95% Hausdorff distance p-value: 0.44; AUC p-value: 0.69).

Intraclass correlation coefficients were 0.83, 0.82, and 0.82 between T₁-weighted deep learning choroid plexus volumes and ground truth choroid plexus volumes (Fig. 5a), T₂-weighted deep learning choroid plexus volumes and ground truth choroid plexus volumes (Fig. 5b), and T₂-weighted FLAIR deep learning choroid plexus volumes and ground truth choroid plexus volumes (Fig. 5c), respectively.

Choroid plexus volume and age

Participants (n = 98) included in the assessment of choroid plexus volume across the adult lifespan ranged from 21 to 89 years of age and included 46 males and 52 females (Additional file 1: Table S3).

Numerical and graphical results from these regression analyses are shown in Additional file 1: Table S4 and Fig. 6, respectively. For each MRI sequence, participant age was positively related to choroid plexus volume (all p-values < 0.001). Additionally, for each MRI sequence, participant sex was significantly related to choroid plexus volume, with males having a larger choroid plexus volume than females (T₁-weighted p-value: 0.0012; T₂-weighted and T₂-weighted FLAIR p-values < 0.001). For each MRI sequence, total intracranial volume was not significantly related to choroid plexus volume (T₁-weighted p-value: 0.094; T₂-weighted p-value: 0.094; T₂-weighted FLAIR p-value: 0.11). The McFadden’s R² values for the T₁-weighted, T₂-weighted, and T₂-weighted FLAIR regression models were 0.54, 0.60, and 0.57, respectively. Intraclass correlation coefficients between choroid plexus volumes were 0.95, 0.95, and 0.96 for T₁-weighted and T₂-weighted FLAIR deep learning methods (Fig. 7a), T₁-weighted and T₂-weighted deep learning methods (Fig. 7b), and T₂-weighted and T₂-weighted FLAIR deep learning methods (Fig. 7c). Representative choroid plexus volumes across the adult lifespan are included in Additional file 1: Table S3.

Discussion

A deep learning method with 3D U-NET architecture was trained for automatic segmentation of the choroid plexus from standard anatomical MRI. Models were trained separately on three types of commonly-acquired images: T₁-weighted, T₂-weighted, and T₂-weighted FLAIR MRI from a cohort of 50 participants across the adult lifespan and with differing levels of tissue atrophy. The findings of the study support improved automated segmentation of the choroid plexus using the proposed method compared to currently-available software, and also provides an exemplar of choroid plexus volumes, as a function of age, in controls that may provide a reference for studies in neurodegeneration. The software is also made freely available for academic use.

The three proposed deep learning methods were able to segment the choroid plexus with Dice coefficients, 95% Hausdorff distances, and AUC values comparable to those found in literature for choroid plexus segmentation [11, 13, 14, 30]. Zhao and colleagues report a mean dice score of 0.73 and a mean 95% Hausdorff distance of 1.87 utilizing a similar deep learning method with 3D U-NET architecture and 3D T₁-weighted MRI from 10 healthy subjects for training data [11]. Yazdan-Panah and colleagues also propose a 3D U-NET method for automatic choroid plexus segmentation using 3D T₁-weighted MRI from patients with multiple sclerosis (n = 97) and heatlhy controls (n = 44) for model training and report a mean dice score of 0.73 [14]. Lastly, Storelli and colleagues developed an automatic choroid plexus segmentation method utilizing a Gaussian Mixture Model from 3D T₂-weighted FLAIR and 3D T₁-weighted MRI in patients with multiple sclerosis (n = 55) and healthy controls (n = 60) and report mean dice scores of 0.63 in multiple sclerosis patients and 0.66 in healthy controls [30]. All training data in these studies were collected at 3.0 T. We expand on these methods by including additional anatomical MRI contrasts that are commonly acquired in clinical settings, specifically 3D T₂-weighted and 2D T₂-weighted FLAIR MRI, and a training dataset with diverse demographics with the goal of increasing the generalizability of the proposed methods. The proposed methods also showed improved performance compared to automatic segmentations from FreeSurfer across all calculated metrics, an important finding as many previous and ongoing studies utilize FreeSurfer for choroid plexus volumetric analyses [3, 8, 31]. FreeSurfer utilizes an atlas-based segmentation approach, whereby a manually labeled training set provided by the software is used to estimate probabilistic neuroanatomical labels for each voxel in the MRI volume registered to this atlas [15]. While this software has shown robust sensitivity for segmentation of many noncortical structures [15, 16], the results from this study and from Zhao et al. suggest that it may not be the most accurate for choroid plexus segmentation, possibly due to the inter-subject variation in choroid plexus structure [11]. Further, we found that most of the proposed models perform accurately independent of lateral ventricular volume. All model performance metrics had no significant relationship to testing subject lateral ventricular volume except the T₁ model’s 95% Hausdorff distance. Observing the central plot in Fig. 4, it is possible that this relationship was driven by a statistical outlier. The observation that the other performance metrics were relatively stable in the presence of a variety of lateral ventricular volumes provides further support for the robustness of these models. In our test–retest analysis, the intraclass correlation coefficient between choroid plexus volumes in consecutively acquired T₁-weighted MRI also was high (ICC = 0.99), again suggesting that the proposed method performs robustly in the context of repeated applications of the same algorithm to separate T₁-weighted scans from the same subject (Additional file 1: Fig. S2).

Additionally, we applied these deep learning methods in a cohort of 98 adult controls across the adult lifespan and found a significant positive relationship between subject age and choroid plexus volume with all three methods. Intraclass correlation coefficients between these volumes were high, suggesting consistently accurate calculations of choroid plexus volumes between models in this cohort. Intraclass coefficients between training subjects’ ground truth choroid plexus volumes and deep learning choroid plexus volumes were also high, suggesting accurate segmentation performance from the proposed methods. We also reported normative ranges of choroid plexus volume across the adult lifespan and found an approximate 15% increase in choroid plexus volume with each decade of life on average across all MRI sequences, which agrees with previous reports from literature [7, 8]. We previously reported age-related increases in choroid plexus volume using similar methods as described in this study and age-related decreases in choroid plexus perfusion detected from arterial spin labeling MRI [7]. Sun and colleagues recently reported age-related increases in choroid plexus volume using manual delineations from T₁-weighted MRI and enlarged stromal tissue in the choroid plexus of older subjects using ultrasmall superparamagnetic iron oxide (USPIO)-enhanced high resolution 2D gradient echo MRI at 7 Tesla [32]. Previous histopathological studies using hematoxylin–eosin staining have shown a thickened vascular wall and fibrotic stroma in the choroid plexus of elderly subjects as well [33], which could explain the enlarged volume on anatomical MRI.

While these methods showed robust results and provided findings that aligned with previous reports from literature, several factors should be considered when interpreting the results. The training data sample size included 50 participants. However, the chroid plexus was segmented using gold-standard manual segmentation by a radiologist and we chose a 3D U-NET architecture which has shown robust accuracy with limited data set samples [11, 21]. We also adopted a data augmentation strategy and utilized a patch-based approach and random flipping, which increased the training dataset from 50 to 4100 samples. Furthernore, we included participants in the training dataset with and without clinically diagnosed neurodegenerative diseases to increase generalizability. Additionally, the lifespan study reports on trends in choroid plexus volume with age, and participants are approximately equally distributed across the adult lifespan. However, this study was cross-sectional and not longitudinal (e.g., following the same participant over time) and also may be underpowered to infer small changes in choroid plexus volume over limited age epochs (e.g., a decade of life or less). Future work could expand on this cohort, using large data sets, to address these issues more rigorously. Lastly, when observing Table 1, Figs. 5, and 7 there is some variability in the calculated choroid plexus volumes compared to the ground truth volumes in all the proposed methods and when compared across the proposed methods. While this variability needs to be reduced in order to ensure accurate volume estimates, it is promising that the variability in Figs. 5a–c is considerably less than the variability produced from currently available automated segmentation approaches (Fig. 5d). It is also relevant to contextualize this variability with the variability that arises from manual segmentation procedures. In a separate analysis, we had two additional raters segment the choroid plexus in a subsample of the same 10 subjects using the same protocol described in the Methods. More details for the methods of this analysis can be found in the Supplementary Materials. The main finding from this analysis was that the variability produced from manual segmentations performed by separate raters was larger (overall ICC between 3 raters = 0.73) than the variability produced from the proposed automatic methods (ICC = 0.82). These analyses also stress the need to develop a more rigorous manual delineation protocol to assess the presence of ChP tissues in the lateral ventricles.

Conclusion

We propose a deep learning segmentation method for automatic segmentation of the choroid plexus from the following standard anatomical MRI: T₁-weighted, T₂-weighted, and T₂-weighted FLAIR. The proposed method performs similarly across these three commonly-acquired MRI sequences and improves segmentation accuracy compared to commercially available algorithms. Finally, we provide ranges for lateral ventricle choroid plexus volume across the adult lifespan, which should provide a useful exemplar for future work that aims to identify pathological aberrations in choroid plexus volume and function. The proposed method is also made freely available for academic use.

Availability of data and materials

The data that support the findings of this study are available from the corresponding author, MJD, upon reasonable request. Choroid plexus segmentation software is freely available for public use at: https://github.com/hettk/chp_seg

Abbreviations

ANTs:: Advanced Normalization Tools
AUC:: Area under curve
CSF:: Cerebrospinal fluid
FCNN:: Fully convolutional neural network
FLAIR:: Fluid attenuated inversion recovery
FSL:: FMRIB Software Library
ICBM-MNI:: International Consortium for Brain Mapping-Montreal Neurological Institute
IRB:: Institutional Review Board
MCI:: Mild cognitive impairment
MPRAGE:: Magnetization-prepared-rapid-gradient-echo
MRI:: Magnetic resonance imaging/images
PET:: Positron emission tomography
TE:: Echo time
TR:: Repetition time
USPIO:: Ultrasmall superparamagnetic iron oxide

References

Khasawneh AH, Garling RJ, Harris CA. Cerebrospinal fluid circulation: what do we know and how do we know it? Brain Circ. 2018;4:14.
Article PubMed PubMed Central Google Scholar
Iliff JJ, Wang M, Liao Y, et al. A paravascular pathway facilitates CSF flow through the brain parenchyma and the clearance of interstitial solutes, including amyloid β. Sci Transl Med. 2012;4:147ra111.
Article PubMed PubMed Central Google Scholar
Choi JD, Moon Y, Kim HJ, Yim Y, Lee S, Moon WJ. Choroid plexus volume and permeability at brain MRI within the Alzheimer disease clinical spectrum. Radiology. 2022;304(3):635–45.
Article PubMed Google Scholar
Ricigliano VAG, Morena E, Colombi A, Tonietto M, Hamzaoui M, Poirion E, Bottlaender M, Gervais P, Louapre C, Bodini B, Stankoff B. Choroid plexus enlargement in inflammatory multiple sclerosis: 3.0-T MRI and translocator protein PET evaluation. Radiology. 2021;301(1):166–77.
Article PubMed Google Scholar
Yasmin A, Pitkänen A, Andrade P, Paananen T, et al. Post-injury ventricular enlargement associates with iron in choroid plexus but not with seizure susceptibility nor lesion atrophy-6-month MRI follow-up after experimental traumatic brain injury. Brain Struct Funct. 2022;227:145–58.
Article PubMed Google Scholar
Maxwell DS, Pease DC. The electron microscopy of the choroid plexus. J Biophys Biochem Cytol. 1956;2(4):467–74.
Article CAS PubMed PubMed Central Google Scholar
Eisma JJ, McKnight CD, Hett K, Elenberger J, Song AK, Stark AJ, Claassen DO, Donahue MJ. Choroid plexus perfusion and bulk cerebrospinal fluid flow across the adult lifespan. J Cereb Blood Flow Metab. 2023;43(2):269–80.
Article CAS PubMed Google Scholar
Alisch JSR, Kiely M, Triebswetter C, Alsameen MH, Gong Z, Khattar N, Egan JM, Bouhrara M. Characterization of age-related differences in the human choroid plexus volume, microstructural integrity, and blood perfusion using multiparameter magnetic resonance imaging. Front Aging Neurosci. 2021;13:613.
Article Google Scholar
Daouk J, Bouzerar R, Chaarani B, Zmudka J, Meyer ME, Balédent O. Use of dynamic 18F-fluorodeoxyglucose positron emission tomography to investigate choroid plexus function in Alzheimer’s disease. Exp Gerontol. 2016;77:62–8.
Article CAS PubMed Google Scholar
Perera C, Harrison IF, Lythgoe MF, et al. Pharmacological MRI with simultaneous measurement of cerebral perfusion and blood-cerebrospinal fluid barrier function using Interleaved Echo-Time arterial spin labelling. Neuroimage. 2021;238: 118270.
Article CAS PubMed Google Scholar
Zhao, L., Feng, X., Meyer, C.H., et al. Choroid plexus segmentation using optimized 3D U-Net. In: IEEE 17th International Symposium on Biomedical Imaging, Iowa City, Iowa, USA, 3 April-7 April 2020, pp. 381–384. New Jersey: IEEE.
Akkus Z, Galimzianova A, Hoogi A, Rubin DL, Erickson BJ. Deep learning for brain MRI segmentation: state of the art and future directions. J Digit Imaging. 2017;30:449–59.
Article PubMed PubMed Central Google Scholar
Schmidt-Mengin, M., Ricigliano, V.A.G., Bodini, B., et al. Axial multi-layer perceptron architecture for automatic segmentation of choroid plexus in multiple sclerosis. Proc. SPIE 12032, Medical Imaging 2022: Image Processing, 1203208.
Yazdan-Panah, A., Schmidt-Mengin, M., Ricigliano, VAG., et al. Automatic segmentation of the choroid plexuses: method and validation in controls and patients with multiple sclerosis. NeuroImage 2023; 38:103368.
Fischl B, et al. Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain. Neuron. 2022;33(3):341–55.
Article Google Scholar
Fischl B, et al. Automatically Parcellating the Human Cerebral Cortex. Cereb Cortex. 2004;14(1):11–22.
Article PubMed Google Scholar
Avants BB, Tustison NJ, Song G, et al. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage. 2011;54:2033–44.
Article PubMed Google Scholar
McCarthy, P. (2023). FSLeyes (1.8.1). Zenodo.
Senay O, Seethaler M, Makris N, et al. A preliminary choroid plexus volumetric study in individuals with psychosis. Hum Brain Mapp. 2023;44:2465–78.
Article PubMed PubMed Central Google Scholar
Spector R, Keep RF, Snodgrass SR, et al. A balanced view of choroid plexus structure and function: focus on adult humans. Exp Neurol. 2015;267:78–86.
Article PubMed Google Scholar
Çiçek, O., Abdulkadir, A., Lienkamp, S.S., et al. 3D U-Net: Learning dense volumetric segmentation from sparse annotation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, Athens, Greece, 17 October-21 October 2016, pp.424–432. Minnesota: MICCAI.
Fonov, V.S., Evans, A.C., McKinstry, R.C., et al. Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. In: Organization for Human Brain Mapping Annual Meeting, San Francisco, California, USA, July 2009, pp. S102. Minnesota: OHBM.
Kingma, D. P., & Lei Ba, J. Adam: A Method for Stochastic Optimization. In: International Conference on Learning Representations, San Diego, California, USA, May 2015.
Sudre, C.H., Li, W., Vercauteren, T., et al. Generalised Dice Overlap as a Deep Learning Loss Function for Highly Unbalanced Segmentations. Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 10553 LNCS, pp. 240–248, 2017.
Coupé P, Mansencal B, Clément M, et al. AssemblyNet: A large ensemble of CNNs for 3D whole brain MRI segmentation. Neuroimage. 2020;2020(219): 117026.
Article Google Scholar
Paszke, A., Gross, S., Massa, F., et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library. Advances in Neural Information Processing Systems 2019; 32:8024. Curran Associates, Inc.
The MathWorks Inc. (2021). MATLAB version: 9.13.0 (R2021a), Natick, Massachusetts: The MathWorks Inc. https://www.mathworks.com
R Core Team (2014). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc. 1995;57:289–300.
MathSciNet Google Scholar
Storelli, L., Pagani, E., Rubin, M., et al. A Fully Automatic Method to Segment Choroid Plexuses in Multiple Sclerosis Using Conventional MRI Sequences. Journal of Magnetic Resonance Imaging. 2023. Epub.
Egorova N, Gottlieb E, Khlif MS, et al. Choroid plexus volume after stroke. Int J Stroke. 2019;14(9):923–30.
Article PubMed Google Scholar
Sun, Z., Li, C., Muccio, M., Jiang, L., & Ge, Y. Age-related Vascular Changes in Choroid Plexus Evaluated Using High-resolution USPIO-Enhanced 7T MRI. In: International Society for Magnetic Resonance in Medicine, Toronto, Canada, 3 June-8 June 2023. California: ISMRM.
Prineas JW, Parratt JDE, Kirwan PD. Fibrosis of the choroid plexus filtration membrane. J Neuropathol Exp Neurol. 2016;75:855–67.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors are grateful to Chuck Nockowski and Ryan Robison for experimental support. This study has been carried out with the financial support of NIH/NIA Grant 5R01AG062574 and NIH/NCCIH Grant 1R01AT011456. Vanderbilt University Medical Center Institutional Review Board has approved this study.

Funding

National Institute on Aging, 5R01AG062574, National Center for Complementary and Integrative Health, 1R01AT011456.

Author information

Authors and Affiliations

Department of Neurology, Behavioral and Cognitive Neurology, Vanderbilt University Medical Center, 1500 21 stAve South, Village at Vanderbilt, Suite 2600, Nashville, TN, 37212, USA
Jarrod J. Eisma, Kilian Hett, Jason Elenberger, Caleb J. Han, Alexander K. Song, Ciaran Considine, Daniel O. Claassen & Manus J. Donahue
Department of Radiology and Radiological Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
Colin D. McKnight
Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
Manus J. Donahue
Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, USA
Manus J. Donahue

Authors

Jarrod J. Eisma
View author publications
You can also search for this author in PubMed Google Scholar
Colin D. McKnight
View author publications
You can also search for this author in PubMed Google Scholar
Kilian Hett
View author publications
You can also search for this author in PubMed Google Scholar
Jason Elenberger
View author publications
You can also search for this author in PubMed Google Scholar
Caleb J. Han
View author publications
You can also search for this author in PubMed Google Scholar
Alexander K. Song
View author publications
You can also search for this author in PubMed Google Scholar
Ciaran Considine
View author publications
You can also search for this author in PubMed Google Scholar
Daniel O. Claassen
View author publications
You can also search for this author in PubMed Google Scholar
Manus J. Donahue
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JJE, CDM, KH, CC, DOC, and MJD contributed to study design and interpretation of data. JJE, MJD, AKS, and JE contributed to data acquisitions. CDM contributed to the review of the data. JJE, KH, AKS, CJH and MJD contributed to analysis and development of new methods. All authors contributed to writing the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Manus J. Donahue.

Ethics declarations

Ethics approval and consent to participate

All participants provided informed consent in accordance with the local institutional review board and consistent with the Declaration of Helsinki and its amendments.

Consent for publication

Available upon request.

Competing interests

No competing interests to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Additional figures and tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Eisma, J.J., McKnight, C.D., Hett, K. et al. Deep learning segmentation of the choroid plexus from structural magnetic resonance imaging (MRI): validation and normative ranges across the adult lifespan. Fluids Barriers CNS 21, 21 (2024). https://doi.org/10.1186/s12987-024-00525-9

Download citation

Received: 08 September 2023
Accepted: 21 February 2024
Published: 29 February 2024
DOI: https://doi.org/10.1186/s12987-024-00525-9

Deep learning segmentation of the choroid plexus from structural magnetic resonance imaging (MRI): validation and normative ranges across the adult lifespan

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Demographics

Image acquisition

Manual segmentation of the choroid plexus

Automatic choroid plexus segmentation

Implementation details

Statistical analyses

Results

Demographics: algorithm development

Algorithm performance metrics

Choroid plexus volume and age

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Fluids and Barriers of the CNS

Contact us