TY - JOUR
T1 - Longitudinal reproducibility of automatically segmented hippocampal subfields: A multisite European 3T study on healthy elderly
AU - Marizzoni, Moira
AU - Antelmi, Luigi
AU - Bosch, Beatriz
AU - Bartrés-Faz, David
AU - Müller, Bernhard W.
AU - Wiltfang, Jens
AU - Fiedler, Ute
AU - Roccatagliata, Luca
AU - Picco, Agnese
AU - Nobili, Flavio
AU - Blin, Olivier
AU - Bombois, Stephanie
AU - Lopes, Renaud
AU - Sein, Julien
AU - Ranjeva, Jean-Philippe
AU - Didic, Mira
AU - Gros-Dagnac, Hélène
AU - Payoux, Pierre
AU - Zoccatelli, Giada
AU - Alessandrini, Franco
AU - Beltramello, Alberto
AU - Bargalló, Núria
AU - Ferretti, Antonio
AU - Caulo, Massimo
AU - Aiello, Marco
AU - Cavaliere, Carlo
AU - Soricelli, Andrea
AU - Salvadori, Nicola
AU - Parnetti, Lucilla
AU - Tarducci, Roberto
AU - Floridi, Piero
AU - Tsolaki, Magda
AU - Constantinidis, Manos
AU - Drevelegas, Antonios
AU - Rossini, Paolo Maria
AU - Marra, Camillo
AU - Hoffmann, Karl-Titus
AU - Hensch, Tilman
AU - Schönknecht, Peter
AU - Kuijer, Joost P.
AU - Visser, Pieter Jelle
AU - Barkhof, Frederik
AU - Bordet, Régis
AU - Frisoni, Giovanni B.
AU - Jovicich, Jorge
PY - 2015
Y1 - 2015
N2 - Recently, there has been an increased interest in the use of automatically segmented subfields of the human hippocampal formation derived from magnetic resonance imaging (MRI). However, little is known about the test-retest reproducibility of such measures, particularly in the context of multisite studies. Here, we report the reproducibility of automated Freesurfer hippocampal subfields segmentations in 65 healthy elderly enrolled in a consortium of 13 3T MRI sites (five subjects per site). Participants were scanned in two sessions (test and retest) at least one week apart. Each session included two anatomical 3D T1 MRI acquisitions harmonized in the consortium. We evaluated the test-retest reproducibility of subfields segmentation (i) to assess the effects of averaging two within-session T1 images and (ii) to compare subfields with whole hippocampus volume and spatial reliability. We found that within-session averaging of two T1 images significantly improved the reproducibility of all hippocampal subfields but not that of the whole hippocampus. Volumetric and spatial reproducibility across MRI sites were very good for the whole hippocampus, CA2-3, CA4-dentate gyrus (DG), subiculum (reproducibility error∼2% and DICE > 0.90), good for CA1 and presubiculum (reproducibility error ∼ 5% and DICE ∼ 0.90), and poorer for fimbria and hippocampal fissure (reproducibility error ∼ 15% and DICE < 0.80). Spearman's correlations confirmed that test-retest reproducibility improved with volume size. Despite considerable differences of MRI scanner configurations, we found consistent hippocampal subfields volumes estimation. CA2-3, CA4-DG, and sub-CA1 (subiculum, presubiculum, and CA1 pooled together) gave test-retest reproducibility similar to the whole hippocampus. Our findings suggest that the larger hippocampal subfields volume may be reliable longitudinal markers in multisite studies.
AB - Recently, there has been an increased interest in the use of automatically segmented subfields of the human hippocampal formation derived from magnetic resonance imaging (MRI). However, little is known about the test-retest reproducibility of such measures, particularly in the context of multisite studies. Here, we report the reproducibility of automated Freesurfer hippocampal subfields segmentations in 65 healthy elderly enrolled in a consortium of 13 3T MRI sites (five subjects per site). Participants were scanned in two sessions (test and retest) at least one week apart. Each session included two anatomical 3D T1 MRI acquisitions harmonized in the consortium. We evaluated the test-retest reproducibility of subfields segmentation (i) to assess the effects of averaging two within-session T1 images and (ii) to compare subfields with whole hippocampus volume and spatial reliability. We found that within-session averaging of two T1 images significantly improved the reproducibility of all hippocampal subfields but not that of the whole hippocampus. Volumetric and spatial reproducibility across MRI sites were very good for the whole hippocampus, CA2-3, CA4-dentate gyrus (DG), subiculum (reproducibility error∼2% and DICE > 0.90), good for CA1 and presubiculum (reproducibility error ∼ 5% and DICE ∼ 0.90), and poorer for fimbria and hippocampal fissure (reproducibility error ∼ 15% and DICE < 0.80). Spearman's correlations confirmed that test-retest reproducibility improved with volume size. Despite considerable differences of MRI scanner configurations, we found consistent hippocampal subfields volumes estimation. CA2-3, CA4-DG, and sub-CA1 (subiculum, presubiculum, and CA1 pooled together) gave test-retest reproducibility similar to the whole hippocampus. Our findings suggest that the larger hippocampal subfields volume may be reliable longitudinal markers in multisite studies.
KW - Freesurfer
KW - hippocampus
KW - test-retest reproducibility
KW - within session T1 averaging
KW - Freesurfer
KW - hippocampus
KW - test-retest reproducibility
KW - within session T1 averaging
UR - http://hdl.handle.net/10807/69808
U2 - 10.1002/hbm.22859
DO - 10.1002/hbm.22859
M3 - Article
SN - 1097-0193
VL - 36
SP - 3516
EP - 3527
JO - Human Brain Mapping
JF - Human Brain Mapping
ER -