Preprocessed Consortium for Neuropsychiatric Phenomics dataset

Krzysztof J. Gorgolewski; Joke Durnez; Russell A. Poldrack

doi:10.12688/f1000research.11964.1

Home Browse Preprocessed Consortium for Neuropsychiatric Phenomics dataset

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Data Note

Preprocessed Consortium for Neuropsychiatric Phenomics dataset

[version 1; peer review: 2 approved with reservations]

Krzysztof J. Gorgolewski ¹^*, Joke Durnez^1,2^*, Russell A. Poldrack¹

^* Equal contributors

PUBLISHED 28 Jul 2017

Author details Author details

¹ Department of Psychology, Stanford University, Stanford, CA, USA
² INRIA Parietal, Neurospin, Saclay, Gif-sur-Yvette, France

Krzysztof J. Gorgolewski
Roles: Conceptualization, Data Curation, Funding Acquisition, Software, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Joke Durnez
Roles: Data Curation, Formal Analysis, Methodology, Software, Validation, Visualization, Writing – Review & Editing

Russell A. Poldrack
Roles: Conceptualization, Funding Acquisition, Supervision, Validation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the INCF gateway.

Abstract

Here we present preprocessed MRI data of 265 participants from the Consortium for Neuropsychiatric Phenomics (CNP) dataset. The preprocessed dataset includes minimally preprocessed data in the native, MNI and surface spaces accompanied with potential confound regressors, tissue probability masks, brain masks and transformations. In addition the preprocessed dataset includes unthresholded group level and single subject statistical maps from all tasks included in the original dataset. We hope that availability of this dataset will greatly accelerate research.

Keywords

fMRI, human, cognition, preprocessed,

Corresponding authors: Krzysztof J. Gorgolewski, Joke Durnez

Competing interests: No competing interests were disclosed.

Grant information: This work has been funded by the Laura and John Arnold Foundation. JD has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 706561. The acquisition of the original dataset was supported by the Consortium for Neuropsychiatric Phenomics (NIH Roadmap for Medical Research grants UL1-DE019580, RL1MH083268, RL1MH083269, RL1DA024853, RL1MH083270, RL1LM009833, PL1MH083271, and PL1NS062410).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2017 Gorgolewski KJ et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Gorgolewski KJ, Durnez J and Poldrack RA. Preprocessed Consortium for Neuropsychiatric Phenomics dataset [version 1; peer review: 2 approved with reservations]. F1000Research 2017, 6:1262 (https://doi.org/10.12688/f1000research.11964.1) First published: 28 Jul 2017, 6:1262 (https://doi.org/10.12688/f1000research.11964.1) Latest published: 22 Sep 2017, 6:1262 (https://doi.org/10.12688/f1000research.11964.2)

Introduction

The recently published Consortium for Neuropsychiatric Phenomics (CNP) dataset¹ is large (272 participants), diverse (healthy controls as well as individuals diagnosed with schizophrenia, bipolar disorder, and attention deficit/hyperactivity disorder), and rich in phenotypic information (each participant filled 42 questionnaires) dataset. It is undoubtedly a rich resource for the academic community. However, before any brain behaviour relationships could be answered, computationally expensive and processing steps need to be performed². In addition to requiring a lot of resources, a certain level of expertise in MRI data processing and fMRI task modelling is required before the data could be used to test scientific hypotheses.

To facilitate answering scientific questions using the CNP dataset, we have performed standard preprocessing as well as statistical modeling on the data, and are making the results of these analyses openly available. The preprocessing was designed to facilitate a wide range of analyses, and includes outputs in native (aligned with participants T1 weighted scan), MNI (volumetric) and fsaverage5 (surface) spaces. The data has not been denoised, but potential confound regressors have been calculated for each run, giving researchers the freedom to fit many different models that incorporate different denoising schemes. In addition, we also include group and single subject statistical maps for all tasks available in the original dataset. This preprocessed dataset joins the ranks of similar initiatives for other openly shared datasets^3–5, and we hope it will be equally useful to the scientific community.

Methods

Preprocessing

For scanning parameters and details of the task fMRI paradigms, see 1. The input dataset was acquired from OpenfMRI.org⁶ - accession number ds000030, revision 1.0.3.

Results included in this manuscript come from preprocessing performed using FMRIPREP version 0.4.4 (http://fmriprep.readthedocs.io), a Nipype⁷ based tool. FMRIPREP was run with the following command line arguments:

--participant_label {sid} -w $LOCAL_SCRATCH --output-space T1w fsaverage5 template --nthreads 8 --mem_mb 20000

Where {sid} was the participant label and $LOCAL_SCRATCH was temporary folder for storing intermediate results.

Within the pipeline each T1 weighted volume was corrected for bias field using N4BiasFieldCorrection v2.1.0⁸, skullstripped using antsBrainExtraction.sh v2.1.0 (using OASIS template), and coregistered to skullstripped ICBM 152 Nonlinear Asymmetrical template version 2009c⁹ using nonlinear transformation implemented in ANTs v2.1.0¹⁰. Cortical surface was estimated using FreeSurfer v6.0.0¹¹.

Functional data for each run was motion corrected using MCFLIRT v5.0.9¹². Functional data was skullstripped using combination of BET and 3dAutoMask tools and was coregistered to the corresponding T1 weighted volume using boundary based registration with 9 degrees of freedom - implemented in FreeSurfer v6.0.0¹³. Motion correcting transformations, transformation to T1 weighted space and MNI template warp were applied in a single step using antsApplyTransformations v2.1.0 with Lanczos interpolation.

Three tissue classes were extracted from T1w images using FSL FAST v5.0.9¹⁴. Voxels from cerebrospinal fluid and white matter were used to create a mask in turn used to extract physiological noise regressors using aCompCor¹⁵. The mask was eroded and limited to subcortical regions to limit overlap with grey matter, six principal components were estimated. Framewise displacement and DVARS¹⁶ was calculated for each functional run using Nipype implementation. In addition to those regressors global signal and mean white matter signal was also calculated.

The whole dataset was preprocessed in total three times. After each iteration the decision to modify the preprocessing was purely based on the visual evaluation of the preprocessed data and not based on results of model fitting. First iteration (using FMRIPREP 0.4.2) uncovered inconsistent output image field of view and issues with EPI skullstripping, second iteration (using FMRIPREP 0.4.3) uncovered two cases of failed normalization due to poor initialization. In the final iteration all those issues were resolved. In total the preprocessing consumed ~22,556 single CPU hours.

For more details of the pipeline see http://fmriprep.readthedocs.io/en/0.4.4/workflows.html.

Volume-based task analysis

For a full description of the paradigms for each task, please refer to¹. We analysed the task data using FSL¹⁷ and AFNI¹⁸, implemented using Nipype⁷. Spatial smoothing was applied using AFNI’s 3dBlurInMask with a Gaussian kernel with FWHM=5mm. Activity was estimated using a general linear model (GLM) with FEAT¹⁷. Predictors were convolved with a double-gamma canonical haemodynamic response function¹⁹. Temporal derivatives were added to all task regressors to compensate variability in the haemodynamic response function. Furthermore, the following regressors were added to avoid confounding due to motion: standardised dvars, absolute dvars, the voxelwise standard deviation of dvars, framewise displacement, and the six motion parameters (translation in 3 directions, rotation in 3 directions).

For the Balloon Analog Risk Task (BART), we included 9 task regressors: for each condition (accept, explode, reject), we added a regressor with equal amplitude and durations of 1 second on each trial. Furthermore, we included the same regressors with the amplitude modulated by the number of trials before explosions (perceived as the probability of explosions). The modulator was mean centered to avoid estimation problems due to collinearity. For the conditions that require a response (accept, reject), a regressor was added with equal amplitude, and the duration equal to the reaction time. These regressors were orthogonalised with their fixed-duration counterpart to separate the fixed effect of the trial and the effect covarying with the reaction time. A regressor is added for the control condition.

In the retrieval phase of the Paired-Associate Memory Task (PAMRET), we modelled 4 conditions: true positives, false positives, true negatives, false negatives. For each condition, a regressor is modelled first with fixed durations (3s) and second with reaction time durations, with the latter orthogonalised with the former. With an extra regressor with control trials, there are 9 task regressors in total.

In the Spatial Capacity Task (SCAP), 25 task regressors were included. For each cognitive load (1 - 3 - 5 - 7) and each delay (1.5 - 3 - 4.5) with a correct response, two regressors were added: a regressor with fixed durations of 5 seconds and one with the duration equal to the reaction time, with the second orthogonalised with respect to the first. For both regressors, the onset is after the delay. The last regressor summarises all incorrect trials.

For the Stop-Signal Task (STOPSIGNAL), for each condition (go, stop - succesful, stop - unsuccesful), one task regressor was included with a fixed duration of 1.5s. For the conditions requiring a response (go and stop-unsuccesful), an extra regressor was added with equal amplitude, but the duration equal to the reaction time. Again, these regressors were orthogonalised with respect to the fixed duration regressor of the same condition. A sixth regressor was added with erroneous trials.

In the Task Switching Task (TASKSWITCH), all manipulations were crossed (switch/no switch, congruent/incongruent, CSI delay short/long), resulting in 8 task conditions. As in the SCAP task, we added for each condition two regressors: a regressor with fixed durations of 1 second, and one with the duration equal to the reaction time, with the second orthogonalised with respect to the first. There is a total of 16 regressors.

For subjects who are missing at least one regressor used in the contrasts, the task data are discarded. This is the case for example when no correct answers are registered for a certain condition in the SCAP task. For the SCAP task, we discarded 16 subjects; 14 subjects were removed for TASKSWITCH, 2 subjects for STOPSIGNAL, 2 subjects for BART, and 12 for PAMRET.

All modelled contrasts are listed in the Supplementary material. As is shown, all contrasts are estimated and tested for both a positive and a negative effect.

The total number of subjects modelled in the BART task is 259, while 244 subjects were modelled for the SCAP task. 254 subjects were included the TASKSWITCH task analysis, 197 subjects in the PAMRET task and 255 subjects in the STOPSIGNAL task.

Group level analysis

Subsequent to the single subject analyses, all subjects were entered in a one-sample group level analysis for each task. Three second level analysis strategies were followed: (A) ordinary least squares (OLS) mixed modelling using FLAME¹⁷, (B) generalized least squares (GLS) with a local estimate of random effects variance, using FSL¹⁷, and (C) non-parametric modelling (NP) using RANDOMISE²⁰, with the whole brain first level parameter estimates for each subject as input, and 10,000 permutations. The first two analyses use a group brain mask with voxels that were present in 100% of all subjects. For the permutation tests, a group mask was created where voxels were discarded for further analysis if less than 80% of the subjects have data in those voxels.

In addition to group level statistical maps, activation count maps (ACMs) were generated to show the proportion of participants that show activation, rather than average activation over subjects²¹. These maps indicate whether the effects discovered in the group analyses are consistent over subjects. As in 21, the statistical map for each subject is binarized at z=+/-1.65. For each contrast, the average of these maps is computed over subjects. The average negative map (percentage of subjects showing a negative effect with z < -1.65) is subtracted from the average positive map to indicate the direction of effects.

Dataset validation

To validate the quality of volumetric spatial normalization we have looked at the overlap of the EPI derived brain masks in the MNI space (across all participants and runs - total of 1,969 masks - see Figure 1). The within subject coregistration and normalization worked well for the vast majority of participants, creating a very good overlap. All of the issues observed while processing the dataset are listed in Table 1.

A selection of the tested contrasts in the task analyses is shown in Figures 2 to 6. Figures were generated using nilearn²².

Figure 1. Overlap of the EPI derived 1,969 brain masks in the MNI space: voxels inside the blue outlined were present within the mask for 85% of runs, purple: 95% of runs, black 100% of runs.

Table 1. Known issues.

List of problems with the raw data we were aware of at the time of writing that impacted preprocessing.

Participants affected	Issue
10971, 10501, 70036, 70035, 11121, 10299, 10428	Lack of T1w files. Preprocessing was not performed.
11067	Signal dropout in the cerebellum during BART, rest, SCAP, stop-signal and task switch tasks.

Figure 2. Task analysis results for the BART task.

In the left plot, the statistical map of the one-sample group test, computed with randomise. The right plot shows the difference between the positive and the negative activation count maps.

Figure 3. Task analysis results for the PAMRET task.

In the left plot, the statistical map of the one-sample group test, computed with randomise. The right plot shows the difference between the positive and the negative activation count maps.

Figure 4. Task analysis results for the SCAP task.

In the left plot, the statistical map of the one-sample group test, computed with randomise. The right plot shows the difference between the positive and the negative activation count maps.

Figure 5. Task analysis results for the STOPSIGNAL task.

In the left plot, the statistical map of the one-sample group test, computed with randomise. The right plot shows the difference between the positive and the negative activation count maps.

Figure 6. Task analysis results for the TASKSWITCH task.

In the left plot, the statistical map of the one-sample group test, computed with randomise. The right plot shows the difference between the positive and the negative activation count maps.

Data and software availability

The preprocessed images were deposited along the original dataset in the OpenfMRI repository – accession number: ds000030⁶, under the revision 1.0.4. The preprocessed data is organized according the draft BIDS derivatives specification. All FMRIPREP derivatives are organised under fmriprep/sub-<participant_label>/

Derivatives related to T1 weighted files are in the anat subfolder:

*T1w_preproc.nii.gz - bias field corrected T1 weighted file, using ANTS’ N4BiasFieldCorrection
*T1w_brainmask.nii.gz - brain mask derived using ANTS
*T1w_dtissue.nii.gz -tissue class map derived using FAST.
*T1w_class-CSF_probtissue.nii.gz, *T1w_class-GM_probtissue.nii.gz, *T1w_class-WM_probtissue.nii.gz - probability tissue maps.

All of the above are available in native and MNI space.

*T1w_smoothwm.[LR].surf.gii - smoothed GrayWhite surfaces.
*T1w_pial.[LR].surf.gii - pial surface.
*T1w_midthickness.[LR].surf.gii - MidThickness surfaces.
*T1w_inflated.[LR].surf.gii - FreeSurfer inflated surfaces for visualization.
*T1w_space-MNI152NLin2009cAsym_class-CSF_probtissue.nii.gz, *T1w_space-MNI152NLin2009cAsym_class-GM_probtissue.nii.gz, *T1w_space-MNI152NLin2009cAsym_class-WM_probtissue.nii.gz - probability tissue maps, transformed into MNI space.
*T1w_target-MNI152NLin2009cAsym_warp.h5 Composite (warp and affine) transform to transform participant's T1 weighted image into the MNI space

Derivatives related to EPI files are in the func subfolder:

*bold_space-<space>_brainmask.nii.gz Brain mask for EPI files.
*bold_space-<space>_preproc.nii.gz Motion-corrected (using MCFLIRT for estimation and ANTs for interpolation) EPI file

All of the above are available in the native T1 weighted space as well as the MNI space.

*bold_space-fsaverage5.[LR].func.gii Motion-corrected EPI file sampled to surface.
*bold_confounds.tsv A tab-separated value file with one column per calculated confound (see Methods) and one row per timepoint/volume

The results of the single subject task modeling are available in task/sub-<participant_label>/ and the group level results can be found in task_group/. Each subject-specific folder holds 5 folders - bart.feat, scap.feat, pamret.feat, stopsignal.feat, taskswitch.feat - with the results from the respective task modeling, organised as standard FEAT output. The group-level folder contains a folder for every task, in turn containing a folder for each contrast (see Supplementary material for naming conventions) and below those folders are the results of the three modeling strategies.

In addition, the dataset includes visual quality HTML reports (one per participant).

The results for each contrast in the one-sample group task analyses are deposited and can be interactively viewed in NeuroVault²³: http://neurovault.org/collections/2606/.

Latest source code used to produce the task analyses: https://github.com/poldracklab/CNP_task_analysis

Archived source code as at the time of publication: http://doi.org/10.5281/zenodo.832319²⁴.

License: MIT license.

All code has been run through a singularity container²⁵, created from a docker container poldracklab/cnp_task_analysis:1.0 available on docker hub (https://hub.docker.com/r/poldracklab/cnp_task_analysis/).

Competing interests

No competing interests were disclosed.

Grant information

This work has been funded by the Laura and John Arnold Foundation. JD has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 706561. The acquisition of the original dataset was supported by the Consortium for Neuropsychiatric Phenomics (NIH Roadmap for Medical Research grants UL1-DE019580, RL1MH083268, RL1MH083269, RL1DA024853, RL1MH083270, RL1LM009833, PL1MH083271, and PL1NS062410).

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Acknowledgements

We would like to thank all of the developers and beta testers of the FMRIPREP package - especially Oscar Esteban, Chris Markiewicz, and Ross Blair.

Supplementary material

Supplementary File 1: Task fMRI Contrasts.

Click here to access the data.

Faculty Opinions recommended

References

1. Poldrack RA, Congdon E, Triplett W, et al.: A phenome-wide examination of neural and cognitive function. Sci Data. 2016; 3: 160110. PubMed Abstract | Publisher Full Text | Free Full Text
2. Poldrack RA, Gorgolewski KJ: Making big data open: data sharing in neuroimaging. Nat Neurosci. 2014; 17(11): 1510–7, [cited 2014 Oct 28]. PubMed Abstract | Publisher Full Text
3. Puccio B, Pooley JP, Pellman JS, et al.: The preprocessed connectomes project repository of manually corrected skull-stripped T1-weighted anatomical MRI data. Gigascience. 2016; 5(1): 45. PubMed Abstract | Publisher Full Text | Free Full Text
4. Bellec P, Chu C, Chouinard-Decorte F, et al.: The Neuro Bureau ADHD-200 Preprocessed repository. Neuroimage. 2017; 144(Pt B): 275–86. PubMed Abstract | Publisher Full Text
5. Glasser MF, Sotiropoulos SN, Wilson JA, et al.: The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage. 2013; 80: 105–24. PubMed Abstract | Publisher Full Text | Free Full Text
6. Poldrack RA, Barch DM, Mitchell JP, et al.: Toward open sharing of task-based fMRI data: the OpenfMRI project. Front Neuroinform. 2013; 7: 12. PubMed Abstract | Publisher Full Text | Free Full Text
7. Gorgolewski K, Burns CD, Madison C, et al.: Nipype: a flexible, lightweight and extensible neuroimaging data processing framework in python. Front Neuroinform. 2011; 5: 13. PubMed Abstract | Publisher Full Text | Free Full Text
8. Tustison NJ, Avants BB, Cook PA, et al.: N4ITK: improved N3 bias correction. IEEE Trans Med Imaging. 2010; 29(6): 1310–20. PubMed Abstract | Publisher Full Text | Free Full Text
9. Fonov VS, Evans AC, McKinstry RC, et al.: Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. Neuroimage. 2009; 47: S102. Publisher Full Text
10. Avants BB, Epstein CL, Grossman M, et al.: Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med Image Anal. 2008; 12(1): 26–41. PubMed Abstract | Publisher Full Text | Free Full Text
11. Dale AM, Fischl B, Sereno MI: Cortical surface-based analysis. I. Segmentation and surface reconstruction. Neuroimage. 1999; 9(2): 179–94. PubMed Abstract | Publisher Full Text
12. Jenkinson M, Bannister P, Brady M, et al.: Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage. 2002; 17(2): 825–41. PubMed Abstract | Publisher Full Text
13. Greve DN, Fischl B: Accurate and robust brain image alignment using boundary-based registration. Neuroimage. 2009; 48(1): 63–72. PubMed Abstract | Publisher Full Text | Free Full Text
14. Zhang Y, Brady M, Smith S: Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans Med Imaging. 2001; 20(1): 45–57. PubMed Abstract | Publisher Full Text
15. Behzadi Y, Restom K, Liau J, et al.: A component based noise correction method (CompCor) for BOLD and perfusion based fMRI. Neuroimage. 2007; 37(1): 90–101. PubMed Abstract | Publisher Full Text | Free Full Text
16. Power JD, Mitra A, Laumann TO, et al.: Methods to detect, characterize, and remove motion artifact in resting state fMRI. Neuroimage. 2013; 84: 320–41. PubMed Abstract | Publisher Full Text | Free Full Text
17. Jenkinson M, Beckmann CF, Behrens TE, et al.: FSL. Neuroimage. 2012; 62(2): 782–90. PubMed Abstract | Publisher Full Text
18. Cox RW: AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Comput Biomed Res. 1996; 29(3): 162–73. PubMed Abstract | Publisher Full Text
19. Glover GH: Deconvolution of impulse response in event-related BOLD fMRI. Neuroimage. 1999; 9(4): 416–29. PubMed Abstract | Publisher Full Text
20. Winkler AM, Ridgway GR, Webster MA, et al.: Permutation inference for the general linear model. Neuroimage. 2014; 92: 381–97. PubMed Abstract | Publisher Full Text | Free Full Text
21. Barch DM, Burgess GC, Harms MP, et al.: Function in the human connectome: task-fMRI and individual differences in behavior. Neuroimage. 2013; 80: 169–89. PubMed Abstract | Publisher Full Text | Free Full Text
22. Abraham A, Pedregosa F, Eickenberg M, et al.: Machine Learning for Neuroimaging with Scikit-Learn. arXiv [cs.LG]. 2014. Reference Source
23. Gorgolewski KJ, Varoquaux G, Rivera G, et al.: NeuroVault.org: a web-based repository for collecting and sharing unthresholded statistical maps of the human brain. Front Neuroinform. 2015; 9: 8. PubMed Abstract | Publisher Full Text | Free Full Text
24. Durnez J, Gorgolewski CJ, Poldrack RA: poldracklab/CNP_task_analysis: v0.1. Zenodo. 2017. Data Source
25. Kurtzer GM, Sochat V, Bauer MW: Singularity: Scientific containers for mobility of compute. PLoS One. 2017; 12(5): e0177459. PubMed Abstract | Publisher Full Text | Free Full Text

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 28 Jul 2017

Author details Author details

¹ Department of Psychology, Stanford University, Stanford, CA, USA
² INRIA Parietal, Neurospin, Saclay, Gif-sur-Yvette, France

Krzysztof J. Gorgolewski
Roles: Conceptualization, Data Curation, Funding Acquisition, Software, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Joke Durnez
Roles: Data Curation, Formal Analysis, Methodology, Software, Validation, Visualization, Writing – Review & Editing

Russell A. Poldrack
Roles: Conceptualization, Funding Acquisition, Supervision, Validation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

Article Versions (2)

version 2

Revised

Published: 22 Sep 2017, 6:1262

https://doi.org/10.12688/f1000research.11964.2

version 1

Published: 28 Jul 2017, 6:1262

https://doi.org/10.12688/f1000research.11964.1

© 2017 Gorgolewski KJ et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Gorgolewski KJ, Durnez J and Poldrack RA. Preprocessed Consortium for Neuropsychiatric Phenomics dataset [version 1; peer review: 2 approved with reservations]. F1000Research 2017, 6:1262 (https://doi.org/10.12688/f1000research.11964.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 28 Jul 2017

Views

Reviewer Report 22 Aug 2017

Anderson M. Winkler, Hospital Israelita Albert Einstein, São Paulo, SP, Brazil; Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA

Approved with Reservations

https://doi.org/10.5256/f1000research.12934.r24602

First of all, I would like to congratulate the authors for making the dataset available, which should allow interested scientists to explore the data and enrich the research they conduct with more information that can eventually lead to helpful new discoveries. It is remarkable that more than one processing stream was used (FSL and AFNI for functional, and volumetric and surface-based for structural), and further, three different inference approaches were considered, all of which are a great bonus in terms of comparisons among methods.

I have very few concerns about the current version of the manuscript (v1, dated 28/July/2017):

Page 2, 1st column, 2nd paragraph of the Introduction: "giving researchers the freedom to fit many different models that incorporate different denoising schemes": as stated, it may suggest that it would be adequate to simply run multiple models, without attention to excess error due to multiple testing. Perhaps a different wording such as "giving researchers the freedom to choose their own denoising schemes" could still accommodate what the authors may have wished to state.
I cannot find information about the subjects. Who were them? Where was the data collected? With what scanner and sequences? Who approved the protocol? Presumably this information is on reference #1 but it cannot hurt to have that information here.
It would be good if a few more details on what exactly FMRIPREP does could be given, without having to rely completely on external links that may no longer be available in the future. Even more important considering that results did change with after minor version changes.
Page 2, 1st column, 4th paragraph of the Methods: As written, a reader may think that the input images to FreeSurfer were those non-linearly aligned to the MNI space, which surely was not the case. But if that was, then the FS analysis would have to re-done as the warps affect thickness and area measurements.
Page 3, 2nd column: Regarding the masks, one would have thought that using the mask from FEAT/FLAME could have been a good shortcut instead of creating a new one for randomise. Why wasn't that done?
Page 3, Validation section: The strategy using the mask contours to investigate between-subjects registration is surely not a good one for not showing how the overlap between structures. A hyperslab across subjects would have been more informative. Moreover, the contours shown in Figure 1 are a bit concerning for suggesting somewhat suboptimal registration.
Still in the Validation section: what exactly is being validated here? It doesn't seem to show that the dataset would be valid or not valid in any particular aspect. Consider investigating some specific validation parameters over different aspects (e.g., registration, bias correction, surface reconstruction, the tasks eliciting expected response, etc), or remove this section altogether, as it can be misleading for suggesting that the dataset is "valid" somehow.
Page 6: The description of the files is extremely helpful. I note that one of the files listed has extension .h5. Is this HDF5? If yes, please state so. I believe this format was used for the lack of another option, but in fact, this is a great format that probably should in the future be an option for most imaging data we use (both surface-based and volume-based).
Of the FreeSurfer surfaces, the white is the most important one, not pial or midthickness. The pial is computed after the white already exists, and its exactness depends on the white. The midthickness does not match any particular tissue border, and if one measures surface area from it, that area will depend on thickness, which would make it a poor phenotype. It would be great if the white surface files could be provided.
Page 7: I find it concerning that information and resources about this dataset are scattered over the internet: There is the current paper (PDF) and its Supplementary Material on F1000, then there are results stored in NeuroVault, source code on Github and Zenodo, and finally, a Docker container on DockerHub. Could not a copy of all these pieces be on a single place that can be simply downloaded and maintained on the long term, e.g., in DataDryad? How can the readers be sure that all these links will be alive in 10 or 20 years?

Is the rationale for creating the dataset(s) clearly described?

Yes
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Yes
Are the datasets clearly presented in a useable and accessible format?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Medicine, statistics, medical imaging

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 22 Sep 2017

Krzysztof J. Gorgolewski, Department of Psychology, Stanford University, Stanford, USA

22 Sep 2017

Author Response
Dr. Winkler,
Thank you for the detailed review. Your comments helped us to improve the manuscript in the following way:
- We have applied the suggested changes to the second
... Continue reading
Dr. Winkler,
Thank you for the detailed review. Your comments helped us to improve the manuscript in the following way:

We have applied the suggested changes to the second paragraph of the introduction

We have added information about the subjects

We have clarified that the FS analysis was done in parallel with the alignment.

We have added a few more details about preprocessing and an overview figure. Furthermore we have mode sure that the online documentation of the version of FMRIPREP used to generate this data has been deposited in the Internet Archive for long term preservation.

We have clarified why different mask strategies were applied.

We have added the hyperslab figure to the manuscript.

The 95% and 85% overlap brain mask overlap contours show good agreement across the normalized masks with signal dropout in areas usually affected by susceptibility distortion artifacts. The 100% overlap is much worse since it requires voxels to be present in all of the 1,969 evaluated masks.

We have changed the header ‘Validation’ to ‘Selected Results’ to not give the wrong impression that we validated the analyses.

We added clarification on the file format for the .h5 files (indeed it’s HDF5!)

The white surface is provided (both in GIfTI and native FreeSurfer formats) - we made this information more prominent.

We have clarified that there is long-term storage of the data and code.
Dr. Winkler,
Thank you for the detailed review. Your comments helped us to improve the manuscript in the following way:

We have applied the suggested changes to the second paragraph of the introduction

We have added information about the subjects

We have clarified that the FS analysis was done in parallel with the alignment.

We have added a few more details about preprocessing and an overview figure. Furthermore we have mode sure that the online documentation of the version of FMRIPREP used to generate this data has been deposited in the Internet Archive for long term preservation.

We have clarified why different mask strategies were applied.

We have added the hyperslab figure to the manuscript.

The 95% and 85% overlap brain mask overlap contours show good agreement across the normalized masks with signal dropout in areas usually affected by susceptibility distortion artifacts. The 100% overlap is much worse since it requires voxels to be present in all of the 1,969 evaluated masks.

We have changed the header ‘Validation’ to ‘Selected Results’ to not give the wrong impression that we validated the analyses.

We added clarification on the file format for the .h5 files (indeed it’s HDF5!)

The white surface is provided (both in GIfTI and native FreeSurfer formats) - we made this information more prominent.

We have clarified that there is long-term storage of the data and code.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 22 Sep 2017

Krzysztof J. Gorgolewski, Department of Psychology, Stanford University, Stanford, USA

22 Sep 2017

Author Response
Dr. Winkler,
Thank you for the detailed review. Your comments helped us to improve the manuscript in the following way:
- We have applied the suggested changes to the second
... Continue reading
Dr. Winkler,
Thank you for the detailed review. Your comments helped us to improve the manuscript in the following way:

We have applied the suggested changes to the second paragraph of the introduction

We have added information about the subjects

We have clarified that the FS analysis was done in parallel with the alignment.

We have added a few more details about preprocessing and an overview figure. Furthermore we have mode sure that the online documentation of the version of FMRIPREP used to generate this data has been deposited in the Internet Archive for long term preservation.

We have clarified why different mask strategies were applied.

We have added the hyperslab figure to the manuscript.

The 95% and 85% overlap brain mask overlap contours show good agreement across the normalized masks with signal dropout in areas usually affected by susceptibility distortion artifacts. The 100% overlap is much worse since it requires voxels to be present in all of the 1,969 evaluated masks.

We have changed the header ‘Validation’ to ‘Selected Results’ to not give the wrong impression that we validated the analyses.

We added clarification on the file format for the .h5 files (indeed it’s HDF5!)

The white surface is provided (both in GIfTI and native FreeSurfer formats) - we made this information more prominent.

We have clarified that there is long-term storage of the data and code.
Dr. Winkler,
Thank you for the detailed review. Your comments helped us to improve the manuscript in the following way:

We have applied the suggested changes to the second paragraph of the introduction

We have added information about the subjects

We have clarified that the FS analysis was done in parallel with the alignment.

We have added a few more details about preprocessing and an overview figure. Furthermore we have mode sure that the online documentation of the version of FMRIPREP used to generate this data has been deposited in the Internet Archive for long term preservation.

We have clarified why different mask strategies were applied.

We have added the hyperslab figure to the manuscript.

The 95% and 85% overlap brain mask overlap contours show good agreement across the normalized masks with signal dropout in areas usually affected by susceptibility distortion artifacts. The 100% overlap is much worse since it requires voxels to be present in all of the 1,969 evaluated masks.

We have changed the header ‘Validation’ to ‘Selected Results’ to not give the wrong impression that we validated the analyses.

We added clarification on the file format for the .h5 files (indeed it’s HDF5!)

The white surface is provided (both in GIfTI and native FreeSurfer formats) - we made this information more prominent.

We have clarified that there is long-term storage of the data and code.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 08 Aug 2017

Angela R. Laird, Department of Physics, Florida International University, Miami, FL, USA

Approved with Reservations

https://doi.org/10.5256/f1000research.12934.r24599

This Data Note reports on the availability of an fMRI dataset from the Consortium for Neuropsychiatric Phenomics (CNP), which includes both original and processed data. The publication of shared fMRI datasets is strongly encouraged to amplify our community's efforts in promoting open science. Although dataset publications are on the rise, unfortunately, only a handful currently exist. I am delighted to see this work being published, and expect that it may serve as a representative fMRI dataset publication in the future. With that in mind, I think it would be helpful to revise the manuscript to include a more detailed description of the data, acquisition methods, and individual tasks.

Introduction: The first paragraph of the Introduction is extremely brief and should be expanded to include a description of the purpose of the study, participants, and the experimental protocol (imaging and behavioral). Only a very short mention of these three important aspects of the study are provided and collapsed into a single (somewhat awkward) first sentence. Such brevity might limit a reader's understanding of the overall context of the data that are being shared. The second sentence of the Introduction alludes to "relationships" being "answered" - this should be restated and expanded to more fully describe what questions may be asked from these data - again, this relates to the overall purpose of the CNP project. While such additional descriptions will result in a longer paper, the information will be helpful in allowing readers to understand how their specific research questions may be addressed by a deeper exploration of these data. Lastly, the Introduction should also state where the data may be downloaded and a summary of the different unprocessed and processed files that have been shared.

Methods: Demographic information on the participants should be provided, as well as a statement of IRB approval. A description of the MRI scanner should be included, along with the data acquisition parameters. Each of the fMRI tasks should be fully described - this will help clarify subsequent reference to different conditions (e.g., “accept, explode, reject”). Much of the Methods is written for those who are already very familiar with the software packages that are utilized. It would be helpful to improve accessibility by including a brief description of some of the newer, less ubiquitious software tools. In particular, given how the Methods is framed around use of FMRIPREP, a short intro should be included. The total numbers of participants for each of the tasks (shown on page 3) doesn’t agree with the numbers of task datasets discarded for being incomplete - please note why the additional participants were omitted from the final dataset. Overall, the flow of the Methods section could be improved by adding a workflow or pipeline figure that summarizes the different analysis steps and the versions of the data (e.g., volume vs. surface approaches).

I’m not convinced that “Dataset validation” is an appropriate heading. Figure 1 is a good sanity check, but it’s not clear how Figures 2 - 6 are a validation.

Data and software availability: Some readers may not be familiar with the BIDS format - please add a description of this. In addition, some readers may be looking for information about the DICOM and NIfTI images, so an explicit mention may be helpful.

Minor comments:
- page 2: first report of DVARS is capitalized, but later mentions are not
- page 2: "Temporal derivatives were added to all task regressors to compensate for variability in the haemodynamic response function"
- page 3: typo - “unsuccesful”
- ensure past tense used consistently throughout Methods (e.g., “tasks data were discarded”, “no correct answers were registered” on page 3).

Is the rationale for creating the dataset(s) clearly described?

Partly
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Partly
Are the datasets clearly presented in a useable and accessible format?

Yes

Competing Interests: No competing interests were disclosed.

CITE

Report a concern

Author Response 22 Sep 2017

Krzysztof J. Gorgolewski, Department of Psychology, Stanford University, Stanford, USA

22 Sep 2017

Author Response
Dr. Laird,
Thank you for your review and comments. They were very helpful in preparation of a new revision of the paper.
- We have extended the introduction to give
... Continue reading
Dr. Laird,
Thank you for your review and comments. They were very helpful in preparation of a new revision of the paper.

We have extended the introduction to give a clearer view on the purpose and possibilities this dataset gives. In the methods sections, we have given demographic information on the participants, as well as IRB approval and a description of the MRI scanner, scanning parameters and tasks.

We have changed the heading ‘Dataset validation’ to ‘Selected results’.

We have expanded the description of preprocessing and introduction to the FMRIPREP package.

We have clarified that not all subjects performed all of the tasks.

We added a data processing overview figure.

We have spelled out the BIDS acronym and added a reference to a paper with more information.

We have added information about NIFTI and GIFTI file formats (DICOM files are not part of this dataset).

Furthermore, in accordance with the review, we have fixed the reported typos and changed the tense of the paper.
Dr. Laird,
Thank you for your review and comments. They were very helpful in preparation of a new revision of the paper.

We have extended the introduction to give a clearer view on the purpose and possibilities this dataset gives. In the methods sections, we have given demographic information on the participants, as well as IRB approval and a description of the MRI scanner, scanning parameters and tasks.

We have changed the heading ‘Dataset validation’ to ‘Selected results’.

We have expanded the description of preprocessing and introduction to the FMRIPREP package.

We have clarified that not all subjects performed all of the tasks.

We added a data processing overview figure.

We have spelled out the BIDS acronym and added a reference to a paper with more information.

We have added information about NIFTI and GIFTI file formats (DICOM files are not part of this dataset).

Furthermore, in accordance with the review, we have fixed the reported typos and changed the tense of the paper.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 22 Sep 2017

Krzysztof J. Gorgolewski, Department of Psychology, Stanford University, Stanford, USA

22 Sep 2017

Author Response
Dr. Laird,
Thank you for your review and comments. They were very helpful in preparation of a new revision of the paper.
- We have extended the introduction to give
... Continue reading
Dr. Laird,
Thank you for your review and comments. They were very helpful in preparation of a new revision of the paper.

We have extended the introduction to give a clearer view on the purpose and possibilities this dataset gives. In the methods sections, we have given demographic information on the participants, as well as IRB approval and a description of the MRI scanner, scanning parameters and tasks.

We have changed the heading ‘Dataset validation’ to ‘Selected results’.

We have expanded the description of preprocessing and introduction to the FMRIPREP package.

We have clarified that not all subjects performed all of the tasks.

We added a data processing overview figure.

We have spelled out the BIDS acronym and added a reference to a paper with more information.

We have added information about NIFTI and GIFTI file formats (DICOM files are not part of this dataset).

Furthermore, in accordance with the review, we have fixed the reported typos and changed the tense of the paper.
Dr. Laird,
Thank you for your review and comments. They were very helpful in preparation of a new revision of the paper.

We have extended the introduction to give a clearer view on the purpose and possibilities this dataset gives. In the methods sections, we have given demographic information on the participants, as well as IRB approval and a description of the MRI scanner, scanning parameters and tasks.

We have changed the heading ‘Dataset validation’ to ‘Selected results’.

We have expanded the description of preprocessing and introduction to the FMRIPREP package.

We have clarified that not all subjects performed all of the tasks.

We added a data processing overview figure.

We have spelled out the BIDS acronym and added a reference to a paper with more information.

We have added information about NIFTI and GIFTI file formats (DICOM files are not part of this dataset).

Furthermore, in accordance with the review, we have fixed the reported typos and changed the tense of the paper.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 28 Jul 2017

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 2 (revision) 22 Sep 17	read	read
Version 1 28 Jul 17	read	read

Angela R. Laird, Florida International University, Miami, USA
Anderson M. Winkler, Hospital Israelita Albert Einstein, São Paulo, Brazil; Yale University School of Medicine, New Haven, USA

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

29 Views

31 Oct 2017 | for Version 2

Angela R. Laird, Department of Physics, Florida International University, Miami, FL, USA

29 Views Cite this report Responses(0)

Approved

The authors have sufficiently addressed my previous concerns, and I now am happy to endorse publication of this manuscript.

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

21 Views

18 Oct 2017 | for Version 2

Anderson M. Winkler, Hospital Israelita Albert Einstein, São Paulo, SP, Brazil; Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA

21 Views Cite this report Responses(0)

Approved

All suggestions I had for the first version have been addressed in this revised version, and I have no further concerns. I would like to thank and congratulate the authors for making this resource available.

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Medicine, statistics, medical imaging

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

43 Views

22 Aug 2017 | for Version 1

Anderson M. Winkler, Hospital Israelita Albert Einstein, São Paulo, SP, Brazil; Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA

43 Views Cite this report Responses(1)

Approved With Reservations

Page 2, 1st column, 2nd paragraph of the Introduction: "giving researchers the freedom to fit many different models that incorporate different denoising schemes": as stated, it may suggest that it would be adequate to simply run multiple models, without attention to excess error due to multiple testing. Perhaps a different wording such as "giving researchers the freedom to choose their own denoising schemes" could still accommodate what the authors may have wished to state.
I cannot find information about the subjects. Who were them? Where was the data collected? With what scanner and sequences? Who approved the protocol? Presumably this information is on reference #1 but it cannot hurt to have that information here.
It would be good if a few more details on what exactly FMRIPREP does could be given, without having to rely completely on external links that may no longer be available in the future. Even more important considering that results did change with after minor version changes.
Page 2, 1st column, 4th paragraph of the Methods: As written, a reader may think that the input images to FreeSurfer were those non-linearly aligned to the MNI space, which surely was not the case. But if that was, then the FS analysis would have to re-done as the warps affect thickness and area measurements.
Page 3, 2nd column: Regarding the masks, one would have thought that using the mask from FEAT/FLAME could have been a good shortcut instead of creating a new one for randomise. Why wasn't that done?
Page 3, Validation section: The strategy using the mask contours to investigate between-subjects registration is surely not a good one for not showing how the overlap between structures. A hyperslab across subjects would have been more informative. Moreover, the contours shown in Figure 1 are a bit concerning for suggesting somewhat suboptimal registration.
Still in the Validation section: what exactly is being validated here? It doesn't seem to show that the dataset would be valid or not valid in any particular aspect. Consider investigating some specific validation parameters over different aspects (e.g., registration, bias correction, surface reconstruction, the tasks eliciting expected response, etc), or remove this section altogether, as it can be misleading for suggesting that the dataset is "valid" somehow.
Page 6: The description of the files is extremely helpful. I note that one of the files listed has extension .h5. Is this HDF5? If yes, please state so. I believe this format was used for the lack of another option, but in fact, this is a great format that probably should in the future be an option for most imaging data we use (both surface-based and volume-based).
Of the FreeSurfer surfaces, the white is the most important one, not pial or midthickness. The pial is computed after the white already exists, and its exactness depends on the white. The midthickness does not match any particular tissue border, and if one measures surface area from it, that area will depend on thickness, which would make it a poor phenotype. It would be great if the white surface files could be provided.
Page 7: I find it concerning that information and resources about this dataset are scattered over the internet: There is the current paper (PDF) and its Supplementary Material on F1000, then there are results stored in NeuroVault, source code on Github and Zenodo, and finally, a Docker container on DockerHub. Could not a copy of all these pieces be on a single place that can be simply downloaded and maintained on the long term, e.g., in DataDryad? How can the readers be sure that all these links will be alive in 10 or 20 years?

Is the rationale for creating the dataset(s) clearly described?

Yes
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Yes
Are the datasets clearly presented in a useable and accessible format?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Medicine, statistics, medical imaging

Respond to this report

Responses (1)

Author Response

22 Sep 2017

Krzysztof J. Gorgolewski, Department of Psychology, Stanford University, Stanford, USA

Dr. Winkler,
Thank you for the detailed review. Your comments helped us to improve the manuscript in the following way:

We have applied the suggested changes to the second paragraph of the introduction
We have added information about the subjects
We have clarified that the FS analysis was done in parallel with the alignment.
We have added a few more details about preprocessing and an overview figure. Furthermore we have mode sure that the online documentation of the version of FMRIPREP used to generate this data has been deposited in the Internet Archive for long term preservation.
We have clarified why different mask strategies were applied.
We have added the hyperslab figure to the manuscript.
The 95% and 85% overlap brain mask overlap contours show good agreement across the normalized masks with signal dropout in areas usually affected by susceptibility distortion artifacts. The 100% overlap is much worse since it requires voxels to be present in all of the 1,969 evaluated masks.
We have changed the header ‘Validation’ to ‘Selected Results’ to not give the wrong impression that we validated the analyses.
We added clarification on the file format for the .h5 files (indeed it’s HDF5!)
The white surface is provided (both in GIfTI and native FreeSurfer formats) - we made this information more prominent.
We have clarified that there is long-term storage of the data and code.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

77 Views

08 Aug 2017 | for Version 1

Angela R. Laird, Department of Physics, Florida International University, Miami, FL, USA

77 Views Cite this report Responses(1)

Approved With Reservations

Is the rationale for creating the dataset(s) clearly described?

Partly
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Partly
Are the datasets clearly presented in a useable and accessible format?

Yes

Competing Interests

No competing interests were disclosed.

Respond to this report

Responses (1)

Author Response

22 Sep 2017

Krzysztof J. Gorgolewski, Department of Psychology, Stanford University, Stanford, USA

Dr. Laird,
Thank you for your review and comments. They were very helpful in preparation of a new revision of the paper.

We have extended the introduction to give a clearer view on the purpose and possibilities this dataset gives. In the methods sections, we have given demographic information on the participants, as well as IRB approval and a description of the MRI scanner, scanning parameters and tasks.
We have changed the heading ‘Dataset validation’ to ‘Selected results’.
We have expanded the description of preprocessing and introduction to the FMRIPREP package.
We have clarified that not all subjects performed all of the tasks.
We added a data processing overview figure.
We have spelled out the BIDS acronym and added a reference to a paper with more information.
We have added information about NIFTI and GIFTI file formats (DICOM files are not part of this dataset).
Furthermore, in accordance with the review, we have fixed the reported typos and changed the tense of the paper.

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Poldrack RA, Congdon E, Triplett W, et al.: A phenome-wide examination of neural and cognitive function. Sci Data. 2016; 3: 160110. PubMed Abstract | Publisher Full Text | Free Full Text

[2] 2. Poldrack RA, Gorgolewski KJ: Making big data open: data sharing in neuroimaging. Nat Neurosci. 2014; 17(11): 1510–7, [cited 2014 Oct 28]. PubMed Abstract | Publisher Full Text

[3] 3. Puccio B, Pooley JP, Pellman JS, et al.: The preprocessed connectomes project repository of manually corrected skull-stripped T1-weighted anatomical MRI data. Gigascience. 2016; 5(1): 45. PubMed Abstract | Publisher Full Text | Free Full Text

[4] 4. Bellec P, Chu C, Chouinard-Decorte F, et al.: The Neuro Bureau ADHD-200 Preprocessed repository. Neuroimage. 2017; 144(Pt B): 275–86. PubMed Abstract | Publisher Full Text

[5] 5. Glasser MF, Sotiropoulos SN, Wilson JA, et al.: The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage. 2013; 80: 105–24. PubMed Abstract | Publisher Full Text | Free Full Text

[6] 6. Poldrack RA, Barch DM, Mitchell JP, et al.: Toward open sharing of task-based fMRI data: the OpenfMRI project. Front Neuroinform. 2013; 7: 12. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. Gorgolewski K, Burns CD, Madison C, et al.: Nipype: a flexible, lightweight and extensible neuroimaging data processing framework in python. Front Neuroinform. 2011; 5: 13. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Tustison NJ, Avants BB, Cook PA, et al.: N4ITK: improved N3 bias correction. IEEE Trans Med Imaging. 2010; 29(6): 1310–20. PubMed Abstract | Publisher Full Text | Free Full Text

[9] 9. Fonov VS, Evans AC, McKinstry RC, et al.: Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. Neuroimage. 2009; 47: S102. Publisher Full Text

[10] 10. Avants BB, Epstein CL, Grossman M, et al.: Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med Image Anal. 2008; 12(1): 26–41. PubMed Abstract | Publisher Full Text | Free Full Text

[11] 11. Dale AM, Fischl B, Sereno MI: Cortical surface-based analysis. I. Segmentation and surface reconstruction. Neuroimage. 1999; 9(2): 179–94. PubMed Abstract | Publisher Full Text

[12] 12. Jenkinson M, Bannister P, Brady M, et al.: Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage. 2002; 17(2): 825–41. PubMed Abstract | Publisher Full Text

[13] 13. Greve DN, Fischl B: Accurate and robust brain image alignment using boundary-based registration. Neuroimage. 2009; 48(1): 63–72. PubMed Abstract | Publisher Full Text | Free Full Text

[14] 14. Zhang Y, Brady M, Smith S: Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans Med Imaging. 2001; 20(1): 45–57. PubMed Abstract | Publisher Full Text

[15] 15. Behzadi Y, Restom K, Liau J, et al.: A component based noise correction method (CompCor) for BOLD and perfusion based fMRI. Neuroimage. 2007; 37(1): 90–101. PubMed Abstract | Publisher Full Text | Free Full Text

[16] 16. Power JD, Mitra A, Laumann TO, et al.: Methods to detect, characterize, and remove motion artifact in resting state fMRI. Neuroimage. 2013; 84: 320–41. PubMed Abstract | Publisher Full Text | Free Full Text

[17] 17. Jenkinson M, Beckmann CF, Behrens TE, et al.: FSL. Neuroimage. 2012; 62(2): 782–90. PubMed Abstract | Publisher Full Text

[18] 18. Cox RW: AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Comput Biomed Res. 1996; 29(3): 162–73. PubMed Abstract | Publisher Full Text

[19] 19. Glover GH: Deconvolution of impulse response in event-related BOLD fMRI. Neuroimage. 1999; 9(4): 416–29. PubMed Abstract | Publisher Full Text

[20] 20. Winkler AM, Ridgway GR, Webster MA, et al.: Permutation inference for the general linear model. Neuroimage. 2014; 92: 381–97. PubMed Abstract | Publisher Full Text | Free Full Text

[21] 21. Barch DM, Burgess GC, Harms MP, et al.: Function in the human connectome: task-fMRI and individual differences in behavior. Neuroimage. 2013; 80: 169–89. PubMed Abstract | Publisher Full Text | Free Full Text

[22] 22. Abraham A, Pedregosa F, Eickenberg M, et al.: Machine Learning for Neuroimaging with Scikit-Learn. arXiv [cs.LG]. 2014. Reference Source

[23] 23. Gorgolewski KJ, Varoquaux G, Rivera G, et al.: NeuroVault.org: a web-based repository for collecting and sharing unthresholded statistical maps of the human brain. Front Neuroinform. 2015; 9: 8. PubMed Abstract | Publisher Full Text | Free Full Text

[24] 24. Durnez J, Gorgolewski CJ, Poldrack RA: poldracklab/CNP_task_analysis: v0.1. Zenodo. 2017. Data Source

[25] 25. Kurtzer GM, Sochat V, Bauer MW: Singularity: Scientific containers for mobility of compute. PLoS One. 2017; 12(5): e0177459. PubMed Abstract | Publisher Full Text | Free Full Text

Preprocessed Consortium for Neuropsychiatric Phenomics dataset

Abstract

Keywords

Introduction

Methods

Preprocessing

Volume-based task analysis

Group level analysis

Dataset validation

Figure 1. Overlap of the EPI derived 1,969 brain masks in the MNI space: voxels inside the blue outlined were present within the mask for 85% of runs, purple: 95% of runs, black 100% of runs.

Table 1. Known issues.

Figure 2. Task analysis results for the BART task.

Figure 3. Task analysis results for the PAMRET task.

Figure 4. Task analysis results for the SCAP task.

Figure 5. Task analysis results for the STOPSIGNAL task.

Figure 6. Task analysis results for the TASKSWITCH task.

Data and software availability

Competing interests

Grant information

Acknowledgements

Supplementary material

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated