INDI Retrospective Studies
Curated, ready-to-use subsets of INDI (International Neuroimaging Data-sharing Initiative) retrospective releases, prepared for the Pittsburgh Fiber Data Hub. This repository consolidates links, essential metadata, licensing, and selected derived artifacts (e.g., QC tables, tractography-ready files) to streamline reuse and replication.
Access note: Many datasets are hosted on external portals or S3 (e.g., NITRC, fcp-indi). Always review each dataset’s license and acknowledgments before use.
What’s included
For each dataset mirrored under data-indi/retro, this repository provides:
- Concise release pages (scope, modalities, phenotypes, funding).
- Pointers to upstream downloads (NITRC / S3 / project pages).
-
When present, helper assets such as:
participants.tsvqc.tsv- Tractography-ready derivatives (
*.gqi.fz,*.qsdr.fz,.sz,.dti.dz,.gqi.dz)
All raw imaging data remain hosted by INDI / project owners and are subject to each dataset’s data use terms.
SIMON — Single Individual Volunteer (73 sessions, multi-site)
Dataset: SIMON — Single Individual Volunteer for Multiple Observations across Networks Design: Longitudinal single healthy ambidextrous male (age 29–46) scanned in 73 sessions across multiple sites and scanner models, including partial acquisition under the Canadian Dementia Imaging Protocol (CDIP). New sessions continue to be added.
Modalities (vary by session):
- T1-, T2-, and PD-weighted images
- T2*-weighted images
- Diffusion-weighted imaging (DTI)
- Resting-state fMRI
- Susceptibility-weighted / QSM-capable sequences
- Arterial spin labeling (ASL)
Phenotypes: Age, session number and date (SIMON_pheno.csv).
Use cases:
- Cross-site and cross-scanner harmonization
- Within-subject longitudinal variability and reliability
- Evaluation of protocol differences and CDIP-style harmonization
Access (AWS S3 via HTTPS tools):
- Raw data:
s3://fcp-indi/data/Projects/INDI/SIMON/rawdata - Preprocessed fMRI:
s3://fcp-indi/data/Projects/INDI/SIMON/derivatives
Cyberduck configuration (S3 anonymous):
- Protocol: S3 (Amazon Simple Storage Service)
- Server:
s3.amazonaws.com -
Path:
fcp-indi/data/Projects/INDI/SIMON/rawdatafcp-indi/data/Projects/INDI/SIMON/derivatives
Personnel:
- PI: Simon Duchesne, Ph.D., CERVO Brain Research Centre, Québec
- Senior personnel (abbr.): AmanPreet Badhwar, Desiree Lussier-Levesque
- Contact:
info@medics.ulaval.ca
License: CC BY-SA (Attribution–ShareAlike)
Typical assets in this repo include subject/session-wise listings plus any tractography-ready files generated by the Fiber Data Hub.
Download (Linux / macOS — bash)
curl -s https://api.github.com/repos/data-indi/retro/releases/tags/simon | jq -r '.assets[].browser_download_url' | xargs -n1 -P4 curl -LO
Download (Windows PowerShell 5.x)
(Invoke-RestMethod "https://api.github.com/repos/data-indi/retro/releases/tags/simon").assets | ForEach-Object { Invoke-WebRequest $_.browser_download_url -OutFile (Split-Path $_.browser_download_url -Leaf) }
MPI-Leipzig Mind-Brain-Body (LEMON & N&C)
Dataset: MPI-Leipzig Mind-Brain-Body (LEMON + Neuroanatomy & Connectivity) Participants: 318 total
Data are organized into two protocols:
LEMON (ses-01) — Structural & qT1 focus
- Participants: 228
-
Imaging:
- Quantitative T1-weighted (MP2RAGE; n≈227)
- T2-weighted (225)
- DWI (60 directions; 228)
- 15-min eyes-open resting-state fMRI (227), with geometric distortion correction for DWI / rs-fMRI
- 2D FLAIR (111, earlier part) → later replaced by high-res 3D FLAIR (SPACE) with FLAIR prep (116)
- SWI sequence updated to an in-house SWI allowing magnitude + phase and QSM
Neuroanatomy & Connectivity (N&C, ses-02) — Resting-state focus
- Participants: 199 (109 overlapping with LEMON)
-
Imaging:
- Four × 15-min eyes-open rs-fMRI runs (complete for 194; some with 1–3 runs)
- T1-weighted + low-res FLAIR for unique N&C-only participants
- Note: A multiband sequence bug caused TE = 39.4 ms for N&C rs-fMRI (vs 30 ms in LEMON)
Deep phenotyping & physiology
- Extensive cognitive, personality, and emotion questionnaires
- Behavioral tasks (e.g., CVLT, TAP, TMT, WST, LPS-2, RWT, etc.)
- Physiological measures: resting-state EEG, ECG, pulse, respiration, beat-to-beat blood pressure, anthropometry, blood / hair samples
- Clinical screening (SCID-I, HAM-D, BSL-23, AUDIT, etc.)
Complete imaging subset: 45 participants have full LEMON+N&C multimodal data (qT1, T2, 3D FLAIR, DWI, SWI/QSM, 75 min rs-fMRI, field maps).
Included assets (this repo, when present):
mpi-lemon.dti.dz,mpi-lemon.gqi.dzparticipants.tsv,qc.tsv- Subject-level
*_dwi.*.fz,*_dwi.sz, and structural NIfTIs
Upstream source: MPI-LEMON / Mind-Brain-Body INDI project pages.
Download (Linux / macOS — bash)
curl -s https://api.github.com/repos/data-indi/retro/releases/tags/mpi-lemon | jq -r '.assets[].browser_download_url' | xargs -n1 -P4 curl -LO
Download (Windows PowerShell 5.x)
(Invoke-RestMethod "https://api.github.com/repos/data-indi/retro/releases/tags/mpi-lemon").assets | ForEach-Object { Invoke-WebRequest $_.browser_download_url -OutFile (Split-Path $_.browser_download_url -Leaf) }
Beijing Normal University — Eyes Open / Eyes Closed (EO/EC)
Dataset: Beijing Normal University – Eyes Open / Eyes Closed Sample Participants: 48 healthy controls (student population, BNU)
Per participant:
-
Three 6-min resting-state fMRI scans:
- Run 1: Eyes closed
- Runs 2–3: Condition randomized between eyes open and eyes closed (counterbalanced; details in demographics)
- T1-weighted MPRAGE (defaced)
- 64-direction DTI (2 mm isotropic)
- Demographic information, including EO/EC counterbalancing
Use cases:
- State-dependent changes in resting-state networks (EO vs EC)
- Test–retest reliability across runs with varying visual conditions
- Combined structure–function connectivity using DTI + rs-fMRI
License: CC BY-NC (Attribution–NonCommercial)
Funding acknowledgment (to include in publications):
“Financial support for the data used in this project was provided by the National Natural Science Foundation of China (Grant No. 30770594) and the National High Technology Program of China (863 Program, Grant No. 2008AA02Z405).”
Upstream source: INDI / BNU EO/EC project page.
Download (Linux / macOS — bash)
curl -s https://api.github.com/repos/data-indi/retro/releases/tags/bnu-eoec | jq -r '.assets[].browser_download_url' | xargs -n1 -P4 curl -LO
Download (Windows PowerShell 5.x)
(Invoke-RestMethod "https://api.github.com/repos/data-indi/retro/releases/tags/bnu-eoec").assets | ForEach-Object { Invoke-WebRequest $_.browser_download_url -OutFile (Split-Path $_.browser_download_url -Leaf) }
Beijing Normal University — Enhanced Sample
Dataset: Beijing Normal University – Enhanced Sample Participants: 180 healthy controls (community/student population, BNU)
Compared to the earlier Beijing_Zang dataset (1000 Functional Connectomes classic), this enhanced release adds:
- IQ scores (Verbal, Performance, Full IQ; WAIS-R) for a subset of 55 participants
- 64-direction DTI for all participants
Per participant:
- 8-min resting-state fMRI (R-fMRI)
- T1-weighted MPRAGE (defaced)
- 64-direction DTI
- Basic demographics
- IQ measures (for n=55)
Important notice (overlap):
Some participants in this release also appear in the Beijing_Zang collection (FCP classic). Do not combine the two datasets without subject-level deduplication.
License: CC BY-NC (Attribution–NonCommercial)
References (abbrev.):
- Yan & Zang (2010), DPARSF toolbox, Frontiers in Systems Neuroscience 4:13.
- Tian et al. (2010), small-world networks, NeuroImage 54:191–202.
- Yan et al. (2011), DTI tractography networks, Cerebral Cortex 21(2):449–458.
Funding acknowledgment:
“Financial support for the data used in this project was provided by the National Natural Science Foundation of China (Grant No. 30770594) and the National High Technology Program of China (863 Program, Grant No. 2008AA02Z405).”
Upstream source: INDI / BNU Enhanced project page.
Download (Linux / macOS — bash)
curl -s https://api.github.com/repos/data-indi/retro/releases/tags/bnu-en | jq -r '.assets[].browser_download_url' | xargs -n1 -P4 curl -LO
Download (Windows PowerShell 5.x)
(Invoke-RestMethod "https://api.github.com/repos/data-indi/retro/releases/tags/bnu-en").assets | ForEach-Object { Invoke-WebRequest $_.browser_download_url -OutFile (Split-Path $_.browser_download_url -Leaf) }
How to cite
For all analyses using these datasets:
- Cite the original datasets / DOIs and associated primary manuscripts (e.g., CDIP, Mind-Brain-Body, BNU EO/EC, BNU Enhanced).
- Include the required funding acknowledgments from the upstream project pages.
- When using derivatives or helper artifacts from this repository, add:
“Processed artifacts and curation were prepared by the Pittsburgh Fiber Data Hub (
data-indi/retro).”
Licensing
Unless otherwise specified on the upstream pages:
- SIMON: CC BY-SA (Attribution–ShareAlike).
- BNU EO/EC & BNU Enhanced: CC BY-NC (Attribution–NonCommercial).
- MPI-Leipzig Mind-Brain-Body: per INDI project license (see project page).
You may share and adapt under the specified terms for non-commercial research, with proper attribution. Always confirm the current license on the dataset’s official page.
Known overlaps / deduplication
- BNU Enhanced ↔ Beijing_Zang (FCP Classic): participant overlap exists; perform subject-level deduplication before pooling.
- SIMON: multiple scanners and sites, but same individual; sessions must be treated as repeated measures, not independent subjects.
- MPI-Leipzig LEMON vs N&C: 109 participants overlap; account for protocol/session when modeling.
Disclaimer
- All data originate from independent sites and are redistributed as-is.
- Acquisition parameters, scanner models, and instructions (eyes open/closed, fixation, etc.) differ across datasets.
-
Users are responsible for:
- Performing adequate quality control (motion, artifacts, protocol differences).
- Choosing preprocessing pipelines suitable for multiband, QSM, and TE differences (e.g., LEMON vs N&C).
- Complying with all data use agreements and licensing terms.