Child Mind Institute Healthy Brain Network (HBN)
From the About page of the HBN website:
An ongoing initiative focused on creating and sharing a biobank comprised of data from 10,000 New York City area children and adolescents (ages 5-21). The Healthy Brain Network has adopted a community-referred recruitment model. Specifically, study advertisements seek the participation of families who have concerns about one or more psychiatric symptoms in their child. The Healthy Brain Network Biobank houses data about psychiatric, behavioral, cognitive, and lifestyle (e.g., fitness, diet) phenotypes, as well as multimodal brain imaging, electroencephalography, digital voice and video recordings, genetics, and actigraphy. Beyond accelerating transdiagnostic research, we discuss the potential of the Healthy Brain Network Biobank to advance related areas, such as biophysical modeling, voice and speech analysis, natural viewing fMRI and EEG, and methods optimization.
Steps to produce this study's data dictionaries
Note: Some of the following COINS access instructions were copied from the HBN Phenotypic Data Access webpage.
- Go to the COINS Data Exchange website.
- Log in using your COINS user ID and password. If you do not have an account, select the Get Account option.
- From the main screen of the COINS Data Exchange, click on Study Information.
- Click on the drop-down for Select a study and choose CMI_HBN.
- Under Study Docs: download the all_data_dicts_Aug_2018.zip into the
HBN/subfolder without renaming the ZIP file. -
Unzip the ZIP file in place. The
HBN/subfolder hierarchy should now look like this:shell HBN/ ├── all_data_dicts_Aug_2018.zip ├── Data Dictionaries/ ├── dictionary.py └── README.md -
Install the required Python 3 library with the following line of code:
shell python3 -m pip install --user pandas openpyxl -
Run the following line of code within the
HBN/subfolder:shell python3 dictionary.py
Notes about this data dictionary
- Some questionnaires (listed here) have an entry on the last line that states "Continue
to" which is ignored when creating the corresponding
.jsonfiles for that questionnaires. - On some questionnaires the ShortName is entered as
Variable Nameon others asVariable - Similarly, the
Descriptionheader, which contains the description for every question on the questionnaire was entered asQuestion,Question, orItem - Some questionnaires used
Value Labelsinstead ofValue Labelto describe the range of possible values. - Many of the levels from different questionnaire needed adjustments that were
provided as notes on the
.xlsxfile. The code has many different if statements to handle the correct behaviour for each questionnaire. - The
SWANquestionnaire is provided twice with the same data with the nameSWAN.xlsxandSWAN .xlsx - Some of the levels on
SCARED_PANDSCARED_SRare defined as values that are>=to a specific threshold