Data
Multi-modal data at all spatial scales
Neuroscience data are heterogeneous and multi-dimensional, and it is evident that the volume, speed, and complexity of data is growing beyond the current capacity of existing analytical systems.
High-throughput molecular analyses, as well as neuroimaging and sensor technologies, are generating petabyte-sized datasets daily, and high compute speeds are required for rapid analytics at this scale. Existing environments cannot contend with the scale and diversity of data that will be required for breakthrough discovery at petabyte scale. At this scale, transferring data becomes cost- and time-prohibitive, therefore computation, tools, analysis and researchers have to be moved to the data.
The BRAIN Commons is leveraging what has been learned from the United States National Cancer Institute’s Genomic Data Commons and builds upon those data standards for managing data. This infrastructure allows for a data commons that will scale beyond the petabyte level; is interoperable; and offers co-location of data, storage and computing infrastructure facilitating data harmonization, data analysis and data sharing in the cloud.
The Graph Data Model is the central method of organization of all data ingested by the BRAIN Commons. This is designed to maintain data and metadata consistency and accommodates multiple types of heterogenous data found in brain research. This is essential so that we can enable data integration and analyses within and across cohorts, allowing our analyses to work across all projects within and across clouds. This also allows for computation to happen in a co-located place so that one can compute over the data in a safe and complaint way, taking computation to the data for multiple users simultaneously.
We no longer have to move data around; now, we move to the data.
BEAT-PD Data Release
The Michael J. Fox Foundation (MJFF) in coordination with Sage Bionetworks and the BRAIN Commons launched the The Biomarker and Endpoint Assessment to Track Parkinson’s Disease (BEAT-PD) Challenge in January 2020. Data is now available to researchers once they have completed a request process and have been approved.
Learn more and request access: BEAT-PD Data Release
These data will include:
- Preclinical Data
- Clinical Data
- Imaging Data
- Electrophysiology Data
- Genomic and other Molecular *.omic Data
- Digital Health Data
- Social Media and other Unstructured Data
Data will be stored in three zones:
Zone 1 – Public Access
Data designated as Public Access will be accessible to any user once they are a member of the BRAIN Commons.
Zone 2 – Qualified Access
Data designated as Controlled Access will be accessible to any Authorized user of the BRAIN Commons once the Data Access Committee has received, authenticated and approved a Zone Access Application form.
Zone 3 – Private Access
Data designated as Exclusive Access will be accessible only to the data contributor and authorized users specifically authorized by the data contributor.
Are you interested in contributing data to the BRAIN Commons? Learn more about How to Participate.
{ Exaptive visualization and explanation goes here }