Hello!
First, I want to say thank you for putting together this really invaluable resource. It's been great to read the publications that have come form this consortium so far! I feel like I am missing something, but is there information in the Data Dictionary about the Files? From what I can tell, the dictionary only describes the files in the Tables tab. It would be really helpful to have descriptions of each of the files and their row/columns that are posted in the "current release" files.
Thank you!
Mollie
Created by Mollie Harrison molliejharrison Hi Kevin,
Thanks very much for this explanation and for adding the descriptions for other files! It's really helpful.
Best,
Mollie Dear Mollie,
Thank you for highlighting the value of the resource and we hope that the data is useful in your own research. You are correct that there were not explicit definitions for the items located in Files. We've added brief descriptions of these Files to the bottom of the Data Dictionary. Further details for deriving the methylation and transcription matrices are provided in their accompanying paper/preprints. For the variant Files, the intention was to provide a way to more efficiently download the larger files. For example, `variants_gene_copy_number.csv.gz` under Files should be the same as `variants_gene_copy_number` under Tables and therefore the data dictionary can be used from the Tables.
Hope this helps,
Kevin