It looks like the training manifest (_/data/imputation_challenge/training_data/metadata_training_data.tsv_) is missing one file that's actually in the folder: C25M22. This is also listed in _Imputation Challenge Metadata_ (https://docs.google.com/spreadsheets/d/111mzOWe1h3NSYv001xK3xBMrj9zO5dZw940qSTDhLos/edit#gid=1876022763) as training data . Another view: 267 training files in _Imputation Challenge Metadata_, only 266 files in_ metadata_training_data.tsv_: ``` >>> impute_meta.groupby('data_split')[['filename']].count() data_split filename B 56 T 267 V 45 >> training_data_manifest.shape (266, 7) >>> validation_data_manifest.shape (45, 7) ```

Created by Keenan Graham keenangraham
There was a mistake in the manifest where the row with C25 C22 had the wrong file name. This looked like: training_data C25 M22 KMS-11 H3K4me3 ENCSR539TKY C25M32.bigwig This has been fixed to the following line: training_data C25 M22 KMS-11 H3K4me3 ENCSR539TKY C25M22.bigwig We've uploaded an updated manifest file to Synapse.

Training manifest missing one file page is loading…