It looks like the training manifest (_/data/imputation_challenge/training_data/metadata_training_data.tsv_) is missing one file that's actually in the folder: C25M22. This is also listed in _Imputation Challenge Metadata_ (https://docs.google.com/spreadsheets/d/111mzOWe1h3NSYv001xK3xBMrj9zO5dZw940qSTDhLos/edit#gid=1876022763) as training data .
Another view: 267 training files in _Imputation Challenge Metadata_, only 266 files in_ metadata_training_data.tsv_:
```
>>> impute_meta.groupby('data_split')[['filename']].count()
data_split filename
B 56
T 267
V 45
>> training_data_manifest.shape
(266, 7)
>>> validation_data_manifest.shape
(45, 7)
```
Created by Keenan Graham keenangraham There was a mistake in the manifest where the row with C25 C22 had the wrong file name.
This looked like:
training_data C25 M22 KMS-11 H3K4me3 ENCSR539TKY C25M32.bigwig
This has been fixed to the following line:
training_data C25 M22 KMS-11 H3K4me3 ENCSR539TKY C25M22.bigwig
We've uploaded an updated manifest file to Synapse.
Drop files to upload
Training manifest missing one file page is loading…