Hi?
Thanks for releasing the GENIE v16.0-public dataset.
The `Genomic Profile Sample Counts` chart indicates the following numbers:
* Structural Variants: 172,8**76** samples
* Copy-number alterations: 148,035 samples
However, upon reviewing the `Case lists` chart (also the cases_XXXX.txt files):
* samples with Structural Variants: 172,8**67**
* samples with CNA: 178,046
While `data_cna.txt` contains 140106 columns and 140105 samples.
Based on the information provided, I would expect these figures to be consistent.
Is there a specific reason for the discrepancy?
Best
Wan
Created by Wan Shi Wanda Hi @Wanda,
So it seems that:
1. The number of samples with structural variants (SVs) in the case lists and the genomic profile sample counts are inconsistent.
2. The number of samples in the data_cna.txt file, the case list, and the genomic profile sample counts are inconsistent.
The sample counts are derived from different sources, which is why they vary. While we're currently focused on other priorities, we?re aware of the issue and are keeping it on our radar.
Best,
Chelsea HI @Wanda ,
Thanks for raising this discrepancy, we are currently working through other project priorities but will try to get to this soon.
Drop files to upload
Discrepancy between Case lists and Genomic Profile Sample Counts on cbioportal (GENIE v16.0) page is loading…