the ID in training is ensemble, the ID in testing is entrez, does that mean different reference genome was used? thanks yuanfang

Created by Yuanfang Guan ???? yuanfang.guan
Dear Yuanfang Guan, All microarray expression have probe level data in the challenge while the RNA-seq has ensemble and entrez id's. Ensemble was used to stay consistent with how MMRF process their RNA-seq data using Salmon. For microarray data standard processing is to use the probe sets. The [webinar](syn10517990) did address this a bit. Also the resources site contains a link to org.Hs.eg.db which is very useful for mapping (for R users). Regards, Mike

RNA-seq preprocessing page is loading…