Hi :) In the clean data set - the format of gene name | protein accession number is not used for a subset i.e blank | CON_P01966 - the accession number still correlates to a protein - is it correct to include these variables and insert the corresponding gene name that is missing? Or is there a reason there is no gene name? Sorry if this information is somewhere that I have missed, Francesca

Created by Francesca Alves fralves27
These are entries in Maxquant's potential contaminant database. Not all entries (~200+) in this common repository of adventitious proteins are human, so we declined to annotate these entries beginning with "CON_" concatenated to a trailing uniprot accession with human gene symbols. The human annotated accessions are valid to reference with human symbols, however, and you may do so.

|CON__P01966|CON__P15636 |CON__Q3ZBD7 page is loading…