Open question

When seeing the plots of the scores distribution for positive and negative cases, I realized that most approaches tend to produce a peak around one for positive cases, and something relatively flat for negative cases. Can anyone figure out why is this happening? Why don't we see a peak for negative cases and a flat distribution for positive cases? My intuition is that most of us, in order to maximise AUC have used the same (or very similar) number of positive and negative examples in the batches. This could have caused that the ___normality_ of negative cases has not been learnt (too much variability). What do you think should be the strategy to get a peaked distribution for negative examples?

Created by Antonio Albiol aalbiol
Let's continue to discuss this thread here: https://www.synapse.org/#!Synapse:syn9935146/discussion/threadId=2130

Your web browser must have JavaScript enabled in order for this application to display correctly.
If you are an automated web crawler from a search engine, follow this AJAX application crawl link

Drop files to upload

Open question page is loading…