(no title)
nightcracker | 4 years ago
The paper (https://journals.plos.org/plosone/article?id=10.1371/journal...) [does not mention the sensitivity or specificity of the model at all, only mentions a '91% accuracy' number]* on a biased dataset (where the number of suicidal cases is oversampled and non-suicidal cases are undersampled), without even mentioning exactly how much they over/undersampled.
* I missed the ROC curve on page 7. However it's not clear if this ROC curve was computed on the under/oversampled dataset or the original.
CrazyStat|4 years ago
There's a full ROC curve in Figure 2. Just eyeballing it, it looks like they get both sensitivity and specificity in excess of .9 in the top left corner (I didn't try to measure it precisely).
It would certainly be helpful to have more information about the over/undersampling.
nightcracker|4 years ago
civilized|4 years ago
If I hear this in an interview, I'm going to assume you do data science by blindly copying random blog posts.
aabaker99|4 years ago
[0] https://www.researchgate.net/profile/Jake-Lever/publication/...
cweill|4 years ago
Animats|4 years ago