Category Archives: classification

Enjoy R: Compare ROC curves of different multinomial classification models

When we think of a ROC curve, we usually refer it to a binary classification problem. For a multiclass case, it comes to be less used, also because it loses most of its explicative power. However, it would be good to use it also in this scenario in order to own one more weapon for diagnostics. …

Continue reading

Enjoy R: Stratified sampling and its application using dplyr

author: Davide Passaretti Simple random sampling is the most common practise when dealing with data sets which are large enough to be split into training and test set for predictive purposes. Think of classification models. You randomly extract, say, 75% of the rows, and that’s a fair technique, at least until you are quite sure that …

Continue reading