Be the first to like this
The video presenting the content for these slides and all the related materials including source code and sample data can be downloaded from this link: http://amsantac.co/blog/en/2016/09/20/balanced-image-classification-r.html.
When conducting a supervised classification with machine learning algorithms such as RandomForests, one recommended practice is to work with a balanced classification dataset. However, this recommendation is sometimes overlooked due to unawareness of its relevance or lack of knowledge about how to deal with it.
In this presentation I initially examine some of the consequences of working with an imbalanced dataset, using an image classification problem. Later I test and suggest some techniques to solve this problem.