Xgboost: A Scalable Tree Boosting System - Explained
XGBoost is a widely-used scalable tree boosting system that excels in classification, regression, and learning-to-rank tasks, often outperforming competitors. It employs decision tree boosting with optimizations like column subsampling and approximate split finding to enhance performance and reduce overfitting. Further resources and tutorials are available to aid in understanding and implementation.
Motivation
Used bymajority of winning solutions on
Kaggle, 2nd most popular method after DNN.
Also used by 10 best teams in KDDCup’15.
Applies to classification, regression and
learning-to-rank tasks.
Usually outperforms alternatives in an
out-of-the-box setting.
Combines a good theoretical foundation and
a highly efficient implementation.
So, how does it work?
Regularized Learning Objective
Firstorder gradient
of the loss function
Second order gradient
of the loss function
By additive definition
Where:
However, for example: