- A method is presented for refining a pre-trained random forest by optimizing the leaf weights while keeping the tree structures fixed.
- This reformulates the random forest as a linear classification/regression problem where samples are represented by sparse indicator vectors.
- The optimization can be performed efficiently and the refined forest has comparable or better accuracy than the original forest, but with significantly fewer trees/nodes.
- Experiments on classification and regression datasets demonstrate the proposed method outperforms other random forest techniques while accelerating training and testing.