The document summarizes the WSDM Cup 2017 competition on vandalism detection in Wikidata revisions. The goal is to predict whether a Wikidata revision should be reverted or not. The competition provides a Wikidata dump with 72 million revisions from 2012-2016 to train and validate models, and tests on unseen future data. Key challenges include class imbalance with positives being 0.0025% of revisions, and a 4GB memory limit for models on their evaluation platform. The document discusses the provided data features and different models tried, with the best performing model being an SVM trained on hashed bag-of-words features of revision metadata and text.