The Anti-Entropy process used by nodetool repair is the way of ensuring consistency of data on disk. Over the many years of the Apache Cassandra project it has also been the biggest pain point for teams running Cassandra. With a solid repair process in place you can be confident that deleted data will not come back to life, and that data is fully distributed when nodes fail.
In this talk Alexander Dejanovski, Consultant at The Last Pickle, will explain how Anti-Entropy works and why it should be run on your cluster. He will discuss the different options such as ""primary range"" repair, sub-range repairs, and incremental repair introduced in version 2.1.
He will also introduce additional tools such as the Spotify Reaper and the range repair script, and future optimisations incremental repair could bring to the read path.
About the Speaker
Alexander DEJANOVSKI Consultant, The Last Pickle
Alexander has been working as a software developer for the last 18 years, mainly for the french leader of express shipments. He's been leading there the effort to build a Cassandra based architecture and migrate services to it from traditional RDBMS. He is involved in the Cassandra community through the development of a JDBC wrapper for the DataStax Java Driver. Recently, he joined The Last Pickle as a Cassandra consultant and now helps customers to get the best out of it.