- The document describes a multiple sorting method called SketchSort for efficiently finding all pairs of similar items in large-scale datasets.
- SketchSort maps high-dimensional vector data to binary sketches while preserving distances. It then performs multiple sorting on the sketches to enumerate similar item pairs.
- Experiments show SketchSort can efficiently find neighbor pairs in large image and genetic datasets, outperforming other state-of-the-art methods. It enables applications like clustering and information retrieval in big data domains.