Our approach uses minutia cylinder-code (MCC) representation with a global consolidation stage and a careful design to make it suitable for the architecture of GPU. The result tested with GTX- 680 device shows that the proposed algorithm can perform 1.8 millions matches per second, making it applicable for real time identification systems with databases of millions fingerprints.