The document presents a survey on using genetic algorithms and hash-based techniques for record deduplication in databases faced with increasing data volumes and duplicates. It highlights the problems caused by duplicate records, such as performance degradation, quality loss, and increased costs, while proposing a genetic programming approach that leverages hash algorithms (MD5 and SHA-1) to identify and eliminate these duplicates. The findings emphasize that this methodology not only enhances data integrity but also optimizes storage efficiency in digital environments.