This document contains code and calculations related to measuring the overlap between two gene sets (CDK2 and CDK6) using Simpson's index. It performs searches on the PubMed database to retrieve the number of publications associated with each gene individually and combined. It then calculates Simpson's index as the proportion of shared publications out of the total for the smaller set. The index value of 0.4326 is returned, indicating moderate overlap between the literature on CDK2 and CDK6.