The document describes a process for identifying patient cohorts for precision medicine research using a centralized clinical data research network (CDRN) in New York City. The CDRN merges patient records across multiple providers into a centralized database containing de-identified records for 7 million patients. When a researcher requests a specific cohort, the CDRN obtains approval from a central IRB. It then works to re-contact patients in the identified cohort to obtain consent and provides de-identified data to the researcher. This process allows researchers to more easily identify cohorts for studies, particularly rare conditions, by accessing merged records across many clinical sites.