This document discusses high-throughput screening (HTS) workflows for identifying biologically active small molecules. It describes how robots are used to rapidly screen large libraries of compounds in assays and generate large datasets. Statistical and machine learning methods in R can then be used to build predictive models from these datasets to identify promising leads and guide the screening of additional compounds. Caveats regarding the applicability of models to new chemical spaces are also discussed.