- The document describes a machine learning pipeline developed by Insight Data Science to analyze textual features on content URLs and predict user engagement for optimal advertisement placement.
- Keywords were extracted from URLs and used to build a logistic regression classification model to predict whether users would click on advertisements based on URL text.
- The model was validated on a test dataset and achieved a precision between 0.55-0.85 and recall between 0.4-0.85 when randomly splitting data 50/50 for training and testing.