This document presents a methodology for semi-unsupervised main entity extraction from news articles using latent variables. It trains a semi-supervised model using only semantic and lexical information from raw text to automatically extract main entities from articles. The extracted entities are evaluated based on word sequence matches between the entities and news article titles, with the evaluation metric for this task needing improvement.