The study investigates crop yield prediction in India from 1997 to 2020 using machine learning techniques such as random forest, logistic regression, and naive bayes, focusing on environmental factors and agricultural data. It concludes that the random forest model offers high accuracy for predictions, while naive bayes shows good precision in positive predictions. The dataset utilized is comprehensive, covering a variety of crops and factors including area, production, rainfall, and fertilizer use across different states.