This document presents a mini project on using AI to detect different objects within an image. The project uses YOLO and RCNN algorithms for object detection. YOLO allows for faster detection than other algorithms while still providing good accuracy. The proposed system uses a Caffe model dataset, deep learning classification, and blob detection for real-time object identification. Detected objects can then be converted to speech. The results discussion shows that YOLO with RCNN can accurately detect objects within images quickly. The conclusion states that combining YOLO and other techniques allows for fast and robust object detection ideal for applications requiring real-time performance.