The document presents a semi-automatic method for converting 2D images and videos into 3D using energy minimization techniques, focusing on the frameworks of random walks and graph cuts. The approach seeks to reduce user effort in labeling while maintaining accuracy in depth mapping, which is crucial for successful 2D to 3D conversion. Results demonstrate the effectiveness of this method in improving conversion accuracy while addressing challenges in video depth tracking and labeling.