This document describes the development of a visual-based and audio-based human-computer interaction system using MATLAB. The researchers created a graphical user interface with buttons for color input and speech input. For color recognition, the system could detect primary colors like red, green, and blue in real-time video frames or offline images. For speech recognition, audio input was compared to pre-recorded audio files to perform operations like opening a web browser. The algorithms and results of the color detection and speech recognition are explained in detail with figures and flowcharts. The system achieved 98% accuracy in color detection and could open programs based on the detected speech input.