The document is a master's thesis focused on sound classification and detection using deep learning, specifically exploring acoustic scene classification (ASC) and sound event detection (SED) through convolutional neural networks (CNN) and recurrent neural networks (RNN). The proposed models outperform baseline results from the DCASE 2017 challenge across various datasets, with notable improvements in accuracy and error rates for ASC and SED. The research was conducted at the National Central University in Taiwan and includes detailed methodologies, experiments, and results that contribute to advancements in audio processing technology.