This document presents a framework for scene recognition using convolutional neural networks (CNNs) as feature extractors and machine learning kernels as classifiers. The framework uses a VGG dataset containing 678 images across 3 categories (highway, open country, streets). CNNs perform feature extraction via convolution and max pooling operations to reduce dimensionality by 10x. The extracted features are then classified using perceptrons and support vector machines (SVMs) in a parallel implementation. Results show SVMs achieve higher accuracy than perceptrons and accuracy increases with more training data. Future work involves task-level parallelism, increasing data size and categories, and comparing CNN features to PCA.