Kernels are functions that run on GPUs in parallel across multiple threads and blocks. Kernels are launched with an execution configuration that defines the number of blocks in the grid and the number of threads in each block, allowing thousands of threads to run simultaneously and take advantage of the GPU's parallel architecture.