Embed presentation
Download to read offline








The document discusses the optimization of split computing in IoT devices by dynamically selecting the optimal splitting layer based on prediction confidence and computational costs. It highlights the early exit technique in deep neural networks for reducing inference time and computational load, particularly in resource-constrained environments. Key variables include total inference time, compression ratio, and costs associated with offloading and computation at the splitting layer.







