The document discusses optimizing deep neural networks (DNNs) for deployment on ultra-low power RISC-V cores. It describes the PULP-NN library which optimizes the computational backend for int8 arithmetic. PULP-NN maximizes data reuse, improves kernel regularity, and exploits parallelism to achieve high utilization. It also introduces DORY, a tool for tiling and code generation that formulates tiling as a constraint programming problem to maximize tile sizes while fitting memory constraints.