The document discusses NUMA-optimized parallel breadth-first search (BFS) on multicore single node systems. It presents a NUMA-aware hybrid BFS algorithm that partitions graphs column-wise and binds partitions to local memory to reduce unnecessary edge traversals and remote memory access. Numerical results show the NUMA algorithm scales well up to 64 threads and achieves 11.15 GTEPS, a 2.2x speedup over the original hybrid algorithm.