The document proposes a distributed adaptive opportunistic routing scheme for wireless ad hoc networks. It uses a reinforcement learning framework to route packets opportunistically even without knowledge of channel statistics or network models. This approach jointly addresses learning and routing opportunistically by exploiting transmission successes. Nodes learn to optimally explore and exploit opportunities in the network to minimize the expected average per packet cost of routing from source to destination.