The document proposes a self-organizing uplink-downlink optimization scheme for time-division duplex (TDD) based small cell networks. It formulates the optimization problem as a non-cooperative game where each small cell base station aims to minimize its total uplink and downlink flow delays by selecting an optimal uplink-downlink configuration. A decentralized learning algorithm is presented that allows each base station to autonomously estimate traffic loads and optimize its configuration accordingly. Simulation results show the proposed algorithm achieves significant gains in packet throughput compared to random or fixed configuration schemes, especially under asymmetric traffic loads.