This document proposes a QoS-Aware Self-Adaptive RAN Overload Control (QoS-Dracon) mechanism to reduce RAN overload in LTE/LTE-A networks while considering users' QoS requirements. It prioritizes delay-sensitive users over delay-tolerant ones during the random access procedure. The mechanism uses an Access Class Barring scheme in the eNodeB to monitor RAN load and block access of delay-tolerant devices when needed. It also employs a QCI-dependent backoff scheme to spread access attempts over time for congested networks. Simulation results show the mechanism maintains low access delays for delay-sensitive users regardless of device type attempting access.