Trinity is a distributed graph engine that stores graphs entirely in memory across a cluster. It allows for both online query processing with low latency and offline graph analytics with high throughput. Trinity partitions graphs across memory trunks in its memory cloud for scalability, fault tolerance, and efficient random data access for online queries. It introduces new computation paradigms beyond traditional vertex-centric models to optimize offline analytics.