This document describes a system for clustering users' points of interest (POI) visit trajectories and using the clusters to generate personalized next-POI recommendations. It clusters trajectories into topics to learn generalized tourist behavior patterns. The system models user behavior as a Markov decision process and uses inverse reinforcement learning to estimate reward functions from observed behavior. It generates two types of recommendations: cluster behavior-based recommendations that leverage cluster policies when individual data is sparse, and a hybrid approach that combines cluster and individual user models. An evaluation shows it outperforms baseline methods on metrics like reward and novelty.