This document discusses Orange's use of Hadoop and big data technologies. It provides an overview of Hadoop concepts like MapReduce, HDFS, and YARN. It describes how Orange initially struggled with large-scale PageRank calculations but was able to successfully adopt Hadoop. Orange now runs Hadoop in production across multiple clusters totaling thousands of nodes and exabytes of data. Key applications discussed include search engine ranking, customer profiling from logs, and using Hadoop with NoSQL technologies like Cassandra. Benefits include significant reductions in costs, improvements in scalability and robustness, and enabling new development areas.