Embed presentation
Downloaded 56 times















Jeff Hammerbacher manages Facebook's data team of 22 data engineers and scientists. He discusses Facebook's data infrastructure including software used like Linux, Apache, MySQL, PHP, and Memcached. Facebook serves over 10 million requests per second from over 10,000 web servers and stores session data in cookies. It uses the distributed computing framework Thrift to build most internal services. Facebook is moving from its in-house distributed log file processing system Cheetah to the open source Hadoop framework for more flexible offline batch processing and ad hoc querying of its growing trove of user data.













