This document summarizes Joshua Hoffman's talk on scalable system operations at Tumblr. The talk outlines Tumblr's management stack for automating server provisioning including iPXE, Invisible Touch, Collins, Phil, Kickstart, and Puppet. It describes how the tools are used together in workflows for server intake, provisioning, and addressing challenges like configuring networking and storage during installation. The talk emphasizes principles like modularity, simplicity, and avoiding breaking the operating system.
Hadoop con 2015 hadoop enables enterprise data lakeJames Chen
Mobile Internet, Social Media 以及 Smart Device 的發展促成資訊的大爆炸,伴隨產生大量的非結構化及半結構化的資料,不但資料的格式多樣,產生的速度極快,對企業的資訊架構帶來了前所未有的挑戰,面對多樣的資料結構及多樣的分析工具,我們應該採用什麼樣的架構互相整合,才能有效的管理資料生命週期,提取資料價值,Hadoop 生態系統,無疑的在這個大架構裡,將扮演最基礎的資料平台的角色,實現企業的 Data Lake。
This document summarizes Joshua Hoffman's talk on scalable system operations at Tumblr. The talk outlines Tumblr's management stack for automating server provisioning including iPXE, Invisible Touch, Collins, Phil, Kickstart, and Puppet. It describes how the tools are used together in workflows for server intake, provisioning, and addressing challenges like configuring networking and storage during installation. The talk emphasizes principles like modularity, simplicity, and avoiding breaking the operating system.
Hadoop con 2015 hadoop enables enterprise data lakeJames Chen
Mobile Internet, Social Media 以及 Smart Device 的發展促成資訊的大爆炸,伴隨產生大量的非結構化及半結構化的資料,不但資料的格式多樣,產生的速度極快,對企業的資訊架構帶來了前所未有的挑戰,面對多樣的資料結構及多樣的分析工具,我們應該採用什麼樣的架構互相整合,才能有效的管理資料生命週期,提取資料價值,Hadoop 生態系統,無疑的在這個大架構裡,將扮演最基礎的資料平台的角色,實現企業的 Data Lake。
How do we manage more than one thousand of Pegasus clusters - backend partacelyc1112009
A presentation in Apache Pegasus meetup in 2021 from Wang Dan.
Know more about Pegasus https://pegasus.apache.org, https://github.com/apache/incubator-pegasus
神州数码 Jason pan future_clouddatacenterv2colderboy17
This document discusses trends in cloud computing and cloud data center design. It notes the evolution of cloud computing from early concepts to widespread adoption today. Key points covered include the growth of private clouds within large enterprises, the rise of big data and noSQL databases, and cloud providers expanding globally. The document also provides guidelines for designing highly scalable, efficient cloud data centers, focusing on factors like high density server configurations, modular designs, efficient cooling and power distribution, and reliability.
This document discusses Puppet, an open source configuration management tool. It can be used to automate system configuration, deployment, and administration tasks across Linux, Unix, and Windows systems. Puppet uses a master-slave architecture with Puppet masters distributing configuration files to Puppet agents. It provides a declarative language to define system configuration and resources.