"Scaling Storage with Ceph", Ross Turk, VP of Community, Inktank
Ceph is an open source distributed object store, network block device, and file system designed for reliability, performance, and scalability. It runs on commodity hardware, has no single point of failure, and is supported by the Linux kernel. This talk will describe the Ceph architecture, share its design principles, and discuss how it can be part of a cost-effective, reliable cloud stack.
Storage tiering and erasure coding in Ceph (SCaLE13x)Sage Weil
Ceph is designed around the assumption that all components of the system (disks, hosts, networks) can fail, and has traditionally leveraged replication to provide data durability and reliability. The CRUSH placement algorithm is used to allow failure domains to be defined across hosts, racks, rows, or datacenters, depending on the deployment scale and requirements.
Recent releases have added support for erasure coding, which can provide much higher data durability and lower storage overheads. However, in practice erasure codes have different performance characteristics than traditional replication and, under some workloads, come at some expense. At the same time, we have introduced a storage tiering infrastructure and cache pools that allow alternate hardware backends (like high-end flash) to be leveraged for active data sets while cold data are transparently migrated to slower backends. The combination of these two features enables a surprisingly broad range of new applications and deployment configurations.
This talk will cover a few Ceph fundamentals, discuss the new tiering and erasure coding features, and then discuss a variety of ways that the new capabilities can be leveraged.
BACD LA 2013 - Scaling Storage with Ceph buildacloud
"Scaling Storage with Ceph", Ross Turk, VP of Community, Inktank
Ceph is an open source distributed object store, network block device, and file system designed for reliability, performance, and scalability. It runs on commodity hardware, has no single point of failure, and is supported by the Linux kernel. This talk will describe the Ceph architecture, share its design principles, and discuss how it can be part of a cost-effective, reliable cloud stack.
"Scaling Storage with Ceph", Ross Turk, VP of Community, Inktank
Ceph is an open source distributed object store, network block device, and file system designed for reliability, performance, and scalability. It runs on commodity hardware, has no single point of failure, and is supported by the Linux kernel. This talk will describe the Ceph architecture, share its design principles, and discuss how it can be part of a cost-effective, reliable cloud stack.
Storage tiering and erasure coding in Ceph (SCaLE13x)Sage Weil
Ceph is designed around the assumption that all components of the system (disks, hosts, networks) can fail, and has traditionally leveraged replication to provide data durability and reliability. The CRUSH placement algorithm is used to allow failure domains to be defined across hosts, racks, rows, or datacenters, depending on the deployment scale and requirements.
Recent releases have added support for erasure coding, which can provide much higher data durability and lower storage overheads. However, in practice erasure codes have different performance characteristics than traditional replication and, under some workloads, come at some expense. At the same time, we have introduced a storage tiering infrastructure and cache pools that allow alternate hardware backends (like high-end flash) to be leveraged for active data sets while cold data are transparently migrated to slower backends. The combination of these two features enables a surprisingly broad range of new applications and deployment configurations.
This talk will cover a few Ceph fundamentals, discuss the new tiering and erasure coding features, and then discuss a variety of ways that the new capabilities can be leveraged.
BACD LA 2013 - Scaling Storage with Ceph buildacloud
"Scaling Storage with Ceph", Ross Turk, VP of Community, Inktank
Ceph is an open source distributed object store, network block device, and file system designed for reliability, performance, and scalability. It runs on commodity hardware, has no single point of failure, and is supported by the Linux kernel. This talk will describe the Ceph architecture, share its design principles, and discuss how it can be part of a cost-effective, reliable cloud stack.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
2024 State of Marketing Report – by HubspotMarius Sescu
https://www.hubspot.com/state-of-marketing
· Scaling relationships and proving ROI
· Social media is the place for search, sales, and service
· Authentic influencer partnerships fuel brand growth
· The strongest connections happen via call, click, chat, and camera.
· Time saved with AI leads to more creative work
· Seeking: A single source of truth
· TLDR; Get on social, try AI, and align your systems.
· More human marketing, powered by robots
ChatGPT is a revolutionary addition to the world since its introduction in 2022. A big shift in the sector of information gathering and processing happened because of this chatbot. What is the story of ChatGPT? How is the bot responding to prompts and generating contents? Swipe through these slides prepared by Expeed Software, a web development company regarding the development and technical intricacies of ChatGPT!
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
2024 State of Marketing Report – by HubspotMarius Sescu
https://www.hubspot.com/state-of-marketing
· Scaling relationships and proving ROI
· Social media is the place for search, sales, and service
· Authentic influencer partnerships fuel brand growth
· The strongest connections happen via call, click, chat, and camera.
· Time saved with AI leads to more creative work
· Seeking: A single source of truth
· TLDR; Get on social, try AI, and align your systems.
· More human marketing, powered by robots
ChatGPT is a revolutionary addition to the world since its introduction in 2022. A big shift in the sector of information gathering and processing happened because of this chatbot. What is the story of ChatGPT? How is the bot responding to prompts and generating contents? Swipe through these slides prepared by Expeed Software, a web development company regarding the development and technical intricacies of ChatGPT!
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
The realm of product design is a constantly changing environment where technology and style intersect. Every year introduces fresh challenges and exciting trends that mold the future of this captivating art form. In this piece, we delve into the significant trends set to influence the look and functionality of product design in the year 2024.
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
Mental health has been in the news quite a bit lately. Dozens of U.S. states are currently suing Meta for contributing to the youth mental health crisis by inserting addictive features into their products, while the U.S. Surgeon General is touring the nation to bring awareness to the growing epidemic of loneliness and isolation. The country has endured periods of low national morale, such as in the 1970s when high inflation and the energy crisis worsened public sentiment following the Vietnam War. The current mood, however, feels different. Gallup recently reported that national mental health is at an all-time low, with few bright spots to lift spirits.
To better understand how Americans are feeling and their attitudes towards mental health in general, ThinkNow conducted a nationally representative quantitative survey of 1,500 respondents and found some interesting differences among ethnic, age and gender groups.
Technology
For example, 52% agree that technology and social media have a negative impact on mental health, but when broken out by race, 61% of Whites felt technology had a negative effect, and only 48% of Hispanics thought it did.
While technology has helped us keep in touch with friends and family in faraway places, it appears to have degraded our ability to connect in person. Staying connected online is a double-edged sword since the same news feed that brings us pictures of the grandkids and fluffy kittens also feeds us news about the wars in Israel and Ukraine, the dysfunction in Washington, the latest mass shooting and the climate crisis.
Hispanics may have a built-in defense against the isolation technology breeds, owing to their large, multigenerational households, strong social support systems, and tendency to use social media to stay connected with relatives abroad.
Age and Gender
When asked how individuals rate their mental health, men rate it higher than women by 11 percentage points, and Baby Boomers rank it highest at 83%, saying it’s good or excellent vs. 57% of Gen Z saying the same.
Gen Z spends the most amount of time on social media, so the notion that social media negatively affects mental health appears to be correlated. Unfortunately, Gen Z is also the generation that’s least comfortable discussing mental health concerns with healthcare professionals. Only 40% of them state they’re comfortable discussing their issues with a professional compared to 60% of Millennials and 65% of Boomers.
Race Affects Attitudes
As seen in previous research conducted by ThinkNow, Asian Americans lag other groups when it comes to awareness of mental health issues. Twenty-four percent of Asian Americans believe that having a mental health issue is a sign of weakness compared to the 16% average for all groups. Asians are also considerably less likely to be aware of mental health services in their communities (42% vs. 55%) and most likely to seek out information on social media (51% vs. 35%).
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
This article is all about what AI trends will emerge in the field of creative operations in 2024. All the marketers and brand builders should be aware of these trends for their further use and save themselves some time!
A report by thenetworkone and Kurio.
The contributing experts and agencies are (in an alphabetical order): Sylwia Rytel, Social Media Supervisor, 180heartbeats + JUNG v MATT (PL), Sharlene Jenner, Vice President - Director of Engagement Strategy, Abelson Taylor (USA), Alex Casanovas, Digital Director, Atrevia (ES), Dora Beilin, Senior Social Strategist, Barrett Hoffher (USA), Min Seo, Campaign Director, Brand New Agency (KR), Deshé M. Gully, Associate Strategist, Day One Agency (USA), Francesca Trevisan, Strategist, Different (IT), Trevor Crossman, CX and Digital Transformation Director; Olivia Hussey, Strategic Planner; Simi Srinarula, Social Media Manager, The Hallway (AUS), James Hebbert, Managing Director, Hylink (CN / UK), Mundy Álvarez, Planning Director; Pedro Rojas, Social Media Manager; Pancho González, CCO, Inbrax (CH), Oana Oprea, Head of Digital Planning, Jam Session Agency (RO), Amy Bottrill, Social Account Director, Launch (UK), Gaby Arriaga, Founder, Leonardo1452 (MX), Shantesh S Row, Creative Director, Liwa (UAE), Rajesh Mehta, Chief Strategy Officer; Dhruv Gaur, Digital Planning Lead; Leonie Mergulhao, Account Supervisor - Social Media & PR, Medulla (IN), Aurelija Plioplytė, Head of Digital & Social, Not Perfect (LI), Daiana Khaidargaliyeva, Account Manager, Osaka Labs (UK / USA), Stefanie Söhnchen, Vice President Digital, PIABO Communications (DE), Elisabeth Winiartati, Managing Consultant, Head of Global Integrated Communications; Lydia Aprina, Account Manager, Integrated Marketing and Communications; Nita Prabowo, Account Manager, Integrated Marketing and Communications; Okhi, Web Developer, PNTR Group (ID), Kei Obusan, Insights Director; Daffi Ranandi, Insights Manager, Radarr (SG), Gautam Reghunath, Co-founder & CEO, Talented (IN), Donagh Humphreys, Head of Social and Digital Innovation, THINKHOUSE (IRE), Sarah Yim, Strategy Director, Zulu Alpha Kilo (CA).
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
The search marketing landscape is evolving rapidly with new technologies, and professionals, like you, rely on innovative paid search strategies to meet changing demands.
It’s important that you’re ready to implement new strategies in 2024.
Check this out and learn the top trends in paid search advertising that are expected to gain traction, so you can drive higher ROI more efficiently in 2024.
You’ll learn:
- The latest trends in AI and automation, and what this means for an evolving paid search ecosystem.
- New developments in privacy and data regulation.
- Emerging ad formats that are expected to make an impact next year.
Watch Sreekant Lanka from iQuanti and Irina Klein from OneMain Financial as they dive into the future of paid search and explore the trends, strategies, and technologies that will shape the search marketing landscape.
If you’re looking to assess your paid search strategy and design an industry-aligned plan for 2024, then this webinar is for you.
5 Public speaking tips from TED - Visualized summarySpeakerHub
From their humble beginnings in 1984, TED has grown into the world’s most powerful amplifier for speakers and thought-leaders to share their ideas. They have over 2,400 filmed talks (not including the 30,000+ TEDx videos) freely available online, and have hosted over 17,500 events around the world.
With over one billion views in a year, it’s no wonder that so many speakers are looking to TED for ideas on how to share their message more effectively.
The article “5 Public-Speaking Tips TED Gives Its Speakers”, by Carmine Gallo for Forbes, gives speakers five practical ways to connect with their audience, and effectively share their ideas on stage.
Whether you are gearing up to get on a TED stage yourself, or just want to master the skills that so many of their speakers possess, these tips and quotes from Chris Anderson, the TED Talks Curator, will encourage you to make the most impactful impression on your audience.
See the full article and more summaries like this on SpeakerHub here: https://speakerhub.com/blog/5-presentation-tips-ted-gives-its-speakers
See the original article on Forbes here:
http://www.forbes.com/forbes/welcome/?toURL=http://www.forbes.com/sites/carminegallo/2016/05/06/5-public-speaking-tips-ted-gives-its-speakers/&refURL=&referrer=#5c07a8221d9b
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
Everyone is in agreement that ChatGPT (and other generative AI tools) will shape the future of work. Yet there is little consensus on exactly how, when, and to what extent this technology will change our world.
Businesses that extract maximum value from ChatGPT will use it as a collaborative tool for everything from brainstorming to technical maintenance.
For individuals, now is the time to pinpoint the skills the future professional will need to thrive in the AI age.
Check out this presentation to understand what ChatGPT is, how it will shape the future of work, and how you can prepare to take advantage.
A brief introduction to DataScience with explaining of the concepts, algorithms, machine learning, supervised and unsupervised learning, clustering, statistics, data preprocessing, real-world applications etc.
It's part of a Data Science Corner Campaign where I will be discussing the fundamentals of DataScience, AIML, Statistics etc.
Time Management & Productivity - Best PracticesVit Horky
Here's my presentation on by proven best practices how to manage your work time effectively and how to improve your productivity. It includes practical tips and how to use tools such as Slack, Google Apps, Hubspot, Google Calendar, Gmail and others.
The six step guide to practical project managementMindGenius
The six step guide to practical project management
If you think managing projects is too difficult, think again.
We’ve stripped back project management processes to the
basics – to make it quicker and easier, without sacrificing
the vital ingredients for success.
“If you’re looking for some real-world guidance, then The Six Step Guide to Practical Project Management will help.”
Dr Andrew Makar, Tactical Project Management
2. outline
why you should care
what is it, what it's for
how it works
architecture
how you can use it
librados
radosgw
RBD, the ceph block device
distributed file system
roadmap
why we do this, who we are
3. • company that provides • distributed unified object,
professional services and block and file storage
support for Ceph
platform
• founded in 2011
• created by storage experts
• funded by DreamHost
• open source
• Mark Shuttleworth
invested $1M • in the Linux Kernel
• Sage Weil, CTO and • integrated into Cloud
creator of Ceph Platforms
5. requirements
diverse storage needs
object storage
block devices (for VMs) with snapshots, cloning
shared file system with POSIX, coherent caches
structured data... files, block devices, or objects?
scale
terabytes, petabytes, exabytes
heterogeneous hardware
reliability and fault tolerance
6. time
ease of administration
no manual data migration, load balancing
painless scaling
expansion and contraction
seamless migration
7. cost
linear function of size or performance
incremental expansion
no fork-lift upgrades
no vendor lock-in
choice of hardware
choice of software
open
10. distributed storage system
data center scale
10s to 10,000s of machines
terabytes to exabytes
fault tolerant
no single point of failure
commodity hardware
self-managing, self-healing
11. how do you design a storage system that
(really) scales?
11
12. DISK
DISK
DISK
DISK
DISK
DISK
HUMAN
HUMAN COMPUTER
COMPUTER DISK
DISK
DISK
DISK
DISK
DISK
DISK
DISK
13. DISK
DISK
DISK
DISK
HUMAN
HUMAN
DISK
DISK
HUMAN
HUMAN COMPUTER
COMPUTER DISK
DISK
DISK
DISK
HUMAN
HUMAN
DISK
DISK
DISK
DISK
14. HUMAN
HUMAN HUMAN
HUMAN
HUMAN
HUMAN
HUMAN
HUMAN DISK
DISK
HUMAN
HUMAN
HUMAN
HUMAN DISK
DISK
HUMAN
HUMAN DISK
DISK
HUMAN
HUMAN
DISK
DISK
HUMAN
HUMAN
DISK
DISK
HUMAN
HUMAN
HUMAN DISK
DISK
HUMAN (COMPUTER) )
(COMPUTER
HUMAN
HUMAN
DISK
DISK
HUMAN
HUMAN HUMAN
HUMAN DISK
DISK
HUMAN
HUMAN
HUMAN
HUMAN DISK
DISK
HUMAN
HUMAN DISK
DISK
HUMAN
HUMAN DISK
DISK
HUMAN
HUMAN
HUMAN
HUMAN DISK
DISK
HUMAN
HUMAN
HUMAN
HUMAN (actually more like this…)
15. COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
HUMAN
HUMAN
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
HUMAN
HUMAN
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
HUMAN
HUMAN
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
16. APP
APP APP
APP HOST/VM
HOST/VM CLIENT
CLIENT
RADOSGW
RADOSGW RBD
RBD CEPH FS
CEPH FS
LIBRADOS
LIBRADOS
AA bucket-based REST AA reliable and fully-
bucket-based REST reliable and fully- AA POSIX-compliant
POSIX-compliant
AA library allowing
library allowing gateway, compatible
gateway, compatible distributed block
distributed block distributed file system,
distributed file system,
apps to directly
apps to directly with S3 and Swift
with S3 and Swift device, with a a Linux
device, with Linux with a a Linux kernel
with Linux kernel
access RADOS,
access RADOS, kernel client and a a
kernel client and client and support for
client and support for
with support for
with support for QEMU/KVM driver
QEMU/KVM driver FUSE
FUSE
C, C++, Java,
C, C++, Java,
Python, Ruby,
Python, Ruby,
and PHP
and PHP
RADOS
RADOS
AA reliable, autonomous, distributed object store comprised of self-healing, self-managing,
reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes
intelligent storage nodes
17. why start with objects?
more useful than (disk) blocks
names in a single flat namespace
variable size
simple API with rich semantics
more scalable than files
no hard-to-distribute hierarchy
update semantics do not span objects
workload is trivially parallel
18. ceph object model
pools
1s to 100s
independent namespaces or object collections
replication level, placement policy
objects
bazillions
blob of data (bytes to gigabytes)
attributes (e.g., “version=12”; bytes to kilobytes)
key/value bundle (bytes to gigabytes)
19. OSD OSD OSD OSD OSD
btrfs
FS FS FS FS FS xfs
ext4
DISK DISK DISK DISK DISK
M M M
20. Object Storage Daemons (OSDs)
10s to 10000s in a cluster
one per disk, SSD, or RAID group, or ...
hardware agnostic
serve stored objects to clients
intelligently peer to perform replication
and recovery tasks
Monitors
maintain cluster membership and state
M
provide consensus for distributed
decision-making
small, odd number
these do not served stored objects to
clients
22. data distribution
all objects are replicated N times
objects are automatically placed, balanced, migrated in a
dynamic cluster
must consider physical infrastructure
ceph-osds on hosts in racks in rows in data centers
three approaches
pick a spot; remember where you put it
pick a spot; write down where you put it
calculate where to put it, where to find it
23. CRUSH
pseudo-random placement algorithm
fast calculation, no lookup
repeatable, deterministic
statistically uniform distribution
stable mapping
limited data migration on change
rule-based configuration
infrastructure topology aware
adjustable replication
allows weighting
30. APP
APP APP
APP HOST/VM
HOST/VM CLIENT
CLIENT
RADOSGW
RADOSGW RBD
RBD CEPH FS
CEPH FS
LIBRADOS
AA bucket-based REST AA reliable and fully-
bucket-based REST reliable and fully- AA POSIX-compliant
POSIX-compliant
A library allowing gateway, compatible
gateway, compatible distributed block
distributed block distributed file system,
distributed file system,
apps to directly with S3 and Swift
with S3 and Swift device, with a a Linux
device, with Linux with a a Linux kernel
with Linux kernel
access RADOS, kernel client and a a
kernel client and client and support for
client and support for
with support for QEMU/KVM driver
QEMU/KVM driver FUSE
FUSE
C, C++, Java,
Python, Ruby,
and PHP
RADOS
RADOS
AA reliable, autonomous, distributed object store comprised of self-healing, self-managing,
reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes
intelligent storage nodes
32. librados
direct access to RADOS from
L
applications
C, C++, Python, PHP, Java, Erlang
direct access to storage nodes
no HTTP overhead
33. rich librados API
atomic single-object transactions
update data, attr together
compare-and-swap
efficient key/value storage inside an object
object-granularity snapshot primitives
embed code in ceph-osd daemon via plugin API
inter-client communication via object
34. APP
APP APP
APP HOST/VM
HOST/VM CLIENT
CLIENT
RADOSGW RBD
RBD CEPH FS
CEPH FS
LIBRADOS
LIBRADOS
A bucket-based REST AA reliable and fully-
reliable and fully- AA POSIX-compliant
POSIX-compliant
AA library allowing
library allowing gateway, compatible distributed block
distributed block distributed file system,
distributed file system,
apps to directly
apps to directly with S3 and Swift device, with a a Linux
device, with Linux with a a Linux kernel
with Linux kernel
access RADOS,
access RADOS, kernel client and a a
kernel client and client and support for
client and support for
with support for
with support for QEMU/KVM driver
QEMU/KVM driver FUSE
FUSE
C, C++, Java,
C, C++, Java,
Python, Ruby,
Python, Ruby,
and PHP
and PHP
RADOS
RADOS
AA reliable, autonomous, distributed object store comprised of self-healing, self-managing,
reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes
intelligent storage nodes
35. APP
APP APP
APP
REST
RADOSGW
RADOSGW RADOSGW
RADOSGW
LIBRADOS
LIBRADOS LIBRADOS
LIBRADOS
native
MM
MM MM
36. RADOS Gateway
REST-based object storage proxy
uses RADOS to store objects
API supports buckets, accounting
usage accounting for billing purposes
compatible with S3, Swift APIs
37. APP
APP APP
APP HOST/VM
HOST/VM CLIENT
CLIENT
RADOSGW
RADOSGW RBD CEPH FS
CEPH FS
LIBRADOS
LIBRADOS
AA bucket-based REST A reliable and fully-
bucket-based REST AA POSIX-compliant
POSIX-compliant
AA library allowing
library allowing gateway, compatible
gateway, compatible distributed block distributed file system,
distributed file system,
apps to directly
apps to directly with S3 and Swift
with S3 and Swift device, with a Linux with a a Linux kernel
with Linux kernel
access RADOS,
access RADOS, kernel client and a client and support for
client and support for
with support for
with support for QEMU/KVM driver FUSE
FUSE
C, C++, Java,
C, C++, Java,
Python, Ruby,
Python, Ruby,
and PHP
and PHP
RADOS
RADOS
AA reliable, autonomous, distributed object store comprised of self-healing, self-managing,
reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes
intelligent storage nodes
38. COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER COMPUTER
COMPUTER DISK
DISK
DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
39. COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
VM
VM COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
VM
VM COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
VM
VM
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
COMPUTER
COMPUTER DISK
DISK
40. VM
VM
VIRTUALIZATION CONTAINER
VIRTUALIZATION CONTAINER
LIBRBD
LIBRBD
LIBRADOS
LIBRADOS
MM
MM MM
41. CONTAINER
CONTAINER VM
VM CONTAINER
CONTAINER
LIBRBD
LIBRBD LIBRBD
LIBRBD
LIBRADOS
LIBRADOS LIBRADOS
LIBRADOS
MM
MM MM
42. HOST
HOST
KRBD (KERNEL MODULE)
KRBD (KERNEL MODULE)
LIBRADOS
LIBRADOS
MM
MM MM
43. RADOS Block Device
storage of disk images in RADOS
decouple VM from host
images striped across entire cluster
(pool)
snapshots
copy-on-write clones
support in
Qemu/KVM
mainline Linux kernel (2.6.39+)
OpenStack, CloudStack
47. APP
APP APP
APP HOST/VM
HOST/VM CLIENT
CLIENT
RADOSGW
RADOSGW RBD
RBD CEPH FS
LIBRADOS
LIBRADOS
AA bucket-based REST AA reliable and fully-
bucket-based REST reliable and fully- A POSIX-compliant
AA library allowing
library allowing gateway, compatible
gateway, compatible distributed block
distributed block distributed file system,
apps to directly
apps to directly with S3 and Swift
with S3 and Swift device, with a a Linux
device, with Linux with a Linux kernel
access RADOS,
access RADOS, kernel client and a a
kernel client and client and support for
with support for
with support for QEMU/KVM driver
QEMU/KVM driver FUSE
C, C++, Java,
C, C++, Java,
Python, Ruby,
Python, Ruby,
and PHP
and PHP
RADOS
RADOS
AA reliable, autonomous, distributed object store comprised of self-healing, self-managing,
reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes
intelligent storage nodes
48. CLIENT
CLIENT
metadata
01
01 data
10
10
MM
MM MM
50. Metadata Server (MDS)
manages metadata for POSIX shared file
system
directory hierarchy
file metadata (size, owner,
timestamps)
stores metadata in RADOS
does not serve file data to clients
only required for the shared file system
65. Inktank’s Professional Services
Consulting Services:
• Technical Overview
• Infrastructure Assessment
• Proof of Concept
• Implementation Support
• Performance Tuning
Support Subscriptions:
• Pre-Production Support
• Production Support
A full description of our services can be found at the following:
Consulting Services: http://www.inktank.com/consulting-services/
Support Subscriptions: http://www.inktank.com/support-services/
66. Check out our on-demand webinars
• Getting Started with Ceph
• Introduction to Ceph with OpenStack
• DreamHost Case Study: DreamObjects with Ceph
• Advanced Features of Ceph Distributed Storage
(delivered by Sage Weil, creator of Ceph)
All can be watched at:
http://www.inktank.com/news-events/webinars/
It was at this point that computers had to learn how to consolidate the storage available in multiple hard drives and present it in a unified way to their human patrons.
This causes a whole new set of problems. How do you prevent two people from writing the same file at the same time? How do you make sure that one person can’t see data intended for someone else?
And, as you might imagine, these solutions led to other problems. Notice a bottleneck in this picture?
Blam!! The invention of distributed storage was another very important point in the history of information storage. For the first time, every part of the system could be scaled.
RADOS is a distributed object store, and it’s the foundation for Ceph. On top of RADOS, the Ceph team has built three applications that allow you to store data and do fantastic things. But before we get into all of that, let’s start at the beginning of the story.
Let’s start with RADOS, Reliable Autonomic Distributed Object Storage. In this example, you’ve got five disks in a computer. You have initialized each disk with a filesystem (btrfs is the right filesystem to use someday, but until it’s stable we recommend XFS). On each filesystem, you deploy a Ceph OSD (Object Storage Daemon). That computer, with its five disks and five object storage daemons, becomes a single node in a RADOS cluster. Alongside these nodes are monitor nodes, which keep track of the current state of the cluster and provide users with an entry point into the cluster (although they do not serve any data themselves).
Applications wanting to store objects into RADOS interact with the cluster as a single entity.
The way CRUSH is configured is somewhat unique. Instead of defining pools for different data types, workgroups, subnets, or applications, CRUSH is configured with the physical topology of your storage network. You tell it how many buildings, rooms, shelves, racks, and nodes you have, and you tell it how you want data placed. For example, you could tell CRUSH that it’s okay to have two replicas in the same building, but not on the same power circuit. You also tell it how many copies to keep.
With CRUSH, the data is first split into a certain number of sections. These are called “placement groups”. The number of placement groups is configurable. Then, the CRUSH algorithm runs, having received the latest cluster map and a set of placement rules, and it determines where the placement group belongs in the cluster. This is a pseudo-random calculation, but it’s also repeatable; given the same cluster state and rule set, it will always return the same results.
Each placement group is run through CRUSH and stored in the cluster. Notice how no node has received more than one copy of a placement group, and no two nodes contain the same information? That’s important.
When it comes time to store an object in the cluster (or retrieve one), the client calculates where it belongs.
What happens, though, when a node goes down? The OSDs are always talking to each other (and the monitors), and they know when something is amiss. The third and fifth node on the top row have noticed that the second node on the bottom row is gone, and they are also aware that they have replicas of the missing data.
The OSDs collectively use the CRUSH algorithm to determine how the cluster should look based on its new state, and move the data to where clients running CRUSH expect it to be.
Because of the way placement is calculated instead of centrally controlled, node failures are transparent to clients.
Next, let’s talk about librados. Librados is a native C library that allows applications to work with RADOS. There are similar libraries available for C++, Java, Python, Ruby, and PHP.
So applications link with librados, allowing them to interact with RADOS through a native protocol.
The radosgw component is a REST-based interface to RADOS. It allows developers to build applications that work with Ceph through standard web services.
So, for example, an application can use a REST-based API to work with radosgw, and radosgw talks to RADOS using a native protocol. You can deploy as many gateways as you need, and you can use standard HTTP load balancers. User authentication and S3-style buckets are also supported, and applications written to work with Amazon S3 or OpenStack Swift will automatically work with radosgw by just changing their endpoint.
The RADOS Block Device (RBD) allows users to store virtual disks inside RADOS.
OK. This gets a little bit abstract now. Sometimes, people don’t want to store files or objects, they want to store entire disks. Virtual disks. Collections of data that can be assembled together and presented to a computer, which would see it as a real hard drive with platters and sectors.
… or the more modern way, which is to boot multiple virtual machines using disks from a storage cluster.
For example, you can use a virtualization container like KVM or QEMU to boot virtual machines from images that have been stored in RADOS. Images are striped across the entire cluster, which allows for simultaneous read access from different cluster nodes.
Separating a virtual computer from its storage also lets you do really neat things, like migrate a virtual machine from one server to another without rebooting it.
As an alternative, machines (even those running on bare metal) can mount an RBD image using native Linux kernel drivers.
With Ceph, copying an RBD image four times gives you five total copies…but only takes the space of one. It also happens instantly.
When they read, though, they read through to the original copy if there’s no newer data.
Finally, let’s talk about Ceph FS. Ceph FS is a parallel filesystem that provides a massively scalable, single-hierarchy, shared disk. If you use a shared drive at work, this is the same thing except that the same drive could be shared by everyone you’ve ever met (and everyone they’ve ever met).
Remember all that meta-data we talked about in the beginning? Feels so long ago. It has to be stored somewhere! Something has to keep track of who created files, when they were created, and who has the right to access them. And something has to remember where they live within a tree. Enter MDS, the Ceph Metadata Server. Clients accessing Ceph FS data first make a request to an MDS, which provides what they need to get files from the right OSDs.
There are multiple MDSs!
If you aren’t running Ceph FS, you don’t need to deploy metadata servers.
So how do you have one tree and multiple servers?
If there’s just one MDS (which is a terrible idea), it manages metadata for the entire tree.
When the second one comes along, it will intelligently partition the work by taking a subtree.
When the third MDS arrives, it will attempt to split the tree again.
Same with the fourth.
A MDS can actually even just take a single directory or file, if the load is high enough. This all happens dynamically based on load and the structure of the data, and it’s called “dynamic subtree partitioning”.
Which components of Ceph are you using today? Which components of Ceph are you interested in using in the future? What additional functionality would you like to see in Ceph? Free form
Technical Overview We will explain the architecture, functionality, best practices, and typical use cases. We’ll take you through code review, the technology road map and review your business goals. Infrastructure Assessment Our team will conduct an in-depth, on-site assessment of your current storage environment to fully understand your architecture, application, and workload. Inktank engineers will customize a solution for your business needs. Proof of Concept The best way to determine whether Ceph is the right solution for you is to set up your own test cluster. With our configuration, benchmarking tools and services, we can guide you to the right solution for your particular application. Implementation Support Reduce project risk by adding Ceph experts to your team. We will work with you to build the most reliable, secure, and robust storage system for your application. We can help you adapt your applications to fully leverage Ceph or extend the Ceph core to provide the features you need. Performance Tuning Through detailed measurements, we can tune the components in a production Ceph cluster to achieve the optimum configuration for your actual workload. Inktank's Pre-Production Subscriptio n is ideal for architects and administrators setting up a Ceph environment and needing support during the installation and configuration process. Features Email support channel One named company contact SLA-backed response Access to support ticketing system Unlimited support requests Inktank Production Support Silver & Gold Subscriptions