This document discusses Flipkart's use of Ceph object storage to provide a petabyte-scale object storage service. Key points:
- Flipkart runs two large data centers hosting over 20,000 servers and 60,000 VMs to power its e-commerce marketplace.
- It developed a highly scalable object storage service using Ceph to store over 1.5 billion objects totaling around 2PB of data. This service provides APIs compatible with AWS S3.
- The Ceph clusters are deployed across SSDs and HDDs to provide different performance and cost tiers for shared active, backup, and archival workloads with different SLAs around latency, throughput, and durability
Redis Day Keynote Salvatore Sanfillipo Redis LabsRedis Labs
Redis' seventh birthday was recently celebrated with the community, several contributors and users. This is Salvatore's keynote as he kicked off Redis Day in Tel Aviv.
RADOS improvements and roadmap - Greg Farnum, Josh Durgin, Kefu ChaiCeph Community
Cephalocon APAC 2018
March 22-23, 2018 - Beijing, China
Greg Farnum,Red Hat RADOS Core Developer
Josh Durgin, Red Hat RADOS Lead
Kefu Chai, Red Hat Senior Software Engineer
HIgh Performance Redis- Tague Griffith, GoProRedis Labs
High Performance Redis looks at a wide range of techniques - from programming to system tuning - to deploy and maintain an extremely high performing Redis cluster. From the operational
perspective, the talk lays out multiple techniques for clustering (sharding) Redis systems and examines how the different
approaches impact performance time. The talk further examines system settings (Linux network parameters, Redis
system) and how they impact performance (both good and bad). Finally, for the developer, we look at how different ways of structuring data actually demonstrate different performance characteristics
Redis Developers Day 2014 - Redis Labs TalksRedis Labs
These are the slides that the Redis Labs team had used to accompany the session that we gave during the first ever Redis Developers Day on October 2nd, 2014, London. It includes some of the ideas we've come up with to tackle operational challenges in the hyper-dense, multi-tenants Redis deployments that our service - Redis Cloud - consists of.
Turning object storage into vm storagewim_provoost
Object Storage is today the standard to build scale-out storage. But due to technical hurdles it is impossible to run Virtual Machines directly from an Object Store. Open vStorage is the layer between the hypervisor and Object Store and turns the Object Store into a high performance, distributed, VM-centric storage platform.
My talk at ScaleConf 2017 in Cape Town on some tips and tactics for scaling WordPress, with reference to WordPress.com and the container-based VIP Go platform.
Video of my talk is here: https://www.youtube.com/watch?v=cs0DcY80spw
Redis Day Keynote Salvatore Sanfillipo Redis LabsRedis Labs
Redis' seventh birthday was recently celebrated with the community, several contributors and users. This is Salvatore's keynote as he kicked off Redis Day in Tel Aviv.
RADOS improvements and roadmap - Greg Farnum, Josh Durgin, Kefu ChaiCeph Community
Cephalocon APAC 2018
March 22-23, 2018 - Beijing, China
Greg Farnum,Red Hat RADOS Core Developer
Josh Durgin, Red Hat RADOS Lead
Kefu Chai, Red Hat Senior Software Engineer
HIgh Performance Redis- Tague Griffith, GoProRedis Labs
High Performance Redis looks at a wide range of techniques - from programming to system tuning - to deploy and maintain an extremely high performing Redis cluster. From the operational
perspective, the talk lays out multiple techniques for clustering (sharding) Redis systems and examines how the different
approaches impact performance time. The talk further examines system settings (Linux network parameters, Redis
system) and how they impact performance (both good and bad). Finally, for the developer, we look at how different ways of structuring data actually demonstrate different performance characteristics
Redis Developers Day 2014 - Redis Labs TalksRedis Labs
These are the slides that the Redis Labs team had used to accompany the session that we gave during the first ever Redis Developers Day on October 2nd, 2014, London. It includes some of the ideas we've come up with to tackle operational challenges in the hyper-dense, multi-tenants Redis deployments that our service - Redis Cloud - consists of.
Turning object storage into vm storagewim_provoost
Object Storage is today the standard to build scale-out storage. But due to technical hurdles it is impossible to run Virtual Machines directly from an Object Store. Open vStorage is the layer between the hypervisor and Object Store and turns the Object Store into a high performance, distributed, VM-centric storage platform.
My talk at ScaleConf 2017 in Cape Town on some tips and tactics for scaling WordPress, with reference to WordPress.com and the container-based VIP Go platform.
Video of my talk is here: https://www.youtube.com/watch?v=cs0DcY80spw
TechTalk: Connext DDS 5.2 - Faster and Easier Development of Industrial Internet Systems and Applications
Watch on-demand: https://youtu.be/j1G0MHC0Vwc
Introduction to memcached, a caching service designed for optimizing performance and scaling in the web stack, seen from perspective of MySQL/PHP users. Given for 2nd year students of professional bachelor in ICT at Kaho St. Lieven, Gent.
Companies with batch and stream processing pipelines need to serve the insights they glean back to their users, an often-overlooked problem that can be hard to achieve reliably and at scale. Felix GV and Yan Yan offer an overview of Venice, a new data store capable of ingesting data from Hadoop and Kafka, merging it together, replicating it globally, and serving it online at low latency.
Venice was designed to be the next-generation replacement of the Voldemort Read-Only system, with the intent to provide a broader feature set, better availability characteristics, and a more efficient architecture. Venice is designed for high-throughput ingestion from Hadoop and Kafka, and these data sources can be merged at ingestion time in order to provide semantics similar to those of a lambda architecture but with a simpler, faster, and more available read path. Robustness is a primary architectural concern and, as such, Venice provides highly available reads and writes, self-healing, stringent data validation guarantees, and the ability to roll back entire datasets in cases where bad data is pushed.
Drinking our own Champagne: How Woot, an Amazon subsidiary, uses AWS (ARC212)...Amazon Web Services
Woot, an Amazon subsidiary, specializes in offering great new product deals every day. Woot's deeply discounted deals; and signature events like the 'Woot Off 'and 'Bag of Crap' sales launch at specific times throughout the day, and the resulting spiky traffic patterns are highly correlated to revenue.
In this session, we offer an unvarnished perspective into how Woot uses services such as Amazon DynamoDB, EC2, ELB, CloudSearch, CloudFront, and SES. Learn how to architect for security and PCI for a retail website running on AWS. Dig into the technical details of a data-store comparison between DynamoDB, Mongo, Oracle, and SQLServer, to find the right solution for unique workloads. Join us as we share our musings and real-lessons learned from using a cocktail of AWS services. We encourage you to attend even if none of this makes sense or is interesting. Don't miss the opportunity to hang out with Mortimer the Woot monkey and his crew and to walk away with one of our legendary flying monkeys.
Apache Hive is a rapidly evolving project, many people are loved by the big data ecosystem. Hive continues to expand support for analytics, reporting, and bilateral queries, and the community is striving to improve support along with many other aspects and use cases. In this lecture, we introduce the latest and greatest features and optimization that appeared in this project last year. This includes benchmarks covering LLAP, Apache Druid's materialized views and integration, workload management, ACID improvements, using Hive in the cloud, and performance improvements. I will also tell you a little about what you can expect in the future.
Apache Hive is a rapidly evolving project, many people are loved by the big data ecosystem. Hive continues to expand support for analytics, reporting, and bilateral queries, and the community is striving to improve support along with many other aspects and use cases. In this lecture, we introduce the latest and greatest features and optimization that appeared in this project last year. This includes benchmarks covering LLAP, Apache Druid's materialized views and integration, workload management, ACID improvements, using Hive in the cloud, and performance improvements. I will also tell you a little about what you can expect in the future.
We're talking about serious log crunching and intelligence gathering with Elastic, Logstash, and Kibana.
ELK is an end-to-end stack for gathering structured and unstructured data from servers. It delivers insights in real time using the Kibana dashboard giving unprecedented horizontal visibility. The visualization and search tools will make your day-to-day hunting a breeze.
During this brief walkthrough of the setup, configuration, and use of the toolset, we will show you how to find the trees from the forest in today's modern cloud environments and beyond.
Ceph Object Storage at Spreadshirt (July 2015, Ceph Berlin Meetup)Jens Hadlich
At Spreadshirt we are building up our own Ceph cluster(s) to spread customer images around the earth. This presentation gives an overview on the setup, numbers as well as issues we are currently facing.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Petabyte Scale Object Storage Service Using Ceph in A Private Cloud - Varada Kari
1. Petabyte Scale Object Storage Service using Ceph
in a Private Cloud
Flipkart Internet Pvt Ltd, Bangalore, India
1
2. • About Flipkart
• Flipkart Cloud Platform(FCP)
• Virtualized Infrastructure
• Object Store Services
• FCP Storage Team - Tech Stack
• Ceph Configuration and Deployment
• QoS(Server Side, RPS based)
• Cluster Administration and Health Monitoring
Agenda
2
3. About Flipkart
•Flipkart is India’s leading e-commerce marketplace offering
• Over 80 million products across 70+ product categories
• 100 million registered users
• 100,000 Sellers
• 8 million shipments/month
• 13 million daily visitors
•Developed a highly scalable and reliable storage infrastructure
•Object storage service backed by Ceph for ~3 years
•Runs two data centers having more than 20,000 hosts
•Completely virtualized infrastructure with over 60,000 VMs
3
4. Object Storage as a Service
• Scale-out object storage service built on CEPH for Flipkart
• Simple HTTP API - A subset of AWS S3 API
• Multiple clusters behind a single endpoint
• Key features
• Focus on durability
• Elastic for performance and capacity
• Multi-tenant with user SLAs
• Five 9's of reliability
4
5. FCP Storage - SLAs Offered
Shared-Active Workloads
● Latency sensitive e.g.
Debian packages, product
image uploads &
near-term invoices.
● Optimized for small to
medium sized objects
Backup Workloads
● Bandwidth optimized e.g.
MySQL DB backup
● Optimized for very large
objects.
● Best suited for taking
backups with limited
retention period.
Archival Workloads
● Long-term (multi-year)
retention optimized e.g.
legal compliance data
● Moderate throughput and
high latency
● Optimized for medium to
very large objects
● Explicit verification targets
for durability
5
Hot Cold
6. Shared Active Service
•Stores ~1.5 billion objects
•~750 OSDs
•Provides ~1 GB/s bandwidth utilization
•Handles ~ 500 RGW requests/sec
•~2PB of raw storage
•Hybrid deployment with SSDs and HDDs
•200+ internal customers
•Growing ~50 million objects/month
6
10. Shared Active Tech Stack
10
Clients(Invoice Service)
Elastic Load
Balancer
ELB
(Read)
ELB
(Write)
RGW
(Write)
RGW
(Write)
RGW
(Read)
RGW
(Read)
Nginx Nginx
Mon
OSD (SSD) OSD HybridRADOS Cluster
Bare Metal H/W (Debian)
Debian VMs
Clients(CDN) Clients(Repo)
Clients(Backup)
Clients(Catalogue)
Cache/Index Base Tier
VIP VIP
VIP
Boto
AWS SDK
for Java
Time Series
Based Health
Monitoring
Grafana
Open
TSDB
Alerts
11. Ceph Deployment: HW Instance
• HW VM instances for Ceph components
• Monitors
• Higher CPU cores and RAM
• RGWs
• Higher Bandwidth and CPUs
• An instance group of N RGWs
• Base Tier OSDs
• Higher Capacity, Hybrid(SSD+HDD)
• Cache Tier OSDs(SSDs)
• Index OSDs (SSDs+More RAM)
• Low latency, large capacity SSDs
• An instance has the following:
• Host OS (Debian)
• Network
• CPU (Hyper Thread cores)
• Disks (Direct attached)
• RAM
11
12. 12
QoS
QoS
RGW Rate Limit
● Per user quota policy
● Requests are rate limited per
GET, PUT, CREATE, DELETE
● Deployed per RGW instance
Elastic Load Balancer
Spread load across RGW/Nginx
Nginx Rd/Wr Separation
● Independent sets of RGW
instances
● Handle GET and PUT in
isolation
● Maintained in Nginx conf
CRUSH
Spread load across physically
separated tiers across different PDU
16. 16
Challenges faced so far...
Issue Timeouts due to lack of iops
Reason 1. Increased client and rebalancing load
2. Split and merges in filestore at the same time
3. Unable to serve I/O requests
Mitigation 1. Migrate journals to SSDs
2. Introduce SSD writeback cache tier in front of HDD OSDs to
absorb iops
3. Introduce rate limits for clients
4. Two set of RGWs to serve read and write operations
17. 17
Challenges faced so far...
Issue Service outage due to rebalancing in large metadata pools (index, log
etc)
Reason 1. Buckets with more than 100 million objects
2. RGW usage pool had high number of entries and was enabled
3. Unable to write when rebalancing happens on large metadata pools
Mitigation 1. Manual sharding of large bucket indexes
2. Disable RGW log pool
3. Introduce bucket quota
18. 18
Challenges faced so far...
Issue Write timeouts due to high and concurrent bucket create calls
Reason Contention in Leveldb to due high number of concurrent updates to a
bucket’s metadata
Mitigation Enable quota on bucket creation calls
19. 19
Challenges faced so far...
Issue Data leak
Reason Leaks with incomplete multipart uploads
Mitigation 1. Fixed : https://github.com/ceph/ceph/pull/15630
2. radosgw-admin orphan find did not scale with millions of objects
3. Developed an offline tool to delete leaked objects
20. 20
Challenges faced so far...
Issue Data loss
Reason Race in RGWCompleteMultipart
Mitigation 1. Fixed : https://github.com/ceph/ceph/pull/16732
2. Accounted loss by scanning all objects for missing parts.
21. 21
Challenges that still persist...
• Upgrades are still a pain
• Upgrade from Hammer to Jewel, issue with chown.
• Upgrading to Luminous:
• Filestore has split and merges problem.
• BlueStore solves them but upgrade path is not smooth
• Adding new osds with BlueStore backend.
• Occasional PG incomplete issue and resulting data loss to
bring PGs back online.
• http://tracker.ceph.com/issues/17002
•End-to-End QOS
25. Build Service Alert ServiceCosmos Monitoring Service
Repo Service Authorization AuthN/ AuthZ Service
Elastic Load Balancer Object Storage Service
Infrastructure as a Service (IAAS)
25
Flipkart Cloud Platform: Building Blocks
26. Object Storage As a Service
• Policy based services
• Shared Active
• Low latency, replicated, high throughput
• Hybrid Deployment of HDDs and SSDs
• Data Backup and Disaster Recovery Service
• Replicated on HDDs
• Data Archival Service
• Erasure Coded on HDDs
• Multi-tenant
• Five 9's reliable
• 200+ internal customers
26
27. Cluster Administration and Monitoring
•Nagios alerts
•Ceph Periodic stats
• Has more than 100 metrics
• Integrates to Cosmos service
• Stores data on Cosmos's OpenTS DB
• Complements Ceph manager
•Grafana
• Visual representation of Ceph Periodic Stats
• Historic and near real-time status of the service
27