Ceph is unstable, vSAN got extremely poor performance. Data center need real high end distributed storage to replace traditional disk array support mission critical applications. PhegData X here raise up to answer...
This presentation provides a basic overview of Ceph, upon which SUSE Storage is based. It discusses the various factors and trade-offs that affect the performance and other functional and non-functional properties of a software-defined storage (SDS) environment.
CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...Ceph Community
Ceph is intelligent. However, users usually make resource request with no guarantee because of no visibility of underlying disk health, no idea of resource availability and no prediction of future demands. Now machine learning can make it happen. We'll present how machine learning technologies help predict Ceph OSD health, predictive impact on clusters and resolutions. We'll take Kubernetes working with Ceph as an example.
This presentation provides a basic overview of Ceph, upon which SUSE Storage is based. It discusses the various factors and trade-offs that affect the performance and other functional and non-functional properties of a software-defined storage (SDS) environment.
CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...Ceph Community
Ceph is intelligent. However, users usually make resource request with no guarantee because of no visibility of underlying disk health, no idea of resource availability and no prediction of future demands. Now machine learning can make it happen. We'll present how machine learning technologies help predict Ceph OSD health, predictive impact on clusters and resolutions. We'll take Kubernetes working with Ceph as an example.
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...Odinot Stanislas
Après la petite intro sur le stockage distribué et la description de Ceph, Jian Zhang réalise dans cette présentation quelques benchmarks intéressants : tests séquentiels, tests random et surtout comparaison des résultats avant et après optimisations. Les paramètres de configuration touchés et optimisations (Large page numbers, Omap data sur un disque séparé, ...) apportent au minimum 2x de perf en plus.
Disaggregated Container Attached Storage - Yet Another Topology with What Pur...DoKC
The storage topology in vogue seems to cycle every few years. Internal storage is followed by centralized Storage Area Networks only to be superseded by one-size-fits-all Hyperconverged models - until scalability constraints led to distributed storage. Then comes NVMe, offering blistering speeds that all of these storage stacks struggle with. Kubernetes inspires Container Attached Storage aspiring to be the perfect model, so why is disaggregated storage now making an appearance?
This talk considers the motivations behind yet another storage topology and examines a modern, flexible architecture for delivering high-performance storage under Kubernetes.
This talk was given by Nick Connolly for DoK Day Europe @ KubeCon 2022.
Disaggregated Container Attached Storage - Yet Another Topology with What Pur...DoKC
Link: https://youtu.be/YhktX1W0geM
https://go.dok.community/slack
https://dok.community/
From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE)
The storage topology in vogue seems to cycle every few years. Internal storage is followed by centralized Storage Area Networks only to be superseded by one-size-fits-all Hyperconverged models - until scalability constraints led to distributed storage. Then comes NVMe, offering blistering speeds that all of these storage stacks struggle with. Kubernetes inspires Container Attached Storage aspiring to be the perfect model, so why is disaggregated storage now making an appearance?
This talk considers the motivations behind yet another storage topology and examines a modern, flexible architecture for delivering high-performance storage under Kubernetes.
-----
Nick Connolly is a pioneer of storage virtualisation and the Chief Scientist at DataCore, where his background in real-time computing and multiprocessing led to the creation of a world-class high-performance storage stack on Windows. He holds patents ranging from highly scalable algorithms through to data protection techniques. Recently he has been working with OpenEBS to bring the power and performance of NVMe to Kubernetes.
Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...Tesora
In this session, speakers Amrith Kumar (Tesora), Steven Walchek (SolidFire), and Chris Merz (SolidFire) discuss Cinder, the OpenStack block storage service, and OpenStack Trove.
VMworld 2013: IBM Solutions for VMware Virtual SAN VMworld
VMworld 2013
Eric Deadwyler, IBM
Joseph Russell, VMware
Learn more about VMworld and register at http://www.vmworld.com/index.jspa?src=socmed-vmworld-slideshare
First Steps with Globus Compute Multi-User EndpointsGlobus
In this presentation we will share our experiences around getting started with the Globus Compute multi-user endpoint. Working with the Pharmacology group at the University of Auckland, we have previously written an application using Globus Compute that can offload computationally expensive steps in the researcher's workflows, which they wish to manage from their familiar Windows environments, onto the NeSI (New Zealand eScience Infrastructure) cluster. Some of the challenges we have encountered were that each researcher had to set up and manage their own single-user globus compute endpoint and that the workloads had varying resource requirements (CPUs, memory and wall time) between different runs. We hope that the multi-user endpoint will help to address these challenges and share an update on our progress here.
Software Engineering, Software Consulting, Tech Lead.
Spring Boot, Spring Cloud, Spring Core, Spring JDBC, Spring Security,
Spring Transaction, Spring MVC,
Log4j, REST/SOAP WEB-SERVICES.
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Globus
The U.S. Geological Survey (USGS) has made substantial investments in meeting evolving scientific, technical, and policy driven demands on storing, managing, and delivering data. As these demands continue to grow in complexity and scale, the USGS must continue to explore innovative solutions to improve its management, curation, sharing, delivering, and preservation approaches for large-scale research data. Supporting these needs, the USGS has partnered with the University of Chicago-Globus to research and develop advanced repository components and workflows leveraging its current investment in Globus. The primary outcome of this partnership includes the development of a prototype enterprise repository, driven by USGS Data Release requirements, through exploration and implementation of the entire suite of the Globus platform offerings, including Globus Flow, Globus Auth, Globus Transfer, and Globus Search. This presentation will provide insights into this research partnership, introduce the unique requirements and challenges being addressed and provide relevant project progress.
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...Juraj Vysvader
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I didn't get rich from it but it did have 63K downloads (powered possible tens of thousands of websites).
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?XfilesPro
Worried about document security while sharing them in Salesforce? Fret no more! Here are the top-notch security standards XfilesPro upholds to ensure strong security for your Salesforce documents while sharing with internal or external people.
To learn more, read the blog: https://www.xfilespro.com/how-does-xfilespro-make-document-sharing-secure-and-seamless-in-salesforce/
Modern design is crucial in today's digital environment, and this is especially true for SharePoint intranets. The design of these digital hubs is critical to user engagement and productivity enhancement. They are the cornerstone of internal collaboration and interaction within enterprises.
SOCRadar Research Team: Latest Activities of IntelBrokerSOCRadar
The European Union Agency for Law Enforcement Cooperation (Europol) has suffered an alleged data breach after a notorious threat actor claimed to have exfiltrated data from its systems. Infamous data leaker IntelBroker posted on the even more infamous BreachForums hacking forum, saying that Europol suffered a data breach this month.
The alleged breach affected Europol agencies CCSE, EC3, Europol Platform for Experts, Law Enforcement Forum, and SIRIUS. Infiltration of these entities can disrupt ongoing investigations and compromise sensitive intelligence shared among international law enforcement agencies.
However, this is neither the first nor the last activity of IntekBroker. We have compiled for you what happened in the last few days. To track such hacker activities on dark web sources like hacker forums, private Telegram channels, and other hidden platforms where cyber threats often originate, you can check SOCRadar’s Dark Web News.
Stay Informed on Threat Actors’ Activity on the Dark Web with SOCRadar!
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Globus
The Earth System Grid Federation (ESGF) is a global network of data servers that archives and distributes the planet’s largest collection of Earth system model output for thousands of climate and environmental scientists worldwide. Many of these petabyte-scale data archives are located in proximity to large high-performance computing (HPC) or cloud computing resources, but the primary workflow for data users consists of transferring data, and applying computations on a different system. As a part of the ESGF 2.0 US project (funded by the United States Department of Energy Office of Science), we developed pre-defined data workflows, which can be run on-demand, capable of applying many data reduction and data analysis to the large ESGF data archives, transferring only the resultant analysis (ex. visualizations, smaller data files). In this talk, we will showcase a few of these workflows, highlighting how Globus Flows can be used for petabyte-scale climate analysis.
Developing Distributed High-performance Computing Capabilities of an Open Sci...Globus
COVID-19 had an unprecedented impact on scientific collaboration. The pandemic and its broad response from the scientific community has forged new relationships among public health practitioners, mathematical modelers, and scientific computing specialists, while revealing critical gaps in exploiting advanced computing systems to support urgent decision making. Informed by our team’s work in applying high-performance computing in support of public health decision makers during the COVID-19 pandemic, we present how Globus technologies are enabling the development of an open science platform for robust epidemic analysis, with the goal of collaborative, secure, distributed, on-demand, and fast time-to-solution analyses to support public health.
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisGlobus
JASMIN is the UK’s high-performance data analysis platform for environmental science, operated by STFC on behalf of the UK Natural Environment Research Council (NERC). In addition to its role in hosting the CEDA Archive (NERC’s long-term repository for climate, atmospheric science & Earth observation data in the UK), JASMIN provides a collaborative platform to a community of around 2,000 scientists in the UK and beyond, providing nearly 400 environmental science projects with working space, compute resources and tools to facilitate their work. High-performance data transfer into and out of JASMIN has always been a key feature, with many scientists bringing model outputs from supercomputers elsewhere in the UK, to analyse against observational or other model data in the CEDA Archive. A growing number of JASMIN users are now realising the benefits of using the Globus service to provide reliable and efficient data movement and other tasks in this and other contexts. Further use cases involve long-distance (intercontinental) transfers to and from JASMIN, and collecting results from a mobile atmospheric radar system, pushing data to JASMIN via a lightweight Globus deployment. We provide details of how Globus fits into our current infrastructure, our experience of the recent migration to GCSv5.4, and of our interest in developing use of the wider ecosystem of Globus services for the benefit of our user community.
Cyaniclab : Software Development Agency Portfolio.pdfCyanic lab
CyanicLab, an offshore custom software development company based in Sweden,India, Finland, is your go-to partner for startup development and innovative web design solutions. Our expert team specializes in crafting cutting-edge software tailored to meet the unique needs of startups and established enterprises alike. From conceptualization to execution, we offer comprehensive services including web and mobile app development, UI/UX design, and ongoing software maintenance. Ready to elevate your business? Contact CyanicLab today and let us propel your vision to success with our top-notch IT solutions.
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERRORTier1 app
Even though at surface level ‘java.lang.OutOfMemoryError’ appears as one single error; underlyingly there are 9 types of OutOfMemoryError. Each type of OutOfMemoryError has different causes, diagnosis approaches and solutions. This session equips you with the knowledge, tools, and techniques needed to troubleshoot and conquer OutOfMemoryError in all its forms, ensuring smoother, more efficient Java applications.
Designing for Privacy in Amazon Web ServicesKrzysztofKkol1
Data privacy is one of the most critical issues that businesses face. This presentation shares insights on the principles and best practices for ensuring the resilience and security of your workload.
Drawing on a real-life project from the HR industry, the various challenges will be demonstrated: data protection, self-healing, business continuity, security, and transparency of data processing. This systematized approach allowed to create a secure AWS cloud infrastructure that not only met strict compliance rules but also exceeded the client's expectations.
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...Anthony Dahanne
Les Buildpacks existent depuis plus de 10 ans ! D’abord, ils étaient utilisés pour détecter et construire une application avant de la déployer sur certains PaaS. Ensuite, nous avons pu créer des images Docker (OCI) avec leur dernière génération, les Cloud Native Buildpacks (CNCF en incubation). Sont-ils une bonne alternative au Dockerfile ? Que sont les buildpacks Paketo ? Quelles communautés les soutiennent et comment ?
Venez le découvrir lors de cette session ignite
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Shahin Sheidaei
Games are powerful teaching tools, fostering hands-on engagement and fun. But they require careful consideration to succeed. Join me to explore factors in running and selecting games, ensuring they serve as effective teaching tools. Learn to maintain focus on learning objectives while playing, and how to measure the ROI of gaming in education. Discover strategies for pitching gaming to leadership. This session offers insights, tips, and examples for coaches, team leads, and enterprise leaders seeking to teach from simple to complex concepts.
Check out the webinar slides to learn more about how XfilesPro transforms Salesforce document management by leveraging its world-class applications. For more details, please connect with sales@xfilespro.com
If you want to watch the on-demand webinar, please click here: https://www.xfilespro.com/webinars/salesforce-document-management-2-0-smarter-faster-better/
2. • The most successful database machine vendor in China
• Market share ~20%, lower than Oracle Exadata, higher than Huawei FusionCube
• Focusing on performance optimization for real applications
• PhDX (PhegData X) inherit the core of database machine
• Resources pooling, strong consistency, low latency I/O, etc.
• High efficient cache engine for mix media environment
• Adding more for virtualization and container systems
• RESTful API, support OpenStack Cinder
• VMware VAAI/vVol, Docker graph driver reday
History & Backgound
3. • Replacing high-end disk array to support mission critical applications
• Scale out architecture
• Proven data center level reliability, serviceability and performance
• Traditional as well as new applications
• Oracle RAC, DB2 PureScale/DPF, Sybase, MySQL, PostgreSQL …
• Hadoop, Spark, Storm, Kafka, Druid …
• VMware, KVM, XEN, Docker, rkt ...
Targeting on …
4. • PhDX = Generic x86 hardware + S2EBS (SmartScaleEBS) software
• Hardware, nothing special, just commodity metal box
• CPU: Intel Xeon E5/E7 series, v2/v3/v4
• Flash: SATA/NVMe/PCIe SSD, NVDIMM releasing soon
• Network: GbE/10GbE/InfiniBand, Intel Omni-Path ready
• S2EBS (SmartScaleEBS)Software
• DHT based distributed system, no centric meta data node
• Block level interfaces, iSCSI, SRP, iSER and S2EBS native protocol
• RESTful API to support objective interfaces (Cinder, S3 compatible, etc.)
Inside PhDX (PhegData X)
6. BAC maintain logical volumes
Chunk
Chunk
Chunk
OSD
Chunk
Chunk
Chunk
OSD
Chunk
Chunk
Chunk
OSD
Disk PoolLogical
Volume
A logical volume is a set of chunks. The mappings are maintained by BAC module.
8. • Metadata Area
• Super Block——64KB
• Space Bitmap——2MB
• Key Space(Mapping B+ Tree)——512MB
• Data Area
OSD maintain physical disks
Key Space Data Area
Space
Bitmap
Super
Block
9. Keep different disks same usage ratio
Pool
All vOSDs
are equally
used.
Cut physical disk into vOSD (4GB by default) and it’s the actual unit of DHT ring.
3TB
OSD
8TB
OSD6TB
OSD
vOSD
vOSD
vOSD
vOSD
vOSD
vOSD
vOSD
10. Router in the middle of I/O process
APP Router
Router
Router
Router
vOSD
vOSD
vOSD
vOSD
Chunk
Chunk
Chunk
Chunk
Chunk
Chunk
Chunk
Chunk
Chunk
Chunk
Chunk
Chunk
DHT
/dev/sd*
BAC
Driver
10GbE/IB
S2EBS Native
Protocol
13. Agile redundancy control
Pool-a: 2-rep 2-rep
Pool-b: 3-rep
3-rep
2-rep 3-rep 4-rep 5-rep
Pool
Common ServerSAN control
redundancy per pool
S2EBS control
redundancy per volume
14. Benefit of volume redundancy control
Get me just 500GB
with 3-rep protected,
the rest will be good
with 2-rep protected
Get all 2-rep
protected data
up to 3-rep
Ctl per pool Ctl per volume
Preserve capacity for
each protecting level
15. Concept of safe boundary
…
Multiple disks
failure could
cause data lost
The more disks
there the more
multiple disks
failure happens
Fact A:
Fact B:
Replicas spread on
too many disks will
impact reliability
Data center requires
99.999% availability
By calculation:
2-rep protection should spread
replicas on less than 100 disks;
3-rep protection should spread
replicas on less than 500 disks
16. Safe boundary related with performance
…
Vol BVol A Vol C
…
Vol BVol A Vol C
Pool safe
boundary
Vol A safe
boundary
Vol B safe
boundary
Vol C safe
boundary
With pool redundancy ctl, safe
boundary limit simultaneous process
With volume redundancy ctl, simultaneous
process range is bigger than safe boundary
Simultaneous range
of all volumes
Simultaneous range
of all volumes
17. • EMC ScaleIO
• Still need centric meta data server, scalability is questionable.
• Ceph
• Poor performance and poor stability.
• VMware vSAN
• Extremely poor performance
• Only work with VMware vSphere
• Nutanix NDFS
• Poor performance, especially high latency
• Not block level storage
Comparison with Equivalents
18. • Performance! Performance! Performance!
• Low latency - 2ms via 10GbE or 0.2ms via InfiniBand
• Parallel processing - Up to 128 nodes serving one volume, IOps & MBps easily heat
physical limitation on host side
• Tiny overhead - 24 bits per I/O, over 99.4% physical bandwidth capable for real data
• Small footprint on host side - 8MB would be enough in most cases
• Little CPU consumption – one core can stably provide 4k~5k IOps
• Agile redundancy control per volume
• Volumes request different redundant level could be created from same pool
• No data migration nor down time, when changing protection level
• Erasing Code being support the same way in next release
PhDX key features and differences