This document proposes a flash-based caching scheme called Flash as Cache Extension (FaCE) to improve database performance and recovery time. FaCE caches both clean and dirty database pages in SSDs on database transaction commit. It uses a write-optimized design with sequential writes to SSDs and a write-back synchronization policy. FaCE also leverages the non-volatility of flash caches to support faster database recovery by reading cached pages from SSDs instead of disks. Evaluation shows FaCE achieves over 3x higher throughput than existing schemes and 4x faster recovery time.
Demystifying Storage - Building large SANsDirecti Group
From http://wiki.directi.com/x/hQAa - This is a fairly detailed presentation I made at BarCamp Mumbai on building large storage networks and different SAN topologies. It covers fundamentals of selecting harddrives, RAID levels and performance of various storage architectures. This is Part I of a 3-part series
CaSSanDra: An SSD Boosted Key-Value StoreTilmann Rabl
This presentation was held by Prashanth Menon at ICDE '14 on April 3, 2014 in Chicago, IL, USA.
The full paper and additional information is available at:
http://msrg.org/papers/Menon2013
Abstract:
With the ever growing size and complexity of enterprise systems there is a pressing need for more detailed application performance management. Due to the high data rates, traditional database technology cannot sustain the required performance. Alternatives are the more lightweight and, thus, more performant key-value stores. However, these systems tend to sacrifice read performance in order to obtain the desired write throughput by avoiding random disk access in favor of fast sequential accesses.
With the advent of SSDs, built upon the philosophy of no moving parts, the boundary between sequential vs. random access is now becoming blurred. This provides a unique opportunity to extend the storage memory hierarchy using SSDs in key-value stores. In this paper, we extensively evaluate the benefits of using SSDs in commercialized key-value stores. In particular, we
investigate the performance of hybrid SSD-HDD systems and demonstrate the benefits of our SSD caching and our novel dynamic schema model.
Cassandra Day SV 2014: Designing Commodity Storage in Apache CassandraDataStax Academy
As we move into the world of Big Data and the Internet of Things, the systems architectures and data models we've relied on for decades are becoming a hindrance. At the core of the problem is the read-modify-write cycle. In this session, Al will talk about how to build systems that don't rely on RMW, with a focus on Cassandra. Finally, for those times when RMW is unavoidable, he will cover how and when to use Cassandra's lightweight transactions and collections.
Demystifying Storage - Building large SANsDirecti Group
From http://wiki.directi.com/x/hQAa - This is a fairly detailed presentation I made at BarCamp Mumbai on building large storage networks and different SAN topologies. It covers fundamentals of selecting harddrives, RAID levels and performance of various storage architectures. This is Part I of a 3-part series
CaSSanDra: An SSD Boosted Key-Value StoreTilmann Rabl
This presentation was held by Prashanth Menon at ICDE '14 on April 3, 2014 in Chicago, IL, USA.
The full paper and additional information is available at:
http://msrg.org/papers/Menon2013
Abstract:
With the ever growing size and complexity of enterprise systems there is a pressing need for more detailed application performance management. Due to the high data rates, traditional database technology cannot sustain the required performance. Alternatives are the more lightweight and, thus, more performant key-value stores. However, these systems tend to sacrifice read performance in order to obtain the desired write throughput by avoiding random disk access in favor of fast sequential accesses.
With the advent of SSDs, built upon the philosophy of no moving parts, the boundary between sequential vs. random access is now becoming blurred. This provides a unique opportunity to extend the storage memory hierarchy using SSDs in key-value stores. In this paper, we extensively evaluate the benefits of using SSDs in commercialized key-value stores. In particular, we
investigate the performance of hybrid SSD-HDD systems and demonstrate the benefits of our SSD caching and our novel dynamic schema model.
Cassandra Day SV 2014: Designing Commodity Storage in Apache CassandraDataStax Academy
As we move into the world of Big Data and the Internet of Things, the systems architectures and data models we've relied on for decades are becoming a hindrance. At the core of the problem is the read-modify-write cycle. In this session, Al will talk about how to build systems that don't rely on RMW, with a focus on Cassandra. Finally, for those times when RMW is unavoidable, he will cover how and when to use Cassandra's lightweight transactions and collections.
JetStor NAS 724UXD Dual Controller Active-Active ZFS BasedGene Leyzarovich
The JetStor NAS 724UXD is a unified / hybrid NAS storage system that consolidates NAS and IP-based iSCSI SAN in one chassis. Featuring the newest Intel Haswell platform to lower power consumption and 7x 1Gb Ethernet host ports per controller, all encompassed in a small 4U enclosure. The JetStor NAS 724UXD offers SSD Caching to boost random I/O intensive application, Snapshot, Thin Provisioning, Online Capacity Expansion and Controller-based cable-less design for excellent manageability.
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS StorageGlusterFS
Gluster has partnered with Redapt, Inc., an innovative data center architecture and infrastructure solutions provider, to integrate GlusterFS with hardware providing customers with highly-scalable NAS storage technology for on-premise, virtual and cloud environments. Gluster's storage technology enables Redapt to offer a comprehensive, cost-effective storage solution delivering the scalability, performance and reliability that companies need to effectively run their data centers.
This webinar will provide an overview of the partnership, benefits of the joint solution, and include use cases of how customers today are deploying the joint solution. .
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMsMarco Obinu
Come dimensionare una VM per SQL Server in Azure IaaS, alla luce delle ultime novità della piattaforma.Sessione erogata il 24 Aprile 2020, nell'ambito del Global Azure Virtual 2020.
Video sessione: https://youtu.be/7o80CJUtnh4
Demo: https://github.com/OmegaMadLab/SqlIaasVmPlayground
ARM Template ottimizzato per SQL Server: https://github.com/OmegaMadLab/OptimizedSqlVm-v2
Gluster Webinar: Introduction to GlusterFSGlusterFS
This webinar provides an introduction to GlusterFS, the leading open source, scale-out NAS file system. Learn how GlusterFS is deployed in the datacenter, in the cloud, or enables you to create a global namespace between the two.
In this Introduction to GlusterFS webinar, introduction and review of the GlusterFS architecture and key functionalities. Learn how GlusterFS is deployed in the datacenter, in the cloud, or between the two. We’ll also cover a brief update on GlusterFS v3.3 which is currently in beta.
Award winning scale-up and scale-out storage for XenGlusterFS
This webinar discusses the Gluster Virtual Storage Appliance for Xen which packages GlusterFS in a virtual machine container optimized for ease of use with little to no configuration required. The Virtual Appliance seamlessly integrates with existing virtualization environments such as Citrix Xen, allowing you to deploy virtual storage the same way you deploy virtual machines. Deploy on premise to create a private cloud using any certified Xen server hardware platforms and certified storage: JBOD, DAS, or SAN.
VMworld 2013: Extreme Performance Series: Storage in a Flash VMworld
VMworld 2013
Sankaran Sivathanu, VMware
Mark Achtemichuk, VMware
Learn more about VMworld and register at http://www.vmworld.com/index.jspa?src=socmed-vmworld-slideshare
This is a summary of my talk at Microsoft Digital Crimes Consortium 2017 Panel: "Going Offline - The Role of Network Separation in Government Civilian Networks" (March 14, 2017)
(Moderator: Kaja Ciglic, Microsoft, Panelists: Seungjoo Kim, Korea University; Ron Winward, Radware; and Erick Stephens, Microsoft)
JetStor NAS 724UXD Dual Controller Active-Active ZFS BasedGene Leyzarovich
The JetStor NAS 724UXD is a unified / hybrid NAS storage system that consolidates NAS and IP-based iSCSI SAN in one chassis. Featuring the newest Intel Haswell platform to lower power consumption and 7x 1Gb Ethernet host ports per controller, all encompassed in a small 4U enclosure. The JetStor NAS 724UXD offers SSD Caching to boost random I/O intensive application, Snapshot, Thin Provisioning, Online Capacity Expansion and Controller-based cable-less design for excellent manageability.
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS StorageGlusterFS
Gluster has partnered with Redapt, Inc., an innovative data center architecture and infrastructure solutions provider, to integrate GlusterFS with hardware providing customers with highly-scalable NAS storage technology for on-premise, virtual and cloud environments. Gluster's storage technology enables Redapt to offer a comprehensive, cost-effective storage solution delivering the scalability, performance and reliability that companies need to effectively run their data centers.
This webinar will provide an overview of the partnership, benefits of the joint solution, and include use cases of how customers today are deploying the joint solution. .
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMsMarco Obinu
Come dimensionare una VM per SQL Server in Azure IaaS, alla luce delle ultime novità della piattaforma.Sessione erogata il 24 Aprile 2020, nell'ambito del Global Azure Virtual 2020.
Video sessione: https://youtu.be/7o80CJUtnh4
Demo: https://github.com/OmegaMadLab/SqlIaasVmPlayground
ARM Template ottimizzato per SQL Server: https://github.com/OmegaMadLab/OptimizedSqlVm-v2
Gluster Webinar: Introduction to GlusterFSGlusterFS
This webinar provides an introduction to GlusterFS, the leading open source, scale-out NAS file system. Learn how GlusterFS is deployed in the datacenter, in the cloud, or enables you to create a global namespace between the two.
In this Introduction to GlusterFS webinar, introduction and review of the GlusterFS architecture and key functionalities. Learn how GlusterFS is deployed in the datacenter, in the cloud, or between the two. We’ll also cover a brief update on GlusterFS v3.3 which is currently in beta.
Award winning scale-up and scale-out storage for XenGlusterFS
This webinar discusses the Gluster Virtual Storage Appliance for Xen which packages GlusterFS in a virtual machine container optimized for ease of use with little to no configuration required. The Virtual Appliance seamlessly integrates with existing virtualization environments such as Citrix Xen, allowing you to deploy virtual storage the same way you deploy virtual machines. Deploy on premise to create a private cloud using any certified Xen server hardware platforms and certified storage: JBOD, DAS, or SAN.
VMworld 2013: Extreme Performance Series: Storage in a Flash VMworld
VMworld 2013
Sankaran Sivathanu, VMware
Mark Achtemichuk, VMware
Learn more about VMworld and register at http://www.vmworld.com/index.jspa?src=socmed-vmworld-slideshare
This is a summary of my talk at Microsoft Digital Crimes Consortium 2017 Panel: "Going Offline - The Role of Network Separation in Government Civilian Networks" (March 14, 2017)
(Moderator: Kaja Ciglic, Microsoft, Panelists: Seungjoo Kim, Korea University; Ron Winward, Radware; and Erick Stephens, Microsoft)
The Future of Mobile - Presented at SMX MunichEric Enge
A look at many of the important aspects of the future of mobile, including: mobile vs. desktop usage, AMP, Page Speed, Google's Mobile First initiative, Progressive Web Apps, and the continuing rise of Voice
Cassandra Day Chicago 2015: DataStax Enterprise & Apache Cassandra Hardware B...DataStax Academy
Speaker(s): Kathryn Erickson, Engineering at DataStax
During this session we will discuss varying recommended hardware configurations for DSE. We’ll get right to the point and provide quick and solid recommendations up front. After we get the main points down take a brief tour of the history of database storage and then focus on designing a storage subsystem that won't let you down.
P99CONF — What We Need to Unlearn About Persistent StorageScyllaDB
System software engineers have long been taught that disks are slow and sequential I/O is key to performance. With SSD drives I/O really got much faster but not simpler. In this brave new world of rocket-speed throughputs an engineer has to distinguish sustained workload from bursts, (still) take care about I/O buffer sizes, account for disks’ internal parallelism and study mixed I/O characteristics in advance. In this talk we will share some key performance measurements of the modern hardware we’re taking at ScyllaDB and our opinion about the implications for the database and system software design.
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)Lars Marowsky-Brée
A presentation discussing various aspects that affect performance of Ceph clusters, and how to map, model, and predict their performance.
This lays the groundwork for building a Ceph cluster measurement and benchmark suite that eventually will build up a data corpus on performance characteristics that can be used to answer these key questions:
- How to build a storage system that meets my requirements?
- If I build a system like this, what will its characteristics be?
- If I change XY in my existing system, how will its characteristics change?
The Hive Think Tank: Rocking the Database World with RocksDBThe Hive
Dhruba Borthakur, Facebook
Dhruba Borthakur is an engineer at Facebook. He has been one of the founding engineer of RocksDB, an open-source key-value store optimized for storing data in flash and main-memory storage. He has been one of the founding architects of the Apache Hadoop Distributed File System and has been instrumental in scaling Facebook's Hadoop cluster to multiples of petabytes. Dhruba has contributed code to the Apache HBase project. Earlier, he contributed to the development of the Andrew File System (AFS). He has an M.S. in Computer Science from the University of Wisconsin, Madison and a B.S. in Computer Science BITS, Pilani, India.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Welcome to the first live UiPath Community Day Dubai! Join us for this unique occasion to meet our local and global UiPath Community and leaders. You will get a full view of the MEA region's automation landscape and the AI Powered automation technology capabilities of UiPath. Also, hosted by our local partners Marc Ellis, you will enjoy a half-day packed with industry insights and automation peers networking.
📕 Curious on our agenda? Wait no more!
10:00 Welcome note - UiPath Community in Dubai
Lovely Sinha, UiPath Community Chapter Leader, UiPath MVPx3, Hyper-automation Consultant, First Abu Dhabi Bank
10:20 A UiPath cross-region MEA overview
Ashraf El Zarka, VP and Managing Director MEA, UiPath
10:35: Customer Success Journey
Deepthi Deepak, Head of Intelligent Automation CoE, First Abu Dhabi Bank
11:15 The UiPath approach to GenAI with our three principles: improve accuracy, supercharge productivity, and automate more
Boris Krumrey, Global VP, Automation Innovation, UiPath
12:15 To discover how Marc Ellis leverages tech-driven solutions in recruitment and managed services.
Brendan Lingam, Director of Sales and Business Development, Marc Ellis
The Metaverse and AI: how can decision-makers harness the Metaverse for their...Jen Stirrup
The Metaverse is popularized in science fiction, and now it is becoming closer to being a part of our daily lives through the use of social media and shopping companies. How can businesses survive in a world where Artificial Intelligence is becoming the present as well as the future of technology, and how does the Metaverse fit into business strategy when futurist ideas are developing into reality at accelerated rates? How do we do this when our data isn't up to scratch? How can we move towards success with our data so we are set up for the Metaverse when it arrives?
How can you help your company evolve, adapt, and succeed using Artificial Intelligence and the Metaverse to stay ahead of the competition? What are the potential issues, complications, and benefits that these technologies could bring to us and our organizations? In this session, Jen Stirrup will explain how to start thinking about these technologies as an organisation.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™UiPathCommunity
In questo evento online gratuito, organizzato dalla Community Italiana di UiPath, potrai esplorare le nuove funzionalità di Autopilot, il tool che integra l'Intelligenza Artificiale nei processi di sviluppo e utilizzo delle Automazioni.
📕 Vedremo insieme alcuni esempi dell'utilizzo di Autopilot in diversi tool della Suite UiPath:
Autopilot per Studio Web
Autopilot per Studio
Autopilot per Apps
Clipboard AI
GenAI applicata alla Document Understanding
👨🏫👨💻 Speakers:
Stefano Negro, UiPath MVPx3, RPA Tech Lead @ BSP Consultant
Flavio Martinelli, UiPath MVP 2023, Technical Account Manager @UiPath
Andrei Tasca, RPA Solutions Team Lead @NTT Data
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
Quantum Computing: Current Landscape and the Future Role of APIs
[G2]fa ce deview_2012
1. Flash-‐Based
Extended
Cache
for
Higher
Throughput
and
Faster
Recovery
Woon-‐hak
Kang,
Sang-‐won
Lee,
and
Bongki
Moon
12.
9.
19. 1
2. Outline
• IntroducIon
• Related
work
• Flash
as
Cache
Extension
(FaCE)
– Design
choice
– Two
opImizaIons
• Recovery
in
FaCE
• Performance
EvaluaIon
• Conclusion
2
3. Outline
• IntroducIon
• Related
work
• Flash
as
Cache
Extension
(FaCE)
– Design
choice
– Two
opImizaIons
• Recovery
in
FaCE
• Performance
EvaluaIon
• Conclusion
3
4. IntroducIon
• Flash
Memory
Solid
State
Drive(SSD
)
– NAND
flash
memory
based
non-‐volaI
le
storage
• CharacterisIcs
– No
mechanical
parts
• Low
access
latency
and
High
random
IOP
S
– MulI-‐channel
and
mulI-‐plane
• Intrinsic
parallelism,
high
concurrency
– No
overwriIng
• Erase-‐before-‐overwriIng
• Read
cost
<<
Write
cost
– Limited
life
span
• #
of
erasures
of
the
flash
block
4
Image
from
:
hXp://www.legitreviews.com/arIcle/1197/2/
5. IntroducIon(2)
• IOPS
(IOs
Per
Second)
maXers
in
OLTP
• IOPS/$:
SSDs
>>
HDDs
– e.g.
SSD
63
(=
28,495
IOPS
/
450$)
vs.
HDD
1.7
(=
409
IOPS
/
240$)
• GB/$:
HDDs
>>
SSDs
– e.g.
SSD
0.073
(=
32GB
/
440$)
vs.
HDD
0.617
(=
146.8GB
/
240$)
• Therefore,
it
is
more
sensible
to
use
SSDs
to
su
pplement
HDDs,
rather
than
to
replace
them
– SSDs
as
cache
between
RAM
and
HDDs
– To
provide
both
the
performance
of
SSDs
and
the
c
apacity
of
HDDs
as
liXle
cost
as
possible
5
6. IntroducIon(3)
• A
few
exisIng
flash-‐based
cache
schemes
– e.g.
Oracle
Exadata,
IBM,
MS
– Pages
cached
in
SSDs
are
overwriXen;
the
write
paXern
in
SS
Ds
is
random
• Write
bandwidth
disparity
in
SSDs
– e.g.
random
write
(25MB/s
=
6,314
x
4KBs/s
)
vs.
sequenIal
w
rite
(243MB/s)
vs.
4KB
Random
Throughput
( Ra=o
Sequen=al/Random
Sequen=al
Bandwidth
(MBPS)
IOPS)
write
Read Write Read Write
SSD
mid
A 28,495 6,314 251 243 9.85
SSD
mid
B 35,601 2,547 259 80 8.04
HDD
Single
409 343 156 154 114.94
HDD
Single
(x8
) 2,598 2,502 848 843 86.25 6
7. IntroducIon(4)
• FaCE
(Flash
as
Cache
Extension)
–
main
contribuIons
– Write-‐opImized
flash
cache
scheme:
e.g.
3x
higher
throughput
t
han
the
exisIng
ones
– Faster
database
recovery
support
by
exploiIng
the
non-‐volaIle
c
ache
pages
in
SSDs
for
recovery:
e.g.
4x
faster
recovery
Ime
DRAM
Random
Read
Sequen=al
Write
(Low
cost) (à
High
throughput)
Random
Read
Non-‐vola=lity
of
flash
SSD cache
for
recovery
Random
(faster
recovery)
Write
HDD
7
8. Contents
• IntroducIon
• Related
work
• Flash
as
Cache
Extension
(FaCE)
– Design
choice
– Two
opImizaIons
• Recovery
in
FaCE
• Performance
EvaluaIon
• Conclusion
8
9. Related
work
• How
to
adopt
SSDs
in
the
DBMS
area?
1. SSD
as
faster
disk
– VLDB
‘08,
Koltsidas
et
al.,
“Flashing
up
the
Storage
Layer”
– VLDB
’09,
Canim
et
al.
“An
Object
Placement
Advisor
for
DB2
Usin
g
Solid
State
Storage”
– SIGMOD
‘08,
Lee
et
al.,
"A
Case
for
Flash
Memory
SSD
in
Enterpris
e
Database
ApplicaIons"
2. SSD
as
DRAM
buffer
extension
– VLDB
’10,
Canim
et
al.,
“SSD
Bufferpool
extensions
for
Database
s
ystems”
– SIGMOD
’11,
Do
et
al.,
“Turbocharging
DBMS
Buffer
Pool
Using
SS
Ds”
9
10. Lazy
Cleaning
(LC)
[SIGMOD’11]
• Cache
on
exit
• Write-‐back
policy
• LRU-‐based
SSD
cache
replacement
policy
– To
incur
almost
random
writes
against
SSD
• No
efficient
recovery
mechanism
provided
Flash
hit
Random
writes
Evict
RAM Buffer (LRU) Flash memory SSD
Fetch
on
miss Stage
out
dirty
pages
HDD
10
11. Contents
• IntroducIon
• Related
work
• Flash
as
Cache
Extension
(FaCE)
– Design
choices
– Two
opImizaIons
• Recovery
in
FaCE
• Performance
EvaluaIon
• Conclusion
11
12. FaCE:
Design
Choices
1. When
to
cache
pages
in
SSD?
2. What
pages
to
cache
in
SSD?
3. Sync
policy
b/w
SSD
and
HDD
4. SSD
Cache
Replacement
Policy
12
13. Design
Choices:
When/What/Sync
Policy
• When
:
on
entry
vs.
on
exit
• What
:
clean
vs.
dirty
vs.
both
• Sync
policy
:
write-‐thru
vs.
write-‐back
On
exit
:
dirty
pages
athe
ell
as
Sync
policy
:
for
s
w
performance,
write-‐back
sync
clean
pages
Evict
RAM Buffer
Flash as Cache Extension
(LRU)
Fetch
on
miss Stage
out
dirty
pages
HDD
13
14. Design
Choices:
SSD
Cache
Replacement
Policy
• What
to
do
when
a
page
is
evicted
from
DRAM
buffe
r
and
SSD
cache
is
full
• LRU
vs.
FIFO
(First-‐In-‐First-‐Out)
– Write
miss:
LRU-‐based
vicIm
selecIon,
write-‐back
if
dirt
y
vicIm,
and
overwrite
the
old
vicIm
page
with
the
new
page
being
evicted
– Write
hit:
overwrite
the
old
copy
in
flash
cache
with
the
updated
page
being
evicted
Random
writes
Evict
RAM Buffer against
SSD
Flash as Cache Extension
(LRU)
HDD
14
15. Design
Choices:
SSD
Cache
Replacement
Policy
• LRU
vs.
FIFO
(First-‐In-‐First-‐Out)
– VicIms
are
chosen
from
the
rear
end
of
flash
cache
:
“sequenIal
writes”
against
SSD
– Write
hit
:
no
addiIonal
acIon
is
taken
in
order
not
to
incur
random
writes.
• mulIple
versions
in
SSD
cache
Evict
RAM Buffer
Flash as Cache Extension
(LRU)
Multi-Version FIFO
(mvFIFO)
HDD
15
16. Write
ReducIon
in
mvFIFO
• Example
– Reduce
three
writes
to
HDD
to
one
Versions
of
Page
P
Mul=ple
Choose
Invalidated
Invalidated
Write-‐back
version Discard
Vic=m
version
to
HDD
Page
P-‐v2 Page
P-‐v1
Page
P-‐v3
Evict
RAM Buffer
Flash as Cache Extension
(LRU)
HDD
16
17. Design
Choices:
SSD
Cache
Replacement
Policy
• LRU
vs.
FIFO
LRU FIFO
Write
paXern Random Sequen=al
Write
performance Low High
#
of
copy
pages Single MulIple
Space
uIlizaIon High Low
Hit
raIo
&
write
reducIon High Low
• Trade-‐off
:
hit-‐raIo
<>
write
performance
– Write
performance
benefit
by
FIFO
>>
Performance
gain
from
higher
hit
raIo
by
LRU
17
18. mvFIFO:
Two
OpImizaIons
• Group
Replacement
(GR)
– MulIple
pages
are
replaced
in
a
group
in
order
to
exploi
t
the
internal
parallelism
in
modern
SSDs
– Replacement
depth
is
limited
by
parallelism
size
(chann
el
*
plane)
– GR
can
improve
SSD
I/O
throughput
• Group
Second
Chance
(GSC)
– GR
+
Second
chance
– if
a
vicIm
candidate
page
is
valid
and
referenced,
will
re
-‐enque
the
vicIm
to
SSD
cache
• A
variant
of
“clock”
replacement
algorithm
for
the
FaCE
– GSC
can
achieve
higher
hit
raIo
and
more
write
reducI
ons
18
19. Group
Replacement
(GR)
• Single
group
read
from
SSD
(64/128
pages)
• Batch
random
writes
to
HD RAM Check
valid
and
dirty
flag
D
Flash
Cache
• Single
group
write
to
SSD
becomes
FULL
2.
Evict
RAM Buffer Flash as Cache Extension
(LRU)
1.
Fetch
on
miss
HDD
19
20. Group
Second
Chance
(GSC)
• GR
+
Second
Chance
reference
bit
is
ON
Check
reference
bit,
RAM if
true
galid
them
Check
vave
and
dirty
flag
2nd
chance
Flash
Caches
become
FULL
2.
Evict
RAM Buffer Flash as Cache Extension
(LRU)
1.
Fetch
on
miss
HDD
20
21. Contents
• IntroducIon
• Related
work
• Flash
as
Cache
Extension
(FaCE)
– Design
choice
– Two
opImizaIons
• Recovery
in
FaCE
• Performance
EvaluaIon
• Conclusion
21
22. Recovery
Issues
in
SSD
Cache
• With
write-‐back
sync
policy,
many
recent
copies
of
data
pages
ar
e
kept
in
SSD,
not
in
HDD.
• Therefore,
database
in
HDD
is
in
an
inconsistent
state
ayer
syste
m
failure
New
version
of
page
P
RAM SSD Mapping Information
(Metadata)
Crash
Inconsistent
as Cache Extension
Flash
state
Old
version
of
page
P
HDD
22
23. Recovery
Issues
in
SSD
Cache
• With
write-‐back
sync
policy,
many
recent
copies
of
data
pages
are
kept
in
SSD,
not
in
HDD.
• Therefore,
database
in
HDD
is
in
an
inconsistent
state
ayer
system
failu
re
• In
this
situa=on,
one
recovery
approach
with
flash
cache
is
to
view
da
tabase
in
harddisk
as
the
only
persistent
DB
[SIGMOD
11]
– Periodically
checkpoint
updated
pages
from
SSD
cache
as
well
as
DRAM
bu
ffer
to
HDD
New
version
of
page
P
RAM SSD Mapping Information Excessive
Checkpoint
Checkpoint
Checkpoint
Flash as Cache Extension
Cost
Persistent
DB
HDD Old
version
of
page
P 23
24. Recovery
Issues
in
SSD
Cache(2)
• Fortunately,
because
SSDs
are
non-‐vola=le,
pages
cached
in
SSD
are
al
ive
even
ayer
system
failure.
• SSD
mapping
informaIon
has
gone
• Two
approaches
for
recovering
metadata.
1. Rebuild
lost
metadata
by
scanning
the
whole
pages
cached
in
SSD
(Naïve
approach)
–
Time-‐consuming
scanning
2. Write
metadata
persistently
whenever
metadata
is
changed
[DaMon
11]
–
Run-‐Ime
overhead
for
managing
metadata
persistently
New
version
of
page
P
RAM SSD Mapping Information
Full
Scanning
Flash as Cache Extension
Flush
every
update
Persistent
DB
HDD Old
version
of
page
P 24
25. Recovery
in
FaCE
• Metadata
checkpoinIng
– Because
a
data
page
entering
SSD
cache
is
wriXen
t
o
the
rear
in
chronological
order,
metadata
can
be
wriXen
regularly
in
a
single
large
segment
64K
RAM Recovery
:
SSD Metadata page
info.
Mapping
Segment
Scanning
segment
Periodically
checkpoint
Flash as Cache Extension Crash metadata
HDD
25
26. Contents
• IntroducIon
• Related
work
• Flash
as
Cache
Extension
(FaCE)
– Design
choice
– Two
opImizaIons
• Recovery
in
FaCE
• Performance
EvaluaIon
• Conclusion
26
27. Experimental
Set-‐Up
• FaCE
ImplementaIon
in
PostgreSQL
– 3
funcIons
in
buffer
mgr.
:
bufferAlloc(),
getFreeBuffer(),
buff
erSync()
– 2
funcIons
in
bootstrap
for
recovery
:
startupXLOG(),
initBuff
erPool()
• Experiment
Setup
– Centos
Linux
– Intel
Core
i7-‐860
2.8
GHz
(quad
core)
and
4G
DRAM
– Disks
:
8
RAIDed
15k
rpm
Seagate
SAS
HDDs
(146.8GB)
– SSD
:
Samsung
MLC
(256GB)
• Workloads
– TPC-‐C
with
500
warehouses
(50GB)
and
50
concurrent
clients
– BenchmarkSQL
27
28. TransacIon
Throughput
HDD
only
SSD
only
LC
FaCE
FaCE+GR
FaCE+GSC
7000
3.9x
6000
FaCE+GSC
FaCE+GR
3.1x
5000
Transac=ons
Per
minute
2.6x
4000
FaCE-‐basic
2.6x
2.4x
2.1x
3000
1.5x
LC
2000
SSD
only
1000
HDD
only
0
4
8
12
16
20
24
28
|Flash
cache|/|Database|
(%)
28
29. Hit
RaIo,
Write
ReducIon,
and
I/O
Throughput
Flash
Cache
Hit
Ra=o
100
95
90
LC
85
FaCE+GSC
Hit
ra=o
(%)
80
75
FaCE-‐basic
70
FaCE+GR
65
60
2GB
4GB
6GB
8GB
10GB
Flash
cache
size
LC
FaCE
FaCE+GR
FaCE+GSC
29
32. Recovery
Performance
• 4.4x
faster
recovery
than
HDD
only
approac
h
Metadata
recovery
:
2
redo
Ime
:me
:
823
redo
I
186
32
33. Contents
• IntroducIon
• Related
work
• Flash
as
Cache
Extension
(FaCE)
– Design
choice
– Two
opImizaIons
• Recovery
in
FaCE
• Performance
EvaluaIon
• Conclusion
33
34. Conclusion
• We
presented
a
low-‐overhead
caching
method
called
FaCE
that
uIlizes
flash
memory
as
an
ext
ension
to
a
DRAM
buffer
for
a
recoverable
data
base.
• FaCE
can
maximized
the
I/O
throughput
of
a
fla
sh
caching
device
by
turning
small
random
writ
es
to
large
sequenIal
ones
• Also,
FaCE
takes
advantage
of
the
non-‐volaIlity
of
flash
memory
to
accelerate
the
system
resta
rt
from
a
failure.
34