SlideShare a Scribd company logo
1 of 23
IN-MEMORY DATABASE SYSTEMS
FOR BIG DATA MANAGEMENT.
SAP HANA DATABASE
PRESENTED BY : GEORGE JOSEPH
S7 CS ALPHA
ROLL NO-39
RSET , KERALA.
AGENDA
.ball
•Revisiting Traditional RDBMS
•Defining IMDB
•A look at a few IMDB products in the market
•SAP HANA database in detail
What is a database ?
.ball
•An organised collection of information
•Allows reading and writing .
•Provides authorisation and authentication.
•Provides some level of data safety.
Traditional RDBMS
.ball
•Developed by E F Codd in early 1970s
•This model is based on tables rows and columns
and the manipulation of data stored within.
•A Relational DB is the collection of all these table
•Example: Oracle, mysql & microsoft access
What is a database ?
.ball
•An organised collection of information
•Allows reading and writing .
•Provides authorisation and authentication.
•Provides some level of data safety.
Data store for typical RDBMS
.ball
•Data resides on disk.
•Data maybe cached into memory for access.
PROBLEM
.ball
• Existing disk-based systems can no longer offer
timely response due to the high access latency to
hard disks
•The unacceptable performance an obstacle for a
meaningful real-time service.
•Eg :Real-time bidding, advertising, social gaming,
Stock market .
“Memory is the new disk, disk is the new tape”
Jim Gray
Data scientist
Creator IBM system R
.ball
© 2013 SAP AG. All rights reserved. 9Public
Hardware Advances: Moore’s Law - DRAM Pricing
1980: Memory $10,000/MB
2000: Memory $1/MB
2013: Memory $0.004/MB
Time
Memory
Cost /
Speed
gdfgfgfgh ss
© 2013 SAP AG. All rights reserved. 10Public
Hardware Advances: Moore‘s Law - CPUs
2002
1 core
32 bits
4MB
2007
2 cores
2 CPUs per server
External Controllers
8 cores -16 threads / CPU
4 CPUs per server
On-chip memory control
Quick interconnect
VM and vector support
64 bits; 256 GB - 1 TB
2010
More cores, bigger caches
16 ... 64 CPUs per server
Greater on-chip integration
(PCIe, network, ...)
Data-direct I/O
Tens of TBs
2013
Images: Intel, Danilo Rizzuti / FreeDigitalPhotos.net
ball cold
IN-MEMORY DATABASE SYSTEMS
.ball
•For in-memory DB ,Data resides permanently on main memory.
•Source data is loaded into system memory in a compressed,
non-relational format
•Only backup copy on disk.
•Memory optimised data structures are used
Disk VS Memory
.ball
•Order of magnitude of access time is less for main memory.
•Main memory is normally volatile while disk storage is not.
•The layout of disk is much more critical than layout of main
memory
MMDB PRODUCTS AVAILABLE
.ball
.ball
.ball
•SAP HANA is the market leader in IMDB systems. It is also a platform
for big data processing analysis and prediction.
•SAP HANA can help business for building real-time applications and
analytics for accelerating the process
© 2013 SAP AG. All rights reserved. 16Public
In-Memory
Column
Database
Massively
Parallel
Processing
Optimized
Calculation
Engine
Columnar storage increases the
amount of data that can be
stored in limited memory
(compared to disk)
Column databases enable
easier parallelization of
queries
Row buffer fast
transactional processing
In-memory
processing gives
more time for
relatively slow
updates to column
data
In-memory allows
sophisticated
calculations in real-time
MPP optimized software
enables linear performance
scaling making sophisticated
calculations like allocations
possible
Each technology works well on its own, but combining them all is the real
opportunity — provides all of the upside benefits while mitigating the downsides
SAP in-memory innovations
make the “New Way” a reality
s
© 2013 SAP AG. All rights reserved. 17Public
Order Country Product Sales
456 France corn 1000
457 Italy wheat 900
458 Italy corn 600
459 Spain rice 800
SAP HANA: Column Store
456 France corn 1000
457 Italy wheat 900
458 Italy corn 600
459 Spain rice 800
456
457
458
459
France
Italy
Italy
Spain
corn
wheat
corn
rice
1000
900
600
800
Typical Database
SAP HANA: column order
SELECT Country, SUM(sales) FROM SalesOrders
WHERE Product = ‘corn’
GROUP BY Country

s
© 2013 SAP AG. All rights reserved. 18Public
SAP HANA: Data Compression
 Efficient compression methods (dictionary, run length, cluster, prefix, etc.)
 Compression works well with columns and can speedup operations on
columns (~ factor 10)
 Because of compression, write changes into less compressed delta storage
 Needs to be merged into columns from time to time or when a certain size is exceeded
 Delta merge can be done in background
 Trade-off between compression ratio and delta merge runtime
 Updates into delta data storage and periodically merged into main data storage
 High write performance not affected by compression
 Data is written to delta storage with less compression which is optimized for write access. This is
merged into the main area of the column store later on.
© 2013 SAP AG. All rights reserved. 19Public
SAP HANA: Dictionary Compression
Jones
Miller
Millman
Zsuwalski
Baker
Miller
John
Miller
Johnson
Jones
Column „Name“
(uncompressed)
Value-ID sequence
One element for each row in column
4
1
5
N
0
4
2
4
3
1
ValueIDs
Johnson
Miller
John
Jones
0
1
2
3
4
Millman
ZsuwalskiN
Dictionary
sorted
Value ID implicitly given
by sequence in which
values are stored
Value
Baker
5
Column „Name“ (dictionary compressed)
point into
dictionary
s
© 2013 SAP AG. All rights reserved. 20Public
SAP HANA: Scalability
Scales from very small servers to very large clusters
Single Server
• 2 CPU 128GB to 8 CPU 1TB
Scale Out Cluster
• 2 to n servers per cluster
• Largest certified configuration: 16 servers
• Largest tested configuration: 100+
servers
• Support for high availability
and disaster tolerance
Cloud Deployment
s
© 2013 SAP AG. All rights reserved. 21Public
What is inside HANA?
ACID Compliant
Database
- In-Memory
- Column Store
Out
In
SQL
BICS
MDX
JSON /
XML
Data
Services
HANA
Studio
Parallel
Execution
Scripting
Engine
Business
Function
Library
Unstructured
(Text)
Predictive
Analysis
Library
OLAP
XS App
Server
“R” HS
Integration
1. Batch Transfer
2. SAP & Non-SAP
3. Extensive Transformations
4. Structured & Unstructured
5. Hadoop Integration
1. ODBC / JDBC
2. 3rd Party Apps
3. 3rd Party Tools
1. BICS
2. NetWeaver BW
3. SAP BOBJ
1. ODBO
2. MS Excel
3. 3rd Party OLAP Tools
1. HTTP
2. RESTful services
3. OData Compliant
“R”
ESP
Spatial /
Geospatial
Query
Federation
1. IQ / ASE
2. Teradata / Oracle
3. Hadoop
Replication
Services 1. Near Real Time
2. Non-SAP
s
.ball
•In-Memory Big Data Management and Processing:
By Hao Zhang, Gang Chen, Member, IEEE, Beng Chin Ooi, Fellow, IEEE,
Kian-Lee Tan, Member, IEEE, and Meihui Zhang, Member, IEEE
•SAP HANA Distributed In-Memory Database
System: Transaction, Session, and Metadata Management
Juchang Lee#1, Yong Sik Kwon#2, Franz Färber*3, Michael Muehle*4, Chulwon
SAP Labs, Korea
•In-memory database
www.wikipedia.org
REFERENCES
.ball

More Related Content

What's hot

SAP IQ 16 Product Annoucement
SAP IQ 16 Product AnnoucementSAP IQ 16 Product Annoucement
SAP IQ 16 Product Annoucement
Dobler Consulting
 
Choosing the Right Big Data Tools for the Job - A Polyglot Approach
Choosing the Right Big Data Tools for the Job - A Polyglot ApproachChoosing the Right Big Data Tools for the Job - A Polyglot Approach
Choosing the Right Big Data Tools for the Job - A Polyglot Approach
DATAVERSITY
 
The thinking persons guide to data warehouse design
The thinking persons guide to data warehouse designThe thinking persons guide to data warehouse design
The thinking persons guide to data warehouse design
Calpont
 
MySQL conference 2010 ignite talk on InfiniDB
MySQL conference 2010 ignite talk on InfiniDBMySQL conference 2010 ignite talk on InfiniDB
MySQL conference 2010 ignite talk on InfiniDB
Calpont
 

What's hot (20)

SAP IQ 16 Product Annoucement
SAP IQ 16 Product AnnoucementSAP IQ 16 Product Annoucement
SAP IQ 16 Product Annoucement
 
Making MySQL Great For Business Intelligence
Making MySQL Great For Business IntelligenceMaking MySQL Great For Business Intelligence
Making MySQL Great For Business Intelligence
 
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAININGDATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
 
SAP HANA Overview
SAP HANA OverviewSAP HANA Overview
SAP HANA Overview
 
Star schema my sql
Star schema   my sqlStar schema   my sql
Star schema my sql
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
Autodesk Technical Webinar: SAP HANA in-memory database
Autodesk Technical Webinar: SAP HANA in-memory databaseAutodesk Technical Webinar: SAP HANA in-memory database
Autodesk Technical Webinar: SAP HANA in-memory database
 
2017 DB Trends for Powering Real-Time Systems of Engagement
2017 DB Trends for Powering Real-Time Systems of Engagement2017 DB Trends for Powering Real-Time Systems of Engagement
2017 DB Trends for Powering Real-Time Systems of Engagement
 
Choosing the Right Big Data Tools for the Job - A Polyglot Approach
Choosing the Right Big Data Tools for the Job - A Polyglot ApproachChoosing the Right Big Data Tools for the Job - A Polyglot Approach
Choosing the Right Big Data Tools for the Job - A Polyglot Approach
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
Big Data .. Are you ready for the next wave?
Big Data .. Are you ready for the next wave?Big Data .. Are you ready for the next wave?
Big Data .. Are you ready for the next wave?
 
The thinking persons guide to data warehouse design
The thinking persons guide to data warehouse designThe thinking persons guide to data warehouse design
The thinking persons guide to data warehouse design
 
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UNMariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001
 
OLTP vs OLAP
OLTP vs OLAPOLTP vs OLAP
OLTP vs OLAP
 
MySQL conference 2010 ignite talk on InfiniDB
MySQL conference 2010 ignite talk on InfiniDBMySQL conference 2010 ignite talk on InfiniDB
MySQL conference 2010 ignite talk on InfiniDB
 
Massive parallel processing database systems mpp
Massive parallel processing database systems mppMassive parallel processing database systems mpp
Massive parallel processing database systems mpp
 
Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"
 
Building High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic ApplicationsBuilding High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic Applications
 

Similar to IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.

Maaz Anjum - IOUG Collaborate 2013 - An Insight into Space Realization on ODA...
Maaz Anjum - IOUG Collaborate 2013 - An Insight into Space Realization on ODA...Maaz Anjum - IOUG Collaborate 2013 - An Insight into Space Realization on ODA...
Maaz Anjum - IOUG Collaborate 2013 - An Insight into Space Realization on ODA...
Maaz Anjum
 
Handling Massive Writes
Handling Massive WritesHandling Massive Writes
Handling Massive Writes
Liran Zelkha
 
A5 oracle exadata-the game changer for online transaction processing data w...
A5   oracle exadata-the game changer for online transaction processing data w...A5   oracle exadata-the game changer for online transaction processing data w...
A5 oracle exadata-the game changer for online transaction processing data w...
Dr. Wilfred Lin (Ph.D.)
 

Similar to IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE. (20)

Oracle Database in-Memory Overivew
Oracle Database in-Memory OverivewOracle Database in-Memory Overivew
Oracle Database in-Memory Overivew
 
Using SAS GRID v 9 with Isilon F810
Using SAS GRID v 9 with Isilon F810Using SAS GRID v 9 with Isilon F810
Using SAS GRID v 9 with Isilon F810
 
Transforming Business with Intel and SAP HANA 2
Transforming Business with Intel and SAP HANA 2 Transforming Business with Intel and SAP HANA 2
Transforming Business with Intel and SAP HANA 2
 
Trusted advisory on technology comparison --exadata, hana, db2
Trusted advisory on technology comparison --exadata, hana, db2Trusted advisory on technology comparison --exadata, hana, db2
Trusted advisory on technology comparison --exadata, hana, db2
 
Maaz Anjum - IOUG Collaborate 2013 - An Insight into Space Realization on ODA...
Maaz Anjum - IOUG Collaborate 2013 - An Insight into Space Realization on ODA...Maaz Anjum - IOUG Collaborate 2013 - An Insight into Space Realization on ODA...
Maaz Anjum - IOUG Collaborate 2013 - An Insight into Space Realization on ODA...
 
SAP HANA – A Technical Snapshot
SAP HANA – A Technical SnapshotSAP HANA – A Technical Snapshot
SAP HANA – A Technical Snapshot
 
IBM Power & SUSE at SAPPHIRE 2016
IBM Power & SUSE at SAPPHIRE 2016IBM Power & SUSE at SAPPHIRE 2016
IBM Power & SUSE at SAPPHIRE 2016
 
Severalnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IXSeveralnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IX
 
Oracle real application_cluster
Oracle real application_clusterOracle real application_cluster
Oracle real application_cluster
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
 
Has Your Data Gone Rogue?
Has Your Data Gone Rogue?Has Your Data Gone Rogue?
Has Your Data Gone Rogue?
 
SAP HANA on IBM Power Systems by John Hedge
SAP HANA on IBM Power Systems by John HedgeSAP HANA on IBM Power Systems by John Hedge
SAP HANA on IBM Power Systems by John Hedge
 
Flash for big data
Flash for big data Flash for big data
Flash for big data
 
Exadata x3 workshop
Exadata x3 workshopExadata x3 workshop
Exadata x3 workshop
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
 
S016828 storage-tiering-nola-v1710b
S016828 storage-tiering-nola-v1710bS016828 storage-tiering-nola-v1710b
S016828 storage-tiering-nola-v1710b
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
Getting Started with Managed Database Services on AWS - September 2016 Webina...
Getting Started with Managed Database Services on AWS - September 2016 Webina...Getting Started with Managed Database Services on AWS - September 2016 Webina...
Getting Started with Managed Database Services on AWS - September 2016 Webina...
 
Handling Massive Writes
Handling Massive WritesHandling Massive Writes
Handling Massive Writes
 
A5 oracle exadata-the game changer for online transaction processing data w...
A5   oracle exadata-the game changer for online transaction processing data w...A5   oracle exadata-the game changer for online transaction processing data w...
A5 oracle exadata-the game changer for online transaction processing data w...
 

Recently uploaded

Recently uploaded (20)

8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 

IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.

  • 1. IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT. SAP HANA DATABASE PRESENTED BY : GEORGE JOSEPH S7 CS ALPHA ROLL NO-39 RSET , KERALA.
  • 2. AGENDA .ball •Revisiting Traditional RDBMS •Defining IMDB •A look at a few IMDB products in the market •SAP HANA database in detail
  • 3. What is a database ? .ball •An organised collection of information •Allows reading and writing . •Provides authorisation and authentication. •Provides some level of data safety.
  • 4. Traditional RDBMS .ball •Developed by E F Codd in early 1970s •This model is based on tables rows and columns and the manipulation of data stored within. •A Relational DB is the collection of all these table •Example: Oracle, mysql & microsoft access
  • 5. What is a database ? .ball •An organised collection of information •Allows reading and writing . •Provides authorisation and authentication. •Provides some level of data safety.
  • 6. Data store for typical RDBMS .ball •Data resides on disk. •Data maybe cached into memory for access.
  • 7. PROBLEM .ball • Existing disk-based systems can no longer offer timely response due to the high access latency to hard disks •The unacceptable performance an obstacle for a meaningful real-time service. •Eg :Real-time bidding, advertising, social gaming, Stock market .
  • 8. “Memory is the new disk, disk is the new tape” Jim Gray Data scientist Creator IBM system R .ball
  • 9. © 2013 SAP AG. All rights reserved. 9Public Hardware Advances: Moore’s Law - DRAM Pricing 1980: Memory $10,000/MB 2000: Memory $1/MB 2013: Memory $0.004/MB Time Memory Cost / Speed gdfgfgfgh ss
  • 10. © 2013 SAP AG. All rights reserved. 10Public Hardware Advances: Moore‘s Law - CPUs 2002 1 core 32 bits 4MB 2007 2 cores 2 CPUs per server External Controllers 8 cores -16 threads / CPU 4 CPUs per server On-chip memory control Quick interconnect VM and vector support 64 bits; 256 GB - 1 TB 2010 More cores, bigger caches 16 ... 64 CPUs per server Greater on-chip integration (PCIe, network, ...) Data-direct I/O Tens of TBs 2013 Images: Intel, Danilo Rizzuti / FreeDigitalPhotos.net ball cold
  • 11. IN-MEMORY DATABASE SYSTEMS .ball •For in-memory DB ,Data resides permanently on main memory. •Source data is loaded into system memory in a compressed, non-relational format •Only backup copy on disk. •Memory optimised data structures are used
  • 12. Disk VS Memory .ball •Order of magnitude of access time is less for main memory. •Main memory is normally volatile while disk storage is not. •The layout of disk is much more critical than layout of main memory
  • 14. .ball
  • 15. .ball •SAP HANA is the market leader in IMDB systems. It is also a platform for big data processing analysis and prediction. •SAP HANA can help business for building real-time applications and analytics for accelerating the process
  • 16. © 2013 SAP AG. All rights reserved. 16Public In-Memory Column Database Massively Parallel Processing Optimized Calculation Engine Columnar storage increases the amount of data that can be stored in limited memory (compared to disk) Column databases enable easier parallelization of queries Row buffer fast transactional processing In-memory processing gives more time for relatively slow updates to column data In-memory allows sophisticated calculations in real-time MPP optimized software enables linear performance scaling making sophisticated calculations like allocations possible Each technology works well on its own, but combining them all is the real opportunity — provides all of the upside benefits while mitigating the downsides SAP in-memory innovations make the “New Way” a reality s
  • 17. © 2013 SAP AG. All rights reserved. 17Public Order Country Product Sales 456 France corn 1000 457 Italy wheat 900 458 Italy corn 600 459 Spain rice 800 SAP HANA: Column Store 456 France corn 1000 457 Italy wheat 900 458 Italy corn 600 459 Spain rice 800 456 457 458 459 France Italy Italy Spain corn wheat corn rice 1000 900 600 800 Typical Database SAP HANA: column order SELECT Country, SUM(sales) FROM SalesOrders WHERE Product = ‘corn’ GROUP BY Country  s
  • 18. © 2013 SAP AG. All rights reserved. 18Public SAP HANA: Data Compression  Efficient compression methods (dictionary, run length, cluster, prefix, etc.)  Compression works well with columns and can speedup operations on columns (~ factor 10)  Because of compression, write changes into less compressed delta storage  Needs to be merged into columns from time to time or when a certain size is exceeded  Delta merge can be done in background  Trade-off between compression ratio and delta merge runtime  Updates into delta data storage and periodically merged into main data storage  High write performance not affected by compression  Data is written to delta storage with less compression which is optimized for write access. This is merged into the main area of the column store later on.
  • 19. © 2013 SAP AG. All rights reserved. 19Public SAP HANA: Dictionary Compression Jones Miller Millman Zsuwalski Baker Miller John Miller Johnson Jones Column „Name“ (uncompressed) Value-ID sequence One element for each row in column 4 1 5 N 0 4 2 4 3 1 ValueIDs Johnson Miller John Jones 0 1 2 3 4 Millman ZsuwalskiN Dictionary sorted Value ID implicitly given by sequence in which values are stored Value Baker 5 Column „Name“ (dictionary compressed) point into dictionary s
  • 20. © 2013 SAP AG. All rights reserved. 20Public SAP HANA: Scalability Scales from very small servers to very large clusters Single Server • 2 CPU 128GB to 8 CPU 1TB Scale Out Cluster • 2 to n servers per cluster • Largest certified configuration: 16 servers • Largest tested configuration: 100+ servers • Support for high availability and disaster tolerance Cloud Deployment s
  • 21. © 2013 SAP AG. All rights reserved. 21Public What is inside HANA? ACID Compliant Database - In-Memory - Column Store Out In SQL BICS MDX JSON / XML Data Services HANA Studio Parallel Execution Scripting Engine Business Function Library Unstructured (Text) Predictive Analysis Library OLAP XS App Server “R” HS Integration 1. Batch Transfer 2. SAP & Non-SAP 3. Extensive Transformations 4. Structured & Unstructured 5. Hadoop Integration 1. ODBC / JDBC 2. 3rd Party Apps 3. 3rd Party Tools 1. BICS 2. NetWeaver BW 3. SAP BOBJ 1. ODBO 2. MS Excel 3. 3rd Party OLAP Tools 1. HTTP 2. RESTful services 3. OData Compliant “R” ESP Spatial / Geospatial Query Federation 1. IQ / ASE 2. Teradata / Oracle 3. Hadoop Replication Services 1. Near Real Time 2. Non-SAP s
  • 22. .ball •In-Memory Big Data Management and Processing: By Hao Zhang, Gang Chen, Member, IEEE, Beng Chin Ooi, Fellow, IEEE, Kian-Lee Tan, Member, IEEE, and Meihui Zhang, Member, IEEE •SAP HANA Distributed In-Memory Database System: Transaction, Session, and Metadata Management Juchang Lee#1, Yong Sik Kwon#2, Franz Färber*3, Michael Muehle*4, Chulwon SAP Labs, Korea •In-memory database www.wikipedia.org REFERENCES
  • 23. .ball

Editor's Notes

  1. From one core to multi-core, to multiple processors per servers, to multi-threaded cores, where we now have servers with up to 8 CPUs (with 24Mb caches each) and 160 threads! Relentless technology progress by Intel, AMD, ARM and others, will lead to even bigger caches and cores. The name of the game is data-locality and parallelization. Just released “Sandy Bridge” generation for servers.
  2. By accessing data in column-store order, you benefit immensely from simplified table-scan and data pre-caching. This can make all the difference in performance.
  3. Big building 1910 Basketball hoop – 10 feet Ratio of 106M to 4.9k Memory access is 1M – 10M times faster than disk. In the past memory was so expensive that database vendors optimized for disk. However, with memory costs dropping so dramatically over last 20 years, it’s not possible to harness the power of in-memory computing.