SlideShare a Scribd company logo
1 of 71
Download to read offline
This document is intended for only AVEA İletişim Hizmetleri A.Ş.("AVEA"), its dealers, employees and/or others specifically authorised. The contents of this document are
confidential and any disclosure, copying, distribution and/or taking any action in reliance with the content of this document is prohibited. AVEA is not liable for the transmission
of this document in any manner to any third parties that are not authorised to receive.
Hadoop & NoSQL
New Generation Database Systems
Ramazan FIRIN
22.04.2014
2
AGENDA
• Big Data
• Hadoop
• NoSQL
• Graph DB and Neoj
• Possible Usage in Tellco
• Demo
3
Executive Summary
AVEA
• Big Data is a new IT trend
• Hadoop and NoSQL can used to process Big Data
• Possible usage area in Tellco :
- Prevent Churn
- to offer customer spesific campaign
- to get more customer
4
Big Bang = Big Data
Big Bang Big Data
42008-07-01_Presentation Template MBT / CEOMercedes-Benz Türk A.Ş.
5
What is Big Data?
Datasets that are too awkward to work with using traditional,
hands-ondatabase management tools.
6
Big Data- 3V Concept
7
Big Data To Smart Data
Cover of The Economist
8
Big Data Sources
1. Social network profiles -Facebook, LinkedIn, Yahoo, Google
2. Social influencers - blog comments, user forums, review sites,
3. Activity-generated data - application logs, sensor data
4. Public—Wikipedia, IMDb, etc
5. Data warehouse appliances - transactional data
6. Network and in-stream monitoring
7. Legacy documents—
9
Big Data Approach
10
Sample Usage - 360°Degree View of the
Customers
11
Big Data Solutions – Oracle Big Data Appliance
12
Big Data Solutions – IBM Pure Data
13
Storage for Big Data
13
İf we cant use relational Database, how can we
store it?
1)Hadoop
2)NoSQL
14
What is HADOOP?
The Apache Hadoop software library is a framework that
allows for the distributed processing of large data sets
across clusters of computers using simple programming models
15
History
16
Hadoop Components
17
HADOOP ARCHITECTURE
18
Hadoop Ecosystem
Pig - simplifies hadoop programming, data processing language
Hive - SQL like queries
HBase - Random read/write, billions of row and millions of colums
(NoSQL)
19
NoSQL
20
RDBMS PERFORMANCE
20
21
Join is killer...
21
22
What is NoSQL?
• Stands for Not Only SQL
• Non relational
• Cheap, Easy to implement
• Scalability
– Vertically - Add more data
– Horizontally - Add more storage
• No pre-defined schema
• No join operations
• Not ACID, support CAP threom
23
Key-Value Stores
- Redis, Voldemort
24
Redis Features
• Data Types
• Publish / Subscribe
• Transactions
• Replication
• Persistence
• Partition
24
25
Redis Datatypes
• String
• List
• Sets
• Sorted Sets
• Hashes
25
26
Redis persistance
• RDB - Take snapshot in an interval
Fast
may loss several minutes data if kill -9
•
• AOF – Log for all operations
Still fast enough
may loss 1 second data if kill -9
26
27
Redis Commands
$ redis-cli set counter 100 OK
$ redis-cli incr counter (integer) 101
$ redis-cli incr counter (integer) 102
$ redis-cli incrby counter 10 (integer) 112
SET : SADD,
GET : SPOP, SRANDMEMBER, SMEMBERS
DEL : SREM
ETC : SINTER, SUNION, SCARD, SDIFF, SMOVE, SISMEMBER
27
28
Redis Commands – Lists
$redis-cli rpush messages "Hello how are you?" OK
$ redis-cli rpush messages "Fine thanks. I'm having fun with Redis"
OK
$ redis-cli rpush messages "I should look into this NOSQL thing
ASAP" OK
$ redis-cli lrange messages 0 2
1. Hello how are you?
2. 2. Fine thanks. I'm having fun with Redis
3. 3. I should look into this NOSQL thing ASAP
• Chat systems
• Paginations...
28
29
Redis – Publish/Subscribe
redis 127.0.0.1:6379> PUBLISH myradioshow "Good morning
everyone!" (integer) 0
redis 127.0.0.1:6379> PUBLISH myradioshow "How ya'll doin
tonight?" (integer) 0
redis 127.0.0.1:6379> PUBLISH myradioshow "Hello? Is anyone
listening? I'm not wearing pants."
(integer) 0
redis 127.0.0.1:6379> SUBSCRIBE myradioshow
Reading messages... (press Ctrl-C to quit)
1) "subscribe"
2) 2) "myradioshow"
29
30
Document Database
- CouchDB, MongoDB
31
MongoDB Features
• JSON / BSON support
• RestFul support
• CRUD operations
• Queries like SQL
• İndexing
• Auto sharding
• Built in replication and high availabity
• Aggregation framework
31
32
Terminology
32
33
Sharding
33
34
MondoDB vs SQL
34
SQL MongoDB
SELECT * FROM users db.users.find()
SELECT id, user_id, status FROM users
db.users.find( { }, { user_id: 1, status:
1 } )
SELECT * FROM users WHERE status
= "A"
db.users.find( { status: "A" } )
SELECT user_id, status FROM users
WHERE status = "A"
db.users.find( { status: "A" }, {
user_id: 1, status: 1, _id: 0 } )
SELECT * FROM users WHERE
user_id like "%bc%"
db.users.find( { user_id: /bc/ } )
SELECT * FROM users WHERE status
= "A" ORDER BY user_id ASC
db.users.find( { status: "A" } ).sort( {
user_id: 1 } )
SELECT * FROM users LIMIT 5 SKIP
10
db.users.find().limit(5).skip(10)
35
Column Family Stores
-Cassandra, HBase
36
Cassandra Features
• Proven
• Rich Data Model
• Scalable
• Distributed & Decentralized
• High Performance read/write
• Fault Tolerance
• No SPOF
• Schema free
36
37
Cassandra Cluster
37
38
Benhmark
38
39
Architecture
39
40
Consistency Level
• ANY
• ONE
• TWO
• THREE
• QUORUM
• LOCAL_QUORUM
• EACH_QUORUM
• ALL
40
41
RMDBS Support ACID
• Atomicity - a transaction is all or nothing
• Consistency - only valid data is written to the database
• Isolation - pretend all transactions are happening serially and the data
is correct
• Durability - what you write is what you get
42
NoSQL Support CAP Threom
Consistency : all nodes give the same
answer
Avaibility : nodes always give answer and
accept updates
Partitioning: system continuos working if
some nodes go quite
43
Visual Guide to NoSQL Systems
43
44
Graph Database
- Neo4J, InfoGrid, Infinite Graph
45
Graph DB
Graph database uses graph structures with nodes, edges, and properties
to represent and store data.
46
NoSQL Performance
47
Graph DB Usage Area
• Recommendations
• Business Inteligence
• Social networking
• MDM
• System Management
• Time Series data
• Product Catalogue
• Web Analitics
• Scientific Computing
• Indexing your slow
RMDBS
48
Neo4j
49
Neo4j
• Leading Graph
Database
• Transaction support
(ACID)
• Indexing
• Querying
• REST support
• Disk Based
• Opensource
• Traversal framework
• High Performance
(traverse 1.000.000 +
relationship/seconds)
• Robust (in 7/24 operation
since 2003)
• Massive scalability
50
Neo4j Data Model
Neo4j has Nodes and Relationship.
Nodes and realtionships have properties.
Node1 Node2
Property:name
Property:surname
Property:name
Property:surname
Relationship
Relationship type : knows
Property : Date of meeting
51
Relational Databases are Graphs!
52
Cypher For Query
53
Ne4j Performance
http://www.neotechnology.com/2012/10/20-billion-relationships-imported-
into-neo4j-on-ec2/
54
Who use Neo4j?
• Cisco - Master Data Management
• Telenor Group : Customer organization scructure (203 million
subscribers )
• Deutsche Telekom: Social football site (150 million subscribers )
55
Orient DB
• The Document-Graph
database
• ACID support
• SQL and Native Queries,
• schema-less, schema-full
and schema-mixed modes
• Roles + Security
• Functions
• HTTP / Restfull / Json /
Binary supports
• Hooks
• Fetch plans
• Inheritance
• 200.000 insert per
second(6 M node travels
with cache)
56
FluxGraph
• Temporal Graph Database
• Has checkpoint
• Compatible with Neo4j
562008-07-01_Presentation Template MBT / CEOMercedes-Benz Türk A.Ş.
57
Graphs of Telecommunications
57
58
CDR Analysis by Graph
58
59
Spring Data
59
60
Spring Data Neo4j
61
NoSQL Usage
• Cisco is building a master data management system based on Neo4j, and this is
actually our first Fortune 500 customer. They found us about two years ago when they
tried to build this big, complex hierarchy inside of Oracle RAC. In Oracle RAC, they had
response time in minutes, and then when they replaced it [with] Neo4j, they had
response times in milliseconds.
Emil Eifrem – Neo4j
CEO
• NHS tears out its Oracle Spine in favour of open source
http://www.theregister.co.uk/2013/10/10/nhs_drops_oracle_for_riak/
• AMD: Why we had to evacuate 276TB from Oracle DB to Hadoop
http://www.theregister.co.uk/2014/03/24/amd_hadoop_migration/
61
62 62
Statistics
63
Magic Quadrant for Operational Database
Management Systems
63
64
NoSQL Market Size
64
65
NoSQL Engine Ranking
65
66
NoSQL in Enterprise App
66
67
Use of NoSQL products
67
68
Database market share
68
69
Web Application Arcitecture
69
70
Polyglot Persistance
70
71
Thanks

More Related Content

What's hot

An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBAn Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBMongoDB
 
Prepare for Peak Holiday Season with MongoDB
Prepare for Peak Holiday Season with MongoDBPrepare for Peak Holiday Season with MongoDB
Prepare for Peak Holiday Season with MongoDBMongoDB
 
NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big dataSteven Francia
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQLbalwinders
 
Webinar: High Performance MongoDB Applications with IBM POWER8
Webinar: High Performance MongoDB Applications with IBM POWER8Webinar: High Performance MongoDB Applications with IBM POWER8
Webinar: High Performance MongoDB Applications with IBM POWER8MongoDB
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational DatabasesUdi Bauman
 
The Hadoop Ecosystem for Developers
The Hadoop Ecosystem for DevelopersThe Hadoop Ecosystem for Developers
The Hadoop Ecosystem for DevelopersZohar Elkayam
 
Music recommendations API with Neo4j
Music recommendations API with Neo4jMusic recommendations API with Neo4j
Music recommendations API with Neo4jBoris Guarisma
 
MongoDB Operations for Developers
MongoDB Operations for DevelopersMongoDB Operations for Developers
MongoDB Operations for DevelopersMongoDB
 
Facing enterprise specific challenges – utility programming in hadoop
Facing enterprise specific challenges – utility programming in hadoopFacing enterprise specific challenges – utility programming in hadoop
Facing enterprise specific challenges – utility programming in hadoopfann wu
 
Analytics with Spark and Cassandra
Analytics with Spark and CassandraAnalytics with Spark and Cassandra
Analytics with Spark and CassandraDataStax Academy
 
Relational and non relational database 7
Relational and non relational database 7Relational and non relational database 7
Relational and non relational database 7abdulrahmanhelan
 
Big data for cio 2015
Big data for cio 2015Big data for cio 2015
Big data for cio 2015Zohar Elkayam
 
NoSQL and The Big Data Hullabaloo
NoSQL and The Big Data HullabalooNoSQL and The Big Data Hullabaloo
NoSQL and The Big Data HullabalooAndrew Brust
 
Connector/J Beyond JDBC: the X DevAPI for Java and MySQL as a Document Store
Connector/J Beyond JDBC: the X DevAPI for Java and MySQL as a Document StoreConnector/J Beyond JDBC: the X DevAPI for Java and MySQL as a Document Store
Connector/J Beyond JDBC: the X DevAPI for Java and MySQL as a Document StoreFilipe Silva
 
Big Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsBig Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsAndrew Brust
 
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News! ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News! Embarcadero Technologies
 
Architecture et modèle de données Cassandra
Architecture et modèle de données CassandraArchitecture et modèle de données Cassandra
Architecture et modèle de données CassandraClaude-Alain Glauser
 
Big Data Strategy for the Relational World
Big Data Strategy for the Relational World Big Data Strategy for the Relational World
Big Data Strategy for the Relational World Andrew Brust
 

What's hot (20)

An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBAn Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDB
 
Prepare for Peak Holiday Season with MongoDB
Prepare for Peak Holiday Season with MongoDBPrepare for Peak Holiday Season with MongoDB
Prepare for Peak Holiday Season with MongoDB
 
NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big data
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
Webinar: High Performance MongoDB Applications with IBM POWER8
Webinar: High Performance MongoDB Applications with IBM POWER8Webinar: High Performance MongoDB Applications with IBM POWER8
Webinar: High Performance MongoDB Applications with IBM POWER8
 
NOSQL Overview
NOSQL OverviewNOSQL Overview
NOSQL Overview
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational Databases
 
The Hadoop Ecosystem for Developers
The Hadoop Ecosystem for DevelopersThe Hadoop Ecosystem for Developers
The Hadoop Ecosystem for Developers
 
Music recommendations API with Neo4j
Music recommendations API with Neo4jMusic recommendations API with Neo4j
Music recommendations API with Neo4j
 
MongoDB Operations for Developers
MongoDB Operations for DevelopersMongoDB Operations for Developers
MongoDB Operations for Developers
 
Facing enterprise specific challenges – utility programming in hadoop
Facing enterprise specific challenges – utility programming in hadoopFacing enterprise specific challenges – utility programming in hadoop
Facing enterprise specific challenges – utility programming in hadoop
 
Analytics with Spark and Cassandra
Analytics with Spark and CassandraAnalytics with Spark and Cassandra
Analytics with Spark and Cassandra
 
Relational and non relational database 7
Relational and non relational database 7Relational and non relational database 7
Relational and non relational database 7
 
Big data for cio 2015
Big data for cio 2015Big data for cio 2015
Big data for cio 2015
 
NoSQL and The Big Data Hullabaloo
NoSQL and The Big Data HullabalooNoSQL and The Big Data Hullabaloo
NoSQL and The Big Data Hullabaloo
 
Connector/J Beyond JDBC: the X DevAPI for Java and MySQL as a Document Store
Connector/J Beyond JDBC: the X DevAPI for Java and MySQL as a Document StoreConnector/J Beyond JDBC: the X DevAPI for Java and MySQL as a Document Store
Connector/J Beyond JDBC: the X DevAPI for Java and MySQL as a Document Store
 
Big Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsBig Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI Pros
 
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News! ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
 
Architecture et modèle de données Cassandra
Architecture et modèle de données CassandraArchitecture et modèle de données Cassandra
Architecture et modèle de données Cassandra
 
Big Data Strategy for the Relational World
Big Data Strategy for the Relational World Big Data Strategy for the Relational World
Big Data Strategy for the Relational World
 

Viewers also liked

Simple Way for Neo4j Visualization
Simple Way for Neo4j VisualizationSimple Way for Neo4j Visualization
Simple Way for Neo4j Visualizationramazan fırın
 
Big data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalBig data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalramazan fırın
 
Oracle vs NoSQL – The good, the bad and the ugly
Oracle vs NoSQL – The good, the bad and the uglyOracle vs NoSQL – The good, the bad and the ugly
Oracle vs NoSQL – The good, the bad and the uglyJohn Kanagaraj
 
Transitioning from SQL to MongoDB
Transitioning from SQL to MongoDBTransitioning from SQL to MongoDB
Transitioning from SQL to MongoDBMongoDB
 
MongoDB Pros and Cons
MongoDB Pros and ConsMongoDB Pros and Cons
MongoDB Pros and Consjohnrjenson
 
NoSQL 간단한 소개
NoSQL 간단한 소개NoSQL 간단한 소개
NoSQL 간단한 소개Wonchang Song
 

Viewers also liked (8)

Cloud computig systems
Cloud computig systemsCloud computig systems
Cloud computig systems
 
Simple Way for Neo4j Visualization
Simple Way for Neo4j VisualizationSimple Way for Neo4j Visualization
Simple Way for Neo4j Visualization
 
Big data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalBig data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-final
 
Oracle vs NoSQL – The good, the bad and the ugly
Oracle vs NoSQL – The good, the bad and the uglyOracle vs NoSQL – The good, the bad and the ugly
Oracle vs NoSQL – The good, the bad and the ugly
 
Hadoop and other animals
Hadoop and other animalsHadoop and other animals
Hadoop and other animals
 
Transitioning from SQL to MongoDB
Transitioning from SQL to MongoDBTransitioning from SQL to MongoDB
Transitioning from SQL to MongoDB
 
MongoDB Pros and Cons
MongoDB Pros and ConsMongoDB Pros and Cons
MongoDB Pros and Cons
 
NoSQL 간단한 소개
NoSQL 간단한 소개NoSQL 간단한 소개
NoSQL 간단한 소개
 

Similar to Hadoop & no sql new generation database systems

Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020Piotr Findeisen
 
Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2Connor McDonald
 
Presentation big dataappliance-overview_oow_v3
Presentation   big dataappliance-overview_oow_v3Presentation   big dataappliance-overview_oow_v3
Presentation big dataappliance-overview_oow_v3xKinAnx
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutionssolarisyougood
 
44spotkaniePLSSUGWRO_CoNowegowKrainieChmur
44spotkaniePLSSUGWRO_CoNowegowKrainieChmur44spotkaniePLSSUGWRO_CoNowegowKrainieChmur
44spotkaniePLSSUGWRO_CoNowegowKrainieChmurTobias Koprowski
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Kent Graziano
 
Big Data Developers Moscow Meetup 1 - sql on hadoop
Big Data Developers Moscow Meetup 1  - sql on hadoopBig Data Developers Moscow Meetup 1  - sql on hadoop
Big Data Developers Moscow Meetup 1 - sql on hadoopbddmoscow
 
MongoDB Administration 101
MongoDB Administration 101MongoDB Administration 101
MongoDB Administration 101MongoDB
 
Paradigmas de procesamiento en Big Data: estado actual, tendencias y oportu...
Paradigmas de procesamiento en  Big Data: estado actual,  tendencias y oportu...Paradigmas de procesamiento en  Big Data: estado actual,  tendencias y oportu...
Paradigmas de procesamiento en Big Data: estado actual, tendencias y oportu...Facultad de Informática UCM
 
Big Data Paris : Hadoop and NoSQL
Big Data Paris : Hadoop and NoSQLBig Data Paris : Hadoop and NoSQL
Big Data Paris : Hadoop and NoSQLTugdual Grall
 
Denodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo Partner Connect: Technical Webinar - Ask Me AnythingDenodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo Partner Connect: Technical Webinar - Ask Me AnythingDenodo
 
NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013Facundo Farias
 
Node.js and the MySQL Document Store
Node.js and the MySQL Document StoreNode.js and the MySQL Document Store
Node.js and the MySQL Document StoreRui Quelhas
 
Survey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data LandscapeSurvey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data LandscapeIke Ellis
 
SQL To NoSQL - Top 6 Questions Before Making The Move
SQL To NoSQL - Top 6 Questions Before Making The MoveSQL To NoSQL - Top 6 Questions Before Making The Move
SQL To NoSQL - Top 6 Questions Before Making The MoveIBM Cloud Data Services
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyAlluxio, Inc.
 
What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?DataWorks Summit
 
What's New in Apache Hive 3.0 - Tokyo
What's New in Apache Hive 3.0 - TokyoWhat's New in Apache Hive 3.0 - Tokyo
What's New in Apache Hive 3.0 - TokyoDataWorks Summit
 

Similar to Hadoop & no sql new generation database systems (20)

Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020
 
Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2
 
Presentation big dataappliance-overview_oow_v3
Presentation   big dataappliance-overview_oow_v3Presentation   big dataappliance-overview_oow_v3
Presentation big dataappliance-overview_oow_v3
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutions
 
44spotkaniePLSSUGWRO_CoNowegowKrainieChmur
44spotkaniePLSSUGWRO_CoNowegowKrainieChmur44spotkaniePLSSUGWRO_CoNowegowKrainieChmur
44spotkaniePLSSUGWRO_CoNowegowKrainieChmur
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)
 
Big Data Developers Moscow Meetup 1 - sql on hadoop
Big Data Developers Moscow Meetup 1  - sql on hadoopBig Data Developers Moscow Meetup 1  - sql on hadoop
Big Data Developers Moscow Meetup 1 - sql on hadoop
 
MongoDB Administration 101
MongoDB Administration 101MongoDB Administration 101
MongoDB Administration 101
 
Paradigmas de procesamiento en Big Data: estado actual, tendencias y oportu...
Paradigmas de procesamiento en  Big Data: estado actual,  tendencias y oportu...Paradigmas de procesamiento en  Big Data: estado actual,  tendencias y oportu...
Paradigmas de procesamiento en Big Data: estado actual, tendencias y oportu...
 
Big Data Paris : Hadoop and NoSQL
Big Data Paris : Hadoop and NoSQLBig Data Paris : Hadoop and NoSQL
Big Data Paris : Hadoop and NoSQL
 
Oracle OpenWo2014 review part 03 three_paa_s_database
Oracle OpenWo2014 review part 03 three_paa_s_databaseOracle OpenWo2014 review part 03 three_paa_s_database
Oracle OpenWo2014 review part 03 three_paa_s_database
 
Denodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo Partner Connect: Technical Webinar - Ask Me AnythingDenodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo Partner Connect: Technical Webinar - Ask Me Anything
 
NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013
 
Node.js and the MySQL Document Store
Node.js and the MySQL Document StoreNode.js and the MySQL Document Store
Node.js and the MySQL Document Store
 
Survey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data LandscapeSurvey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data Landscape
 
SQL To NoSQL - Top 6 Questions Before Making The Move
SQL To NoSQL - Top 6 Questions Before Making The MoveSQL To NoSQL - Top 6 Questions Before Making The Move
SQL To NoSQL - Top 6 Questions Before Making The Move
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
 
What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?
 
What's New in Apache Hive 3.0 - Tokyo
What's New in Apache Hive 3.0 - TokyoWhat's New in Apache Hive 3.0 - Tokyo
What's New in Apache Hive 3.0 - Tokyo
 
Data
DataData
Data
 

Recently uploaded

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 

Recently uploaded (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 

Hadoop & no sql new generation database systems

  • 1. This document is intended for only AVEA İletişim Hizmetleri A.Ş.("AVEA"), its dealers, employees and/or others specifically authorised. The contents of this document are confidential and any disclosure, copying, distribution and/or taking any action in reliance with the content of this document is prohibited. AVEA is not liable for the transmission of this document in any manner to any third parties that are not authorised to receive. Hadoop & NoSQL New Generation Database Systems Ramazan FIRIN 22.04.2014
  • 2. 2 AGENDA • Big Data • Hadoop • NoSQL • Graph DB and Neoj • Possible Usage in Tellco • Demo
  • 3. 3 Executive Summary AVEA • Big Data is a new IT trend • Hadoop and NoSQL can used to process Big Data • Possible usage area in Tellco : - Prevent Churn - to offer customer spesific campaign - to get more customer
  • 4. 4 Big Bang = Big Data Big Bang Big Data 42008-07-01_Presentation Template MBT / CEOMercedes-Benz Türk A.Ş.
  • 5. 5 What is Big Data? Datasets that are too awkward to work with using traditional, hands-ondatabase management tools.
  • 6. 6 Big Data- 3V Concept
  • 7. 7 Big Data To Smart Data Cover of The Economist
  • 8. 8 Big Data Sources 1. Social network profiles -Facebook, LinkedIn, Yahoo, Google 2. Social influencers - blog comments, user forums, review sites, 3. Activity-generated data - application logs, sensor data 4. Public—Wikipedia, IMDb, etc 5. Data warehouse appliances - transactional data 6. Network and in-stream monitoring 7. Legacy documents—
  • 10. 10 Sample Usage - 360°Degree View of the Customers
  • 11. 11 Big Data Solutions – Oracle Big Data Appliance
  • 12. 12 Big Data Solutions – IBM Pure Data
  • 13. 13 Storage for Big Data 13 İf we cant use relational Database, how can we store it? 1)Hadoop 2)NoSQL
  • 14. 14 What is HADOOP? The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models
  • 18. 18 Hadoop Ecosystem Pig - simplifies hadoop programming, data processing language Hive - SQL like queries HBase - Random read/write, billions of row and millions of colums (NoSQL)
  • 22. 22 What is NoSQL? • Stands for Not Only SQL • Non relational • Cheap, Easy to implement • Scalability – Vertically - Add more data – Horizontally - Add more storage • No pre-defined schema • No join operations • Not ACID, support CAP threom
  • 24. 24 Redis Features • Data Types • Publish / Subscribe • Transactions • Replication • Persistence • Partition 24
  • 25. 25 Redis Datatypes • String • List • Sets • Sorted Sets • Hashes 25
  • 26. 26 Redis persistance • RDB - Take snapshot in an interval Fast may loss several minutes data if kill -9 • • AOF – Log for all operations Still fast enough may loss 1 second data if kill -9 26
  • 27. 27 Redis Commands $ redis-cli set counter 100 OK $ redis-cli incr counter (integer) 101 $ redis-cli incr counter (integer) 102 $ redis-cli incrby counter 10 (integer) 112 SET : SADD, GET : SPOP, SRANDMEMBER, SMEMBERS DEL : SREM ETC : SINTER, SUNION, SCARD, SDIFF, SMOVE, SISMEMBER 27
  • 28. 28 Redis Commands – Lists $redis-cli rpush messages "Hello how are you?" OK $ redis-cli rpush messages "Fine thanks. I'm having fun with Redis" OK $ redis-cli rpush messages "I should look into this NOSQL thing ASAP" OK $ redis-cli lrange messages 0 2 1. Hello how are you? 2. 2. Fine thanks. I'm having fun with Redis 3. 3. I should look into this NOSQL thing ASAP • Chat systems • Paginations... 28
  • 29. 29 Redis – Publish/Subscribe redis 127.0.0.1:6379> PUBLISH myradioshow "Good morning everyone!" (integer) 0 redis 127.0.0.1:6379> PUBLISH myradioshow "How ya'll doin tonight?" (integer) 0 redis 127.0.0.1:6379> PUBLISH myradioshow "Hello? Is anyone listening? I'm not wearing pants." (integer) 0 redis 127.0.0.1:6379> SUBSCRIBE myradioshow Reading messages... (press Ctrl-C to quit) 1) "subscribe" 2) 2) "myradioshow" 29
  • 31. 31 MongoDB Features • JSON / BSON support • RestFul support • CRUD operations • Queries like SQL • İndexing • Auto sharding • Built in replication and high availabity • Aggregation framework 31
  • 34. 34 MondoDB vs SQL 34 SQL MongoDB SELECT * FROM users db.users.find() SELECT id, user_id, status FROM users db.users.find( { }, { user_id: 1, status: 1 } ) SELECT * FROM users WHERE status = "A" db.users.find( { status: "A" } ) SELECT user_id, status FROM users WHERE status = "A" db.users.find( { status: "A" }, { user_id: 1, status: 1, _id: 0 } ) SELECT * FROM users WHERE user_id like "%bc%" db.users.find( { user_id: /bc/ } ) SELECT * FROM users WHERE status = "A" ORDER BY user_id ASC db.users.find( { status: "A" } ).sort( { user_id: 1 } ) SELECT * FROM users LIMIT 5 SKIP 10 db.users.find().limit(5).skip(10)
  • 36. 36 Cassandra Features • Proven • Rich Data Model • Scalable • Distributed & Decentralized • High Performance read/write • Fault Tolerance • No SPOF • Schema free 36
  • 40. 40 Consistency Level • ANY • ONE • TWO • THREE • QUORUM • LOCAL_QUORUM • EACH_QUORUM • ALL 40
  • 41. 41 RMDBS Support ACID • Atomicity - a transaction is all or nothing • Consistency - only valid data is written to the database • Isolation - pretend all transactions are happening serially and the data is correct • Durability - what you write is what you get
  • 42. 42 NoSQL Support CAP Threom Consistency : all nodes give the same answer Avaibility : nodes always give answer and accept updates Partitioning: system continuos working if some nodes go quite
  • 43. 43 Visual Guide to NoSQL Systems 43
  • 44. 44 Graph Database - Neo4J, InfoGrid, Infinite Graph
  • 45. 45 Graph DB Graph database uses graph structures with nodes, edges, and properties to represent and store data.
  • 47. 47 Graph DB Usage Area • Recommendations • Business Inteligence • Social networking • MDM • System Management • Time Series data • Product Catalogue • Web Analitics • Scientific Computing • Indexing your slow RMDBS
  • 49. 49 Neo4j • Leading Graph Database • Transaction support (ACID) • Indexing • Querying • REST support • Disk Based • Opensource • Traversal framework • High Performance (traverse 1.000.000 + relationship/seconds) • Robust (in 7/24 operation since 2003) • Massive scalability
  • 50. 50 Neo4j Data Model Neo4j has Nodes and Relationship. Nodes and realtionships have properties. Node1 Node2 Property:name Property:surname Property:name Property:surname Relationship Relationship type : knows Property : Date of meeting
  • 54. 54 Who use Neo4j? • Cisco - Master Data Management • Telenor Group : Customer organization scructure (203 million subscribers ) • Deutsche Telekom: Social football site (150 million subscribers )
  • 55. 55 Orient DB • The Document-Graph database • ACID support • SQL and Native Queries, • schema-less, schema-full and schema-mixed modes • Roles + Security • Functions • HTTP / Restfull / Json / Binary supports • Hooks • Fetch plans • Inheritance • 200.000 insert per second(6 M node travels with cache)
  • 56. 56 FluxGraph • Temporal Graph Database • Has checkpoint • Compatible with Neo4j 562008-07-01_Presentation Template MBT / CEOMercedes-Benz Türk A.Ş.
  • 58. 58 CDR Analysis by Graph 58
  • 61. 61 NoSQL Usage • Cisco is building a master data management system based on Neo4j, and this is actually our first Fortune 500 customer. They found us about two years ago when they tried to build this big, complex hierarchy inside of Oracle RAC. In Oracle RAC, they had response time in minutes, and then when they replaced it [with] Neo4j, they had response times in milliseconds. Emil Eifrem – Neo4j CEO • NHS tears out its Oracle Spine in favour of open source http://www.theregister.co.uk/2013/10/10/nhs_drops_oracle_for_riak/ • AMD: Why we had to evacuate 276TB from Oracle DB to Hadoop http://www.theregister.co.uk/2014/03/24/amd_hadoop_migration/ 61
  • 63. 63 Magic Quadrant for Operational Database Management Systems 63
  • 67. 67 Use of NoSQL products 67