SlideShare a Scribd company logo
1 of 42
Oracle Database 12c
Features for Big Data
Disclaimer : The information presented here is based on my views, and information gathered from online sources, the
presentation is only to create an awareness about the features and does not describe a real solution.
Presented by Abishek V S
Agenda
• What is Big Data
• Big Data Versus RDBMS
• Oracle In-Memory Column Store
• JSON support in Oracle Database
• Oracle Database And Hadoop
What is Big Data
What is Big Data
Big data is simply data that breaks traditional
architectures due to its sheer volume, speed and
variety.
Structured
Unstructured
Semi-Structured
Multiple Sources
Large Volumes
Characterization of Big Data
 Volume
 Variety
 Velocity
From “Understanding Big Data” by IBM
Veracity, Validity, Volatility
Characterization of Big Data
From the dawn of civilization until
2003, humankind generated five
exabytes of data. Now we produce
five exabytes every two days…and
the pace is accelerating.
Eric Schmidt,
Executive Chairman, Google
Characterization of Big Data
Characterization of Big Data
Big Data: Driving Factors & Motivation
• Exponential growth of the internet
• Widespread acceptance of E-Commerce
• Growth of the Social Network
• Commoditization of the computing resources
• Per GB cost of storage is more affordable now than 10
years back.
• Commodity computers have become more powerful.
• Popularity of clusters based on commodity computers
• IoT (Internet of Things)
– Day by day the devices we own are getting smarter
and are learning about us.
• Distributed computing
– Distributed Servers and Storage (Cloud based)
– Distributed processing Eg : MapReduce with Hadoop
• Schema Free Databases
– NoSQL Database
• In-memory
• Semi Structures
– JSON
– Key, Value pairs
• Columnar databases
• Big Data Operations
• Analytic / Semantic Processing (e.g. R, OWLIM)
Big Data: Technologies and Tools
Big Data Versus RDBMS
Big Data versus RDBMS
• RDBMS
– Data is stored in defined structures (tables)
– Transactional in nature
– Data consistency is a primary consideration
– Drives operational systems
– Response time is crucial
• Big Data
– Data comes in all shapes and sizes
– Behavioral Data
– Prone to rapid change
– Useful in VAS, identifying patterns not exposed by Operational
systems
– The value derived is of prime importance.
Big Data versus RDBMS
RDBMS
Captures Business Transactions
Ensures Operational Efficiency
Operational Decision support
Analytics is very limited
Integrating external data is expensive
ERP, BI, ETL, Data warehouse
Big Data
Captures User behavioral data
System logs, social data
Acts as Feedback to business
New opportunity exploration
Analytics is the key focus
Technology aims at integration.
User activity log, Web Analytics,
Social Media Streaming API, Hadoop
Map Reduce, NoSQL data store
optimized for Analytics
Big Data versus RDBMS
Big Data
RDBMS
Oracle In-Memory Column Store
Oracle In-Memory Column Store
• A column format database stores each of the attributes
about a transaction or record in a separate column
structure
• A column format is ideal for analytics, as it allows for
faster data retrieval when only a few columns are
selected but the query accesses a large portion of the
data set.
• A column format is not so efficient at processing row
wise DML: In order to insert or delete a single record in
a column format all of the columnar structures in the
table must be changed.
• Up until now you have been forced to pick just one
format and suffer the tradeoff of either suboptimal OLTP
or sub-optimal analytics performance.
Oracle In-Memory Column Store
Oracle Database In-Memory provides best of both worlds
The in-memory column format store cache should be sized to fit the objects that
must be stored in memory.
Less than 20% overhead in terms of total memory requirements.
Database In-Memory uses an In-Memory column store (IM column store), which is
a new component of the Oracle Database System Global Area (SGA), called the In-
Memory Area (INMEMORY_SIZE).
Oracle In-Memory Column Store
• Tablespace Level
– ALTER TABLESPACE ts_data INMEMORY;
• Table Level
– ALTER TABLE sales INMEMORY NO INMEMORY(prod_id);
• Partition Level
– ALTER TABLE sales MODIFY PARTITION SALES_Q1_1998 NO INMEMORY;
• Objects are populated into the IM column store either in a prioritized list immediately after the
database is opened or after they are scanned (queried) for the first time.
– ALTER TABLE customers INMEMORY PRIORITY CRITICAL;
Oracle In-Memory Column Store
• In-Memory Compression
• Typically compression is considered only as a space-saving mechanism.
However, data populated into the IM column store is compressed using a
new set of compression algorithms that not only help save space but also
improve query performance
Oracle In-Memory Column Store
• In-Memory Scans
– Analytic queries typically reference only a small subset of the columns in a table.
– Oracle Database InMemory scans only the columns needed by a SQL, and applies any
WHERE clause filter predicates to these columns directly without decompressing them.
• In-Memory Storage Index
– A further reduction in the amount of data accessed
– Automatically created and maintained on each of the columns in the IM column store.
– Storage Indexes allow data pruning based on the filter predicates in a SQL statement.
• SIMD Vector Processing
– Database In-Memory uses SIMD (Single Instruction processing Multiple Data values) vector
processing
– SIMD vector processing allows a set of column values to be evaluated together in a single
CPU instruction.
• In-Memory Joins
– SQL statements that join multiple tables can also be processed very efficiently in the IM
column store as they can take advantage of Bloom Filters.
• A Bloom filter transforms a join into a filter that can be applied as part of the scan of the larger table.
• In-Memory Aggregation
– Analytic style queries often require complex aggregations and summaries.
– A new optimizer transformation, called Vector Group By, has been introduced with Oracle
Database 12.1.0.2 to ensure more complex analytic queries can be processed using new
CPU-efficient algorithms.
Oracle In-Memory Column Store
JSON support in Oracle Database
JSON support in Oracle Database
• JSON (Java Script Object Notation) is a fast-
growing data type often used in web and mobile
applications.
• JSON is also used as a data interchange format
– More lightweight
– Bandwidth-non-intensive
• JSON integrates into web pages as javascript can
directly inherit a JSON
JSON support in Oracle Database
• JSON is gaining popularity
– APIs (application programming interfaces)
• Most Social network providers provide JSON based data services
API.
• Webservices : RESTful (Representative state transfer)
– Big Data
• Many NoSQL databases use JSON as the storage format
– MongoDB, CouchDB, and Riak
– Internet of Things (IoT)
• With more personal devices and appliances getting smart and
hooking up to the internet, JSON is becoming the choice of use as it
is lightweight and better adaptable to these devices.
JSON support in Oracle Database
• JSON in Oracle Database 12c R1 (12.1.0.2)
– Creating Tables to Hold JSON
– Querying JSON Data
• Dot Notation
• IS JSON
• JSON_EXISTS
• JSON_VALUE
• JSON_QUERY
• JSON_TABLE
• JSON_TEXTCONTAINS
– Identifying Columns Containing JSON
– Loading JSON Files Using External Tables
JSON support in Oracle Database
• Creating Tables to Hold JSON
– No new data type has been added to support JSON. Instead, it is stored
in regular VARCHAR2 or CLOB columns.
– The IS JSON constraint indicates the column contains valid JSON data.
CREATE TABLE json_documents (
id RAW(16) NOT NULL,
data CLOB,
CONSTRAINT json_documents_pk PRIMARY KEY (id),
CONSTRAINT json_documents_json_chk CHECK (data IS JSON)
);
Lax or Strict checking “(data is JSON(Strict))”
– The [USER|ALL|DBA]_JSON_COLUMNS views can be used to identify
tables and columns containing JSON data.
INSERT INTO json_documents (id, data)
VALUES (SYS_GUID(),
'{
"FirstName" : "John",
"LastName" : "Doe",
"Job" : "Clerk",
"Address" : {
"Street" : "99 My Street",
"City" : "My City",
"Country" : "UK",
"Postcode" : "A12 34B"
},
"ContactDetails" : {
"Email" : "john.doe@example.com",
"Phone" : "44 123 123456",
"Twitter" : "@johndoe"
},
"DateOfBirth" : "01-JAN-1980",
"Active" : true
}');
COLUMN FirstName FORMAT A15
COLUMN LastName FORMAT A15
COLUMN Postcode FORMAT A10
COLUMN Email FORMAT A25
SELECT a.data.FirstName,
a.data.LastName,
a.data.Address.Postcode AS Postcode,
a.data.ContactDetails.Email AS Email
FROM json_documents a
ORDER BY a.data.FirstName,
a.data.LastName;
FIRSTNAME LASTNAME POSTCODE EMAIL
--------------- --------------- ---------- -------------------------
Jayne Doe A12 34B jayne.doe@example.com
John Doe A12 34B john.doe@example.com
• IS JSON
– The IS JSON condition can be used to test if a column contains JSON data.
• SELECT JSON_VALUE(a.data, '$.FirstName') AS first_name FROM json_documents_no_constraint a WHERE a.data IS JSON;
• JSON_EXISTS
– Similar to IS NULL, checks if an element has a value
• JSON_VALUE
– Returns an element from the JSON document, based on the specified JSON
path.
• JSON_QUERY
– The JSON_QUERY function returns a JSON fragment representing one or more
values.
• JSON_TABLE
– The JSON_TABLE function incorporates all the functionality of JSON_VALUE,
JSON_EXISTS and JSON_QUERY.
– JSON_TABLE is used for making JSON data look like relational data, which is
especially useful when creating relational views over JSON data,
• JSON_TEXTCONTAINS
– Works with JSON indexes and enables faster text searching through the JSON
data.
JSON support in Oracle Database
Loading JSON Files Using External Tables
• Create the directory objects for use with the external table.
CREATE OR REPLACE DIRECTORY order_entry_dir
AS '/u01/app/oracle/product/12.1.0.2/db_1/demo/schema/order_entry';
GRANT READ, WRITE ON DIRECTORY order_entry_dir TO test;
CREATE OR REPLACE DIRECTORY loader_output_dir AS '/tmp';
GRANT READ, WRITE ON DIRECTORY loader_output_dir TO test;
• Create the external table and query it to check if it is working.
CREATE TABLE json_dump_file_contents (json_document CLOB)
ORGANIZATION EXTERNAL (TYPE ORACLE_LOADER DEFAULT DIRECTORY order_entry_dir
ACCESS PARAMETERS (RECORDS DELIMITED BY 0x'0A'
DISABLE_DIRECTORY_LINK_CHECK
BADFILE loader_output_dir: 'JSONDumpFile.bad'
LOGFILE order_entry_dir: 'JSONDumpFile.log'
FIELDS (json_document CHAR(5000)))
LOCATION (order_entry_dir:'PurchaseOrders.dmp'))
PARALLEL
REJECT LIMIT UNLIMITED;
JSON support in Oracle Database
SELECT COUNT(*) FROM json_dump_file_contents;
COUNT(*)
----------
10000
• You can now load the database table with the contents of the external table.
TRUNCATE TABLE json_documents;
INSERT /*+ APPEND */ INTO json_documents
SELECT SYS_GUID(), json_document
FROM json_dump_file_contents
WHERE json_document IS JSON;
COMMIT;
Oracle Database And Hadoop
Oracle Database And Hadoop
• Big Data Discussion is incomplete without the mention of Hadoop
• Hadoop is a distributed computing framework
• Runs Batch operations(MapReduce) on distributed clusters made of
commodity computers.
• Stores data in a distributed clustered filesystem
• Hadoop clusters are a shared nothing paradigm
Oracle Database And Hadoop
• MapReduce Paradigm
Oracle Database And Hadoop
• In-Database MapReduce
• Avoid Shipping of data residing in RDBMS to an external
infrastructure
• Database security can be applied to the processed data.
• Shorter learning curve for both Developers and DBAs
• Mix SQL with MapReduce processing for flexibility and
efficiency
• Uses PL/SQL or Java Pipe-Lined Functions
INSERT INTO OUTTABLE
SELECT * FROM TABLE
(Word_Count_Reduce (:ConfKey,
CURSOR(SELECT * FROM TABLE
(Word_Cursor_Map(:ConfKey,
CURSOR(SELECT * FROM InTable)))))) ;
Oracle Database And Hadoop
• Pipelined Functions : Can either return a stream of rows or take it
as input too.
• Can be Parallelized with a partition key
• Implemented using PL/SQL, Java or C
• Contains 2 Pipelined Functions, one for mapper the other for
reducer.
• Further the mapper input source could be an external table, and the
reducer output may be placed in a DB table or further sent out to
filesystem file.
• Can leverage external tables, DBFS, use Java or C to write to files.
• The opportunities are endless when coupled with other DB features
and options.
• DB Scheduler can be used to schedule the mapreduce
• Clustered with distributed databases using DBLinks
• Add fault tolerance and scalability with RAC.
Oracle Database And Hadoop
• Oracle In-Database Hadoop
• We will look at this in a future discussion …
Oracle Database And Hadoop
The Road Ahead
• Big Data/NoSQL databases WILL NOT replace
RDBMS databases.
• Oracle’s Roadmap has been Single Vendor
Solutions.
• Reusing available resources : Both technology
and human resource.
• Oracle is building more Appliance based
solutions.
The Road Ahead
• Oracle Big Data Products.
– Oracle Big Data Management
• Oracle Big Data Appliance
• Oracle Big Data SQL
• Oracle NoSQL Database
– Oracle Big Data Integration
• Oracle GoldenGate
• Oracle Data Integration
• Oracle Event Processing
– Big Data Analytics
• Oracle Big Data Discovery
• Oracle Advanced Analytics
• Oracle Business Intelligence Foundation
Please mail me at abishek.vidyashanker@in.unisys.com
Oracle Database 12c - Features for Big Data

More Related Content

What's hot

SQL in the Hybrid World
SQL in the Hybrid WorldSQL in the Hybrid World
SQL in the Hybrid WorldTanel Poder
 
Oracle performance tuning_sfsf
Oracle performance tuning_sfsfOracle performance tuning_sfsf
Oracle performance tuning_sfsfMao Geng
 
Connecting Hadoop and Oracle
Connecting Hadoop and OracleConnecting Hadoop and Oracle
Connecting Hadoop and OracleTanel Poder
 
Oracle db performance tuning
Oracle db performance tuningOracle db performance tuning
Oracle db performance tuningSimon Huang
 
SQL Server Tuning to Improve Database Performance
SQL Server Tuning to Improve Database PerformanceSQL Server Tuning to Improve Database Performance
SQL Server Tuning to Improve Database PerformanceMark Ginnebaugh
 
Analyzing and Interpreting AWR
Analyzing and Interpreting AWRAnalyzing and Interpreting AWR
Analyzing and Interpreting AWRpasalapudi
 
Collaborate 2019 - How to Understand an AWR Report
Collaborate 2019 - How to Understand an AWR ReportCollaborate 2019 - How to Understand an AWR Report
Collaborate 2019 - How to Understand an AWR ReportAlfredo Krieg
 
Crating a Robust Performance Strategy
Crating a Robust Performance StrategyCrating a Robust Performance Strategy
Crating a Robust Performance StrategyGuatemala User Group
 
Database Automation with MySQL Triggers and Event Schedulers
Database Automation with MySQL Triggers and Event SchedulersDatabase Automation with MySQL Triggers and Event Schedulers
Database Automation with MySQL Triggers and Event SchedulersAbdul Rahman Sherzad
 
Oracle Database 12c Release 2 - New Features On Oracle Database Exadata Expr...
Oracle Database 12c Release 2 - New Features On Oracle Database Exadata  Expr...Oracle Database 12c Release 2 - New Features On Oracle Database Exadata  Expr...
Oracle Database 12c Release 2 - New Features On Oracle Database Exadata Expr...Alex Zaballa
 
Oracle Database Performance Tuning: The Not SQL Option
Oracle Database Performance Tuning: The Not SQL OptionOracle Database Performance Tuning: The Not SQL Option
Oracle Database Performance Tuning: The Not SQL OptionGuatemala User Group
 
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...Leighton Nelson
 
PDB Provisioning with Oracle Multitenant Self Service Application
PDB Provisioning with Oracle Multitenant Self Service ApplicationPDB Provisioning with Oracle Multitenant Self Service Application
PDB Provisioning with Oracle Multitenant Self Service ApplicationLeighton Nelson
 
Ten query tuning techniques every SQL Server programmer should know
Ten query tuning techniques every SQL Server programmer should knowTen query tuning techniques every SQL Server programmer should know
Ten query tuning techniques every SQL Server programmer should knowKevin Kline
 
Awr + 12c performance tuning
Awr + 12c performance tuningAwr + 12c performance tuning
Awr + 12c performance tuningAiougVizagChapter
 
Oracle Database Performance Tuning Concept
Oracle Database Performance Tuning ConceptOracle Database Performance Tuning Concept
Oracle Database Performance Tuning ConceptChien Chung Shen
 
Simplifying EBS 12.2 ADOP - Collaborate 2019
Simplifying EBS 12.2 ADOP - Collaborate 2019   Simplifying EBS 12.2 ADOP - Collaborate 2019
Simplifying EBS 12.2 ADOP - Collaborate 2019 Alfredo Krieg
 
Remote DBA Experts 11g Features
Remote DBA Experts 11g FeaturesRemote DBA Experts 11g Features
Remote DBA Experts 11g FeaturesRemote DBA Experts
 

What's hot (20)

SQL in the Hybrid World
SQL in the Hybrid WorldSQL in the Hybrid World
SQL in the Hybrid World
 
Oracle performance tuning_sfsf
Oracle performance tuning_sfsfOracle performance tuning_sfsf
Oracle performance tuning_sfsf
 
Connecting Hadoop and Oracle
Connecting Hadoop and OracleConnecting Hadoop and Oracle
Connecting Hadoop and Oracle
 
Oracle db performance tuning
Oracle db performance tuningOracle db performance tuning
Oracle db performance tuning
 
SQL Server Tuning to Improve Database Performance
SQL Server Tuning to Improve Database PerformanceSQL Server Tuning to Improve Database Performance
SQL Server Tuning to Improve Database Performance
 
Analyzing and Interpreting AWR
Analyzing and Interpreting AWRAnalyzing and Interpreting AWR
Analyzing and Interpreting AWR
 
Collaborate 2019 - How to Understand an AWR Report
Collaborate 2019 - How to Understand an AWR ReportCollaborate 2019 - How to Understand an AWR Report
Collaborate 2019 - How to Understand an AWR Report
 
Crating a Robust Performance Strategy
Crating a Robust Performance StrategyCrating a Robust Performance Strategy
Crating a Robust Performance Strategy
 
Database Automation with MySQL Triggers and Event Schedulers
Database Automation with MySQL Triggers and Event SchedulersDatabase Automation with MySQL Triggers and Event Schedulers
Database Automation with MySQL Triggers and Event Schedulers
 
Oracle Database 12c Release 2 - New Features On Oracle Database Exadata Expr...
Oracle Database 12c Release 2 - New Features On Oracle Database Exadata  Expr...Oracle Database 12c Release 2 - New Features On Oracle Database Exadata  Expr...
Oracle Database 12c Release 2 - New Features On Oracle Database Exadata Expr...
 
Performance tuning in sql server
Performance tuning in sql serverPerformance tuning in sql server
Performance tuning in sql server
 
Oracle Database Performance Tuning: The Not SQL Option
Oracle Database Performance Tuning: The Not SQL OptionOracle Database Performance Tuning: The Not SQL Option
Oracle Database Performance Tuning: The Not SQL Option
 
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...
Oracle Enteprise Manager Cloud Control 12c - Setting Up Metrics and Monitorin...
 
PDB Provisioning with Oracle Multitenant Self Service Application
PDB Provisioning with Oracle Multitenant Self Service ApplicationPDB Provisioning with Oracle Multitenant Self Service Application
PDB Provisioning with Oracle Multitenant Self Service Application
 
Ten query tuning techniques every SQL Server programmer should know
Ten query tuning techniques every SQL Server programmer should knowTen query tuning techniques every SQL Server programmer should know
Ten query tuning techniques every SQL Server programmer should know
 
Awr + 12c performance tuning
Awr + 12c performance tuningAwr + 12c performance tuning
Awr + 12c performance tuning
 
Oracle Database Performance Tuning Concept
Oracle Database Performance Tuning ConceptOracle Database Performance Tuning Concept
Oracle Database Performance Tuning Concept
 
Simplifying EBS 12.2 ADOP - Collaborate 2019
Simplifying EBS 12.2 ADOP - Collaborate 2019   Simplifying EBS 12.2 ADOP - Collaborate 2019
Simplifying EBS 12.2 ADOP - Collaborate 2019
 
Avoid boring work_v2
Avoid boring work_v2Avoid boring work_v2
Avoid boring work_v2
 
Remote DBA Experts 11g Features
Remote DBA Experts 11g FeaturesRemote DBA Experts 11g Features
Remote DBA Experts 11g Features
 

Viewers also liked

Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
2016 VLDB - Messing Up with Bart: Error Generation for Evaluating Data-Cleani...
2016 VLDB - Messing Up with Bart: Error Generation for Evaluating Data-Cleani...2016 VLDB - Messing Up with Bart: Error Generation for Evaluating Data-Cleani...
2016 VLDB - Messing Up with Bart: Error Generation for Evaluating Data-Cleani...Boris Glavic
 
VLDB Administration Strategies
VLDB Administration StrategiesVLDB Administration Strategies
VLDB Administration StrategiesMurilo Miranda
 
Leveraging Big Data with Hadoop, NoSQL and RDBMS
Leveraging Big Data with Hadoop, NoSQL and RDBMSLeveraging Big Data with Hadoop, NoSQL and RDBMS
Leveraging Big Data with Hadoop, NoSQL and RDBMSAerospike, Inc.
 
Paradigmas de Procesamiento en Big Data: Arquitecturas y Tecnologías aplicadas
Paradigmas de Procesamiento en Big Data: Arquitecturas y Tecnologías aplicadasParadigmas de Procesamiento en Big Data: Arquitecturas y Tecnologías aplicadas
Paradigmas de Procesamiento en Big Data: Arquitecturas y Tecnologías aplicadasBig-Data-Summit
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Dave Segleau
 
Oracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overviewOracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overviewDave Segleau
 
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop systemToby Woolfe
 
Best New Features of Oracle Database 12c
Best New Features of Oracle Database 12cBest New Features of Oracle Database 12c
Best New Features of Oracle Database 12cPini Dibask
 
Engineering practices in big data storage and processing
Engineering practices in big data storage and processingEngineering practices in big data storage and processing
Engineering practices in big data storage and processingSchubert Zhang
 

Viewers also liked (12)

Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
2016 VLDB - Messing Up with Bart: Error Generation for Evaluating Data-Cleani...
2016 VLDB - Messing Up with Bart: Error Generation for Evaluating Data-Cleani...2016 VLDB - Messing Up with Bart: Error Generation for Evaluating Data-Cleani...
2016 VLDB - Messing Up with Bart: Error Generation for Evaluating Data-Cleani...
 
VLDB Administration Strategies
VLDB Administration StrategiesVLDB Administration Strategies
VLDB Administration Strategies
 
Leveraging Big Data with Hadoop, NoSQL and RDBMS
Leveraging Big Data with Hadoop, NoSQL and RDBMSLeveraging Big Data with Hadoop, NoSQL and RDBMS
Leveraging Big Data with Hadoop, NoSQL and RDBMS
 
Paradigmas de Procesamiento en Big Data: Arquitecturas y Tecnologías aplicadas
Paradigmas de Procesamiento en Big Data: Arquitecturas y Tecnologías aplicadasParadigmas de Procesamiento en Big Data: Arquitecturas y Tecnologías aplicadas
Paradigmas de Procesamiento en Big Data: Arquitecturas y Tecnologías aplicadas
 
Oracle's BigData solutions
Oracle's BigData solutionsOracle's BigData solutions
Oracle's BigData solutions
 
BIG DATA and USE CASES
BIG DATA and USE CASESBIG DATA and USE CASES
BIG DATA and USE CASES
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
 
Oracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overviewOracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overview
 
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
 
Best New Features of Oracle Database 12c
Best New Features of Oracle Database 12cBest New Features of Oracle Database 12c
Best New Features of Oracle Database 12c
 
Engineering practices in big data storage and processing
Engineering practices in big data storage and processingEngineering practices in big data storage and processing
Engineering practices in big data storage and processing
 

Similar to Oracle Database 12c - Features for Big Data

Transform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataTransform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataAshnikbiz
 
Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)Ivo Andreev
 
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...Amazon Web Services
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Martin Bém
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftAmazon Web Services
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
professional informatica trainer
professional informatica trainerprofessional informatica trainer
professional informatica trainervibrantuser
 
Changing the game with cloud dw
Changing the game with cloud dwChanging the game with cloud dw
Changing the game with cloud dwelephantscale
 
MongoDB: What, why, when
MongoDB: What, why, whenMongoDB: What, why, when
MongoDB: What, why, whenEugenio Minardi
 
Building better SQL Server Databases
Building better SQL Server DatabasesBuilding better SQL Server Databases
Building better SQL Server DatabasesColdFusionConference
 
Storage Systems For Scalable systems
Storage Systems For Scalable systemsStorage Systems For Scalable systems
Storage Systems For Scalable systemselliando dias
 
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...Mark Rittman
 
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliL'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliData Driven Innovation
 
Scaling db infra_pay_pal
Scaling db infra_pay_palScaling db infra_pay_pal
Scaling db infra_pay_palpramod garre
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 

Similar to Oracle Database 12c - Features for Big Data (20)

DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Transform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataTransform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big Data
 
Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)
 
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
 
Serverless SQL
Serverless SQLServerless SQL
Serverless SQL
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
Nosql data models
Nosql data modelsNosql data models
Nosql data models
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
professional informatica trainer
professional informatica trainerprofessional informatica trainer
professional informatica trainer
 
Changing the game with cloud dw
Changing the game with cloud dwChanging the game with cloud dw
Changing the game with cloud dw
 
No SQL and MongoDB - Hyderabad Scalability Meetup
No SQL and MongoDB - Hyderabad Scalability MeetupNo SQL and MongoDB - Hyderabad Scalability Meetup
No SQL and MongoDB - Hyderabad Scalability Meetup
 
MongoDB: What, why, when
MongoDB: What, why, whenMongoDB: What, why, when
MongoDB: What, why, when
 
Building better SQL Server Databases
Building better SQL Server DatabasesBuilding better SQL Server Databases
Building better SQL Server Databases
 
Storage Systems For Scalable systems
Storage Systems For Scalable systemsStorage Systems For Scalable systems
Storage Systems For Scalable systems
 
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
 
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliL'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
 
Scaling db infra_pay_pal
Scaling db infra_pay_palScaling db infra_pay_pal
Scaling db infra_pay_pal
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 

Recently uploaded

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 

Recently uploaded (20)

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 

Oracle Database 12c - Features for Big Data

  • 1. Oracle Database 12c Features for Big Data Disclaimer : The information presented here is based on my views, and information gathered from online sources, the presentation is only to create an awareness about the features and does not describe a real solution. Presented by Abishek V S
  • 2. Agenda • What is Big Data • Big Data Versus RDBMS • Oracle In-Memory Column Store • JSON support in Oracle Database • Oracle Database And Hadoop
  • 3. What is Big Data
  • 4. What is Big Data Big data is simply data that breaks traditional architectures due to its sheer volume, speed and variety. Structured Unstructured Semi-Structured Multiple Sources Large Volumes
  • 5. Characterization of Big Data  Volume  Variety  Velocity From “Understanding Big Data” by IBM Veracity, Validity, Volatility
  • 6. Characterization of Big Data From the dawn of civilization until 2003, humankind generated five exabytes of data. Now we produce five exabytes every two days…and the pace is accelerating. Eric Schmidt, Executive Chairman, Google
  • 9. Big Data: Driving Factors & Motivation • Exponential growth of the internet • Widespread acceptance of E-Commerce • Growth of the Social Network • Commoditization of the computing resources • Per GB cost of storage is more affordable now than 10 years back. • Commodity computers have become more powerful. • Popularity of clusters based on commodity computers • IoT (Internet of Things) – Day by day the devices we own are getting smarter and are learning about us.
  • 10. • Distributed computing – Distributed Servers and Storage (Cloud based) – Distributed processing Eg : MapReduce with Hadoop • Schema Free Databases – NoSQL Database • In-memory • Semi Structures – JSON – Key, Value pairs • Columnar databases • Big Data Operations • Analytic / Semantic Processing (e.g. R, OWLIM) Big Data: Technologies and Tools
  • 12. Big Data versus RDBMS • RDBMS – Data is stored in defined structures (tables) – Transactional in nature – Data consistency is a primary consideration – Drives operational systems – Response time is crucial • Big Data – Data comes in all shapes and sizes – Behavioral Data – Prone to rapid change – Useful in VAS, identifying patterns not exposed by Operational systems – The value derived is of prime importance.
  • 13. Big Data versus RDBMS RDBMS Captures Business Transactions Ensures Operational Efficiency Operational Decision support Analytics is very limited Integrating external data is expensive ERP, BI, ETL, Data warehouse Big Data Captures User behavioral data System logs, social data Acts as Feedback to business New opportunity exploration Analytics is the key focus Technology aims at integration. User activity log, Web Analytics, Social Media Streaming API, Hadoop Map Reduce, NoSQL data store optimized for Analytics
  • 14. Big Data versus RDBMS Big Data RDBMS
  • 16. Oracle In-Memory Column Store • A column format database stores each of the attributes about a transaction or record in a separate column structure • A column format is ideal for analytics, as it allows for faster data retrieval when only a few columns are selected but the query accesses a large portion of the data set. • A column format is not so efficient at processing row wise DML: In order to insert or delete a single record in a column format all of the columnar structures in the table must be changed. • Up until now you have been forced to pick just one format and suffer the tradeoff of either suboptimal OLTP or sub-optimal analytics performance.
  • 17. Oracle In-Memory Column Store Oracle Database In-Memory provides best of both worlds The in-memory column format store cache should be sized to fit the objects that must be stored in memory. Less than 20% overhead in terms of total memory requirements. Database In-Memory uses an In-Memory column store (IM column store), which is a new component of the Oracle Database System Global Area (SGA), called the In- Memory Area (INMEMORY_SIZE).
  • 18. Oracle In-Memory Column Store • Tablespace Level – ALTER TABLESPACE ts_data INMEMORY; • Table Level – ALTER TABLE sales INMEMORY NO INMEMORY(prod_id); • Partition Level – ALTER TABLE sales MODIFY PARTITION SALES_Q1_1998 NO INMEMORY; • Objects are populated into the IM column store either in a prioritized list immediately after the database is opened or after they are scanned (queried) for the first time. – ALTER TABLE customers INMEMORY PRIORITY CRITICAL;
  • 19. Oracle In-Memory Column Store • In-Memory Compression • Typically compression is considered only as a space-saving mechanism. However, data populated into the IM column store is compressed using a new set of compression algorithms that not only help save space but also improve query performance
  • 20. Oracle In-Memory Column Store • In-Memory Scans – Analytic queries typically reference only a small subset of the columns in a table. – Oracle Database InMemory scans only the columns needed by a SQL, and applies any WHERE clause filter predicates to these columns directly without decompressing them. • In-Memory Storage Index – A further reduction in the amount of data accessed – Automatically created and maintained on each of the columns in the IM column store. – Storage Indexes allow data pruning based on the filter predicates in a SQL statement.
  • 21. • SIMD Vector Processing – Database In-Memory uses SIMD (Single Instruction processing Multiple Data values) vector processing – SIMD vector processing allows a set of column values to be evaluated together in a single CPU instruction. • In-Memory Joins – SQL statements that join multiple tables can also be processed very efficiently in the IM column store as they can take advantage of Bloom Filters. • A Bloom filter transforms a join into a filter that can be applied as part of the scan of the larger table. • In-Memory Aggregation – Analytic style queries often require complex aggregations and summaries. – A new optimizer transformation, called Vector Group By, has been introduced with Oracle Database 12.1.0.2 to ensure more complex analytic queries can be processed using new CPU-efficient algorithms. Oracle In-Memory Column Store
  • 22. JSON support in Oracle Database
  • 23. JSON support in Oracle Database • JSON (Java Script Object Notation) is a fast- growing data type often used in web and mobile applications. • JSON is also used as a data interchange format – More lightweight – Bandwidth-non-intensive • JSON integrates into web pages as javascript can directly inherit a JSON
  • 24. JSON support in Oracle Database • JSON is gaining popularity – APIs (application programming interfaces) • Most Social network providers provide JSON based data services API. • Webservices : RESTful (Representative state transfer) – Big Data • Many NoSQL databases use JSON as the storage format – MongoDB, CouchDB, and Riak – Internet of Things (IoT) • With more personal devices and appliances getting smart and hooking up to the internet, JSON is becoming the choice of use as it is lightweight and better adaptable to these devices.
  • 25. JSON support in Oracle Database • JSON in Oracle Database 12c R1 (12.1.0.2) – Creating Tables to Hold JSON – Querying JSON Data • Dot Notation • IS JSON • JSON_EXISTS • JSON_VALUE • JSON_QUERY • JSON_TABLE • JSON_TEXTCONTAINS – Identifying Columns Containing JSON – Loading JSON Files Using External Tables
  • 26. JSON support in Oracle Database • Creating Tables to Hold JSON – No new data type has been added to support JSON. Instead, it is stored in regular VARCHAR2 or CLOB columns. – The IS JSON constraint indicates the column contains valid JSON data. CREATE TABLE json_documents ( id RAW(16) NOT NULL, data CLOB, CONSTRAINT json_documents_pk PRIMARY KEY (id), CONSTRAINT json_documents_json_chk CHECK (data IS JSON) ); Lax or Strict checking “(data is JSON(Strict))” – The [USER|ALL|DBA]_JSON_COLUMNS views can be used to identify tables and columns containing JSON data.
  • 27. INSERT INTO json_documents (id, data) VALUES (SYS_GUID(), '{ "FirstName" : "John", "LastName" : "Doe", "Job" : "Clerk", "Address" : { "Street" : "99 My Street", "City" : "My City", "Country" : "UK", "Postcode" : "A12 34B" }, "ContactDetails" : { "Email" : "john.doe@example.com", "Phone" : "44 123 123456", "Twitter" : "@johndoe" }, "DateOfBirth" : "01-JAN-1980", "Active" : true }');
  • 28. COLUMN FirstName FORMAT A15 COLUMN LastName FORMAT A15 COLUMN Postcode FORMAT A10 COLUMN Email FORMAT A25 SELECT a.data.FirstName, a.data.LastName, a.data.Address.Postcode AS Postcode, a.data.ContactDetails.Email AS Email FROM json_documents a ORDER BY a.data.FirstName, a.data.LastName; FIRSTNAME LASTNAME POSTCODE EMAIL --------------- --------------- ---------- ------------------------- Jayne Doe A12 34B jayne.doe@example.com John Doe A12 34B john.doe@example.com
  • 29. • IS JSON – The IS JSON condition can be used to test if a column contains JSON data. • SELECT JSON_VALUE(a.data, '$.FirstName') AS first_name FROM json_documents_no_constraint a WHERE a.data IS JSON; • JSON_EXISTS – Similar to IS NULL, checks if an element has a value • JSON_VALUE – Returns an element from the JSON document, based on the specified JSON path. • JSON_QUERY – The JSON_QUERY function returns a JSON fragment representing one or more values. • JSON_TABLE – The JSON_TABLE function incorporates all the functionality of JSON_VALUE, JSON_EXISTS and JSON_QUERY. – JSON_TABLE is used for making JSON data look like relational data, which is especially useful when creating relational views over JSON data, • JSON_TEXTCONTAINS – Works with JSON indexes and enables faster text searching through the JSON data.
  • 30. JSON support in Oracle Database Loading JSON Files Using External Tables • Create the directory objects for use with the external table. CREATE OR REPLACE DIRECTORY order_entry_dir AS '/u01/app/oracle/product/12.1.0.2/db_1/demo/schema/order_entry'; GRANT READ, WRITE ON DIRECTORY order_entry_dir TO test; CREATE OR REPLACE DIRECTORY loader_output_dir AS '/tmp'; GRANT READ, WRITE ON DIRECTORY loader_output_dir TO test; • Create the external table and query it to check if it is working. CREATE TABLE json_dump_file_contents (json_document CLOB) ORGANIZATION EXTERNAL (TYPE ORACLE_LOADER DEFAULT DIRECTORY order_entry_dir ACCESS PARAMETERS (RECORDS DELIMITED BY 0x'0A' DISABLE_DIRECTORY_LINK_CHECK BADFILE loader_output_dir: 'JSONDumpFile.bad' LOGFILE order_entry_dir: 'JSONDumpFile.log' FIELDS (json_document CHAR(5000))) LOCATION (order_entry_dir:'PurchaseOrders.dmp')) PARALLEL REJECT LIMIT UNLIMITED;
  • 31. JSON support in Oracle Database SELECT COUNT(*) FROM json_dump_file_contents; COUNT(*) ---------- 10000 • You can now load the database table with the contents of the external table. TRUNCATE TABLE json_documents; INSERT /*+ APPEND */ INTO json_documents SELECT SYS_GUID(), json_document FROM json_dump_file_contents WHERE json_document IS JSON; COMMIT;
  • 33. Oracle Database And Hadoop • Big Data Discussion is incomplete without the mention of Hadoop • Hadoop is a distributed computing framework • Runs Batch operations(MapReduce) on distributed clusters made of commodity computers. • Stores data in a distributed clustered filesystem • Hadoop clusters are a shared nothing paradigm
  • 34. Oracle Database And Hadoop • MapReduce Paradigm
  • 35. Oracle Database And Hadoop • In-Database MapReduce • Avoid Shipping of data residing in RDBMS to an external infrastructure • Database security can be applied to the processed data. • Shorter learning curve for both Developers and DBAs • Mix SQL with MapReduce processing for flexibility and efficiency • Uses PL/SQL or Java Pipe-Lined Functions INSERT INTO OUTTABLE SELECT * FROM TABLE (Word_Count_Reduce (:ConfKey, CURSOR(SELECT * FROM TABLE (Word_Cursor_Map(:ConfKey, CURSOR(SELECT * FROM InTable)))))) ;
  • 36. Oracle Database And Hadoop • Pipelined Functions : Can either return a stream of rows or take it as input too. • Can be Parallelized with a partition key • Implemented using PL/SQL, Java or C • Contains 2 Pipelined Functions, one for mapper the other for reducer. • Further the mapper input source could be an external table, and the reducer output may be placed in a DB table or further sent out to filesystem file. • Can leverage external tables, DBFS, use Java or C to write to files. • The opportunities are endless when coupled with other DB features and options. • DB Scheduler can be used to schedule the mapreduce • Clustered with distributed databases using DBLinks • Add fault tolerance and scalability with RAC.
  • 37. Oracle Database And Hadoop • Oracle In-Database Hadoop • We will look at this in a future discussion …
  • 39. The Road Ahead • Big Data/NoSQL databases WILL NOT replace RDBMS databases. • Oracle’s Roadmap has been Single Vendor Solutions. • Reusing available resources : Both technology and human resource. • Oracle is building more Appliance based solutions.
  • 40. The Road Ahead • Oracle Big Data Products. – Oracle Big Data Management • Oracle Big Data Appliance • Oracle Big Data SQL • Oracle NoSQL Database – Oracle Big Data Integration • Oracle GoldenGate • Oracle Data Integration • Oracle Event Processing – Big Data Analytics • Oracle Big Data Discovery • Oracle Advanced Analytics • Oracle Business Intelligence Foundation
  • 41. Please mail me at abishek.vidyashanker@in.unisys.com