SlideShare a Scribd company logo
1 of 64
Download to read offline
ḟୡ௦ศᯒᇶ┙㻌䇿Data Lake” 䜢ᨭ䛘䜛 
Pivotal䛾䜲䞁䝯䝰䝸 + SQL on Hadoop䝔䜽䝜䝻䝆䞊 
2014ᖺ11᭶13᪥ 
Pivotal䝆䝱䝟䞁ᰴᘧ఍♫ 
ᕷᮧ㻌཭ᐶ 
ᐘ䚷ຬᶞ 
© Copyright 2014 Pivotal. All rights reserved. 1
Pivotal ఍♫ᴫせ 
䜶䞁䝍䞊䝥䝷䜲䝈ྥ䛡䛻3rd䝥䝷䝑䝖䝣䜷䞊䝮䜢ᐇ⌧䛩䜛䝋䝣䝖䜴䜵䜰䜢㛤Ⓨ䞉ᥦ౪ 
䜽䝷䜴䝗(PaaS)䛸䝡䝑䜾䝕䞊䝍䛾ᇶ┙ᢏ⾡䚸ཬ䜃ḟୡ௦䜰䝥䝸㛤Ⓨ䝃䞊䝡䝇䛜୺㍈ 
CEO 䝫䞊䝹䞉䝬䝸䝑䝒 
2013ᖺ4᭶タ❧ 
(᪥ᮏἲே䠖7᭶) 
ᚑᴗဨᩘ 
⣙2,000ே 
௻ᴗ㢳ᐈ 
1,200♫௨ୖ 
ฟ㈨௻ᴗ 
EMC㐃ྜ䛸GE㐃ᦠ 
© Copyright 2014 Pivotal. All rights reserved. 2
Pivotal’s Opportunity 
§ 䝡䝑䜾䝕䞊䝍 
Pivotal HD, Pivotal Greenplum DB 
§ 䝣䜯䝇䝖䝕䞊䝍 
Pivotal GemFire 
§ 䜶䞁䝍䞊䝥䝷䜲䝈㻌PaaS 
Pivotal CF 
§ 䜰䝆䝱䜲䝹㛤Ⓨᨭ᥼䝃䞊䝡䝇 
Pivotal Labs 
§ 䝕䞊䝍䝃䜲䜶䞁䝔䜱䝇䝖ཬ䜃 
䚷䚷⫱ᡂ䝖䝺䞊䝙䞁䜾 
Pivotal Data Science Labs 
© Copyright 2014 Pivotal. All rights reserved. 3
䝡䝑䜾䝕䞊䝍᫬௦䛻ồ䜑䜙䜜䜛せ⣲ᢏ⾡ 
吨听吀䞊 
吵呉吐 
SQL on Hadoop 
HAWQ 
Impala, Drill, Presto,.. 
Hadoop 
Pivotal HD 
CDH, MapR, Horton 
䝕䞊䝍㔞 
ศᩓᆺRDB 
GreenplumDB 
PureData, Teradata, ExaData 
䜲䞁䝯䝰䝸 
GemFire/XD 
TimesTen, SAP HANA 
RDB 
Oracle, DB2, MSSQL Server 
MySQL, PostgreSQL 
© Copyright 2014 Pivotal. All rights reserved. 4
Pivotal Data Lake 䜰䞊䜻䝔䜽䝏䝱 
Ÿ 䝕䞊䝍ฎ⌮ᇶ┙䛾ᇶ┙せ⣲䛸䛺䜛Hadoop(HDFS)䛻䝕䞊䝍䜢⵳✚ 
Ÿ 䛥䜎䛦䜎䛺䝕䞊䝍䞉せ௳䛻ᛂ䛨䛶ฎ⌮䜶䞁䝆䞁䜢౑䛔ศ䛡䜛 
Pivotal Data Lake 
䜰䝘䝸䝔䜱䝑䜽 
䝕䞊䝍䝬䞊䝖 
SQLฎ⌮ 
䜸䝨䝺䞊䝅䝵䝘䝹 
䜲䞁䝔䝸䝆䜵䞁䝇 
䜲䞁䞉䝯䝰䝸㻌䝕䞊䝍䝧䞊䝇 
䝷䞁䞉䝍䜲䝮 
䜰䝥䝸䜿䞊䝆䝵䞁 
HDFS 
䝕䞊䝍 
䝇䝔䞊䝆䞁䜾 
䝕䞊䝍⟶⌮ 
䝇䝖䝸䞊䝮 
䜲䞁䝆䜵䝇䝏䝵䞁 
䝇䝖䝸䞊䝭䞁䜾ฎ⌮ 
New Data-fabrics 
Software-Defined Datacenter 
GemFire 
XD 
...ETC 
䜲䞁䞉䝯䝰䝸㻌䜾䝸䝑䝗 
GemFire 
XD 
© Copyright 2014 Pivotal. All rights reserved. 5
Pivotal Data Lake 䝸䝣䜯䝺䞁䝇䜰䞊䜻䝔䜽䝏䝱 
3. SQL䛻䜒ᑐᛂ䛧䛯㧗䛔㛤Ⓨ⏕⏘ᛶ 
S 
Q 
L 
䜲䞁䝯䝰䝸䞊䞉䜶䞁䝆䞁 
䝕䞊䝍䜴䜵䜰䝝䜴䝇 
䝉䞁䝖䝷䝹DWH 䝕䞊䝍䝬䞊䝖 
䝡䝆䝛䝇䚷䚷䚷 
䜰䝥䝸䜿䞊䝅䝵䞁 
BI 
䝡䝆䝛䝇 
䜰䝘䝸䝔䜱䜽䝇 
䝣䜯䝇䝖䝕䞊䝍 
ᵓ㐀໬䝕䞊䝍 
຺ᐃ⣔ 
䝅䝇䝔䝮 
᝟ሗ⣔ 
䝅䝇䝔䝮 
࿘㎶ 
䝅䝇䝔䝮 
⤒Ⴀ⪅ 
⟶⌮⪅ 
ᴗົ㒊㛛 
ศᯒ⪅䞉᝟ሗ 
䝅䝇䝔䝮㒊㛛 
㠀ᵓ㐀໬䝕䞊䝍 
䝕䞊䝍䝺䜲䜽 
Hadoop 
⏕䝕䞊䝍 
ETL 
ฎ⌮ 
䜰䜽䝉䝇䝻䜾 䝯䞊䝹䞉㼃㼑㼎 㻹㻞㻹 ⏬ീ䞉ᫎീ 㡢ኌ 㻿㻺㻿 
© Copyright 2014 Pivotal. All rights reserved. 6
㻳㻱䛾䜲䝜䝧䞊䝅䝵䞁㻌Industrial Internet 
 
඲䛶䛾஦ᴗ㒊㛛䜢䜎䛯䛜䛳䛯䝕䞊䝍ศᯒᇶ┙ 
(Industrial Data Lake)䜢Pivotal♫䛾䝔䜽䝜䝻 
䝆䞊䛷ᐇ⌧ 
2014ᖺ8᭶15᪥㻌᪥⤒⏘ᴗ᪂⪺ 
25のエアライン 
340万フライト 
340TBデータ 
2,000倍の分析速度向上 
10分の1の費用削減 
アプリ投入まで7日間 
© Copyright 2013 Pivotal. All rights reserved. 7
㢼ຊⓎ㟁䝅䝇䝔䝮䛾᭱㐺໬(⡿ᅜ䠅 
⥅⥆ⓗ䛺䝸䜰䝹䝍䜲䝮ศᯒ 
Fast Data 
Big Data 
⛊㛫䛒䛯䜚ᩘ༓䝕䞊䝍䝫䜲䞁䝖䛛䜙䛾 
䝕䞊䝍ᢞධ 
䝏䝳䞊䝙䞁䜾  ᭱㐺໬䚸 
⥭ᛴ೵Ṇ 
┘ど䛸ไᚚ 
ಖᏲసᴗ䛾ຠ⋡໬ ㄢ㔠 
© Copyright 2014 Pivotal. All rights reserved. 8
ḟୡ௦䝡䝑䜽䝕䞊䝍ᇶ┙䜢ᨭ䛘䜛䝋䝣䝖䜴䜵䜰⩌ 
䝸䜰䝹䝍䜲䝮ศᯒ Pivotal GemFire Pivotal GemFire XD 
䝝䜲䝟䝣䜷䞊䝬䞁䝇 
ศᯒ 
䝺䜼䝳䝷䞊䝟䝣䜷䞊 
䝬䞁䝇䠃䝞䝑䝏 
ศᯒ 
Pivotal Greenplum DB HAWQ 
Pivotal HD 
䜲䞁䝯䝰䝸䞊 
KVS 
㉸୪ิฎ⌮ 
RDB 
Hadoop 
© Copyright 2014 Pivotal. All rights reserved. 9
Pivotal HD 
Data Lake 䝥䝷䝑䝖䝣䜷䞊䝮䛾䝁䜰䝔䜽䝜䝻䝆䞊 
• Apache䝧䞊䝇䛾Hadoop䛻䜶䞁䝍䞊䝥䝷䜲䝈ᶵ⬟䜢㏣ຍ䞉䜸䞊䝥䞁䛸ၟ⏝୧᪉䛾䝯䝸䝑䝖䜢ாཷ 
• ANSI SQL‽ᣐ䛻䜘䜛㧗㏿SQL䜶䞁䝆䞁䛻䜘䜛᪤Ꮡ㈨⏘㻌(䝥䝻䜾䝷䝮䜔䝇䜻䝹) 䛾ಖㆤ 
Pivotal䛾Hadoop䝕䜱䝇䝖䝸䝡䝳䞊䝅䝵䞁 
• Apache Hadoop2.2䝧䞊䝇 
• ၟ⏝〇ရ䛸䛧䛶䛾㏣ຍᶵ⬟ 
• Command Center, HVE 
• HAWQ, GemFireXD 
• 䜸䞊䝥䞁䝋䞊䝇䛸䛾㐃ᦠ䞉⤫ྜ 
• Spark, Parquet, GraphLab➼ 
㧗㏿䛺SQL䜶䞁䝆䞁䛾ᦚ㍕ 
• ᶆ‽SQLᑐᛂ䛾DB䜶䞁䝆䞁 
• HIVEẚᩘ༑ಸࠥᩘⓒಸ䛾㧗ᛶ⬟ 
VM䞉EMC䝔䜽䝜䝻䝆䞊䛸䛾㐃ᦠ 
• VMwareୖ䛷䛾᭱㐺໬ᶵ⬟ HVE 
• EMC䛾䝇䜿䞊䝹䜰䜴䝖NAS 
䚷㻌”Isilon”䛸䛾㐃ᦠ 
HAWQ 
䜰䝗䝞䞁䝇䝗䝕䞊䝍䝧䞊䝇䝃䞊䝡䝇 
ANSI SQL + 䜰䝘䝸䝔䜱䜽䝇 
MADlib 
䜹䝍䝻䜾 
䝃䞊䝡䝇 
䝎䜲䝘䝭䝑䜽䞉䝟䜲䝥䝷䜲䝙䞁䜾 
HDFS 
HBase 
Pig, Hive, 
Mahout 
Map 
Reduce 
Xtension 
䝣䝺䞊䝮䝽䞊䜽 
Hadoop 䝞䞊䝏䝱䝷䜲䝊䞊䝅䝵䞁 
(HVE) 
䜸䝥䝔䜱䝬䜲 
䝄 
(Orca) 
Spring 
Parquet 
䈜3 
GraphLab, 
OpenMPI 
䈜3 
Sqoop Flume 
䝸䝋䞊䝇⟶⌮ 
 䝽䞊䜽䝣䝻 
䞊 
YARN 
ZooKeeper 
Oozie 
Apache Pivotal HD ㏣ຍᶵ⬟ 
Pivotal 
Command 
Center 
ᵓᡂ/䝕䝥䝻䜲/ 
┘ど/⟶⌮ 
Spark 
䈜1 
Ambari 
䈜2 
䈜1. Pivotal HD䛿 Apache Spark 䛸✌ാ䛩䜛䛣䛸䜢ㄆᐃ䛩䜛 “Certified Spark Distribution”䛻Ⓩ㘓䛥䜜䛶䛔䜎䛩䚹(2014/5) 
䈜2. Pivotal䛿 Apache Ambari䛾㛤Ⓨ䝥䝻䝆䜵䜽䝖䛻ཧ⏬䛩䜛䛣䛸䜢⾲᫂䛧䛶䛔䜎䛩䚹(2014/7) 
䈜3. PivotalHD2.0 䛻䛶GraphLab, OpenMPI, Parquet䛿䝧䞊䝍ᥦ౪䛥䜜䛶䛔䜎䛩䚹 
© Copyright 2014 Pivotal. All rights reserved. 10
Pivotal GemFire XD 
ప㐜ᘏ䚸䝇䜿䞊䝹䜰䜴䝖䜢ᐇ⌧ 
䝕䞊䝍䜢฼⏝䛩䜛䜶䞁䝍䞊䝥䝷䜲䝈 䜰䝥䝸䜿䞊䝅䝵䞁 
ಙ㢗ᛶ䛾㧗䛔䜲䝧䞁䝖㏻▱ᶵ⬟ SQL(JDBC/ODBC) ୪ิฎ⌮ 
㧗䛔䝇䝹䞊䝥䝑䝖 ప㐜ᘏ ඃ䜜䛯䝇䜿䞊䝷䝡䝸䝔䜱 ⥅⥆ⓗ䛺ྍ⏝ᛶ 
GemFire XD 䝕䞊䝍䜾䝸䝑䝗 
䝕䞊䝍䛾ᣢ⥆ᛶ 
WAN ⤒⏤䛾ศᩓ 
HDFS 䛭䛾௚䝕䞊䝍䝇䝖䜰 
䝣䜯䜲䝹 䝅䝇䝔䝮 䝕䞊䝍䝧䞊䝇 እ㒊䝕䞊䝍䝇䝖䜰 
© Copyright 2014 Pivotal. All rights reserved. 11
GemFire XD - ప㐜ᘏ䚸䝇䜿䞊䝹䜰䜴䝖䜢ᐇ⌧ 
© Copyright 2014 Pivotal. All rights reserved. 12
䜲䞁䝯䝰䝸䝕䞊䝍䜾䝸䝑䝗䛾ᴫせ 
© Copyright 2014 Pivotal. All rights reserved. 13
䜲䞁䝯䝰䝸䝕䞊䝍䜾䝸䝑䝗䛾ᇶᮏⓗ䛺ᴫᛕ 
Handle thousands of concurrent connections 
Replicated 
Region 
Synchronous replication for 
slow changing data 
Partition for large data or highly transactional data 
• 䜻䞊䝞䝸䝳䞊/䜸䝤䝆䜵䜽䝖䚸SQL䜰䜽䝉䝇 
• 䜽䜶䝸䞊䚸䜲䞁䝕䝑䜽䝇䚸䝖䝷䞁䝄䜽䝅䝵䞁ᑐᛂ 
• 䝯䝰䝸ୖ䛷䛾䝺䝥䝸䜿䞊䝅䝵䞁䚸䝟䞊䝔䜱䝅䝵䞁 
• 䜽䝷䝇䝍㛫䛷䛾෕㛗䝕䞊䝍䛾䝁䝢䞊 
• 䝕䜱䝇䜽䜒䛧䛟䛿RDBMS䜈䛾䝕䞊䝍ᒎ㛤 
Redundant copy 
Partitioned Region 
• ศᩓᆺ䝯䝰䝸ᣦྥ䛾䝕䞊䝍䝇䝖䜰 
• 」ᩘ䛾䝇䝖䝺䞊䝆䝰䝕䝹 
© Copyright 2014 Pivotal. All rights reserved. 14 
14 
Low latency for 
thousands of 
clients 
• 䜰䝥䝸䜿䞊䝅䝵䞁䝻䝆䝑䜽䛾୪ิ໬ 
• 」ᩘ䛾㞀ᐖ᳨▱ฎ⌮ 
• ືⓗ䛺䝯䞁䝞䞊䛾㏣ຍ (elastic) 
• 䝧䞁䝎䞊䛾ᕪู໬せ⣲ 
• SQLᑐᛂ䚸WANᑐᛂ, 䜲䝧䞁䝖ฎ⌮, etc
ᚑ᮶䛾㻾㻰㻮㻹㻿䛻䛚䛡䜛ㄢ㢟 
䝞䝑䝣䜯䛿୺䛻䈊 
㻵㻛㻻ྥ䛡䛻᭱㐺໬ 
ḟ䛻䝕䞊䝍䞉䝣䜯䜲䝹䛻䈊 
᭩㎸䜐 
§ ኱㔞䛾I/O 
§ タィ᫬䛾᝿ᐃ䛸䛾䜼䝱䝑䝥 
• ACID䛻䝣䜷䞊䜹䝇 
• 䝕䜱䝇䜽ྠᮇ䛾䝪䝖䝹䝛䝑䜽 
ึ䜑䛻㻸㻻㻳䜢䈊 
᭩㎸䜐 
© Copyright 2014 Pivotal. All rights reserved. 15
GemFire XD/SQLFire䛾䝧䞁䝏䝬䞊䜽⤖ᯝ 
㻝㻘㻞㻜㻜㻗ྠ᫬䜰䜽䝉䝇 
㻣㻜୓㻗䛾䝇䝹䞊䝥䝑䝖 
© Copyright 2014 Pivotal. All rights reserved. 16
RDBMS䛸䛾㐃ᦠ㻌䠖㻌DBSynchronizer 
• RDBMS䛸䝔䞊䝤䝹ᐃ⩏䛜ྠ䛨ሙྜ䛻᭷ຠ 
Flights 
Flights 
FlightsR 
FlightsR 
FlightAvailability 
FlightAvailability 
FlightAvailabilityR 
FlightAvailabilityR 
Airlines Airlines 
java.sql.Connection conn = getConnection(); 
CallableStatement cs = conn.prepareCall(“CALL 
SYS.ADD_ASYNC_EVENT_LISTENER(?,?,?,?,?,?,?,?,?,?,?)”); 
cs.setString(1, “SG1”); 
cs.setString(2, “MyID”); 
cs.setString(3, “com.vmware.sqlfire.callbacks.DBSynchronizer”); 
cs.setString(11,“jdbc:oracle:thin:@localhost:1521:XE”); 
cs.execute(); 
© Copyright 2014 Pivotal. All rights reserved. 17
䝕䞊䝍䝉䞁䝍䞊㛫䛾䝕䞊䝍䞉䝺䝥䝸䜿䞊䝅䝵䞁 
GemFire 
1 
GemFire2 
Standby 
Gateway 
GemFire4 
Gateway 
GemFire 
3 
New York Site 
GemFire5 
Standby 
Gateway 
GemFire6 
GemFire 
7 
GemFire8 
Gateway 
Tokyo Site 
Standby Gateway Paths 
GemFire9 
Gateway 
GemFire12 
Standby 
Gateway 
GemFire GemFire 10 
11 
London Site 
Primary Gateway Paths 
 䜾䝻䞊䝞䝹䛺䝕䞊䝍㓄ಙ 
 䜰䜽䝔䜱䝤-䜰䜽䝔䜱䝤䛺㻌DR ᑐ⟇ 
 ᶆ‽ᶵ⬟䛸䛧䛶ᥦ౪ 
© Copyright 2014 Pivotal. All rights reserved. 18
‘Shared nothing persistence’䛺䜰䞊䜻䝔䜽䝏䝱䛾฼Ⅼ䛸ㄢ㢟 
þ ㏣グ䛾䜏䛾䜸䝨䝺䞊䝅䝵䞁䝻䜾 
þ ᏶඲䛺୪ิฎ⌮ 
þ 䝕䜱䝇䜽䝅䞊䜽䛺䛧 
¨ 䝻䜾䛾䝇䜻䝱䞁䛻䛿䜽䝷䝇䝍䝉䝑䝖䛾 
෌㉳ື䛜ᚲせ 
¨ ኱ᐜ㔞䛾䝪䝸䝳䞊䝮䛻ᑐ䛧䛶䛿ㄪᩚ䛜 
ᚲせ 
Memory 
Tables 
Compressor 
OS Buffers 
Append only 
Operation logs 
LOG 
Record1 
Record2 
Record3 
Record1 
Record2 
Record3 
Memory 
Tables 
Compressor 
OS Buffers 
Append only 
Operation logs 
LOG 
Record1 
Record2 
Record3 
Record1 
Record2 
Record3 
䜲䞁䝯䝰䝸䝕䞊䝍䜾䝸䝑䝗䛷䛒䜛 
GemFire䛻䛚䛔䛶䜒ㄢ㢟䛜Ꮡᅾ 
© Copyright 2014 Pivotal. All rights reserved. 19
Hadoop HDFS 
Hadoop core(HDFS) for scalable, parallel storage 
• ᡂ⇍䛧䛴䛴䛒䜚䚸㐺⏝⠊ᅖ䜒ᣑ኱ 
• ỗ⏝䝃䞊䝞䛷䜒኱つᶍ䛺䝕䞊䝍䝉䝑䝖䜢ྲྀ䜚ᢅ䛔䛜ྍ⬟ 
• 㞀ᐖ䜈䛾ᰂ㌾䛺ᑐᛂ 
• 䝅䞁䝥䝹䛺୍㈏ᛶ䝰䝕䝹 
© Copyright 2014 Pivotal. All rights reserved. 20
Hadoop 䜶䝁䝅䝇䝔䝮䛜ᐇ⌧䛩䜛฼Ⅼ 
Ÿ ኱つᶍ䛺䝪䝸䝳䞊䝮䝉䝑䝖 ( TB to PB) 
Ÿ 㧗ྍ⏝ᛶ, ᅽ⦰ᶵ⬟ 
Ÿ ୪ิィ⟬䛸䝕䞊䝍ศᯒᇶ┙䛸䛧䛶䛾ᡂ⇍ᗘ䛸䜶䝁䝅䝇䝔䝮䛾 
ᒎ㛤 
Ÿ 䝇䝖䝺䞊䝆䝅䝇䝔䝮䛻䛚䛔䛶䜒HDFSᑐᛂ䛜ᬑཬ 
Ÿ ௬᝿໬⎔ቃ䜈䛾ᑐᛂ䜒ᚎ䚻䛻ᾐ㏱ 
© Copyright 2014 Pivotal. All rights reserved. 21
GemFire XD 
䜲䞁䝯䝰䝸䝕䞊䝍䜾䝸䝑䝗䛛䜙䛾᪂䛯䛺ᒎ㛤 
© Copyright 2014 Pivotal. All rights reserved. 22
GemFire XD – 䝸䜰䝹䝍䜲䝮䝕䞊䝍ศᯒ䜶䞁䝆䞁 
䜲䞁䝯䝰䝸䚸䛥䜙䛻䛿ᆅ⌮ⓗ䛻 
ศᩓ䛥䜜䛯䝽䞊䜻䞁䜾䝉䝑䝖 
SQLFire 
㐣ཤ䝕䞊䝍䚸᫬⣔ิ䝕䞊䝍䜢 
HDFS䛻᱁⣡ 
Pivotal HD 
GemFire 
+ 
䜽䝷䝇䝍䝸䞁䜾䚸䜲䞁䝯䝰䝸 
䝕䞊䝍䝇䝖䜰䚸HA, 䝺䝥䝸䜿䞊 
䝅䝵䞁䚸WANᑐᛂ䚸䜲䝧䞁䝖ฎ 
⌮䚸ศᩓ䜻䝳䞊… 
SQL 
Objects, 
JSON 
SQL 䜶䞁䝆䞁 
- 䜸䝥䝔䜱䝬䜲䝄䚸䜲䞁䝯䝰䝸 
ୖ䛾䜲䞁䝕䝑䜽䝇సᡂ䚸ศᩓ 
䝖䝷䞁䝄䜽䝅䝵䞁䚸RDB䜲䞁䝔 
䜾䝺䞊䝅䝵䞁.. 
䜲䞁䝇䝖䞊䝹䚸ᵓᡂ䚸⟶⌮䛾⤫ 
ྜ໬䚸┘どᶵ⬟䚸Hadoop䜈䛾 
᭱㐺໬ 
© Copyright 2014 Pivotal. All rights reserved. 23
GemFire XD 䜰䞊䜻䝔䜽䝏䝱ᴫせ 
JDBC ODBC 
HDD HDD HDD HDD HDD 
䝕䞊䝍䝜䞊䝗 
HDFS 
䝕䞊䝍䝜䞊䝗 
HDFS 
䝕䞊䝍䝜䞊䝗 
HDFS 
䝕䞊䝍䝜䞊䝗 
HDFS 
䝕䞊䝍䝜䞊䝗 
HDFS 
GemFire XD 
䜽䝷䝇䝍 
Map/Reduce, Hive, HBase GemFire XD PXF Plugin 
Hadoop 
䜽䝷䝇䝍 
䝕䞊䝍䝧䞊䝇 
䝕䞊䝍䝉䞁䝍䞊 
© Copyright 2014 Pivotal. All rights reserved. 24
SQL + IMDG(Objects) + HDFS 
䝯䜲䞁䝯䝰䝸䞊䜢䝧䞊䝇䛻㻌ప㐜ᘏ䛷ศᩓ䛧䛯䝡䝑䜾䝕䞊䝍ྥ䛡䛾䝕䞊䝍䝇䝖䜰 
ᐇ㝿䛻᧯స䛩䜛 
䝕䞊䝍䛾䜏䚸䝯䝰䝸 
ୖ䛻ᒎ㛤 
Ọ⥆໬䚸䛒䜛䛔䛿 
䜰䞊䜹䜲䝤䝕䞊䝍䛿 
HDFS䜈 
© Copyright 2014 Pivotal. All rights reserved. 25
SQL + IMDG(Objects) + HDFS 
䝺䝥䝸䜿䞊䝅䝵䞁 / 䝟䞊䝔䜱䝅䝵䞁 
䝇䝖䝺䞊䝆䝰䝕䝹: 
- 䜲䞁䝯䝰䝸 
- 䜲䞁䝯䝰䝸䛸䝻䞊䜹䝹䝕䜱䝇䜽 
- 䜲䞁䝯䝰䝸䛸HDFS 
© Copyright 2014 Pivotal. All rights reserved. 26
SQL + IMDG(Objects) + HDFS 
SQL Engine – OLTP䚸䝖䝷䞁䝄䜽䝅䝵䞁 
䛻䜒ᑐᛂ 
IMDG 䜻䝱䝑䝅䝳ᶵ 
⬟ - readThru, 
writeBehind, ➼䚻 
© Copyright 2014 Pivotal. All rights reserved. 27
SQL + IMDG(Objects) + HDFS 
HDFS䛸䛾ᐦ䛺⤫ྜ – 
䝇䝖䝸䞊䝭䞁䜾䚸R/W䜰䜽䝉䝇 䝕䞊䝍ศᯒ䛿䜲䞁䝯䝰䝸䛷䛿䛺䛟HDFS䜢฼⏝䚷 
– 䝅䞊䜿䞁䝅䝱䝹䛺䜰䜽䝉䝇䚸䜒䛧䛟䛿䜲䞁䜽 
䝸䝯䞁䝍䝹䛺ฎ⌮䜢ᐇ⌧ 
䝕䞊䝍ᢞධ䛾୪ิ໬䛻䜘䜚䚸䝸䜰䝹䝍䜲䝮 
䛻㏆䛔ヲ⣽䛺ศᯒ䜒ྍ⬟ 
© Copyright 2014 Pivotal. All rights reserved. 28
SQL + IMDG(Objects) + HDFS 
MapReduce䛻䛚䛡 
䜛reduceฎ⌮䛻䛶 
䜲䞁䝯䝰䝸䜈䛾 
䝕䞊䝍཯ᫎ 
䝸䜰䝹䝍䜲䝮ฎ⌮䛸ศᯒ䛾㛫䛷䛾䜽䝻䞊䝈䝗䛺䝹䞊䝥䜢ᐇ⌧ 
© Copyright 2014 Pivotal. All rights reserved. 29
GemFire XD䛻䛚䛡䜛䝕䞊䝍⟶⌮ 
CREATE TABLE FLIGHTS ( 
FLIGHT_ID CHAR(6) NOT NULL , 
SEGMENT_NUMBER INTEGER NOT NULL , 
….. 
PARTITION BY COLUMN (FLIGHT_ID) 
PERSISTENT 
HDFSSTORE RWStore; 
CREATE HDFSSTORE RWStore 
NAMENODE hdfs://PHD1:8020 
DIR /indexed-tables 
BATCHSIZE 10 
BATCHTIMEINTERVAL 2000 
QUEUEPERSISTENT true; 
Replicated Table Table 
Replicated Table Replicated Table 
Partitioned Table 
Partitioned Table 
Colocated Partition Colocated Partition Colocated Partition 
Redundant Partition 
Partitioned Table 
Redundant Partition 
Redundant Partition 
© Copyright 2014 Pivotal. All rights reserved. 30
䝺䜲䝔䞁䝅䛾ほⅬ䛷䜏䛯㐺⏝⠊ᅖ 
䝬䝅䞁䛻䜘䜛 
ᛂ⟅ 
ே㛫䛻䜘䜛 
䜲䞁䝍䝷䜽䝔䜱䝤 
䝺䝫䞊䝖 䝞䝑䝏ฎ⌮ ᧯స 
Milliseconds Seconds Seconds, Minutes Minutes, Hours 
Online/OLTP/Operational DBs Analytics, Data Warehousing 
GemFire XD PivotalHD HAWQ 
© Copyright 2014 Pivotal. All rights reserved. 31
Hadoopୖ䛷䛾䝸䜰䝹䝍䜲䝮ฎ⌮䛻䛚䛡䜛㑅ᢥ⫥ 
䜋䛸䜣䛹䛜䝕䞊䝍ศᯒ䛻ྥ䛔䛯䜲䞁䝍䝷䜽䝔䜱䝤䛺䜽䜶䝸ฎ⌮䛻䝣䜷䞊䜹䝇 Many more…. 
© Copyright 2014 Pivotal. All rights reserved. 32
GemFire XD + Pivotal HD 
䝸䜰䝹䝍䜲䝮+䝡䝑䜽䝕䞊䝍䛻䜘䜛Data Lake䜰䞊䜻䝔䜽䝏䝱䛾ᐇ⌧ 
Online Apps 
䝸䜰䝹䝍䜲䝮䝕䞊䝍䛾ᢞධ 䝸䜰䝹䝍䜲䝮ศᯒ 㧗ᗘ䛺䝕䞊䝍ศᯒ 
䝕䞊䝍䝰䝕䝹 
Sensor Data / Feeds 
䝕䞊䝍䝰䝕䝹 
᭦᪂ Map-Reduce 
Analytic Apps 
GemFire XD HAWQ 
HDFS 
PXF 
I/P  O/P 
Formatter 
䝸䜰䝹䝍䜲䝮/䝉䝭䞉䝸䜰䝹䝍䜲䝮䛷䛾 
䝕䞊䝍ᢞධ 
Shared Data - HFiles 
PCC 
᭦᪂ 
ศᯒ䝕䞊䝍䛾ྲྀ䜚㎸䜏 
ศᯒ䝕䞊䝍䛾ฎ⌮ 
© Copyright 2014 Pivotal. All rights reserved. 33
PaaS䛻䛚䛡䜛䝃䞊䝡䝇䛸䛧䛶䛾ᥦ౪ 
Pivotal GemFire XD for Pivotal CF 
Ÿ Pivotal CFୖ䛻ᒎ㛤䛥䜜䜛䜰䝥䝸䜿䞊䝅䝵䞁 
䛻ᑐ䛧䛶GemFireXD䜢䝃䞊䝡䝇䛸䛧䛶ᥦ౪ 
– Pivotal CF: 䜸䞊䝥䞁䝋䞊䝇PaaS䛷䛒䜛Cloud 
Foundry䛾ၟ⏝䝕䜱䝇䝖䝸䝡䝳䞊䝅䝵䞁 
Ÿ ⊂⮬䛾䝃䞊䝡䝇䛸䛧䛶䜒㏣ຍྍ⬟ 
– Service Broker䛾ᐇ⿦ 
– 䝴䞊䝄ᐃ⩏䝃䞊䝡䝇 
© Copyright 2014 Pivotal. All rights reserved. 34
䜶䞁䝍䞊䝥䝷䜲䝈ྥ䛡 
SQL on Hadoop䜶䞁䝆䞁 
© Copyright 2014 Pivotal. All rights reserved. 35
せⅬ 
1. HAWQ䛸䛿ఱ䛛䠛 
2. HAWQ䛿㏿䛔 
3. HAWQ䛿SQL஫᥮ 
© Copyright 2014 Pivotal. All rights reserved. 36
䝞䝑䝏ฎ⌮ 
http://www.anishsneh.com/2014/07/hadoop-mapreduce-api.html 
© Copyright 2014 Pivotal. All rights reserved. 37
䜰䝗䝩䝑䜽䜽䜶䝸 
Ø SELECT id, name, address FROM foo LIMIT 100; 
Ø SELECT a, b, count(c) FROM bar GROUP BY a, b; 
… 
© Copyright 2014 Pivotal. All rights reserved. 38
Hadoopྥ䛡䜽䜶䝸ゝㄒ㻌Hive 
Hive :SQL 䛻ఝ䛯HiveQL䛷MapReduce䜢ᐇ⾜ 
ᑐヰᘧ䛻኱㔞䝕䞊䝍䜢ฎ⌮䛜ྍ⬟䛻 
© Copyright 2014 Pivotal. All rights reserved. 39
Hive 䛾ၥ㢟Ⅼ 
Ÿ ᑠつᶍ䛺䜽䜶䝸䜢ᐇ⾜䛩䜛䛻䜒᫬㛫䛜䛛䛛䜛 
– MapReduce䜢౑⏝䛧䛶䛚䜚䚸ẖᅇJavaVM䛾㉳ື-⤊஢ฎ⌮䜢ᐇ⾜䛩䜛Ⅽ 
Ÿ BI䝒䞊䝹➼䛷ື䛛䛺䛔ሙྜ䛜ከ䛔 
– SQL䛾᏶඲஫᥮䛷䛿↓䛔Ⅽ 
© Copyright 2014 Pivotal. All rights reserved. 40
Hive䜘䜚䜒㏿䛟䚸SQL஫᥮䛷䚸 
Hadoopୖ䛷ື䛟䜶䞁䝆䞁䛜ᚲせ 
SQL on Hadoop 
© Copyright 2014 Pivotal. All rights reserved. 41
SQL on Hadoop “HAWQ” 
HAdoop With Query 
Ÿ 䜶䞁䝍䞊䝥䝷䜲䝈ྥ䛡䜽䜶䝸䜶䞁䝆䞁 
Ÿ HDFSୖ䛻㧗㏿䝕䞊䝍䝧䞊䝇䜶䞁䝆䞁 
HAWQ䜢ᦚ㍕ 
– 䝡䝑䜾䝕䞊䝍䛻ᑐ䛩䜛 
㧗㏿䜽䜶䝸ฎ⌮ 
– ᶆ‽SQL‽ᣐ 
– ⤫ィゎᯒ㛵ᩘ㻌MADlib 䛻ᑐᛂ 
PivotalHD 
MapReduce Pig 
HDFS 
© Copyright 2014 Pivotal. All rights reserved. 42
HAWQ䛿㏿䛔 
User intelligence 
4.2 
198 
Sales analysis 
8.7 
161 
Click analysis 
2.0 
415 
Data exploration 
2.7 
1,285 
BI drill down 
2.8 
1,815 
47X 
19X 
208X 
476X 
648X 
༢఩䠖⛊ 
User intelligence 
4.2 
37 
Sales analysis 
8.7 
596 
Click analysis 
2.0 
50 
Data exploration 
2.7 
55 
BI drill down 
2.8 
59 
༢఩䠖⛊ 
9X 
69X 
25X 
20X 
21X 
© Copyright 2014 Pivotal. All rights reserved. 43
HAWQ/Impalaẚ㍑ᛶ⬟᳨ド 
㻌㻔ᴗ⏺ᶆ‽ᣦᶆ㼀㻼㻯㻙㻰㻿䜢౑⏝㻕 
䝃䝫䞊䝖䜽䜶䝸ᩘẚ㍑ 
100% 
(ẕᩘ111䜽䜶䝸) 
䝃䝫䞊䝖䛛䛴᏶஢䛧䛯䜽䜶䝸ᩘẚ㍑ 
100% 
(ẕᩘ111䜽䜶䝸) 
㠀䝃䝫䞊䝖䜽䜶䝸 
㠀䝃䝫䞊䝖䜽䜶䝸 
䜒䛧䛟䛿ฎ⌮୰䛻 
␗ᖖ⤊஢䛧䛯䜽䜶䝸 
HAWQฎ⌮᫬㛫䜢䠍䛸䛧䛯ሙྜ䛾 
Impalaฎ⌮᫬㛫 
Impala䛸᏶ẚ஢䛧㍑䛯䛧䜽䛶䜶䝸ᖹ ᆒ6ಸ䛾㧗㏿ 
28% 
18% 
ᛶ⬟ẚ 
䈜㻚㻌㼀㻼㻯㻙㻰㻿䛸䛿ᴗ⏺ᶆ‽䛾ᛶ⬟ᣦᶆ䜢ᐃ⩏䛩䜛ᴗ⏺ᅋయ㻌㼀㻼㻯㻌㻔㼀㼞㼍㼚㼟㼍㼏㼠㼕㼛㼚㻌㻼㼞㼛㼏㼑㼟㼟㼕㼚㼓㻌㻼㼑㼞㼒㼛㼞㼙㼍㼚㼏㼑㻌㻯㼛㼡㼚㼏㼕㼘㻕㻌䛜ᥦ౪䛩䜛᝟ሗ⣔䝅䝇䝔䝮䛾䛯䜑䛾බᘧ䛺ᛶ⬟ᣦᶆ䛷䛩䚹㻞㻜㻝㻞ᖺ䛛䜙 
ᥦ౪䛥䜜䛶䛔䜛㻌㼀㻼㻯㻙㻰㻿㻌䛷䛿䚸ὶ㏻ᴗ䛻䛚䛡䜛඾ᆺⓗ䛺䝕䞊䝍䝰䝕䝹䜢෌⌧䛧䛶䛚䜚䚸㻞㻡䝔䞊䝤䝹䚸㻠㻞㻥䜹䝷䝮䛻ᑐ䛧䛶ᵝ䚻䛺䜽䜶䝸䜢ᐇ⾜䛧䜎䛩䚹 
䈜㻚㻌᳨ド䛷䛿㻌㻼㻴㻰㻝㻚㻝㻛㻴㻭㼃㻽㻝㻚㻝㻌䛸㻌㻯㻰㻴㻠㻚㻠㻛㻵㼙㼜㼍㼘㼍㻌㻝㻚㻝㻚㻝㻚䜢౑⏝䛧䜎䛧䛯䚹 
ཧ⪃㼁㻾㻸㻦㻌㻻㼞㼏㼍㻦㻌㻭㻌㻹㼛㼐㼡㼘㼍㼞㻌㻽㼡㼑㼞㼥㻌㻻㼜㼠㼕㼙㼕㼦㼑㼞㻌㻭㼞㼏㼔㼕㼠㼑㼏㼠㼡㼞㼑㻌㼒㼛㼞㻌㻮㼕㼓㻌㻰㼍㼠㼍㻌㻌㼔㼠㼠㼜㻦㻛㻛㼣㼣㼣㻚㼓㼛㼜㼕㼢㼛㼠㼍㼘㻚㼏㼛㼙㻛㼟㼕㼠㼑㼟㻛㼐㼑㼒㼍㼡㼘㼠㻛㼒㼕㼘㼑㼟㻛㻿㻵㻳㻹㻻㻰㻹㼍㼥㻞㻜㻝㻠㻴㻭㼃㻽㻭㼐㼢㼍㼚㼠㼍㼓㼑㼟㻚㼜㼐㼒 
ཧ⪃㼁㻾㻸㻦㻌㼇㻞㻜㻝㻠㻛㻢㻛㻞㻡㼉㻌㻼㼕㼢㼛㼠㼍㼘㻌㻴㻭㼃㻽㻌㻮㼑㼚㼏㼔㼙㼍㼞㼗㻌㻰㼑㼙㼛㼚㼟㼠㼞㼍㼠㼑㼟㻌㼁㼜㻌㼀㼛㻌㻞㻝㼤㻌㻲㼍㼟㼠㼑㼞㻌㻼㼑㼞㼒㼛㼞㼙㼍㼚㼏㼑㻌㼛㼚㻌㻴㼍㼐㼛㼛㼜㻌㻽㼡㼑㼞㼕㼑㼟㻌㼀㼔㼍㼚㻌㻿㻽㻸㻙㼘㼕㼗㼑㻌㻿㼛㼘㼡㼠㼕㼛㼚㼟㻌㼔㼠㼠㼜㻦㻛㻛㼎㼘㼛㼓㻚㼓㼛㼜㼕㼢㼛㼠㼍㼘㻚㼏㼛㼙㻛㼜㼕㼢㼛㼠㼍㼘㻛㼜㼞㼛㼐㼡㼏㼠㼟㻛㼜㼕㼢㼛㼠㼍㼘㻙㼔㼍㼣㼝㻙 
㼎㼑㼚㼏㼔㼙㼍㼞㼗㻙㼐㼑㼙㼛㼚㼟㼠㼞㼍㼠㼑㼟㻙㼡㼜㻙㼠㼛㻙㻞㻝㼤㻙㼒㼍㼟㼠㼑㼞㻙㼜㼑㼞㼒㼛㼞㼙㼍㼚㼏㼑㻙㼛㼚㻙㼔㼍㼐㼛㼛㼜㻙㼝㼡㼑㼞㼕㼑㼟㻙㼠㼔㼍㼚㻙㼟㼝㼘㻙㼘㼕㼗㼑㻙㼟㼛㼘㼡㼠㼕㼛㼚㼟 
© Copyright 2014 Pivotal. All rights reserved. 44
䝕䞊䝍ฎ⌮䝣䝻䞊ẚ㍑ 
䜽䜶䝸䛾ᢞධ 
䝥䝷䞁సᡂ 
䜽䜶䝸ᐇ⾜ 
⤖ᯝ䛾㏉ಙ 
䝹䞊䝹䝧䞊䝇䛾䜸䝥䝔䜱䝬䜲䝄 
• 䝔䞊䝤䝹䝕䞊䝍䛾ෆᐜ䛻㛵䜟䜙䛪䜽䜶 
䝸䛻䜘䛳䛶ᐇ⾜䝥䝷䞁䜢సᡂ 
• MapReduce䝇䜽䝸䝥䝖䜢సᡂ 
MapReduceฎ⌮ 
• Java䝥䝻䝉䝇䛾㉳ື䞉೵Ṇ 
୰㛫䝕䞊䝍䛾䝕䜱䝇䜽ฎ⌮ 
• ㏲ḟ䝕䜱䝇䜽IO䛾Ⓨ⏕ 
䝁䝇䝖䝧䞊䝇䜸䝥䝔䜱䝬䜲䝄 Orca 
• 䝔䞊䝤䝹䝕䞊䝍䛾ෆᐜ(䝕䞊䝍㔞䚸䜹䞊 
䝕䜱䝘䝸䝔䜱➼)䜢㋃䜎䛘᭱㐺䛺ᐇ⾜䝥䝷 
䞁䜢సᡂ 
C䝥䝻䝉䝇ฎ⌮ 
• ᖖ㥔䝥䝻䝉䝇䛻䜘䜛༶᫬ฎ⌮ 
୰㛫䝕䞊䝍䛾䜸䞁䝯䝰䝸ฎ⌮ 
• 䝟䜲䝥䝷䜲䞁ฎ⌮䛻䜘䜛䜸䞁䝯䝰䝸䛾㧗 
㏿ฎ⌮ 
• 䝕䜱䝇䜽IO䜢᤼㝖 
© Copyright 2014 Pivotal. All rights reserved. 45
䜽䜶䝸䜸䝥䝔䜱䝬䜲䝄䛜ᐇ⌧䛩䜛㧗㏿䝕䞊䝍ฎ⌮ 
MapReduce䜢௓䛥䛪䝕䞊䝍䜢䝟䜲䝥䝷䜲䞁ฎ⌮ 
Ÿ 䝁䝇䝖䝧䞊䝇䛾䜸䝥䝔䜱䝬䜲䝄䛜᭱ 
㐺䛺ᐇ⾜䝥䝷䞁䜢㑅ᢥ 
– DBฎ⌮(䝇䜻䝱䞁䚸䝆䝵䜲䞁䚸䝋䞊䝖䚸㞟 
ィ➼)䛻ᑐ䛧䛶䝁䝇䝖䜢⟬ฟ 
– 䝉䜾䝯䞁䝖㛫㏻ಙ(“䝰䞊䝅䝵䞁”)䜒䜸䝥 
䝔䜱䝬䜲䝄䛜ᣦ♧ 
Ÿ 䝎䜲䝘䝭䝑䜽䝟䜲䝥䝷䜲䞁ฎ⌮ 
– ୰㛫䝕䞊䝍䛾䜸䞁䝯䝰䝸ฎ⌮ 
PHYSICAL EXECUTION PLAN 
FROM SQL 
Gather Motion 
4:1(Slice 3) 
Sort 
HashAggregate 
HashJoin 
Redistribute Motion 
4:4(Slice 1) 
HashJoin 
Hash 
HashJoin 
Seq Scan on 
customer 
Hash Hash 
Broadcast Motion 
4:4(Slice 2) 
Seq Scan on 
motion 
Seq Scan 
on lineitem 
Seq Scan on 
orders 
© Copyright 2014 Pivotal. All rights reserved. 46
᭱᪂䜽䜶䝸䜸䝥䝔䜱䝬䜲䝄 Orca䛻䜘䜛ᅽಽⓗᛶ⬟ྥୖ 
ᚑ᮶䜽䜶䝸䜸䝥䝔䜱䝬䜲䝄ẚᖹᆒ䠑ಸ䛾ᛶ⬟ྥୖ䜢ᐇ⌧ 
Ÿ 䝆䝵䜲䞁䜸䞊䝎䞊䝸䞁䜾 
– 䜲䞁䝍䝁䝛䜽䝖䜈䛾኱つᶍ䝕䞊䝍㌿㏦䜢㜵䛠䝔䞊䝤䝹⤖ྜ㡰ᗎ䜢㑅ᐃ 
Ÿ ┦㛵䝃䝤䜽䜶䝸ฎ⌮ 
– ཯᚟ⓗ䝃䝤䜽䜶䝸ฎ⌮䜢ᅇ㑊 
Ÿ ືⓗ㻌䝟䞊䝔䜱䝅䝵䞁᤼㝖䝇䜻䝱䞁 
– 䜽䜶䝸ฎ⌮୰䛾୰㛫䝕䞊䝍䛻䛒䜟䛫䛶䝇䜻䝱䞁ᑐ㇟䝟䞊䝔䜱䝅䝵䞁䜢ື 
ⓗ䛻㑅ᐃ䞉᤼㝖 
© Copyright 2014 Pivotal. All rights reserved. 47
䝆䝵䜲䞁䜸䞊䝎䝸䞁䜾䛾౛ 
䝴䞊䝄IDẖ䛻㞟ィ䛧䛯䛔ሙྜ 
A B 
join 
ྛ䝜䞊䝗䛻 
䝕䞊䝍䜢ศ㓄 
A 
(100୓௳䚸 
䝴䞊䝄id䛷ศᩓ) 
B 
(100୓௳䚸 
᫂⣽id䛷ศᩓ) 
ྛ䝜䞊䝗䛻 
䝕䞊䝍䜢ศ㓄 
join 
or 
䛹䛱䜙䛜䜘䛔䛛䠛 
© Copyright 2014 Pivotal. All rights reserved. 48
ศᩓ䜻䞊䛾௳ᩘ䜢䝠䝇䝖䜾䝷䝮໬ 
A䛾䝜䞊䝗ẖ䛾䝴䞊䝄idᩘ 
B䛾䝜䞊䝗ẖ䛾䝴䞊䝄idᩘ 
A 
(䝴䞊䝄id䛷ศᩓ) 
B 
(᫂⣽id䛷ศᩓ) 
䝔䞊䝤䝹᝟ሗ䛛䜙ศᯒ 
䝔䞊䝤䝹᝟ሗ䛛䜙ศᯒ 
© Copyright 2014 Pivotal. All rights reserved. 49
䝠䝇䝖䜾䝷䝮䛛䜙ᐇ⾜᫬㛫䛾ぢ✚䜒䜚 
A䛾䝜䞊䝗ẖ䛾䝴䞊䝄idᩘ 
B䛾䝜䞊䝗ẖ䛾䝴䞊䝄idᩘ 
A䜢ᅛᐃ䛧䛶B䛾䝕䞊䝍䜢ྛ䝜䞊䝗䛻ศ㓄䛧䛯ሙྜ䛾ᐇ⾜᫬㛫 
B䜢ᅛᐃ䛧䛶A䛾䝕䞊䝍䜢ྛ䝜䞊䝗䛻ศ㓄䛧䛯ሙྜ䛾ᐇ⾜᫬㛫 
© Copyright 2014 Pivotal. All rights reserved. 50
䛣䛱䜙䛾᪉䛜᫬㛫䛜䛛䛛䜙䛺䛔䛾䛷᥇⏝ 
A B 
ศ㓄 ศ㓄 
join 
A B 
join 
ẚ㍑ 
© Copyright 2014 Pivotal. All rights reserved. 51
HAWQ䛷䛿⮬ື䛷䝆䝵䜲䞁䜸䞊䝎䝸䞁䜾䜢ᐇ᪋ 
᥇⏝ 
A B 
ศ㓄 ศ㓄 
join 
A B 
join 
ẚ㍑ 
© Copyright 2014 Pivotal. All rights reserved. 52
HAWQ䛿SQL‽ᣐ 
SQL ‘92 ’93 2003 OLAPᑐᛂ 
䝝䞊䝗/OS RDBMS BI䝒䞊䝹 
䝝䞊䝗/OS HDFS BI䝒䞊䝹 
© Copyright 2014 Pivotal. All rights reserved. 53
HAWQ䜢ᨭ䛘䜛㻌GreenplumDB 10ᖺ䛾ᐇ⦼ 
㻳㼞㼑㼑㼚㼜㼘㼡㼙㻰㻮䛾୺せ䝔䜽䝜䝻䝆䞊䜢㻌㻴㻭㼃㻽㻌䛷᥇⏝ 
 
• ᶆ‽㻌㻿㻽㻸㻌ᑐᛂ 
• 䝁䝇䝖䝧䞊䝇䜸䝥䝔䜱䝬䜲䝄 
• 䝎䜲䝘䝭䝑䜽䝟䜲䝥䝷䜲䞁ฎ⌮ 
• 䝻䞊䝇䝖䜰䞉䜹䝷䝮䝇䝖䜰୧᪉䜈䛾ᑐᛂ 
• ᅽ⦰㻔㻽㼡㼕㼏㼗㻸㼆㻘㻌㼆㻸㻵㻮㻘㻌㻾㻸㻱㻕 
• ศᩓ᱁⣡ 
• 䝬䝹䝏䝺䝧䝹䝟䞊䝔䜱䝅䝵䝙䞁䜾 
• 䝟䝷䝺䝹䞊䝻䞊䝗䞉䜰䞁䝻䞊䝗 
• 㧗㏿䝕䞊䝍෌ศᩓ 
• ⤫ィゎᯒ㛵ᩘ㻔㻹㻭㻰㼘㼕㼎㻕 
• 㻿㻱㻸㻱㻯㼀 
• 㻵㻺㻿㻱㻾㼀 
• 㻶㻻㻵㻺 
• 䝡䝳䞊 
• እ㒊⾲ 
• 䝸䝋䞊䝇䝬䝛䝆䝯䞁䝖 
• 䝉䜻䝳䝸䝔䜱 
• ㄆド 
• ⟶⌮䞉┘ど 
• 㻻㻰㻮㻯㻛㻶㻰㻮㻯ᑐᛂ 
© Copyright 2014 Pivotal. All rights reserved. 54
ศᩓ㻌ᩘ್ィ⟬䞉ᶵᲔᏛ⩦䝷䜲䝤䝷䝸 
MADlib䜢ྠᲕ 
ண ⓗ䝰䝕䝸䞁䜾䝷䜲䝤䝷䝸 
Latest release: MADlib v1.6, URL: madlib.net 
ᶵᲔᏛ⩦䜰䝹䝂䝸䝈䝮 
• ୺ᡂศศᯒ(PCA) 
• 䜰䝋䝅䜶䞊䝅䝵䞁䝹䞊䝹ศᯒ㻌(䜰䝣䜱䝙䝔䜱ศ 
ᯒ,䝬䞊䜿䝑䝖䝞䝇䜿䝑䝖ศᯒ) 
• 䝖䝢䝑䜽䝰䝕䝸䞁䜾㻌(䝟䝷䝺䝹LDA) 
• Ỵᐃᮌ 
• 䜰䞁䝃䞁䝤䝹Ꮫ⩦(䝷䞁䝎䝮䝣䜷䝺䝇䝖) 
• 䝃䝫䞊䝖䝧䜽䝍䞊䝬䝅䞁 
• 䝁䞁䝕䜱䝅䝵䝘䝹䝷䞁䝎䝮䝣䜱䞊䝹䝈(CRF) 
• 䜽䝷䝇䝍䝸䞁䜾 (Kᖹᆒἲ) 
• 䜽䝻䝇䝞䝸䝕䞊䝅䝵䞁 
⥺ᙧ䝅䝇䝔䝮ゎᯒ 
• ␯⾜ิ䝋䝹䝞䞊 
• ᐦ⾜ิ䝋䝹䝞䞊 
୍⯡໬⥺ᙧ䝰䝕䝹 
• ⥺ᙧᅇᖐ 
• 䝻䝆䝇䝔䜱䝑䜽ᅇᖐ 
• ከ㡯䝻䝆䝇䝔䜱䝑䜽ᅇᖐ 
• 䝁䝑䜽䝇ẚ౛䝝䝄䞊䝗 
• ᅇᖐศᯒ 
• 䜶䝷䝇䝔䜱䝑䜽䝛䝑䝖ᆺṇつ໬ 
• 䝃䞁䝗䜲䝑䝏᥎ᐃ 
⾜ิᅉᏊศゎ 
• ≉␗್ศゎ㻌(SVD) 
• ప䝷䞁䜽㏆ఝ 
グ㏙⤫ィ 
䝇䜿䝑䝏䝧䞊䝇᥎ᐃ 
• CountMin䝇䜿䝑䝏 
• Flajolet-Martin䝇䜿䝑䝏 
• ᭱㢖್䝇䜿䝑䝏 
┦㛵㛵ಀ 
⤫ィ್䝃䝬䝸 
䝃䝫䞊䝖䝰䝆䝳䞊䝹 
㓄ิ₇⟬ 
␯䝧䜽䝖䝹 
䝷䞁䝎䝮䝃䞁䝥䝸䞁䜾 
☜⋡㛵ᩘ 
© Copyright 2014 Pivotal. All rights reserved. 55
MADlib䝃䝫䞊䝖ᶵ⬟䛾౛ 
K-means 㐺⏝๓ 
x 
y 
Ⅼ䛾ሢ䜢኱䜎䛛䛻3䛴 
䛻ศ๭䛧䛯䛔 
© Copyright 2014 Pivotal. All rights reserved. 56
MADlib䝃䝫䞊䝖ᶵ⬟䛾౛ 
K-means 㐺⏝ᚋ 
䜽䝷䝇䝍A 䜽䝷䝇䝍B 
x 
y 
䜽䝷䝇䝍C 
Ⅼ䛾ሢ䛛䜙䚸3䛴䛾䜽䝷䝇䝍䛜ᆒ➼ 
䛻䝞䝷䛡䜛䜘䛖䛻䚸㔜ᚰ●䜢సᡂ 
ྛⅬ䛿㔜ᚰ●䛻㏆䛔䜽䝷䝇䝍䛻ᡤᒓ 
© Copyright 2014 Pivotal. All rights reserved. 57
MADlib䛷䛾K-meansᐇ⾜౛ 
㔜ᚰ䜢ồ䜑䜛౛ 
 SELECT * FROM madlib.kmeanspp( 'km_sample', 
'points', 
3, 
'madlib.squared_dist_norm2', 
'madlib.avg', 
20, 
0.001 
); 
ධຊ䝔䞊䝤䝹 
madlib䛾㛵ᩘ 
ฟຊ 
ศ๭ᩘ 
centroids | {{13.24,2.59, … ,735},{13.856,…,1078},{14.255,…,1378.75}} 
… 
3䛴䛾㔜ᚰ䛾఩⨨ 
ヲ⣽䠖http://doc.madlib.net/latest/group__grp__kmeans.html 
© Copyright 2014 Pivotal. All rights reserved. 58
䝬䜲䜽䝻䜰䝗ᵝ 
Pivotal HD+HAWQ஦౛ 
 
Pivotal HD+HAWQ䛻䜘䜚SPSS䛾᪤Ꮡ㈨⏘䛻୍ษᡭ䜢ຍ䛘䜛䛣䛸 
䛺䛟ศᯒྍ⬟䛺䝕䞊䝍䛾ᣑ኱䜢ప䝁䝇䝖䛷ᐇ⌧䚹䛚ᐈᵝ䛻䛸䛳䛶䛾 
➇தຊ䛾※Ἠ䛷䛒䜛䝕䞊䝍ศᯒ⢭ᗘ䛾ྥୖ䛻㈉⊩ 
᪤Ꮡ⎔ቃ 
䞉 IBM PureData/SPSS䛾ศᯒᇶ┙䜢ᵓ⠏ 
䞉㻌ศᯒせᮃ䛾㧗ᗘ໬䛻䜘䜚᱁⣡䝕䞊䝍ቑ኱䚸PureData 
䛾ᐜ㔞ᯤῬ 
䞉㻌䝁䝇䝖䜢ᢚ䛘䜛䛯䜑Hadoop (Cloudera↓ൾ∧)䜢ే⏝ 
ㄢ㢟 
䞉 SPSS䛾䜽䜶䝸䛜Hadoopᶆ‽䝒䞊䝹HIVE䛷䛿 
䚷䛂㏻䜙䛺䛔䛃䛂㏵୰䛷䜶䝷䞊䛻䛺䜛䛃䛂ⴭ䛧䛟㐜䛔䛃䛯䜑 
䚷ᐇ⏝䛻ሓ䛘䛺䛔 
(DWH) 
IBM PureData 
(Hadoop) 
Pivotal 
Pivotal HD + HAWQᑟධ⤖ᯝ 
䞉 SPSS䛾䜽䜶䝸䛜ኚ᭦↓䛧䛷100%฼⏝ྍ⬟ 
䞉 HIVE䛸ẚ㍑䛧䛶᭱኱⣙70ಸ㏿䛔ᛶ⬟䜢グ㘓 
䞉㻌ỗ⏝IA䝃䞊䝞6ྎ䛷ᐇ⿦ 
(BA) 
IBM 
SPSS 
ᑡ䛺䛔ᢞ㈨䛷ᗈ⠊ᅖ䛺䝕䞊䝍䛻ᑐ䛧䛶௒䜎䛷䛷 
䛝䛺䛛䛳䛯ศᯒ䜢ᐇ᪋ྍ⬟䛻 
© Copyright 2014 Pivotal. All rights reserved. 59
HAWQ䛾⤖ㄽ 
1. HAWQ䛿SQL on Hadoop䛾୍䛴 
Hadoop䛷Greenplum䛾䜽䜶䝸䜶䞁䝆䞁䜢ື䛟䜘䛖䛻䛧䛯䜒䛾 
2. HAWQ䛿Hive䜘䜚ᩘ༑~ᩘⓒಸ, 
Impala䜘䜚ᩘಸ㏿䛔 
3. HAWQ䛿SQL஫᥮䛺䛾䛷䚸᪤Ꮡ䛾䝒䞊䝹䛛䜙 
౑䛔䜔䛩䛔 
© Copyright 2014 Pivotal. All rights reserved. 60
䝡䝑䜾䝕䞊䝍᫬௦䛾 
௻ᴗኚ㠉䜢ᐇ⌧䛩䜛Pivotal 
• 䝕䞊䝍⵳✚䊻ศᯒ䊻䜰䝥䝸䜿䞊䝅 
䝵䞁䛾䝃䜲䜽䝹 
• 䛒䜙䜖䜛䝕䞊䝍䜢䛸䜙䛘䜛䝡䝑䜾䝕 
䞊䝍ᇶ┙䛂䝕䞊䝍䝺䜲䜽䛃ᵓ᝿ 
© Copyright 2014 Pivotal. All rights reserved. 61
䝡䝑䜾䝕䞊䝍᫬௦䛾 
௻ᴗኚ㠉䜢ᐇ⌧䛩䜛Pivotal 
• 䝕䞊䝍⵳✚䊻ศᯒ䊻䜰䝥䝸䜿䞊䝅䝵䞁䈊 
䛾䝃䜲䜽䝹 
• 䛒䜙䜖䜛䝕䞊䝍䜢䛸䜙䛘䜛䝡䝑䜾䝕䞊䝍䈊 
ᇶ┙䛂䝕䞊䝍䝺䜲䜽䛃㻌ᵓ᝿ 
© Copyright 2014 Pivotal. All rights reserved. 62
Pivotal䛾ᥦ౪䛩䜛䝁䞁䝃䝹䝔䜱䞁䜾䝃䞊䝡䝇 
Ÿ 䝡䝑䜾䝕䞊䝍ᇶ┙䜢ᑟධ䛧䛯䛔 
– 䛡䛹䝻䜾㌿㏦䛺䛹䛹䛖䛩䜜䜀䞉䞉 
Ÿ 䝕䞊䝍ศᯒ䞉ᶵᲔᏛ⩦䜢䛧䛯䛔 
䜰䝥䝸䜿䞊䝅䝵䞁 
䜸䞊䝥䞁䝋䞊䝇 
䜽䝷䜴䝗(PaaS)ᇶ┙ 
䞉䞉䞉 ᪂䛯䛺 
ᇶᖿ䝅䝇䝔䝮㐃ᦠ 
䝣䜯䝇䝖䝕䞊䝍 
(M2M/䝸䜰䝹䝍䜲䝮) 
䝡䝆䝛䝇䝰䝕䝹 
䜰䝆䝱䜲䝹㛤Ⓨ 
䜰䝘䝸䝔䜱䜽䝇 䝕䞊䝍 
䝕䞊䝍䝃䜲䜶䞁䝇 䝡䝑䜾䝕䞊䝍 
(DWH/Hadoop) 
Pivotal䝆䝱䝟䞁䛻䛶 
ᑟධ䝃䝫䞊䝖䠃䝖䝺䞊䝙䞁䜾ᐇ᪋ 
᪥ᮏே䛾䝕䞊䝍䝬䜲䝙䞁䜾䜶䞁䝆䝙䜰ཬ䜃 
䝕䞊䝍䝃䜲䜶䞁䝔䜱䝇䝖ᅾ⡠ 
© Copyright 2014 Pivotal. All rights reserved. 63
A NEW PLATFORM FOR A NEW ERA

More Related Content

What's hot

Low Latency SQL on Hadoop - What's best for your cluster
Low Latency SQL on Hadoop - What's best for your clusterLow Latency SQL on Hadoop - What's best for your cluster
Low Latency SQL on Hadoop - What's best for your clusterDataWorks Summit
 
The Future of Hadoop: A deeper look at Apache Spark
The Future of Hadoop: A deeper look at Apache SparkThe Future of Hadoop: A deeper look at Apache Spark
The Future of Hadoop: A deeper look at Apache SparkCloudera, Inc.
 
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...huguk
 
Hive and Apache Tez: Benchmarked at Yahoo! Scale
Hive and Apache Tez: Benchmarked at Yahoo! ScaleHive and Apache Tez: Benchmarked at Yahoo! Scale
Hive and Apache Tez: Benchmarked at Yahoo! ScaleDataWorks Summit
 
Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)
Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)
Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)Adam Kawa
 
Pig on Tez - Low Latency ETL with Big Data
Pig on Tez - Low Latency ETL with Big DataPig on Tez - Low Latency ETL with Big Data
Pig on Tez - Low Latency ETL with Big DataDataWorks Summit
 
Apache Kudu (Incubating): New Hadoop Storage for Fast Analytics on Fast Data ...
Apache Kudu (Incubating): New Hadoop Storage for Fast Analytics on Fast Data ...Apache Kudu (Incubating): New Hadoop Storage for Fast Analytics on Fast Data ...
Apache Kudu (Incubating): New Hadoop Storage for Fast Analytics on Fast Data ...Cloudera, Inc.
 
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv larsgeorge
 
Architecting applications with Hadoop - Fraud Detection
Architecting applications with Hadoop - Fraud DetectionArchitecting applications with Hadoop - Fraud Detection
Architecting applications with Hadoop - Fraud Detectionhadooparchbook
 
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)Uwe Printz
 
Architecting a Fraud Detection Application with Hadoop
Architecting a Fraud Detection Application with HadoopArchitecting a Fraud Detection Application with Hadoop
Architecting a Fraud Detection Application with HadoopDataWorks Summit
 
Apache Drill and Zeppelin: Two Promising Tools You've Never Heard Of
Apache Drill and Zeppelin: Two Promising Tools You've Never Heard OfApache Drill and Zeppelin: Two Promising Tools You've Never Heard Of
Apache Drill and Zeppelin: Two Promising Tools You've Never Heard OfCharles Givre
 
Apache HBaseの現在 - 火山と呼ばれたHBaseは今どうなっているのか
Apache HBaseの現在 - 火山と呼ばれたHBaseは今どうなっているのかApache HBaseの現在 - 火山と呼ばれたHBaseは今どうなっているのか
Apache HBaseの現在 - 火山と呼ばれたHBaseは今どうなっているのかToshihiro Suzuki
 
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsBig Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsEsther Kundin
 
Hadoop Summit 2015: Hive at Yahoo: Letters from the Trenches
Hadoop Summit 2015: Hive at Yahoo: Letters from the TrenchesHadoop Summit 2015: Hive at Yahoo: Letters from the Trenches
Hadoop Summit 2015: Hive at Yahoo: Letters from the TrenchesMithun Radhakrishnan
 
Cloudera Impala: A modern SQL Query Engine for Hadoop
Cloudera Impala: A modern SQL Query Engine for HadoopCloudera Impala: A modern SQL Query Engine for Hadoop
Cloudera Impala: A modern SQL Query Engine for HadoopCloudera, Inc.
 
Solr on HDFS - Past, Present, and Future: Presented by Mark Miller, Cloudera
Solr on HDFS - Past, Present, and Future: Presented by Mark Miller, ClouderaSolr on HDFS - Past, Present, and Future: Presented by Mark Miller, Cloudera
Solr on HDFS - Past, Present, and Future: Presented by Mark Miller, ClouderaLucidworks
 
NYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache HadoopNYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache Hadoopmarkgrover
 

What's hot (20)

Low Latency SQL on Hadoop - What's best for your cluster
Low Latency SQL on Hadoop - What's best for your clusterLow Latency SQL on Hadoop - What's best for your cluster
Low Latency SQL on Hadoop - What's best for your cluster
 
The Future of Hadoop: A deeper look at Apache Spark
The Future of Hadoop: A deeper look at Apache SparkThe Future of Hadoop: A deeper look at Apache Spark
The Future of Hadoop: A deeper look at Apache Spark
 
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...
 
Hive and Apache Tez: Benchmarked at Yahoo! Scale
Hive and Apache Tez: Benchmarked at Yahoo! ScaleHive and Apache Tez: Benchmarked at Yahoo! Scale
Hive and Apache Tez: Benchmarked at Yahoo! Scale
 
Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)
Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)
Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)
 
Pig on Tez - Low Latency ETL with Big Data
Pig on Tez - Low Latency ETL with Big DataPig on Tez - Low Latency ETL with Big Data
Pig on Tez - Low Latency ETL with Big Data
 
Apache Kudu (Incubating): New Hadoop Storage for Fast Analytics on Fast Data ...
Apache Kudu (Incubating): New Hadoop Storage for Fast Analytics on Fast Data ...Apache Kudu (Incubating): New Hadoop Storage for Fast Analytics on Fast Data ...
Apache Kudu (Incubating): New Hadoop Storage for Fast Analytics on Fast Data ...
 
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
 
Architecting applications with Hadoop - Fraud Detection
Architecting applications with Hadoop - Fraud DetectionArchitecting applications with Hadoop - Fraud Detection
Architecting applications with Hadoop - Fraud Detection
 
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
 
Architecting a Fraud Detection Application with Hadoop
Architecting a Fraud Detection Application with HadoopArchitecting a Fraud Detection Application with Hadoop
Architecting a Fraud Detection Application with Hadoop
 
Pivotal hawq internals
Pivotal hawq internalsPivotal hawq internals
Pivotal hawq internals
 
Apache Drill and Zeppelin: Two Promising Tools You've Never Heard Of
Apache Drill and Zeppelin: Two Promising Tools You've Never Heard OfApache Drill and Zeppelin: Two Promising Tools You've Never Heard Of
Apache Drill and Zeppelin: Two Promising Tools You've Never Heard Of
 
Apache HBaseの現在 - 火山と呼ばれたHBaseは今どうなっているのか
Apache HBaseの現在 - 火山と呼ばれたHBaseは今どうなっているのかApache HBaseの現在 - 火山と呼ばれたHBaseは今どうなっているのか
Apache HBaseの現在 - 火山と呼ばれたHBaseは今どうなっているのか
 
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsBig Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
 
Hadoop Summit 2015: Hive at Yahoo: Letters from the Trenches
Hadoop Summit 2015: Hive at Yahoo: Letters from the TrenchesHadoop Summit 2015: Hive at Yahoo: Letters from the Trenches
Hadoop Summit 2015: Hive at Yahoo: Letters from the Trenches
 
Cloudera Impala: A modern SQL Query Engine for Hadoop
Cloudera Impala: A modern SQL Query Engine for HadoopCloudera Impala: A modern SQL Query Engine for Hadoop
Cloudera Impala: A modern SQL Query Engine for Hadoop
 
Solr on HDFS - Past, Present, and Future: Presented by Mark Miller, Cloudera
Solr on HDFS - Past, Present, and Future: Presented by Mark Miller, ClouderaSolr on HDFS - Past, Present, and Future: Presented by Mark Miller, Cloudera
Solr on HDFS - Past, Present, and Future: Presented by Mark Miller, Cloudera
 
Hadoop overview
Hadoop overviewHadoop overview
Hadoop overview
 
NYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache HadoopNYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache Hadoop
 

Similar to [db tech showcase Tokyo 2014] D36: 次世代分析基盤 "Data Lake" を支えるPivotalのインメモリ+SQL on Hadoopテクノロジー by Pivotalジャパン株式会社 市村友寛 & 宵勇樹

Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHortonworks
 
Carpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenCarpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenDataWorks Summit
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataBig Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataPentaho
 
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
 How to use Hadoop for operational and transactional purposes by RODRIGO MERI... How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...Big Data Spain
 
Analyzing Real-World Data with Apache Drill
Analyzing Real-World Data with Apache DrillAnalyzing Real-World Data with Apache Drill
Analyzing Real-World Data with Apache DrillTomer Shiran
 
Analyzing Real-World Data with Apache Drill
Analyzing Real-World Data with Apache DrillAnalyzing Real-World Data with Apache Drill
Analyzing Real-World Data with Apache Drilltshiran
 
Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Hortonworks
 
HP flash optimized storage - webcast
HP flash optimized storage - webcastHP flash optimized storage - webcast
HP flash optimized storage - webcastCalvin Zito
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoopmarkgrover
 
Introducing Apache Geode and Spring Data GemFire
Introducing Apache Geode and Spring Data GemFireIntroducing Apache Geode and Spring Data GemFire
Introducing Apache Geode and Spring Data GemFireJohn Blum
 
Hadoop Training in Hyderabad,Hadoop Training Institute in Hyderabad
Hadoop Training in Hyderabad,Hadoop Training Institute in HyderabadHadoop Training in Hyderabad,Hadoop Training Institute in Hyderabad
Hadoop Training in Hyderabad,Hadoop Training Institute in Hyderabadchariorienit
 
Quality Hadoop Training
Quality Hadoop TrainingQuality Hadoop Training
Quality Hadoop TrainingMartin James
 
Hadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabad
Hadoop Institute in Hyderabad,Hadoop Training Institutes in HyderabadHadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabad
Hadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabadchariorienit
 
Hadoop training in hyderabad
Hadoop training in hyderabadHadoop training in hyderabad
Hadoop training in hyderabadsreehari orienit
 
Hadoop Training in Hyderabad, Hadoop Training Institute in Hyderabad
 Hadoop Training in Hyderabad, Hadoop Training Institute in Hyderabad Hadoop Training in Hyderabad, Hadoop Training Institute in Hyderabad
Hadoop Training in Hyderabad, Hadoop Training Institute in HyderabadOrienIt Orienit
 
Hadoop Training Institutes in Hyderabad
Hadoop Training Institutes in HyderabadHadoop Training Institutes in Hyderabad
Hadoop Training Institutes in Hyderabadsreehari orienit
 
Pivotal HAWQ 소개
Pivotal HAWQ 소개Pivotal HAWQ 소개
Pivotal HAWQ 소개Seungdon Choi
 
Track B-3 解構大數據架構 - 大數據系統的伺服器與網路資源規劃
Track B-3 解構大數據架構 - 大數據系統的伺服器與網路資源規劃Track B-3 解構大數據架構 - 大數據系統的伺服器與網路資源規劃
Track B-3 解構大數據架構 - 大數據系統的伺服器與網路資源規劃Etu Solution
 
Big data processing engines, Atlanta Meetup 4/30
Big data processing engines, Atlanta Meetup 4/30Big data processing engines, Atlanta Meetup 4/30
Big data processing engines, Atlanta Meetup 4/30Ashish Narasimham
 

Similar to [db tech showcase Tokyo 2014] D36: 次世代分析基盤 "Data Lake" を支えるPivotalのインメモリ+SQL on Hadoopテクノロジー by Pivotalジャパン株式会社 市村友寛 & 宵勇樹 (20)

Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
 
Carpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenCarpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP Haven
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataBig Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big Data
 
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
 How to use Hadoop for operational and transactional purposes by RODRIGO MERI... How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
 
Analyzing Real-World Data with Apache Drill
Analyzing Real-World Data with Apache DrillAnalyzing Real-World Data with Apache Drill
Analyzing Real-World Data with Apache Drill
 
Analyzing Real-World Data with Apache Drill
Analyzing Real-World Data with Apache DrillAnalyzing Real-World Data with Apache Drill
Analyzing Real-World Data with Apache Drill
 
Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture
 
HP flash optimized storage - webcast
HP flash optimized storage - webcastHP flash optimized storage - webcast
HP flash optimized storage - webcast
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoop
 
Introducing Apache Geode and Spring Data GemFire
Introducing Apache Geode and Spring Data GemFireIntroducing Apache Geode and Spring Data GemFire
Introducing Apache Geode and Spring Data GemFire
 
Hadoop Training in Hyderabad,Hadoop Training Institute in Hyderabad
Hadoop Training in Hyderabad,Hadoop Training Institute in HyderabadHadoop Training in Hyderabad,Hadoop Training Institute in Hyderabad
Hadoop Training in Hyderabad,Hadoop Training Institute in Hyderabad
 
Hadoop
HadoopHadoop
Hadoop
 
Quality Hadoop Training
Quality Hadoop TrainingQuality Hadoop Training
Quality Hadoop Training
 
Hadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabad
Hadoop Institute in Hyderabad,Hadoop Training Institutes in HyderabadHadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabad
Hadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabad
 
Hadoop training in hyderabad
Hadoop training in hyderabadHadoop training in hyderabad
Hadoop training in hyderabad
 
Hadoop Training in Hyderabad, Hadoop Training Institute in Hyderabad
 Hadoop Training in Hyderabad, Hadoop Training Institute in Hyderabad Hadoop Training in Hyderabad, Hadoop Training Institute in Hyderabad
Hadoop Training in Hyderabad, Hadoop Training Institute in Hyderabad
 
Hadoop Training Institutes in Hyderabad
Hadoop Training Institutes in HyderabadHadoop Training Institutes in Hyderabad
Hadoop Training Institutes in Hyderabad
 
Pivotal HAWQ 소개
Pivotal HAWQ 소개Pivotal HAWQ 소개
Pivotal HAWQ 소개
 
Track B-3 解構大數據架構 - 大數據系統的伺服器與網路資源規劃
Track B-3 解構大數據架構 - 大數據系統的伺服器與網路資源規劃Track B-3 解構大數據架構 - 大數據系統的伺服器與網路資源規劃
Track B-3 解構大數據架構 - 大數據系統的伺服器與網路資源規劃
 
Big data processing engines, Atlanta Meetup 4/30
Big data processing engines, Atlanta Meetup 4/30Big data processing engines, Atlanta Meetup 4/30
Big data processing engines, Atlanta Meetup 4/30
 

More from Insight Technology, Inc.

グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?Insight Technology, Inc.
 
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~Insight Technology, Inc.
 
事例を通じて機械学習とは何かを説明する
事例を通じて機械学習とは何かを説明する事例を通じて機械学習とは何かを説明する
事例を通じて機械学習とは何かを説明するInsight Technology, Inc.
 
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーンInsight Technology, Inc.
 
MBAAで覚えるDBREの大事なおしごと
MBAAで覚えるDBREの大事なおしごとMBAAで覚えるDBREの大事なおしごと
MBAAで覚えるDBREの大事なおしごとInsight Technology, Inc.
 
グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?Insight Technology, Inc.
 
DBREから始めるデータベースプラットフォーム
DBREから始めるデータベースプラットフォームDBREから始めるデータベースプラットフォーム
DBREから始めるデータベースプラットフォームInsight Technology, Inc.
 
SQL Server エンジニアのためのコンテナ入門
SQL Server エンジニアのためのコンテナ入門SQL Server エンジニアのためのコンテナ入門
SQL Server エンジニアのためのコンテナ入門Insight Technology, Inc.
 
db tech showcase2019オープニングセッション @ 森田 俊哉
db tech showcase2019オープニングセッション @ 森田 俊哉 db tech showcase2019オープニングセッション @ 森田 俊哉
db tech showcase2019オープニングセッション @ 森田 俊哉 Insight Technology, Inc.
 
db tech showcase2019 オープニングセッション @ 石川 雅也
db tech showcase2019 オープニングセッション @ 石川 雅也db tech showcase2019 オープニングセッション @ 石川 雅也
db tech showcase2019 オープニングセッション @ 石川 雅也Insight Technology, Inc.
 
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー Insight Technology, Inc.
 
難しいアプリケーション移行、手軽に試してみませんか?
難しいアプリケーション移行、手軽に試してみませんか?難しいアプリケーション移行、手軽に試してみませんか?
難しいアプリケーション移行、手軽に試してみませんか?Insight Technology, Inc.
 
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介Insight Technology, Inc.
 
そのデータベース、クラウドで使ってみませんか?
そのデータベース、クラウドで使ってみませんか?そのデータベース、クラウドで使ってみませんか?
そのデータベース、クラウドで使ってみませんか?Insight Technology, Inc.
 
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...Insight Technology, Inc.
 
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。 複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。 Insight Technology, Inc.
 
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...Insight Technology, Inc.
 
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]Insight Technology, Inc.
 

More from Insight Technology, Inc. (20)

グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?
 
Docker and the Oracle Database
Docker and the Oracle DatabaseDocker and the Oracle Database
Docker and the Oracle Database
 
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
 
事例を通じて機械学習とは何かを説明する
事例を通じて機械学習とは何かを説明する事例を通じて機械学習とは何かを説明する
事例を通じて機械学習とは何かを説明する
 
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
 
MBAAで覚えるDBREの大事なおしごと
MBAAで覚えるDBREの大事なおしごとMBAAで覚えるDBREの大事なおしごと
MBAAで覚えるDBREの大事なおしごと
 
グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?グラフデータベースは如何に自然言語を理解するか?
グラフデータベースは如何に自然言語を理解するか?
 
DBREから始めるデータベースプラットフォーム
DBREから始めるデータベースプラットフォームDBREから始めるデータベースプラットフォーム
DBREから始めるデータベースプラットフォーム
 
SQL Server エンジニアのためのコンテナ入門
SQL Server エンジニアのためのコンテナ入門SQL Server エンジニアのためのコンテナ入門
SQL Server エンジニアのためのコンテナ入門
 
Lunch & Learn, AWS NoSQL Services
Lunch & Learn, AWS NoSQL ServicesLunch & Learn, AWS NoSQL Services
Lunch & Learn, AWS NoSQL Services
 
db tech showcase2019オープニングセッション @ 森田 俊哉
db tech showcase2019オープニングセッション @ 森田 俊哉 db tech showcase2019オープニングセッション @ 森田 俊哉
db tech showcase2019オープニングセッション @ 森田 俊哉
 
db tech showcase2019 オープニングセッション @ 石川 雅也
db tech showcase2019 オープニングセッション @ 石川 雅也db tech showcase2019 オープニングセッション @ 石川 雅也
db tech showcase2019 オープニングセッション @ 石川 雅也
 
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
 
難しいアプリケーション移行、手軽に試してみませんか?
難しいアプリケーション移行、手軽に試してみませんか?難しいアプリケーション移行、手軽に試してみませんか?
難しいアプリケーション移行、手軽に試してみませんか?
 
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
 
そのデータベース、クラウドで使ってみませんか?
そのデータベース、クラウドで使ってみませんか?そのデータベース、クラウドで使ってみませんか?
そのデータベース、クラウドで使ってみませんか?
 
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
 
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。 複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
 
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
 
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
 

Recently uploaded

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 

[db tech showcase Tokyo 2014] D36: 次世代分析基盤 "Data Lake" を支えるPivotalのインメモリ+SQL on Hadoopテクノロジー by Pivotalジャパン株式会社 市村友寛 & 宵勇樹

  • 1. ḟୡ௦ศᯒᇶ┙㻌䇿Data Lake” 䜢ᨭ䛘䜛 Pivotal䛾䜲䞁䝯䝰䝸 + SQL on Hadoop䝔䜽䝜䝻䝆䞊 2014ᖺ11᭶13᪥ Pivotal䝆䝱䝟䞁ᰴᘧ఍♫ ᕷᮧ㻌཭ᐶ ᐘ䚷ຬᶞ © Copyright 2014 Pivotal. All rights reserved. 1
  • 2. Pivotal ఍♫ᴫせ 䜶䞁䝍䞊䝥䝷䜲䝈ྥ䛡䛻3rd䝥䝷䝑䝖䝣䜷䞊䝮䜢ᐇ⌧䛩䜛䝋䝣䝖䜴䜵䜰䜢㛤Ⓨ䞉ᥦ౪ 䜽䝷䜴䝗(PaaS)䛸䝡䝑䜾䝕䞊䝍䛾ᇶ┙ᢏ⾡䚸ཬ䜃ḟୡ௦䜰䝥䝸㛤Ⓨ䝃䞊䝡䝇䛜୺㍈ CEO 䝫䞊䝹䞉䝬䝸䝑䝒 2013ᖺ4᭶タ❧ (᪥ᮏἲே䠖7᭶) ᚑᴗဨᩘ ⣙2,000ே ௻ᴗ㢳ᐈ 1,200♫௨ୖ ฟ㈨௻ᴗ EMC㐃ྜ䛸GE㐃ᦠ © Copyright 2014 Pivotal. All rights reserved. 2
  • 3. Pivotal’s Opportunity § 䝡䝑䜾䝕䞊䝍 Pivotal HD, Pivotal Greenplum DB § 䝣䜯䝇䝖䝕䞊䝍 Pivotal GemFire § 䜶䞁䝍䞊䝥䝷䜲䝈㻌PaaS Pivotal CF § 䜰䝆䝱䜲䝹㛤Ⓨᨭ᥼䝃䞊䝡䝇 Pivotal Labs § 䝕䞊䝍䝃䜲䜶䞁䝔䜱䝇䝖ཬ䜃 䚷䚷⫱ᡂ䝖䝺䞊䝙䞁䜾 Pivotal Data Science Labs © Copyright 2014 Pivotal. All rights reserved. 3
  • 4. 䝡䝑䜾䝕䞊䝍᫬௦䛻ồ䜑䜙䜜䜛せ⣲ᢏ⾡ 吨听吀䞊 吵呉吐 SQL on Hadoop HAWQ Impala, Drill, Presto,.. Hadoop Pivotal HD CDH, MapR, Horton 䝕䞊䝍㔞 ศᩓᆺRDB GreenplumDB PureData, Teradata, ExaData 䜲䞁䝯䝰䝸 GemFire/XD TimesTen, SAP HANA RDB Oracle, DB2, MSSQL Server MySQL, PostgreSQL © Copyright 2014 Pivotal. All rights reserved. 4
  • 5. Pivotal Data Lake 䜰䞊䜻䝔䜽䝏䝱 Ÿ 䝕䞊䝍ฎ⌮ᇶ┙䛾ᇶ┙せ⣲䛸䛺䜛Hadoop(HDFS)䛻䝕䞊䝍䜢⵳✚ Ÿ 䛥䜎䛦䜎䛺䝕䞊䝍䞉せ௳䛻ᛂ䛨䛶ฎ⌮䜶䞁䝆䞁䜢౑䛔ศ䛡䜛 Pivotal Data Lake 䜰䝘䝸䝔䜱䝑䜽 䝕䞊䝍䝬䞊䝖 SQLฎ⌮ 䜸䝨䝺䞊䝅䝵䝘䝹 䜲䞁䝔䝸䝆䜵䞁䝇 䜲䞁䞉䝯䝰䝸㻌䝕䞊䝍䝧䞊䝇 䝷䞁䞉䝍䜲䝮 䜰䝥䝸䜿䞊䝆䝵䞁 HDFS 䝕䞊䝍 䝇䝔䞊䝆䞁䜾 䝕䞊䝍⟶⌮ 䝇䝖䝸䞊䝮 䜲䞁䝆䜵䝇䝏䝵䞁 䝇䝖䝸䞊䝭䞁䜾ฎ⌮ New Data-fabrics Software-Defined Datacenter GemFire XD ...ETC 䜲䞁䞉䝯䝰䝸㻌䜾䝸䝑䝗 GemFire XD © Copyright 2014 Pivotal. All rights reserved. 5
  • 6. Pivotal Data Lake 䝸䝣䜯䝺䞁䝇䜰䞊䜻䝔䜽䝏䝱 3. SQL䛻䜒ᑐᛂ䛧䛯㧗䛔㛤Ⓨ⏕⏘ᛶ S Q L 䜲䞁䝯䝰䝸䞊䞉䜶䞁䝆䞁 䝕䞊䝍䜴䜵䜰䝝䜴䝇 䝉䞁䝖䝷䝹DWH 䝕䞊䝍䝬䞊䝖 䝡䝆䝛䝇䚷䚷䚷 䜰䝥䝸䜿䞊䝅䝵䞁 BI 䝡䝆䝛䝇 䜰䝘䝸䝔䜱䜽䝇 䝣䜯䝇䝖䝕䞊䝍 ᵓ㐀໬䝕䞊䝍 ຺ᐃ⣔ 䝅䝇䝔䝮 ᝟ሗ⣔ 䝅䝇䝔䝮 ࿘㎶ 䝅䝇䝔䝮 ⤒Ⴀ⪅ ⟶⌮⪅ ᴗົ㒊㛛 ศᯒ⪅䞉᝟ሗ 䝅䝇䝔䝮㒊㛛 㠀ᵓ㐀໬䝕䞊䝍 䝕䞊䝍䝺䜲䜽 Hadoop ⏕䝕䞊䝍 ETL ฎ⌮ 䜰䜽䝉䝇䝻䜾 䝯䞊䝹䞉㼃㼑㼎 㻹㻞㻹 ⏬ീ䞉ᫎീ 㡢ኌ 㻿㻺㻿 © Copyright 2014 Pivotal. All rights reserved. 6
  • 7. 㻳㻱䛾䜲䝜䝧䞊䝅䝵䞁㻌Industrial Internet ඲䛶䛾஦ᴗ㒊㛛䜢䜎䛯䛜䛳䛯䝕䞊䝍ศᯒᇶ┙ (Industrial Data Lake)䜢Pivotal♫䛾䝔䜽䝜䝻 䝆䞊䛷ᐇ⌧ 2014ᖺ8᭶15᪥㻌᪥⤒⏘ᴗ᪂⪺ 25のエアライン 340万フライト 340TBデータ 2,000倍の分析速度向上 10分の1の費用削減 アプリ投入まで7日間 © Copyright 2013 Pivotal. All rights reserved. 7
  • 8. 㢼ຊⓎ㟁䝅䝇䝔䝮䛾᭱㐺໬(⡿ᅜ䠅 ⥅⥆ⓗ䛺䝸䜰䝹䝍䜲䝮ศᯒ Fast Data Big Data ⛊㛫䛒䛯䜚ᩘ༓䝕䞊䝍䝫䜲䞁䝖䛛䜙䛾 䝕䞊䝍ᢞධ 䝏䝳䞊䝙䞁䜾 ᭱㐺໬䚸 ⥭ᛴ೵Ṇ ┘ど䛸ไᚚ ಖᏲసᴗ䛾ຠ⋡໬ ㄢ㔠 © Copyright 2014 Pivotal. All rights reserved. 8
  • 9. ḟୡ௦䝡䝑䜽䝕䞊䝍ᇶ┙䜢ᨭ䛘䜛䝋䝣䝖䜴䜵䜰⩌ 䝸䜰䝹䝍䜲䝮ศᯒ Pivotal GemFire Pivotal GemFire XD 䝝䜲䝟䝣䜷䞊䝬䞁䝇 ศᯒ 䝺䜼䝳䝷䞊䝟䝣䜷䞊 䝬䞁䝇䠃䝞䝑䝏 ศᯒ Pivotal Greenplum DB HAWQ Pivotal HD 䜲䞁䝯䝰䝸䞊 KVS ㉸୪ิฎ⌮ RDB Hadoop © Copyright 2014 Pivotal. All rights reserved. 9
  • 10. Pivotal HD Data Lake 䝥䝷䝑䝖䝣䜷䞊䝮䛾䝁䜰䝔䜽䝜䝻䝆䞊 • Apache䝧䞊䝇䛾Hadoop䛻䜶䞁䝍䞊䝥䝷䜲䝈ᶵ⬟䜢㏣ຍ䞉䜸䞊䝥䞁䛸ၟ⏝୧᪉䛾䝯䝸䝑䝖䜢ாཷ • ANSI SQL‽ᣐ䛻䜘䜛㧗㏿SQL䜶䞁䝆䞁䛻䜘䜛᪤Ꮡ㈨⏘㻌(䝥䝻䜾䝷䝮䜔䝇䜻䝹) 䛾ಖㆤ Pivotal䛾Hadoop䝕䜱䝇䝖䝸䝡䝳䞊䝅䝵䞁 • Apache Hadoop2.2䝧䞊䝇 • ၟ⏝〇ရ䛸䛧䛶䛾㏣ຍᶵ⬟ • Command Center, HVE • HAWQ, GemFireXD • 䜸䞊䝥䞁䝋䞊䝇䛸䛾㐃ᦠ䞉⤫ྜ • Spark, Parquet, GraphLab➼ 㧗㏿䛺SQL䜶䞁䝆䞁䛾ᦚ㍕ • ᶆ‽SQLᑐᛂ䛾DB䜶䞁䝆䞁 • HIVEẚᩘ༑ಸࠥᩘⓒಸ䛾㧗ᛶ⬟ VM䞉EMC䝔䜽䝜䝻䝆䞊䛸䛾㐃ᦠ • VMwareୖ䛷䛾᭱㐺໬ᶵ⬟ HVE • EMC䛾䝇䜿䞊䝹䜰䜴䝖NAS 䚷㻌”Isilon”䛸䛾㐃ᦠ HAWQ 䜰䝗䝞䞁䝇䝗䝕䞊䝍䝧䞊䝇䝃䞊䝡䝇 ANSI SQL + 䜰䝘䝸䝔䜱䜽䝇 MADlib 䜹䝍䝻䜾 䝃䞊䝡䝇 䝎䜲䝘䝭䝑䜽䞉䝟䜲䝥䝷䜲䝙䞁䜾 HDFS HBase Pig, Hive, Mahout Map Reduce Xtension 䝣䝺䞊䝮䝽䞊䜽 Hadoop 䝞䞊䝏䝱䝷䜲䝊䞊䝅䝵䞁 (HVE) 䜸䝥䝔䜱䝬䜲 䝄 (Orca) Spring Parquet 䈜3 GraphLab, OpenMPI 䈜3 Sqoop Flume 䝸䝋䞊䝇⟶⌮ 䝽䞊䜽䝣䝻 䞊 YARN ZooKeeper Oozie Apache Pivotal HD ㏣ຍᶵ⬟ Pivotal Command Center ᵓᡂ/䝕䝥䝻䜲/ ┘ど/⟶⌮ Spark 䈜1 Ambari 䈜2 䈜1. Pivotal HD䛿 Apache Spark 䛸✌ാ䛩䜛䛣䛸䜢ㄆᐃ䛩䜛 “Certified Spark Distribution”䛻Ⓩ㘓䛥䜜䛶䛔䜎䛩䚹(2014/5) 䈜2. Pivotal䛿 Apache Ambari䛾㛤Ⓨ䝥䝻䝆䜵䜽䝖䛻ཧ⏬䛩䜛䛣䛸䜢⾲᫂䛧䛶䛔䜎䛩䚹(2014/7) 䈜3. PivotalHD2.0 䛻䛶GraphLab, OpenMPI, Parquet䛿䝧䞊䝍ᥦ౪䛥䜜䛶䛔䜎䛩䚹 © Copyright 2014 Pivotal. All rights reserved. 10
  • 11. Pivotal GemFire XD ప㐜ᘏ䚸䝇䜿䞊䝹䜰䜴䝖䜢ᐇ⌧ 䝕䞊䝍䜢฼⏝䛩䜛䜶䞁䝍䞊䝥䝷䜲䝈 䜰䝥䝸䜿䞊䝅䝵䞁 ಙ㢗ᛶ䛾㧗䛔䜲䝧䞁䝖㏻▱ᶵ⬟ SQL(JDBC/ODBC) ୪ิฎ⌮ 㧗䛔䝇䝹䞊䝥䝑䝖 ప㐜ᘏ ඃ䜜䛯䝇䜿䞊䝷䝡䝸䝔䜱 ⥅⥆ⓗ䛺ྍ⏝ᛶ GemFire XD 䝕䞊䝍䜾䝸䝑䝗 䝕䞊䝍䛾ᣢ⥆ᛶ WAN ⤒⏤䛾ศᩓ HDFS 䛭䛾௚䝕䞊䝍䝇䝖䜰 䝣䜯䜲䝹 䝅䝇䝔䝮 䝕䞊䝍䝧䞊䝇 እ㒊䝕䞊䝍䝇䝖䜰 © Copyright 2014 Pivotal. All rights reserved. 11
  • 12. GemFire XD - ప㐜ᘏ䚸䝇䜿䞊䝹䜰䜴䝖䜢ᐇ⌧ © Copyright 2014 Pivotal. All rights reserved. 12
  • 13. 䜲䞁䝯䝰䝸䝕䞊䝍䜾䝸䝑䝗䛾ᴫせ © Copyright 2014 Pivotal. All rights reserved. 13
  • 14. 䜲䞁䝯䝰䝸䝕䞊䝍䜾䝸䝑䝗䛾ᇶᮏⓗ䛺ᴫᛕ Handle thousands of concurrent connections Replicated Region Synchronous replication for slow changing data Partition for large data or highly transactional data • 䜻䞊䝞䝸䝳䞊/䜸䝤䝆䜵䜽䝖䚸SQL䜰䜽䝉䝇 • 䜽䜶䝸䞊䚸䜲䞁䝕䝑䜽䝇䚸䝖䝷䞁䝄䜽䝅䝵䞁ᑐᛂ • 䝯䝰䝸ୖ䛷䛾䝺䝥䝸䜿䞊䝅䝵䞁䚸䝟䞊䝔䜱䝅䝵䞁 • 䜽䝷䝇䝍㛫䛷䛾෕㛗䝕䞊䝍䛾䝁䝢䞊 • 䝕䜱䝇䜽䜒䛧䛟䛿RDBMS䜈䛾䝕䞊䝍ᒎ㛤 Redundant copy Partitioned Region • ศᩓᆺ䝯䝰䝸ᣦྥ䛾䝕䞊䝍䝇䝖䜰 • 」ᩘ䛾䝇䝖䝺䞊䝆䝰䝕䝹 © Copyright 2014 Pivotal. All rights reserved. 14 14 Low latency for thousands of clients • 䜰䝥䝸䜿䞊䝅䝵䞁䝻䝆䝑䜽䛾୪ิ໬ • 」ᩘ䛾㞀ᐖ᳨▱ฎ⌮ • ືⓗ䛺䝯䞁䝞䞊䛾㏣ຍ (elastic) • 䝧䞁䝎䞊䛾ᕪู໬せ⣲ • SQLᑐᛂ䚸WANᑐᛂ, 䜲䝧䞁䝖ฎ⌮, etc
  • 15. ᚑ᮶䛾㻾㻰㻮㻹㻿䛻䛚䛡䜛ㄢ㢟 䝞䝑䝣䜯䛿୺䛻䈊 㻵㻛㻻ྥ䛡䛻᭱㐺໬ ḟ䛻䝕䞊䝍䞉䝣䜯䜲䝹䛻䈊 ᭩㎸䜐 § ኱㔞䛾I/O § タィ᫬䛾᝿ᐃ䛸䛾䜼䝱䝑䝥 • ACID䛻䝣䜷䞊䜹䝇 • 䝕䜱䝇䜽ྠᮇ䛾䝪䝖䝹䝛䝑䜽 ึ䜑䛻㻸㻻㻳䜢䈊 ᭩㎸䜐 © Copyright 2014 Pivotal. All rights reserved. 15
  • 16. GemFire XD/SQLFire䛾䝧䞁䝏䝬䞊䜽⤖ᯝ 㻝㻘㻞㻜㻜㻗ྠ᫬䜰䜽䝉䝇 㻣㻜୓㻗䛾䝇䝹䞊䝥䝑䝖 © Copyright 2014 Pivotal. All rights reserved. 16
  • 17. RDBMS䛸䛾㐃ᦠ㻌䠖㻌DBSynchronizer • RDBMS䛸䝔䞊䝤䝹ᐃ⩏䛜ྠ䛨ሙྜ䛻᭷ຠ Flights Flights FlightsR FlightsR FlightAvailability FlightAvailability FlightAvailabilityR FlightAvailabilityR Airlines Airlines java.sql.Connection conn = getConnection(); CallableStatement cs = conn.prepareCall(“CALL SYS.ADD_ASYNC_EVENT_LISTENER(?,?,?,?,?,?,?,?,?,?,?)”); cs.setString(1, “SG1”); cs.setString(2, “MyID”); cs.setString(3, “com.vmware.sqlfire.callbacks.DBSynchronizer”); cs.setString(11,“jdbc:oracle:thin:@localhost:1521:XE”); cs.execute(); © Copyright 2014 Pivotal. All rights reserved. 17
  • 18. 䝕䞊䝍䝉䞁䝍䞊㛫䛾䝕䞊䝍䞉䝺䝥䝸䜿䞊䝅䝵䞁 GemFire 1 GemFire2 Standby Gateway GemFire4 Gateway GemFire 3 New York Site GemFire5 Standby Gateway GemFire6 GemFire 7 GemFire8 Gateway Tokyo Site Standby Gateway Paths GemFire9 Gateway GemFire12 Standby Gateway GemFire GemFire 10 11 London Site Primary Gateway Paths 䜾䝻䞊䝞䝹䛺䝕䞊䝍㓄ಙ 䜰䜽䝔䜱䝤-䜰䜽䝔䜱䝤䛺㻌DR ᑐ⟇ ᶆ‽ᶵ⬟䛸䛧䛶ᥦ౪ © Copyright 2014 Pivotal. All rights reserved. 18
  • 19. ‘Shared nothing persistence’䛺䜰䞊䜻䝔䜽䝏䝱䛾฼Ⅼ䛸ㄢ㢟 þ ㏣グ䛾䜏䛾䜸䝨䝺䞊䝅䝵䞁䝻䜾 þ ᏶඲䛺୪ิฎ⌮ þ 䝕䜱䝇䜽䝅䞊䜽䛺䛧 ¨ 䝻䜾䛾䝇䜻䝱䞁䛻䛿䜽䝷䝇䝍䝉䝑䝖䛾 ෌㉳ື䛜ᚲせ ¨ ኱ᐜ㔞䛾䝪䝸䝳䞊䝮䛻ᑐ䛧䛶䛿ㄪᩚ䛜 ᚲせ Memory Tables Compressor OS Buffers Append only Operation logs LOG Record1 Record2 Record3 Record1 Record2 Record3 Memory Tables Compressor OS Buffers Append only Operation logs LOG Record1 Record2 Record3 Record1 Record2 Record3 䜲䞁䝯䝰䝸䝕䞊䝍䜾䝸䝑䝗䛷䛒䜛 GemFire䛻䛚䛔䛶䜒ㄢ㢟䛜Ꮡᅾ © Copyright 2014 Pivotal. All rights reserved. 19
  • 20. Hadoop HDFS Hadoop core(HDFS) for scalable, parallel storage • ᡂ⇍䛧䛴䛴䛒䜚䚸㐺⏝⠊ᅖ䜒ᣑ኱ • ỗ⏝䝃䞊䝞䛷䜒኱つᶍ䛺䝕䞊䝍䝉䝑䝖䜢ྲྀ䜚ᢅ䛔䛜ྍ⬟ • 㞀ᐖ䜈䛾ᰂ㌾䛺ᑐᛂ • 䝅䞁䝥䝹䛺୍㈏ᛶ䝰䝕䝹 © Copyright 2014 Pivotal. All rights reserved. 20
  • 21. Hadoop 䜶䝁䝅䝇䝔䝮䛜ᐇ⌧䛩䜛฼Ⅼ Ÿ ኱つᶍ䛺䝪䝸䝳䞊䝮䝉䝑䝖 ( TB to PB) Ÿ 㧗ྍ⏝ᛶ, ᅽ⦰ᶵ⬟ Ÿ ୪ิィ⟬䛸䝕䞊䝍ศᯒᇶ┙䛸䛧䛶䛾ᡂ⇍ᗘ䛸䜶䝁䝅䝇䝔䝮䛾 ᒎ㛤 Ÿ 䝇䝖䝺䞊䝆䝅䝇䝔䝮䛻䛚䛔䛶䜒HDFSᑐᛂ䛜ᬑཬ Ÿ ௬᝿໬⎔ቃ䜈䛾ᑐᛂ䜒ᚎ䚻䛻ᾐ㏱ © Copyright 2014 Pivotal. All rights reserved. 21
  • 22. GemFire XD 䜲䞁䝯䝰䝸䝕䞊䝍䜾䝸䝑䝗䛛䜙䛾᪂䛯䛺ᒎ㛤 © Copyright 2014 Pivotal. All rights reserved. 22
  • 23. GemFire XD – 䝸䜰䝹䝍䜲䝮䝕䞊䝍ศᯒ䜶䞁䝆䞁 䜲䞁䝯䝰䝸䚸䛥䜙䛻䛿ᆅ⌮ⓗ䛻 ศᩓ䛥䜜䛯䝽䞊䜻䞁䜾䝉䝑䝖 SQLFire 㐣ཤ䝕䞊䝍䚸᫬⣔ิ䝕䞊䝍䜢 HDFS䛻᱁⣡ Pivotal HD GemFire + 䜽䝷䝇䝍䝸䞁䜾䚸䜲䞁䝯䝰䝸 䝕䞊䝍䝇䝖䜰䚸HA, 䝺䝥䝸䜿䞊 䝅䝵䞁䚸WANᑐᛂ䚸䜲䝧䞁䝖ฎ ⌮䚸ศᩓ䜻䝳䞊… SQL Objects, JSON SQL 䜶䞁䝆䞁 - 䜸䝥䝔䜱䝬䜲䝄䚸䜲䞁䝯䝰䝸 ୖ䛾䜲䞁䝕䝑䜽䝇సᡂ䚸ศᩓ 䝖䝷䞁䝄䜽䝅䝵䞁䚸RDB䜲䞁䝔 䜾䝺䞊䝅䝵䞁.. 䜲䞁䝇䝖䞊䝹䚸ᵓᡂ䚸⟶⌮䛾⤫ ྜ໬䚸┘どᶵ⬟䚸Hadoop䜈䛾 ᭱㐺໬ © Copyright 2014 Pivotal. All rights reserved. 23
  • 24. GemFire XD 䜰䞊䜻䝔䜽䝏䝱ᴫせ JDBC ODBC HDD HDD HDD HDD HDD 䝕䞊䝍䝜䞊䝗 HDFS 䝕䞊䝍䝜䞊䝗 HDFS 䝕䞊䝍䝜䞊䝗 HDFS 䝕䞊䝍䝜䞊䝗 HDFS 䝕䞊䝍䝜䞊䝗 HDFS GemFire XD 䜽䝷䝇䝍 Map/Reduce, Hive, HBase GemFire XD PXF Plugin Hadoop 䜽䝷䝇䝍 䝕䞊䝍䝧䞊䝇 䝕䞊䝍䝉䞁䝍䞊 © Copyright 2014 Pivotal. All rights reserved. 24
  • 25. SQL + IMDG(Objects) + HDFS 䝯䜲䞁䝯䝰䝸䞊䜢䝧䞊䝇䛻㻌ప㐜ᘏ䛷ศᩓ䛧䛯䝡䝑䜾䝕䞊䝍ྥ䛡䛾䝕䞊䝍䝇䝖䜰 ᐇ㝿䛻᧯స䛩䜛 䝕䞊䝍䛾䜏䚸䝯䝰䝸 ୖ䛻ᒎ㛤 Ọ⥆໬䚸䛒䜛䛔䛿 䜰䞊䜹䜲䝤䝕䞊䝍䛿 HDFS䜈 © Copyright 2014 Pivotal. All rights reserved. 25
  • 26. SQL + IMDG(Objects) + HDFS 䝺䝥䝸䜿䞊䝅䝵䞁 / 䝟䞊䝔䜱䝅䝵䞁 䝇䝖䝺䞊䝆䝰䝕䝹: - 䜲䞁䝯䝰䝸 - 䜲䞁䝯䝰䝸䛸䝻䞊䜹䝹䝕䜱䝇䜽 - 䜲䞁䝯䝰䝸䛸HDFS © Copyright 2014 Pivotal. All rights reserved. 26
  • 27. SQL + IMDG(Objects) + HDFS SQL Engine – OLTP䚸䝖䝷䞁䝄䜽䝅䝵䞁 䛻䜒ᑐᛂ IMDG 䜻䝱䝑䝅䝳ᶵ ⬟ - readThru, writeBehind, ➼䚻 © Copyright 2014 Pivotal. All rights reserved. 27
  • 28. SQL + IMDG(Objects) + HDFS HDFS䛸䛾ᐦ䛺⤫ྜ – 䝇䝖䝸䞊䝭䞁䜾䚸R/W䜰䜽䝉䝇 䝕䞊䝍ศᯒ䛿䜲䞁䝯䝰䝸䛷䛿䛺䛟HDFS䜢฼⏝䚷 – 䝅䞊䜿䞁䝅䝱䝹䛺䜰䜽䝉䝇䚸䜒䛧䛟䛿䜲䞁䜽 䝸䝯䞁䝍䝹䛺ฎ⌮䜢ᐇ⌧ 䝕䞊䝍ᢞධ䛾୪ิ໬䛻䜘䜚䚸䝸䜰䝹䝍䜲䝮 䛻㏆䛔ヲ⣽䛺ศᯒ䜒ྍ⬟ © Copyright 2014 Pivotal. All rights reserved. 28
  • 29. SQL + IMDG(Objects) + HDFS MapReduce䛻䛚䛡 䜛reduceฎ⌮䛻䛶 䜲䞁䝯䝰䝸䜈䛾 䝕䞊䝍཯ᫎ 䝸䜰䝹䝍䜲䝮ฎ⌮䛸ศᯒ䛾㛫䛷䛾䜽䝻䞊䝈䝗䛺䝹䞊䝥䜢ᐇ⌧ © Copyright 2014 Pivotal. All rights reserved. 29
  • 30. GemFire XD䛻䛚䛡䜛䝕䞊䝍⟶⌮ CREATE TABLE FLIGHTS ( FLIGHT_ID CHAR(6) NOT NULL , SEGMENT_NUMBER INTEGER NOT NULL , ….. PARTITION BY COLUMN (FLIGHT_ID) PERSISTENT HDFSSTORE RWStore; CREATE HDFSSTORE RWStore NAMENODE hdfs://PHD1:8020 DIR /indexed-tables BATCHSIZE 10 BATCHTIMEINTERVAL 2000 QUEUEPERSISTENT true; Replicated Table Table Replicated Table Replicated Table Partitioned Table Partitioned Table Colocated Partition Colocated Partition Colocated Partition Redundant Partition Partitioned Table Redundant Partition Redundant Partition © Copyright 2014 Pivotal. All rights reserved. 30
  • 31. 䝺䜲䝔䞁䝅䛾ほⅬ䛷䜏䛯㐺⏝⠊ᅖ 䝬䝅䞁䛻䜘䜛 ᛂ⟅ ே㛫䛻䜘䜛 䜲䞁䝍䝷䜽䝔䜱䝤 䝺䝫䞊䝖 䝞䝑䝏ฎ⌮ ᧯స Milliseconds Seconds Seconds, Minutes Minutes, Hours Online/OLTP/Operational DBs Analytics, Data Warehousing GemFire XD PivotalHD HAWQ © Copyright 2014 Pivotal. All rights reserved. 31
  • 33. GemFire XD + Pivotal HD 䝸䜰䝹䝍䜲䝮+䝡䝑䜽䝕䞊䝍䛻䜘䜛Data Lake䜰䞊䜻䝔䜽䝏䝱䛾ᐇ⌧ Online Apps 䝸䜰䝹䝍䜲䝮䝕䞊䝍䛾ᢞධ 䝸䜰䝹䝍䜲䝮ศᯒ 㧗ᗘ䛺䝕䞊䝍ศᯒ 䝕䞊䝍䝰䝕䝹 Sensor Data / Feeds 䝕䞊䝍䝰䝕䝹 ᭦᪂ Map-Reduce Analytic Apps GemFire XD HAWQ HDFS PXF I/P O/P Formatter 䝸䜰䝹䝍䜲䝮/䝉䝭䞉䝸䜰䝹䝍䜲䝮䛷䛾 䝕䞊䝍ᢞධ Shared Data - HFiles PCC ᭦᪂ ศᯒ䝕䞊䝍䛾ྲྀ䜚㎸䜏 ศᯒ䝕䞊䝍䛾ฎ⌮ © Copyright 2014 Pivotal. All rights reserved. 33
  • 34. PaaS䛻䛚䛡䜛䝃䞊䝡䝇䛸䛧䛶䛾ᥦ౪ Pivotal GemFire XD for Pivotal CF Ÿ Pivotal CFୖ䛻ᒎ㛤䛥䜜䜛䜰䝥䝸䜿䞊䝅䝵䞁 䛻ᑐ䛧䛶GemFireXD䜢䝃䞊䝡䝇䛸䛧䛶ᥦ౪ – Pivotal CF: 䜸䞊䝥䞁䝋䞊䝇PaaS䛷䛒䜛Cloud Foundry䛾ၟ⏝䝕䜱䝇䝖䝸䝡䝳䞊䝅䝵䞁 Ÿ ⊂⮬䛾䝃䞊䝡䝇䛸䛧䛶䜒㏣ຍྍ⬟ – Service Broker䛾ᐇ⿦ – 䝴䞊䝄ᐃ⩏䝃䞊䝡䝇 © Copyright 2014 Pivotal. All rights reserved. 34
  • 35. 䜶䞁䝍䞊䝥䝷䜲䝈ྥ䛡 SQL on Hadoop䜶䞁䝆䞁 © Copyright 2014 Pivotal. All rights reserved. 35
  • 36. せⅬ 1. HAWQ䛸䛿ఱ䛛䠛 2. HAWQ䛿㏿䛔 3. HAWQ䛿SQL஫᥮ © Copyright 2014 Pivotal. All rights reserved. 36
  • 37. 䝞䝑䝏ฎ⌮ http://www.anishsneh.com/2014/07/hadoop-mapreduce-api.html © Copyright 2014 Pivotal. All rights reserved. 37
  • 38. 䜰䝗䝩䝑䜽䜽䜶䝸 Ø SELECT id, name, address FROM foo LIMIT 100; Ø SELECT a, b, count(c) FROM bar GROUP BY a, b; … © Copyright 2014 Pivotal. All rights reserved. 38
  • 39. Hadoopྥ䛡䜽䜶䝸ゝㄒ㻌Hive Hive :SQL 䛻ఝ䛯HiveQL䛷MapReduce䜢ᐇ⾜ ᑐヰᘧ䛻኱㔞䝕䞊䝍䜢ฎ⌮䛜ྍ⬟䛻 © Copyright 2014 Pivotal. All rights reserved. 39
  • 40. Hive 䛾ၥ㢟Ⅼ Ÿ ᑠつᶍ䛺䜽䜶䝸䜢ᐇ⾜䛩䜛䛻䜒᫬㛫䛜䛛䛛䜛 – MapReduce䜢౑⏝䛧䛶䛚䜚䚸ẖᅇJavaVM䛾㉳ື-⤊஢ฎ⌮䜢ᐇ⾜䛩䜛Ⅽ Ÿ BI䝒䞊䝹➼䛷ື䛛䛺䛔ሙྜ䛜ከ䛔 – SQL䛾᏶඲஫᥮䛷䛿↓䛔Ⅽ © Copyright 2014 Pivotal. All rights reserved. 40
  • 41. Hive䜘䜚䜒㏿䛟䚸SQL஫᥮䛷䚸 Hadoopୖ䛷ື䛟䜶䞁䝆䞁䛜ᚲせ SQL on Hadoop © Copyright 2014 Pivotal. All rights reserved. 41
  • 42. SQL on Hadoop “HAWQ” HAdoop With Query Ÿ 䜶䞁䝍䞊䝥䝷䜲䝈ྥ䛡䜽䜶䝸䜶䞁䝆䞁 Ÿ HDFSୖ䛻㧗㏿䝕䞊䝍䝧䞊䝇䜶䞁䝆䞁 HAWQ䜢ᦚ㍕ – 䝡䝑䜾䝕䞊䝍䛻ᑐ䛩䜛 㧗㏿䜽䜶䝸ฎ⌮ – ᶆ‽SQL‽ᣐ – ⤫ィゎᯒ㛵ᩘ㻌MADlib 䛻ᑐᛂ PivotalHD MapReduce Pig HDFS © Copyright 2014 Pivotal. All rights reserved. 42
  • 43. HAWQ䛿㏿䛔 User intelligence 4.2 198 Sales analysis 8.7 161 Click analysis 2.0 415 Data exploration 2.7 1,285 BI drill down 2.8 1,815 47X 19X 208X 476X 648X ༢఩䠖⛊ User intelligence 4.2 37 Sales analysis 8.7 596 Click analysis 2.0 50 Data exploration 2.7 55 BI drill down 2.8 59 ༢఩䠖⛊ 9X 69X 25X 20X 21X © Copyright 2014 Pivotal. All rights reserved. 43
  • 44. HAWQ/Impalaẚ㍑ᛶ⬟᳨ド 㻌㻔ᴗ⏺ᶆ‽ᣦᶆ㼀㻼㻯㻙㻰㻿䜢౑⏝㻕 䝃䝫䞊䝖䜽䜶䝸ᩘẚ㍑ 100% (ẕᩘ111䜽䜶䝸) 䝃䝫䞊䝖䛛䛴᏶஢䛧䛯䜽䜶䝸ᩘẚ㍑ 100% (ẕᩘ111䜽䜶䝸) 㠀䝃䝫䞊䝖䜽䜶䝸 㠀䝃䝫䞊䝖䜽䜶䝸 䜒䛧䛟䛿ฎ⌮୰䛻 ␗ᖖ⤊஢䛧䛯䜽䜶䝸 HAWQฎ⌮᫬㛫䜢䠍䛸䛧䛯ሙྜ䛾 Impalaฎ⌮᫬㛫 Impala䛸᏶ẚ஢䛧㍑䛯䛧䜽䛶䜶䝸ᖹ ᆒ6ಸ䛾㧗㏿ 28% 18% ᛶ⬟ẚ 䈜㻚㻌㼀㻼㻯㻙㻰㻿䛸䛿ᴗ⏺ᶆ‽䛾ᛶ⬟ᣦᶆ䜢ᐃ⩏䛩䜛ᴗ⏺ᅋయ㻌㼀㻼㻯㻌㻔㼀㼞㼍㼚㼟㼍㼏㼠㼕㼛㼚㻌㻼㼞㼛㼏㼑㼟㼟㼕㼚㼓㻌㻼㼑㼞㼒㼛㼞㼙㼍㼚㼏㼑㻌㻯㼛㼡㼚㼏㼕㼘㻕㻌䛜ᥦ౪䛩䜛᝟ሗ⣔䝅䝇䝔䝮䛾䛯䜑䛾බᘧ䛺ᛶ⬟ᣦᶆ䛷䛩䚹㻞㻜㻝㻞ᖺ䛛䜙 ᥦ౪䛥䜜䛶䛔䜛㻌㼀㻼㻯㻙㻰㻿㻌䛷䛿䚸ὶ㏻ᴗ䛻䛚䛡䜛඾ᆺⓗ䛺䝕䞊䝍䝰䝕䝹䜢෌⌧䛧䛶䛚䜚䚸㻞㻡䝔䞊䝤䝹䚸㻠㻞㻥䜹䝷䝮䛻ᑐ䛧䛶ᵝ䚻䛺䜽䜶䝸䜢ᐇ⾜䛧䜎䛩䚹 䈜㻚㻌᳨ド䛷䛿㻌㻼㻴㻰㻝㻚㻝㻛㻴㻭㼃㻽㻝㻚㻝㻌䛸㻌㻯㻰㻴㻠㻚㻠㻛㻵㼙㼜㼍㼘㼍㻌㻝㻚㻝㻚㻝㻚䜢౑⏝䛧䜎䛧䛯䚹 ཧ⪃㼁㻾㻸㻦㻌㻻㼞㼏㼍㻦㻌㻭㻌㻹㼛㼐㼡㼘㼍㼞㻌㻽㼡㼑㼞㼥㻌㻻㼜㼠㼕㼙㼕㼦㼑㼞㻌㻭㼞㼏㼔㼕㼠㼑㼏㼠㼡㼞㼑㻌㼒㼛㼞㻌㻮㼕㼓㻌㻰㼍㼠㼍㻌㻌㼔㼠㼠㼜㻦㻛㻛㼣㼣㼣㻚㼓㼛㼜㼕㼢㼛㼠㼍㼘㻚㼏㼛㼙㻛㼟㼕㼠㼑㼟㻛㼐㼑㼒㼍㼡㼘㼠㻛㼒㼕㼘㼑㼟㻛㻿㻵㻳㻹㻻㻰㻹㼍㼥㻞㻜㻝㻠㻴㻭㼃㻽㻭㼐㼢㼍㼚㼠㼍㼓㼑㼟㻚㼜㼐㼒 ཧ⪃㼁㻾㻸㻦㻌㼇㻞㻜㻝㻠㻛㻢㻛㻞㻡㼉㻌㻼㼕㼢㼛㼠㼍㼘㻌㻴㻭㼃㻽㻌㻮㼑㼚㼏㼔㼙㼍㼞㼗㻌㻰㼑㼙㼛㼚㼟㼠㼞㼍㼠㼑㼟㻌㼁㼜㻌㼀㼛㻌㻞㻝㼤㻌㻲㼍㼟㼠㼑㼞㻌㻼㼑㼞㼒㼛㼞㼙㼍㼚㼏㼑㻌㼛㼚㻌㻴㼍㼐㼛㼛㼜㻌㻽㼡㼑㼞㼕㼑㼟㻌㼀㼔㼍㼚㻌㻿㻽㻸㻙㼘㼕㼗㼑㻌㻿㼛㼘㼡㼠㼕㼛㼚㼟㻌㼔㼠㼠㼜㻦㻛㻛㼎㼘㼛㼓㻚㼓㼛㼜㼕㼢㼛㼠㼍㼘㻚㼏㼛㼙㻛㼜㼕㼢㼛㼠㼍㼘㻛㼜㼞㼛㼐㼡㼏㼠㼟㻛㼜㼕㼢㼛㼠㼍㼘㻙㼔㼍㼣㼝㻙 㼎㼑㼚㼏㼔㼙㼍㼞㼗㻙㼐㼑㼙㼛㼚㼟㼠㼞㼍㼠㼑㼟㻙㼡㼜㻙㼠㼛㻙㻞㻝㼤㻙㼒㼍㼟㼠㼑㼞㻙㼜㼑㼞㼒㼛㼞㼙㼍㼚㼏㼑㻙㼛㼚㻙㼔㼍㼐㼛㼛㼜㻙㼝㼡㼑㼞㼕㼑㼟㻙㼠㼔㼍㼚㻙㼟㼝㼘㻙㼘㼕㼗㼑㻙㼟㼛㼘㼡㼠㼕㼛㼚㼟 © Copyright 2014 Pivotal. All rights reserved. 44
  • 45. 䝕䞊䝍ฎ⌮䝣䝻䞊ẚ㍑ 䜽䜶䝸䛾ᢞධ 䝥䝷䞁సᡂ 䜽䜶䝸ᐇ⾜ ⤖ᯝ䛾㏉ಙ 䝹䞊䝹䝧䞊䝇䛾䜸䝥䝔䜱䝬䜲䝄 • 䝔䞊䝤䝹䝕䞊䝍䛾ෆᐜ䛻㛵䜟䜙䛪䜽䜶 䝸䛻䜘䛳䛶ᐇ⾜䝥䝷䞁䜢సᡂ • MapReduce䝇䜽䝸䝥䝖䜢సᡂ MapReduceฎ⌮ • Java䝥䝻䝉䝇䛾㉳ື䞉೵Ṇ ୰㛫䝕䞊䝍䛾䝕䜱䝇䜽ฎ⌮ • ㏲ḟ䝕䜱䝇䜽IO䛾Ⓨ⏕ 䝁䝇䝖䝧䞊䝇䜸䝥䝔䜱䝬䜲䝄 Orca • 䝔䞊䝤䝹䝕䞊䝍䛾ෆᐜ(䝕䞊䝍㔞䚸䜹䞊 䝕䜱䝘䝸䝔䜱➼)䜢㋃䜎䛘᭱㐺䛺ᐇ⾜䝥䝷 䞁䜢సᡂ C䝥䝻䝉䝇ฎ⌮ • ᖖ㥔䝥䝻䝉䝇䛻䜘䜛༶᫬ฎ⌮ ୰㛫䝕䞊䝍䛾䜸䞁䝯䝰䝸ฎ⌮ • 䝟䜲䝥䝷䜲䞁ฎ⌮䛻䜘䜛䜸䞁䝯䝰䝸䛾㧗 ㏿ฎ⌮ • 䝕䜱䝇䜽IO䜢᤼㝖 © Copyright 2014 Pivotal. All rights reserved. 45
  • 46. 䜽䜶䝸䜸䝥䝔䜱䝬䜲䝄䛜ᐇ⌧䛩䜛㧗㏿䝕䞊䝍ฎ⌮ MapReduce䜢௓䛥䛪䝕䞊䝍䜢䝟䜲䝥䝷䜲䞁ฎ⌮ Ÿ 䝁䝇䝖䝧䞊䝇䛾䜸䝥䝔䜱䝬䜲䝄䛜᭱ 㐺䛺ᐇ⾜䝥䝷䞁䜢㑅ᢥ – DBฎ⌮(䝇䜻䝱䞁䚸䝆䝵䜲䞁䚸䝋䞊䝖䚸㞟 ィ➼)䛻ᑐ䛧䛶䝁䝇䝖䜢⟬ฟ – 䝉䜾䝯䞁䝖㛫㏻ಙ(“䝰䞊䝅䝵䞁”)䜒䜸䝥 䝔䜱䝬䜲䝄䛜ᣦ♧ Ÿ 䝎䜲䝘䝭䝑䜽䝟䜲䝥䝷䜲䞁ฎ⌮ – ୰㛫䝕䞊䝍䛾䜸䞁䝯䝰䝸ฎ⌮ PHYSICAL EXECUTION PLAN FROM SQL Gather Motion 4:1(Slice 3) Sort HashAggregate HashJoin Redistribute Motion 4:4(Slice 1) HashJoin Hash HashJoin Seq Scan on customer Hash Hash Broadcast Motion 4:4(Slice 2) Seq Scan on motion Seq Scan on lineitem Seq Scan on orders © Copyright 2014 Pivotal. All rights reserved. 46
  • 47. ᭱᪂䜽䜶䝸䜸䝥䝔䜱䝬䜲䝄 Orca䛻䜘䜛ᅽಽⓗᛶ⬟ྥୖ ᚑ᮶䜽䜶䝸䜸䝥䝔䜱䝬䜲䝄ẚᖹᆒ䠑ಸ䛾ᛶ⬟ྥୖ䜢ᐇ⌧ Ÿ 䝆䝵䜲䞁䜸䞊䝎䞊䝸䞁䜾 – 䜲䞁䝍䝁䝛䜽䝖䜈䛾኱つᶍ䝕䞊䝍㌿㏦䜢㜵䛠䝔䞊䝤䝹⤖ྜ㡰ᗎ䜢㑅ᐃ Ÿ ┦㛵䝃䝤䜽䜶䝸ฎ⌮ – ཯᚟ⓗ䝃䝤䜽䜶䝸ฎ⌮䜢ᅇ㑊 Ÿ ືⓗ㻌䝟䞊䝔䜱䝅䝵䞁᤼㝖䝇䜻䝱䞁 – 䜽䜶䝸ฎ⌮୰䛾୰㛫䝕䞊䝍䛻䛒䜟䛫䛶䝇䜻䝱䞁ᑐ㇟䝟䞊䝔䜱䝅䝵䞁䜢ື ⓗ䛻㑅ᐃ䞉᤼㝖 © Copyright 2014 Pivotal. All rights reserved. 47
  • 48. 䝆䝵䜲䞁䜸䞊䝎䝸䞁䜾䛾౛ 䝴䞊䝄IDẖ䛻㞟ィ䛧䛯䛔ሙྜ A B join ྛ䝜䞊䝗䛻 䝕䞊䝍䜢ศ㓄 A (100୓௳䚸 䝴䞊䝄id䛷ศᩓ) B (100୓௳䚸 ᫂⣽id䛷ศᩓ) ྛ䝜䞊䝗䛻 䝕䞊䝍䜢ศ㓄 join or 䛹䛱䜙䛜䜘䛔䛛䠛 © Copyright 2014 Pivotal. All rights reserved. 48
  • 49. ศᩓ䜻䞊䛾௳ᩘ䜢䝠䝇䝖䜾䝷䝮໬ A䛾䝜䞊䝗ẖ䛾䝴䞊䝄idᩘ B䛾䝜䞊䝗ẖ䛾䝴䞊䝄idᩘ A (䝴䞊䝄id䛷ศᩓ) B (᫂⣽id䛷ศᩓ) 䝔䞊䝤䝹᝟ሗ䛛䜙ศᯒ 䝔䞊䝤䝹᝟ሗ䛛䜙ศᯒ © Copyright 2014 Pivotal. All rights reserved. 49
  • 50. 䝠䝇䝖䜾䝷䝮䛛䜙ᐇ⾜᫬㛫䛾ぢ✚䜒䜚 A䛾䝜䞊䝗ẖ䛾䝴䞊䝄idᩘ B䛾䝜䞊䝗ẖ䛾䝴䞊䝄idᩘ A䜢ᅛᐃ䛧䛶B䛾䝕䞊䝍䜢ྛ䝜䞊䝗䛻ศ㓄䛧䛯ሙྜ䛾ᐇ⾜᫬㛫 B䜢ᅛᐃ䛧䛶A䛾䝕䞊䝍䜢ྛ䝜䞊䝗䛻ศ㓄䛧䛯ሙྜ䛾ᐇ⾜᫬㛫 © Copyright 2014 Pivotal. All rights reserved. 50
  • 51. 䛣䛱䜙䛾᪉䛜᫬㛫䛜䛛䛛䜙䛺䛔䛾䛷᥇⏝ A B ศ㓄 ศ㓄 join A B join ẚ㍑ © Copyright 2014 Pivotal. All rights reserved. 51
  • 52. HAWQ䛷䛿⮬ື䛷䝆䝵䜲䞁䜸䞊䝎䝸䞁䜾䜢ᐇ᪋ ᥇⏝ A B ศ㓄 ศ㓄 join A B join ẚ㍑ © Copyright 2014 Pivotal. All rights reserved. 52
  • 53. HAWQ䛿SQL‽ᣐ SQL ‘92 ’93 2003 OLAPᑐᛂ 䝝䞊䝗/OS RDBMS BI䝒䞊䝹 䝝䞊䝗/OS HDFS BI䝒䞊䝹 © Copyright 2014 Pivotal. All rights reserved. 53
  • 54. HAWQ䜢ᨭ䛘䜛㻌GreenplumDB 10ᖺ䛾ᐇ⦼ 㻳㼞㼑㼑㼚㼜㼘㼡㼙㻰㻮䛾୺せ䝔䜽䝜䝻䝆䞊䜢㻌㻴㻭㼃㻽㻌䛷᥇⏝ • ᶆ‽㻌㻿㻽㻸㻌ᑐᛂ • 䝁䝇䝖䝧䞊䝇䜸䝥䝔䜱䝬䜲䝄 • 䝎䜲䝘䝭䝑䜽䝟䜲䝥䝷䜲䞁ฎ⌮ • 䝻䞊䝇䝖䜰䞉䜹䝷䝮䝇䝖䜰୧᪉䜈䛾ᑐᛂ • ᅽ⦰㻔㻽㼡㼕㼏㼗㻸㼆㻘㻌㼆㻸㻵㻮㻘㻌㻾㻸㻱㻕 • ศᩓ᱁⣡ • 䝬䝹䝏䝺䝧䝹䝟䞊䝔䜱䝅䝵䝙䞁䜾 • 䝟䝷䝺䝹䞊䝻䞊䝗䞉䜰䞁䝻䞊䝗 • 㧗㏿䝕䞊䝍෌ศᩓ • ⤫ィゎᯒ㛵ᩘ㻔㻹㻭㻰㼘㼕㼎㻕 • 㻿㻱㻸㻱㻯㼀 • 㻵㻺㻿㻱㻾㼀 • 㻶㻻㻵㻺 • 䝡䝳䞊 • እ㒊⾲ • 䝸䝋䞊䝇䝬䝛䝆䝯䞁䝖 • 䝉䜻䝳䝸䝔䜱 • ㄆド • ⟶⌮䞉┘ど • 㻻㻰㻮㻯㻛㻶㻰㻮㻯ᑐᛂ © Copyright 2014 Pivotal. All rights reserved. 54
  • 55. ศᩓ㻌ᩘ್ィ⟬䞉ᶵᲔᏛ⩦䝷䜲䝤䝷䝸 MADlib䜢ྠᲕ ண ⓗ䝰䝕䝸䞁䜾䝷䜲䝤䝷䝸 Latest release: MADlib v1.6, URL: madlib.net ᶵᲔᏛ⩦䜰䝹䝂䝸䝈䝮 • ୺ᡂศศᯒ(PCA) • 䜰䝋䝅䜶䞊䝅䝵䞁䝹䞊䝹ศᯒ㻌(䜰䝣䜱䝙䝔䜱ศ ᯒ,䝬䞊䜿䝑䝖䝞䝇䜿䝑䝖ศᯒ) • 䝖䝢䝑䜽䝰䝕䝸䞁䜾㻌(䝟䝷䝺䝹LDA) • Ỵᐃᮌ • 䜰䞁䝃䞁䝤䝹Ꮫ⩦(䝷䞁䝎䝮䝣䜷䝺䝇䝖) • 䝃䝫䞊䝖䝧䜽䝍䞊䝬䝅䞁 • 䝁䞁䝕䜱䝅䝵䝘䝹䝷䞁䝎䝮䝣䜱䞊䝹䝈(CRF) • 䜽䝷䝇䝍䝸䞁䜾 (Kᖹᆒἲ) • 䜽䝻䝇䝞䝸䝕䞊䝅䝵䞁 ⥺ᙧ䝅䝇䝔䝮ゎᯒ • ␯⾜ิ䝋䝹䝞䞊 • ᐦ⾜ิ䝋䝹䝞䞊 ୍⯡໬⥺ᙧ䝰䝕䝹 • ⥺ᙧᅇᖐ • 䝻䝆䝇䝔䜱䝑䜽ᅇᖐ • ከ㡯䝻䝆䝇䝔䜱䝑䜽ᅇᖐ • 䝁䝑䜽䝇ẚ౛䝝䝄䞊䝗 • ᅇᖐศᯒ • 䜶䝷䝇䝔䜱䝑䜽䝛䝑䝖ᆺṇつ໬ • 䝃䞁䝗䜲䝑䝏᥎ᐃ ⾜ิᅉᏊศゎ • ≉␗್ศゎ㻌(SVD) • ప䝷䞁䜽㏆ఝ グ㏙⤫ィ 䝇䜿䝑䝏䝧䞊䝇᥎ᐃ • CountMin䝇䜿䝑䝏 • Flajolet-Martin䝇䜿䝑䝏 • ᭱㢖್䝇䜿䝑䝏 ┦㛵㛵ಀ ⤫ィ್䝃䝬䝸 䝃䝫䞊䝖䝰䝆䝳䞊䝹 㓄ิ₇⟬ ␯䝧䜽䝖䝹 䝷䞁䝎䝮䝃䞁䝥䝸䞁䜾 ☜⋡㛵ᩘ © Copyright 2014 Pivotal. All rights reserved. 55
  • 56. MADlib䝃䝫䞊䝖ᶵ⬟䛾౛ K-means 㐺⏝๓ x y Ⅼ䛾ሢ䜢኱䜎䛛䛻3䛴 䛻ศ๭䛧䛯䛔 © Copyright 2014 Pivotal. All rights reserved. 56
  • 57. MADlib䝃䝫䞊䝖ᶵ⬟䛾౛ K-means 㐺⏝ᚋ 䜽䝷䝇䝍A 䜽䝷䝇䝍B x y 䜽䝷䝇䝍C Ⅼ䛾ሢ䛛䜙䚸3䛴䛾䜽䝷䝇䝍䛜ᆒ➼ 䛻䝞䝷䛡䜛䜘䛖䛻䚸㔜ᚰ●䜢సᡂ ྛⅬ䛿㔜ᚰ●䛻㏆䛔䜽䝷䝇䝍䛻ᡤᒓ © Copyright 2014 Pivotal. All rights reserved. 57
  • 58. MADlib䛷䛾K-meansᐇ⾜౛ 㔜ᚰ䜢ồ䜑䜛౛ SELECT * FROM madlib.kmeanspp( 'km_sample', 'points', 3, 'madlib.squared_dist_norm2', 'madlib.avg', 20, 0.001 ); ධຊ䝔䞊䝤䝹 madlib䛾㛵ᩘ ฟຊ ศ๭ᩘ centroids | {{13.24,2.59, … ,735},{13.856,…,1078},{14.255,…,1378.75}} … 3䛴䛾㔜ᚰ䛾఩⨨ ヲ⣽䠖http://doc.madlib.net/latest/group__grp__kmeans.html © Copyright 2014 Pivotal. All rights reserved. 58
  • 59. 䝬䜲䜽䝻䜰䝗ᵝ Pivotal HD+HAWQ஦౛ Pivotal HD+HAWQ䛻䜘䜚SPSS䛾᪤Ꮡ㈨⏘䛻୍ษᡭ䜢ຍ䛘䜛䛣䛸 䛺䛟ศᯒྍ⬟䛺䝕䞊䝍䛾ᣑ኱䜢ప䝁䝇䝖䛷ᐇ⌧䚹䛚ᐈᵝ䛻䛸䛳䛶䛾 ➇தຊ䛾※Ἠ䛷䛒䜛䝕䞊䝍ศᯒ⢭ᗘ䛾ྥୖ䛻㈉⊩ ᪤Ꮡ⎔ቃ 䞉 IBM PureData/SPSS䛾ศᯒᇶ┙䜢ᵓ⠏ 䞉㻌ศᯒせᮃ䛾㧗ᗘ໬䛻䜘䜚᱁⣡䝕䞊䝍ቑ኱䚸PureData 䛾ᐜ㔞ᯤῬ 䞉㻌䝁䝇䝖䜢ᢚ䛘䜛䛯䜑Hadoop (Cloudera↓ൾ∧)䜢ే⏝ ㄢ㢟 䞉 SPSS䛾䜽䜶䝸䛜Hadoopᶆ‽䝒䞊䝹HIVE䛷䛿 䚷䛂㏻䜙䛺䛔䛃䛂㏵୰䛷䜶䝷䞊䛻䛺䜛䛃䛂ⴭ䛧䛟㐜䛔䛃䛯䜑 䚷ᐇ⏝䛻ሓ䛘䛺䛔 (DWH) IBM PureData (Hadoop) Pivotal Pivotal HD + HAWQᑟධ⤖ᯝ 䞉 SPSS䛾䜽䜶䝸䛜ኚ᭦↓䛧䛷100%฼⏝ྍ⬟ 䞉 HIVE䛸ẚ㍑䛧䛶᭱኱⣙70ಸ㏿䛔ᛶ⬟䜢グ㘓 䞉㻌ỗ⏝IA䝃䞊䝞6ྎ䛷ᐇ⿦ (BA) IBM SPSS ᑡ䛺䛔ᢞ㈨䛷ᗈ⠊ᅖ䛺䝕䞊䝍䛻ᑐ䛧䛶௒䜎䛷䛷 䛝䛺䛛䛳䛯ศᯒ䜢ᐇ᪋ྍ⬟䛻 © Copyright 2014 Pivotal. All rights reserved. 59
  • 60. HAWQ䛾⤖ㄽ 1. HAWQ䛿SQL on Hadoop䛾୍䛴 Hadoop䛷Greenplum䛾䜽䜶䝸䜶䞁䝆䞁䜢ື䛟䜘䛖䛻䛧䛯䜒䛾 2. HAWQ䛿Hive䜘䜚ᩘ༑~ᩘⓒಸ, Impala䜘䜚ᩘಸ㏿䛔 3. HAWQ䛿SQL஫᥮䛺䛾䛷䚸᪤Ꮡ䛾䝒䞊䝹䛛䜙 ౑䛔䜔䛩䛔 © Copyright 2014 Pivotal. All rights reserved. 60
  • 61. 䝡䝑䜾䝕䞊䝍᫬௦䛾 ௻ᴗኚ㠉䜢ᐇ⌧䛩䜛Pivotal • 䝕䞊䝍⵳✚䊻ศᯒ䊻䜰䝥䝸䜿䞊䝅 䝵䞁䛾䝃䜲䜽䝹 • 䛒䜙䜖䜛䝕䞊䝍䜢䛸䜙䛘䜛䝡䝑䜾䝕 䞊䝍ᇶ┙䛂䝕䞊䝍䝺䜲䜽䛃ᵓ᝿ © Copyright 2014 Pivotal. All rights reserved. 61
  • 62. 䝡䝑䜾䝕䞊䝍᫬௦䛾 ௻ᴗኚ㠉䜢ᐇ⌧䛩䜛Pivotal • 䝕䞊䝍⵳✚䊻ศᯒ䊻䜰䝥䝸䜿䞊䝅䝵䞁䈊 䛾䝃䜲䜽䝹 • 䛒䜙䜖䜛䝕䞊䝍䜢䛸䜙䛘䜛䝡䝑䜾䝕䞊䝍䈊 ᇶ┙䛂䝕䞊䝍䝺䜲䜽䛃㻌ᵓ᝿ © Copyright 2014 Pivotal. All rights reserved. 62
  • 63. Pivotal䛾ᥦ౪䛩䜛䝁䞁䝃䝹䝔䜱䞁䜾䝃䞊䝡䝇 Ÿ 䝡䝑䜾䝕䞊䝍ᇶ┙䜢ᑟධ䛧䛯䛔 – 䛡䛹䝻䜾㌿㏦䛺䛹䛹䛖䛩䜜䜀䞉䞉 Ÿ 䝕䞊䝍ศᯒ䞉ᶵᲔᏛ⩦䜢䛧䛯䛔 䜰䝥䝸䜿䞊䝅䝵䞁 䜸䞊䝥䞁䝋䞊䝇 䜽䝷䜴䝗(PaaS)ᇶ┙ 䞉䞉䞉 ᪂䛯䛺 ᇶᖿ䝅䝇䝔䝮㐃ᦠ 䝣䜯䝇䝖䝕䞊䝍 (M2M/䝸䜰䝹䝍䜲䝮) 䝡䝆䝛䝇䝰䝕䝹 䜰䝆䝱䜲䝹㛤Ⓨ 䜰䝘䝸䝔䜱䜽䝇 䝕䞊䝍 䝕䞊䝍䝃䜲䜶䞁䝇 䝡䝑䜾䝕䞊䝍 (DWH/Hadoop) Pivotal䝆䝱䝟䞁䛻䛶 ᑟධ䝃䝫䞊䝖䠃䝖䝺䞊䝙䞁䜾ᐇ᪋ ᪥ᮏே䛾䝕䞊䝍䝬䜲䝙䞁䜾䜶䞁䝆䝙䜰ཬ䜃 䝕䞊䝍䝃䜲䜶䞁䝔䜱䝇䝖ᅾ⡠ © Copyright 2014 Pivotal. All rights reserved. 63
  • 64. A NEW PLATFORM FOR A NEW ERA