In this session Satoru Goto, Solutions Engineer at MariaDB, shows how the Pentaho connector for MariaDB ColumnStore can be used for both BI/reporting on MariaDB ColumnStore as well as loading data into MariaDB ColumnStore.
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
Using Pentaho with MariaDB ColumnStore
1. Using Pentaho & Tableau
with MariaDB ColumnStore
GOTO Satoru
Customer Solutions Engineer
MariaDB Corporation
2. Note
● we have ≈ 5 minutes for Q&A at the end of
the session
○ In order to get through all slides
3. Agenda
● Motivation
● What is MariaDB ColumnStore
● What is Tableau
● Analytics with Tableau & ColumnStore
● What is Pantaho
● MariaDB ColumnStore bulk load plugin for Pentaho Data
Integration(PDI)
● Using Pentaho EE Analyzer with MariaDB ColumnStore
5. Motivation to use Tableau/Pentaho Analyzer
with MariaDB ColumnStore
● MariaDB Corp. do not provide GUI tool
● For interactive analytics, GUI front end needed
6. Motivation of MariaDB ColumnStore bulk
load plugin for PDI on Windows
● ColumnStore Server only supports Linux
● end BI users are not familiar with Linux, enterprise
end users use Windows
● GUI bulk loader needed
8. MariaDB ColumnStore architecture
Columnar Distributed Data Storage
Local Storage | SAN | NAS | EBS | GlusterFS
BI Tool SQL Client Custom
Big Data App
Application
MariaDB
SQL Front
End - UM
Distributed
Query
Engine - PM
Data Storage
9. MariaDB ColumnStore architecture
• Massively parallel
architecture (MPP)
– Linear scalability
• Horizontal scaling
– Add new data nodes
as your data grows
– Continue read queries
when adding new nodes
Shared-Nothing Distributed Data Storage
Highly Compressed by default
User
Module
(UM)
Performance
Module
(PM)
Data Storage
11. Test environment
Software OS
MariaDB ColumnStore 1.2.2 CentOS 7
MariaDB Connector/J 2.3.0 OS independent
MariaDB Connector/ODBC 3.0.8 Microsoft Windows 10 Pro
Tableau Desktop 2019.1 Microsoft Windows 10 Pro
MariaDB Pentaho Kettle bulk loader plugin 1.2.2 Microsoft Windows 10 Pro
Pentaho EE 8.2 (PDI included) Microsoft Windows 10 Pro
13. What is Tableau Desktop
● Widely used interactive data visualization tool for
BI(Business Intelligence) / Analytics
● Tableau Desktop 2019.1 now officially supports
MariaDB Connector/ODBC
15. Sample data set
● https://github.com/mariadb-corporation/mariadb-columnstore-samples
● ‘Flights’ sample
○ on-time performance dataset for US based flights by US
Department of Transportation
16. Connect to MariaDB ColumnStore w/
Tableau 2019.1
● For the 1st time, you
need to download
and install MariaDB
Connector/ODBC
disabled
19. Connect to ColumnStore via Connector/ODBC
● Restart of Tableau is
needed
● After restart of Tableau
Desktop 2019.1, you can
select MariaDB Data
Source
26. What is Pentaho
● Developed by Hitachi Vantara
(Pentaho Corporation was acquired by Hitachi in 2015)
● Pentaho Server : Business Analytics Platform
● PDI (Pentaho Data Integration) : ETL engine
○ a.k.a. Kettle
https://www.hitachivantara.com/go/pentaho.html
33. Windows 10 dependencies
● Visual C++ Redistributable for Visual Studio 2015 (x64)
is required to use the Bulk Write SDK
https://github.com/mariadb-corporation/mariadb-columnstore-data-
adapters/tree/master/kettle-columnstore-bulk-exporter-plugin
34. CSV bulk load into ColumnStore
with PDI 8.2 Kettle plugin
36. Bulk load Kettle plugin : CSV file input step
● drag “CSV file input”
step into to the canvas
● Click “CSV file input”
https://help.pentaho.com/Documentation/8.2/Products/Data_Integration/Transformati
on_Step_Reference/CSV_File_Input
37. Bulk load Kettle plugin : CSV file input step
● Recommended to
have “header row”
in CSV
(for fields mapping)
● Otherwise you need
to map all fields by
hand
● Click “Get Fields”
42. Bulk load Kettle plugin : Columnstore.xml
Replace all 127.0.0.1 to IP address of ColumnStore server in
Columnstore.xml :
<?xml version="1.0" encoding="UTF-8"?>
<!-- $Id: Columnstore.xml 3713 2013-08-16 20:04:47Z wweeks $ -->
<!-- WARNING: Do not make changes to this file unless directed to do so by
MariaDB service engineers. Incorrect settings can render your system
unusable and will require a service call to correct.
-->
<Columnstore Version="V1.0.0">
<ExeMgr1>
<IPAddr>192.168.2.12</IPAddr>
<Port>8601</Port>
<Module>pm1</Module>
</ExeMgr1>
58. Sample data set
● https://github.com/mariadb-corporation/mariadb-columnstore-samples
● ‘Flights’ sample
○ US Department of Transportation on-time performance
dataset for US based flights
76. Summary
● Interactive Analytics using Tableau Desktop
connecting to ColumnStore via MariaDB
Connector/ODBC
● Data bulk load to MariaDB ColumnStore using PDI
Kettle plugin
● Interactive Analytics using Pentaho EE connecting
to ColumnStore via MariaDB Connector/J