Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
© Hitachi America, Ltd. 2017. All rights reserved.
Overview of webSpoon
@Pentaho Bay Area Meetup
4/27/2017
Researcher at H...
© Hitachi America, Ltd. 2017. All rights reserved.
Contents
1
• What is webSpoon?
• Use cases
• Hands-on demo
• Technical ...
© Hitachi America, Ltd. 2017. All rights reserved.
Spoon: a graphical designer for PDI
2
• Intuitive user interface (graph...
© Hitachi America, Ltd. 2017. All rights reserved.
webSpoon: a browser-based Spoon
3
• webSpoon works on any latest browse...
© Hitachi America, Ltd. 2017. All rights reserved.
Use cases
4
© Hitachi America, Ltd. 2017. All rights reserved.
Data Security: Keep data where they should be
5
Spoon webSpoon
• Data e...
© Hitachi America, Ltd. 2017. All rights reserved.
Data integration of sensor data in remote sites
6
Thai factory
Tokyo of...
© Hitachi America, Ltd. 2017. All rights reserved.
Data integration of sensor data in remote sites
7
Tokyo office
*HTTPS: ...
© Hitachi America, Ltd. 2017. All rights reserved.
Managed Pentaho development environment
8
• Different version, plugin, ...
© Hitachi America, Ltd. 2017. All rights reserved.
Unified environment for everyone
9
• Dev envs are moving to the cloud, ...
© Hitachi America, Ltd. 2017. All rights reserved.
Hands-on demo
10
© Hitachi America, Ltd. 2017. All rights reserved.
Technical overview
11
© Hitachi America, Ltd. 2017. All rights reserved.
webSpoon = Spoon - SWT + RWT
12
• Spoon relies on SWT for UI widgets (e...
© Hitachi America, Ltd. 2017. All rights reserved.
1. Local files
– Spoon: local files of the laptop/desktop
– webSpoon: l...
© Hitachi America, Ltd. 2017. All rights reserved.
Secure webSpoon
14
• Keep in mind that webSpoon is shared by multiple u...
© Hitachi America, Ltd. 2017. All rights reserved.
Resources
15
• GitHub
– https://github.com/HiromuHota/pentaho-kettle
• ...
© Hitachi America, Ltd. 2016. All rights reserved.
Trademarks and copyrights
17
• Pentaho is a trademark registered by Pen...
Overview of webSpoon @ Pentaho Bay Area Meetup
Upcoming SlideShare
Loading in …5
×

Overview of webSpoon @ Pentaho Bay Area Meetup

2,398 views

Published on

webSpoon is a web-based Spoon, an graphical designer for Pentaho Data Integration.
This is a slide deck used in a Pentaho Bay Area Meetup on 4/27/17 (http://meetu.ps/e/CFNKy/cb4hM/f)

Published in: Software
  • Be the first to comment

Overview of webSpoon @ Pentaho Bay Area Meetup

  1. 1. © Hitachi America, Ltd. 2017. All rights reserved. Overview of webSpoon @Pentaho Bay Area Meetup 4/27/2017 Researcher at Hitachi America, Ltd. Hiromu Hota, PhD @HiromuHota, hiromu.hota@hal.hitachi.com
  2. 2. © Hitachi America, Ltd. 2017. All rights reserved. Contents 1 • What is webSpoon? • Use cases • Hands-on demo • Technical overview
  3. 3. © Hitachi America, Ltd. 2017. All rights reserved. Spoon: a graphical designer for PDI 2 • Intuitive user interface (graphical, drag and drop) • OS-agnostic, but locked-in a desktop Image from www.pentaho.com
  4. 4. © Hitachi America, Ltd. 2017. All rights reserved. webSpoon: a browser-based Spoon 3 • webSpoon works on any latest browser, accessible over a network. • webSpoon has advantages: • webSpoon is NOT a product (supported by neither Pentaho nor Hitachi). Smartphone/tablet CloudData security Ease of mgmt.Remote use Desktop/laptop
  5. 5. © Hitachi America, Ltd. 2017. All rights reserved. Use cases 4
  6. 6. © Hitachi America, Ltd. 2017. All rights reserved. Data Security: Keep data where they should be 5 Spoon webSpoon • Data engineers should physically be near data. • They might be tempted to download data to work in their office. • They can work from office, home, or wherever comfortable. Hospital/Government/Bank Data When data cannot leave facility/country due to some regulations,
  7. 7. © Hitachi America, Ltd. 2017. All rights reserved. Data integration of sensor data in remote sites 6 Thai factory Tokyo office Skilled engineer 1. Kettle files need updating frequently for many reasons: • New machine, new sensor, new analytics, etc. 2. But, remote desktop (RDP) is prohibited and travel costs. Data copy/move User interaction Travel costs RDP Remote desktop prohibited Sensor Database Spoon *Kettle file: Transformation or Job written in PDI (Cropped) Asia - Single Color by FreeVectorMaps.com
  8. 8. © Hitachi America, Ltd. 2017. All rights reserved. Data integration of sensor data in remote sites 7 Tokyo office *HTTPS: HTTP Secure Skilled engineer Data copy/move User interaction No travel General protocol HTTPS Thai factory Sensor Database webSpoon (Cropped) Asia - Single Color by FreeVectorMaps.com 1. Kettle files need updating frequently for many reasons: • New machine, new sensor, new analytics, etc. 2. But, remote desktop (RDP) is prohibited and travel costs.
  9. 9. © Hitachi America, Ltd. 2017. All rights reserved. Managed Pentaho development environment 8 • Different version, plugin, etc. slows down collaboration. • Could possibly be – Outdated. – Malicious plugins & drivers. Spoon webSpoon Bob Alice Ver. X.X Ver. Y.Y Your Kettle file does not run in my environment! • All Kettle files run in coworker’s screen. • No installation/upgrade/update required (by end-users). • Only desired plugins & drivers. Your Kettle file runs in my environment! *Kettle file: Transformation or Job written in PDI Plugin A Driver B Plugin A Driver C Bob Alice
  10. 10. © Hitachi America, Ltd. 2017. All rights reserved. Unified environment for everyone 9 • Dev envs are moving to the cloud, Spoon shouldn’t be left behind. (e.g., Cloud9 IDE, Eclipse Che, etc.) • IT admins should only take care of a single unified environment. – Easier management of IT – Better data governance • Collaborations between different roles become seamless. Data scientists Data engineers BI developers webSpoon Databases
  11. 11. © Hitachi America, Ltd. 2017. All rights reserved. Hands-on demo 10
  12. 12. © Hitachi America, Ltd. 2017. All rights reserved. Technical overview 11
  13. 13. © Hitachi America, Ltd. 2017. All rights reserved. webSpoon = Spoon - SWT + RWT 12 • Spoon relies on SWT for UI widgets (e.g., button, dialog, canvas). • RWT is a web alternative to SWT and “largely” implements SWT APIs, meaning Spoon can become a web app with most codes intact. • There are – Unimplemented SWT APIs (e.g., a part of GC, some Mouse events) – RWT-specific additional APIs (e.g., Multi-user, File Up/Download). Image adapted from https://angelozerr.wordpress.com/2011/05/24/rap_step5/ Operating System Servlet Container Web Browser SWT RWT (Server) RWT (Client) JFace JFace HTTP Spoon webSpoon
  14. 14. © Hitachi America, Ltd. 2017. All rights reserved. 1. Local files – Spoon: local files of the laptop/desktop – webSpoon: local files of the (remote) server 2. Clipboard – Spoon and webSpoon do not share the clipboard. – In other words, no copy & paste between Spoon and webSpoon. How is webSpoon different from Spoon? 13 File A is local File B is local File A is local File B is remote File A File B webSpoon Spoon
  15. 15. © Hitachi America, Ltd. 2017. All rights reserved. Secure webSpoon 14 • Keep in mind that webSpoon is shared by multiple users. • Secure webSpoon by using secured communication and user authentication. • Also important to use secure communication and user authentication for Kettle files/data sources. HTTPS webSpoon Browser SFTP Files in a file server Kettle files in Pentaho Repository : User authentication Local files HTTPS
  16. 16. © Hitachi America, Ltd. 2017. All rights reserved. Resources 15 • GitHub – https://github.com/HiromuHota/pentaho-kettle • Docker Hub – https://hub.docker.com/r/hiromuhota/webspoon
  17. 17. © Hitachi America, Ltd. 2016. All rights reserved. Trademarks and copyrights 17 • Pentaho is a trademark registered by Pentaho Corporation. • Cloud9 IDE is a trademark of Cloud9 IDE, Inc. • Eclipse Che is a trademark registered by The Eclipse Foundation. • Apache Hadoop and its logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. No endorsement by The Apache Software Foundation is implied by the use of these marks. • The Git Logo by Jason Long is licensed under the Creative Commons Attribution 3.0 Unported License. • The R logo is © 2016 The R Foundation. • The Python logo is a trademark of the Python Software Foundation. • GitHub is a trademark registered in the United States by GitHub, Inc. • Docker and the Docker logo are trademarks or registered trademarks of Docker, Inc. in the United States and/or other countries. Docker, Inc. and other parties may also have trademark rights in other terms used herein. • Other company and product names mentioned in this document may be the trademarks of their respective owners.

×