1 Big Data – Is it a hype or for real? 22.12.2015
Software
Solutions for
Manufacturing
Introduction into Big Data and 1.5 applications
Dr. Dirk Ortloff
Big Data – Is it a hype
or for real?
2 Big Data – Is it a hype or for real? 22.12.2015
 Company overview
 Introduction
 Status of Big Data
 Big Data ≠ Hadoop
 1.5 example applications
 camLine eTrace SaaS
 The research project PRO-OPT
 Conclusions
Agenda
Picture from: https://commons.wikimedia.org/wiki/File:BigData_2267x1146_white.png
3 Big Data – Is it a hype or for real? 22.12.2015
Company overview
4 Big Data – Is it a hype or for real? 22.12.2015
Service Offering Portfolio
5 Big Data – Is it a hype or for real? 22.12.2015
Product areas
6 Big Data – Is it a hype or for real? 22.12.2015
Products for your Manufacturing Excellence
PRODUCTS
LineWorks
•WIP, MaMa,
CarMa, mDICE
•iGate
•Pulse, MaiMa
•SPACE, SQM
•Recipe
Management
•…
Cornerstone
•Explorative
Data Analysis
(EDA)
•Design of
Experiments
(DoE)
•Six Sigma
InFrame
Synapse
•SME’s MES
•Complete
tracking and
tracing
XperiDesk
•Technology
Development
Management
platform
•Virtual
manufacturing
enablement
•Electronic Lab
notebook
(ELN)
Solutions
•Manufacturing
Execution
System (MES)
•Quality
Management
•Equipment
Efficiency (OEE)
7 Big Data – Is it a hype or for real? 22.12.2015
Introduction – Big Data
8 Big Data – Is it a hype or for real? 22.12.2015
The first documented use of the term “big data” appeared in a 1997 paper by
scientists at NASA, describing the problem they had with visualization (i.e.
computer graphics) which “provides an interesting challenge for computer
systems: data sets are generally quite large, taxing the capacities of main memory,
local disk, and even remote disk. We call this the problem of big data. When data
sets do not fit in main memory (in core), or when they do not fit even on local
disk, the most common solution is to acquire more resources.”
Gartner: Big data is high-volume, high-velocity and/or high-variety information
assets that demand cost-effective, innovative forms of information processing that
enable enhanced insight, decision making, and process automation.
What is Big Data?
9 Big Data – Is it a hype or for real? 22.12.2015
What is Big Data?
Picture from: http://www.ibmbigdatahub.com/sites/default/files/infographic_file/4-Vs-of-big-data.jpg
10 Big Data – Is it a hype or for real? 22.12.2015
Data: symbols
Data is raw. It simply exists and has no
significance beyond its existence (in and
of itself). It can exist in any form, usable
or not. It does not have meaning of itself.
In computer parlance, a spreadsheet
generally starts out by holding data.
Information: data that are processed to be useful; provides answers to "who", "what",
"where", and "when" questions. Information is Data that has been given meaning by
way of relational connection.
Knowledge: application of Data and Information; answers "how" questions. Knowledge
is the appropriate collection of information, such that it's intent is to be useful.
Wisdom: is the ability to increase effectiveness. Wisdom adds value, which requires the
mental function that we call judgment. The ethical and aesthetic values that this implies
are inherent to the actor and are unique and personal.
Following the DIKW model: http://www.systems-thinking.org/dikw/dikw.htm
Definitions
11 Big Data – Is it a hype or for real? 22.12.2015
“Fact: 80 percent of the
digitized information in
a typical company is in
the form of unstructured
data such as documents, e-mail,
and images. “1
“Fact: The amount of unstructured
content in a typical business grows
by 50 percent every year. “1
What it boils down to:
1: Oracle: Information Management – Get control of your Information
Picture is property of: www.yakidoo.com
12 Big Data – Is it a hype or for real? 22.12.2015
Status of Big Data
13 Big Data – Is it a hype or for real? 22.12.2015
Is Big Data still a hype to go away?
Picture from: http://na2.www.gartner.com/imagesrv/newsroom/images/hype-cycle-pr.png;wada20fd4bd7891509
14 Big Data – Is it a hype or for real? 22.12.2015
Is Big Data still a hype to go away?
Picture from: http://na1.www.gartner.com/imagesrv/newsroom/images/HC_ET_2014.jpg;wadf79d1c8397a49a2
15 Big Data – Is it a hype or for real? 22.12.2015
Is Big Data still a hype to go away?
Picture from: http://na2.www.gartner.com/imagesrv/newsroom/images/emerging-tech-hc.png;wa0131df2b233dcd17
16 Big Data – Is it a hype or for real? 22.12.2015
Big Data
Almost anybody does something
with Big Data…
Picture from: https://commons.wikimedia.org/wiki/File:Peak_hour_traffic_in_melbourne.jpg
17 Big Data – Is it a hype or for real? 22.12.2015
But everthing is pretty company
specific and mostly isolated
applications
Picture from: https://commons.wikimedia.org/wiki/File:Palau_archipelago.jpg
18 Big Data – Is it a hype or for real? 22.12.2015
Big Data ≠ Hadoop
19 Big Data – Is it a hype or for real? 22.12.2015
Hadoop – the myths
Data Lake
42
20 Big Data – Is it a hype or for real? 22.12.2015
“The Apache™ Hadoop® project develops open-source software for reliable,
scalable, distributed computing.
The Apache Hadoop software library is a framework that allows for the
distributed processing of large data sets across clusters of computers ….”
Hadoop itself consist only out of :
• Hadoop Common
• Hadoop Distributed File System (HDFS™)
• Hadoop YARN
• Hadoop MapReduce
Hadoop – the reality
21 Big Data – Is it a hype or for real? 22.12.2015
Big Data Landscape
22 Big Data – Is it a hype or for real? 22.12.2015
Big Data Landscape – Example Usage
23 Big Data – Is it a hype or for real? 22.12.2015
Paradigm shift – Technically
Picture from: http://zcom.ro/wp-content/uploads/docs/pdf/3.2.-Eric-Dong-Huawei-Big-Data-How-to-Face-the-Data-Wave-in-Cloud-Era.pdf
24 Big Data – Is it a hype or for real? 22.12.2015
In theoretical computer science,
the CAP theorem, also known
as Brewer’s theorem, states that
it is impossible for a distributed
computer system to
simultaneously provide all three
of the following guarantees:
• Consistency (all nodes see the
same data at the same time)
• Availability (a guarantee that
every request receives a
response about whether it
was successful or failed)
• Partition tolerance (the system
continues to operate despite
arbitrary message loss or
failure of part of the system)
Principle differences in data storage approaches
Picture from: http://www.abramsimon.com/
26 Big Data – Is it a hype or for real? 22.12.2015
Important – Keep history and evolvement in mind
Picture from: http://www.digitale-technologien.de/DT/Redaktion/DE/Downloads/Publikation/smart-data-boden-introduction-flink.pdf
27 Big Data – Is it a hype or for real? 22.12.2015
Paradigm shift – Culturally
28 Big Data – Is it a hype or for real? 22.12.2015
What is the future business flow …???
29 Big Data – Is it a hype or for real? 22.12.2015
1.5 solution approaches
camLine eTrace
30 Big Data – Is it a hype or for real? 22.12.2015
Cloud eTrace SaaS Edition
ETrace
eTrace SaaS Edition
• Real time process by each upload wafer test result files
• Real time test result analysis and notification per file
• Unlimited user
• Various Analysis plugins are provided by camLine
Analysis Storage
Real Time
Eval.
Source Parser
Foundries
Test
House
Foundries
31 Big Data – Is it a hype or for real? 22.12.2015
The Challenges
Reduce
integration and
maintenance
effort for tester
raw data
Online and long
term data
storage and
analysis
Reduce cost of
ownership
34 Big Data – Is it a hype or for real? 22.12.2015
Search by Lot and View summary for each parameter online
Analysis – Wafer Map
35 Big Data – Is it a hype or for real? 22.12.2015
Online Wafer Map View for each test result on
given lot
Ease pick list of
tested result set
for given wafer
Drill down
36 Big Data – Is it a hype or for real? 22.12.2015
SPACE Offline Chart Plugin Viewer - DMP
Full Wafer Map
with interpolated
values.
37 Big Data – Is it a hype or for real? 22.12.2015
Reduce cost of
ownership
eTrace Solution Values
Reduce
integration and
maintenance
effort for tester
raw data
Online and long
term data
storage and
analysis
Saving in effort in integration and
test programing by engineers
Potential of 30% reduction
for data storage capacity
Reduce customer
resources to maintain
the system
38 Big Data – Is it a hype or for real? 22.12.2015
1.5 solution approaches
Overview research project PRO_OPT
39 Big Data – Is it a hype or for real? 22.12.2015
The PRO-OPT Research Project
Project start 1.1.2015,
3 years of funding,
www.pro-opt.org
40 Big Data – Is it a hype or for real? 22.12.2015
The PRO-OPT Mission and Research Group
Evaluation Partner and Data Supplier
for Automotive Diagnostics
Technology and Evaluation Partner for
Production Systems
Research Partner for Data Mining and
Integration of System Components
Project Lead, Technology and
Evaluation Partner for Automotive
Diagnostics
Technical Project Lead,
Research Partner for
Access Restriction, SW Architecture,
Simulation, Data Generation
Multiplicator
Data Supplier and
Evaluation Partner
PRO-OPT aims at identifying valuable data and making it available for
creating additional benefit for all members of a Smart Ecosystem.
Associated Partners
41 Big Data – Is it a hype or for real? 22.12.2015
Product, Delivery Notes, …
Motivation
Module Supplier OEM Production OEM After-Sales
Electronic
Component Supplier
OEM Engineering
Design and Engineering Data, Configuration Data, …
Is there more
information available
here?
Is there even more
Information
we ignore?
Why is there so little
feedback here?
42 Big Data – Is it a hype or for real? 22.12.2015
Is this how we Look at the Diagnostic Data of a
Car?
3rd-Party After-Sales,
Automobile
Associations
43 Big Data – Is it a hype or for real? 22.12.2015
Shouldn‘t it be like this?
44 Big Data – Is it a hype or for real? 22.12.2015
The PRO-OPT Mission
• Profitable exchange of data for all involved parties
• Protection of „private“ data and intellectual
property
New
Business Models
• Identify and describe available data sources
• Consider data quality and usage restriction
Modeling
• Platform for integrative big data applications
• Data analysis and visualization methods and tools
• Support data ownership and usage restriction
Platform and Methods
for Collaborative
Analytics
45 Big Data – Is it a hype or for real? 22.12.2015
Benefits
Transparency improves overall
product quality
Lower warranty costs
among the supply chain
Better reputation wrt. quality
Getting ready for
growing product complexity
47 Big Data – Is it a hype or for real? 22.12.2015
Big Data Landscape – Technology@PRO-OPT
48 Big Data – Is it a hype or for real? 22.12.2015
camLine – Use Case Overview
camLine
Conti
camLine
SDC
WIP
EvaProd
CarMa
MaMa
…
…
camLine mDice camLine
Cornerston
e
Trace KPI
camLine
SPACE
Data Storage
Data ProcessingData Ingestion
Cluster
Zookeeper
RDBMS
Bulk
Uploader
(TBD)
store read
store
store readstore read
Spark
Streamin
g
MLLib GraphX SparkR
Blink
DB
Spark
SQL
Cluster
ETL
UC9
UC1
UC2+3UC4
UC5
UC5+6UC5+6UC5+7UC8
49 Big Data – Is it a hype or for real? 22.12.2015
Conclusions
50 Big Data – Is it a hype or for real? 22.12.2015
• Big Data is here to stay – Gartner recons it is not hype anymore it is in the
market
• Big data ≠ Hadoop
• Big data is not a technical – it is also a cultural change
• New "Ideology": Collect first, Collect everything, think later what to do with it...
storage is cheap, lost data is expensive
• Example applications already exist – there is a longer way to the data lake
Conculsions
51 Big Data – Is it a hype or for real? 22.12.2015
Thank you for your attention

Big Data – Is it a hype or for real?

  • 1.
    1 Big Data– Is it a hype or for real? 22.12.2015 Software Solutions for Manufacturing Introduction into Big Data and 1.5 applications Dr. Dirk Ortloff Big Data – Is it a hype or for real?
  • 2.
    2 Big Data– Is it a hype or for real? 22.12.2015  Company overview  Introduction  Status of Big Data  Big Data ≠ Hadoop  1.5 example applications  camLine eTrace SaaS  The research project PRO-OPT  Conclusions Agenda Picture from: https://commons.wikimedia.org/wiki/File:BigData_2267x1146_white.png
  • 3.
    3 Big Data– Is it a hype or for real? 22.12.2015 Company overview
  • 4.
    4 Big Data– Is it a hype or for real? 22.12.2015 Service Offering Portfolio
  • 5.
    5 Big Data– Is it a hype or for real? 22.12.2015 Product areas
  • 6.
    6 Big Data– Is it a hype or for real? 22.12.2015 Products for your Manufacturing Excellence PRODUCTS LineWorks •WIP, MaMa, CarMa, mDICE •iGate •Pulse, MaiMa •SPACE, SQM •Recipe Management •… Cornerstone •Explorative Data Analysis (EDA) •Design of Experiments (DoE) •Six Sigma InFrame Synapse •SME’s MES •Complete tracking and tracing XperiDesk •Technology Development Management platform •Virtual manufacturing enablement •Electronic Lab notebook (ELN) Solutions •Manufacturing Execution System (MES) •Quality Management •Equipment Efficiency (OEE)
  • 7.
    7 Big Data– Is it a hype or for real? 22.12.2015 Introduction – Big Data
  • 8.
    8 Big Data– Is it a hype or for real? 22.12.2015 The first documented use of the term “big data” appeared in a 1997 paper by scientists at NASA, describing the problem they had with visualization (i.e. computer graphics) which “provides an interesting challenge for computer systems: data sets are generally quite large, taxing the capacities of main memory, local disk, and even remote disk. We call this the problem of big data. When data sets do not fit in main memory (in core), or when they do not fit even on local disk, the most common solution is to acquire more resources.” Gartner: Big data is high-volume, high-velocity and/or high-variety information assets that demand cost-effective, innovative forms of information processing that enable enhanced insight, decision making, and process automation. What is Big Data?
  • 9.
    9 Big Data– Is it a hype or for real? 22.12.2015 What is Big Data? Picture from: http://www.ibmbigdatahub.com/sites/default/files/infographic_file/4-Vs-of-big-data.jpg
  • 10.
    10 Big Data– Is it a hype or for real? 22.12.2015 Data: symbols Data is raw. It simply exists and has no significance beyond its existence (in and of itself). It can exist in any form, usable or not. It does not have meaning of itself. In computer parlance, a spreadsheet generally starts out by holding data. Information: data that are processed to be useful; provides answers to "who", "what", "where", and "when" questions. Information is Data that has been given meaning by way of relational connection. Knowledge: application of Data and Information; answers "how" questions. Knowledge is the appropriate collection of information, such that it's intent is to be useful. Wisdom: is the ability to increase effectiveness. Wisdom adds value, which requires the mental function that we call judgment. The ethical and aesthetic values that this implies are inherent to the actor and are unique and personal. Following the DIKW model: http://www.systems-thinking.org/dikw/dikw.htm Definitions
  • 11.
    11 Big Data– Is it a hype or for real? 22.12.2015 “Fact: 80 percent of the digitized information in a typical company is in the form of unstructured data such as documents, e-mail, and images. “1 “Fact: The amount of unstructured content in a typical business grows by 50 percent every year. “1 What it boils down to: 1: Oracle: Information Management – Get control of your Information Picture is property of: www.yakidoo.com
  • 12.
    12 Big Data– Is it a hype or for real? 22.12.2015 Status of Big Data
  • 13.
    13 Big Data– Is it a hype or for real? 22.12.2015 Is Big Data still a hype to go away? Picture from: http://na2.www.gartner.com/imagesrv/newsroom/images/hype-cycle-pr.png;wada20fd4bd7891509
  • 14.
    14 Big Data– Is it a hype or for real? 22.12.2015 Is Big Data still a hype to go away? Picture from: http://na1.www.gartner.com/imagesrv/newsroom/images/HC_ET_2014.jpg;wadf79d1c8397a49a2
  • 15.
    15 Big Data– Is it a hype or for real? 22.12.2015 Is Big Data still a hype to go away? Picture from: http://na2.www.gartner.com/imagesrv/newsroom/images/emerging-tech-hc.png;wa0131df2b233dcd17
  • 16.
    16 Big Data– Is it a hype or for real? 22.12.2015 Big Data Almost anybody does something with Big Data… Picture from: https://commons.wikimedia.org/wiki/File:Peak_hour_traffic_in_melbourne.jpg
  • 17.
    17 Big Data– Is it a hype or for real? 22.12.2015 But everthing is pretty company specific and mostly isolated applications Picture from: https://commons.wikimedia.org/wiki/File:Palau_archipelago.jpg
  • 18.
    18 Big Data– Is it a hype or for real? 22.12.2015 Big Data ≠ Hadoop
  • 19.
    19 Big Data– Is it a hype or for real? 22.12.2015 Hadoop – the myths Data Lake 42
  • 20.
    20 Big Data– Is it a hype or for real? 22.12.2015 “The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers ….” Hadoop itself consist only out of : • Hadoop Common • Hadoop Distributed File System (HDFS™) • Hadoop YARN • Hadoop MapReduce Hadoop – the reality
  • 21.
    21 Big Data– Is it a hype or for real? 22.12.2015 Big Data Landscape
  • 22.
    22 Big Data– Is it a hype or for real? 22.12.2015 Big Data Landscape – Example Usage
  • 23.
    23 Big Data– Is it a hype or for real? 22.12.2015 Paradigm shift – Technically Picture from: http://zcom.ro/wp-content/uploads/docs/pdf/3.2.-Eric-Dong-Huawei-Big-Data-How-to-Face-the-Data-Wave-in-Cloud-Era.pdf
  • 24.
    24 Big Data– Is it a hype or for real? 22.12.2015 In theoretical computer science, the CAP theorem, also known as Brewer’s theorem, states that it is impossible for a distributed computer system to simultaneously provide all three of the following guarantees: • Consistency (all nodes see the same data at the same time) • Availability (a guarantee that every request receives a response about whether it was successful or failed) • Partition tolerance (the system continues to operate despite arbitrary message loss or failure of part of the system) Principle differences in data storage approaches Picture from: http://www.abramsimon.com/
  • 25.
    26 Big Data– Is it a hype or for real? 22.12.2015 Important – Keep history and evolvement in mind Picture from: http://www.digitale-technologien.de/DT/Redaktion/DE/Downloads/Publikation/smart-data-boden-introduction-flink.pdf
  • 26.
    27 Big Data– Is it a hype or for real? 22.12.2015 Paradigm shift – Culturally
  • 27.
    28 Big Data– Is it a hype or for real? 22.12.2015 What is the future business flow …???
  • 28.
    29 Big Data– Is it a hype or for real? 22.12.2015 1.5 solution approaches camLine eTrace
  • 29.
    30 Big Data– Is it a hype or for real? 22.12.2015 Cloud eTrace SaaS Edition ETrace eTrace SaaS Edition • Real time process by each upload wafer test result files • Real time test result analysis and notification per file • Unlimited user • Various Analysis plugins are provided by camLine Analysis Storage Real Time Eval. Source Parser Foundries Test House Foundries
  • 30.
    31 Big Data– Is it a hype or for real? 22.12.2015 The Challenges Reduce integration and maintenance effort for tester raw data Online and long term data storage and analysis Reduce cost of ownership
  • 31.
    34 Big Data– Is it a hype or for real? 22.12.2015 Search by Lot and View summary for each parameter online Analysis – Wafer Map
  • 32.
    35 Big Data– Is it a hype or for real? 22.12.2015 Online Wafer Map View for each test result on given lot Ease pick list of tested result set for given wafer Drill down
  • 33.
    36 Big Data– Is it a hype or for real? 22.12.2015 SPACE Offline Chart Plugin Viewer - DMP Full Wafer Map with interpolated values.
  • 34.
    37 Big Data– Is it a hype or for real? 22.12.2015 Reduce cost of ownership eTrace Solution Values Reduce integration and maintenance effort for tester raw data Online and long term data storage and analysis Saving in effort in integration and test programing by engineers Potential of 30% reduction for data storage capacity Reduce customer resources to maintain the system
  • 35.
    38 Big Data– Is it a hype or for real? 22.12.2015 1.5 solution approaches Overview research project PRO_OPT
  • 36.
    39 Big Data– Is it a hype or for real? 22.12.2015 The PRO-OPT Research Project Project start 1.1.2015, 3 years of funding, www.pro-opt.org
  • 37.
    40 Big Data– Is it a hype or for real? 22.12.2015 The PRO-OPT Mission and Research Group Evaluation Partner and Data Supplier for Automotive Diagnostics Technology and Evaluation Partner for Production Systems Research Partner for Data Mining and Integration of System Components Project Lead, Technology and Evaluation Partner for Automotive Diagnostics Technical Project Lead, Research Partner for Access Restriction, SW Architecture, Simulation, Data Generation Multiplicator Data Supplier and Evaluation Partner PRO-OPT aims at identifying valuable data and making it available for creating additional benefit for all members of a Smart Ecosystem. Associated Partners
  • 38.
    41 Big Data– Is it a hype or for real? 22.12.2015 Product, Delivery Notes, … Motivation Module Supplier OEM Production OEM After-Sales Electronic Component Supplier OEM Engineering Design and Engineering Data, Configuration Data, … Is there more information available here? Is there even more Information we ignore? Why is there so little feedback here?
  • 39.
    42 Big Data– Is it a hype or for real? 22.12.2015 Is this how we Look at the Diagnostic Data of a Car? 3rd-Party After-Sales, Automobile Associations
  • 40.
    43 Big Data– Is it a hype or for real? 22.12.2015 Shouldn‘t it be like this?
  • 41.
    44 Big Data– Is it a hype or for real? 22.12.2015 The PRO-OPT Mission • Profitable exchange of data for all involved parties • Protection of „private“ data and intellectual property New Business Models • Identify and describe available data sources • Consider data quality and usage restriction Modeling • Platform for integrative big data applications • Data analysis and visualization methods and tools • Support data ownership and usage restriction Platform and Methods for Collaborative Analytics
  • 42.
    45 Big Data– Is it a hype or for real? 22.12.2015 Benefits Transparency improves overall product quality Lower warranty costs among the supply chain Better reputation wrt. quality Getting ready for growing product complexity
  • 43.
    47 Big Data– Is it a hype or for real? 22.12.2015 Big Data Landscape – Technology@PRO-OPT
  • 44.
    48 Big Data– Is it a hype or for real? 22.12.2015 camLine – Use Case Overview camLine Conti camLine SDC WIP EvaProd CarMa MaMa … … camLine mDice camLine Cornerston e Trace KPI camLine SPACE Data Storage Data ProcessingData Ingestion Cluster Zookeeper RDBMS Bulk Uploader (TBD) store read store store readstore read Spark Streamin g MLLib GraphX SparkR Blink DB Spark SQL Cluster ETL UC9 UC1 UC2+3UC4 UC5 UC5+6UC5+6UC5+7UC8
  • 45.
    49 Big Data– Is it a hype or for real? 22.12.2015 Conclusions
  • 46.
    50 Big Data– Is it a hype or for real? 22.12.2015 • Big Data is here to stay – Gartner recons it is not hype anymore it is in the market • Big data ≠ Hadoop • Big data is not a technical – it is also a cultural change • New "Ideology": Collect first, Collect everything, think later what to do with it... storage is cheap, lost data is expensive • Example applications already exist – there is a longer way to the data lake Conculsions
  • 47.
    51 Big Data– Is it a hype or for real? 22.12.2015 Thank you for your attention