SlideShare a Scribd company logo
the REAL face of big data
myths and facts
Data is never
clean!
2
Most of time will be spending on data
cleansing and preparing data for analytics.
80% of a typical data science project is
sourcing cleaning and preparing the data,
while the remaining 20% is actual data
analysis.
“It’s an absolute myth that you can send an
algorithm over raw data and have insights
pop up.”
AWS has a great
support services.
Amazon will not be transparent about the
underlying infrastructure, but also will not give
diagrams, machine details, etc., so it is difficult
to get performance and measurement reports.
Even though some EC2 managed services are
available through third-parties, only paid
support is available for “the most common
third-party software running on AWS,”
excluding managed support services.
2
No one cares
how you did it
4
Most technical presentations in industry are 1)
far too long and 2) focus on the portions that
don’t matter (the details of the method applied)
to hide the parts that do (what does that mean
for this prospect).
However, I think a more constructive way to
phrase this is “know your audience”. So makes
awesome presentations. Sounds strange, but
usually is most important than the algorithm
itself.
Data Lakes Will
Replace The Data
Warehouse
5
It's "misleading" for vendors to position data
lakes as replacements for data warehouses.
A data lake's foundational technologies lack the
maturity and breadth of the features found in
established data warehouse technologies.
Many organizations get stuck at the pilot stage
because they don't tie the technology to
business processes or concrete use cases.
Big data will give a
B&W concrete
answer
6
The real analytics is combining, weighting and
judging multiple sources of information. The
more data you have, the more analysis you
must run.
Big data has a strong capacity to brig other
marketing questions, and other insights and
makes leaders think better.
Machine-learning
will be the key!
7
In 90% of cases generalized linear regression
will do the trick.
The most basic and commonly used predictive
analysis will do the job.
Regression estimates are used to describe data
and to explain the relationship between one
dependent variable and one or more
independent variables
Get data scientists
senior team
8
Doctorate in math, a background in computer
science, and what amounts to an MBA, not to
mention actual work experience in all of those
fields. "How old is this person, 90?
Almost impossible find this data-scientist
unicorn, it had to create a working group with a
cross-section of expertise. This is in fact what
you have to do.
95% of tasks do
not require deep
learning.
9
It’s a hands-on job.
Super-intelligent artificial automated algorithm
that will solve all problem with magic-touch
does not exists.
This role requires a lot of dirty data model ing,
coding, patience and focus.

More Related Content

What's hot

How to program your way into data science?
How to program your way into data science?How to program your way into data science?
How to program your way into data science?
DeZyre
 
Best practices in building machine learning models in Azure ML
Best practices in building machine learning models in Azure MLBest practices in building machine learning models in Azure ML
Best practices in building machine learning models in Azure ML
Zeydy Ortiz, Ph. D.
 
Simplify your analytics strategy- Palash badjatya
Simplify your analytics strategy- Palash badjatyaSimplify your analytics strategy- Palash badjatya
Simplify your analytics strategy- Palash badjatya
Acropolis Technical Campus
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
IMC Institute
 
Big data analytics and large-scale computers
Big data analytics and large-scale computersBig data analytics and large-scale computers
Big data analytics and large-scale computers
ShubhamKhurana20
 
The Other 99% of a Data Science Project
The Other 99% of a Data Science ProjectThe Other 99% of a Data Science Project
The Other 99% of a Data Science Project
Eugene Mandel
 
KEY CHALLENGES FOR MONETIZING BIG DATA POWERED AI: AN OVERVIEW
KEY CHALLENGES FOR MONETIZING BIG DATA POWERED AI: AN OVERVIEWKEY CHALLENGES FOR MONETIZING BIG DATA POWERED AI: AN OVERVIEW
KEY CHALLENGES FOR MONETIZING BIG DATA POWERED AI: AN OVERVIEW
Tyrone Systems
 
Collaborative design for data driven projects
Collaborative design for data driven projectsCollaborative design for data driven projects
Collaborative design for data driven projects
Carlos Gerardo Prieto Alvarez
 
Gary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you ThinkGary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you Think
Saratoga
 
Challenges in business analytics
Challenges in business analyticsChallenges in business analytics
Challenges in business analytics
Miklos Koren
 
Things you need to know about big data
Things you need to know about big dataThings you need to know about big data
Things you need to know about big data
Lantern Institute
 
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
Tata Consultancy Services
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
LivePerson
 
Banking infographic v7
Banking infographic v7Banking infographic v7
Banking infographic v7
Rani Goel
 
Predictive analysis
Predictive analysisPredictive analysis
Predictive analysis
Vedprakash Srivastava
 
Predictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial IntelligencePredictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial Intelligence
Manish Jain
 
Ai presentatie
Ai presentatieAi presentatie
Ai presentatie
LunaDuFour
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Edureka!
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
DataWorks Summit/Hadoop Summit
 
Data dont have to be boring 11ntcdata
Data dont have to be boring  11ntcdataData dont have to be boring  11ntcdata
Data dont have to be boring 11ntcdata
kidsdata
 

What's hot (20)

How to program your way into data science?
How to program your way into data science?How to program your way into data science?
How to program your way into data science?
 
Best practices in building machine learning models in Azure ML
Best practices in building machine learning models in Azure MLBest practices in building machine learning models in Azure ML
Best practices in building machine learning models in Azure ML
 
Simplify your analytics strategy- Palash badjatya
Simplify your analytics strategy- Palash badjatyaSimplify your analytics strategy- Palash badjatya
Simplify your analytics strategy- Palash badjatya
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big data analytics and large-scale computers
Big data analytics and large-scale computersBig data analytics and large-scale computers
Big data analytics and large-scale computers
 
The Other 99% of a Data Science Project
The Other 99% of a Data Science ProjectThe Other 99% of a Data Science Project
The Other 99% of a Data Science Project
 
KEY CHALLENGES FOR MONETIZING BIG DATA POWERED AI: AN OVERVIEW
KEY CHALLENGES FOR MONETIZING BIG DATA POWERED AI: AN OVERVIEWKEY CHALLENGES FOR MONETIZING BIG DATA POWERED AI: AN OVERVIEW
KEY CHALLENGES FOR MONETIZING BIG DATA POWERED AI: AN OVERVIEW
 
Collaborative design for data driven projects
Collaborative design for data driven projectsCollaborative design for data driven projects
Collaborative design for data driven projects
 
Gary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you ThinkGary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you Think
 
Challenges in business analytics
Challenges in business analyticsChallenges in business analytics
Challenges in business analytics
 
Things you need to know about big data
Things you need to know about big dataThings you need to know about big data
Things you need to know about big data
 
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Banking infographic v7
Banking infographic v7Banking infographic v7
Banking infographic v7
 
Predictive analysis
Predictive analysisPredictive analysis
Predictive analysis
 
Predictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial IntelligencePredictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial Intelligence
 
Ai presentatie
Ai presentatieAi presentatie
Ai presentatie
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
 
Data dont have to be boring 11ntcdata
Data dont have to be boring  11ntcdataData dont have to be boring  11ntcdata
Data dont have to be boring 11ntcdata
 

Viewers also liked

Herramientas Digitales
Herramientas DigitalesHerramientas Digitales
Herramientas Digitales
Alejandra Chacón
 
Studying languages at_dal_2008
Studying languages at_dal_2008Studying languages at_dal_2008
Studying languages at_dal_2008sdspasova
 
Цифровая взрывозащищённая аппаратура контроля вибрации «ЦВА»
Цифровая взрывозащищённая аппаратура контроля вибрации «ЦВА»Цифровая взрывозащищённая аппаратура контроля вибрации «ЦВА»
Цифровая взрывозащищённая аппаратура контроля вибрации «ЦВА»
ООО "Прософт-Системы"
 
-Trebuchet-Vinoos by AMS checked
-Trebuchet-Vinoos by AMS checked-Trebuchet-Vinoos by AMS checked
-Trebuchet-Vinoos by AMS checkedMireille Reuling
 
перевернув страницы старых газет
перевернув страницы старых газетперевернув страницы старых газет
перевернув страницы старых газет
foryana
 
例外處理與單元測試
例外處理與單元測試例外處理與單元測試
例外處理與單元測試
國昭 張
 
The State
The StateThe State
Los años para los masones
Los años para los masonesLos años para los masones
Los años para los masones
Hijosdelaviuda
 
AP Psych Photo Contest
AP Psych Photo Contest AP Psych Photo Contest
AP Psych Photo Contest
kbolinsky
 
Psychology Snowmen
Psychology Snowmen Psychology Snowmen
Psychology Snowmen
kbolinsky
 
2016年逢甲大學資訊系:ASP.NET MVC 4 教育訓練2
2016年逢甲大學資訊系:ASP.NET MVC 4 教育訓練22016年逢甲大學資訊系:ASP.NET MVC 4 教育訓練2
2016年逢甲大學資訊系:ASP.NET MVC 4 教育訓練2
Duran Hsieh
 
Aix en Provence
Aix en Provence Aix en Provence
Aix en Provence
Jean-Philippe Alfonsi
 
RDBMS to NoSQL. An overview.
RDBMS to NoSQL. An overview.RDBMS to NoSQL. An overview.
RDBMS to NoSQL. An overview.
Girish. N. Raghavan
 
Plan Operativo Anual
Plan Operativo AnualPlan Operativo Anual
Plan Operativo Anual
LuisMariaOlaso
 

Viewers also liked (19)

Herramientas Digitales
Herramientas DigitalesHerramientas Digitales
Herramientas Digitales
 
Studying languages at_dal_2008
Studying languages at_dal_2008Studying languages at_dal_2008
Studying languages at_dal_2008
 
Цифровая взрывозащищённая аппаратура контроля вибрации «ЦВА»
Цифровая взрывозащищённая аппаратура контроля вибрации «ЦВА»Цифровая взрывозащищённая аппаратура контроля вибрации «ЦВА»
Цифровая взрывозащищённая аппаратура контроля вибрации «ЦВА»
 
-Trebuchet-Vinoos by AMS checked
-Trebuchet-Vinoos by AMS checked-Trebuchet-Vinoos by AMS checked
-Trebuchet-Vinoos by AMS checked
 
перевернув страницы старых газет
перевернув страницы старых газетперевернув страницы старых газет
перевернув страницы старых газет
 
Min vida
Min vidaMin vida
Min vida
 
Blog
BlogBlog
Blog
 
I forum aleksey_pilipchuk
I forum aleksey_pilipchukI forum aleksey_pilipchuk
I forum aleksey_pilipchuk
 
例外處理與單元測試
例外處理與單元測試例外處理與單元測試
例外處理與單元測試
 
The State
The StateThe State
The State
 
TechComp Capability Statement
TechComp Capability StatementTechComp Capability Statement
TechComp Capability Statement
 
Los años para los masones
Los años para los masonesLos años para los masones
Los años para los masones
 
AP Psych Photo Contest
AP Psych Photo Contest AP Psych Photo Contest
AP Psych Photo Contest
 
Ms. Guyer's 2nd Period Terrorism/9-11 Group
Ms. Guyer's 2nd Period Terrorism/9-11 GroupMs. Guyer's 2nd Period Terrorism/9-11 Group
Ms. Guyer's 2nd Period Terrorism/9-11 Group
 
Psychology Snowmen
Psychology Snowmen Psychology Snowmen
Psychology Snowmen
 
2016年逢甲大學資訊系:ASP.NET MVC 4 教育訓練2
2016年逢甲大學資訊系:ASP.NET MVC 4 教育訓練22016年逢甲大學資訊系:ASP.NET MVC 4 教育訓練2
2016年逢甲大學資訊系:ASP.NET MVC 4 教育訓練2
 
Aix en Provence
Aix en Provence Aix en Provence
Aix en Provence
 
RDBMS to NoSQL. An overview.
RDBMS to NoSQL. An overview.RDBMS to NoSQL. An overview.
RDBMS to NoSQL. An overview.
 
Plan Operativo Anual
Plan Operativo AnualPlan Operativo Anual
Plan Operativo Anual
 

Similar to The REAL face of Big Data

Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Edgar Alejandro Villegas
 
Top ten data and analysis technology trends in 2021
Top ten data and analysis technology trends in 2021Top ten data and analysis technology trends in 2021
Top ten data and analysis technology trends in 2021
Ruchi Jain
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraVin Malhotra
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedback
Peculium Crypto
 
Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018
mark madsen
 
Odgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperOdgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White Paper
Robertson Executive Search
 
From Volume to Value - A Guide to Data Engineering
From Volume to Value - A Guide to Data EngineeringFrom Volume to Value - A Guide to Data Engineering
From Volume to Value - A Guide to Data Engineering
Ry Walker
 
Cloud Analytics Playbook
Cloud Analytics PlaybookCloud Analytics Playbook
Cloud Analytics Playbook
Booz Allen Hamilton
 
Big datarevealed hadoop catalog
Big datarevealed hadoop catalogBig datarevealed hadoop catalog
Big datarevealed hadoop catalog
Steven Meister
 
Top 10 areas of expertise in data science
Top 10 areas of expertise in data scienceTop 10 areas of expertise in data science
Top 10 areas of expertise in data science
GlobalTechCouncil
 
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Dana Gardner
 
Big Data : a 360° Overview
Big Data : a 360° Overview Big Data : a 360° Overview
Big Data : a 360° Overview
Juvénal CHOKOGOUE
 
Challenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdf
Challenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdfChallenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdf
Challenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdf
venkatakeerthi3
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
prateek kumar
 
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
Dario Mangano
 
Top reasons why big data projects are still a failure
Top reasons why big data projects are still a failureTop reasons why big data projects are still a failure
Top reasons why big data projects are still a failure
Arun Kapoor
 
Expert Big Data Tips
Expert Big Data TipsExpert Big Data Tips
Expert Big Data Tips
Qubole
 
Data Science for Finance Interview.
Data Science for Finance Interview. Data Science for Finance Interview.
Data Science for Finance Interview.
James LoBuono, CAPM, ITILv4
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
Vipul Kalamkar
 

Similar to The REAL face of Big Data (20)

Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
 
Top ten data and analysis technology trends in 2021
Top ten data and analysis technology trends in 2021Top ten data and analysis technology trends in 2021
Top ten data and analysis technology trends in 2021
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin Malhotra
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedback
 
Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018
 
Odgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperOdgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White Paper
 
From Volume to Value - A Guide to Data Engineering
From Volume to Value - A Guide to Data EngineeringFrom Volume to Value - A Guide to Data Engineering
From Volume to Value - A Guide to Data Engineering
 
Cloud Analytics Playbook
Cloud Analytics PlaybookCloud Analytics Playbook
Cloud Analytics Playbook
 
Big datarevealed hadoop catalog
Big datarevealed hadoop catalogBig datarevealed hadoop catalog
Big datarevealed hadoop catalog
 
Top 10 areas of expertise in data science
Top 10 areas of expertise in data scienceTop 10 areas of expertise in data science
Top 10 areas of expertise in data science
 
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
Mission Critical Use Cases Show How Analytics Architectures Usher in an Artif...
 
Big Data : a 360° Overview
Big Data : a 360° Overview Big Data : a 360° Overview
Big Data : a 360° Overview
 
Challenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdf
Challenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdfChallenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdf
Challenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdf
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
 
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
 
Top reasons why big data projects are still a failure
Top reasons why big data projects are still a failureTop reasons why big data projects are still a failure
Top reasons why big data projects are still a failure
 
Expert Big Data Tips
Expert Big Data TipsExpert Big Data Tips
Expert Big Data Tips
 
Data Science for Finance Interview.
Data Science for Finance Interview. Data Science for Finance Interview.
Data Science for Finance Interview.
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 

More from Douglas Bernardini

Top reasons to choose SAP hana
Top reasons to choose SAP hanaTop reasons to choose SAP hana
Top reasons to choose SAP hana
Douglas Bernardini
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integrated
Douglas Bernardini
 
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapRHadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Douglas Bernardini
 
SAP HORTONWORKS
SAP HORTONWORKSSAP HORTONWORKS
SAP HORTONWORKS
Douglas Bernardini
 
R-language
R-languageR-language
R-language
Douglas Bernardini
 
REDSHIFT - Amazon
REDSHIFT - AmazonREDSHIFT - Amazon
REDSHIFT - Amazon
Douglas Bernardini
 
Splunk
SplunkSplunk
Finance month closing with HANA
Finance month closing with HANAFinance month closing with HANA
Finance month closing with HANA
Douglas Bernardini
 
RDBMS x NoSQL
RDBMS x NoSQLRDBMS x NoSQL
RDBMS x NoSQL
Douglas Bernardini
 
SAP - SOLUTION MANAGER
SAP - SOLUTION MANAGER SAP - SOLUTION MANAGER
SAP - SOLUTION MANAGER
Douglas Bernardini
 
MS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTUREMS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTURE
Douglas Bernardini
 
DBA oracle
DBA oracleDBA oracle
DBA oracle
Douglas Bernardini
 
Hortonworks.Cluster Config Guide
Hortonworks.Cluster Config GuideHortonworks.Cluster Config Guide
Hortonworks.Cluster Config Guide
Douglas Bernardini
 
SAP Business Objects - Lopes Supermarket
SAP   Business Objects - Lopes SupermarketSAP   Business Objects - Lopes Supermarket
SAP Business Objects - Lopes Supermarket
Douglas Bernardini
 
SAP - Business Objects - Ri happy
SAP - Business Objects - Ri happySAP - Business Objects - Ri happy
SAP - Business Objects - Ri happy
Douglas Bernardini
 
Hadoop on retail
Hadoop on retailHadoop on retail
Hadoop on retail
Douglas Bernardini
 
Retail: Big data e Omni-Channel
Retail: Big data e Omni-ChannelRetail: Big data e Omni-Channel
Retail: Big data e Omni-Channel
Douglas Bernardini
 
Granular Access Control Using Cell Level Security In Accumulo
Granular Access Control  Using Cell Level Security  In Accumulo             Granular Access Control  Using Cell Level Security  In Accumulo
Granular Access Control Using Cell Level Security In Accumulo
Douglas Bernardini
 
Proposta aderencia drogaria onofre
Proposta aderencia   drogaria onofreProposta aderencia   drogaria onofre
Proposta aderencia drogaria onofreDouglas Bernardini
 

More from Douglas Bernardini (20)

Top reasons to choose SAP hana
Top reasons to choose SAP hanaTop reasons to choose SAP hana
Top reasons to choose SAP hana
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integrated
 
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapRHadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
 
SAP HORTONWORKS
SAP HORTONWORKSSAP HORTONWORKS
SAP HORTONWORKS
 
R-language
R-languageR-language
R-language
 
REDSHIFT - Amazon
REDSHIFT - AmazonREDSHIFT - Amazon
REDSHIFT - Amazon
 
Splunk
SplunkSplunk
Splunk
 
Finance month closing with HANA
Finance month closing with HANAFinance month closing with HANA
Finance month closing with HANA
 
RDBMS x NoSQL
RDBMS x NoSQLRDBMS x NoSQL
RDBMS x NoSQL
 
SAP - SOLUTION MANAGER
SAP - SOLUTION MANAGER SAP - SOLUTION MANAGER
SAP - SOLUTION MANAGER
 
MS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTUREMS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTURE
 
DBA oracle
DBA oracleDBA oracle
DBA oracle
 
Hortonworks.Cluster Config Guide
Hortonworks.Cluster Config GuideHortonworks.Cluster Config Guide
Hortonworks.Cluster Config Guide
 
SAP Business Objects - Lopes Supermarket
SAP   Business Objects - Lopes SupermarketSAP   Business Objects - Lopes Supermarket
SAP Business Objects - Lopes Supermarket
 
SAP - Business Objects - Ri happy
SAP - Business Objects - Ri happySAP - Business Objects - Ri happy
SAP - Business Objects - Ri happy
 
Hadoop on retail
Hadoop on retailHadoop on retail
Hadoop on retail
 
Retail: Big data e Omni-Channel
Retail: Big data e Omni-ChannelRetail: Big data e Omni-Channel
Retail: Big data e Omni-Channel
 
Granular Access Control Using Cell Level Security In Accumulo
Granular Access Control  Using Cell Level Security  In Accumulo             Granular Access Control  Using Cell Level Security  In Accumulo
Granular Access Control Using Cell Level Security In Accumulo
 
Proposta aderencia drogaria onofre
Proposta aderencia   drogaria onofreProposta aderencia   drogaria onofre
Proposta aderencia drogaria onofre
 
SAP-Solution-Manager
SAP-Solution-ManagerSAP-Solution-Manager
SAP-Solution-Manager
 

Recently uploaded

Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 

Recently uploaded (20)

Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

The REAL face of Big Data

  • 1. the REAL face of big data myths and facts
  • 2. Data is never clean! 2 Most of time will be spending on data cleansing and preparing data for analytics. 80% of a typical data science project is sourcing cleaning and preparing the data, while the remaining 20% is actual data analysis. “It’s an absolute myth that you can send an algorithm over raw data and have insights pop up.”
  • 3. AWS has a great support services. Amazon will not be transparent about the underlying infrastructure, but also will not give diagrams, machine details, etc., so it is difficult to get performance and measurement reports. Even though some EC2 managed services are available through third-parties, only paid support is available for “the most common third-party software running on AWS,” excluding managed support services. 2
  • 4. No one cares how you did it 4 Most technical presentations in industry are 1) far too long and 2) focus on the portions that don’t matter (the details of the method applied) to hide the parts that do (what does that mean for this prospect). However, I think a more constructive way to phrase this is “know your audience”. So makes awesome presentations. Sounds strange, but usually is most important than the algorithm itself.
  • 5. Data Lakes Will Replace The Data Warehouse 5 It's "misleading" for vendors to position data lakes as replacements for data warehouses. A data lake's foundational technologies lack the maturity and breadth of the features found in established data warehouse technologies. Many organizations get stuck at the pilot stage because they don't tie the technology to business processes or concrete use cases.
  • 6. Big data will give a B&W concrete answer 6 The real analytics is combining, weighting and judging multiple sources of information. The more data you have, the more analysis you must run. Big data has a strong capacity to brig other marketing questions, and other insights and makes leaders think better.
  • 7. Machine-learning will be the key! 7 In 90% of cases generalized linear regression will do the trick. The most basic and commonly used predictive analysis will do the job. Regression estimates are used to describe data and to explain the relationship between one dependent variable and one or more independent variables
  • 8. Get data scientists senior team 8 Doctorate in math, a background in computer science, and what amounts to an MBA, not to mention actual work experience in all of those fields. "How old is this person, 90? Almost impossible find this data-scientist unicorn, it had to create a working group with a cross-section of expertise. This is in fact what you have to do.
  • 9. 95% of tasks do not require deep learning. 9 It’s a hands-on job. Super-intelligent artificial automated algorithm that will solve all problem with magic-touch does not exists. This role requires a lot of dirty data model ing, coding, patience and focus.