SlideShare a Scribd company logo
copyright Earthsongs Holistic LLC
2014
Big Data:Big Data:
Strategies and SynergiesStrategies and Synergies
Melinda H. ConnorMelinda H. Connor
D.D., Ph.D., AMP, FAMD.D., Ph.D., AMP, FAM
Adjunct Professor, Akamai UniversityAdjunct Professor, Akamai University
copyright Earthsongs Holistic LLC
2014
Melinda H. Connor, D.D., Ph.D., AMP, FAMMelinda H. Connor, D.D., Ph.D., AMP, FAM
Adjunct Professor, Akamai University, Hilo, HawaiiAdjunct Professor, Akamai University, Hilo, Hawaii
Science Advisor, Spirituals for the 21st Century, GeorgiaScience Advisor, Spirituals for the 21st Century, Georgia
and Nolan Payton Archive of Sacred Music, Californiaand Nolan Payton Archive of Sacred Music, California
State University Dominguez HillsState University Dominguez Hills
CEO, National Foundation for Energy HealingCEO, National Foundation for Energy Healing
Dr. Connor is the former team lead level 3 support forDr. Connor is the former team lead level 3 support for
IBM’s Business Intelligence Technical Support Group.IBM’s Business Intelligence Technical Support Group.
Melinda_Connor@mindspring.comMelinda_Connor@mindspring.com
copyright Earthsongs Holistic LLC
2014
What are the “Big Issues”What are the “Big Issues”
around “Big Data”?around “Big Data”?
copyright Earthsongs Holistic LLC
2014
Challenges:Challenges:
• Quality of programming skills of theQuality of programming skills of the
computer programmers.computer programmers.
• Level of problem definition.Level of problem definition.
• Level of actual problemLevel of actual problem
understanding in the specific area.understanding in the specific area.
• Correct hardware to solve the issue.Correct hardware to solve the issue.
• Correct software to solve the issue.Correct software to solve the issue.
copyright Earthsongs Holistic LLC
2014
Challenges con’t:Challenges con’t:
• Intersection and compatibility of theIntersection and compatibility of the
hardware and software.hardware and software.
• Intersection and compatibility of theIntersection and compatibility of the
software on multiple platforms.software on multiple platforms.
• Understanding of the end user needs.Understanding of the end user needs.
• Production of the reports in a formatProduction of the reports in a format
that the end user can understand.that the end user can understand.
copyright Earthsongs Holistic LLC
2014
Client QuoteClient Quote
I don’t care how your softwareI don’t care how your software
works. I don’t want to spendworks. I don’t want to spend
time with your software. I justtime with your software. I just
want the data I need to run mywant the data I need to run my
business!business!
copyright Earthsongs Holistic LLC
2014
Flip Side:Flip Side:
• Poorly trained user community wantingPoorly trained user community wanting
turn key solutions.turn key solutions.
• The incorrect people making theThe incorrect people making the
purchasing decisions.purchasing decisions.
• Poorly defined understanding of what thePoorly defined understanding of what the
“real” problem is that they are trying to“real” problem is that they are trying to
solve.solve.
• Poor quality problem reports.Poor quality problem reports.
copyright Earthsongs Holistic LLC
2014
Where to start...Where to start...
copyright Earthsongs Holistic LLC
2014
How can utilize the terabytes per hourHow can utilize the terabytes per hour
that you are receiving?that you are receiving?
• Define the needs closely as possible to matchDefine the needs closely as possible to match
the needs of the business or situationthe needs of the business or situation
• Do data mining! There will be more that youDo data mining! There will be more that you
can usecan use
• Select the correct platform to do theSelect the correct platform to do the
processing at speedprocessing at speed
• Understand all of the tools that are available –Understand all of the tools that are available –
do not limit yourself to one companies toolsdo not limit yourself to one companies tools
but do write in clauses that the software mustbut do write in clauses that the software must
work together or no one gets paid.work together or no one gets paid.
copyright Earthsongs Holistic LLC
2014
What is the most effectiveWhat is the most effective
management of this “big data”?management of this “big data”?
• Play both ends against the middle!Play both ends against the middle!
–One end is the problem you are trying to solve.One end is the problem you are trying to solve.
–The other end is the report the end user needs.The other end is the report the end user needs.
• Build fast platforms that are correctly sized for theBuild fast platforms that are correctly sized for the
load.load.
• Limit the bottlenecks in the hardware.Limit the bottlenecks in the hardware.
• Have the correct people do the purchasing and useHave the correct people do the purchasing and use
industry specialists.industry specialists.
copyright Earthsongs Holistic LLC
2014
SPEED,SPEED,
CORRECT PLATFORM,CORRECT PLATFORM,
CORRECT FORM OF DATA BASE,CORRECT FORM OF DATA BASE,
CORRECT TOOLS for ANALYSISCORRECT TOOLS for ANALYSIS
and theand the
CORRECT FORM OF THE REPORTCORRECT FORM OF THE REPORT
copyright Earthsongs Holistic LLC
2014
What are the most effective ways ofWhat are the most effective ways of
understanding the ecologicalunderstanding the ecological
landscape of the data you arelandscape of the data you are
receiving?receiving?
• Start by understanding the types of data you areStart by understanding the types of data you are
collecting.collecting.
• Then understand the tools available.Then understand the tools available.
• For example: Object oriented vs relationalFor example: Object oriented vs relational
databases which do you use and when do youdatabases which do you use and when do you
use one or the other?use one or the other?
copyright Earthsongs Holistic LLC
2014
How do you determine new corporateHow do you determine new corporate
strategic direction based on the datastrategic direction based on the data
when the shape of the data itself is notwhen the shape of the data itself is not
clear?clear?
By defining the problem that youBy defining the problem that you
areare
trying to solve very tightly. Thentrying to solve very tightly. Then
you get the data which answers theyou get the data which answers the
copyright Earthsongs Holistic LLC
2014
How long do you keep the raw data?How long do you keep the raw data?
• How much storage space do you have available and how fastHow much storage space do you have available and how fast
are you getting the data?are you getting the data?
• What are your storage processing speeds and how fast can youWhat are your storage processing speeds and how fast can you
process the data that is available.process the data that is available.
• Know where the bottlenecks are in the physical limitations ofKnow where the bottlenecks are in the physical limitations of
your hardware:your hardware:
• For example: if you have a slow IO handler?For example: if you have a slow IO handler?
• Know the limitations in the way your database is designed:Know the limitations in the way your database is designed:
• File vs table vs row/column locking!File vs table vs row/column locking!
• What about threading?What about threading?
• When is the OS software going to start thrashing?When is the OS software going to start thrashing?
• What about speed of allocation of memory space?What about speed of allocation of memory space?
• What are the legal requirements?What are the legal requirements?
copyright Earthsongs Holistic LLC
2014
Real World Example:Real World Example:
• Internet broadcast of a scienceInternet broadcast of a science
experiment:experiment:
• 8k users logged on a system designed8k users logged on a system designed
for 2400 users with differentfor 2400 users with different
businesses.businesses.
• RESULTRESULT
• Crashed every server in the system.Crashed every server in the system.
copyright Earthsongs Holistic LLC
2014
And what data will you dump?And what data will you dump?
• Everything you can! You will be gettingEverything you can! You will be getting
more!more!
• Life/data runs in cycles. You will not hear orLife/data runs in cycles. You will not hear or
see the information only once. There are wayssee the information only once. There are ways
to back up the raw data and keep it for ato back up the raw data and keep it for a
number of years but do you REALLY neednumber of years but do you REALLY need
that data?that data?
copyright Earthsongs Holistic LLC
2014
What about the limitations of theWhat about the limitations of the
hardware of the various platforms andhardware of the various platforms and
the network structure itself?the network structure itself?
• Problem definition skills of decision makers.Problem definition skills of decision makers.
• They do not define the needs of the business closelyThey do not define the needs of the business closely
enough because they are not using the actual data.enough because they are not using the actual data.
• Do not understand sizing the volume of data properlyDo not understand sizing the volume of data properly
so that the correct processing platform is selected.so that the correct processing platform is selected.
• Do not understand what shape the final product needsDo not understand what shape the final product needs
to be in to be useful to the team.to be in to be useful to the team.
copyright Earthsongs Holistic LLC
2014
Real World Example:Real World Example:
• Hospital System (50 hospitals)Hospital System (50 hospitals)
– Wanted to have end users on PC’s so selected a PCWanted to have end users on PC’s so selected a PC
based system which could not handle thebased system which could not handle the
processing load.processing load.
– Decided on centralized servers without tieredDecided on centralized servers without tiered
support.support.
– Did not purchase enough servers.Did not purchase enough servers.
– Did not distribute network load effectively.Did not distribute network load effectively.
– Did not provide enough training on the software toDid not provide enough training on the software to
medical personnel.medical personnel.
copyright Earthsongs Holistic LLC
2014
Programmer TrainingProgrammer Training
• Issues with the training of the programmers:Issues with the training of the programmers:
– Many do not understand how to write theMany do not understand how to write the
software to use the hardware mostsoftware to use the hardware most
effectively.effectively.
– AND they do not understand the stacking.AND they do not understand the stacking.
– AND they do not understand how toAND they do not understand how to
optimize the code to make the best use ofoptimize the code to make the best use of
the compilers.the compilers.
copyright Earthsongs Holistic LLC
2014
Use an industry specialist!Use an industry specialist!
copyright Earthsongs Holistic LLC
2014
What are the most effective ways ofWhat are the most effective ways of
data-mining?data-mining?
• Specialized software for the platform.Specialized software for the platform.
• Build the algorithms to determine if there areBuild the algorithms to determine if there are
any random correspondences.any random correspondences.
• Know what data you what to review.Know what data you what to review.
• Build meta-data platforms whenever possible.Build meta-data platforms whenever possible.
• Have the people doing the design and buildsHave the people doing the design and builds
understand the shape of the data before theyunderstand the shape of the data before they
start!start!
copyright Earthsongs Holistic LLC
2014
Real World Example:Real World Example:
• Soft Drink Company in 122 countries:Soft Drink Company in 122 countries:
• Need to understand peek load days forNeed to understand peek load days for
manufacture and distribution.manufacture and distribution.
• Problem trying to address was concurrenceProblem trying to address was concurrence
when one country would have to support thewhen one country would have to support the
overload of another.overload of another.
• Meta-data critical to understanding andMeta-data critical to understanding and
defining the shape of the data.defining the shape of the data.
copyright Earthsongs Holistic LLC
2014
What about cross platform portabilityWhat about cross platform portability
of the final product?of the final product?
Wolf Geiger (1992) - Data is only asWolf Geiger (1992) - Data is only as
good as the format in which it isgood as the format in which it is
presented to the person who has to usepresented to the person who has to use
it. If it is not in a format that they canit. If it is not in a format that they can
use there is no point in spending theuse there is no point in spending the
time to do any of the processing.time to do any of the processing.
copyright Earthsongs Holistic LLC
2014
Real World Example:Real World Example:
• Asked the end user to write down exactly whatAsked the end user to write down exactly what
they wanted in the report.they wanted in the report.
• Asked the manager to write down exactly whatAsked the manager to write down exactly what
they wanted in the report.they wanted in the report.
• Asked the computer programmer to writeAsked the computer programmer to write
down exactly what the clients wanted in thedown exactly what the clients wanted in the
report.report.
• Two of three matched. Which one did not?Two of three matched. Which one did not?
copyright Earthsongs Holistic LLC
2014
Cell Phone Data: How should it beCell Phone Data: How should it be
parsed?parsed?
• Has to be done on super computers to start based on the volume of the dataHas to be done on super computers to start based on the volume of the data
but it has to end in PC formats!but it has to end in PC formats!
• Object oriented db with full variable length fields.Object oriented db with full variable length fields.
• Needs Multi-dimensional processing:Needs Multi-dimensional processing:
– Computational linguistics.Computational linguistics.
• Analysis of word stressors.Analysis of word stressors.
• Analysis of grammatical syntax.Analysis of grammatical syntax.
– Cognitive focus (topic basis).Cognitive focus (topic basis).
– Recognized vocal stress vs topic.Recognized vocal stress vs topic.
– Risk factor assignment.Risk factor assignment.
– Background noise assessment.Background noise assessment.
– Probability analysis of each of the factors to determine further review.Probability analysis of each of the factors to determine further review.
• Data presentation tools have to be in a format that is currently used thatData presentation tools have to be in a format that is currently used that
everyone understands where to look to find the important information.everyone understands where to look to find the important information.
• Cross platform portability!!!!Cross platform portability!!!!
copyright Earthsongs Holistic LLC
2014
Questions?Questions?
copyright Earthsongs Holistic LLC
2014
Thank you!Thank you!

More Related Content

Viewers also liked

Xarxes Socials,DiferèNcies I Funcionalitats
Xarxes Socials,DiferèNcies I FuncionalitatsXarxes Socials,DiferèNcies I Funcionalitats
Xarxes Socials,DiferèNcies I Funcionalitats
Oscar Cumí
 
Logiadis1
Logiadis1Logiadis1
Logiadis1
gymnasio
 
Educa Presentation
Educa PresentationEduca Presentation
Educa PresentationAndy Black
 
Networking Como Herramienta Profesional
Networking Como Herramienta ProfesionalNetworking Como Herramienta Profesional
Networking Como Herramienta Profesional
Oscar Cumí
 
团队建设2
团队建设2团队建设2
团队建设220004
 
Ccna
CcnaCcna
Ccna
20004
 
心态调整及认同
心态调整及认同心态调整及认同
心态调整及认同20004
 
积极的心态4
积极的心态4积极的心态4
积极的心态420004
 
6 telphone and online crisis counseling
6 telphone and online crisis counseling6 telphone and online crisis counseling
6 telphone and online crisis counselingDon Thompson
 
Prespes 2011
Prespes 2011Prespes 2011
Prespes 2011
gymnasio
 
Networking Com A Eina Professional
Networking Com A Eina ProfessionalNetworking Com A Eina Professional
Networking Com A Eina ProfessionalOscar Cumí
 
Reading Grade 2 A2
Reading Grade 2 A2Reading Grade 2 A2
Reading Grade 2 A2missmarsh
 
24 Hours In The Life Of Muslim
24 Hours In The Life Of Muslim24 Hours In The Life Of Muslim
24 Hours In The Life Of Muslim1911213114
 
Danseaza lent1
Danseaza lent1Danseaza lent1
Danseaza lent1Nicky Nic
 
Music On The Mound
Music On The MoundMusic On The Mound
Music On The MoundRobbyBarbaro
 
胜任力模型
胜任力模型胜任力模型
胜任力模型20004
 
Ferice de cei
Ferice de ceiFerice de cei
Ferice de ceiNicky Nic
 
Snowy owl nikki parker
Snowy owl nikki parkerSnowy owl nikki parker
Snowy owl nikki parkervermigle
 
Pretty In Pink Fashion Show
Pretty In Pink Fashion ShowPretty In Pink Fashion Show
Pretty In Pink Fashion ShowRobbyBarbaro
 

Viewers also liked (20)

Xarxes Socials,DiferèNcies I Funcionalitats
Xarxes Socials,DiferèNcies I FuncionalitatsXarxes Socials,DiferèNcies I Funcionalitats
Xarxes Socials,DiferèNcies I Funcionalitats
 
Logiadis1
Logiadis1Logiadis1
Logiadis1
 
Educa Presentation
Educa PresentationEduca Presentation
Educa Presentation
 
Networking Como Herramienta Profesional
Networking Como Herramienta ProfesionalNetworking Como Herramienta Profesional
Networking Como Herramienta Profesional
 
团队建设2
团队建设2团队建设2
团队建设2
 
Ccna
CcnaCcna
Ccna
 
心态调整及认同
心态调整及认同心态调整及认同
心态调整及认同
 
积极的心态4
积极的心态4积极的心态4
积极的心态4
 
Web Presen
Web PresenWeb Presen
Web Presen
 
6 telphone and online crisis counseling
6 telphone and online crisis counseling6 telphone and online crisis counseling
6 telphone and online crisis counseling
 
Prespes 2011
Prespes 2011Prespes 2011
Prespes 2011
 
Networking Com A Eina Professional
Networking Com A Eina ProfessionalNetworking Com A Eina Professional
Networking Com A Eina Professional
 
Reading Grade 2 A2
Reading Grade 2 A2Reading Grade 2 A2
Reading Grade 2 A2
 
24 Hours In The Life Of Muslim
24 Hours In The Life Of Muslim24 Hours In The Life Of Muslim
24 Hours In The Life Of Muslim
 
Danseaza lent1
Danseaza lent1Danseaza lent1
Danseaza lent1
 
Music On The Mound
Music On The MoundMusic On The Mound
Music On The Mound
 
胜任力模型
胜任力模型胜任力模型
胜任力模型
 
Ferice de cei
Ferice de ceiFerice de cei
Ferice de cei
 
Snowy owl nikki parker
Snowy owl nikki parkerSnowy owl nikki parker
Snowy owl nikki parker
 
Pretty In Pink Fashion Show
Pretty In Pink Fashion ShowPretty In Pink Fashion Show
Pretty In Pink Fashion Show
 

Similar to Connor big data

Data Mining & Engineering
Data Mining & EngineeringData Mining & Engineering
Data Mining & Engineering
Visible Technologies
 
Trusting a Distributed Data Pipeline | Masters of Conversion
Trusting a Distributed Data Pipeline | Masters of ConversionTrusting a Distributed Data Pipeline | Masters of Conversion
Trusting a Distributed Data Pipeline | Masters of Conversion
VWO
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
Lynne Thomas
 
OSMC 2015 | Testing in Production by Devdas Bhagat
OSMC 2015 | Testing in Production by Devdas BhagatOSMC 2015 | Testing in Production by Devdas Bhagat
OSMC 2015 | Testing in Production by Devdas Bhagat
NETWAYS
 
OSMC 2015: Testing in Production by Devdas Bhagat
OSMC 2015: Testing in Production by Devdas BhagatOSMC 2015: Testing in Production by Devdas Bhagat
OSMC 2015: Testing in Production by Devdas Bhagat
NETWAYS
 
Data mining & warehousing
Data mining & warehousingData mining & warehousing
Data mining & warehousingSamoneh Dashti
 
IW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchIW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester Research
Software AG
 
2018 10 igneous
2018 10 igneous2018 10 igneous
2018 10 igneous
Chris Dwan
 
Customer Discovery in the DOD/IC Workshop H4D Stanford 2016
Customer Discovery in the DOD/IC Workshop H4D Stanford 2016Customer Discovery in the DOD/IC Workshop H4D Stanford 2016
Customer Discovery in the DOD/IC Workshop H4D Stanford 2016
Stanford University
 
Infrastructure is development
Infrastructure is developmentInfrastructure is development
Infrastructure is development
stahnma
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Owais Ashraf
 
How To Keep Your Job
How To Keep Your JobHow To Keep Your Job
How To Keep Your Job
pragdave
 
How to get what you really want from Testing' with Michael Bolton
How to get what you really want from Testing' with Michael BoltonHow to get what you really want from Testing' with Michael Bolton
How to get what you really want from Testing' with Michael Bolton
TEST Huddle
 
The Salesforce Playbook- 6 Steps to Better Deployments
The Salesforce Playbook- 6 Steps to Better DeploymentsThe Salesforce Playbook- 6 Steps to Better Deployments
The Salesforce Playbook- 6 Steps to Better Deployments
Alex Cowan
 
Programming agents without a programming language
Programming agents without a programming languageProgramming agents without a programming language
Programming agents without a programming language
Aryan Rathore
 
Analyzing organization e-mails in near real time using hadoop ecosystem tools...
Analyzing organization e-mails in near real time using hadoop ecosystem tools...Analyzing organization e-mails in near real time using hadoop ecosystem tools...
Analyzing organization e-mails in near real time using hadoop ecosystem tools...
Big Data Spain
 
Accounting, Reporting, Auditing, and Analysis in a Digital Environmentintro
Accounting, Reporting, Auditing, and Analysis in a Digital EnvironmentintroAccounting, Reporting, Auditing, and Analysis in a Digital Environmentintro
Accounting, Reporting, Auditing, and Analysis in a Digital Environmentintro
Charles Hoffman
 
Customer Insights Workshop - Consumer Text Analytics Conference
Customer Insights Workshop - Consumer Text Analytics ConferenceCustomer Insights Workshop - Consumer Text Analytics Conference
Customer Insights Workshop - Consumer Text Analytics Conference
Mekkin Bjarnadottir
 
Designing for the Prime Interface
Designing for the Prime InterfaceDesigning for the Prime Interface
Designing for the Prime Interface
Ben Taylor
 
(Webinar Slides) Choosing Legal Tech for Your Office
(Webinar Slides) Choosing Legal Tech for Your Office(Webinar Slides) Choosing Legal Tech for Your Office
(Webinar Slides) Choosing Legal Tech for Your Office
MyCase Legal Case and Practice Management Software
 

Similar to Connor big data (20)

Data Mining & Engineering
Data Mining & EngineeringData Mining & Engineering
Data Mining & Engineering
 
Trusting a Distributed Data Pipeline | Masters of Conversion
Trusting a Distributed Data Pipeline | Masters of ConversionTrusting a Distributed Data Pipeline | Masters of Conversion
Trusting a Distributed Data Pipeline | Masters of Conversion
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
 
OSMC 2015 | Testing in Production by Devdas Bhagat
OSMC 2015 | Testing in Production by Devdas BhagatOSMC 2015 | Testing in Production by Devdas Bhagat
OSMC 2015 | Testing in Production by Devdas Bhagat
 
OSMC 2015: Testing in Production by Devdas Bhagat
OSMC 2015: Testing in Production by Devdas BhagatOSMC 2015: Testing in Production by Devdas Bhagat
OSMC 2015: Testing in Production by Devdas Bhagat
 
Data mining & warehousing
Data mining & warehousingData mining & warehousing
Data mining & warehousing
 
IW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchIW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester Research
 
2018 10 igneous
2018 10 igneous2018 10 igneous
2018 10 igneous
 
Customer Discovery in the DOD/IC Workshop H4D Stanford 2016
Customer Discovery in the DOD/IC Workshop H4D Stanford 2016Customer Discovery in the DOD/IC Workshop H4D Stanford 2016
Customer Discovery in the DOD/IC Workshop H4D Stanford 2016
 
Infrastructure is development
Infrastructure is developmentInfrastructure is development
Infrastructure is development
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
How To Keep Your Job
How To Keep Your JobHow To Keep Your Job
How To Keep Your Job
 
How to get what you really want from Testing' with Michael Bolton
How to get what you really want from Testing' with Michael BoltonHow to get what you really want from Testing' with Michael Bolton
How to get what you really want from Testing' with Michael Bolton
 
The Salesforce Playbook- 6 Steps to Better Deployments
The Salesforce Playbook- 6 Steps to Better DeploymentsThe Salesforce Playbook- 6 Steps to Better Deployments
The Salesforce Playbook- 6 Steps to Better Deployments
 
Programming agents without a programming language
Programming agents without a programming languageProgramming agents without a programming language
Programming agents without a programming language
 
Analyzing organization e-mails in near real time using hadoop ecosystem tools...
Analyzing organization e-mails in near real time using hadoop ecosystem tools...Analyzing organization e-mails in near real time using hadoop ecosystem tools...
Analyzing organization e-mails in near real time using hadoop ecosystem tools...
 
Accounting, Reporting, Auditing, and Analysis in a Digital Environmentintro
Accounting, Reporting, Auditing, and Analysis in a Digital EnvironmentintroAccounting, Reporting, Auditing, and Analysis in a Digital Environmentintro
Accounting, Reporting, Auditing, and Analysis in a Digital Environmentintro
 
Customer Insights Workshop - Consumer Text Analytics Conference
Customer Insights Workshop - Consumer Text Analytics ConferenceCustomer Insights Workshop - Consumer Text Analytics Conference
Customer Insights Workshop - Consumer Text Analytics Conference
 
Designing for the Prime Interface
Designing for the Prime InterfaceDesigning for the Prime Interface
Designing for the Prime Interface
 
(Webinar Slides) Choosing Legal Tech for Your Office
(Webinar Slides) Choosing Legal Tech for Your Office(Webinar Slides) Choosing Legal Tech for Your Office
(Webinar Slides) Choosing Legal Tech for Your Office
 

More from David Jimenez

IAFIE Essay Comptetion 2019
IAFIE Essay Comptetion 2019 IAFIE Essay Comptetion 2019
IAFIE Essay Comptetion 2019
David Jimenez
 
Iafie 2019 call for proposals (extended deadline)
Iafie 2019   call for proposals (extended deadline)Iafie 2019   call for proposals (extended deadline)
Iafie 2019 call for proposals (extended deadline)
David Jimenez
 
Iafie instructor of_the_year_award_2017 (final) (2)
Iafie instructor of_the_year_award_2017 (final) (2)Iafie instructor of_the_year_award_2017 (final) (2)
Iafie instructor of_the_year_award_2017 (final) (2)
David Jimenez
 
Iafie call for proposals (extended deadline) (1)
Iafie call for proposals (extended deadline) (1)Iafie call for proposals (extended deadline) (1)
Iafie call for proposals (extended deadline) (1)
David Jimenez
 
Call for proposals (iafie) (2)
Call for proposals (iafie) (2)Call for proposals (iafie) (2)
Call for proposals (iafie) (2)
David Jimenez
 
Iafie instructor of_the_year_award_2017 (final) (1)
Iafie instructor of_the_year_award_2017 (final) (1)Iafie instructor of_the_year_award_2017 (final) (1)
Iafie instructor of_the_year_award_2017 (final) (1)
David Jimenez
 
Quarterly training agenda january 2017
Quarterly training agenda   january 2017Quarterly training agenda   january 2017
Quarterly training agenda january 2017
David Jimenez
 
Jan 24 registration form
Jan 24 registration formJan 24 registration form
Jan 24 registration form
David Jimenez
 
Call for proposals (iafie) (1)
Call for proposals (iafie) (1)Call for proposals (iafie) (1)
Call for proposals (iafie) (1)
David Jimenez
 
Iafie europe 2017
Iafie europe 2017Iafie europe 2017
Iafie europe 2017
David Jimenez
 
IAFIE Instructor of The Year Award 2017
IAFIE Instructor of The Year Award 2017 IAFIE Instructor of The Year Award 2017
IAFIE Instructor of The Year Award 2017
David Jimenez
 
Mercyhurst
MercyhurstMercyhurst
Mercyhurst
David Jimenez
 
Save the Date! Monday December 5, 2016 with Joe Caddell
Save the Date!  Monday December 5, 2016 with Joe CaddellSave the Date!  Monday December 5, 2016 with Joe Caddell
Save the Date! Monday December 5, 2016 with Joe Caddell
David Jimenez
 
Nisa congres20160601 25_years_nisa_detailed_programme_v05(1)
Nisa congres20160601 25_years_nisa_detailed_programme_v05(1)Nisa congres20160601 25_years_nisa_detailed_programme_v05(1)
Nisa congres20160601 25_years_nisa_detailed_programme_v05(1)
David Jimenez
 
Globalytica breda workshops jun 2016
Globalytica breda workshops jun 2016Globalytica breda workshops jun 2016
Globalytica breda workshops jun 2016
David Jimenez
 
Washington DC Area Chapter of IAFIE Spring 2016 Newsletter
Washington DC Area Chapter of IAFIE Spring 2016 NewsletterWashington DC Area Chapter of IAFIE Spring 2016 Newsletter
Washington DC Area Chapter of IAFIE Spring 2016 Newsletter
David Jimenez
 
The domestic intel flyer 3
The domestic intel flyer 3The domestic intel flyer 3
The domestic intel flyer 3
David Jimenez
 
Lint Center Launches Partnership with IAFIE to Advance Intelligence Education
Lint Center Launches Partnership with IAFIE to Advance Intelligence EducationLint Center Launches Partnership with IAFIE to Advance Intelligence Education
Lint Center Launches Partnership with IAFIE to Advance Intelligence Education
David Jimenez
 
Iafie clift
Iafie cliftIafie clift
Iafie clift
David Jimenez
 
Iafie instructor of_the_year_award_2016_final
Iafie instructor of_the_year_award_2016_finalIafie instructor of_the_year_award_2016_final
Iafie instructor of_the_year_award_2016_final
David Jimenez
 

More from David Jimenez (20)

IAFIE Essay Comptetion 2019
IAFIE Essay Comptetion 2019 IAFIE Essay Comptetion 2019
IAFIE Essay Comptetion 2019
 
Iafie 2019 call for proposals (extended deadline)
Iafie 2019   call for proposals (extended deadline)Iafie 2019   call for proposals (extended deadline)
Iafie 2019 call for proposals (extended deadline)
 
Iafie instructor of_the_year_award_2017 (final) (2)
Iafie instructor of_the_year_award_2017 (final) (2)Iafie instructor of_the_year_award_2017 (final) (2)
Iafie instructor of_the_year_award_2017 (final) (2)
 
Iafie call for proposals (extended deadline) (1)
Iafie call for proposals (extended deadline) (1)Iafie call for proposals (extended deadline) (1)
Iafie call for proposals (extended deadline) (1)
 
Call for proposals (iafie) (2)
Call for proposals (iafie) (2)Call for proposals (iafie) (2)
Call for proposals (iafie) (2)
 
Iafie instructor of_the_year_award_2017 (final) (1)
Iafie instructor of_the_year_award_2017 (final) (1)Iafie instructor of_the_year_award_2017 (final) (1)
Iafie instructor of_the_year_award_2017 (final) (1)
 
Quarterly training agenda january 2017
Quarterly training agenda   january 2017Quarterly training agenda   january 2017
Quarterly training agenda january 2017
 
Jan 24 registration form
Jan 24 registration formJan 24 registration form
Jan 24 registration form
 
Call for proposals (iafie) (1)
Call for proposals (iafie) (1)Call for proposals (iafie) (1)
Call for proposals (iafie) (1)
 
Iafie europe 2017
Iafie europe 2017Iafie europe 2017
Iafie europe 2017
 
IAFIE Instructor of The Year Award 2017
IAFIE Instructor of The Year Award 2017 IAFIE Instructor of The Year Award 2017
IAFIE Instructor of The Year Award 2017
 
Mercyhurst
MercyhurstMercyhurst
Mercyhurst
 
Save the Date! Monday December 5, 2016 with Joe Caddell
Save the Date!  Monday December 5, 2016 with Joe CaddellSave the Date!  Monday December 5, 2016 with Joe Caddell
Save the Date! Monday December 5, 2016 with Joe Caddell
 
Nisa congres20160601 25_years_nisa_detailed_programme_v05(1)
Nisa congres20160601 25_years_nisa_detailed_programme_v05(1)Nisa congres20160601 25_years_nisa_detailed_programme_v05(1)
Nisa congres20160601 25_years_nisa_detailed_programme_v05(1)
 
Globalytica breda workshops jun 2016
Globalytica breda workshops jun 2016Globalytica breda workshops jun 2016
Globalytica breda workshops jun 2016
 
Washington DC Area Chapter of IAFIE Spring 2016 Newsletter
Washington DC Area Chapter of IAFIE Spring 2016 NewsletterWashington DC Area Chapter of IAFIE Spring 2016 Newsletter
Washington DC Area Chapter of IAFIE Spring 2016 Newsletter
 
The domestic intel flyer 3
The domestic intel flyer 3The domestic intel flyer 3
The domestic intel flyer 3
 
Lint Center Launches Partnership with IAFIE to Advance Intelligence Education
Lint Center Launches Partnership with IAFIE to Advance Intelligence EducationLint Center Launches Partnership with IAFIE to Advance Intelligence Education
Lint Center Launches Partnership with IAFIE to Advance Intelligence Education
 
Iafie clift
Iafie cliftIafie clift
Iafie clift
 
Iafie instructor of_the_year_award_2016_final
Iafie instructor of_the_year_award_2016_finalIafie instructor of_the_year_award_2016_final
Iafie instructor of_the_year_award_2016_final
 

Connor big data

  • 1. copyright Earthsongs Holistic LLC 2014 Big Data:Big Data: Strategies and SynergiesStrategies and Synergies Melinda H. ConnorMelinda H. Connor D.D., Ph.D., AMP, FAMD.D., Ph.D., AMP, FAM Adjunct Professor, Akamai UniversityAdjunct Professor, Akamai University
  • 2. copyright Earthsongs Holistic LLC 2014 Melinda H. Connor, D.D., Ph.D., AMP, FAMMelinda H. Connor, D.D., Ph.D., AMP, FAM Adjunct Professor, Akamai University, Hilo, HawaiiAdjunct Professor, Akamai University, Hilo, Hawaii Science Advisor, Spirituals for the 21st Century, GeorgiaScience Advisor, Spirituals for the 21st Century, Georgia and Nolan Payton Archive of Sacred Music, Californiaand Nolan Payton Archive of Sacred Music, California State University Dominguez HillsState University Dominguez Hills CEO, National Foundation for Energy HealingCEO, National Foundation for Energy Healing Dr. Connor is the former team lead level 3 support forDr. Connor is the former team lead level 3 support for IBM’s Business Intelligence Technical Support Group.IBM’s Business Intelligence Technical Support Group. Melinda_Connor@mindspring.comMelinda_Connor@mindspring.com
  • 3. copyright Earthsongs Holistic LLC 2014 What are the “Big Issues”What are the “Big Issues” around “Big Data”?around “Big Data”?
  • 4. copyright Earthsongs Holistic LLC 2014 Challenges:Challenges: • Quality of programming skills of theQuality of programming skills of the computer programmers.computer programmers. • Level of problem definition.Level of problem definition. • Level of actual problemLevel of actual problem understanding in the specific area.understanding in the specific area. • Correct hardware to solve the issue.Correct hardware to solve the issue. • Correct software to solve the issue.Correct software to solve the issue.
  • 5. copyright Earthsongs Holistic LLC 2014 Challenges con’t:Challenges con’t: • Intersection and compatibility of theIntersection and compatibility of the hardware and software.hardware and software. • Intersection and compatibility of theIntersection and compatibility of the software on multiple platforms.software on multiple platforms. • Understanding of the end user needs.Understanding of the end user needs. • Production of the reports in a formatProduction of the reports in a format that the end user can understand.that the end user can understand.
  • 6. copyright Earthsongs Holistic LLC 2014 Client QuoteClient Quote I don’t care how your softwareI don’t care how your software works. I don’t want to spendworks. I don’t want to spend time with your software. I justtime with your software. I just want the data I need to run mywant the data I need to run my business!business!
  • 7. copyright Earthsongs Holistic LLC 2014 Flip Side:Flip Side: • Poorly trained user community wantingPoorly trained user community wanting turn key solutions.turn key solutions. • The incorrect people making theThe incorrect people making the purchasing decisions.purchasing decisions. • Poorly defined understanding of what thePoorly defined understanding of what the “real” problem is that they are trying to“real” problem is that they are trying to solve.solve. • Poor quality problem reports.Poor quality problem reports.
  • 8. copyright Earthsongs Holistic LLC 2014 Where to start...Where to start...
  • 9. copyright Earthsongs Holistic LLC 2014 How can utilize the terabytes per hourHow can utilize the terabytes per hour that you are receiving?that you are receiving? • Define the needs closely as possible to matchDefine the needs closely as possible to match the needs of the business or situationthe needs of the business or situation • Do data mining! There will be more that youDo data mining! There will be more that you can usecan use • Select the correct platform to do theSelect the correct platform to do the processing at speedprocessing at speed • Understand all of the tools that are available –Understand all of the tools that are available – do not limit yourself to one companies toolsdo not limit yourself to one companies tools but do write in clauses that the software mustbut do write in clauses that the software must work together or no one gets paid.work together or no one gets paid.
  • 10. copyright Earthsongs Holistic LLC 2014 What is the most effectiveWhat is the most effective management of this “big data”?management of this “big data”? • Play both ends against the middle!Play both ends against the middle! –One end is the problem you are trying to solve.One end is the problem you are trying to solve. –The other end is the report the end user needs.The other end is the report the end user needs. • Build fast platforms that are correctly sized for theBuild fast platforms that are correctly sized for the load.load. • Limit the bottlenecks in the hardware.Limit the bottlenecks in the hardware. • Have the correct people do the purchasing and useHave the correct people do the purchasing and use industry specialists.industry specialists.
  • 11. copyright Earthsongs Holistic LLC 2014 SPEED,SPEED, CORRECT PLATFORM,CORRECT PLATFORM, CORRECT FORM OF DATA BASE,CORRECT FORM OF DATA BASE, CORRECT TOOLS for ANALYSISCORRECT TOOLS for ANALYSIS and theand the CORRECT FORM OF THE REPORTCORRECT FORM OF THE REPORT
  • 12. copyright Earthsongs Holistic LLC 2014 What are the most effective ways ofWhat are the most effective ways of understanding the ecologicalunderstanding the ecological landscape of the data you arelandscape of the data you are receiving?receiving? • Start by understanding the types of data you areStart by understanding the types of data you are collecting.collecting. • Then understand the tools available.Then understand the tools available. • For example: Object oriented vs relationalFor example: Object oriented vs relational databases which do you use and when do youdatabases which do you use and when do you use one or the other?use one or the other?
  • 13. copyright Earthsongs Holistic LLC 2014 How do you determine new corporateHow do you determine new corporate strategic direction based on the datastrategic direction based on the data when the shape of the data itself is notwhen the shape of the data itself is not clear?clear? By defining the problem that youBy defining the problem that you areare trying to solve very tightly. Thentrying to solve very tightly. Then you get the data which answers theyou get the data which answers the
  • 14. copyright Earthsongs Holistic LLC 2014 How long do you keep the raw data?How long do you keep the raw data? • How much storage space do you have available and how fastHow much storage space do you have available and how fast are you getting the data?are you getting the data? • What are your storage processing speeds and how fast can youWhat are your storage processing speeds and how fast can you process the data that is available.process the data that is available. • Know where the bottlenecks are in the physical limitations ofKnow where the bottlenecks are in the physical limitations of your hardware:your hardware: • For example: if you have a slow IO handler?For example: if you have a slow IO handler? • Know the limitations in the way your database is designed:Know the limitations in the way your database is designed: • File vs table vs row/column locking!File vs table vs row/column locking! • What about threading?What about threading? • When is the OS software going to start thrashing?When is the OS software going to start thrashing? • What about speed of allocation of memory space?What about speed of allocation of memory space? • What are the legal requirements?What are the legal requirements?
  • 15. copyright Earthsongs Holistic LLC 2014 Real World Example:Real World Example: • Internet broadcast of a scienceInternet broadcast of a science experiment:experiment: • 8k users logged on a system designed8k users logged on a system designed for 2400 users with differentfor 2400 users with different businesses.businesses. • RESULTRESULT • Crashed every server in the system.Crashed every server in the system.
  • 16. copyright Earthsongs Holistic LLC 2014 And what data will you dump?And what data will you dump? • Everything you can! You will be gettingEverything you can! You will be getting more!more! • Life/data runs in cycles. You will not hear orLife/data runs in cycles. You will not hear or see the information only once. There are wayssee the information only once. There are ways to back up the raw data and keep it for ato back up the raw data and keep it for a number of years but do you REALLY neednumber of years but do you REALLY need that data?that data?
  • 17. copyright Earthsongs Holistic LLC 2014 What about the limitations of theWhat about the limitations of the hardware of the various platforms andhardware of the various platforms and the network structure itself?the network structure itself? • Problem definition skills of decision makers.Problem definition skills of decision makers. • They do not define the needs of the business closelyThey do not define the needs of the business closely enough because they are not using the actual data.enough because they are not using the actual data. • Do not understand sizing the volume of data properlyDo not understand sizing the volume of data properly so that the correct processing platform is selected.so that the correct processing platform is selected. • Do not understand what shape the final product needsDo not understand what shape the final product needs to be in to be useful to the team.to be in to be useful to the team.
  • 18. copyright Earthsongs Holistic LLC 2014 Real World Example:Real World Example: • Hospital System (50 hospitals)Hospital System (50 hospitals) – Wanted to have end users on PC’s so selected a PCWanted to have end users on PC’s so selected a PC based system which could not handle thebased system which could not handle the processing load.processing load. – Decided on centralized servers without tieredDecided on centralized servers without tiered support.support. – Did not purchase enough servers.Did not purchase enough servers. – Did not distribute network load effectively.Did not distribute network load effectively. – Did not provide enough training on the software toDid not provide enough training on the software to medical personnel.medical personnel.
  • 19. copyright Earthsongs Holistic LLC 2014 Programmer TrainingProgrammer Training • Issues with the training of the programmers:Issues with the training of the programmers: – Many do not understand how to write theMany do not understand how to write the software to use the hardware mostsoftware to use the hardware most effectively.effectively. – AND they do not understand the stacking.AND they do not understand the stacking. – AND they do not understand how toAND they do not understand how to optimize the code to make the best use ofoptimize the code to make the best use of the compilers.the compilers.
  • 20. copyright Earthsongs Holistic LLC 2014 Use an industry specialist!Use an industry specialist!
  • 21. copyright Earthsongs Holistic LLC 2014 What are the most effective ways ofWhat are the most effective ways of data-mining?data-mining? • Specialized software for the platform.Specialized software for the platform. • Build the algorithms to determine if there areBuild the algorithms to determine if there are any random correspondences.any random correspondences. • Know what data you what to review.Know what data you what to review. • Build meta-data platforms whenever possible.Build meta-data platforms whenever possible. • Have the people doing the design and buildsHave the people doing the design and builds understand the shape of the data before theyunderstand the shape of the data before they start!start!
  • 22. copyright Earthsongs Holistic LLC 2014 Real World Example:Real World Example: • Soft Drink Company in 122 countries:Soft Drink Company in 122 countries: • Need to understand peek load days forNeed to understand peek load days for manufacture and distribution.manufacture and distribution. • Problem trying to address was concurrenceProblem trying to address was concurrence when one country would have to support thewhen one country would have to support the overload of another.overload of another. • Meta-data critical to understanding andMeta-data critical to understanding and defining the shape of the data.defining the shape of the data.
  • 23. copyright Earthsongs Holistic LLC 2014 What about cross platform portabilityWhat about cross platform portability of the final product?of the final product? Wolf Geiger (1992) - Data is only asWolf Geiger (1992) - Data is only as good as the format in which it isgood as the format in which it is presented to the person who has to usepresented to the person who has to use it. If it is not in a format that they canit. If it is not in a format that they can use there is no point in spending theuse there is no point in spending the time to do any of the processing.time to do any of the processing.
  • 24. copyright Earthsongs Holistic LLC 2014 Real World Example:Real World Example: • Asked the end user to write down exactly whatAsked the end user to write down exactly what they wanted in the report.they wanted in the report. • Asked the manager to write down exactly whatAsked the manager to write down exactly what they wanted in the report.they wanted in the report. • Asked the computer programmer to writeAsked the computer programmer to write down exactly what the clients wanted in thedown exactly what the clients wanted in the report.report. • Two of three matched. Which one did not?Two of three matched. Which one did not?
  • 25. copyright Earthsongs Holistic LLC 2014 Cell Phone Data: How should it beCell Phone Data: How should it be parsed?parsed? • Has to be done on super computers to start based on the volume of the dataHas to be done on super computers to start based on the volume of the data but it has to end in PC formats!but it has to end in PC formats! • Object oriented db with full variable length fields.Object oriented db with full variable length fields. • Needs Multi-dimensional processing:Needs Multi-dimensional processing: – Computational linguistics.Computational linguistics. • Analysis of word stressors.Analysis of word stressors. • Analysis of grammatical syntax.Analysis of grammatical syntax. – Cognitive focus (topic basis).Cognitive focus (topic basis). – Recognized vocal stress vs topic.Recognized vocal stress vs topic. – Risk factor assignment.Risk factor assignment. – Background noise assessment.Background noise assessment. – Probability analysis of each of the factors to determine further review.Probability analysis of each of the factors to determine further review. • Data presentation tools have to be in a format that is currently used thatData presentation tools have to be in a format that is currently used that everyone understands where to look to find the important information.everyone understands where to look to find the important information. • Cross platform portability!!!!Cross platform portability!!!!
  • 26. copyright Earthsongs Holistic LLC 2014 Questions?Questions?
  • 27. copyright Earthsongs Holistic LLC 2014 Thank you!Thank you!