An introductory presentation about possibilities that Big Data opens to public safety company like e.g. taking advantage of smart city grids, crime and accident databases
Technology has forced organizations to adapt to a workforce that insists on being mobile, but there are still challenges. Learn more in this infographic. Learn more: http://del.ly/cD9jD8
Despite un-unprecedented technology innovation, since 2004, US labor productivity growth has been going down to a pathetic 0.5% per year. Why Technology doesn't drive growth anymore?
This presentation is an exploration of what Information Technology will look like in 2021. We have spread our predictions across three categories: IT Trends, IT Practice and Impacts on Everyday Life.
Technology has forced organizations to adapt to a workforce that insists on being mobile, but there are still challenges. Learn more in this infographic. Learn more: http://del.ly/cD9jD8
Despite un-unprecedented technology innovation, since 2004, US labor productivity growth has been going down to a pathetic 0.5% per year. Why Technology doesn't drive growth anymore?
This presentation is an exploration of what Information Technology will look like in 2021. We have spread our predictions across three categories: IT Trends, IT Practice and Impacts on Everyday Life.
'Psychometrics Predicting Behaviour, In Theory and In Practice' Dr David Stil...Chinwag
Psychometrics predicting behaviour, in theory and in practice
This two-part talk, the first will look at how personality influences behaviour in social media, tracing the limits and analysis and prediction and comparing to behavioural and keyword targeting. The second part will take a practical look at how data this type of technique can be used to obtain psychological insights and targeting strategies for any key brands or audiences.
Find more info at: http://chinwag.com/insight/psychology
Space Mission UK - Mission 3 Lookbook - 5-11 Nov 2016Chinwag
Space Mission UK is a series of entrepreneur-led missions specifically designed for the UK's top space and satellite application startups. This lookbook covers the ten companies taking part in the third mission to San Francisco, Silicon Valley and Los Angeles.
For more information:
http://spacemissionuk.org
Space Mission UK is supported by Innovate UK and produced by trade mission specialists, Chinwag - http://chinwag.com
The REAL Impact of Big Data on PrivacyClaudiu Popa
The awesome promise of Big Data is tempered by the need to protect personal information. Data scientists must expertly navigate the legislative waters and acquire the skills to protect privacy and security. This talk provides enterprise leaders with answers and suggests questions to ask when the time comes to consider the vast opportunities offered by big data.
W-JAX Keynote - Big Data and Corporate Evolutionjstogdill
A look at corporate evolution from the industrial revolution to the information age - with a focus on how Big Data will make an impact.
Presented at W-JAX Java Conference in Munich Germany, 11-8-11
Social Media Group SMWTO: Mining Data - Developing Foundations & Social GoodSocial Media Group
Big Data has quickly become an industry buzz term and with it promises of exciting opportunities for consumer intelligence. However, increasing privacy concerns and the logistics associated with data refinement bring their own unique set of challenges for marketers. In his session, Cam will outline some emerging solutions to address these concerns as well as innovative opportunities for social good through real-time data analysis.
With the computer revolution vast amount of digital data has become available. With the Internet and smart connected product, the data is growing exponentially. It is estimated that every year, more data is generated than all history prior. And this has repeated over several years.
With all this data, it becomes a platform for something new of its own. In this lecture, we look at what big data is and look at several examples of how to use data. There are many well-know algorithms to analyse data, like clustering and machine learning.
Data Science in the Real World: Making a Difference Srinath Perera
We use the terms “Big Data” and “Data Science” for use of data processing to make sense of the world around us. Spanning many fields, Big Data brings together technologies like Distributed Systems, Machine Learning, Statistics, and Internet of Things together. It is a multi-billion-dollar industry including use cases like targeted advertising, fraud detection, product recommendations, and market surveys. With new technologies like Internet of Things (IoT), these use cases are expanding to scenarios like Smart Cities, Smart health, and Smart Agriculture.
These usecases use basic analytics, advanced statistical methods, and predictive technologies like Machine Learning. However, it is not just about crunching the data. Some usecases like Urban Planning can be slow, and there is enough time to process the data. However, with use cases like traffic, patient monitoring, surveillance the the value of results degrades much faster with time and needs results within milliseconds to seconds. Collecting data from many sources, cleaning them up, processing them using computation clusters, and doing all these fast is a major challenge.
This talk will discuss motivation behind big data and data science and how it can make a difference. Then it will discuss the challenges, systems, and methodologies for implementing and sustaining a data science pipeline.
Technology tech trends 2022 and beyond Brian Pichman
It's that time of year again, where we get to look ahead and finally have some good news. Tech enthusiast Brian Pichman of the Evolve Project will showcase the latest technology trends and how that impact our learning spaces and spaces at home. It is guaranteed to make you forget about all of 2020 and 2021....well maybe that's a new technology about to be released, the MIB memory eraser. Join this exciting webinar and leave with some high hopes of new technology to explore!
Fiz Yazdi & Jesmond Allen talk presented at UX Scotland and UX Cambridge 2016.
We talk about our experience setting up our own UX team at cxpartners, and about working with clients including Nationwide Building Society and Bristol City Council to help them grow their own UX teams.
GDG Cloud Southlake #15: Mihir Mistry: Cybersecurity and Data Privacy in an A...James Anderson
Addressing Cybersecurity and Data Privacy concerns in the evolving world of AR, VR and Metaverse.
Mihir Mistry is a leader in Strategy, Governance, Risk and Compliance. Leads the delivery service lines for Controls & Compliance, Risk & Advisory, Cloud and Architecture. Takes a very programmatic approach to solving and delivering security based projects. Proven business and entrepreneurial skills to deliver custom, highly visible projects in front of the C-suite and Board of Directors. Diverse knowledge base and framework expertise that includes NIST, HIPAA, HITRUST, CIS, GLBA, ISO, GDPR, CLOUD, PCI and others. Global experience across North America, Europe, Asia and Australia.
https://gdg.community.dev/events/details/google-gdg-cloud-southlake-presents-gdg-cloud-southlake-15-mihir-mistry-cybersecurity-and-data-privacy-in-an-arvr-metaverse-world/
'Psychometrics Predicting Behaviour, In Theory and In Practice' Dr David Stil...Chinwag
Psychometrics predicting behaviour, in theory and in practice
This two-part talk, the first will look at how personality influences behaviour in social media, tracing the limits and analysis and prediction and comparing to behavioural and keyword targeting. The second part will take a practical look at how data this type of technique can be used to obtain psychological insights and targeting strategies for any key brands or audiences.
Find more info at: http://chinwag.com/insight/psychology
Space Mission UK - Mission 3 Lookbook - 5-11 Nov 2016Chinwag
Space Mission UK is a series of entrepreneur-led missions specifically designed for the UK's top space and satellite application startups. This lookbook covers the ten companies taking part in the third mission to San Francisco, Silicon Valley and Los Angeles.
For more information:
http://spacemissionuk.org
Space Mission UK is supported by Innovate UK and produced by trade mission specialists, Chinwag - http://chinwag.com
The REAL Impact of Big Data on PrivacyClaudiu Popa
The awesome promise of Big Data is tempered by the need to protect personal information. Data scientists must expertly navigate the legislative waters and acquire the skills to protect privacy and security. This talk provides enterprise leaders with answers and suggests questions to ask when the time comes to consider the vast opportunities offered by big data.
W-JAX Keynote - Big Data and Corporate Evolutionjstogdill
A look at corporate evolution from the industrial revolution to the information age - with a focus on how Big Data will make an impact.
Presented at W-JAX Java Conference in Munich Germany, 11-8-11
Social Media Group SMWTO: Mining Data - Developing Foundations & Social GoodSocial Media Group
Big Data has quickly become an industry buzz term and with it promises of exciting opportunities for consumer intelligence. However, increasing privacy concerns and the logistics associated with data refinement bring their own unique set of challenges for marketers. In his session, Cam will outline some emerging solutions to address these concerns as well as innovative opportunities for social good through real-time data analysis.
With the computer revolution vast amount of digital data has become available. With the Internet and smart connected product, the data is growing exponentially. It is estimated that every year, more data is generated than all history prior. And this has repeated over several years.
With all this data, it becomes a platform for something new of its own. In this lecture, we look at what big data is and look at several examples of how to use data. There are many well-know algorithms to analyse data, like clustering and machine learning.
Data Science in the Real World: Making a Difference Srinath Perera
We use the terms “Big Data” and “Data Science” for use of data processing to make sense of the world around us. Spanning many fields, Big Data brings together technologies like Distributed Systems, Machine Learning, Statistics, and Internet of Things together. It is a multi-billion-dollar industry including use cases like targeted advertising, fraud detection, product recommendations, and market surveys. With new technologies like Internet of Things (IoT), these use cases are expanding to scenarios like Smart Cities, Smart health, and Smart Agriculture.
These usecases use basic analytics, advanced statistical methods, and predictive technologies like Machine Learning. However, it is not just about crunching the data. Some usecases like Urban Planning can be slow, and there is enough time to process the data. However, with use cases like traffic, patient monitoring, surveillance the the value of results degrades much faster with time and needs results within milliseconds to seconds. Collecting data from many sources, cleaning them up, processing them using computation clusters, and doing all these fast is a major challenge.
This talk will discuss motivation behind big data and data science and how it can make a difference. Then it will discuss the challenges, systems, and methodologies for implementing and sustaining a data science pipeline.
Technology tech trends 2022 and beyond Brian Pichman
It's that time of year again, where we get to look ahead and finally have some good news. Tech enthusiast Brian Pichman of the Evolve Project will showcase the latest technology trends and how that impact our learning spaces and spaces at home. It is guaranteed to make you forget about all of 2020 and 2021....well maybe that's a new technology about to be released, the MIB memory eraser. Join this exciting webinar and leave with some high hopes of new technology to explore!
Fiz Yazdi & Jesmond Allen talk presented at UX Scotland and UX Cambridge 2016.
We talk about our experience setting up our own UX team at cxpartners, and about working with clients including Nationwide Building Society and Bristol City Council to help them grow their own UX teams.
GDG Cloud Southlake #15: Mihir Mistry: Cybersecurity and Data Privacy in an A...James Anderson
Addressing Cybersecurity and Data Privacy concerns in the evolving world of AR, VR and Metaverse.
Mihir Mistry is a leader in Strategy, Governance, Risk and Compliance. Leads the delivery service lines for Controls & Compliance, Risk & Advisory, Cloud and Architecture. Takes a very programmatic approach to solving and delivering security based projects. Proven business and entrepreneurial skills to deliver custom, highly visible projects in front of the C-suite and Board of Directors. Diverse knowledge base and framework expertise that includes NIST, HIPAA, HITRUST, CIS, GLBA, ISO, GDPR, CLOUD, PCI and others. Global experience across North America, Europe, Asia and Australia.
https://gdg.community.dev/events/details/google-gdg-cloud-southlake-presents-gdg-cloud-southlake-15-mihir-mistry-cybersecurity-and-data-privacy-in-an-arvr-metaverse-world/
Bigger than Any One: Solving Large Scale Data Problems with People and MachinesTyler Bell
The informatic challenges of 2013 and beyond are bigger than any one company. This presentation provides an overview of a number of recent, successful crowd-sourced and community-driven applications that combine ‘Big Data’ approaches with Community involvement. The speaker dives into the numbers and specific details of Factual’s approach to large-scale, multi-authored data collection and aggregation, and how the company’s data ethos and business positioning dictates both the shape of its technology and its vision of large-scale, collective data ecosystems.
Why CxOs care about Data Governance; the roadblock to digital masteryCoert Du Plessis (杜康)
This talk covered how data governance are scaled in large organisations, defining self-sustaining ownership models, a mechanism for managing risk and delegating decisions to those with the most knowhow.
Data-Ed Webinar: Demystifying Big Data DATAVERSITY
We are in the middle of a data flood and we need to figure out how to tame it without drowning. Most of what has been written about Big Data is focused on selling hardware and services. But what about a Big Data Strategy that guides hardware and software decisions? While virtually every major organization is faced with the challenge of figuring out the approach for and the requirements of this new development, jumping into the fray hastily and unprepared will only reproduce the same dismal IT project results as previously experienced. Join Dr. Peter Aiken as he will debunk a number of misconceptions about Big Data as your un-typical IT project. He will provide guidance on how to establish realistic Big Data management plans and expectations, and help demonstrate the value of such actions to both internal and external decision makers without getting lost in the hype.
Takeaways:
- The means by which Big Data techniques can complement existing data management practices
- The prototyping nature of practicing Big Data techniques
- The distinct ways in which utilizing Big Data can generate business value
- Bigger Data isn’t always Better Data
We are in the middle of a data flood and we need to figure out how to tame it without drowning. Most of what has been written about Big Data is focused on selling hardware and services. But what about a Big Data Strategy that guides hardware and software decisions? While virtually every major organization is faced with the challenge of figuring out the approach for and the requirements of this new development, jumping into the fray hastily and unprepared will only reproduce the same dismal IT project results as previously experienced. Join Dr. Peter Aiken as he will debunk a number of misconceptions about Big Data as your un-typical IT project. He will provide guidance on how to establish realistic Big Data management plans and expectations, and help demonstrate the value of such actions to both internal and external decision makers without getting lost in the hype.
Check out more of our Data-Ed webinars here: www.datablueprint.com/webinar-schedule
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
2. Winning the customer
More: How Target Figured Out A Teen Girl Was Pregnant Before Her Father Did, How Companies Learn Your Secrets
Photo illustration by James Emmerman. Photo Courtesy of Shutterstock.
3. Predicting and influencing behavior
imgsrc:https://www.seoclerk.com/pics/122043-1.jpg
Imgsrc:http://www.thesleuthjournal.com/wp-content/uploads/2013/10/mind-control2.jpg
More: Facebook 'likes' can reveal your secrets, study finds,
Everything You Need to Know About Facebook’s Controversial Emotion Experiment
Could Facebook swing an election?
4. Gimme moar money!
Moar: How Kaggle Solves Big Problems With Big Data Contests,
Improve HealthCare, win $3,000,000, NASA’s Asteroid Grand Challenge
9. Why now?
„We've always had the data, but it is only now
we have tools to ask a question of that data
and get the answer back before we forget why
we asked the question in the first place.”
-- Hilary Mason
More: Dirty Secrets of Data Science by Hilary Mason
13. Smart Parking
Trafic
congestion
Smart Roads
City Sensors
Air pollution
Landslide and
Avalanche
Earthquake
early
detection
Environment
Sensors
Chemical
leakage
detection
Leakages
detection
River floods
Water
Sensors
Smart Grids
Tank level (oil,
gas)
Water flow
Metering
Sensors
Explosive and
Hazardous
gases
Radiation
levels
Liquid
presence
Security &
Emergency
Sensors
Sensors for Smart Cities
More: 50 Sensor Applications for a Smarter World
14. Smart Cities Now
cities around the world are expected to spend $41 trillion in the
next 20 years ($108 billion by 2020) on infrastructure upgrade
Source: In the wake of Intel’s deal with San José, what makes a smart city?
15. More about Smart Cities
• IBM SmarterCity Public Safety (video)
• IBM Smarter Public Safety (video)
• IBM Miami-Dade Police Department - IBM Smarter Planet
Leadership Series Client Success Video (video)
• TED: Kent Larson: Brilliant designs to fit more people in every
city (video)
• Miami-Dade County and IBM Establish Public-Private
Partnership on Smarter Cities Initiative
• Cisco will make KC a 'smart city‘
• Governments can bridge costs and services gaps with sensor
networks
16. SMAC in Public Safety
•How to best allocate limited
resources to multiple events
•money efficiency
•Better prioritization
•Spend less on IT infrastructure –
more on critical resources
•On-demand, real-time, scalable
•Gives ability to deliver realtime
data to responders in the field
•Makes actions more traceable
(source for data analytics)
•Historical event analysis
•Making improvements for the
future
•Source for data analysis
•Additional information for
responders
•Sometimes easier to call for help
•Collaboration between
emergency agencies & public
Social Mobile
(Data)
Analytics
Cloud
More: How Mobility, Analytics, Cloud and Social Media are Changing the Game in Emergency Management
17. „Social” in action
Source: http://mashable.com/2012/02/06/social-media-public-safety-2/
Photo source: http://www.preppersworldusa.com/2014/01/25/columbia-mall-shooting-press-conference-police-confirmed-3-people-killed/
19. Data Science, stupid!
in fact, it’s data analysis on steroids
more: The Culture of Big Data Analytics, Interactive chart: Drew Conway blog, Image source: Data Science e Python
20. OSEMN model
Obtain
• From other location
• Query from
database or API
• Extract from
another file
• Generate data (e.g.
Sensors)
Scrub
• Filtering lines
• Extracting columns
or words
• Rreplacing values
• Handling missing
values
• Converting formats
Explore
• Understanding data
• Deriving statistics
• Creating
visualizations
Model
• Clustering
• Classification
• Regression
• Dimensionality
reduction
iNterpret
• Drawing conclusion
from data
• Evaluating meaning
of results
• Communicating
result
Source: A Taxonomy of Data Science
21. Who is Data Scientist
• Key skills
• Math & Stats
• NoSQL (& SQL)
• Python, R & Java
• Algorithms
• Soft skills
• Visualization
Up until 2006 there was some data available from various web pages, but contemporary technical solutions were inadequate to process it (SQL databases, single core servers)
In 1998 Google was founded
In 2006 Google announced BigTable and GFS
In coming years, many open source implementations of BigTable were created, many companies went into it like Facebook, Twitter, Amazon
Data analytics already existed, but were limited to small amounts of data
Customers are attached to their routines that are so ingrained in their mind, that it is almost impossible to change their habits
Target found out, that if woman is in the second trimester, if she’s going to do shopping there, most likely they attach her to the brand and she will be doing more and more shopping there
Entire thing was discovered when father of teenage girl in Minessota learned that his daughter received an email with coupons for baby care products. He was outraged, but then his daughter admit being pregnant.
University of Cambridge findings from ~60k people – ability to predict based on „likes” only:
Caucasian or African-American, gender, male sexuality, Democratic or Republican leanings, and detecting Christians and Muslims (80-95% of accuracy)
Smoking, using drugs, drinking alcohol (~70% accuraccy)
Single vs in relationship (67% accurracy)
Parents divorced before age of 21 (60% accurracy)
For one week in 2012, Facebook altered the algorithms it uses to determine which status updates appeared in the News Feed of 689,003 randomly selected users (about 1 of every 2,500 Facebook users).
For people who had positive content reduced in their News Feed, a larger percentage of words in people’s status updates were negative and a smaller percentage were positive. When negativity was reduced, the opposite pattern occurred. These results suggest that the emotions expressed by friends, via online social networks, influence our own moods, constituting, to our knowledge, the first experimental evidence for massive-scale emotional contagion via social networks
The firm announced plans to mine data of millions of American voters, just as an experiment which tried to encourage people to vote was disclosed.
Netflix contest $1,000,000 – recommendations for new movies
Heritage Provider Network - $3,000,000 - Identify patients who will be admitted to a hospital within the next year
NASA contest $35,000 – spotting earth-threatening asteroids
General Electric $250,000 – optimization of flight routes
There is no one good definition
Standard data storage solutions (SQL databases) are not sufficient
Multiple sources
Plethora of sources
Enormous data ingestion per minute
Smarter resource management
Water
Energy
Green gases generation
Cost optimization
Disaster detection
Forest fires
Leakages
Traffic accidents
Life & health quality improvement
Interesting sensors from public safety perspective
96% of public safety agencies use social networks for push/pull
Several larger agencies have established dedicated units (often within police departments) to provide real-time intelligence. Real-time crime centers operate in several major U.S. cities — notably, New York City and Houston, TX. These centers have access to powerful data aggregation and decision support tools. The New York City Police Department has created a social media unit within its intelligence division.
Criminals are stupid enough to use social media during commiting a crime
Crime analysis
Situational awareness
Crime reduction strategies
Patrol resource optimization
140,000 to 190,000 people with deep analytic skills as well as 1.5 million managers and analysts will be needed by 2018 to fill jobs in Big Data (Source: McKinsey)
There are two types of data scientist:
PhD level, where they invent new stuff
Data Engineer level, where they process data