SlideShare a Scribd company logo
Click through for eight facts and eight fictions regarding Big Data, as identified by
ThreatTrack Security.

1
Fact #1: You are Big Data. Much of the world’s Big Data is
created as metadata from users’ smartphones and GPS traffic.
Every day you create metadata with smartphones that enable GPS location services.
Every picture you take, every website you visit, every route you map creates
metadata that is stored and available for analysis. With more than five billion mobile
phones in use, including more than one billion smartphones in 2012, according to
research firm Strategy Analytics, it’s no wonder that many enterprises and
government organizations are interested in gleaning valuable content from the
information.

2
Fact #2: Big Data tends to be mined poorly to build
ineffective threat analysis algorithms.
With all the metadata that exists, we are only now figuring out how to make sense of
it and how to cultivate beneficial data from it. For one, enterprises traditionally
haven’t had the resources in place to analyze metadata. As those investments
increase, the mining for trends and useful analysis will increase as well.
3
Fact #3: Big Data is automating tasks that used to
involve tedious manual labor.
Software companies are developing better business intelligence tools that can not
only analyze metadata, but also automate tasks to more quickly make use of that
data to their advantage. This allows companies to be more flexible and also make the
analysis of Big Data much less costly than in the past.
4
Fact #4: Big Data is being used to categorize and classify malware more
effectively, grouping bad files the same way Google ranks pages.
As more information is gleaned about malware and more analysis picks up on trends, algorithms for categorizing
and classifying malware are being developed to help security providers. We at ThreatTrack Security use Big Data
in four ways: first, to discuss CART (Classification and Regression Trees) for predictive classification of event
modifiers; second, to make use of Shewhart Control Charts for outlier threat detection; third, we use Splines for
non-linear exploratory modeling; lastly, we apply the Goodness of fit principle to check for stability of historical
threat data and constructing a parsimonious model for APTs.
Our case study works by using a closed loop system beginning with identifying a file/URL, correlating the
information and finding where the file initially came from, where it was downloaded from, how it entered the
company's data space, what it downloaded, what it installed, its current payload and so on.

5
Fact #5: Big Data theory is moving faster than the reality of what an
enterprise is capable of from both a technology and manpower
standpoint.
Since much of Big Data is derived from user-centric behavior and usage, it moves a lot faster than what an enterprise typically
generates from its application systems. About 70 percent of the digital universe has been created by individuals, not
corporations.
The primary reason why the Big Data theory is moving faster than its practice is to address the solution for managing such
humongous amounts of data. Oceans of data will be created between now and the year 2020, resulting in a 4,300 percent
increase in annual data generation as the macro drivers of user-generated data, along with the shift from analog to digital
media, propel us to the next frontier.
The Big Data tsunami is also causing technologies to be modernized. What used to be stored in conventional RDBMS and later in
NoSQL databases are insufficient and cannot be accessed by direct record access methods. The current technology of choice is
not conventional RDBMS but a map-reduced database like Hadoop that operates off distributed hardware substrate.

6
Fact #6: Big Data will create a major shift in
visualization of threats within the next three years.
Visualization of objects in excess of a few million in quantity requires thinking differently. For
instance, imagine the complexity of modeling huge data sets that are increasingly being
gathered by ubiquitous information-sensing mobile devices, aerial sensory
technologies, software logs, cameras, microphones, radio-frequency identification readers
and wireless sensor networks. Right now, the largest memory requirements for visualizing Big
Data working sets can’t be addressed by conventional computing models. That’s why the
science of visualization will have to be re-imagined and re-visited in the next three years.

7
Fact #7: Yesterday's antivirus endpoints are becoming
tomorrow's real threat-track vectors.
First, antivirus is a well-understood and mature market where users have a
reasonable idea of what spam is, what viruses are and how to remediate them using
up-to-date antivirus software. Second, the conventional endpoint of corporations is
the edge of the network. Both these paradigms are outdated now. First, antivirus is
not enough to protect against Advanced Persistent Threats. Enterprises need an antithreat type of software to combat sophisticated attacks that find their way in through
endpoints in new and creative ways.

8
Fact #8: Yesterday's endpoints of the brick-and-mortar enterprise have
shifted to the users, with the proliferation of the BYOD paradigm where
user devices are the real endpoints.
This extends the point made in Fact #7. With the advent of BYOD becoming the norm
in the corporate environment, the real vulnerable endpoint of enterprises has turned
out to be handheld smartphones. As more smartphones connect to corporate
networks and data, it increases the vulnerabilities organizations face trying to secure
all those additional points of entry.
9
Fiction #1: Security companies are equipped to handle
the volume and velocity of Big Data.
Like many enterprises, security companies are also learning to wrap their hands
around Big Data, and the theory of Big Data for that matter, eliminating potential
vulnerabilities to ensure that the data remains clean for analysis and production. As
the concept of Big Data grows and evolves, security companies must perpetually
grow and evolve too.
10
Fiction #2: Security developers are easily extracting
value from collected data.
The old saying “you don’t know what you don’t know” applies to security developers.
Without proper analysis tools in place, security companies aren’t able to extract
valuable content from the collected data. Only with those analysis tools, algorithms
and applications can developers truly garner valuable insight from collected data.
11
Fiction #3: Analytics technology is ready-made for
security.
From the phrase “finding a needle in the haystack,” analytics is useless in haystacks of
data where there are no needles to begin with. The hype has caused us to create
massive data stacks with poor references (or indices) around those stacks. Any data
analyst will attest to the fact that a better index of smaller data sets yields better
analytics than a larger data set with lame indices.
12
Fiction #4: Leveraging Big Data in a security context is as
simple as using it for any generalized purpose.
Successfully leveraging Big Data first must address the point in Fiction #3, that analytics is ready made for security.
Second, establishing a security context is the next problem. Security context can be established connecting the
relationships (after map reducing the data itself) between data sets to reveal valuable insights in the patterns that
were previously not correlated or compared. Mining for trends requires that data be managed coherently at first.
Similarly, mining for relationships requires that trends be understood. Only after you have the data map
reduced, and the trends in it understood, can you then mine for relationships among the trends of the mapreduced data farms. Only after all of these prerequisites are achievable, can you establish the big security context
of Big Data.
Think of security context as the metadata fabric of relationships, which is a lot more powerful and useful for
visualizing risks, threats and predictive analytics.

13
Fiction #5: Big Data will cause major change in the
security industry within the next year.
Big Data won’t cause major change in the security industry. Instead, the major change
will be in identifying anomalies that can be identified as advanced security attacks.
And both concepts will join together and work in concert to realize value for
enterprises.
14
Fiction #6: There is a widespread belief that Big Data sets offer a higher
form of intelligence that can generate insights that were previously
impossible.
That’s not true by itself. We need more algorithms that can offer more
intelligence, not bigger data sets. The two kinds of algorithms are: Bayesian
algorithms, which deal with prior occurrences, and predictive analytics, which is
forward facing. Looking at the future, big context in security is going to be more
innovative than Big Data in security.
15
Fiction #7: Big Data searched with dumb algorithms fails to yield what
little data can yield using smarter algorithms.
The concept of Big Data should be about the algorithms and not about the data itself.
Better precision and better searching techniques will trap the breaches. Better
algorithms and lesser data stacks will provide more value than lesser algorithms and
bigger data stacks. The better net will catch better stuff.
16
Fiction #8: Most data scientists have experience with Big Data. This isn’t true
because much of Big Data isn’t directly used; rather, it is summarized or “map
reduced” before being analyzed, which is often not very big.
Data science is a new branch of study. Most data scientists fall into two groups:
statisticians turned programmers, and programmers turned statisticians that compete
for data scientist jobs. While they used to work with data sets and map-reduced data
sets, they’re not as used to working with user profile data, which is what Big Data
primarily consists of.
17

More Related Content

What's hot

Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
IT Support Engineer
 
Graphs in Government
Graphs in GovernmentGraphs in Government
Graphs in Government
Neo4j
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
David Feinleib
 
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology ChallengesData Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
Booz Allen Hamilton
 
Data set The Future of Big Data
Data set The Future of Big DataData set The Future of Big Data
Data set The Future of Big Data
Data-Set
 
Big data
Big dataBig data
Big data
Ami Redwan Haq
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Geoffrey Fox
 
Big data brings big opportunities for insurers
Big data brings big opportunities for insurersBig data brings big opportunities for insurers
Big data brings big opportunities for insurers
Sarah Freemantle
 
A proposed model_for_cybercrime_detectio
A proposed model_for_cybercrime_detectioA proposed model_for_cybercrime_detectio
A proposed model_for_cybercrime_detectio
Hossam Al-Ansary
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Onyebuchi nosiri
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Onyebuchi nosiri
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Oomph! Recruitment
 
MCAP Big Data Security Intelligence Platform
MCAP Big Data Security Intelligence PlatformMCAP Big Data Security Intelligence Platform
MCAP Big Data Security Intelligence Platform
Sean Ben
 
Disruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 2 - Blockchain smart_contractsDisruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 2 - Blockchain smart_contracts
Bohitesh Misra, PMP
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research Activity
Andry Alamsyah
 
Global Technology Outlook 2012 Booklet
Global Technology Outlook 2012 BookletGlobal Technology Outlook 2012 Booklet
Global Technology Outlook 2012 Booklet
IBM Danmark
 
Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data Science
DataMites
 
Seminarppt
SeminarpptSeminarppt
Seminarppt
Monali Akhare
 
1. The Importance of Graphs in Government
1. The Importance of Graphs in Government1. The Importance of Graphs in Government
1. The Importance of Graphs in Government
Neo4j
 

What's hot (20)

Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 
Graphs in Government
Graphs in GovernmentGraphs in Government
Graphs in Government
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology ChallengesData Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
 
1
11
1
 
Data set The Future of Big Data
Data set The Future of Big DataData set The Future of Big Data
Data set The Future of Big Data
 
Big data
Big dataBig data
Big data
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
 
Big data brings big opportunities for insurers
Big data brings big opportunities for insurersBig data brings big opportunities for insurers
Big data brings big opportunities for insurers
 
A proposed model_for_cybercrime_detectio
A proposed model_for_cybercrime_detectioA proposed model_for_cybercrime_detectio
A proposed model_for_cybercrime_detectio
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
MCAP Big Data Security Intelligence Platform
MCAP Big Data Security Intelligence PlatformMCAP Big Data Security Intelligence Platform
MCAP Big Data Security Intelligence Platform
 
Disruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 2 - Blockchain smart_contractsDisruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 2 - Blockchain smart_contracts
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research Activity
 
Global Technology Outlook 2012 Booklet
Global Technology Outlook 2012 BookletGlobal Technology Outlook 2012 Booklet
Global Technology Outlook 2012 Booklet
 
Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data Science
 
Seminarppt
SeminarpptSeminarppt
Seminarppt
 
1. The Importance of Graphs in Government
1. The Importance of Graphs in Government1. The Importance of Graphs in Government
1. The Importance of Graphs in Government
 

Viewers also liked

Capitalizing on the Internet of Things: a primer
Capitalizing on the Internet of Things: a primerCapitalizing on the Internet of Things: a primer
Capitalizing on the Internet of Things: a primerThe Marketing Distillery
 
7 steps to business success on the Internet of Things
7 steps to business success on the Internet of Things7 steps to business success on the Internet of Things
7 steps to business success on the Internet of ThingsThe Marketing Distillery
 
Internet of Things: manage the complexity, seize the opportunity
Internet of Things: manage the complexity, seize the opportunityInternet of Things: manage the complexity, seize the opportunity
Internet of Things: manage the complexity, seize the opportunityThe Marketing Distillery
 
Big Data 2.0
Big Data 2.0Big Data 2.0
Plantas nativas de Costa Rica en los jardines
Plantas nativas de Costa Rica en los jardinesPlantas nativas de Costa Rica en los jardines
Plantas nativas de Costa Rica en los jardines
Rose Menacho
 

Viewers also liked (7)

Making sense of consumer data
Making sense of consumer dataMaking sense of consumer data
Making sense of consumer data
 
The M2M platform for a connected world
The M2M platform for a connected worldThe M2M platform for a connected world
The M2M platform for a connected world
 
Capitalizing on the Internet of Things: a primer
Capitalizing on the Internet of Things: a primerCapitalizing on the Internet of Things: a primer
Capitalizing on the Internet of Things: a primer
 
7 steps to business success on the Internet of Things
7 steps to business success on the Internet of Things7 steps to business success on the Internet of Things
7 steps to business success on the Internet of Things
 
Internet of Things: manage the complexity, seize the opportunity
Internet of Things: manage the complexity, seize the opportunityInternet of Things: manage the complexity, seize the opportunity
Internet of Things: manage the complexity, seize the opportunity
 
Big Data 2.0
Big Data 2.0Big Data 2.0
Big Data 2.0
 
Plantas nativas de Costa Rica en los jardines
Plantas nativas de Costa Rica en los jardinesPlantas nativas de Costa Rica en los jardines
Plantas nativas de Costa Rica en los jardines
 

Similar to Big Data: 8 facts and 8 fictions

Big Data - Fact or Fiction ?
Big Data -  Fact or Fiction ?Big Data -  Fact or Fiction ?
Big Data - Fact or Fiction ?
Tyrone Systems
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
Shahbaz Anjam
 
Smart Data Module 6 d drive the future
Smart Data Module 6 d drive the futureSmart Data Module 6 d drive the future
Smart Data Module 6 d drive the future
caniceconsulting
 
130214 copy
130214   copy130214   copy
130214 copy
Arpit Arora
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellence
Mudit Mangal
 
Business with Big data
Business with Big dataBusiness with Big data
Business with Big data
Bruno Curtarelli
 
Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online
caniceconsulting
 
Real callenges in big data security
Real callenges in big data securityReal callenges in big data security
Real callenges in big data security
balasahebcomp
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraVin Malhotra
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
Ahmed Banafa
 
The REAL Impact of Big Data on Privacy
The REAL Impact of Big Data on PrivacyThe REAL Impact of Big Data on Privacy
The REAL Impact of Big Data on Privacy
Claudiu Popa
 
Gartner eBook on Big Data
Gartner eBook on Big DataGartner eBook on Big Data
Gartner eBook on Big Data
Jyrki Määttä
 
Global Data Management: Governance, Security and Usefulness in a Hybrid World
Global Data Management: Governance, Security and Usefulness in a Hybrid WorldGlobal Data Management: Governance, Security and Usefulness in a Hybrid World
Global Data Management: Governance, Security and Usefulness in a Hybrid World
Neil Raden
 
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
Gerd Leonhard
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
IRJET Journal
 
Big Data Dectives
Big Data DectivesBig Data Dectives
Big Data Dectives
- Mark - Fullbright
 
What exactly is big data? What exactly is big data? .pptx
What exactly is big data? What exactly is big data? .pptxWhat exactly is big data? What exactly is big data? .pptx
What exactly is big data? What exactly is big data? .pptx
TusharSengar6
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Edgar Alejandro Villegas
 

Similar to Big Data: 8 facts and 8 fictions (20)

Big Data - Fact or Fiction ?
Big Data -  Fact or Fiction ?Big Data -  Fact or Fiction ?
Big Data - Fact or Fiction ?
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Smart Data Module 6 d drive the future
Smart Data Module 6 d drive the futureSmart Data Module 6 d drive the future
Smart Data Module 6 d drive the future
 
130214 copy
130214   copy130214   copy
130214 copy
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellence
 
Business with Big data
Business with Big dataBusiness with Big data
Business with Big data
 
Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online
 
Real callenges in big data security
Real callenges in big data securityReal callenges in big data security
Real callenges in big data security
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin Malhotra
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
 
The REAL Impact of Big Data on Privacy
The REAL Impact of Big Data on PrivacyThe REAL Impact of Big Data on Privacy
The REAL Impact of Big Data on Privacy
 
Gartner eBook on Big Data
Gartner eBook on Big DataGartner eBook on Big Data
Gartner eBook on Big Data
 
Global Data Management: Governance, Security and Usefulness in a Hybrid World
Global Data Management: Governance, Security and Usefulness in a Hybrid WorldGlobal Data Management: Governance, Security and Usefulness in a Hybrid World
Global Data Management: Governance, Security and Usefulness in a Hybrid World
 
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
 
Big data
Big data Big data
Big data
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Big Data Dectives
Big Data DectivesBig Data Dectives
Big Data Dectives
 
What exactly is big data? What exactly is big data? .pptx
What exactly is big data? What exactly is big data? .pptxWhat exactly is big data? What exactly is big data? .pptx
What exactly is big data? What exactly is big data? .pptx
 
The state of the Big Data market
The state of the Big Data marketThe state of the Big Data market
The state of the Big Data market
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
 

More from The Marketing Distillery

From the Internet of Computers to the Internet of Things
From the Internet of Computers to the Internet of ThingsFrom the Internet of Computers to the Internet of Things
From the Internet of Computers to the Internet of ThingsThe Marketing Distillery
 
Smart networked objects and the Internet of Things
Smart networked objects and the Internet of ThingsSmart networked objects and the Internet of Things
Smart networked objects and the Internet of ThingsThe Marketing Distillery
 
Enhancing intelligence with the Internet of Things
Enhancing intelligence with the Internet of ThingsEnhancing intelligence with the Internet of Things
Enhancing intelligence with the Internet of ThingsThe Marketing Distillery
 
M2M innovations invigorate warehouse management
M2M innovations invigorate warehouse managementM2M innovations invigorate warehouse management
M2M innovations invigorate warehouse managementThe Marketing Distillery
 
How Big Data can help optimize business marketing efforts
How Big Data can help optimize business marketing effortsHow Big Data can help optimize business marketing efforts
How Big Data can help optimize business marketing effortsThe Marketing Distillery
 

More from The Marketing Distillery (20)

Capitalizing on the Internet of Things
Capitalizing on the Internet of ThingsCapitalizing on the Internet of Things
Capitalizing on the Internet of Things
 
Managing the Internet of Things
Managing the Internet of ThingsManaging the Internet of Things
Managing the Internet of Things
 
From the Internet of Computers to the Internet of Things
From the Internet of Computers to the Internet of ThingsFrom the Internet of Computers to the Internet of Things
From the Internet of Computers to the Internet of Things
 
Getting started in Big Data
Getting started in Big DataGetting started in Big Data
Getting started in Big Data
 
Smart networked objects and the Internet of Things
Smart networked objects and the Internet of ThingsSmart networked objects and the Internet of Things
Smart networked objects and the Internet of Things
 
Enhancing intelligence with the Internet of Things
Enhancing intelligence with the Internet of ThingsEnhancing intelligence with the Internet of Things
Enhancing intelligence with the Internet of Things
 
Internet of Things application platforms
Internet of Things application platformsInternet of Things application platforms
Internet of Things application platforms
 
Internet of Things building blocks
Internet of Things building blocksInternet of Things building blocks
Internet of Things building blocks
 
Smart cities and the Internet of Things
Smart cities and the Internet of ThingsSmart cities and the Internet of Things
Smart cities and the Internet of Things
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
M2M innovations invigorate warehouse management
M2M innovations invigorate warehouse managementM2M innovations invigorate warehouse management
M2M innovations invigorate warehouse management
 
How Big Data can help optimize business marketing efforts
How Big Data can help optimize business marketing effortsHow Big Data can help optimize business marketing efforts
How Big Data can help optimize business marketing efforts
 
Big Data analytics
Big Data analyticsBig Data analytics
Big Data analytics
 
The Business Analyst as a leader
The Business Analyst as a leaderThe Business Analyst as a leader
The Business Analyst as a leader
 
Big Data strategy components
Big Data strategy componentsBig Data strategy components
Big Data strategy components
 
Banking on analytics
Banking on analyticsBanking on analytics
Banking on analytics
 
The promise and challenge of Big Data
The promise and challenge of Big DataThe promise and challenge of Big Data
The promise and challenge of Big Data
 
Transforming Big Data into business value
Transforming Big Data into business valueTransforming Big Data into business value
Transforming Big Data into business value
 
Analytics 101
Analytics 101Analytics 101
Analytics 101
 
Exploiting the Internet of Things
Exploiting the Internet of ThingsExploiting the Internet of Things
Exploiting the Internet of Things
 

Recently uploaded

FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 

Big Data: 8 facts and 8 fictions

  • 1. Click through for eight facts and eight fictions regarding Big Data, as identified by ThreatTrack Security. 1
  • 2. Fact #1: You are Big Data. Much of the world’s Big Data is created as metadata from users’ smartphones and GPS traffic. Every day you create metadata with smartphones that enable GPS location services. Every picture you take, every website you visit, every route you map creates metadata that is stored and available for analysis. With more than five billion mobile phones in use, including more than one billion smartphones in 2012, according to research firm Strategy Analytics, it’s no wonder that many enterprises and government organizations are interested in gleaning valuable content from the information. 2
  • 3. Fact #2: Big Data tends to be mined poorly to build ineffective threat analysis algorithms. With all the metadata that exists, we are only now figuring out how to make sense of it and how to cultivate beneficial data from it. For one, enterprises traditionally haven’t had the resources in place to analyze metadata. As those investments increase, the mining for trends and useful analysis will increase as well. 3
  • 4. Fact #3: Big Data is automating tasks that used to involve tedious manual labor. Software companies are developing better business intelligence tools that can not only analyze metadata, but also automate tasks to more quickly make use of that data to their advantage. This allows companies to be more flexible and also make the analysis of Big Data much less costly than in the past. 4
  • 5. Fact #4: Big Data is being used to categorize and classify malware more effectively, grouping bad files the same way Google ranks pages. As more information is gleaned about malware and more analysis picks up on trends, algorithms for categorizing and classifying malware are being developed to help security providers. We at ThreatTrack Security use Big Data in four ways: first, to discuss CART (Classification and Regression Trees) for predictive classification of event modifiers; second, to make use of Shewhart Control Charts for outlier threat detection; third, we use Splines for non-linear exploratory modeling; lastly, we apply the Goodness of fit principle to check for stability of historical threat data and constructing a parsimonious model for APTs. Our case study works by using a closed loop system beginning with identifying a file/URL, correlating the information and finding where the file initially came from, where it was downloaded from, how it entered the company's data space, what it downloaded, what it installed, its current payload and so on. 5
  • 6. Fact #5: Big Data theory is moving faster than the reality of what an enterprise is capable of from both a technology and manpower standpoint. Since much of Big Data is derived from user-centric behavior and usage, it moves a lot faster than what an enterprise typically generates from its application systems. About 70 percent of the digital universe has been created by individuals, not corporations. The primary reason why the Big Data theory is moving faster than its practice is to address the solution for managing such humongous amounts of data. Oceans of data will be created between now and the year 2020, resulting in a 4,300 percent increase in annual data generation as the macro drivers of user-generated data, along with the shift from analog to digital media, propel us to the next frontier. The Big Data tsunami is also causing technologies to be modernized. What used to be stored in conventional RDBMS and later in NoSQL databases are insufficient and cannot be accessed by direct record access methods. The current technology of choice is not conventional RDBMS but a map-reduced database like Hadoop that operates off distributed hardware substrate. 6
  • 7. Fact #6: Big Data will create a major shift in visualization of threats within the next three years. Visualization of objects in excess of a few million in quantity requires thinking differently. For instance, imagine the complexity of modeling huge data sets that are increasingly being gathered by ubiquitous information-sensing mobile devices, aerial sensory technologies, software logs, cameras, microphones, radio-frequency identification readers and wireless sensor networks. Right now, the largest memory requirements for visualizing Big Data working sets can’t be addressed by conventional computing models. That’s why the science of visualization will have to be re-imagined and re-visited in the next three years. 7
  • 8. Fact #7: Yesterday's antivirus endpoints are becoming tomorrow's real threat-track vectors. First, antivirus is a well-understood and mature market where users have a reasonable idea of what spam is, what viruses are and how to remediate them using up-to-date antivirus software. Second, the conventional endpoint of corporations is the edge of the network. Both these paradigms are outdated now. First, antivirus is not enough to protect against Advanced Persistent Threats. Enterprises need an antithreat type of software to combat sophisticated attacks that find their way in through endpoints in new and creative ways. 8
  • 9. Fact #8: Yesterday's endpoints of the brick-and-mortar enterprise have shifted to the users, with the proliferation of the BYOD paradigm where user devices are the real endpoints. This extends the point made in Fact #7. With the advent of BYOD becoming the norm in the corporate environment, the real vulnerable endpoint of enterprises has turned out to be handheld smartphones. As more smartphones connect to corporate networks and data, it increases the vulnerabilities organizations face trying to secure all those additional points of entry. 9
  • 10. Fiction #1: Security companies are equipped to handle the volume and velocity of Big Data. Like many enterprises, security companies are also learning to wrap their hands around Big Data, and the theory of Big Data for that matter, eliminating potential vulnerabilities to ensure that the data remains clean for analysis and production. As the concept of Big Data grows and evolves, security companies must perpetually grow and evolve too. 10
  • 11. Fiction #2: Security developers are easily extracting value from collected data. The old saying “you don’t know what you don’t know” applies to security developers. Without proper analysis tools in place, security companies aren’t able to extract valuable content from the collected data. Only with those analysis tools, algorithms and applications can developers truly garner valuable insight from collected data. 11
  • 12. Fiction #3: Analytics technology is ready-made for security. From the phrase “finding a needle in the haystack,” analytics is useless in haystacks of data where there are no needles to begin with. The hype has caused us to create massive data stacks with poor references (or indices) around those stacks. Any data analyst will attest to the fact that a better index of smaller data sets yields better analytics than a larger data set with lame indices. 12
  • 13. Fiction #4: Leveraging Big Data in a security context is as simple as using it for any generalized purpose. Successfully leveraging Big Data first must address the point in Fiction #3, that analytics is ready made for security. Second, establishing a security context is the next problem. Security context can be established connecting the relationships (after map reducing the data itself) between data sets to reveal valuable insights in the patterns that were previously not correlated or compared. Mining for trends requires that data be managed coherently at first. Similarly, mining for relationships requires that trends be understood. Only after you have the data map reduced, and the trends in it understood, can you then mine for relationships among the trends of the mapreduced data farms. Only after all of these prerequisites are achievable, can you establish the big security context of Big Data. Think of security context as the metadata fabric of relationships, which is a lot more powerful and useful for visualizing risks, threats and predictive analytics. 13
  • 14. Fiction #5: Big Data will cause major change in the security industry within the next year. Big Data won’t cause major change in the security industry. Instead, the major change will be in identifying anomalies that can be identified as advanced security attacks. And both concepts will join together and work in concert to realize value for enterprises. 14
  • 15. Fiction #6: There is a widespread belief that Big Data sets offer a higher form of intelligence that can generate insights that were previously impossible. That’s not true by itself. We need more algorithms that can offer more intelligence, not bigger data sets. The two kinds of algorithms are: Bayesian algorithms, which deal with prior occurrences, and predictive analytics, which is forward facing. Looking at the future, big context in security is going to be more innovative than Big Data in security. 15
  • 16. Fiction #7: Big Data searched with dumb algorithms fails to yield what little data can yield using smarter algorithms. The concept of Big Data should be about the algorithms and not about the data itself. Better precision and better searching techniques will trap the breaches. Better algorithms and lesser data stacks will provide more value than lesser algorithms and bigger data stacks. The better net will catch better stuff. 16
  • 17. Fiction #8: Most data scientists have experience with Big Data. This isn’t true because much of Big Data isn’t directly used; rather, it is summarized or “map reduced” before being analyzed, which is often not very big. Data science is a new branch of study. Most data scientists fall into two groups: statisticians turned programmers, and programmers turned statisticians that compete for data scientist jobs. While they used to work with data sets and map-reduced data sets, they’re not as used to working with user profile data, which is what Big Data primarily consists of. 17