SlideShare a Scribd company logo
Big data and cloud computing for
management
By
Simen Fivelstad Smaaberg (n8661260)
Abstract
 Big data
 What is it?
 What possibilities and challenges lays within?
 Cloud computing
 Software as a Service
 Platform as a Service
 Infrastructure as a Service
 How does big data relate to cloud computing?
 Google / Facebook
 How does Google and Facebook handle big data?
 What big data/cloud services do they offer for others to
use?
Background
 Internet today: a social web
 Huge amounts of data are created by users daily
 Creates your digital footprint
 Need for methods to handle all these data
 The data is used for analysis purposes
 Improve customer experiences
 Increase revenues
Big Data
 The vast volume of data in existence (Arthur, E
2013)
 Tweets, likes, videos, images, comments and so on
 Group of collected information, spans 3 V’s of data
management (Gartner 2011)
Figure 1: Big data 3V’s ( Datameer 2013)
Volume
 Big data is Big
 Exists in one size: large
 Challenges
 Physical storage space
 Logical structuring
 Scaling
Volume
 Big data is Big
 Exists in one size: large
 Challenges
 Physical storage space
 Logical structuring
 Scaling
 Opportunities
 Finding trends, patterns and relationships between data
 Possibilities for in depth analysis
Variety
 Challenges
 Infrastructure to handle
different kinds of media is
required.
 Challenging for
engineers
Figure 2: Big Data (Orange 2011)
Variety
 Challenges
 Infrastructure to handle
different kinds of media is
required.
 Challenging for
engineers
 Opportunities
 Finding patterns and
relationships between
different types of data
Figure 2: Big Data (Orange 2011)
Velocity
 Big data can have different kinds of time-sensitivity
 Real time vs non real time
 Challenges
 Infrastructure that can handle different kinds of time-
sensitivity
Velocity
 Big data can have different kinds of time-sensitivity
 Real time vs non real time
 Challenges
 Infrastructure that can handle different kinds of time-
sensitivity
 Opportunities
 Combine slow moving data with fast moving time
constrained data to give the user a better user experience
Big data success story: Santam Insurance
 About
 South Africas largest short-term insurance company
 Problem
 6-10% of premium revenue were fraud
 Solution
 Big Data prediction analysis
Big data success story: Santam Insurance
 About
 South Africas largest short-term insurance company
 Problem
 6-10% of premium revenue were fraud
 Solution
 Big Data prediction analysis
 Result
 First four months: 1.98 million USD saved
 First three years: ROI of 244%
 Insurance fraud syndicate disceovered
« Big data has the potential to change the way
governments, organizations, and academic
institutions conduct business and make
discoveries, and its likely to change how everyone
lives their day-to-day lives »
– Susan Hauser, VP Microsoft Enterprise and partner
group (Microsoft Enterprise team 2013)
Cloud computing
 Running applications elsewhere and accessing them through your
computer
 You get an “infinite” amount of storage space and computing power
 You pay for what you use
Figure 3: Cloud computing
Big Data and Cloud Computing
 Big data requires enormous amounts of storage
space
 Costly to build and maintain
 Huge engineering challenges
Big Data and Cloud Computing
 Big data requires enormous amounts of storage
space
 Costly to build and maintain
 Huge engineering challenges
 Solution: Cloud Computing!
 Put your data and programs into “the cloud”
 Avoid the hardware problem of big data
 Pay for the computing power you actually use,
 Opens up for smaller companies stepping into big data analysis
Big Data and Cloud Computing
 Big data requires enormous amounts of storage
space
 Costly to build and maintain
 Huge engineering challenges
 Solution: Cloud Computing!
 Put your data and programs into “the cloud”
 Avoid the hardware problem of big data
 Pay for the computing power you actually use,
 Opens up for smaller companies stepping into big data analysis
 Issues: Privacy and trust
 You put your data into someone else's hand
 Is that someone trustworthy?
Google and Hadoop
 2004: Google revolutionized the field of Big Data
and Cloud Computing
 Released papers describing how they handled these
topics
Google and Hadoop
 2004: Google revolutionized the field of Big Data
and Cloud Computing
 Released papers describing how they handled these
topics
 From this Yahoo spawned Hadoop
 Platform that can process and analyse huge amounts of
data on interconnected commodity servers
 Great fit for cloud computing
 Data is spread out and duplicated across servers
 Data is analysed in parallel through MapReduce
 Backbone of Twitter, Facebook, Yahoo and eBay
Google
 Huge competitor in the field of big data and cloud
computing
 Big data used internally
 Index searches
 Provide email services
 Provide advertizing
 External Big data and cloud services
 Software as a Service: Google docs, Gmail etc
 Platform as a Service: Google app engine
 Infrastructure as a Service: Google compute engine
 Gives developers access to the same infrastructure google itself
is run on
Facebook
 Forerunner in the field of Big Data
 Handling massive amounts of data daily
 2.5 billion status updates, wall posts, photos, videos and
comments
 2.7 billion likes
 300 million uploaded photos
 500 Tb new integrated data every day (2012)
Facebook
 Forerunner in the field of Big Data
 Handling massive amounts of data daily
 2.5 billion status updates, wall posts, photos, videos and
comments
 2.7 billion likes
 300 million uploaded photos
 500 Tb new integrated data every day (2012)
 Facebook is run on top of Hadoop
 100 Petabytes of storage
 Underpins analysis and everyday services
 New data is put into one of their Hadoop clusters physically
residing in one of their data centers
 Data is analysed when needed or at specific intervals
(hourly/daily) through MapReduce
Facebook Insight
 Tool that provides page owners (both facebook pages and
ordinary web pages) with metrics about their content.
(Facebook 2013)
 Number of visits
 Facebook referrals
 Visitor demographics (age, gender, location, language)
 Connects Facebook’s big data with users visiting your page to
provide metrics that can be used to improve your business’s
online performance
 Generated through Facebook’s Hadoop cluster
Figure 4: User demographics (Campalyst 2012)
Facebook graph search
 Search engine for your social circle (Bea, F 2013)
 Allows for search on relationships between
people, likes, comments, photos etc.
 Possible because of
Facebook’s big data
 Privacy concerns
Figure 5: Facebook restaurant search Figure 6: Facebook TV show search
The Future
 Big data and cloud computing is in constant change
 More and more usage areas are found
 Medical diagnostics
 Weather forecasts
 Particle physics
 Fraud prevention and detection
 Etc.
 Prediction: We have barely seen the start of its
dominance
References
 Arthur, E. 2013. "Big Data“. Alaska Business Monthly, vol. 29, no. 1, pp. 72-72. Retrieved from
http://search.proquest.com.ezp01.library.qut.edu.au/docview/1271622055
 Gartner. 2011. “Solving Big Data Challenge involves more than just managing volumes of data”.
Accessed June 4, 2013. http://www.gartner.com/newsroom/id/1731916
 Strickland, J. “How Cloud Computing Works”. Accessed June 6, 2013.
http://computer.howstuffworks.com/cloud-computing/cloud-computing.htm
 Gartner. “Software as a Service (SaaS)”. Accessed June 6. 2013. http://www.gartner.com/it-
glossary/software-as-a-service-saas/
 Chong, R. 2011. “The perfect marriage: Hadoop and Cloud”. Accessed June 4, 2013.
http://thoughtsoncloud.com/index.php/2011/10/the-perfect-marriage-hadoop-and-cloud/
 Microsoft Enterprise Team. “The Big Bang: How the Big Data Explosion Is Changing the
World”. Last Modified March 27, 2013.
http://www.microsoft.com/enterprise/it-trends/big-data/articles/The-Big-Bang-How-the-Big-Data-
Explosion-Is-Changing-the-World.aspx#fbid=8RIFw1BLCG2
 Metz, C. 2011. “How Yahoo Spawner Hadoop, the Future of Big Data”. Accessed June 5, 2013.
http://www.wired.com/wiredenterprise/2011/10/how-yahoo-spawned-hadoop/all/1
 Big Data Insights. 2013. “How Facebook uses Hadoop and Hive”. Accessed June 05, 2013.
http://hortonworks.com/blog/how-facebook-uses-hadoop-and-hive/
 Facebook. “Insights”. Last modified May 30, 2013.
https://developers.facebook.com/docs/insights/
 Bea, F. 2013. “How Facebook’s Graph Search Works…Sort Of". Accessed June 05, 2013.
http://www.digitaltrends.com/social-media/how-facebook-graph-search-works/
 IBM. 2013. “IBM Business Analytics SPSS: Santam insurance”. Last modified May 28, 2013.
http://www-01.ibm.com/software/success/cssdb.nsf/CS/SANS-
985HX2?OpenDocument&Site=default&cty=en_us
References - Illustrations
 Datameer. 2013. “What is Big Data?”. Digital
Image. Viewed June 8, 2013.
http://www.datameer.com/product/big-data.html
 Orange. 2011. “Analyst insight”. Digital Image.
Viewed June 11, 2013. http://www.orange-
business.com/en/magazine/analyst-insight-
december-2011
 Campalyst. 2012. “How to measure website visitors’
demographics: hidden Facebook Insights gem”.
Digital Image. Viewed June 11, 2013.
http://blog.campalyst.com/2012/10/10/how-to-
measure-website-visitors-demographics-hidden-
facebook-insights-gem/

More Related Content

What's hot

Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
Unmesh Ballal
 
Ey35869874
Ey35869874Ey35869874
Ey35869874
IJERA Editor
 
About Cloud Computing
About Cloud ComputingAbout Cloud Computing
About Cloud Computing
Naman Talati
 
Cloud Computing Documentation Report
Cloud Computing Documentation ReportCloud Computing Documentation Report
Cloud Computing Documentation Report
Ajit Yadav
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
vishnu varunan
 
cloud computing documentation
cloud computing documentationcloud computing documentation
cloud computing documentation
shilpa bojji
 
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENTA REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
INTERNATIONAL JOURNAL OF COMPUTERS AND TECHNOLOGY
 
A proposal for implementing cloud computing in newspaper company
A proposal for implementing cloud computing in newspaper companyA proposal for implementing cloud computing in newspaper company
A proposal for implementing cloud computing in newspaper company
Kingsley Mensah
 
Cloud computing report
Cloud computing reportCloud computing report
Cloud computing report
Vamsi Krishna Vanama
 
Cloud Computing- Proposal (Autosaved)
Cloud Computing- Proposal (Autosaved)Cloud Computing- Proposal (Autosaved)
Cloud Computing- Proposal (Autosaved)
Zuhair Haroon khan
 
Cloud computing by Rajat Shukla
Cloud computing by Rajat ShuklaCloud computing by Rajat Shukla
Cloud computing by Rajat Shukla
Rajat Shukla
 
Cloud Computing? What is it and its future trends?
Cloud Computing? What is it and its future trends?Cloud Computing? What is it and its future trends?
Cloud Computing? What is it and its future trends?
ziaurrehman4484
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
vishnu varunan
 
International journal of computer science and innovation vol 2015-n2-paper2
International journal of computer science and innovation  vol 2015-n2-paper2International journal of computer science and innovation  vol 2015-n2-paper2
International journal of computer science and innovation vol 2015-n2-paper2
sophiabelthome
 
Cloud
CloudCloud
Cloud
CloudCloud
The Cloud Presentation 2016
The Cloud Presentation 2016The Cloud Presentation 2016
The Cloud Presentation 2016
Joel Kline
 
CLOUD COMPUTING_proposal
CLOUD COMPUTING_proposalCLOUD COMPUTING_proposal
CLOUD COMPUTING_proposal
Laud Randy Amofah
 
Cloud Computing & Big Data
Cloud Computing & Big DataCloud Computing & Big Data
Cloud Computing & Big Data
Mrinal Kumar
 
Debunking common cloud hosting myths
Debunking common cloud hosting mythsDebunking common cloud hosting myths
Debunking common cloud hosting myths
manoharparakh
 

What's hot (20)

Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 
Ey35869874
Ey35869874Ey35869874
Ey35869874
 
About Cloud Computing
About Cloud ComputingAbout Cloud Computing
About Cloud Computing
 
Cloud Computing Documentation Report
Cloud Computing Documentation ReportCloud Computing Documentation Report
Cloud Computing Documentation Report
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
 
cloud computing documentation
cloud computing documentationcloud computing documentation
cloud computing documentation
 
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENTA REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
 
A proposal for implementing cloud computing in newspaper company
A proposal for implementing cloud computing in newspaper companyA proposal for implementing cloud computing in newspaper company
A proposal for implementing cloud computing in newspaper company
 
Cloud computing report
Cloud computing reportCloud computing report
Cloud computing report
 
Cloud Computing- Proposal (Autosaved)
Cloud Computing- Proposal (Autosaved)Cloud Computing- Proposal (Autosaved)
Cloud Computing- Proposal (Autosaved)
 
Cloud computing by Rajat Shukla
Cloud computing by Rajat ShuklaCloud computing by Rajat Shukla
Cloud computing by Rajat Shukla
 
Cloud Computing? What is it and its future trends?
Cloud Computing? What is it and its future trends?Cloud Computing? What is it and its future trends?
Cloud Computing? What is it and its future trends?
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
 
International journal of computer science and innovation vol 2015-n2-paper2
International journal of computer science and innovation  vol 2015-n2-paper2International journal of computer science and innovation  vol 2015-n2-paper2
International journal of computer science and innovation vol 2015-n2-paper2
 
Cloud
CloudCloud
Cloud
 
Cloud
CloudCloud
Cloud
 
The Cloud Presentation 2016
The Cloud Presentation 2016The Cloud Presentation 2016
The Cloud Presentation 2016
 
CLOUD COMPUTING_proposal
CLOUD COMPUTING_proposalCLOUD COMPUTING_proposal
CLOUD COMPUTING_proposal
 
Cloud Computing & Big Data
Cloud Computing & Big DataCloud Computing & Big Data
Cloud Computing & Big Data
 
Debunking common cloud hosting myths
Debunking common cloud hosting mythsDebunking common cloud hosting myths
Debunking common cloud hosting myths
 

Similar to INN530 - Assignment 2, Big data and cloud computing for management

IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET Journal
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
Shahbaz Anjam
 
Complete-SRS.doc
Complete-SRS.docComplete-SRS.doc
Complete-SRS.doc
jadhavpravin920
 
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTINGSECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
IJNSA Journal
 
Security issues associated with big data in cloud computing
Security issues associated with big data in cloud computingSecurity issues associated with big data in cloud computing
Security issues associated with big data in cloud computing
IJNSA Journal
 
Analysis on big data concepts and applications
Analysis on big data concepts and applicationsAnalysis on big data concepts and applications
Analysis on big data concepts and applications
IJARIIT
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
saranya270513
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
johnmutiso245
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
johnmutiso245
 
R180305120123
R180305120123R180305120123
R180305120123
IOSR Journals
 
Big Data.pdf
Big Data.pdfBig Data.pdf
Big Data.pdf
AnilaAbid2
 
Big dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyondBig dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyond
Patrick Bouillaud
 
Big data upload
Big data uploadBig data upload
Big data upload
Bhavin Tandel
 
big-data.pdf
big-data.pdfbig-data.pdf
big-data.pdf
aditi276464
 
Big data mining
Big data miningBig data mining
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Happiest Minds Technologies
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
Shilpa Soi
 
The What, Why and How of Big Data
The What, Why and How of Big DataThe What, Why and How of Big Data
The What, Why and How of Big Data
Luca Naso
 
Big data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxBig data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docx
hartrobert670
 

Similar to INN530 - Assignment 2, Big data and cloud computing for management (20)

IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Complete-SRS.doc
Complete-SRS.docComplete-SRS.doc
Complete-SRS.doc
 
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTINGSECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
 
Security issues associated with big data in cloud computing
Security issues associated with big data in cloud computingSecurity issues associated with big data in cloud computing
Security issues associated with big data in cloud computing
 
Analysis on big data concepts and applications
Analysis on big data concepts and applicationsAnalysis on big data concepts and applications
Analysis on big data concepts and applications
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
 
R180305120123
R180305120123R180305120123
R180305120123
 
Big Data.pdf
Big Data.pdfBig Data.pdf
Big Data.pdf
 
Big dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyondBig dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyond
 
Big data upload
Big data uploadBig data upload
Big data upload
 
big-data.pdf
big-data.pdfbig-data.pdf
big-data.pdf
 
Big data mining
Big data miningBig data mining
Big data mining
 
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
 
The What, Why and How of Big Data
The What, Why and How of Big DataThe What, Why and How of Big Data
The What, Why and How of Big Data
 
Big data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxBig data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docx
 

Recently uploaded

Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
marufrahmanstratejm
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
maazsz111
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 

Recently uploaded (20)

Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 

INN530 - Assignment 2, Big data and cloud computing for management

  • 1. Big data and cloud computing for management By Simen Fivelstad Smaaberg (n8661260)
  • 2. Abstract  Big data  What is it?  What possibilities and challenges lays within?  Cloud computing  Software as a Service  Platform as a Service  Infrastructure as a Service  How does big data relate to cloud computing?  Google / Facebook  How does Google and Facebook handle big data?  What big data/cloud services do they offer for others to use?
  • 3. Background  Internet today: a social web  Huge amounts of data are created by users daily  Creates your digital footprint  Need for methods to handle all these data  The data is used for analysis purposes  Improve customer experiences  Increase revenues
  • 4. Big Data  The vast volume of data in existence (Arthur, E 2013)  Tweets, likes, videos, images, comments and so on  Group of collected information, spans 3 V’s of data management (Gartner 2011) Figure 1: Big data 3V’s ( Datameer 2013)
  • 5. Volume  Big data is Big  Exists in one size: large  Challenges  Physical storage space  Logical structuring  Scaling
  • 6. Volume  Big data is Big  Exists in one size: large  Challenges  Physical storage space  Logical structuring  Scaling  Opportunities  Finding trends, patterns and relationships between data  Possibilities for in depth analysis
  • 7. Variety  Challenges  Infrastructure to handle different kinds of media is required.  Challenging for engineers Figure 2: Big Data (Orange 2011)
  • 8. Variety  Challenges  Infrastructure to handle different kinds of media is required.  Challenging for engineers  Opportunities  Finding patterns and relationships between different types of data Figure 2: Big Data (Orange 2011)
  • 9. Velocity  Big data can have different kinds of time-sensitivity  Real time vs non real time  Challenges  Infrastructure that can handle different kinds of time- sensitivity
  • 10. Velocity  Big data can have different kinds of time-sensitivity  Real time vs non real time  Challenges  Infrastructure that can handle different kinds of time- sensitivity  Opportunities  Combine slow moving data with fast moving time constrained data to give the user a better user experience
  • 11. Big data success story: Santam Insurance  About  South Africas largest short-term insurance company  Problem  6-10% of premium revenue were fraud  Solution  Big Data prediction analysis
  • 12. Big data success story: Santam Insurance  About  South Africas largest short-term insurance company  Problem  6-10% of premium revenue were fraud  Solution  Big Data prediction analysis  Result  First four months: 1.98 million USD saved  First three years: ROI of 244%  Insurance fraud syndicate disceovered
  • 13. « Big data has the potential to change the way governments, organizations, and academic institutions conduct business and make discoveries, and its likely to change how everyone lives their day-to-day lives » – Susan Hauser, VP Microsoft Enterprise and partner group (Microsoft Enterprise team 2013)
  • 14. Cloud computing  Running applications elsewhere and accessing them through your computer  You get an “infinite” amount of storage space and computing power  You pay for what you use Figure 3: Cloud computing
  • 15. Big Data and Cloud Computing  Big data requires enormous amounts of storage space  Costly to build and maintain  Huge engineering challenges
  • 16. Big Data and Cloud Computing  Big data requires enormous amounts of storage space  Costly to build and maintain  Huge engineering challenges  Solution: Cloud Computing!  Put your data and programs into “the cloud”  Avoid the hardware problem of big data  Pay for the computing power you actually use,  Opens up for smaller companies stepping into big data analysis
  • 17. Big Data and Cloud Computing  Big data requires enormous amounts of storage space  Costly to build and maintain  Huge engineering challenges  Solution: Cloud Computing!  Put your data and programs into “the cloud”  Avoid the hardware problem of big data  Pay for the computing power you actually use,  Opens up for smaller companies stepping into big data analysis  Issues: Privacy and trust  You put your data into someone else's hand  Is that someone trustworthy?
  • 18. Google and Hadoop  2004: Google revolutionized the field of Big Data and Cloud Computing  Released papers describing how they handled these topics
  • 19. Google and Hadoop  2004: Google revolutionized the field of Big Data and Cloud Computing  Released papers describing how they handled these topics  From this Yahoo spawned Hadoop  Platform that can process and analyse huge amounts of data on interconnected commodity servers  Great fit for cloud computing  Data is spread out and duplicated across servers  Data is analysed in parallel through MapReduce  Backbone of Twitter, Facebook, Yahoo and eBay
  • 20. Google  Huge competitor in the field of big data and cloud computing  Big data used internally  Index searches  Provide email services  Provide advertizing  External Big data and cloud services  Software as a Service: Google docs, Gmail etc  Platform as a Service: Google app engine  Infrastructure as a Service: Google compute engine  Gives developers access to the same infrastructure google itself is run on
  • 21. Facebook  Forerunner in the field of Big Data  Handling massive amounts of data daily  2.5 billion status updates, wall posts, photos, videos and comments  2.7 billion likes  300 million uploaded photos  500 Tb new integrated data every day (2012)
  • 22. Facebook  Forerunner in the field of Big Data  Handling massive amounts of data daily  2.5 billion status updates, wall posts, photos, videos and comments  2.7 billion likes  300 million uploaded photos  500 Tb new integrated data every day (2012)  Facebook is run on top of Hadoop  100 Petabytes of storage  Underpins analysis and everyday services  New data is put into one of their Hadoop clusters physically residing in one of their data centers  Data is analysed when needed or at specific intervals (hourly/daily) through MapReduce
  • 23. Facebook Insight  Tool that provides page owners (both facebook pages and ordinary web pages) with metrics about their content. (Facebook 2013)  Number of visits  Facebook referrals  Visitor demographics (age, gender, location, language)  Connects Facebook’s big data with users visiting your page to provide metrics that can be used to improve your business’s online performance  Generated through Facebook’s Hadoop cluster Figure 4: User demographics (Campalyst 2012)
  • 24. Facebook graph search  Search engine for your social circle (Bea, F 2013)  Allows for search on relationships between people, likes, comments, photos etc.  Possible because of Facebook’s big data  Privacy concerns Figure 5: Facebook restaurant search Figure 6: Facebook TV show search
  • 25. The Future  Big data and cloud computing is in constant change  More and more usage areas are found  Medical diagnostics  Weather forecasts  Particle physics  Fraud prevention and detection  Etc.  Prediction: We have barely seen the start of its dominance
  • 26. References  Arthur, E. 2013. "Big Data“. Alaska Business Monthly, vol. 29, no. 1, pp. 72-72. Retrieved from http://search.proquest.com.ezp01.library.qut.edu.au/docview/1271622055  Gartner. 2011. “Solving Big Data Challenge involves more than just managing volumes of data”. Accessed June 4, 2013. http://www.gartner.com/newsroom/id/1731916  Strickland, J. “How Cloud Computing Works”. Accessed June 6, 2013. http://computer.howstuffworks.com/cloud-computing/cloud-computing.htm  Gartner. “Software as a Service (SaaS)”. Accessed June 6. 2013. http://www.gartner.com/it- glossary/software-as-a-service-saas/  Chong, R. 2011. “The perfect marriage: Hadoop and Cloud”. Accessed June 4, 2013. http://thoughtsoncloud.com/index.php/2011/10/the-perfect-marriage-hadoop-and-cloud/  Microsoft Enterprise Team. “The Big Bang: How the Big Data Explosion Is Changing the World”. Last Modified March 27, 2013. http://www.microsoft.com/enterprise/it-trends/big-data/articles/The-Big-Bang-How-the-Big-Data- Explosion-Is-Changing-the-World.aspx#fbid=8RIFw1BLCG2  Metz, C. 2011. “How Yahoo Spawner Hadoop, the Future of Big Data”. Accessed June 5, 2013. http://www.wired.com/wiredenterprise/2011/10/how-yahoo-spawned-hadoop/all/1  Big Data Insights. 2013. “How Facebook uses Hadoop and Hive”. Accessed June 05, 2013. http://hortonworks.com/blog/how-facebook-uses-hadoop-and-hive/  Facebook. “Insights”. Last modified May 30, 2013. https://developers.facebook.com/docs/insights/  Bea, F. 2013. “How Facebook’s Graph Search Works…Sort Of". Accessed June 05, 2013. http://www.digitaltrends.com/social-media/how-facebook-graph-search-works/  IBM. 2013. “IBM Business Analytics SPSS: Santam insurance”. Last modified May 28, 2013. http://www-01.ibm.com/software/success/cssdb.nsf/CS/SANS- 985HX2?OpenDocument&Site=default&cty=en_us
  • 27. References - Illustrations  Datameer. 2013. “What is Big Data?”. Digital Image. Viewed June 8, 2013. http://www.datameer.com/product/big-data.html  Orange. 2011. “Analyst insight”. Digital Image. Viewed June 11, 2013. http://www.orange- business.com/en/magazine/analyst-insight- december-2011  Campalyst. 2012. “How to measure website visitors’ demographics: hidden Facebook Insights gem”. Digital Image. Viewed June 11, 2013. http://blog.campalyst.com/2012/10/10/how-to- measure-website-visitors-demographics-hidden- facebook-insights-gem/