SlideShare a Scribd company logo
1 of 19
Big data Analytics
Big data Analytics
Introduction
 Unstructured data contains different (multiple) types of
data
 Unstructured data is a generic label for describing data
that is not contained in a database or some other type of
data structure.
 Unstructured data contains everything and presents
everywhere globally.
 More than 90% of social media data is unstructured.
Big data Analytics
Importance of Unstructured Data
 Every minute, there are more than 6,000 pictures shared
on social media sites and more than 200 million emails
sent.
 Analyzing social content such as Tweets, Facebook posts
and transcripts from support calls gives a clear view of
how customers perceive the value and issues.
 Unstructured data isn't well organized or easy to access,
but companies who analyze this data and integrate it into
their information management landscape can significantly
improve employee productivity.
Big data Analytics
Examples of Unstructured Data
 e-mail messages, word processing documents, videos,
photos, audio files, presentations, web pages.
 Examples of "unstructured data" may include books,
journals, documents, metadata, health records, audio,
video, analog data, images, files.
Big data Analytics
Influence of Unstructured data on Social media
 The social media needs to be part of the business
strategy by interacting with clients on customer.
 The statistics contain the number of Twitter
followers, Facebook likes, LinkedIn connections,
blog subscribers.
 Social media like Facebook is growing enormously
with the massive amount of unstructured data, they
are collecting.
 Twitter sees about 175 million tweets each day and
has more than 465 million accounts.
Big data Analytics
Big data Analytics
Technologies
 Data mining
 Pattern Recognition
 Operations Research
 Social Network Analytics (Facebook, Twitter, LinkedIn)
 Natural Language Processing
Big data Analytics
Tools to analyze
 R language
 Rapid Miner
 Weka
 Hadoop
 Python
Big data Analytics
RapidMiner
 Rapidminer provides an integrated environment for
machine learning, data mining, text mining, predictive
analytics.
 It is the most powerful tool, easy to use and intuitive
graphical interface for the design of analytic process.
 The code is written in JAVA.
 Runs on all major platforms and operating system.
 Save time by identifying possible errors, and get
suggested quick fixes and support .csv, excel and binary
files.
Big data Analytics
RapidMiner
Imported from csv, excel files.
Statistics, charts.
Big data Analytics
Weka
 Weka is a collection of machine learning algorithms.
 It contains tools for data pre-processing,
classification, regression, clustering, association
rules, and visualization.
 It is s written in Java and runs on almost any
platform.
 Large collection of different data mining algorithms.
Big data Analytics
Python
 Connect python with R by installing package
“Rserve”
 High level language and easy to interpret.
 Free and open source, runs on all platforms.
Big data Analytics
R language
 R is very effective statistical tool and well worth the effort
to learn.
 R is polymorphic, which means that the same function
can be applied to different types of objects.
 R has more than 4000 packages available from multiple
repositories in various specializations.
 CRAN (Comprehensive R Archive Network).
 R can import data from csv files, excel, sas and produces
the output in pdf, jpg, png formats and also table output.
Big data Analytics
R langauge
Working with R studio, loading packages, extracting
the tweets.
Big data Analytics
Unstructured data Analysis for Motor Insurance
 Extracting the data from social media related to Motor
insurance sector.
 Company names, keywords.
 Getting the tweets from twitter and analyzing the data.
 Sentiment analysis.
 User interface.
 What type of insurance can be given or any fraud detection?
Big data Analytics
Extracting data from Twitter using R
 Need to create an app
 api_key
 api_token
 access_token
 access_secret
Big data Analytics
Sentiment Analysis from Twitter
Big data Analytics
Comparison between data mining tools
Characteristic R Rapidminer Weka
Purpose Statistics,Clusteirng
and analytics
Data Mining,
Classification
Data Mining,
Association.
Data Import .xlsx, csv,
RODBSC, .txt
.csv.xlsx, binary files .csv.arff
Specialization It has a large
number of users, in
the fields of bio-
informatics and
social science.
Specialized for
Business solutions that
include predictive
analysis and statistical
computing.
Weka is best suited
for mining
association rules
and machine
learning techniques.
Advantages Purely statistical Visualization,
Parameter
optimization
Ease of use and
machine learning
Big data Analytics

More Related Content

What's hot

Data mining with big data
Data mining with big dataData mining with big data
Data mining with big datakk1718
 
Big Data Projects Research Ideas
Big Data Projects Research IdeasBig Data Projects Research Ideas
Big Data Projects Research IdeasMatlab Simulation
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big dataHari Priya
 
Introduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 SystemIntroduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 SystemPetr Novotný
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...BigMine
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBernard Marr
 
Big Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBig Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBernard Marr
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data AnalyticsS P Sajjan
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantStuart Miniman
 
big data Presentation
big data Presentationbig data Presentation
big data PresentationMahmoud Farag
 
Big Data Analytics - Introduction
Big Data Analytics - IntroductionBig Data Analytics - Introduction
Big Data Analytics - IntroductionAlex Meadows
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsChandan Rajah
 
Big Data: The 6 Key Skills Every Business Needs
Big Data: The 6 Key Skills Every Business NeedsBig Data: The 6 Key Skills Every Business Needs
Big Data: The 6 Key Skills Every Business NeedsBernard Marr
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop iACT Global
 

What's hot (20)

Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data
Big dataBig data
Big data
 
Big Data Projects Research Ideas
Big Data Projects Research IdeasBig Data Projects Research Ideas
Big Data Projects Research Ideas
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
Introduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 SystemIntroduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 System
 
BIG DATA and USE CASES
BIG DATA and USE CASESBIG DATA and USE CASES
BIG DATA and USE CASES
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must Know
 
Big Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBig Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must Know
 
What is Big Data ?
What is Big Data ?What is Big Data ?
What is Big Data ?
 
Big data mining
Big data miningBig data mining
Big data mining
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data Analytics
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Big Data & Data Mining
Big Data & Data MiningBig Data & Data Mining
Big Data & Data Mining
 
big data Presentation
big data Presentationbig data Presentation
big data Presentation
 
Big Data Analytics - Introduction
Big Data Analytics - IntroductionBig Data Analytics - Introduction
Big Data Analytics - Introduction
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and Benefits
 
Big Data: The 6 Key Skills Every Business Needs
Big Data: The 6 Key Skills Every Business NeedsBig Data: The 6 Key Skills Every Business Needs
Big Data: The 6 Key Skills Every Business Needs
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
 

Similar to Big data analytics

Data science using r multisoft systems
Data science using r  multisoft systemsData science using r  multisoft systems
Data science using r multisoft systemsMultisoft Systems
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsFredReynolds2
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentationAASTHA PANDEY
 
Brochure_Big-Data_Offerings
Brochure_Big-Data_OfferingsBrochure_Big-Data_Offerings
Brochure_Big-Data_OfferingsAnisha Lamba
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 
Big data's impact on online marketing
Big data's impact on online marketingBig data's impact on online marketing
Big data's impact on online marketingPros Global Inc
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?Aspire Techsoft Academy
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
 
10 Best Big Data Management Tools
10 Best Big Data Management Tools10 Best Big Data Management Tools
10 Best Big Data Management ToolsPromptCloud
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Prof.Balakrishnan S
 
Integrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and PerficientIntegrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and PerficientPerficient, Inc.
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattooMohamed Magdy
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET Journal
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...phdAssistance1
 
Shareinsights Datasheet
Shareinsights DatasheetShareinsights Datasheet
Shareinsights DatasheetAccelerite
 

Similar to Big data analytics (20)

Big data
Big dataBig data
Big data
 
Data science using r multisoft systems
Data science using r  multisoft systemsData science using r  multisoft systems
Data science using r multisoft systems
 
Python para Manual de Ciência de Dados
Python para Manual de Ciência de DadosPython para Manual de Ciência de Dados
Python para Manual de Ciência de Dados
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentation
 
Brochure_Big-Data_Offerings
Brochure_Big-Data_OfferingsBrochure_Big-Data_Offerings
Brochure_Big-Data_Offerings
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Big data's impact on online marketing
Big data's impact on online marketingBig data's impact on online marketing
Big data's impact on online marketing
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
10 Best Big Data Management Tools
10 Best Big Data Management Tools10 Best Big Data Management Tools
10 Best Big Data Management Tools
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19
 
Integrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and PerficientIntegrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and Perficient
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
Shareinsights Datasheet
Shareinsights DatasheetShareinsights Datasheet
Shareinsights Datasheet
 

Recently uploaded

ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxLigayaBacuel1
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 

Recently uploaded (20)

ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 

Big data analytics

  • 2. Big data Analytics Introduction  Unstructured data contains different (multiple) types of data  Unstructured data is a generic label for describing data that is not contained in a database or some other type of data structure.  Unstructured data contains everything and presents everywhere globally.  More than 90% of social media data is unstructured.
  • 3. Big data Analytics Importance of Unstructured Data  Every minute, there are more than 6,000 pictures shared on social media sites and more than 200 million emails sent.  Analyzing social content such as Tweets, Facebook posts and transcripts from support calls gives a clear view of how customers perceive the value and issues.  Unstructured data isn't well organized or easy to access, but companies who analyze this data and integrate it into their information management landscape can significantly improve employee productivity.
  • 4. Big data Analytics Examples of Unstructured Data  e-mail messages, word processing documents, videos, photos, audio files, presentations, web pages.  Examples of "unstructured data" may include books, journals, documents, metadata, health records, audio, video, analog data, images, files.
  • 5. Big data Analytics Influence of Unstructured data on Social media  The social media needs to be part of the business strategy by interacting with clients on customer.  The statistics contain the number of Twitter followers, Facebook likes, LinkedIn connections, blog subscribers.  Social media like Facebook is growing enormously with the massive amount of unstructured data, they are collecting.  Twitter sees about 175 million tweets each day and has more than 465 million accounts.
  • 7. Big data Analytics Technologies  Data mining  Pattern Recognition  Operations Research  Social Network Analytics (Facebook, Twitter, LinkedIn)  Natural Language Processing
  • 8. Big data Analytics Tools to analyze  R language  Rapid Miner  Weka  Hadoop  Python
  • 9. Big data Analytics RapidMiner  Rapidminer provides an integrated environment for machine learning, data mining, text mining, predictive analytics.  It is the most powerful tool, easy to use and intuitive graphical interface for the design of analytic process.  The code is written in JAVA.  Runs on all major platforms and operating system.  Save time by identifying possible errors, and get suggested quick fixes and support .csv, excel and binary files.
  • 10. Big data Analytics RapidMiner Imported from csv, excel files. Statistics, charts.
  • 11. Big data Analytics Weka  Weka is a collection of machine learning algorithms.  It contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization.  It is s written in Java and runs on almost any platform.  Large collection of different data mining algorithms.
  • 12. Big data Analytics Python  Connect python with R by installing package “Rserve”  High level language and easy to interpret.  Free and open source, runs on all platforms.
  • 13. Big data Analytics R language  R is very effective statistical tool and well worth the effort to learn.  R is polymorphic, which means that the same function can be applied to different types of objects.  R has more than 4000 packages available from multiple repositories in various specializations.  CRAN (Comprehensive R Archive Network).  R can import data from csv files, excel, sas and produces the output in pdf, jpg, png formats and also table output.
  • 14. Big data Analytics R langauge Working with R studio, loading packages, extracting the tweets.
  • 15. Big data Analytics Unstructured data Analysis for Motor Insurance  Extracting the data from social media related to Motor insurance sector.  Company names, keywords.  Getting the tweets from twitter and analyzing the data.  Sentiment analysis.  User interface.  What type of insurance can be given or any fraud detection?
  • 16. Big data Analytics Extracting data from Twitter using R  Need to create an app  api_key  api_token  access_token  access_secret
  • 17. Big data Analytics Sentiment Analysis from Twitter
  • 18. Big data Analytics Comparison between data mining tools Characteristic R Rapidminer Weka Purpose Statistics,Clusteirng and analytics Data Mining, Classification Data Mining, Association. Data Import .xlsx, csv, RODBSC, .txt .csv.xlsx, binary files .csv.arff Specialization It has a large number of users, in the fields of bio- informatics and social science. Specialized for Business solutions that include predictive analysis and statistical computing. Weka is best suited for mining association rules and machine learning techniques. Advantages Purely statistical Visualization, Parameter optimization Ease of use and machine learning