SlideShare a Scribd company logo
1 of 21
Impetus Technologies Inc. 
Big Data Technologies for Social 
© 2014 1 Impetus Technologies 
Media Analytics 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=4 
8
Outline 
• Social Media Analytics- Need and Benefits 
• Effective convergence of disparate data sources 
• Big Data technologies to enable Social Analytics 
• Our recommended approach 
• Industry relevant use cases 
© 2014 2 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Social Analytics 
Recommendation 
Engine 
© 2014 3 Impetus Technologies 
Reports and 
Statistics 
Data visualization Sentiment Analysis 
via Interactive 
Interface 
Social Media Sources 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Business Intelligence & Product Research 
 Customer Analysis 
 Identifies users from different geographies, 
locations 
 Tracks users activities to determine usage 
patterns 
 Feature Analysis 
 Track the usage of various social features 
 Product Growth Analysis 
 Track customer feedback on products 
 Target the right customers 
 Recommendation Engine 
 Related products and customers 
 Third Party Data Analysis 
 Analysis of customers on third party sites 
© 2014 4 Impetus Technologies 
Social Analytics provides smarter 
ways of data tracking, powerful 
analytics and metrics for informed 
decision making 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
How it Helps? 
Outcome Based Approach 
• Customer retention 
• Brand building and recall (harvests/ address sentiment) 
• Simplifies customer service 
• Reduces operational cost 
• Builds up the customer base 
• Understands customer’s opinions and addresses their 
needs 
• Competition benchmarking 
• Proactive on demographic changes 
© 2014 5 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Convergence of Data Sources 
Data Sources 
© 2014 6 Impetus Technologies 
Website Traffic Analysis 
(On-site web analytics) 
Internal CSR Logs, Customer 
Queries 
Automated Agent discussions 
Complaints and Resolutions 
Employee Insights 
External Data Sources 
(Off-site web analytics) 
Industry Reports 
Market Research 
Social Media 
Social Media Analytics 
Social Media Analytics effectively converges on-site, social media and third party data 
to extract useful information 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Technical Tenets of Social Media Analytics 
Data Sources 
© 2014 7 Impetus Technologies 
Website Traffic Analysis 
(On-site web analytics) 
Internal CSR Logs, Customer 
Queries 
Automated Agent discussions 
Complaints and Resolutions 
Employee Insights 
External Data Sources 
(Off-site web analytics) 
Industry Reports 
Market Research 
Social Media 
Social Media Analytics 
Clustering Classification Sequential classification 
Entity extraction Event extraction Communication graph 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Why Big Data for Social Analytics? 
• Large data volumes in the order TBs and PBs 
• Complex unstructured data from social sources 
• Deeper insights into customers and trends 
• Storing images, videos 
• The bottom-line - $/TB 
© 2014 8 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Our Recommended Approach 
Technologies 
• Data collection - Social media data 
– Live feeds 
– Historical bulk data 
• NLP (NLTK is a good option) 
• Data preparation/ Mashup 
– M/R, PIG, Hive, Oozie, R, Sqoop 
• Classification/ Clustering (Mahout) 
• Recommendation (Mahout) 
• Loopback/ Feed output to live applications 
• Analytical reporting and deep mining 
© 2014 9 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Our Recommended Approach 
• Collecting Twitter Feed (Streaming feed) using filter fire 
hose 
– Tweets for keywords 
– Based on brand, product, category, industry, product 
segment, special offers and marketing buzz words 
– Streaming API and HBASE based sink for high writes 
• Collect/create training data 
– Standalone Tweets for individual keywords 
© 2014 10 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Our Recommended Approach 
• Creating or classifying text data and demographics 
• Quantitative analytics 
• Ascertaining daily trend 
• General tweets v/s product-specific tweets 
• Tweets targeted at competitors v/s own product 
• Location based trends (for available data sets) 
• Identifying and categorizing the output 
• Sentiment analysis of own product - Good, Neutral, 
Bad 
• Use training data for classification - Mahout/NLTK 
• Run trained models on Tweet data - Mahout/NLTK 
© 2014 11 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Our Recommended Approach 
• Mash up Analytics from RDBMS with Social media 
analytics 
• Using customer data to recommend new/related 
products 
• Preparing mock customer data for Social ID mapping 
• Running recommendations (item or user based) using 
Mahout 
• Analytical Reporting 
• Demonstrates drill down reports on data generated by 
Mahout 
• Reports over Hive/MySQL using a traditional Reporting 
product or framework 
© 2014 12 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
© 2014 13 Impetus Technologies 
iLaDaP 
Impetus Large Data Analytics Platform
iLaDaP- Technology Stack 
• Scalable data store 
– Hadoop HDFS 
– Hbase 
• Connectors (In/Out) 
– Flume 
– Sqoop 
– Messaging queue 
– ESB- Apache Camel 
• Analytics and ETL 
– Mahout for NL and text mining 
• Classification/ Clustering 
• Recommendation 
– Oozie for complex ETL and workflow 
– JDBC/ODBC compliant Analytics tools – Intellicus, Jasper etc. 
© 2014 14 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Case Study- Financial Services 
The Client 
– Leading financial services company 
Key Challenge 
– Recommend products based on User profile/location 
– Recommend alternate products based Social Media feedback 
Impetus Solution 
• Proposed iLaDaP based solution 
• Sentiment Analysis using Naïve Bayesian algorithm for 
classification/sentiment analysis 
• Clustering using k-means algorithm of Mahout 
• Apache Mahout based recommendation engine 
Benefits Realised 
• Better product recommendations 
© 2014 15 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Case Study- Online Retailer 
The Client 
– Leading online product retailer 
Key Challenge 
• Recommendation engine 
• Cross product customer analysis 
• Provide ‘Big Picture’ across business units 
Impetus Solution 
• Proposed iLaDaP based solution 
• Clustering using k-means algorithm of Mahout 
• Apache Mahout based recommendation engine 
Benefits Realised 
• True centralized business overview across product and business lines 
© 2014 16 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Summing Up 
• Using Big Data technologies for Social Analytics needs a 
well-thought of strategy 
• Open source yields better results for social media data 
• Hadoop based Big Data Analytics is a scalable and cost 
effective option. 
• Selecting the right tools is the key to build a successful 
Social Analytics EDW using Big Data 
• Easy extension of the existing Data Warehouse and 
Analytics infrastructure is possible to leverage existing 
investments 
© 2014 17 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
© 2014 18 Impetus Technologies 
About Impetus
• Strategic partners for software product engineering and 
R&D 
• Thought leaders in cutting-edge technologies 
• Mature processes and practices that are methodical, yet 
flexible 
• Diverse domain expertise 
© 2014 19 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
© 2014 20 Impetus Technologies 
Q & A
© 2014 21 Impetus Technologies 
Thank You 
Write to us at inquiry@impetus.com 
Follow us on Twitter @impetustech 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48

More Related Content

Similar to Big Data Technologies for Social Media Analytics- Impetus Webinar

The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016StampedeCon
 
Webinar: Take Your Brandwatch Data Anywhere
Webinar: Take Your Brandwatch Data AnywhereWebinar: Take Your Brandwatch Data Anywhere
Webinar: Take Your Brandwatch Data AnywhereBrandwatch
 
The Importance of an Analytics Platform
The Importance of an Analytics PlatformThe Importance of an Analytics Platform
The Importance of an Analytics PlatformLou Bajuk
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concretoHP Enterprise Italia
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014Hortonworks
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessNG DATA
 
First Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring PentahoFirst Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring PentahoArchipelagoIS
 
Why Your Product Needs A Data & Analytics Strategy
Why Your Product Needs A Data & Analytics StrategyWhy Your Product Needs A Data & Analytics Strategy
Why Your Product Needs A Data & Analytics StrategyAIPMM Administration
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data AnalyticsDatameer
 
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...SAP Ariba
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdf
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdfHyperHack 2023 Global Presentation - AMER Enablement_070623.pdf
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdfDianaGray10
 
Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy Pentaho
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMBig Data Joe™ Rossi
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMBig Data Joe™ Rossi
 
Tips and Tricks for Beginning Cognos Report Studio Authors
Tips and Tricks for Beginning Cognos Report Studio AuthorsTips and Tricks for Beginning Cognos Report Studio Authors
Tips and Tricks for Beginning Cognos Report Studio AuthorsSenturus
 
How to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudHow to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudPerficient, Inc.
 

Similar to Big Data Technologies for Social Media Analytics- Impetus Webinar (20)

The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
 
Webinar: Take Your Brandwatch Data Anywhere
Webinar: Take Your Brandwatch Data AnywhereWebinar: Take Your Brandwatch Data Anywhere
Webinar: Take Your Brandwatch Data Anywhere
 
The Importance of an Analytics Platform
The Importance of an Analytics PlatformThe Importance of an Analytics Platform
The Importance of an Analytics Platform
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concreto
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your business
 
First Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring PentahoFirst Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring Pentaho
 
Why Your Product Needs A Data & Analytics Strategy
Why Your Product Needs A Data & Analytics StrategyWhy Your Product Needs A Data & Analytics Strategy
Why Your Product Needs A Data & Analytics Strategy
 
Mar-Tech Oversight
Mar-Tech OversightMar-Tech Oversight
Mar-Tech Oversight
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data Analytics
 
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdf
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdfHyperHack 2023 Global Presentation - AMER Enablement_070623.pdf
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdf
 
Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
Tips and Tricks for Beginning Cognos Report Studio Authors
Tips and Tricks for Beginning Cognos Report Studio AuthorsTips and Tricks for Beginning Cognos Report Studio Authors
Tips and Tricks for Beginning Cognos Report Studio Authors
 
How to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudHow to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics Cloud
 

More from Impetus Technologies

Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Impetus Technologies
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarImpetus Technologies
 
Building Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarBuilding Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarImpetus Technologies
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Impetus Technologies
 
Impetus White Paper- Handling Data Corruption in Elasticsearch
Impetus White Paper- Handling  Data Corruption  in ElasticsearchImpetus White Paper- Handling  Data Corruption  in Elasticsearch
Impetus White Paper- Handling Data Corruption in ElasticsearchImpetus Technologies
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarImpetus Technologies
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarImpetus Technologies
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Impetus Technologies
 
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Impetus Technologies
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Impetus Technologies
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...Impetus Technologies
 
Enterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastEnterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastImpetus Technologies
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Impetus Technologies
 
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Impetus Technologies
 
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Impetus Technologies
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabImpetus Technologies
 
Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trendsImpetus Technologies
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labImpetus Technologies
 
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...Impetus Technologies
 
Performance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastPerformance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastImpetus Technologies
 

More from Impetus Technologies (20)

Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
 
Building Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarBuilding Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus Webinar
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
 
Impetus White Paper- Handling Data Corruption in Elasticsearch
Impetus White Paper- Handling  Data Corruption  in ElasticsearchImpetus White Paper- Handling  Data Corruption  in Elasticsearch
Impetus White Paper- Handling Data Corruption in Elasticsearch
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
 
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
 
Enterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastEnterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus Webcast
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
 
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
 
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLab
 
Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trends
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph lab
 
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
 
Performance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastPerformance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus Webcast
 

Recently uploaded

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 

Recently uploaded (20)

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 

Big Data Technologies for Social Media Analytics- Impetus Webinar

  • 1. Impetus Technologies Inc. Big Data Technologies for Social © 2014 1 Impetus Technologies Media Analytics Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=4 8
  • 2. Outline • Social Media Analytics- Need and Benefits • Effective convergence of disparate data sources • Big Data technologies to enable Social Analytics • Our recommended approach • Industry relevant use cases © 2014 2 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 3. Social Analytics Recommendation Engine © 2014 3 Impetus Technologies Reports and Statistics Data visualization Sentiment Analysis via Interactive Interface Social Media Sources Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 4. Business Intelligence & Product Research  Customer Analysis  Identifies users from different geographies, locations  Tracks users activities to determine usage patterns  Feature Analysis  Track the usage of various social features  Product Growth Analysis  Track customer feedback on products  Target the right customers  Recommendation Engine  Related products and customers  Third Party Data Analysis  Analysis of customers on third party sites © 2014 4 Impetus Technologies Social Analytics provides smarter ways of data tracking, powerful analytics and metrics for informed decision making Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 5. How it Helps? Outcome Based Approach • Customer retention • Brand building and recall (harvests/ address sentiment) • Simplifies customer service • Reduces operational cost • Builds up the customer base • Understands customer’s opinions and addresses their needs • Competition benchmarking • Proactive on demographic changes © 2014 5 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 6. Convergence of Data Sources Data Sources © 2014 6 Impetus Technologies Website Traffic Analysis (On-site web analytics) Internal CSR Logs, Customer Queries Automated Agent discussions Complaints and Resolutions Employee Insights External Data Sources (Off-site web analytics) Industry Reports Market Research Social Media Social Media Analytics Social Media Analytics effectively converges on-site, social media and third party data to extract useful information Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 7. Technical Tenets of Social Media Analytics Data Sources © 2014 7 Impetus Technologies Website Traffic Analysis (On-site web analytics) Internal CSR Logs, Customer Queries Automated Agent discussions Complaints and Resolutions Employee Insights External Data Sources (Off-site web analytics) Industry Reports Market Research Social Media Social Media Analytics Clustering Classification Sequential classification Entity extraction Event extraction Communication graph Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 8. Why Big Data for Social Analytics? • Large data volumes in the order TBs and PBs • Complex unstructured data from social sources • Deeper insights into customers and trends • Storing images, videos • The bottom-line - $/TB © 2014 8 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 9. Our Recommended Approach Technologies • Data collection - Social media data – Live feeds – Historical bulk data • NLP (NLTK is a good option) • Data preparation/ Mashup – M/R, PIG, Hive, Oozie, R, Sqoop • Classification/ Clustering (Mahout) • Recommendation (Mahout) • Loopback/ Feed output to live applications • Analytical reporting and deep mining © 2014 9 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 10. Our Recommended Approach • Collecting Twitter Feed (Streaming feed) using filter fire hose – Tweets for keywords – Based on brand, product, category, industry, product segment, special offers and marketing buzz words – Streaming API and HBASE based sink for high writes • Collect/create training data – Standalone Tweets for individual keywords © 2014 10 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 11. Our Recommended Approach • Creating or classifying text data and demographics • Quantitative analytics • Ascertaining daily trend • General tweets v/s product-specific tweets • Tweets targeted at competitors v/s own product • Location based trends (for available data sets) • Identifying and categorizing the output • Sentiment analysis of own product - Good, Neutral, Bad • Use training data for classification - Mahout/NLTK • Run trained models on Tweet data - Mahout/NLTK © 2014 11 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 12. Our Recommended Approach • Mash up Analytics from RDBMS with Social media analytics • Using customer data to recommend new/related products • Preparing mock customer data for Social ID mapping • Running recommendations (item or user based) using Mahout • Analytical Reporting • Demonstrates drill down reports on data generated by Mahout • Reports over Hive/MySQL using a traditional Reporting product or framework © 2014 12 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 13. © 2014 13 Impetus Technologies iLaDaP Impetus Large Data Analytics Platform
  • 14. iLaDaP- Technology Stack • Scalable data store – Hadoop HDFS – Hbase • Connectors (In/Out) – Flume – Sqoop – Messaging queue – ESB- Apache Camel • Analytics and ETL – Mahout for NL and text mining • Classification/ Clustering • Recommendation – Oozie for complex ETL and workflow – JDBC/ODBC compliant Analytics tools – Intellicus, Jasper etc. © 2014 14 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 15. Case Study- Financial Services The Client – Leading financial services company Key Challenge – Recommend products based on User profile/location – Recommend alternate products based Social Media feedback Impetus Solution • Proposed iLaDaP based solution • Sentiment Analysis using Naïve Bayesian algorithm for classification/sentiment analysis • Clustering using k-means algorithm of Mahout • Apache Mahout based recommendation engine Benefits Realised • Better product recommendations © 2014 15 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 16. Case Study- Online Retailer The Client – Leading online product retailer Key Challenge • Recommendation engine • Cross product customer analysis • Provide ‘Big Picture’ across business units Impetus Solution • Proposed iLaDaP based solution • Clustering using k-means algorithm of Mahout • Apache Mahout based recommendation engine Benefits Realised • True centralized business overview across product and business lines © 2014 16 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 17. Summing Up • Using Big Data technologies for Social Analytics needs a well-thought of strategy • Open source yields better results for social media data • Hadoop based Big Data Analytics is a scalable and cost effective option. • Selecting the right tools is the key to build a successful Social Analytics EDW using Big Data • Easy extension of the existing Data Warehouse and Analytics infrastructure is possible to leverage existing investments © 2014 17 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 18. © 2014 18 Impetus Technologies About Impetus
  • 19. • Strategic partners for software product engineering and R&D • Thought leaders in cutting-edge technologies • Mature processes and practices that are methodical, yet flexible • Diverse domain expertise © 2014 19 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 20. © 2014 20 Impetus Technologies Q & A
  • 21. © 2014 21 Impetus Technologies Thank You Write to us at inquiry@impetus.com Follow us on Twitter @impetustech Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48