Submit Search
Upload
bigdata_datascience_methodology
•
0 likes
•
58 views
S
Sitaraman Ramachandran
Follow
Report
Share
Report
Share
1 of 1
Download now
Download to read offline
Recommended
YELP Data Set Challenge
YELP Data Set Challenge
Vegard Ølstad
Eat, Rate, Love -- Presentation Slides v3
Eat, Rate, Love -- Presentation Slides v3
Robert Chen
DataScience presents insights to customers in an easily digestible, interactive format via a collaborative web application. This presentation outlines the technology behind the DataScience application, as well as future plans to enhance it.
Delivering Insights: Building the DataScience Web Application
Delivering Insights: Building the DataScience Web Application
DataScience
The business cases for Hadoop can be made on the tremendous operational cost savings that it affords. But why stop there? The integration of R-powered analytics in Hadoop presents a totally new value proposition. Organizations can write R code and deploy it natively in Hadoop without data movement or the need to write their own MapReduce. Bringing R-powered predictive analytics into Hadoop will accelerate Hadoop’s value to organizations by allowing them to break through performance and scalability challenges and solve new analytic problems. Use all the data in Hadoop to discover more, grow more quickly, and operate more efficiently. Ask bigger questions. Ask new questions. Get better, faster results and share them.
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
Revolution Analytics
This project involves predictive analysis on user reviews, review count, neighborhoods of businesses etc. and deriving star rating for different businesses in the YELP application. Technologies used: Machine Learning Models (Navie Bayes, KNN, Decision Tree), Weka, Hadoop, Hive, Java You can also find the blog at https://sjsubigdata.wordpress.com/2014/12/19/data-mining-on-yelp-dataset-2/
Data mining on yelp dataset
Data mining on yelp dataset
Parineetha Tirumali
Yelp final
Yelp final
xourico24
Analysis of Yelp Academic Datasets
Yelp Academic Dataset
Yelp Academic Dataset
MandaniKeyur
A restaurant's average rating and reviews on Yelp in influence customers to an incredible degree. An extra half-star rating causes restaurants to sell out 19 percentage points (49%) more frequently. Despite the impact on the restaurant's business, achieving a better overall rating is not straightforward. A user may give only one star to the restaurant just because he or she found the quality of service to be abysmal even though the food and the restaurant's location were up to his or her standard. These facts may have been mentioned in the review in detail but the final rating would just reflect the poor quality of service. The user rating alone does not provide any additional details, and as a result, the restaurant may not be able to understand which aspects create a negative impact on user experience. Another case may be that a certain popular dish will make users give the restaurant five star ratings, but they would not be satisfied with another aspect of the restaurant such as the dessert. The high user ratings may hide the fact that some aspects of the user experience was negative and that the restaurant has room to improve. Traditional recommender systems usually use only the aggregated ratings without considering the hidden factors in the preference of the users and the properties of the restaurants. For the restaurant domain, this could mean main cuisine, dessert, service, staff friendliness, knowledge of staff, location, ambiance, price and many more aspects. Without considering the ratings for individual aspects, it is likely that the recommendation systems will give inaccurate predictions to restaurants as well as users. In this project, we aim to uncover hidden details about the users' preferences with respect to restaurant properties. With this information, we can provide precise recommendations to the restaurants regarding what aspects they should concentrate on to improve user experience. Since we are backed by more meaningful information about users' preferences we can provide better recommendations to users as to which restaurants they would prefer and why. To summarize, from the results of this project, we can answer the following questions: "what does a particular user care about when dining from a restaurant?", "which aspect should the restaurant improve in order to effectively increase the rating?", and "which restaurant is the best for a particular user?"
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Tharindu Mathew
Recommended
YELP Data Set Challenge
YELP Data Set Challenge
Vegard Ølstad
Eat, Rate, Love -- Presentation Slides v3
Eat, Rate, Love -- Presentation Slides v3
Robert Chen
DataScience presents insights to customers in an easily digestible, interactive format via a collaborative web application. This presentation outlines the technology behind the DataScience application, as well as future plans to enhance it.
Delivering Insights: Building the DataScience Web Application
Delivering Insights: Building the DataScience Web Application
DataScience
The business cases for Hadoop can be made on the tremendous operational cost savings that it affords. But why stop there? The integration of R-powered analytics in Hadoop presents a totally new value proposition. Organizations can write R code and deploy it natively in Hadoop without data movement or the need to write their own MapReduce. Bringing R-powered predictive analytics into Hadoop will accelerate Hadoop’s value to organizations by allowing them to break through performance and scalability challenges and solve new analytic problems. Use all the data in Hadoop to discover more, grow more quickly, and operate more efficiently. Ask bigger questions. Ask new questions. Get better, faster results and share them.
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
Revolution Analytics
This project involves predictive analysis on user reviews, review count, neighborhoods of businesses etc. and deriving star rating for different businesses in the YELP application. Technologies used: Machine Learning Models (Navie Bayes, KNN, Decision Tree), Weka, Hadoop, Hive, Java You can also find the blog at https://sjsubigdata.wordpress.com/2014/12/19/data-mining-on-yelp-dataset-2/
Data mining on yelp dataset
Data mining on yelp dataset
Parineetha Tirumali
Yelp final
Yelp final
xourico24
Analysis of Yelp Academic Datasets
Yelp Academic Dataset
Yelp Academic Dataset
MandaniKeyur
A restaurant's average rating and reviews on Yelp in influence customers to an incredible degree. An extra half-star rating causes restaurants to sell out 19 percentage points (49%) more frequently. Despite the impact on the restaurant's business, achieving a better overall rating is not straightforward. A user may give only one star to the restaurant just because he or she found the quality of service to be abysmal even though the food and the restaurant's location were up to his or her standard. These facts may have been mentioned in the review in detail but the final rating would just reflect the poor quality of service. The user rating alone does not provide any additional details, and as a result, the restaurant may not be able to understand which aspects create a negative impact on user experience. Another case may be that a certain popular dish will make users give the restaurant five star ratings, but they would not be satisfied with another aspect of the restaurant such as the dessert. The high user ratings may hide the fact that some aspects of the user experience was negative and that the restaurant has room to improve. Traditional recommender systems usually use only the aggregated ratings without considering the hidden factors in the preference of the users and the properties of the restaurants. For the restaurant domain, this could mean main cuisine, dessert, service, staff friendliness, knowledge of staff, location, ambiance, price and many more aspects. Without considering the ratings for individual aspects, it is likely that the recommendation systems will give inaccurate predictions to restaurants as well as users. In this project, we aim to uncover hidden details about the users' preferences with respect to restaurant properties. With this information, we can provide precise recommendations to the restaurants regarding what aspects they should concentrate on to improve user experience. Since we are backed by more meaningful information about users' preferences we can provide better recommendations to users as to which restaurants they would prefer and why. To summarize, from the results of this project, we can answer the following questions: "what does a particular user care about when dining from a restaurant?", "which aspect should the restaurant improve in order to effectively increase the rating?", and "which restaurant is the best for a particular user?"
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Tharindu Mathew
This project encompasses the sentiment expressed in social media (Twitter) for smartphones. How to Perform text mining on Hadoop data and analyze the same by integrating R with Hadoop.
Integrating R & Hadoop - Text Mining & Sentiment Analysis
Integrating R & Hadoop - Text Mining & Sentiment Analysis
Aravind Babu
Goal: Demonstrate popular practices when mining big dissimilar texts Object of mining: texts from site: http://blogs.korrespondent.net Tool: R
TextMining with R
TextMining with R
Aleksei Beloshytski
RDataMining Slides Series: Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
Yanchang Zhao
A quick tutorial for the Boston Predictive Analytics MeetUp to demonstrate the use of R in the context of text mining Twitter. We implement a very crude algorithm for sentiment analysis but still get a plausible result.
R by example: mining Twitter for consumer attitudes towards airlines
R by example: mining Twitter for consumer attitudes towards airlines
Jeffrey Breen
This is an introduction for the layman to how sentiment analysis in machines works.
How Sentiment Analysis works
How Sentiment Analysis works
CJ Jenkins
Tutorial of Sentiment Analysis
Tutorial of Sentiment Analysis
Fabio Benedetti
An Introduction to Sentiment Analysis
Introduction to Sentiment Analysis
Introduction to Sentiment Analysis
Jaganadh Gopinadhan
High school extra curricular certificates.6
High school extra curricular certificates.6
Sitaraman Ramachandran
High school extra curricular certificates.5
High school extra curricular certificates.5
Sitaraman Ramachandran
High school extra curricular certificates.4
High school extra curricular certificates.4
Sitaraman Ramachandran
High school extra curricular certificates.3
High school extra curricular certificates.3
Sitaraman Ramachandran
High school extra curricular certificates.2
High school extra curricular certificates.2
Sitaraman Ramachandran
High school extra curricular certificates.1
High school extra curricular certificates.1
Sitaraman Ramachandran
Engi extra curricular certificate
Engi extra curricular certificate
Sitaraman Ramachandran
Noah's Ark
Noah's Ark
Sitaraman Ramachandran
bigdata_pig
bigdata_pig
Sitaraman Ramachandran
bigdata_nosql_DBaaS
bigdata_nosql_DBaaS
Sitaraman Ramachandran
bigdata_mapreduce
bigdata_mapreduce
Sitaraman Ramachandran
bigdata_hive
bigdata_hive
Sitaraman Ramachandran
bigdata_hadoop_fundamentals
bigdata_hadoop_fundamentals
Sitaraman Ramachandran
More Related Content
Viewers also liked
This project encompasses the sentiment expressed in social media (Twitter) for smartphones. How to Perform text mining on Hadoop data and analyze the same by integrating R with Hadoop.
Integrating R & Hadoop - Text Mining & Sentiment Analysis
Integrating R & Hadoop - Text Mining & Sentiment Analysis
Aravind Babu
Goal: Demonstrate popular practices when mining big dissimilar texts Object of mining: texts from site: http://blogs.korrespondent.net Tool: R
TextMining with R
TextMining with R
Aleksei Beloshytski
RDataMining Slides Series: Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
Yanchang Zhao
A quick tutorial for the Boston Predictive Analytics MeetUp to demonstrate the use of R in the context of text mining Twitter. We implement a very crude algorithm for sentiment analysis but still get a plausible result.
R by example: mining Twitter for consumer attitudes towards airlines
R by example: mining Twitter for consumer attitudes towards airlines
Jeffrey Breen
This is an introduction for the layman to how sentiment analysis in machines works.
How Sentiment Analysis works
How Sentiment Analysis works
CJ Jenkins
Tutorial of Sentiment Analysis
Tutorial of Sentiment Analysis
Fabio Benedetti
An Introduction to Sentiment Analysis
Introduction to Sentiment Analysis
Introduction to Sentiment Analysis
Jaganadh Gopinadhan
Viewers also liked
(7)
Integrating R & Hadoop - Text Mining & Sentiment Analysis
Integrating R & Hadoop - Text Mining & Sentiment Analysis
TextMining with R
TextMining with R
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
R by example: mining Twitter for consumer attitudes towards airlines
R by example: mining Twitter for consumer attitudes towards airlines
How Sentiment Analysis works
How Sentiment Analysis works
Tutorial of Sentiment Analysis
Tutorial of Sentiment Analysis
Introduction to Sentiment Analysis
Introduction to Sentiment Analysis
More from Sitaraman Ramachandran
High school extra curricular certificates.6
High school extra curricular certificates.6
Sitaraman Ramachandran
High school extra curricular certificates.5
High school extra curricular certificates.5
Sitaraman Ramachandran
High school extra curricular certificates.4
High school extra curricular certificates.4
Sitaraman Ramachandran
High school extra curricular certificates.3
High school extra curricular certificates.3
Sitaraman Ramachandran
High school extra curricular certificates.2
High school extra curricular certificates.2
Sitaraman Ramachandran
High school extra curricular certificates.1
High school extra curricular certificates.1
Sitaraman Ramachandran
Engi extra curricular certificate
Engi extra curricular certificate
Sitaraman Ramachandran
Noah's Ark
Noah's Ark
Sitaraman Ramachandran
bigdata_pig
bigdata_pig
Sitaraman Ramachandran
bigdata_nosql_DBaaS
bigdata_nosql_DBaaS
Sitaraman Ramachandran
bigdata_mapreduce
bigdata_mapreduce
Sitaraman Ramachandran
bigdata_hive
bigdata_hive
Sitaraman Ramachandran
bigdata_hadoop_fundamentals
bigdata_hadoop_fundamentals
Sitaraman Ramachandran
More from Sitaraman Ramachandran
(13)
High school extra curricular certificates.6
High school extra curricular certificates.6
High school extra curricular certificates.5
High school extra curricular certificates.5
High school extra curricular certificates.4
High school extra curricular certificates.4
High school extra curricular certificates.3
High school extra curricular certificates.3
High school extra curricular certificates.2
High school extra curricular certificates.2
High school extra curricular certificates.1
High school extra curricular certificates.1
Engi extra curricular certificate
Engi extra curricular certificate
Noah's Ark
Noah's Ark
bigdata_pig
bigdata_pig
bigdata_nosql_DBaaS
bigdata_nosql_DBaaS
bigdata_mapreduce
bigdata_mapreduce
bigdata_hive
bigdata_hive
bigdata_hadoop_fundamentals
bigdata_hadoop_fundamentals
bigdata_datascience_methodology
1.
Sitaraman Ramachandran Data Science
Methodology July 31, 2015
Download now