SlideShare a Scribd company logo
DATA SOCIETY © 2015
TM
“If you can’t explain it simply, you don’t understand it well enough.”
- Albert Einstein
DATA SOCIETY © 2015
Contrarian Question
con·trar·i·an
kəәnˈtre(əә)rēəәn,kän-/
noun
noun: contrarian; plural noun: contrarians
1. 

a person who opposes or rejects popular opinion, especially in stock exchange dealing.

adjective
adjective: contrarian
1. 

opposing or rejecting popular opinion; going against current practice.
”the comment came more from a contrarian disposition than moral conviction"
DATA SOCIETY © 2015
Contrarian Question
Data has NO intrinsic value!!
DATA SOCIETY © 2015
Facts vs. Opinion(s)
Two main types of textual information.
• Facts and Opinions
Search engines are optimized for Facts;
Sentiment Analysis is a growing attempt (not completely
solved) to optimize the discovery of opinions.
Opinions Mining or Sentiment Analysis is an attempt to
recognize the opinion or sentiment that a person holds
toward an object.
DATA SOCIETY © 2015
Let's have a look at this*:
• 91 percent of people report having gone into a store because of an
online experience.
• 89 percent of consumers conduct research using search engines.
• 78 percent of consumers say that posts made by companies on
social media influence their purchases.
• 72 percent of consumers trust online reviews as much as personal
recommendations.
• 62 percent of consumers end up making a purchase in a store after
researching it online.
*Blogs, Comments (i.e. YouTube, Facebook), Reviews, Forums, Microblogging (i.e. Twitter) http://www.investopedia.com/terms/a/asymmetricinformation.asp
Information Asymmetry
DATA SOCIETY © 2015
Information Asymmetry
seller
buyer
information
information
information $
$
$
$
$
$
$
$
DATA SOCIETY © 2015
Where do we find sentiment?
• Movie / Books: Are the reviews on this movie/book positive/negative?

• Product Sales: What is thought of the new iPhone? 

• Public Sentiment: How do consumers feel about the economy? How is
consumer sentiment effecting sales by sector?

• Politics: How are voters polarized, if at all around a candidate or policy? 

• Prediction: Stock Prices, Election Outcomes, Market Trends, Product Sales
DATA SOCIETY © 2015
Scherer’s Typology of Emotions:
Scherer's typology of emotions is briefly explained as follows:
• Emotion: This is a brief, organically synchronized evaluation of a major event, for
example, being angry, sad, joyful, ashamed, proud, and elated i.e. fired or promoted at
work.
• Mood: This is a diffused, non-caused, low-intensity, long-duration change in subjective
feeling, for example, being cheerful, gloomy, irritable, listless, depressed, and buoyant
• Interpersonal stance: This is an affective stance towards another person in a specific
interaction, for example, being friendly, flirtatious, distant, cold, warm, supportive, and
contemptuous
• Attitudes: This is enduring, affectively colored beliefs or dispositions towards objects or
persons, for example, being liking, loving, hating, valuing, and desiring
• Personality traits: These are stable personality dispositions and typical behavior
tendencies, for example, being nervous, anxious, reckless, morose, hostile, and jealous
DATA SOCIETY © 2015
Goal: all measurement is to arrange items on a continuum (observed or unobserved).
Measurement:
DATA SOCIETY © 2015
1. Dictionary Based Sentiment Analysis
• i.e. Is an attitude toward an object positive or negative?
• e.g. Jeffrey Breen’s method, qdap
2. Supervised Learning for Sentiment Analysis.
• i.e. Given data we have seen in the past, can we predict class
assignment for our polarity measure (positive/neutral/negative)
• e.g. Naive Bayes, MaxEnt, SVM
3. Unsupervised Sentiment Analysis
• i.e. No dictionaries. No labeled data. No training algorithms. And,
scale words (often bi-grams) and users on a single dimension.
• e.g. latent variable models - IRT
Sentiment Analysis: Ordered Sophistication Lexicon Based
(Supervised)
DATA SOCIETY © 2015
• The Beige Book (http://www.federalreserve.gov/monetarypolicy/beigebook), more formally
called the Summary of Commentary on Current Economic Conditions, is a report published
by the United States Federal Research Board (FRB) eight times a year.
• The Beige Book has been in publication since 1985 and is now published online.
• The report is published by each (n=12) of the Federal Reserve Bank districts (e.g. Beige
Book (October 2013) is below)
• The content is rather anecdotal. The report interviews key business contacts, economists,
market experts, and others to get their opinion about the economy. 

• The data used in this book can be found on GitHub (https://github.com/
SocialMediaMininginR/beigebook), as well as the Python code for all the scraping and
parsing.
Beige Book
DATA SOCIETY © 2015
“Consumer spending grew modestly in most Districts. Auto sales
continued to be strong, particularly in the New York District where they
were said to be increasingly robust. In contrast, Chicago, Kansas City,
and Dallas indicated slower growth in auto sales in September.”
Beige Book
DATA SOCIETY © 2015
qdap
Determine polarity (a few ways to do this… none of which are perfect). qdap uses word clusters.
> pol.bb<- polarity(bb$text, grouping.var = bb$location, polarity.frame = POLKEY,
constrain = TRUE, negators = qdapDictionaries::negation.words,
amplifiers = qdapDictionaries::amplification.words,
deamplifiers = qdapDictionaries::deamplification.words, question.weight = 0,
amplifier.weight = .3, n.before = 4, n.after = 2, rm.incomplete = FALSE, digits = 3)
xi = word of polarity
xit = neutral (xi
0
); negator (xi
N
); amplifier(xi
a
); de-amplifier (xi
d
)
Each polarized word (xi) is then weighted w based on the weights from polarity frame
xi
T
xixi-1xi-2xi-3xi-4 xi+1 xi+2
DATA SOCIETY © 2015
Summary
Beige Book (1996 - 2013) with recession bars (pink)
DATA SOCIETY © 2015
thank you
Blog: http://socialmediaminingr.com
Twitter: @rheimann
http://datatactics.blogspot.com/2015/02/modern-approaches-to-sentiment-analysis.html

More Related Content

Viewers also liked

Data Tactics Data Science Brown Bag (April 2014)
Data Tactics Data Science Brown Bag (April 2014)Data Tactics Data Science Brown Bag (April 2014)
Data Tactics Data Science Brown Bag (April 2014)
Rich Heimann
 
Enterprise Cloud Best Practices
Enterprise Cloud Best PracticesEnterprise Cloud Best Practices
Enterprise Cloud Best Practices
Open Data Center Alliance
 
Forecast 2014: Opening Keynote
Forecast 2014: Opening KeynoteForecast 2014: Opening Keynote
Forecast 2014: Opening Keynote
Open Data Center Alliance
 
Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?
Rich Heimann
 
Human Terrain Analysis at George Mason University (DAY 1)
Human Terrain Analysis at George Mason University (DAY 1)Human Terrain Analysis at George Mason University (DAY 1)
Human Terrain Analysis at George Mason University (DAY 1)
Rich Heimann
 
Workshop: Big Data Visualization for Security
Workshop: Big Data Visualization for SecurityWorkshop: Big Data Visualization for Security
Workshop: Big Data Visualization for Security
Raffael Marty
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
Amit Ranjan
 

Viewers also liked (7)

Data Tactics Data Science Brown Bag (April 2014)
Data Tactics Data Science Brown Bag (April 2014)Data Tactics Data Science Brown Bag (April 2014)
Data Tactics Data Science Brown Bag (April 2014)
 
Enterprise Cloud Best Practices
Enterprise Cloud Best PracticesEnterprise Cloud Best Practices
Enterprise Cloud Best Practices
 
Forecast 2014: Opening Keynote
Forecast 2014: Opening KeynoteForecast 2014: Opening Keynote
Forecast 2014: Opening Keynote
 
Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?
 
Human Terrain Analysis at George Mason University (DAY 1)
Human Terrain Analysis at George Mason University (DAY 1)Human Terrain Analysis at George Mason University (DAY 1)
Human Terrain Analysis at George Mason University (DAY 1)
 
Workshop: Big Data Visualization for Security
Workshop: Big Data Visualization for SecurityWorkshop: Big Data Visualization for Security
Workshop: Big Data Visualization for Security
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
 

Similar to Guest Talk for Data Society's "INTRO TO DATA SCIENCE BOOT CAMP"

Annotated Literature Review and Supporting DataThe topic for thi.docx
Annotated Literature Review and Supporting DataThe topic for thi.docxAnnotated Literature Review and Supporting DataThe topic for thi.docx
Annotated Literature Review and Supporting DataThe topic for thi.docx
durantheseldine
 
The ultimate guide to data storytelling | Materclass
The ultimate guide to data storytelling | MaterclassThe ultimate guide to data storytelling | Materclass
The ultimate guide to data storytelling | Materclass
Gramener
 
Understanding local networks
Understanding local networksUnderstanding local networks
Understanding local networks
Anna De Vera
 
What do you really mean when you tweet? Challenges for opinion mining on soci...
What do you really mean when you tweet? Challenges for opinion mining on soci...What do you really mean when you tweet? Challenges for opinion mining on soci...
What do you really mean when you tweet? Challenges for opinion mining on soci...
Diana Maynard
 
What makes good research
What makes good research What makes good research
What makes good research
CharityComms
 
Using research to generate positive media coverage - improve your approach in...
Using research to generate positive media coverage - improve your approach in...Using research to generate positive media coverage - improve your approach in...
Using research to generate positive media coverage - improve your approach in...
CharityComms
 
Leader data
Leader dataLeader data
Leader data
Shourya Simha
 
Practical sentiment analysis
Practical sentiment analysisPractical sentiment analysis
Practical sentiment analysis
Diana Maynard
 
You are Political
You are PoliticalYou are Political
You are Political
Mr. Finnie
 
Social Data Sentiment Analysis
Social Data Sentiment AnalysisSocial Data Sentiment Analysis
Social Data Sentiment Analysis
Seth Grimes
 
Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Diana Maynard
 
The best stats you have ever seen
The best stats you have ever seenThe best stats you have ever seen
The best stats you have ever seen
Shourya Simha
 
Managing stakeholders from the disengaged to the difficult
Managing stakeholders from the disengaged to the difficultManaging stakeholders from the disengaged to the difficult
Managing stakeholders from the disengaged to the difficult
Mahmoud Ghoz
 
Ethos (think ETHICAL Appeal of the Writer)This appeal invo.docx
Ethos (think ETHICAL Appeal of the Writer)This appeal invo.docxEthos (think ETHICAL Appeal of the Writer)This appeal invo.docx
Ethos (think ETHICAL Appeal of the Writer)This appeal invo.docx
SANSKAR20
 
Effect of perceptional bias on decision making
Effect of perceptional bias on decision making Effect of perceptional bias on decision making
Effect of perceptional bias on decision making
Amrendra Roy
 
Sentiment in Social Media: The Genie in the Bottle
Sentiment in Social Media: The Genie in the BottleSentiment in Social Media: The Genie in the Bottle
Sentiment in Social Media: The Genie in the Bottle
Seth Grimes
 

Similar to Guest Talk for Data Society's "INTRO TO DATA SCIENCE BOOT CAMP" (20)

Annotated Literature Review and Supporting DataThe topic for thi.docx
Annotated Literature Review and Supporting DataThe topic for thi.docxAnnotated Literature Review and Supporting DataThe topic for thi.docx
Annotated Literature Review and Supporting DataThe topic for thi.docx
 
Cls8 decarbonet
Cls8 decarbonetCls8 decarbonet
Cls8 decarbonet
 
Unit 2 Human Development and Capability
Unit 2 Human Development and Capability Unit 2 Human Development and Capability
Unit 2 Human Development and Capability
 
The ultimate guide to data storytelling | Materclass
The ultimate guide to data storytelling | MaterclassThe ultimate guide to data storytelling | Materclass
The ultimate guide to data storytelling | Materclass
 
Understanding local networks
Understanding local networksUnderstanding local networks
Understanding local networks
 
What do you really mean when you tweet? Challenges for opinion mining on soci...
What do you really mean when you tweet? Challenges for opinion mining on soci...What do you really mean when you tweet? Challenges for opinion mining on soci...
What do you really mean when you tweet? Challenges for opinion mining on soci...
 
What makes good research
What makes good research What makes good research
What makes good research
 
Using research to generate positive media coverage - improve your approach in...
Using research to generate positive media coverage - improve your approach in...Using research to generate positive media coverage - improve your approach in...
Using research to generate positive media coverage - improve your approach in...
 
Leader data
Leader dataLeader data
Leader data
 
Practical sentiment analysis
Practical sentiment analysisPractical sentiment analysis
Practical sentiment analysis
 
Sentiment Analysis.pptx
Sentiment Analysis.pptxSentiment Analysis.pptx
Sentiment Analysis.pptx
 
You are Political
You are PoliticalYou are Political
You are Political
 
Social Data Sentiment Analysis
Social Data Sentiment AnalysisSocial Data Sentiment Analysis
Social Data Sentiment Analysis
 
Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?
 
The best stats you have ever seen
The best stats you have ever seenThe best stats you have ever seen
The best stats you have ever seen
 
Managing stakeholders from the disengaged to the difficult
Managing stakeholders from the disengaged to the difficultManaging stakeholders from the disengaged to the difficult
Managing stakeholders from the disengaged to the difficult
 
Ethos (think ETHICAL Appeal of the Writer)This appeal invo.docx
Ethos (think ETHICAL Appeal of the Writer)This appeal invo.docxEthos (think ETHICAL Appeal of the Writer)This appeal invo.docx
Ethos (think ETHICAL Appeal of the Writer)This appeal invo.docx
 
Sentiment Analysis.pptx
Sentiment Analysis.pptxSentiment Analysis.pptx
Sentiment Analysis.pptx
 
Effect of perceptional bias on decision making
Effect of perceptional bias on decision making Effect of perceptional bias on decision making
Effect of perceptional bias on decision making
 
Sentiment in Social Media: The Genie in the Bottle
Sentiment in Social Media: The Genie in the BottleSentiment in Social Media: The Genie in the Bottle
Sentiment in Social Media: The Genie in the Bottle
 

More from Rich Heimann

GES673 SP2014 Intro Lecture
GES673 SP2014 Intro LectureGES673 SP2014 Intro Lecture
GES673 SP2014 Intro LectureRich Heimann
 
Data Tactics Analytics Brown Bag (November 2013)
Data Tactics Analytics Brown Bag (November 2013)Data Tactics Analytics Brown Bag (November 2013)
Data Tactics Analytics Brown Bag (November 2013)
Rich Heimann
 
A Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics CorporationA Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics Corporation
Rich Heimann
 
Data Tactics Analytics Brown Bag (Aug 22, 2013)
Data Tactics Analytics Brown Bag (Aug 22, 2013)Data Tactics Analytics Brown Bag (Aug 22, 2013)
Data Tactics Analytics Brown Bag (Aug 22, 2013)
Rich Heimann
 
Spatial Analysis; The Primitives at UMBC
Spatial Analysis; The Primitives at UMBCSpatial Analysis; The Primitives at UMBC
Spatial Analysis; The Primitives at UMBC
Rich Heimann
 
Spatial Analysis and Geomatics
Spatial Analysis and GeomaticsSpatial Analysis and Geomatics
Spatial Analysis and Geomatics
Rich Heimann
 
Week 1 Lecture @ UMBC
Week 1 Lecture @ UMBCWeek 1 Lecture @ UMBC
Week 1 Lecture @ UMBC
Rich Heimann
 

More from Rich Heimann (8)

DS4G
DS4GDS4G
DS4G
 
GES673 SP2014 Intro Lecture
GES673 SP2014 Intro LectureGES673 SP2014 Intro Lecture
GES673 SP2014 Intro Lecture
 
Data Tactics Analytics Brown Bag (November 2013)
Data Tactics Analytics Brown Bag (November 2013)Data Tactics Analytics Brown Bag (November 2013)
Data Tactics Analytics Brown Bag (November 2013)
 
A Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics CorporationA Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics Corporation
 
Data Tactics Analytics Brown Bag (Aug 22, 2013)
Data Tactics Analytics Brown Bag (Aug 22, 2013)Data Tactics Analytics Brown Bag (Aug 22, 2013)
Data Tactics Analytics Brown Bag (Aug 22, 2013)
 
Spatial Analysis; The Primitives at UMBC
Spatial Analysis; The Primitives at UMBCSpatial Analysis; The Primitives at UMBC
Spatial Analysis; The Primitives at UMBC
 
Spatial Analysis and Geomatics
Spatial Analysis and GeomaticsSpatial Analysis and Geomatics
Spatial Analysis and Geomatics
 
Week 1 Lecture @ UMBC
Week 1 Lecture @ UMBCWeek 1 Lecture @ UMBC
Week 1 Lecture @ UMBC
 

Recently uploaded

Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 

Recently uploaded (20)

Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 

Guest Talk for Data Society's "INTRO TO DATA SCIENCE BOOT CAMP"

  • 1. DATA SOCIETY © 2015 TM “If you can’t explain it simply, you don’t understand it well enough.” - Albert Einstein
  • 2. DATA SOCIETY © 2015 Contrarian Question con·trar·i·an kəәnˈtre(əә)rēəәn,kän-/ noun noun: contrarian; plural noun: contrarians 1. 
 a person who opposes or rejects popular opinion, especially in stock exchange dealing.
 adjective adjective: contrarian 1. 
 opposing or rejecting popular opinion; going against current practice. ”the comment came more from a contrarian disposition than moral conviction"
  • 3. DATA SOCIETY © 2015 Contrarian Question Data has NO intrinsic value!!
  • 4. DATA SOCIETY © 2015 Facts vs. Opinion(s) Two main types of textual information. • Facts and Opinions Search engines are optimized for Facts; Sentiment Analysis is a growing attempt (not completely solved) to optimize the discovery of opinions. Opinions Mining or Sentiment Analysis is an attempt to recognize the opinion or sentiment that a person holds toward an object.
  • 5. DATA SOCIETY © 2015 Let's have a look at this*: • 91 percent of people report having gone into a store because of an online experience. • 89 percent of consumers conduct research using search engines. • 78 percent of consumers say that posts made by companies on social media influence their purchases. • 72 percent of consumers trust online reviews as much as personal recommendations. • 62 percent of consumers end up making a purchase in a store after researching it online. *Blogs, Comments (i.e. YouTube, Facebook), Reviews, Forums, Microblogging (i.e. Twitter) http://www.investopedia.com/terms/a/asymmetricinformation.asp Information Asymmetry
  • 6. DATA SOCIETY © 2015 Information Asymmetry seller buyer information information information $ $ $ $ $ $ $ $
  • 7. DATA SOCIETY © 2015 Where do we find sentiment? • Movie / Books: Are the reviews on this movie/book positive/negative? • Product Sales: What is thought of the new iPhone? • Public Sentiment: How do consumers feel about the economy? How is consumer sentiment effecting sales by sector? • Politics: How are voters polarized, if at all around a candidate or policy? • Prediction: Stock Prices, Election Outcomes, Market Trends, Product Sales
  • 8. DATA SOCIETY © 2015 Scherer’s Typology of Emotions: Scherer's typology of emotions is briefly explained as follows: • Emotion: This is a brief, organically synchronized evaluation of a major event, for example, being angry, sad, joyful, ashamed, proud, and elated i.e. fired or promoted at work. • Mood: This is a diffused, non-caused, low-intensity, long-duration change in subjective feeling, for example, being cheerful, gloomy, irritable, listless, depressed, and buoyant • Interpersonal stance: This is an affective stance towards another person in a specific interaction, for example, being friendly, flirtatious, distant, cold, warm, supportive, and contemptuous • Attitudes: This is enduring, affectively colored beliefs or dispositions towards objects or persons, for example, being liking, loving, hating, valuing, and desiring • Personality traits: These are stable personality dispositions and typical behavior tendencies, for example, being nervous, anxious, reckless, morose, hostile, and jealous
  • 9. DATA SOCIETY © 2015 Goal: all measurement is to arrange items on a continuum (observed or unobserved). Measurement:
  • 10. DATA SOCIETY © 2015 1. Dictionary Based Sentiment Analysis • i.e. Is an attitude toward an object positive or negative? • e.g. Jeffrey Breen’s method, qdap 2. Supervised Learning for Sentiment Analysis. • i.e. Given data we have seen in the past, can we predict class assignment for our polarity measure (positive/neutral/negative) • e.g. Naive Bayes, MaxEnt, SVM 3. Unsupervised Sentiment Analysis • i.e. No dictionaries. No labeled data. No training algorithms. And, scale words (often bi-grams) and users on a single dimension. • e.g. latent variable models - IRT Sentiment Analysis: Ordered Sophistication Lexicon Based (Supervised)
  • 11. DATA SOCIETY © 2015 • The Beige Book (http://www.federalreserve.gov/monetarypolicy/beigebook), more formally called the Summary of Commentary on Current Economic Conditions, is a report published by the United States Federal Research Board (FRB) eight times a year. • The Beige Book has been in publication since 1985 and is now published online. • The report is published by each (n=12) of the Federal Reserve Bank districts (e.g. Beige Book (October 2013) is below) • The content is rather anecdotal. The report interviews key business contacts, economists, market experts, and others to get their opinion about the economy. 
 • The data used in this book can be found on GitHub (https://github.com/ SocialMediaMininginR/beigebook), as well as the Python code for all the scraping and parsing. Beige Book
  • 12. DATA SOCIETY © 2015 “Consumer spending grew modestly in most Districts. Auto sales continued to be strong, particularly in the New York District where they were said to be increasingly robust. In contrast, Chicago, Kansas City, and Dallas indicated slower growth in auto sales in September.” Beige Book
  • 13. DATA SOCIETY © 2015 qdap Determine polarity (a few ways to do this… none of which are perfect). qdap uses word clusters. > pol.bb<- polarity(bb$text, grouping.var = bb$location, polarity.frame = POLKEY, constrain = TRUE, negators = qdapDictionaries::negation.words, amplifiers = qdapDictionaries::amplification.words, deamplifiers = qdapDictionaries::deamplification.words, question.weight = 0, amplifier.weight = .3, n.before = 4, n.after = 2, rm.incomplete = FALSE, digits = 3) xi = word of polarity xit = neutral (xi 0 ); negator (xi N ); amplifier(xi a ); de-amplifier (xi d ) Each polarized word (xi) is then weighted w based on the weights from polarity frame xi T xixi-1xi-2xi-3xi-4 xi+1 xi+2
  • 14. DATA SOCIETY © 2015 Summary Beige Book (1996 - 2013) with recession bars (pink)
  • 15. DATA SOCIETY © 2015 thank you Blog: http://socialmediaminingr.com Twitter: @rheimann http://datatactics.blogspot.com/2015/02/modern-approaches-to-sentiment-analysis.html