SlideShare a Scribd company logo
Qualitative Content Analysis
Ricky Bilakhia
Advisor – Prof. Aron Lindberg
Approach
Content analysis (CA) is a technique for systematically describing written, spoken or visual
communication and provides quantitative (numerical) description.
This type of analysis has gained popularity because other automated techniques prove to be
expensive or are unavailable in other cases. And is used to spot patterns or peculiar qualities in the
given data that automatic analysis can easily miss.
Future Work
Also we will be using Python for classification because it provides a number of packages which
automates the analysis and has a high accuracy rate
Business Intelligence & Analytics
http://www.stevens.edu/howe/academics/graduate/business-intelligence-analytics
Executive Summary
“Marketplace for work that requires Human Intelligence”
The emergence of online labor markets makes it far easier to use individual human raters to
evaluate materials for data collection and analysis in the real world.
Mechanical Turk is a well developed, commonly used and has the most information
available to assess its suitability for research.
Mechanical Turk is being used for research in disciplines such as natural language
processing, machine learning and human computer interaction.
Technology
R for generating the statistical model.
Amazon Web Service for the verification of work done through R.
Excel or HTML to fetch inputs and write results of the model.
Current Work
Successfully able to implement the creation, assignment and acquiring the HIT
response from Turkers.
Currently working on analyzing the reactions from the Turkers using “Qualitative
Content analysis”
Content Analysis has a wide appeal as providing information about the subjective
dimension of texts. It can be regarded as a classification technique, either binary
(polarity classification into positive/negative) or multi-class categorization (e.g.
positive/neutral/negative).
Purpose
•In a quest to develop a solution which will perform content analysis on a given source, we
came up with a system which is explained in the figure below.
•This solution is a holistic approach to extract, load, host, conduct, refine and analyze number
of user reviews based on Qualitative Content Analysis principle.
•The system is being developed in two different parts where the first part deals with extracting
user comments from any specified source and convert it to a .csv file which will form input to
our system.
•Our system will be responsible for reading these reviews from the csv file and create
questionnaires to present to the turkers.
•These questionnaires are then turned into HITs for our workers with help of Mechanical
Turk.
•Mechanical Turk also will help us filter the genuine workers from the spammers.
•Once we have all the results from the workers, its time to move to the analysis of reviews at
hand.
Research Stimulus
Data about a Research Stimulus:
This possibility involves that the researcher is
studying some collection of objects that need
humans to provide data about them.
 We refer these objects generically as research stimuli;
an individual Turker may work on many stimuli.
In this case, we assume that the data of interest
are inherent in the stimuli (i.e., different individuals
are not expected to have different interpretations) and
require observation with minimal interpretation.
This case describes many uses of AMT for research. For example we will provide
reviews on specific project; the Turkers identify whether they offered information or
emotional support.
Data Source
Deconstruct into
Tasks
Design HITs for Task
Register HITType
Create HIT(s)
Qualification
Requirements
Assignments
Review
Qualify Contact Bonus
Analyze Data
Installation of
MturkR
package and
Its Library.
Requestor
Login to the
AWS through
Credential()
Function
Data Source
for the HIT
Create Single
HIT
Create Bulk HIT
using Template
Submit HIT
Worker Logs
to AWS
Pre-Qualification
Requirement for the
worker
Qualification
Test
Fail
Request to take HIT
Pass
Completes
the HIT
Incomplete
HIT
End
HIT Approval?
HIT Status
Payment
made to
Worker
Accept
End
Rejected
Payment
Received
End
Request HIT
Accepted
End
Rejected
Worker
Contact Worker
Creation of HIT
Accept HIT
Requester
Worker

More Related Content

What's hot

Thinking About Guideline for Data Interoperability - Design concept and workf...
Thinking About Guideline for Data Interoperability - Design concept and workf...Thinking About Guideline for Data Interoperability - Design concept and workf...
Thinking About Guideline for Data Interoperability - Design concept and workf...
Open Cyber University of Korea
 
Sentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataSentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big Data
Iswarya M
 
AtanuResume_SBU
AtanuResume_SBUAtanuResume_SBU
AtanuResume_SBU
Atanu Ghosh
 
Artemenko-poster
Artemenko-posterArtemenko-poster
Best Python Libraries For Data Science & Machine Learning | Edureka
Best Python Libraries For Data Science & Machine Learning | EdurekaBest Python Libraries For Data Science & Machine Learning | Edureka
Best Python Libraries For Data Science & Machine Learning | Edureka
Edureka!
 
Prashobh_Resume_2_001
Prashobh_Resume_2_001Prashobh_Resume_2_001
Prashobh_Resume_2_001
Prashobh Paul Maliyekal Nambadan
 
Explore The Machine Learning and TensorFlow
Explore The Machine Learning and TensorFlowExplore The Machine Learning and TensorFlow
Explore The Machine Learning and TensorFlow
MahaKhalidALhobishi
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3
alaa223
 
Artificial Intelligence for Automating Data Analysis
Artificial Intelligence for Automating Data AnalysisArtificial Intelligence for Automating Data Analysis
Artificial Intelligence for Automating Data Analysis
Manuel Martín
 
User friendly pattern search paradigm
User friendly pattern search paradigmUser friendly pattern search paradigm
User friendly pattern search paradigm
Migrant Systems
 
Technical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search EngineTechnical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search Engine
s0P5a41b
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrieval
Nanthini Dominique
 
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge ExchangeNikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
AIST
 
Algorithm Procedure and Pseudo Code Mining
Algorithm Procedure and Pseudo Code MiningAlgorithm Procedure and Pseudo Code Mining
Algorithm Procedure and Pseudo Code Mining
IRJET Journal
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notes
BAIRAVI T
 
IRJET- Determining Document Relevance using Keyword Extraction
IRJET-  	  Determining Document Relevance using Keyword ExtractionIRJET-  	  Determining Document Relevance using Keyword Extraction
IRJET- Determining Document Relevance using Keyword Extraction
IRJET Journal
 
ResumeAmanRajJuly2016
ResumeAmanRajJuly2016ResumeAmanRajJuly2016
ResumeAmanRajJuly2016
Aman Raj
 
Data analytics beyond data processing and how it affects Industry 4.0
Data analytics beyond data processing and how it affects Industry 4.0Data analytics beyond data processing and how it affects Industry 4.0
Data analytics beyond data processing and how it affects Industry 4.0
Mathieu d'Aquin
 
Resume
ResumeResume
Findability through Traceability - A Realistic Application of Candidate Tr...
Findability through Traceability  - A Realistic Application of Candidate Tr...Findability through Traceability  - A Realistic Application of Candidate Tr...
Findability through Traceability - A Realistic Application of Candidate Tr...
Markus Borg
 

What's hot (20)

Thinking About Guideline for Data Interoperability - Design concept and workf...
Thinking About Guideline for Data Interoperability - Design concept and workf...Thinking About Guideline for Data Interoperability - Design concept and workf...
Thinking About Guideline for Data Interoperability - Design concept and workf...
 
Sentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataSentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big Data
 
AtanuResume_SBU
AtanuResume_SBUAtanuResume_SBU
AtanuResume_SBU
 
Artemenko-poster
Artemenko-posterArtemenko-poster
Artemenko-poster
 
Best Python Libraries For Data Science & Machine Learning | Edureka
Best Python Libraries For Data Science & Machine Learning | EdurekaBest Python Libraries For Data Science & Machine Learning | Edureka
Best Python Libraries For Data Science & Machine Learning | Edureka
 
Prashobh_Resume_2_001
Prashobh_Resume_2_001Prashobh_Resume_2_001
Prashobh_Resume_2_001
 
Explore The Machine Learning and TensorFlow
Explore The Machine Learning and TensorFlowExplore The Machine Learning and TensorFlow
Explore The Machine Learning and TensorFlow
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3
 
Artificial Intelligence for Automating Data Analysis
Artificial Intelligence for Automating Data AnalysisArtificial Intelligence for Automating Data Analysis
Artificial Intelligence for Automating Data Analysis
 
User friendly pattern search paradigm
User friendly pattern search paradigmUser friendly pattern search paradigm
User friendly pattern search paradigm
 
Technical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search EngineTechnical Whitepaper: A Knowledge Correlation Search Engine
Technical Whitepaper: A Knowledge Correlation Search Engine
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrieval
 
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge ExchangeNikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
Nikolay Karpov - Evolvable Semantic Platform for Facilitating Knowledge Exchange
 
Algorithm Procedure and Pseudo Code Mining
Algorithm Procedure and Pseudo Code MiningAlgorithm Procedure and Pseudo Code Mining
Algorithm Procedure and Pseudo Code Mining
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notes
 
IRJET- Determining Document Relevance using Keyword Extraction
IRJET-  	  Determining Document Relevance using Keyword ExtractionIRJET-  	  Determining Document Relevance using Keyword Extraction
IRJET- Determining Document Relevance using Keyword Extraction
 
ResumeAmanRajJuly2016
ResumeAmanRajJuly2016ResumeAmanRajJuly2016
ResumeAmanRajJuly2016
 
Data analytics beyond data processing and how it affects Industry 4.0
Data analytics beyond data processing and how it affects Industry 4.0Data analytics beyond data processing and how it affects Industry 4.0
Data analytics beyond data processing and how it affects Industry 4.0
 
Resume
ResumeResume
Resume
 
Findability through Traceability - A Realistic Application of Candidate Tr...
Findability through Traceability  - A Realistic Application of Candidate Tr...Findability through Traceability  - A Realistic Application of Candidate Tr...
Findability through Traceability - A Realistic Application of Candidate Tr...
 

Similar to Qualitative Content Analysis

Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptx
prathammishra28
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf
13DikshaDatir
 
Prescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxPrescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptx
Karthik132344
 
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET -  	  Twitter Sentiment Analysis using Machine LearningIRJET -  	  Twitter Sentiment Analysis using Machine Learning
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET Journal
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
Editor IJCATR
 
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
IRJET Journal
 
Visualization of Computer Forensics Analysis on Digital Evidence
Visualization of Computer Forensics Analysis on Digital EvidenceVisualization of Computer Forensics Analysis on Digital Evidence
Visualization of Computer Forensics Analysis on Digital Evidence
Muhd Mu'izuddin
 
Empirical Model of Supervised Learning Approach for Opinion Mining
Empirical Model of Supervised Learning Approach for Opinion MiningEmpirical Model of Supervised Learning Approach for Opinion Mining
Empirical Model of Supervised Learning Approach for Opinion Mining
IRJET Journal
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data Mining
IRJET Journal
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
Sumit Raj
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
amitparashar42
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
amitparashar42
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment Analysis
Editor IJCATR
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
inventionjournals
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
vivatechijri
 
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET Journal
 
Introduction
IntroductionIntroduction
Introduction
sarojbhavaraju5
 
NLP based Mining on Movie Critics
NLP based Mining on Movie Critics NLP based Mining on Movie Critics
NLP based Mining on Movie Critics
supraja reddy
 
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - TrivadisTechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
Trivadis
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
Mahir Haque
 

Similar to Qualitative Content Analysis (20)

Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptx
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf
 
Prescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxPrescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptx
 
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET -  	  Twitter Sentiment Analysis using Machine LearningIRJET -  	  Twitter Sentiment Analysis using Machine Learning
IRJET - Twitter Sentiment Analysis using Machine Learning
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
 
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
 
Visualization of Computer Forensics Analysis on Digital Evidence
Visualization of Computer Forensics Analysis on Digital EvidenceVisualization of Computer Forensics Analysis on Digital Evidence
Visualization of Computer Forensics Analysis on Digital Evidence
 
Empirical Model of Supervised Learning Approach for Opinion Mining
Empirical Model of Supervised Learning Approach for Opinion MiningEmpirical Model of Supervised Learning Approach for Opinion Mining
Empirical Model of Supervised Learning Approach for Opinion Mining
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data Mining
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment Analysis
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
 
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
 
Introduction
IntroductionIntroduction
Introduction
 
NLP based Mining on Movie Critics
NLP based Mining on Movie Critics NLP based Mining on Movie Critics
NLP based Mining on Movie Critics
 
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - TrivadisTechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 

Qualitative Content Analysis

  • 1. Qualitative Content Analysis Ricky Bilakhia Advisor – Prof. Aron Lindberg Approach Content analysis (CA) is a technique for systematically describing written, spoken or visual communication and provides quantitative (numerical) description. This type of analysis has gained popularity because other automated techniques prove to be expensive or are unavailable in other cases. And is used to spot patterns or peculiar qualities in the given data that automatic analysis can easily miss. Future Work Also we will be using Python for classification because it provides a number of packages which automates the analysis and has a high accuracy rate Business Intelligence & Analytics http://www.stevens.edu/howe/academics/graduate/business-intelligence-analytics Executive Summary “Marketplace for work that requires Human Intelligence” The emergence of online labor markets makes it far easier to use individual human raters to evaluate materials for data collection and analysis in the real world. Mechanical Turk is a well developed, commonly used and has the most information available to assess its suitability for research. Mechanical Turk is being used for research in disciplines such as natural language processing, machine learning and human computer interaction. Technology R for generating the statistical model. Amazon Web Service for the verification of work done through R. Excel or HTML to fetch inputs and write results of the model. Current Work Successfully able to implement the creation, assignment and acquiring the HIT response from Turkers. Currently working on analyzing the reactions from the Turkers using “Qualitative Content analysis” Content Analysis has a wide appeal as providing information about the subjective dimension of texts. It can be regarded as a classification technique, either binary (polarity classification into positive/negative) or multi-class categorization (e.g. positive/neutral/negative). Purpose •In a quest to develop a solution which will perform content analysis on a given source, we came up with a system which is explained in the figure below. •This solution is a holistic approach to extract, load, host, conduct, refine and analyze number of user reviews based on Qualitative Content Analysis principle. •The system is being developed in two different parts where the first part deals with extracting user comments from any specified source and convert it to a .csv file which will form input to our system. •Our system will be responsible for reading these reviews from the csv file and create questionnaires to present to the turkers. •These questionnaires are then turned into HITs for our workers with help of Mechanical Turk. •Mechanical Turk also will help us filter the genuine workers from the spammers. •Once we have all the results from the workers, its time to move to the analysis of reviews at hand. Research Stimulus Data about a Research Stimulus: This possibility involves that the researcher is studying some collection of objects that need humans to provide data about them.  We refer these objects generically as research stimuli; an individual Turker may work on many stimuli. In this case, we assume that the data of interest are inherent in the stimuli (i.e., different individuals are not expected to have different interpretations) and require observation with minimal interpretation. This case describes many uses of AMT for research. For example we will provide reviews on specific project; the Turkers identify whether they offered information or emotional support. Data Source Deconstruct into Tasks Design HITs for Task Register HITType Create HIT(s) Qualification Requirements Assignments Review Qualify Contact Bonus Analyze Data Installation of MturkR package and Its Library. Requestor Login to the AWS through Credential() Function Data Source for the HIT Create Single HIT Create Bulk HIT using Template Submit HIT Worker Logs to AWS Pre-Qualification Requirement for the worker Qualification Test Fail Request to take HIT Pass Completes the HIT Incomplete HIT End HIT Approval? HIT Status Payment made to Worker Accept End Rejected Payment Received End Request HIT Accepted End Rejected Worker Contact Worker Creation of HIT Accept HIT Requester Worker