SlideShare a Scribd company logo
Crowdsourcing Predictors of
Behavioral Outcomes
Presented by
Alekya.Yermal(Lead)
(10NJ1A0502)
I.Aiysha Sham (10NJ1A0514)
S.M.V.N.Sowndarya(10NJ1A0537)
V.Padmavathi (10NJ1A0542)
Under the guidance of
Mrs.D.Usha Rajeswari
CROWD SOURCING
PREDICTORS OF
BEHAVIORAL OUTCOMES
Retrieving bulk of data from crowd
Introduction
Text
ext
Text
Text
Text
•Generating models from large data sets—and determining which subsets of data to
mine —is becoming increasingly automated.
• This was accomplished by building a Web platform in which human groups
interact to both respond to questions likely to help predict a behavioral outcome
and pose new questions to their peers.
• Result- dynamically growing online survey, this behavior also leads to models
that can predict the user’s outcomes based on their responses to the user-generated
survey questions.
• Example:
1) Predict users’ monthly electric energy consumption.
2) Predict users’ body mass index.
Existing System
• Statistical tools such as multiple regression or neural networks provide mature
methods for computing model parameters when the set of predictive covariates
and the model structure are pre-specified.
• The task of choosing which potentially predictive variables to study is largely
a qualitative task that requires substantial domain expertise.
•Example:
1) A survey designer must have domain expertise to choose questions
that will identify predictive covariates.
2) An engineer must develop substantial familiarity with a design in order to
determine which variables can be systematically adjusted in order to optimize
performance.
Disadvantages of existing system
• There are many problems in which one seeks to develop predictive
models to map between a set of predictor variables and an outcome.
• One aspect of the scientific method that has not yet yielded to automation is the
selection of variables for which data should be collected to evaluate hypotheses.
• In the case of a prediction problem, machine science is not yet able to select the
independent variables that might predict an outcome of interest, and for which
data collection is required.
Proposed System
TEXT TEXT TEXT TEXT
• The goal of this research is to test an alternative approach to model in which the wisdom of
crowds is harnessed to both propose which potentially predictive variables to study by asking
questions and to respond to those questions, in order to develop a predictive model.
• This paper introduces, for the first time, a method by which non-domain experts can be
motivated to formulate independent variables as well as populate enough of these variables for
successful modeling.
• Users arrive at a Web site in which a behavioral outcome is to be modeled.
Users provide their own outcome and then answer questions that may be predictive of that
outcome .
• Periodically, models are constructed against the growing data set that predict each user’s
behavioral outcome. Users may also pose their own questions that, when answered by other
users, become new independent variables in the modeling process.
Advantages of proposed system
• Participants successfully uncovered at least one statistically significant predictor of
the outcome variable.
• For the BMI outcome, the participants successfully formulated many of the correlates
known to predict BMI and provided sufficiently honest values for those correlates to
become predictive during the experiment.
• While, our instantiations focus on energy and BMI, the proposed method is general
and might, as the method improves, be useful to answer many difficult questions
regarding why some outcomes are different than others.
SOFTWARE CONFIGURATION
Operating System : Windows 7, 8.
Coding Language: JAVA/JSP
Front End design: HTML, CSS, JavaScript
Database : MYSQL
Database Connectivity: JDBC
Server: Apache Tomcat V.7
Hardware requirements
TEXT TEXTTEXT TEXT
Processor - Pentium –IV
Speed - 1.1 GHz
RAM - 256 MB(min)
Hard Disk - 20 GB
Key Board - Standard Windows Keyboard
Monitor - SVGA
Main Modules:-
1. Investigator Behavior
2. User Behavior
3. Model Behavior
Investigator Behavior
The investigator is responsible for initially creating the web platform, and
seeding it with a starting question. Then, as the experiment runs they
filter new survey questions generated by the users.
 However, once posed, the question was filtered by the investigator as
to its suitability . A question was deemed unsuitable if any of the
following conditions were met:
(1) the question revealed the identity of its author (e.g. “Hi, I am John
Doe. I would like to know if...”) thereby contravening the Institutional
Review Board approval for these experiments;
(2) the question contained profanity or hateful text;
(3) the question was inappropriately correlated with the outcome (e.g.
“What is your BMI?”).
User Behavior
 Users who visit the site first provide their individual value for
the outcome of interest. Users may then respond to questions
found on the site .
 Their answers are stored in a common data set and made
available to the modeling engine.
 At any time a user may elect to pose a question of their own
devising. Users could pose questions that required a yes/no
response, a five-level Liker rating, or a number. Users were
not constrained in what kinds of questions to pose.
Model Behavior
•The modeling engine continually generates predictive
models using the survey questions as candidate
predictors of the outcome and users’ responses as the
training data.
Architecture
System Design
 Admin
User
Use Case Diagram
Class Diagram
Activity Diagram
Sequence Diagram
Sample screens of
implementation
•Creating user profile
•User login page
•Home Page
User creates quiz question
User creates poll question
Select quiz
Add question to quiz
User playing quiz
User selects poll
Poll question-user
User submits poll question
Rate quiz
Rating
Admin login
Add question - pool
Conclusion
This paper introduced a new approach to social science modeling in which the
participants themselves are motivated to uncover the correlates of some human
behavior outcome, such as homeowner electricity usage or body mass index.
In both cases participants successfully uncovered at least one statistically significant
predictor of the outcome variable.
The main goal of this paper is to demonstrate a system that enables non domain
experts to collectively formulate many of the known (and possibly unknown) predictors
of a behavioral outcome, and that this system is independent of the outcome of
interest.

More Related Content

What's hot

Active Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a SurveyActive Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a Survey
University of Bergen
 
Evaluation and User Study in HCI
Evaluation and User Study in HCIEvaluation and User Study in HCI
Evaluation and User Study in HCI
Byungkyu (Jay) Kang
 
Artificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactArtificial Intelligence for Societal Impact
Artificial Intelligence for Societal Impact
Amit Sharma
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
Milind Gokhale
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
Alan Said
 
ICELW Conference Slides
ICELW Conference SlidesICELW Conference Slides
ICELW Conference Slides
toolboc
 
Machine learning meets user analytics - Metageni tech talk
Machine learning meets user analytics - Metageni tech talkMachine learning meets user analytics - Metageni tech talk
Machine learning meets user analytics - Metageni tech talk
Gabriel Hughes PhD
 
Evaluating Collaborative Filtering Recommender Systems
Evaluating Collaborative Filtering Recommender SystemsEvaluating Collaborative Filtering Recommender Systems
Evaluating Collaborative Filtering Recommender Systems
MegaVjohnson
 
Ontological and clustering approach for content based recommendation systems
Ontological and clustering approach for content based recommendation systemsOntological and clustering approach for content based recommendation systems
Ontological and clustering approach for content based recommendation systems
vikramadityajakkula
 
An Independent Study Comparing SPSS to Intellectus Statistics: Preliminary ...
 An Independent Study Comparing SPSS to  Intellectus Statistics: Preliminary ... An Independent Study Comparing SPSS to  Intellectus Statistics: Preliminary ...
An Independent Study Comparing SPSS to Intellectus Statistics: Preliminary ...
Statistics Solutions
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
David Zibriczky
 
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
YONG ZHENG
 
Auditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAuditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographics
Amit Sharma
 
Example: movielens data with mahout
Example: movielens data with mahoutExample: movielens data with mahout
Example: movielens data with mahout
Gregg Barrett
 
The Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practiceThe Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practice
Amit Sharma
 
Literature review
Literature reviewLiterature review
Literature review
Darshit Kanziya
 
Towards Automatic Analysis of Online Discussions among Hong Kong Students
Towards Automatic Analysis of Online Discussions among Hong Kong StudentsTowards Automatic Analysis of Online Discussions among Hong Kong Students
Towards Automatic Analysis of Online Discussions among Hong Kong Students
CITE
 

What's hot (19)

De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
Active Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a SurveyActive Learning in Collaborative Filtering Recommender Systems : a Survey
Active Learning in Collaborative Filtering Recommender Systems : a Survey
 
Evaluation and User Study in HCI
Evaluation and User Study in HCIEvaluation and User Study in HCI
Evaluation and User Study in HCI
 
Artificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactArtificial Intelligence for Societal Impact
Artificial Intelligence for Societal Impact
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
 
ICELW Conference Slides
ICELW Conference SlidesICELW Conference Slides
ICELW Conference Slides
 
Machine learning meets user analytics - Metageni tech talk
Machine learning meets user analytics - Metageni tech talkMachine learning meets user analytics - Metageni tech talk
Machine learning meets user analytics - Metageni tech talk
 
Evaluating Collaborative Filtering Recommender Systems
Evaluating Collaborative Filtering Recommender SystemsEvaluating Collaborative Filtering Recommender Systems
Evaluating Collaborative Filtering Recommender Systems
 
Ontological and clustering approach for content based recommendation systems
Ontological and clustering approach for content based recommendation systemsOntological and clustering approach for content based recommendation systems
Ontological and clustering approach for content based recommendation systems
 
People issues
People issuesPeople issues
People issues
 
An Independent Study Comparing SPSS to Intellectus Statistics: Preliminary ...
 An Independent Study Comparing SPSS to  Intellectus Statistics: Preliminary ... An Independent Study Comparing SPSS to  Intellectus Statistics: Preliminary ...
An Independent Study Comparing SPSS to Intellectus Statistics: Preliminary ...
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
 
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
 
Auditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAuditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographics
 
Example: movielens data with mahout
Example: movielens data with mahoutExample: movielens data with mahout
Example: movielens data with mahout
 
The Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practiceThe Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practice
 
Literature review
Literature reviewLiterature review
Literature review
 
Towards Automatic Analysis of Online Discussions among Hong Kong Students
Towards Automatic Analysis of Online Discussions among Hong Kong StudentsTowards Automatic Analysis of Online Discussions among Hong Kong Students
Towards Automatic Analysis of Online Discussions among Hong Kong Students
 

Viewers also liked

Formulas estructurales
Formulas estructuralesFormulas estructurales
Formulas estructurales
Mauricio Vazquez
 
Aesthetics Shows
Aesthetics ShowsAesthetics Shows
Aesthetics Shows
Berivan FAYARD
 
Curriculum vitae English.doc
Curriculum vitae English.docCurriculum vitae English.doc
Curriculum vitae English.docalfred janssen
 
Quartiere Piazzo: il progetto del M5s Biella
Quartiere Piazzo: il progetto del M5s BiellaQuartiere Piazzo: il progetto del M5s Biella
Quartiere Piazzo: il progetto del M5s Biella
Nicolò Sapellani
 
Who is who around the table presentation
Who is who around the table   presentationWho is who around the table   presentation
Who is who around the table presentation
Jan Gleisner
 
Datapoint Systems 04-02-2016
Datapoint Systems 04-02-2016Datapoint Systems 04-02-2016
Datapoint Systems 04-02-2016
BNI Exponential
 
Клинические испытания в гомеопатии: аргументы для скептиков
Клинические испытания в гомеопатии: аргументы для  скептиковКлинические испытания в гомеопатии: аргументы для  скептиков
Клинические испытания в гомеопатии: аргументы для скептиков
Vubuntu Vera
 
Заболевания новорожденных
Заболевания новорожденныхЗаболевания новорожденных
Заболевания новорожденных
Vubuntu Vera
 

Viewers also liked (10)

Formulas estructurales
Formulas estructuralesFormulas estructurales
Formulas estructurales
 
Jamie Naik CV
Jamie Naik CVJamie Naik CV
Jamie Naik CV
 
Aesthetics Shows
Aesthetics ShowsAesthetics Shows
Aesthetics Shows
 
Curriculum vitae English.doc
Curriculum vitae English.docCurriculum vitae English.doc
Curriculum vitae English.doc
 
Quartiere Piazzo: il progetto del M5s Biella
Quartiere Piazzo: il progetto del M5s BiellaQuartiere Piazzo: il progetto del M5s Biella
Quartiere Piazzo: il progetto del M5s Biella
 
Who is who around the table presentation
Who is who around the table   presentationWho is who around the table   presentation
Who is who around the table presentation
 
Grant Proposal
Grant ProposalGrant Proposal
Grant Proposal
 
Datapoint Systems 04-02-2016
Datapoint Systems 04-02-2016Datapoint Systems 04-02-2016
Datapoint Systems 04-02-2016
 
Клинические испытания в гомеопатии: аргументы для скептиков
Клинические испытания в гомеопатии: аргументы для  скептиковКлинические испытания в гомеопатии: аргументы для  скептиков
Клинические испытания в гомеопатии: аргументы для скептиков
 
Заболевания новорожденных
Заболевания новорожденныхЗаболевания новорожденных
Заболевания новорожденных
 

Similar to Crowdsourcing Predictors of Behavioral Outcomes

JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
IEEEGLOBALSOFTTECHNOLOGIES
 
Behavioural Modelling Outcomes prediction using Casual Factors
Behavioural Modelling Outcomes prediction using Casual  FactorsBehavioural Modelling Outcomes prediction using Casual  Factors
Behavioural Modelling Outcomes prediction using Casual Factors
IJMER
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation System
IRJET Journal
 
Student Assestment Questionnaire
Student Assestment QuestionnaireStudent Assestment Questionnaire
Student Assestment QuestionnaireMichael Mendoza
 
NEXT- A System for Real-World Development, Evaluation, and Application of Act...
NEXT- A System for Real-World Development, Evaluation, and Application of Act...NEXT- A System for Real-World Development, Evaluation, and Application of Act...
NEXT- A System for Real-World Development, Evaluation, and Application of Act...Nicholas Glattard
 
An effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded contentAn effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded content
ijdpsjournal
 
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...Sc Huang
 
Open domain question answering system using semantic role labeling
Open domain question answering system using semantic role labelingOpen domain question answering system using semantic role labeling
Open domain question answering system using semantic role labeling
eSAT Publishing House
 
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Dr. Cornelius Ludmann
 
Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
晓愚 孟
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement
Roi Blanco
 
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Waqas Tariq
 
Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.
BELIV Workshop
 
IRJET- Automated Exam Question Generator using Genetic Algorithm
IRJET-  	  Automated Exam Question Generator using Genetic AlgorithmIRJET-  	  Automated Exam Question Generator using Genetic Algorithm
IRJET- Automated Exam Question Generator using Genetic Algorithm
IRJET Journal
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
20211a05p7
 
Thesis
ThesisThesis
Thesis
Lashanda83
 
Forecasting movie rating using k-nearest neighbor based collaborative filtering
Forecasting movie rating using k-nearest neighbor based  collaborative filteringForecasting movie rating using k-nearest neighbor based  collaborative filtering
Forecasting movie rating using k-nearest neighbor based collaborative filtering
IJECEIAES
 
Measuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systemsMeasuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systems
Amit Sharma
 

Similar to Crowdsourcing Predictors of Behavioral Outcomes (20)

JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
 
Behavioural Modelling Outcomes prediction using Casual Factors
Behavioural Modelling Outcomes prediction using Casual  FactorsBehavioural Modelling Outcomes prediction using Casual  Factors
Behavioural Modelling Outcomes prediction using Casual Factors
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation System
 
Student Assestment Questionnaire
Student Assestment QuestionnaireStudent Assestment Questionnaire
Student Assestment Questionnaire
 
NEXT- A System for Real-World Development, Evaluation, and Application of Act...
NEXT- A System for Real-World Development, Evaluation, and Application of Act...NEXT- A System for Real-World Development, Evaluation, and Application of Act...
NEXT- A System for Real-World Development, Evaluation, and Application of Act...
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
An effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded contentAn effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded content
 
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
 
Open domain question answering system using semantic role labeling
Open domain question answering system using semantic role labelingOpen domain question answering system using semantic role labeling
Open domain question answering system using semantic role labeling
 
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
Continuous Evaluation of Collaborative Recommender Systems in Data Stream Man...
 
Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement
 
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
 
Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.
 
final
finalfinal
final
 
IRJET- Automated Exam Question Generator using Genetic Algorithm
IRJET-  	  Automated Exam Question Generator using Genetic AlgorithmIRJET-  	  Automated Exam Question Generator using Genetic Algorithm
IRJET- Automated Exam Question Generator using Genetic Algorithm
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
 
Thesis
ThesisThesis
Thesis
 
Forecasting movie rating using k-nearest neighbor based collaborative filtering
Forecasting movie rating using k-nearest neighbor based  collaborative filteringForecasting movie rating using k-nearest neighbor based  collaborative filtering
Forecasting movie rating using k-nearest neighbor based collaborative filtering
 
Measuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systemsMeasuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systems
 

Crowdsourcing Predictors of Behavioral Outcomes

  • 1. Crowdsourcing Predictors of Behavioral Outcomes Presented by Alekya.Yermal(Lead) (10NJ1A0502) I.Aiysha Sham (10NJ1A0514) S.M.V.N.Sowndarya(10NJ1A0537) V.Padmavathi (10NJ1A0542) Under the guidance of Mrs.D.Usha Rajeswari
  • 3. Retrieving bulk of data from crowd
  • 4. Introduction Text ext Text Text Text •Generating models from large data sets—and determining which subsets of data to mine —is becoming increasingly automated. • This was accomplished by building a Web platform in which human groups interact to both respond to questions likely to help predict a behavioral outcome and pose new questions to their peers. • Result- dynamically growing online survey, this behavior also leads to models that can predict the user’s outcomes based on their responses to the user-generated survey questions. • Example: 1) Predict users’ monthly electric energy consumption. 2) Predict users’ body mass index.
  • 5. Existing System • Statistical tools such as multiple regression or neural networks provide mature methods for computing model parameters when the set of predictive covariates and the model structure are pre-specified. • The task of choosing which potentially predictive variables to study is largely a qualitative task that requires substantial domain expertise. •Example: 1) A survey designer must have domain expertise to choose questions that will identify predictive covariates. 2) An engineer must develop substantial familiarity with a design in order to determine which variables can be systematically adjusted in order to optimize performance.
  • 6. Disadvantages of existing system • There are many problems in which one seeks to develop predictive models to map between a set of predictor variables and an outcome. • One aspect of the scientific method that has not yet yielded to automation is the selection of variables for which data should be collected to evaluate hypotheses. • In the case of a prediction problem, machine science is not yet able to select the independent variables that might predict an outcome of interest, and for which data collection is required.
  • 7. Proposed System TEXT TEXT TEXT TEXT • The goal of this research is to test an alternative approach to model in which the wisdom of crowds is harnessed to both propose which potentially predictive variables to study by asking questions and to respond to those questions, in order to develop a predictive model. • This paper introduces, for the first time, a method by which non-domain experts can be motivated to formulate independent variables as well as populate enough of these variables for successful modeling. • Users arrive at a Web site in which a behavioral outcome is to be modeled. Users provide their own outcome and then answer questions that may be predictive of that outcome . • Periodically, models are constructed against the growing data set that predict each user’s behavioral outcome. Users may also pose their own questions that, when answered by other users, become new independent variables in the modeling process.
  • 8. Advantages of proposed system • Participants successfully uncovered at least one statistically significant predictor of the outcome variable. • For the BMI outcome, the participants successfully formulated many of the correlates known to predict BMI and provided sufficiently honest values for those correlates to become predictive during the experiment. • While, our instantiations focus on energy and BMI, the proposed method is general and might, as the method improves, be useful to answer many difficult questions regarding why some outcomes are different than others.
  • 9. SOFTWARE CONFIGURATION Operating System : Windows 7, 8. Coding Language: JAVA/JSP Front End design: HTML, CSS, JavaScript Database : MYSQL Database Connectivity: JDBC Server: Apache Tomcat V.7
  • 10. Hardware requirements TEXT TEXTTEXT TEXT Processor - Pentium –IV Speed - 1.1 GHz RAM - 256 MB(min) Hard Disk - 20 GB Key Board - Standard Windows Keyboard Monitor - SVGA
  • 11. Main Modules:- 1. Investigator Behavior 2. User Behavior 3. Model Behavior
  • 12. Investigator Behavior The investigator is responsible for initially creating the web platform, and seeding it with a starting question. Then, as the experiment runs they filter new survey questions generated by the users.  However, once posed, the question was filtered by the investigator as to its suitability . A question was deemed unsuitable if any of the following conditions were met: (1) the question revealed the identity of its author (e.g. “Hi, I am John Doe. I would like to know if...”) thereby contravening the Institutional Review Board approval for these experiments; (2) the question contained profanity or hateful text; (3) the question was inappropriately correlated with the outcome (e.g. “What is your BMI?”).
  • 13. User Behavior  Users who visit the site first provide their individual value for the outcome of interest. Users may then respond to questions found on the site .  Their answers are stored in a common data set and made available to the modeling engine.  At any time a user may elect to pose a question of their own devising. Users could pose questions that required a yes/no response, a five-level Liker rating, or a number. Users were not constrained in what kinds of questions to pose.
  • 14. Model Behavior •The modeling engine continually generates predictive models using the survey questions as candidate predictors of the outcome and users’ responses as the training data.
  • 27. User creates quiz question
  • 28. User creates poll question
  • 34. User submits poll question
  • 39. Conclusion This paper introduced a new approach to social science modeling in which the participants themselves are motivated to uncover the correlates of some human behavior outcome, such as homeowner electricity usage or body mass index. In both cases participants successfully uncovered at least one statistically significant predictor of the outcome variable. The main goal of this paper is to demonstrate a system that enables non domain experts to collectively formulate many of the known (and possibly unknown) predictors of a behavioral outcome, and that this system is independent of the outcome of interest.