SlideShare a Scribd company logo
1 of 10
International Journal of Artificial Intelligence &
Applications (IJAIA)
http://www.airccse.org/journal/ijaia/ijaia.html
ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)
ISSN : 0975-900X ( Online ); 0976-2191 (Print)
ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)
November 2018, Volume 9, Number 6
Utilizing Imbalanced Data and Classification
Cost Matrix to Predict Movie Preferences
Haifeng Wang,
Penn State University New Kensington, USA.
ABSTRACT
• In this paper, we propose a movie genre recommendation
system based on imbalanced survey data and unequal
classification costs for small and medium-sized enterprises
(SMEs) who need a data-based and analytical approach to
stock favored movies and target marketing to young people..
 These predictors do not include movies’ information such as
actors or directors. The paper applies Gentle boost, Adaboost
and Bagged tree ensembles as well as SVM machine learning
algorithms to learn classification from one thousand
observations and predict movie genre preferences with adjusted
classification costs. The proposed recommendation system also
selects important predictors.
1. INTRODUCTION
• In this paper, we propose a movie genre recommendation system
based on imbalanced survey data and unequal classification costs for
small and medium-sized enterprises (SMEs) who need a data-based
and analytical approach to stock favored movies and target
marketing to young people.
 These predictors do not include movies’ information such as actors
or directors. The paper applies Gentle boost, Adaboost and Bagged
tree ensembles as well as SVM machine learning algorithms to learn
classification from one thousand observations and predict movie
genre preferences with adjusted classification costs. The proposed
recommendation system also selects important predictors.
CONCLUSION
• Large corporations such as Netflix and Redbox have gained a great
competitive edge from collecting and analyzing customer demographic
information to find customer interest and target market segments. Many
small-medium entities (SMEs) in physical movie rental industry also need
localized data-based analytical and recommendation systems to determine
which targeted customer segments are most likely to buy or rent specific
genres of movies.
 Finally, in comparison with other state-of-art prediction models, the
proposed prediction model has satisfied prediction power. Therefore the
proposed machine learning algorithm allows us to overcome shortcomings
of traditional massive dataset prerequisite and can be more flexible applied
based on the misclassification cost. We will continue to improve the
algorithm and enrich the dataset because the small dataset is not enough to
predict the current mega-trends in which demographics and personal
attributes contribute to individual movie genre preference across the
country.
International Journal of Artificial Intelligence
and Applications (IJAIA)
http://www.airccse.org/journal/ijaia/ijaia.html
For More Details : http://aircconline.com/ijaia/V9N6/9618ijaia01.pdf

More Related Content

Similar to Utilizing Imbalanced Data and Classification Cost Matrix to Predict Movie Preferences

Supervision Reporting Service
Supervision Reporting ServiceSupervision Reporting Service
Supervision Reporting Service
m0gabr79
 

Similar to Utilizing Imbalanced Data and Classification Cost Matrix to Predict Movie Preferences (20)

Meetup Niort Data - AWS Intelligence Artificielle
Meetup Niort Data - AWS Intelligence ArtificielleMeetup Niort Data - AWS Intelligence Artificielle
Meetup Niort Data - AWS Intelligence Artificielle
 
Big Data Analytics for Commercial aviation and Aerospace
Big Data Analytics for Commercial aviation and AerospaceBig Data Analytics for Commercial aviation and Aerospace
Big Data Analytics for Commercial aviation and Aerospace
 
Heed the Agile Commerce Imperative Keeping Pace in a Rapidly Evolving World
Heed the Agile Commerce Imperative Keeping Pace in a Rapidly Evolving WorldHeed the Agile Commerce Imperative Keeping Pace in a Rapidly Evolving World
Heed the Agile Commerce Imperative Keeping Pace in a Rapidly Evolving World
 
SplunkLive! Zurich 2017 - Advanced Analytics / Machine Learning
SplunkLive! Zurich 2017 - Advanced Analytics / Machine LearningSplunkLive! Zurich 2017 - Advanced Analytics / Machine Learning
SplunkLive! Zurich 2017 - Advanced Analytics / Machine Learning
 
Machine Learning: Decision Trees
Machine Learning: Decision TreesMachine Learning: Decision Trees
Machine Learning: Decision Trees
 
Analog Devices ADIS16460 6-axis MEMS Inertial Sensor
Analog Devices ADIS16460 6-axis MEMS Inertial SensorAnalog Devices ADIS16460 6-axis MEMS Inertial Sensor
Analog Devices ADIS16460 6-axis MEMS Inertial Sensor
 
Mobile Bang Theory - Gartner Mobile & Wireless 2009
Mobile Bang Theory - Gartner Mobile & Wireless 2009Mobile Bang Theory - Gartner Mobile & Wireless 2009
Mobile Bang Theory - Gartner Mobile & Wireless 2009
 
Japan adtech industry report 2014
Japan adtech industry report 2014Japan adtech industry report 2014
Japan adtech industry report 2014
 
Price cast fuel product folder
Price cast fuel product folderPrice cast fuel product folder
Price cast fuel product folder
 
Supervision Reporting Service
Supervision Reporting ServiceSupervision Reporting Service
Supervision Reporting Service
 
MSXI Global Benchmarker July_2017
MSXI Global Benchmarker July_2017MSXI Global Benchmarker July_2017
MSXI Global Benchmarker July_2017
 
Testing the Next Generation of Technologies: IoT, Mobile, and Cloud … Oh My!
Testing the Next Generation of Technologies: IoT, Mobile, and Cloud … Oh My!Testing the Next Generation of Technologies: IoT, Mobile, and Cloud … Oh My!
Testing the Next Generation of Technologies: IoT, Mobile, and Cloud … Oh My!
 
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
 
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
 
Cango Technologies awarded by Frost and Sullivan
Cango Technologies awarded by Frost and SullivanCango Technologies awarded by Frost and Sullivan
Cango Technologies awarded by Frost and Sullivan
 
Software Defined Operations Research Presentation
Software Defined Operations Research PresentationSoftware Defined Operations Research Presentation
Software Defined Operations Research Presentation
 
Driving the Data Pipelines for Connected Vehicles with Spring Cloud Data Flow
Driving the Data Pipelines for Connected Vehicles with Spring Cloud Data FlowDriving the Data Pipelines for Connected Vehicles with Spring Cloud Data Flow
Driving the Data Pipelines for Connected Vehicles with Spring Cloud Data Flow
 
Digital transformation on aws
Digital transformation on awsDigital transformation on aws
Digital transformation on aws
 
Microservices: The Building Blocks for a Digital Future
Microservices: The Building Blocks for a Digital FutureMicroservices: The Building Blocks for a Digital Future
Microservices: The Building Blocks for a Digital Future
 
[Big] Data For Marketers: Targeting the Right Market
[Big] Data For Marketers: Targeting the Right Market[Big] Data For Marketers: Targeting the Right Market
[Big] Data For Marketers: Targeting the Right Market
 

More from gerogepatton

A Machine Learning Ensemble Model for the Detection of Cyberbullying
A Machine Learning Ensemble Model for the Detection of CyberbullyingA Machine Learning Ensemble Model for the Detection of Cyberbullying
A Machine Learning Ensemble Model for the Detection of Cyberbullying
gerogepatton
 
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
gerogepatton
 

More from gerogepatton (20)

10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
THE TRANSFORMATION RISK-BENEFIT MODEL OF ARTIFICIAL INTELLIGENCE:BALANCING RI...
THE TRANSFORMATION RISK-BENEFIT MODEL OF ARTIFICIAL INTELLIGENCE:BALANCING RI...THE TRANSFORMATION RISK-BENEFIT MODEL OF ARTIFICIAL INTELLIGENCE:BALANCING RI...
THE TRANSFORMATION RISK-BENEFIT MODEL OF ARTIFICIAL INTELLIGENCE:BALANCING RI...
 
13th International Conference on Software Engineering and Applications (SEA 2...
13th International Conference on Software Engineering and Applications (SEA 2...13th International Conference on Software Engineering and Applications (SEA 2...
13th International Conference on Software Engineering and Applications (SEA 2...
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
 
10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
A Machine Learning Ensemble Model for the Detection of Cyberbullying
A Machine Learning Ensemble Model for the Detection of CyberbullyingA Machine Learning Ensemble Model for the Detection of Cyberbullying
A Machine Learning Ensemble Model for the Detection of Cyberbullying
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
10th International Conference on Data Mining (DaMi 2024)
10th International Conference on Data Mining (DaMi 2024)10th International Conference on Data Mining (DaMi 2024)
10th International Conference on Data Mining (DaMi 2024)
 
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
 
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATION
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATIONWAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATION
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATION
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
 
10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...
 
The International Journal of Artificial Intelligence & Applications (IJAIA)
The International Journal of Artificial Intelligence & Applications (IJAIA)The International Journal of Artificial Intelligence & Applications (IJAIA)
The International Journal of Artificial Intelligence & Applications (IJAIA)
 
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecture
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer ArchitectureFoundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecture
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecture
 
2nd International Conference on Computer Science, Engineering and Artificial ...
2nd International Conference on Computer Science, Engineering and Artificial ...2nd International Conference on Computer Science, Engineering and Artificial ...
2nd International Conference on Computer Science, Engineering and Artificial ...
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
SanaAli374401
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Recently uploaded (20)

PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 

Utilizing Imbalanced Data and Classification Cost Matrix to Predict Movie Preferences

  • 1. International Journal of Artificial Intelligence & Applications (IJAIA) http://www.airccse.org/journal/ijaia/ijaia.html ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print) ISSN : 0975-900X ( Online ); 0976-2191 (Print) ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)ISSN : 0975-900X ( Online ); 0976-2191 (Print)
  • 2. November 2018, Volume 9, Number 6 Utilizing Imbalanced Data and Classification Cost Matrix to Predict Movie Preferences Haifeng Wang, Penn State University New Kensington, USA.
  • 3. ABSTRACT • In this paper, we propose a movie genre recommendation system based on imbalanced survey data and unequal classification costs for small and medium-sized enterprises (SMEs) who need a data-based and analytical approach to stock favored movies and target marketing to young people..  These predictors do not include movies’ information such as actors or directors. The paper applies Gentle boost, Adaboost and Bagged tree ensembles as well as SVM machine learning algorithms to learn classification from one thousand observations and predict movie genre preferences with adjusted classification costs. The proposed recommendation system also selects important predictors.
  • 4. 1. INTRODUCTION • In this paper, we propose a movie genre recommendation system based on imbalanced survey data and unequal classification costs for small and medium-sized enterprises (SMEs) who need a data-based and analytical approach to stock favored movies and target marketing to young people.  These predictors do not include movies’ information such as actors or directors. The paper applies Gentle boost, Adaboost and Bagged tree ensembles as well as SVM machine learning algorithms to learn classification from one thousand observations and predict movie genre preferences with adjusted classification costs. The proposed recommendation system also selects important predictors.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9. CONCLUSION • Large corporations such as Netflix and Redbox have gained a great competitive edge from collecting and analyzing customer demographic information to find customer interest and target market segments. Many small-medium entities (SMEs) in physical movie rental industry also need localized data-based analytical and recommendation systems to determine which targeted customer segments are most likely to buy or rent specific genres of movies.  Finally, in comparison with other state-of-art prediction models, the proposed prediction model has satisfied prediction power. Therefore the proposed machine learning algorithm allows us to overcome shortcomings of traditional massive dataset prerequisite and can be more flexible applied based on the misclassification cost. We will continue to improve the algorithm and enrich the dataset because the small dataset is not enough to predict the current mega-trends in which demographics and personal attributes contribute to individual movie genre preference across the country.
  • 10. International Journal of Artificial Intelligence and Applications (IJAIA) http://www.airccse.org/journal/ijaia/ijaia.html For More Details : http://aircconline.com/ijaia/V9N6/9618ijaia01.pdf