SlideShare a Scribd company logo
1 of 23
Download to read offline
Data Mining
● "Data mining" refers to any set of software
tools or automated processes that can
access data from one or more databases,
and present them in a way that highlights
previously unknown relationships and
patterns.
● Data mining itself is not new. It has been
used for years in academic, government, and
market research in areas such as risk
Who will Mine the Election Data?
The answer is explicit ,
● Election commission
● Political parties
● Media
Where does Data Mining
technology enter in?
● There have always been voter roles, information
stored in databases and you could always break
down by census tract or by community or by city.
● The technology is now allowing parties to do
something which it has never been able to do
● take incompatible data types from all these different databases
and actually put data together so that now they have a much
more clear view
● who the people are who are the best, most likely targets for the
door to door outreach efforts.
Election commission
Election commission (EC)
● Mine the data base,
● To minimize the number of polling booths.
● As a result of mining the data The Election commission of
India was able to cut down the figures,
state No of booths in 1999 No in 2004
TN 54,847 45,729
● To Improve the security in the polling booths by looking at the
past history, where polling was not peaceful (data base may
contain info about booths where repolling was ordered).
EC (contd.,)
● look for patterns that imply a poor polling
percentage (such as terrorist threats etc.,)
● To find patterns that cause the number of
contestants in a particular constituency to be high
(to minimize expenses).
● If patterns indicate that the number contestants in a
constituency is always maximum, they can try to set
a higher deposit amount for that constituency.
Percentage of polling in assembly
elections in India over the past years
year male female total
1962 63.31 46.63 55.42
1967 66.73 55.48 61.33
1971 60.90 49.11 55.29
1977 65.63 54.91 60.49
1980 62.16 51.22 56.62
1984 68.18 58.60 63.56
1989 66.13 57.32 61.95
1991 61.58 51.35 56.93
1996 62.06 53.41 57.94
1998 63.88 57.88 60.88
1999 63.97 55.64 59.99
Visualizing the results
DATA
FOCUSED DATA SUBSET
PREPROCESSED and FORMATTED
DATA
PREDICITIVE MODELS
KNOWLEDGE
Selection
Preprocessing and
Transformation
Data Mining
Human
Interpretation
The Technology-centric view of the data mining
process
year expenditure
1996 597344100
1998 6662216000
1999 8800000000
2004 13000000000
Selection
Preprocessing and
Transformation
MODEL1
MODEL2
Data
Mining
Human
Interpretation
KNOWLEDG
DATAB
ASE
Data
Political Parties
Politics and Data Mining
● In the political sphere, data mining technologies
are useful in,
○ door-to-door canvassing strategy.
○ helping to map out an efficient and effective
mail.
○ To enable campaigns to customize and
personalize messages down to specific
households with great ease.
How are they doing that?
More particularly, there are 4-5 states that they think
will be decisive in the election because the
electorate is so polarized right now and there are
such a small number of undecided voters out there.
Finding who those voters are really matters a lot.
What Political parties do
with the data ……..??
● In elections to the four state assemblies (MP,Delhi,
chattisgar,Rajasthan) conducted a year back, BJP used
an elaborate data analysis. (now they are ruling in three
states!!!!).
● Data analysis is to target messages to specific groups
based on castes ,age, income and profession .
● Data analysis essential when we look for voting patterns
in prior elections.
● Political parties may follow the Benchmarking process
to improve their results.
4. Set
improvement
goals
2. Identify
the best
performers
3. Collect and
analyze data
to identify
gaps
7. Repeat
evaluations
6.
Evaluate
results
5. Develop and
implement plans to
close gaps
The Benchmarking Process
1. Define
the
domain
Data mining Methodology
● The methodology used today in mining consists of just a
few very important concepts.
They are,
○ Finding a pattern in the data.
○ Validating the predictive models.
What is a Pattern?
● Consider the simple problem of trying to determine
the next number in the following sequence:
1212121….?,because the pattern “12” is found often
enough ,you have some confidence in the predictive
model that says “if 1,then 2 will follow.”
● So Pattern is an event or combination of events in a
database that occurs more often than expected.
What is a Model?
● Model is one that can be successfully applied to
a new data in order to make predictions about
missing values or to make statements about
expected values.
● There may not be crisp dividing line between
pattern and a model (in the number sequence
example, the pattern “12” was also the model),
in general pattern are driven by data, whereas a
model generally reflects a purpose and may not
be driven by data.
Picking the best model
Media group NDA+ Cong+ Others
Sahara 263-278 92-102 171-181
Star 263-275 174-186 86-98
Aaj Tak 248 189 105
Zee 249 117 176
NDTV 230-250 190-205 100-130
Opinion polls conducted by various media groups
Picking the best model (Contd.,)
● But the actual results
were,
NDA+ 185
Cong+ 220
Others 137
Media group NDA+ Cong+ Others
Sahara 263-278 92-102 171-181
Star 263-275 174-186 86-98
Aaj Tak 248 189 105
Zee 249 117 176
NDTV 230-250 190-205 100-130
(The predictive model used by NDTV is some what closer to the actual
results)
Media
● Mine the data
● To predict what will happen, Which party will win.
● To make opinion polls and exit polls effective.
Data Mining of Elections 2004
● Most regional parties lost nearly 95% outside
their states.
● Those who losing deposits (on account of less
than 5% of votes) include,
BJP -27%
Cong -30%
CPM -58%
CPI -85%.
Thank you

More Related Content

Similar to Data mining

Using Data-Mining Technique for Census Analysis to Give Geo-Spatial Distribut...
Using Data-Mining Technique for Census Analysis to Give Geo-Spatial Distribut...Using Data-Mining Technique for Census Analysis to Give Geo-Spatial Distribut...
Using Data-Mining Technique for Census Analysis to Give Geo-Spatial Distribut...IOSR Journals
 
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...raihansikdar
 
[DSC Europe 23] Alen Kisic - How can do Facebook data and machine learning al...
[DSC Europe 23] Alen Kisic - How can do Facebook data and machine learning al...[DSC Europe 23] Alen Kisic - How can do Facebook data and machine learning al...
[DSC Europe 23] Alen Kisic - How can do Facebook data and machine learning al...DataScienceConferenc1
 
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...Coert Du Plessis (杜康)
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlationVrushaliSolanke
 
MGMT205 International Business.docx
MGMT205 International Business.docxMGMT205 International Business.docx
MGMT205 International Business.docxstirlingvwriters
 
stock price prediction using sentiment analysis
stock price prediction using sentiment analysisstock price prediction using sentiment analysis
stock price prediction using sentiment analysisSurbhiSharma889936
 
Market Research OverviewWhat is the idea of using marketing re.docx
Market Research OverviewWhat is the idea of using marketing re.docxMarket Research OverviewWhat is the idea of using marketing re.docx
Market Research OverviewWhat is the idea of using marketing re.docxdesteinbrook
 
Using systemic models in games and simulations for participatory planning
Using systemic models in games and simulations for participatory planningUsing systemic models in games and simulations for participatory planning
Using systemic models in games and simulations for participatory planningSystemic Design Association (SDA)
 
GSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-EditionGSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-EditionGang Li
 
Opportunities and methodological challenges of Big Data for official statist...
Opportunities and methodological challenges of  Big Data for official statist...Opportunities and methodological challenges of  Big Data for official statist...
Opportunities and methodological challenges of Big Data for official statist...Piet J.H. Daas
 
Campaign Sciences Analytics White Paper
Campaign Sciences Analytics White PaperCampaign Sciences Analytics White Paper
Campaign Sciences Analytics White PaperWPA Intelligence
 
Revolution in Election System Using Big Data Analytics
Revolution in Election System Using Big Data AnalyticsRevolution in Election System Using Big Data Analytics
Revolution in Election System Using Big Data AnalyticsMphasis
 
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...Ferdin Joe John Joseph PhD
 
Online Survey Tool.pdf
Online Survey Tool.pdfOnline Survey Tool.pdf
Online Survey Tool.pdfssuser353e3f
 
Real Life Machine Learning Case on Mobile Advertisement
Real Life Machine Learning Case on Mobile AdvertisementReal Life Machine Learning Case on Mobile Advertisement
Real Life Machine Learning Case on Mobile Advertisementşadi şeker
 
Overview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsOverview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsNUS-ISS
 

Similar to Data mining (20)

Using Data-Mining Technique for Census Analysis to Give Geo-Spatial Distribut...
Using Data-Mining Technique for Census Analysis to Give Geo-Spatial Distribut...Using Data-Mining Technique for Census Analysis to Give Geo-Spatial Distribut...
Using Data-Mining Technique for Census Analysis to Give Geo-Spatial Distribut...
 
New Age Of Polling
New Age Of PollingNew Age Of Polling
New Age Of Polling
 
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
Data mining-implementation-to-predict-sales-using-time-series-method By Raiha...
 
[DSC Europe 23] Alen Kisic - How can do Facebook data and machine learning al...
[DSC Europe 23] Alen Kisic - How can do Facebook data and machine learning al...[DSC Europe 23] Alen Kisic - How can do Facebook data and machine learning al...
[DSC Europe 23] Alen Kisic - How can do Facebook data and machine learning al...
 
Demonitization data
Demonitization dataDemonitization data
Demonitization data
 
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlation
 
MGMT205 International Business.docx
MGMT205 International Business.docxMGMT205 International Business.docx
MGMT205 International Business.docx
 
stock price prediction using sentiment analysis
stock price prediction using sentiment analysisstock price prediction using sentiment analysis
stock price prediction using sentiment analysis
 
Market Research OverviewWhat is the idea of using marketing re.docx
Market Research OverviewWhat is the idea of using marketing re.docxMarket Research OverviewWhat is the idea of using marketing re.docx
Market Research OverviewWhat is the idea of using marketing re.docx
 
Using systemic models in games and simulations for participatory planning
Using systemic models in games and simulations for participatory planningUsing systemic models in games and simulations for participatory planning
Using systemic models in games and simulations for participatory planning
 
5 data analysis approaches dr. hueihsia holloman
5 data analysis approaches dr. hueihsia holloman5 data analysis approaches dr. hueihsia holloman
5 data analysis approaches dr. hueihsia holloman
 
GSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-EditionGSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-Edition
 
Opportunities and methodological challenges of Big Data for official statist...
Opportunities and methodological challenges of  Big Data for official statist...Opportunities and methodological challenges of  Big Data for official statist...
Opportunities and methodological challenges of Big Data for official statist...
 
Campaign Sciences Analytics White Paper
Campaign Sciences Analytics White PaperCampaign Sciences Analytics White Paper
Campaign Sciences Analytics White Paper
 
Revolution in Election System Using Big Data Analytics
Revolution in Election System Using Big Data AnalyticsRevolution in Election System Using Big Data Analytics
Revolution in Election System Using Big Data Analytics
 
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...
Twitter Based Outcome Predictions of 2019 Indian General Elections Using Deci...
 
Online Survey Tool.pdf
Online Survey Tool.pdfOnline Survey Tool.pdf
Online Survey Tool.pdf
 
Real Life Machine Learning Case on Mobile Advertisement
Real Life Machine Learning Case on Mobile AdvertisementReal Life Machine Learning Case on Mobile Advertisement
Real Life Machine Learning Case on Mobile Advertisement
 
Overview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and FoundationsOverview of Data and Analytics Essentials and Foundations
Overview of Data and Analytics Essentials and Foundations
 

Recently uploaded

B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 

Recently uploaded (20)

B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 

Data mining

  • 1. Data Mining ● "Data mining" refers to any set of software tools or automated processes that can access data from one or more databases, and present them in a way that highlights previously unknown relationships and patterns. ● Data mining itself is not new. It has been used for years in academic, government, and market research in areas such as risk
  • 2. Who will Mine the Election Data? The answer is explicit , ● Election commission ● Political parties ● Media
  • 3. Where does Data Mining technology enter in? ● There have always been voter roles, information stored in databases and you could always break down by census tract or by community or by city. ● The technology is now allowing parties to do something which it has never been able to do ● take incompatible data types from all these different databases and actually put data together so that now they have a much more clear view ● who the people are who are the best, most likely targets for the door to door outreach efforts.
  • 5. Election commission (EC) ● Mine the data base, ● To minimize the number of polling booths. ● As a result of mining the data The Election commission of India was able to cut down the figures, state No of booths in 1999 No in 2004 TN 54,847 45,729 ● To Improve the security in the polling booths by looking at the past history, where polling was not peaceful (data base may contain info about booths where repolling was ordered).
  • 6. EC (contd.,) ● look for patterns that imply a poor polling percentage (such as terrorist threats etc.,) ● To find patterns that cause the number of contestants in a particular constituency to be high (to minimize expenses). ● If patterns indicate that the number contestants in a constituency is always maximum, they can try to set a higher deposit amount for that constituency.
  • 7. Percentage of polling in assembly elections in India over the past years year male female total 1962 63.31 46.63 55.42 1967 66.73 55.48 61.33 1971 60.90 49.11 55.29 1977 65.63 54.91 60.49 1980 62.16 51.22 56.62 1984 68.18 58.60 63.56 1989 66.13 57.32 61.95 1991 61.58 51.35 56.93 1996 62.06 53.41 57.94 1998 63.88 57.88 60.88 1999 63.97 55.64 59.99
  • 9. DATA FOCUSED DATA SUBSET PREPROCESSED and FORMATTED DATA PREDICITIVE MODELS KNOWLEDGE Selection Preprocessing and Transformation Data Mining Human Interpretation The Technology-centric view of the data mining process
  • 10. year expenditure 1996 597344100 1998 6662216000 1999 8800000000 2004 13000000000 Selection Preprocessing and Transformation MODEL1 MODEL2 Data Mining Human Interpretation KNOWLEDG DATAB ASE Data
  • 12. Politics and Data Mining ● In the political sphere, data mining technologies are useful in, ○ door-to-door canvassing strategy. ○ helping to map out an efficient and effective mail. ○ To enable campaigns to customize and personalize messages down to specific households with great ease.
  • 13. How are they doing that? More particularly, there are 4-5 states that they think will be decisive in the election because the electorate is so polarized right now and there are such a small number of undecided voters out there. Finding who those voters are really matters a lot.
  • 14. What Political parties do with the data ……..?? ● In elections to the four state assemblies (MP,Delhi, chattisgar,Rajasthan) conducted a year back, BJP used an elaborate data analysis. (now they are ruling in three states!!!!). ● Data analysis is to target messages to specific groups based on castes ,age, income and profession . ● Data analysis essential when we look for voting patterns in prior elections. ● Political parties may follow the Benchmarking process to improve their results.
  • 15. 4. Set improvement goals 2. Identify the best performers 3. Collect and analyze data to identify gaps 7. Repeat evaluations 6. Evaluate results 5. Develop and implement plans to close gaps The Benchmarking Process 1. Define the domain
  • 16. Data mining Methodology ● The methodology used today in mining consists of just a few very important concepts. They are, ○ Finding a pattern in the data. ○ Validating the predictive models.
  • 17. What is a Pattern? ● Consider the simple problem of trying to determine the next number in the following sequence: 1212121….?,because the pattern “12” is found often enough ,you have some confidence in the predictive model that says “if 1,then 2 will follow.” ● So Pattern is an event or combination of events in a database that occurs more often than expected.
  • 18. What is a Model? ● Model is one that can be successfully applied to a new data in order to make predictions about missing values or to make statements about expected values. ● There may not be crisp dividing line between pattern and a model (in the number sequence example, the pattern “12” was also the model), in general pattern are driven by data, whereas a model generally reflects a purpose and may not be driven by data.
  • 19. Picking the best model Media group NDA+ Cong+ Others Sahara 263-278 92-102 171-181 Star 263-275 174-186 86-98 Aaj Tak 248 189 105 Zee 249 117 176 NDTV 230-250 190-205 100-130 Opinion polls conducted by various media groups
  • 20. Picking the best model (Contd.,) ● But the actual results were, NDA+ 185 Cong+ 220 Others 137 Media group NDA+ Cong+ Others Sahara 263-278 92-102 171-181 Star 263-275 174-186 86-98 Aaj Tak 248 189 105 Zee 249 117 176 NDTV 230-250 190-205 100-130 (The predictive model used by NDTV is some what closer to the actual results)
  • 21. Media ● Mine the data ● To predict what will happen, Which party will win. ● To make opinion polls and exit polls effective.
  • 22. Data Mining of Elections 2004 ● Most regional parties lost nearly 95% outside their states. ● Those who losing deposits (on account of less than 5% of votes) include, BJP -27% Cong -30% CPM -58% CPI -85%.