SlideShare a Scribd company logo
1 of 28
Download to read offline
Machine learning for fair redistricting
Soledad Villar
Center for Data Science
Courant Institute of Mathematical Sciences
Workshop on Quantitative Gerrymandering
October 9, 2018
A few lines of research
Statistical/computational analysis of random maps.
Compare current maps properties with those of random maps.
Demonstrate mathematically that certain maps are unfair.
A few lines of research
Statistical/computational analysis of random maps.
Compare current maps properties with those of random maps.
Demonstrate mathematically that certain maps are unfair.
Theoretical analysis.
How much can one party gerrymander with constraints.
How different constraints interact.
Effect of geographic distribution on the gerrymandering power.
Computational hardness.
Understanding the limits of the problem.
A few lines of research
Statistical/computational analysis of random maps.
Compare current maps properties with those of random maps.
Demonstrate mathematically that certain maps are unfair.
Theoretical analysis.
How much can one party gerrymander with constraints.
How different constraints interact.
Effect of geographic distribution on the gerrymandering power.
Computational hardness.
Understanding the limits of the problem.
Algorithms for “fairer redistricting”.
Analyzed by simulations or game theory.
Provide a baseline to compare maps.
Provide a way to compute fair maps.
This talk
Hardness result: “fair” redistricting is NP-hard.
Use ideas from k-means clustering in the plane is NP-hard.
Game for fair redistricting.
Goal: train a neural network to learn how to play the game.
Question: other ways machine learning can be useful here?
Fair redistricting is hard
Joint work with Richard Kueng and Dustin Mixon
Redistricting is connected to NP-hard problems. [Altman 1997]
Argument against “automatic redistricting”.
Computational intractability is inherent to the redistricting problem.
Worst-case complexity says very little about real-world maps, but
identifies the limits of the problem.
What performance guarantees are possible for redistricting
algorithms?
Fair maps among legal compliant maps
Compliant maps
all districts have approximately the same population
mild notion of geographic compactness
Fair maps
both parties receive at least some level of representation.
Fair maps among legal compliant maps
Compliant maps
all districts have approximately the same population
mild notion of geographic compactness
Fair maps
both parties receive at least some level of representation.
Theorem
Deciding whether there exists a fair redistricting among compliant
maps is NP-hard.
Proof idea: reduction from planar 3-SAT
Inspired by Mahanjan, Minbhorkar and Varadarajan proof that planar k-means is NP-hard
3-SAT: Deciding whether there exists a boolean assignment that
satisfies a formula of the form:
(¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . .
Proof idea: reduction from planar 3-SAT
Inspired by Mahanjan, Minbhorkar and Varadarajan proof that planar k-means is NP-hard
3-SAT: Deciding whether there exists a boolean assignment that
satisfies a formula of the form:
(¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . .
Planar 3-SAT: Consider the bipartite graph: V={variables, clauses}
Proof idea: reduction from planar 3-SAT
Inspired by Mahanjan, Minbhorkar and Varadarajan proof that planar k-means is NP-hard
3-SAT: Deciding whether there exists a boolean assignment that
satisfies a formula of the form:
(¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . .
Planar 3-SAT: Consider the bipartite graph: V={variables, clauses}
Planar 3-SAT is NP-complete.
Reduction
Every planar 3-SAT instance can be posed as deciding whether there exists a fair
redistricting.
Town Pop D R
Big: L L 0
Small: 2γ
3
L 2γ
3
L 0
Adjacent: L
2
+ γL
6
L
4
L
4
+ γL
6
Edge: L
2
L
2
− γL
4
L
2
+ γL
4
Population per district ∈ [L, L+γ].
D wins at most 2k districts
even with almost half of the vote and Total pop 2k
Formula is satisfiable iff D wins 2k districts.
Ideas illustrate what Jonathan Rodden was saying yesterday.
Ideas illustrate what Jonathan Rodden was saying yesterday.
Next: Algorithms for fairer maps.
Example 1: Shortest-Splitline Algorithm
Idea:
Even number of districts. Split among the shortest line that
divides the population in half.
Odd number of districts. Split among the shortest line that
divides the population in appropriate proportion.
Smith and Ryan, Center for Range Voting, http://www.rangevoting.org/GerryExamples.html
Example 1: Shortest-Splitline Algorithm
Idea:
Even number of districts. Split among the shortest line that
divides the population in half.
Odd number of districts. Split among the shortest line that
divides the population in appropriate proportion.
Example
Smith and Ryan, Center for Range Voting, http://www.rangevoting.org/GerryExamples.html
Example 2: “I-Cut-You-Freeze” protocol
Two political parties sequentially divide up a state:
First player divides a map of a state into the allotted number
of districts, each with equal numbers of voters.
Second player chooses one district to “freeze,” so no further
changes could be made to it, and re-map the remaining
districts as it likes.
Pedgen, Procaccia and Yu, A partisan districting protocol with provably nonpartisan outcomes
“I-Cut-You-Freeze” protocol
Leverages the competition between Republicans and
Democrats to produce an equitable result1.
Each party can pursue a strategy that guarantees it something
that it wants.
Is it possible to produce a protocol that no player has an
advantage even in the finite district setting?
1
When the number of districts goes to infinity no player has an advantage with
respect to the other one
Ongoing work with Dustin Mixon
Game: Two players take turns assigning precincts to districts so
that:
Districts are always connected.
In the end all district have equal population.
In the end districts satisfy some form of compactness.
Theorem
In the symmetric non geometrically constrained setting no player
has an advantage.
Conjecture: players have a strategy that allows them to win the
proportion of seats corresponding with their proportion of voters.
Theorem
In the symmetric non geometrically constrained setting no player
has an advantage.
Conjecture: players have a strategy that allows them to win the
proportion of seats corresponding with their proportion of voters.
Compare to efficiency gap [Stephanopoulos, McGhee]
EG is minimized at doubly proportionality [Bernstein, Duchin].
EG=0 ⇒ 55% of vote share → 60% of seats
Theorem
In the symmetric non geometrically constrained setting no player
has an advantage.
Conjecture: players have a strategy that allows them to win the
proportion of seats corresponding with their proportion of voters.
Compare to efficiency gap [Stephanopoulos, McGhee]
EG is minimized at doubly proportionality [Bernstein, Duchin].
EG=0 ⇒ 55% of vote share → 60% of seats
Is it true for the geometrically constrained case?
Goal: run this game in real maps with machine learning
Reinforcement learning
There exists a function (unknown):
Q : (states, actions) → R
The objective is to maximize the cumulative reward:
R =
∞
t=0
γt
Qt, (γ ∈ [0, 1] discount rate)
Learn a policy π : S × A → [0, 1] where π(s, a) gives the
probability of taking action a while in state s.
Learning how to play
Learning how to play
Let Q be a neural network:
Q(s, a) = ρ(AL(. . . ρ(A1(s, a)) . . .))
Idea:
Train Q through self-play.
Explore the space of states via
Monte-Carlo Tree Search
Difficulties
The game is too large.
Combine large scale strategy with small scale strategy.
Locally valid moves produce maps that will be invalid later.
Ask players for a witness that shows that the map can be
completed.
Difficulties
The game is too large.
Combine large scale strategy with small scale strategy.
Locally valid moves produce maps that will be invalid later.
Ask players for a witness that shows that the map can be
completed.
Challenging for gerrymandering and for machine learning.
Thanks
Fair redistricting is hard
Richard Kueng, Dustin G. Mixon, Soledad Villar
arXiv:1808.08905

More Related Content

Similar to Quantitative Redistricting Workshop - Machine Learning for Fair Redistricting- Soledad Villar, October 9, 2018

Towards controlling evolutionary dynamics through network geometry: some very...
Towards controlling evolutionary dynamics through network geometry: some very...Towards controlling evolutionary dynamics through network geometry: some very...
Towards controlling evolutionary dynamics through network geometry: some very...Kolja Kleineberg
 
An Extension of Downs Model of Political Competition using Fuzzy Logic (Soci...
An Extension of Downs Model of Political Competition using Fuzzy Logic (Soci...An Extension of Downs Model of Political Competition using Fuzzy Logic (Soci...
An Extension of Downs Model of Political Competition using Fuzzy Logic (Soci...Camilo Pechâ Garz
 
Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Frank Nielsen
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Jonathan Stray
 
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks Christopher Morris
 
Reproducibility and differential analysis with selfish
Reproducibility and differential analysis with selfishReproducibility and differential analysis with selfish
Reproducibility and differential analysis with selfishtuxette
 
GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...
GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...
GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...Fabio Petroni, PhD
 
fuzzy fuzzification and defuzzification
fuzzy fuzzification and defuzzificationfuzzy fuzzification and defuzzification
fuzzy fuzzification and defuzzificationNourhan Selem Salm
 
Spatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesSpatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesKolja Kleineberg
 
Using Spectral Clustering to find Successful Hero Combinations in DOTA 2
Using Spectral Clustering to find Successful Hero Combinations in DOTA 2Using Spectral Clustering to find Successful Hero Combinations in DOTA 2
Using Spectral Clustering to find Successful Hero Combinations in DOTA 2Roderick Cox
 

Similar to Quantitative Redistricting Workshop - Machine Learning for Fair Redistricting- Soledad Villar, October 9, 2018 (20)

tutorial_kearns
tutorial_kearnstutorial_kearns
tutorial_kearns
 
nips-gg
nips-ggnips-gg
nips-gg
 
Towards controlling evolutionary dynamics through network geometry: some very...
Towards controlling evolutionary dynamics through network geometry: some very...Towards controlling evolutionary dynamics through network geometry: some very...
Towards controlling evolutionary dynamics through network geometry: some very...
 
Quantitative Redistricting Workshop - Partisan Gerrymandering Versus Geograph...
Quantitative Redistricting Workshop - Partisan Gerrymandering Versus Geograph...Quantitative Redistricting Workshop - Partisan Gerrymandering Versus Geograph...
Quantitative Redistricting Workshop - Partisan Gerrymandering Versus Geograph...
 
An Extension of Downs Model of Political Competition using Fuzzy Logic (Soci...
An Extension of Downs Model of Political Competition using Fuzzy Logic (Soci...An Extension of Downs Model of Political Competition using Fuzzy Logic (Soci...
An Extension of Downs Model of Political Competition using Fuzzy Logic (Soci...
 
Linear algebra havard university
Linear algebra havard universityLinear algebra havard university
Linear algebra havard university
 
Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...
 
Alpha Go: in few slides
Alpha Go: in few slidesAlpha Go: in few slides
Alpha Go: in few slides
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
 
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks
 
Reproducibility and differential analysis with selfish
Reproducibility and differential analysis with selfishReproducibility and differential analysis with selfish
Reproducibility and differential analysis with selfish
 
GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...
GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...
GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...
 
Population Density Mapping using the Dasymetric Method
Population Density Mapping using the Dasymetric MethodPopulation Density Mapping using the Dasymetric Method
Population Density Mapping using the Dasymetric Method
 
Chapter 2. Know Your Data.ppt
Chapter 2. Know Your Data.pptChapter 2. Know Your Data.ppt
Chapter 2. Know Your Data.ppt
 
fuzzy fuzzification and defuzzification
fuzzy fuzzification and defuzzificationfuzzy fuzzification and defuzzification
fuzzy fuzzification and defuzzification
 
HW1 MIT Fall 2005
HW1 MIT Fall 2005HW1 MIT Fall 2005
HW1 MIT Fall 2005
 
Spatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesSpatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexes
 
K033049053
K033049053K033049053
K033049053
 
Using Spectral Clustering to find Successful Hero Combinations in DOTA 2
Using Spectral Clustering to find Successful Hero Combinations in DOTA 2Using Spectral Clustering to find Successful Hero Combinations in DOTA 2
Using Spectral Clustering to find Successful Hero Combinations in DOTA 2
 

More from The Statistical and Applied Mathematical Sciences Institute

More from The Statistical and Applied Mathematical Sciences Institute (20)

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
 
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
 
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
 
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
 
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
 
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
 
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
 
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
 
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
 
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
 
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
 
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
 
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
 
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 

Recently uploaded

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 

Recently uploaded (20)

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 

Quantitative Redistricting Workshop - Machine Learning for Fair Redistricting- Soledad Villar, October 9, 2018

  • 1. Machine learning for fair redistricting Soledad Villar Center for Data Science Courant Institute of Mathematical Sciences Workshop on Quantitative Gerrymandering October 9, 2018
  • 2. A few lines of research Statistical/computational analysis of random maps. Compare current maps properties with those of random maps. Demonstrate mathematically that certain maps are unfair.
  • 3. A few lines of research Statistical/computational analysis of random maps. Compare current maps properties with those of random maps. Demonstrate mathematically that certain maps are unfair. Theoretical analysis. How much can one party gerrymander with constraints. How different constraints interact. Effect of geographic distribution on the gerrymandering power. Computational hardness. Understanding the limits of the problem.
  • 4. A few lines of research Statistical/computational analysis of random maps. Compare current maps properties with those of random maps. Demonstrate mathematically that certain maps are unfair. Theoretical analysis. How much can one party gerrymander with constraints. How different constraints interact. Effect of geographic distribution on the gerrymandering power. Computational hardness. Understanding the limits of the problem. Algorithms for “fairer redistricting”. Analyzed by simulations or game theory. Provide a baseline to compare maps. Provide a way to compute fair maps.
  • 5. This talk Hardness result: “fair” redistricting is NP-hard. Use ideas from k-means clustering in the plane is NP-hard. Game for fair redistricting. Goal: train a neural network to learn how to play the game. Question: other ways machine learning can be useful here?
  • 6. Fair redistricting is hard Joint work with Richard Kueng and Dustin Mixon Redistricting is connected to NP-hard problems. [Altman 1997] Argument against “automatic redistricting”. Computational intractability is inherent to the redistricting problem. Worst-case complexity says very little about real-world maps, but identifies the limits of the problem. What performance guarantees are possible for redistricting algorithms?
  • 7. Fair maps among legal compliant maps Compliant maps all districts have approximately the same population mild notion of geographic compactness Fair maps both parties receive at least some level of representation.
  • 8. Fair maps among legal compliant maps Compliant maps all districts have approximately the same population mild notion of geographic compactness Fair maps both parties receive at least some level of representation. Theorem Deciding whether there exists a fair redistricting among compliant maps is NP-hard.
  • 9. Proof idea: reduction from planar 3-SAT Inspired by Mahanjan, Minbhorkar and Varadarajan proof that planar k-means is NP-hard 3-SAT: Deciding whether there exists a boolean assignment that satisfies a formula of the form: (¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . .
  • 10. Proof idea: reduction from planar 3-SAT Inspired by Mahanjan, Minbhorkar and Varadarajan proof that planar k-means is NP-hard 3-SAT: Deciding whether there exists a boolean assignment that satisfies a formula of the form: (¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . . Planar 3-SAT: Consider the bipartite graph: V={variables, clauses}
  • 11. Proof idea: reduction from planar 3-SAT Inspired by Mahanjan, Minbhorkar and Varadarajan proof that planar k-means is NP-hard 3-SAT: Deciding whether there exists a boolean assignment that satisfies a formula of the form: (¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . . Planar 3-SAT: Consider the bipartite graph: V={variables, clauses} Planar 3-SAT is NP-complete.
  • 12. Reduction Every planar 3-SAT instance can be posed as deciding whether there exists a fair redistricting. Town Pop D R Big: L L 0 Small: 2γ 3 L 2γ 3 L 0 Adjacent: L 2 + γL 6 L 4 L 4 + γL 6 Edge: L 2 L 2 − γL 4 L 2 + γL 4 Population per district ∈ [L, L+γ]. D wins at most 2k districts even with almost half of the vote and Total pop 2k Formula is satisfiable iff D wins 2k districts.
  • 13. Ideas illustrate what Jonathan Rodden was saying yesterday.
  • 14. Ideas illustrate what Jonathan Rodden was saying yesterday. Next: Algorithms for fairer maps.
  • 15. Example 1: Shortest-Splitline Algorithm Idea: Even number of districts. Split among the shortest line that divides the population in half. Odd number of districts. Split among the shortest line that divides the population in appropriate proportion. Smith and Ryan, Center for Range Voting, http://www.rangevoting.org/GerryExamples.html
  • 16. Example 1: Shortest-Splitline Algorithm Idea: Even number of districts. Split among the shortest line that divides the population in half. Odd number of districts. Split among the shortest line that divides the population in appropriate proportion. Example Smith and Ryan, Center for Range Voting, http://www.rangevoting.org/GerryExamples.html
  • 17. Example 2: “I-Cut-You-Freeze” protocol Two political parties sequentially divide up a state: First player divides a map of a state into the allotted number of districts, each with equal numbers of voters. Second player chooses one district to “freeze,” so no further changes could be made to it, and re-map the remaining districts as it likes. Pedgen, Procaccia and Yu, A partisan districting protocol with provably nonpartisan outcomes
  • 18. “I-Cut-You-Freeze” protocol Leverages the competition between Republicans and Democrats to produce an equitable result1. Each party can pursue a strategy that guarantees it something that it wants. Is it possible to produce a protocol that no player has an advantage even in the finite district setting? 1 When the number of districts goes to infinity no player has an advantage with respect to the other one
  • 19. Ongoing work with Dustin Mixon Game: Two players take turns assigning precincts to districts so that: Districts are always connected. In the end all district have equal population. In the end districts satisfy some form of compactness.
  • 20. Theorem In the symmetric non geometrically constrained setting no player has an advantage. Conjecture: players have a strategy that allows them to win the proportion of seats corresponding with their proportion of voters.
  • 21. Theorem In the symmetric non geometrically constrained setting no player has an advantage. Conjecture: players have a strategy that allows them to win the proportion of seats corresponding with their proportion of voters. Compare to efficiency gap [Stephanopoulos, McGhee] EG is minimized at doubly proportionality [Bernstein, Duchin]. EG=0 ⇒ 55% of vote share → 60% of seats
  • 22. Theorem In the symmetric non geometrically constrained setting no player has an advantage. Conjecture: players have a strategy that allows them to win the proportion of seats corresponding with their proportion of voters. Compare to efficiency gap [Stephanopoulos, McGhee] EG is minimized at doubly proportionality [Bernstein, Duchin]. EG=0 ⇒ 55% of vote share → 60% of seats Is it true for the geometrically constrained case?
  • 23. Goal: run this game in real maps with machine learning Reinforcement learning There exists a function (unknown): Q : (states, actions) → R The objective is to maximize the cumulative reward: R = ∞ t=0 γt Qt, (γ ∈ [0, 1] discount rate) Learn a policy π : S × A → [0, 1] where π(s, a) gives the probability of taking action a while in state s.
  • 25. Learning how to play Let Q be a neural network: Q(s, a) = ρ(AL(. . . ρ(A1(s, a)) . . .)) Idea: Train Q through self-play. Explore the space of states via Monte-Carlo Tree Search
  • 26. Difficulties The game is too large. Combine large scale strategy with small scale strategy. Locally valid moves produce maps that will be invalid later. Ask players for a witness that shows that the map can be completed.
  • 27. Difficulties The game is too large. Combine large scale strategy with small scale strategy. Locally valid moves produce maps that will be invalid later. Ask players for a witness that shows that the map can be completed. Challenging for gerrymandering and for machine learning.
  • 28. Thanks Fair redistricting is hard Richard Kueng, Dustin G. Mixon, Soledad Villar arXiv:1808.08905