Quantitative Redistricting Workshop - Machine Learning for Fair Redistricting- Soledad Villar, October 9, 2018

Machine learning for fair redistricting
Soledad Villar
Center for Data Science
Courant Institute of Mathematical Sciences
Workshop on Quantitative Gerrymandering
October 9, 2018

A few lines of research
Statistical/computational analysis of random maps.
Compare current maps properties with those of random maps.
Demonstrate mathematically that certain maps are unfair.

Theoretical analysis.
How much can one party gerrymander with constraints.
How diﬀerent constraints interact.
Eﬀect of geographic distribution on the gerrymandering power.
Computational hardness.
Understanding the limits of the problem.

Theoretical analysis.
How much can one party gerrymander with constraints.
How diﬀerent constraints interact.
Eﬀect of geographic distribution on the gerrymandering power.
Computational hardness.
Understanding the limits of the problem.
Algorithms for “fairer redistricting”.
Analyzed by simulations or game theory.
Provide a baseline to compare maps.
Provide a way to compute fair maps.

This talk
Hardness result: “fair” redistricting is NP-hard.
Use ideas from k-means clustering in the plane is NP-hard.
Game for fair redistricting.
Goal: train a neural network to learn how to play the game.
Question: other ways machine learning can be useful here?

Fair redistricting is hard
Joint work with Richard Kueng and Dustin Mixon
Redistricting is connected to NP-hard problems. [Altman 1997]
Argument against “automatic redistricting”.
Computational intractability is inherent to the redistricting problem.
Worst-case complexity says very little about real-world maps, but
identiﬁes the limits of the problem.
What performance guarantees are possible for redistricting
algorithms?

Fair maps among legal compliant maps
Compliant maps
all districts have approximately the same population
mild notion of geographic compactness
Fair maps
both parties receive at least some level of representation.

Fair maps among legal compliant maps
Compliant maps
all districts have approximately the same population
mild notion of geographic compactness
Fair maps
both parties receive at least some level of representation.
Theorem
Deciding whether there exists a fair redistricting among compliant
maps is NP-hard.

Proof idea: reduction from planar 3-SAT
Inspired by Mahanjan, Minbhorkar and Varadarajan proof that planar k-means is NP-hard
3-SAT: Deciding whether there exists a boolean assignment that
satisﬁes a formula of the form:
(¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . .

(¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . .
Planar 3-SAT: Consider the bipartite graph: V={variables, clauses}

(¬x1 ∨ x2 ∨ ¬x4) ∧ (¬x2 ∨ ¬x4 ∨ ¬x3) ∧ . . .
Planar 3-SAT: Consider the bipartite graph: V={variables, clauses}
Planar 3-SAT is NP-complete.

Reduction
Every planar 3-SAT instance can be posed as deciding whether there exists a fair
redistricting.
Town Pop D R
Big: L L 0
Small: 2γ
3
L 2γ
3
L 0
Adjacent: L
2
+ γL
6
L
4
L
4
+ γL
6
Edge: L
2
L
2
− γL
4
L
2
+ γL
4
Population per district ∈ [L, L+γ].
D wins at most 2k districts
even with almost half of the vote and Total pop 2k
Formula is satisﬁable iﬀ D wins 2k districts.

Ideas illustrate what Jonathan Rodden was saying yesterday.

Ideas illustrate what Jonathan Rodden was saying yesterday.
Next: Algorithms for fairer maps.

Example 1: Shortest-Splitline Algorithm
Idea:
Even number of districts. Split among the shortest line that
divides the population in half.
Odd number of districts. Split among the shortest line that
divides the population in appropriate proportion.
Smith and Ryan, Center for Range Voting, http://www.rangevoting.org/GerryExamples.html

Example 1: Shortest-Splitline Algorithm
Idea:
Even number of districts. Split among the shortest line that
divides the population in half.
Odd number of districts. Split among the shortest line that
divides the population in appropriate proportion.
Example
Smith and Ryan, Center for Range Voting, http://www.rangevoting.org/GerryExamples.html

Example 2: “I-Cut-You-Freeze” protocol
Two political parties sequentially divide up a state:
First player divides a map of a state into the allotted number
of districts, each with equal numbers of voters.
Second player chooses one district to “freeze,” so no further
changes could be made to it, and re-map the remaining
districts as it likes.
Pedgen, Procaccia and Yu, A partisan districting protocol with provably nonpartisan outcomes

“I-Cut-You-Freeze” protocol
Leverages the competition between Republicans and
Democrats to produce an equitable result1.
Each party can pursue a strategy that guarantees it something
that it wants.
Is it possible to produce a protocol that no player has an
advantage even in the ﬁnite district setting?
1
When the number of districts goes to inﬁnity no player has an advantage with
respect to the other one

Ongoing work with Dustin Mixon
Game: Two players take turns assigning precincts to districts so
that:
Districts are always connected.
In the end all district have equal population.
In the end districts satisfy some form of compactness.

Theorem
In the symmetric non geometrically constrained setting no player
has an advantage.
Conjecture: players have a strategy that allows them to win the
proportion of seats corresponding with their proportion of voters.

Theorem
has an advantage.
Compare to eﬃciency gap [Stephanopoulos, McGhee]
EG is minimized at doubly proportionality [Bernstein, Duchin].
EG=0 ⇒ 55% of vote share → 60% of seats

Theorem
has an advantage.
Compare to eﬃciency gap [Stephanopoulos, McGhee]
EG is minimized at doubly proportionality [Bernstein, Duchin].
EG=0 ⇒ 55% of vote share → 60% of seats
Is it true for the geometrically constrained case?

Goal: run this game in real maps with machine learning
Reinforcement learning
There exists a function (unknown):
Q : (states, actions) → R
The objective is to maximize the cumulative reward:
R =
∞
t=0
γt
Qt, (γ ∈ [0, 1] discount rate)
Learn a policy π : S × A → [0, 1] where π(s, a) gives the
probability of taking action a while in state s.

Learning how to play
Let Q be a neural network:
Q(s, a) = ρ(AL(. . . ρ(A1(s, a)) . . .))
Idea:
Train Q through self-play.
Explore the space of states via
Monte-Carlo Tree Search

Diﬃculties
The game is too large.
Combine large scale strategy with small scale strategy.
Locally valid moves produce maps that will be invalid later.
Ask players for a witness that shows that the map can be
completed.

Diﬃculties
The game is too large.
Combine large scale strategy with small scale strategy.
Locally valid moves produce maps that will be invalid later.
Ask players for a witness that shows that the map can be
completed.
Challenging for gerrymandering and for machine learning.

Thanks
Fair redistricting is hard
Richard Kueng, Dustin G. Mixon, Soledad Villar
arXiv:1808.08905

Quantitative Redistricting Workshop - Machine Learning for Fair Redistricting- Soledad Villar, October 9, 2018

Recommended

Recommended

More Related Content

Similar to Quantitative Redistricting Workshop - Machine Learning for Fair Redistricting- Soledad Villar, October 9, 2018

Similar to Quantitative Redistricting Workshop - Machine Learning for Fair Redistricting- Soledad Villar, October 9, 2018 (20)

More from The Statistical and Applied Mathematical Sciences Institute

More from The Statistical and Applied Mathematical Sciences Institute (20)

Recently uploaded

Recently uploaded (20)

Quantitative Redistricting Workshop - Machine Learning for Fair Redistricting- Soledad Villar, October 9, 2018