SlideShare a Scribd company logo
1 of 21
THE BEST FIT LINE
a study of how that one group was wrong, and
how modeling is impossible
By Maneesha Palakurthi
PART 1
Let’s talk about why that one group was wrong.
THIS IS THE GRAPH THAT GROUP E3 USED TO
REPRESENT PEOPLE’S ECONOMIC PATHS.
As you can see, their concept was that in the cycle of loss and gain, the
rich gain more than they lose (and so get continually richer), the middle class
breaks even (and so remain stagnant wealth-wise), and the poor's gains don't
make up for their losses (and so get progressively poorer). This graphs refers to
wealth in amounts. There are two glaring mistakes in this representation.
THE FIRST PROBLEM
The first is that the three lines begin
so close together. The wealth gap does widen
over time, true, but that doesn't mean that it
starts so small. In this case it would be more
accurate to have the following:
For reference:
THE SECOND PROBLEM
The other large issue with their graph
is that there are far too many "outliers".
Considering that their wealth axis is set on a
basis of specific economic values, only people
who begin with the exact amount of the three
y-intercepts are shown on this graph. So if you
were to give the axes the following values: (see
below graph)
For reference:
…only people who begin with 4,
8, or 12 units of wealth are
represented. Those who begin
with any other amount are
unaccounted for.
SOLUTION FOR THE SECOND PROBLEM
The graph should really look more like the
following one.
For reference:
However, it would
need infinitely more lines, but
creating this same basic shape.
The pink line would be
stationed at whatever y-value
is considered at the time to be
median wealth/wage.
ANOTHER APPROACH
Now there's another entirely different approach that yields the same
meaning, using relativity and percentiles. It is drawn as one of the most basic
graphs of all, the y=x graph. Here, the x and y axes both represent a wealth
percentile. The x axis represents your financial standing when you start, and your
y axis represents your financial standing when you finish.
A SIMPLIFIED EXPLANATION
Say there's 100 people in a city. You happen to be the
second richest person in the city, so you are the 99th
percentile. Similarly to the fashion that group E3 had been
trying to demonstrate with their graph, you as one of the rich,
are getting richer. However, your wealth doesn't increase
faster than the one richest person, or slower than the people
who started poorer than you, so you continue to rank as the
second richest person in town. Thus, your final percentile
value is 99, and you fall on the y=x line.
For reference:
SCATTER PLOTTED
Now that graph also has outliers
not taken into account, though in this
case they are actually a minority. If you
were to scatter plot the percentile graph,
it would look something like this:
For reference:
The red dots are outliers.
However, as you can see the
line is what is commonly
known as a best fit line, and is
therefore acceptable.
BUT.
(Here starts part two in which I contradict some of what
I just said mostly because I'm switching topics to the
best fit line discussion.)
Best fit lines should never be
considered acceptable! The are
one of the most prevalent
examples of humans and our
dependency on taxonomy!
HERE’S WHY…
When given the below
scatterplot…
People attempt to draw
what they call a best fit
line. Which ends up looking
like the following.
From four unconnected points on the graph, we conjure up
a best fit line, and say that it represents them. But it doesn't! Only
one of the points is actually on the line, yet we deem this the line
that represents the data. Unless your best fit line goes through at
least 90% of the points on the graph, it should be considered invalid.
SO WHY DO WE DRAW BEST FIT LINES?
It's a form of taxonomy. A way to say
that things (or points) are a part of a
category (or line) that they really aren't. It
helps us to associate things with whatever
is similar, and ignore outliers.
WHICH BRINGS ME TO MY NEXT POINT.
(HAHA, POINT, GET IT?)
The gravest form of the best fit line and
taxonomy is...
THE MODEL.
Modeling is the worst kind of best fit line.
WHY MODELS ARE UNACCEPTABLE
So we study three different revolutions, and then create a
model/system based off of those three that should be applicable to
every other situation. Not only are outliers disregarded, but nearly
all other events are too!
Let me explain visually.
The red dots represent the events that we studied. This
graph shows only the data that we studied.
Key:
Red dots = Events Studied
Here the gray line is the best fit line or the model, made
based on the data.
Key:
Red Dots = Events Studied
Gray Line = Best Fit Line for Red Dots
The green dots are all the events that took place that we
didn't study. So it's obvious that the events fell in a very
wide range.
Key:
Red Dots = Events Studied
Gray Line = Best Fit Line for
Red Dots
Green Dots = Unstudied Events
in History
So here are all the events throughout history, and the one
model that was made to represent all of them. I don't think
it quite works, do you?
Key:
Red Dots = Events Studied
Gray Line = Best Fit Line/Model
for Red Dots
Green Dots = Unstudied Events
in History
IN CONCLUSION…
We draw best fit lines and create models in order to
fool ourselves into thinking that:
a) It's actually possible to categorize the world
b) There is actually one line or model that applies
to every point of data
c) We understand the world
EL FIN

More Related Content

Similar to Maneesha Palakurthi

TitlenameInstructor namedateWhere do you live.docx
TitlenameInstructor namedateWhere do you live.docxTitlenameInstructor namedateWhere do you live.docx
TitlenameInstructor namedateWhere do you live.docxedwardmarivel
 
Chapter 4 Problem 31. For problem three in chapter four, a teac.docx
Chapter 4 Problem 31. For problem three in chapter four,   a teac.docxChapter 4 Problem 31. For problem three in chapter four,   a teac.docx
Chapter 4 Problem 31. For problem three in chapter four, a teac.docxrobertad6
 
Compare And Contrast Essay Conclusion.pdf
Compare And Contrast Essay Conclusion.pdfCompare And Contrast Essay Conclusion.pdf
Compare And Contrast Essay Conclusion.pdfJessica Summers
 
Digital txtbook final
Digital txtbook finalDigital txtbook final
Digital txtbook finalneetaa2014
 
Intro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKONIntro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKONAmanda Roberts
 
Similarities Essay. Comparative Essay - 10 Examples, Format, Pdf Examples
Similarities Essay. Comparative Essay - 10 Examples, Format, Pdf  ExamplesSimilarities Essay. Comparative Essay - 10 Examples, Format, Pdf  Examples
Similarities Essay. Comparative Essay - 10 Examples, Format, Pdf ExamplesClaire Flanagan
 
Edith And Teriya
Edith And TeriyaEdith And Teriya
Edith And Teriyasnorrishk
 
Data Visualization Inspiration: Analysis To Insights To Action, Faster!
Data Visualization Inspiration: Analysis To Insights To Action, Faster!Data Visualization Inspiration: Analysis To Insights To Action, Faster!
Data Visualization Inspiration: Analysis To Insights To Action, Faster!gtmarketing8688
 
Charts, Graphs and Tables
Charts, Graphs and TablesCharts, Graphs and Tables
Charts, Graphs and TablesLiliyaNersesyan
 
data handling class 8
data handling class 8data handling class 8
data handling class 8HimakshiKava
 
A brief history of math
A brief history of mathA brief history of math
A brief history of mathFadli Dly
 
Spearman after priory man
Spearman after priory manSpearman after priory man
Spearman after priory manWill Williams
 

Similar to Maneesha Palakurthi (20)

Visualization-1
Visualization-1Visualization-1
Visualization-1
 
TitlenameInstructor namedateWhere do you live.docx
TitlenameInstructor namedateWhere do you live.docxTitlenameInstructor namedateWhere do you live.docx
TitlenameInstructor namedateWhere do you live.docx
 
Chapter 4 Problem 31. For problem three in chapter four, a teac.docx
Chapter 4 Problem 31. For problem three in chapter four,   a teac.docxChapter 4 Problem 31. For problem three in chapter four,   a teac.docx
Chapter 4 Problem 31. For problem three in chapter four, a teac.docx
 
LIB300 Week 6 Using Visuals
LIB300 Week 6 Using VisualsLIB300 Week 6 Using Visuals
LIB300 Week 6 Using Visuals
 
Compare And Contrast Essay Conclusion.pdf
Compare And Contrast Essay Conclusion.pdfCompare And Contrast Essay Conclusion.pdf
Compare And Contrast Essay Conclusion.pdf
 
assignment 2
assignment 2assignment 2
assignment 2
 
Digital txtbook final
Digital txtbook finalDigital txtbook final
Digital txtbook final
 
Intro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKONIntro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKON
 
Similarities Essay. Comparative Essay - 10 Examples, Format, Pdf Examples
Similarities Essay. Comparative Essay - 10 Examples, Format, Pdf  ExamplesSimilarities Essay. Comparative Essay - 10 Examples, Format, Pdf  Examples
Similarities Essay. Comparative Essay - 10 Examples, Format, Pdf Examples
 
Edith And Teriya
Edith And TeriyaEdith And Teriya
Edith And Teriya
 
Lesson 13
Lesson 13Lesson 13
Lesson 13
 
Data Visualization Inspiration: Analysis To Insights To Action, Faster!
Data Visualization Inspiration: Analysis To Insights To Action, Faster!Data Visualization Inspiration: Analysis To Insights To Action, Faster!
Data Visualization Inspiration: Analysis To Insights To Action, Faster!
 
Skills booklet
Skills bookletSkills booklet
Skills booklet
 
Charts, Graphs and Tables
Charts, Graphs and TablesCharts, Graphs and Tables
Charts, Graphs and Tables
 
data handling class 8
data handling class 8data handling class 8
data handling class 8
 
The grand tradition v007
The grand tradition v007The grand tradition v007
The grand tradition v007
 
Statistics and probability
Statistics and probabilityStatistics and probability
Statistics and probability
 
Sales Forecasting Basics
Sales Forecasting BasicsSales Forecasting Basics
Sales Forecasting Basics
 
A brief history of math
A brief history of mathA brief history of math
A brief history of math
 
Spearman after priory man
Spearman after priory manSpearman after priory man
Spearman after priory man
 

Maneesha Palakurthi

  • 1. THE BEST FIT LINE a study of how that one group was wrong, and how modeling is impossible By Maneesha Palakurthi
  • 2. PART 1 Let’s talk about why that one group was wrong.
  • 3. THIS IS THE GRAPH THAT GROUP E3 USED TO REPRESENT PEOPLE’S ECONOMIC PATHS. As you can see, their concept was that in the cycle of loss and gain, the rich gain more than they lose (and so get continually richer), the middle class breaks even (and so remain stagnant wealth-wise), and the poor's gains don't make up for their losses (and so get progressively poorer). This graphs refers to wealth in amounts. There are two glaring mistakes in this representation.
  • 4. THE FIRST PROBLEM The first is that the three lines begin so close together. The wealth gap does widen over time, true, but that doesn't mean that it starts so small. In this case it would be more accurate to have the following: For reference:
  • 5. THE SECOND PROBLEM The other large issue with their graph is that there are far too many "outliers". Considering that their wealth axis is set on a basis of specific economic values, only people who begin with the exact amount of the three y-intercepts are shown on this graph. So if you were to give the axes the following values: (see below graph) For reference: …only people who begin with 4, 8, or 12 units of wealth are represented. Those who begin with any other amount are unaccounted for.
  • 6. SOLUTION FOR THE SECOND PROBLEM The graph should really look more like the following one. For reference: However, it would need infinitely more lines, but creating this same basic shape. The pink line would be stationed at whatever y-value is considered at the time to be median wealth/wage.
  • 7. ANOTHER APPROACH Now there's another entirely different approach that yields the same meaning, using relativity and percentiles. It is drawn as one of the most basic graphs of all, the y=x graph. Here, the x and y axes both represent a wealth percentile. The x axis represents your financial standing when you start, and your y axis represents your financial standing when you finish.
  • 8. A SIMPLIFIED EXPLANATION Say there's 100 people in a city. You happen to be the second richest person in the city, so you are the 99th percentile. Similarly to the fashion that group E3 had been trying to demonstrate with their graph, you as one of the rich, are getting richer. However, your wealth doesn't increase faster than the one richest person, or slower than the people who started poorer than you, so you continue to rank as the second richest person in town. Thus, your final percentile value is 99, and you fall on the y=x line. For reference:
  • 9. SCATTER PLOTTED Now that graph also has outliers not taken into account, though in this case they are actually a minority. If you were to scatter plot the percentile graph, it would look something like this: For reference: The red dots are outliers. However, as you can see the line is what is commonly known as a best fit line, and is therefore acceptable.
  • 10. BUT. (Here starts part two in which I contradict some of what I just said mostly because I'm switching topics to the best fit line discussion.)
  • 11. Best fit lines should never be considered acceptable! The are one of the most prevalent examples of humans and our dependency on taxonomy!
  • 12. HERE’S WHY… When given the below scatterplot… People attempt to draw what they call a best fit line. Which ends up looking like the following. From four unconnected points on the graph, we conjure up a best fit line, and say that it represents them. But it doesn't! Only one of the points is actually on the line, yet we deem this the line that represents the data. Unless your best fit line goes through at least 90% of the points on the graph, it should be considered invalid.
  • 13. SO WHY DO WE DRAW BEST FIT LINES? It's a form of taxonomy. A way to say that things (or points) are a part of a category (or line) that they really aren't. It helps us to associate things with whatever is similar, and ignore outliers.
  • 14. WHICH BRINGS ME TO MY NEXT POINT. (HAHA, POINT, GET IT?) The gravest form of the best fit line and taxonomy is... THE MODEL. Modeling is the worst kind of best fit line.
  • 15. WHY MODELS ARE UNACCEPTABLE So we study three different revolutions, and then create a model/system based off of those three that should be applicable to every other situation. Not only are outliers disregarded, but nearly all other events are too! Let me explain visually.
  • 16. The red dots represent the events that we studied. This graph shows only the data that we studied. Key: Red dots = Events Studied
  • 17. Here the gray line is the best fit line or the model, made based on the data. Key: Red Dots = Events Studied Gray Line = Best Fit Line for Red Dots
  • 18. The green dots are all the events that took place that we didn't study. So it's obvious that the events fell in a very wide range. Key: Red Dots = Events Studied Gray Line = Best Fit Line for Red Dots Green Dots = Unstudied Events in History
  • 19. So here are all the events throughout history, and the one model that was made to represent all of them. I don't think it quite works, do you? Key: Red Dots = Events Studied Gray Line = Best Fit Line/Model for Red Dots Green Dots = Unstudied Events in History
  • 20. IN CONCLUSION… We draw best fit lines and create models in order to fool ourselves into thinking that: a) It's actually possible to categorize the world b) There is actually one line or model that applies to every point of data c) We understand the world