SlideShare a Scribd company logo
1 of 13
New NYC Business
Incorporation 2005-2013
An Exploration of Non-Minority and
Minority-Owned Enterprise Creation
By Shelby Ahern
stahern@gmail.com
NYC Data Science Academy
Student Demo day 07-21-2014
R005: Data Science by R(Beginner level)
Explore
• New Business Incorporation in NYC between 2005-2013, and
• New Business Incorporation, by Minority and Non-Minority Ownership
Data Sources
• Active NewYork Corporations: Beginning in 18001
• NYC Online Directory of Certified Businesses: Minority-Owned Business Enterprises
(MBE)2,3
• U.S. Census Population Estimates4
• EntityType:
• Domestic BusinessCorporation
• Domestic Cooperative Corporation
• Domestic Professional Corporation
Parameters and Notes
• 2005-2013 (9 years)
• Borough = County (ie. Manhattan: NewYork County, Brooklyn: Kings County,
Queens = Queens County, Bronx = Bronx County, Staten Island = Richmond County
Create Data Frames of Data from Each Source
Run Summary Statistics forValidation
Split by Borough and Combine DFs from Different Sources
Perform Calculations ie. New Incorporations per Capita
DataViz!
Test:
“Density” of New MBE Corps for Minority Population ≠ “Density” of New
Non-MBECorps per Non-Minority Population
An Initial Review of the Summaries of the Corporation Data and MBE-Certified
Corporations show…
Major disparity between the Number of Incorporations per year, and number of
MBE’s established in that year.
Why?
- Data Quality: Change in Ownership Structure, Restrictions to MBE Certifications, and/or Filing Lag
- !!What the Data actually represent: MBE application purpose & process
>MH_Corps
County year NewCorps NewMBECorps Tot_Pop MBE_pop1 NwCorpsperCap NwMBECorpsperCap NwMBECorpsperMBECap
NwNonMBECor
psperCap
1NEW YORK 2005 5101 35 1529774 690696 0.0033 2.30E-05 5.10E-05 0.006
2NEW YORK 2006 5395 42 1611581 738221 0.0033 2.60E-05 5.70E-05 0.0061
3NEW YORK 2007 5373 39 1620867 724926 0.0033 2.40E-05 5.40E-05 0.006
4NEW YORK 2008 5602 38 1634795 696413 0.0034 2.30E-05 5.50E-05 0.0059
5NEW YORK 2009 7617 39 1629054 669583 0.0047 2.40E-05 5.80E-05 0.0079
6NEW YORK 2010 9872 34 1585873 674800 0.0062 2.10E-05 5.00E-05 0.0108
7NEW YORK 2011 9909 24 1601948 703250 0.0062 1.50E-05 3.40E-05 0.011
8NEW YORK 2012 10326 15 1619090 697407 0.0064 9.30E-06 2.20E-05 0.0112
9NEW YORK 2013 10345 3 1585873 546732 0.0065 1.90E-06 5.50E-06 0.01
After merging data from different data frames, we are able to calculate the number of
new corporations filed per capita, on a yearly basis.
Further, we calculate the number of new corporations filed per capita of certain
populations, like MBEs/Minority and Non-MBE’s/Non-Minority populations.
Example Data Frame, Manhattan
$NwCorpsperCap
$NwMBECorpsperCap
Incorporations per Capita and MBE Incorporations per capita, 2005- 2013
MBE Incorporations per Capita, 2005- 2013
$NwCorpsperCap
$NwMBECorpsperCap
Findings:
The per-capita incidence of incorporations increased across all
boroughs, from 2005 - 2013.
Manhattan, Queens, and Brooklyn had the highest per-capita
incorporations.
Queens appears to have the steepest increase in corporation
filings.
MBE incorporations per capita are a thousands of times smaller
than the general level of per-capita-incorporation.
The per-capita incidence of MBE incorporations varied by borough
(led by Manhattan), and trended downward after 2009.
Hypothesis:The number of MBE incorporations per non-white
person is not equal to the number of non-MBE incorporations
per white person.
The approach:
1. SelectValue to test:
 MBE Corps per Minority capita
 Non-MBE Corps per Non-Minority capita
▪ Utilize data from all years and boroughs(5 boroughs x 9 years x 2 categories = 90 obs.)
2. Evaluate which test(s) to conduct.
 Parametric vs. Non-parametric
 Means test vs. Other
3. Conduct test and analyze results.
Histogram, MBE Incorporations per Minority capita QQplot, MBE Incorporations per Minority capita
Histogram,
Non-MBE Incorporations per Non-Minority capita
QQplot, Non-MBE Incorporations per Non-Minority
capita
Neither MBE nor Non-MBE per capita data appear to be normally distributed.
Hence, we’ll consider the following two non-parametric tests:
Mood’s MedianTest
A nonparametric test where the null hypothesis of
the medians of the populations from which two or
more samples are drawn are identical. (Wikipedia)
H0: Medians of MBE - Minority cap and Non-MBE --
Non-Minority cap are equivalent.
H1: Medians of MBE - Minority cap and Non-MBE --
Non-Minority cap are NOT equivalent.
Mann-Whitney-Wilcoxon Test
A nonparametric test of the null
hypothesis that two populations are the same
against an alternative hypothesis, especially that a
particular population tends to have larger values
than the other. (Wikipedia)
H0: MBE - Minority cap and Non-MBE -- Non-
Minority cap could be representative of the
same set of data.
H1: MBE - Minority cap and Non-MBE -- Non-
Minority cap could NOT be representative of the
same set of data.
In both tests of parity, the null hypothesis is
rejected, thus we find that the incidence of
new business incorporations per capita are
different between the two populations.

More Related Content

Similar to NYC Business Incorporation Disparities

Default of Credit Card Payments
Default of Credit Card PaymentsDefault of Credit Card Payments
Default of Credit Card PaymentsVikas Virani
 
Lecture 5 Data Visualisation
Lecture 5 Data VisualisationLecture 5 Data Visualisation
Lecture 5 Data Visualisationpaul.hawking
 
Descriptive Statistics, Numerical Description
Descriptive Statistics, Numerical DescriptionDescriptive Statistics, Numerical Description
Descriptive Statistics, Numerical Descriptiongetyourcheaton
 
Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...AdekunleJoseph4
 
MRMW N America 2016 presentation kelly and zanutto naxion
MRMW N America 2016 presentation kelly and zanutto naxionMRMW N America 2016 presentation kelly and zanutto naxion
MRMW N America 2016 presentation kelly and zanutto naxionMichael Kelly
 
Data Science process
Data Science processData Science process
Data Science processbigdata trunk
 
New tools and new evidence on business dynamics. Chiara Criscuolo
 New tools and new evidence on business dynamics. Chiara Criscuolo  New tools and new evidence on business dynamics. Chiara Criscuolo
New tools and new evidence on business dynamics. Chiara Criscuolo enterpriseresearchcentre
 
Basic Analytics Module for Sponsors
Basic Analytics Module for SponsorsBasic Analytics Module for Sponsors
Basic Analytics Module for SponsorsDee Daley
 
Crossing the Borders towards Entrepreneurship:
Crossing the Borders towards Entrepreneurship: Crossing the Borders towards Entrepreneurship:
Crossing the Borders towards Entrepreneurship: enterpriseresearchcentre
 
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12SWAMI NATH SATPAL
 
ONS presentation at RSS South Wales poverty & inequality stats event
ONS presentation at RSS South Wales poverty & inequality stats eventONS presentation at RSS South Wales poverty & inequality stats event
ONS presentation at RSS South Wales poverty & inequality stats eventRichard Tonkin
 
Presentation of Project and Critique.pptx
Presentation of Project and Critique.pptxPresentation of Project and Critique.pptx
Presentation of Project and Critique.pptxBillyMoses1
 

Similar to NYC Business Incorporation Disparities (20)

Default of Credit Card Payments
Default of Credit Card PaymentsDefault of Credit Card Payments
Default of Credit Card Payments
 
Lecture 5 Data Visualisation
Lecture 5 Data VisualisationLecture 5 Data Visualisation
Lecture 5 Data Visualisation
 
Descriptive Statistics, Numerical Description
Descriptive Statistics, Numerical DescriptionDescriptive Statistics, Numerical Description
Descriptive Statistics, Numerical Description
 
Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...
 
Estimation & Adjustment in Census 2021
Estimation & Adjustment in Census 2021Estimation & Adjustment in Census 2021
Estimation & Adjustment in Census 2021
 
MD poverty indexes
MD poverty indexesMD poverty indexes
MD poverty indexes
 
11
1111
11
 
MRMW N America 2016 presentation kelly and zanutto naxion
MRMW N America 2016 presentation kelly and zanutto naxionMRMW N America 2016 presentation kelly and zanutto naxion
MRMW N America 2016 presentation kelly and zanutto naxion
 
Data Science process
Data Science processData Science process
Data Science process
 
New tools and new evidence on business dynamics. Chiara Criscuolo
 New tools and new evidence on business dynamics. Chiara Criscuolo  New tools and new evidence on business dynamics. Chiara Criscuolo
New tools and new evidence on business dynamics. Chiara Criscuolo
 
Qnt275 qnt 275
Qnt275 qnt 275Qnt275 qnt 275
Qnt275 qnt 275
 
Basic Analytics Module for Sponsors
Basic Analytics Module for SponsorsBasic Analytics Module for Sponsors
Basic Analytics Module for Sponsors
 
What Is Your LMIQ
What Is Your LMIQWhat Is Your LMIQ
What Is Your LMIQ
 
Crossing the Borders towards Entrepreneurship:
Crossing the Borders towards Entrepreneurship: Crossing the Borders towards Entrepreneurship:
Crossing the Borders towards Entrepreneurship:
 
Lmi Demystify62910
Lmi Demystify62910Lmi Demystify62910
Lmi Demystify62910
 
Week_2_Lecture.pdf
Week_2_Lecture.pdfWeek_2_Lecture.pdf
Week_2_Lecture.pdf
 
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
 
ONS presentation at RSS South Wales poverty & inequality stats event
ONS presentation at RSS South Wales poverty & inequality stats eventONS presentation at RSS South Wales poverty & inequality stats event
ONS presentation at RSS South Wales poverty & inequality stats event
 
Presentation of Project and Critique.pptx
Presentation of Project and Critique.pptxPresentation of Project and Critique.pptx
Presentation of Project and Critique.pptx
 
ONS Economic Forum
ONS Economic ForumONS Economic Forum
ONS Economic Forum
 

More from Vivian S. Zhang

Career services workshop- Roger Ren
Career services workshop- Roger RenCareer services workshop- Roger Ren
Career services workshop- Roger RenVivian S. Zhang
 
Nycdsa wordpress guide book
Nycdsa wordpress guide bookNycdsa wordpress guide book
Nycdsa wordpress guide bookVivian S. Zhang
 
We're so skewed_presentation
We're so skewed_presentationWe're so skewed_presentation
We're so skewed_presentationVivian S. Zhang
 
Wikipedia: Tuned Predictions on Big Data
Wikipedia: Tuned Predictions on Big DataWikipedia: Tuned Predictions on Big Data
Wikipedia: Tuned Predictions on Big DataVivian S. Zhang
 
A Hybrid Recommender with Yelp Challenge Data
A Hybrid Recommender with Yelp Challenge Data A Hybrid Recommender with Yelp Challenge Data
A Hybrid Recommender with Yelp Challenge Data Vivian S. Zhang
 
Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Kaggle Top1% Solution: Predicting Housing Prices in Moscow Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Kaggle Top1% Solution: Predicting Housing Prices in Moscow Vivian S. Zhang
 
Data mining with caret package
Data mining with caret packageData mining with caret package
Data mining with caret packageVivian S. Zhang
 
Streaming Python on Hadoop
Streaming Python on HadoopStreaming Python on Hadoop
Streaming Python on HadoopVivian S. Zhang
 
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorKaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorVivian S. Zhang
 
Nyc open-data-2015-andvanced-sklearn-expanded
Nyc open-data-2015-andvanced-sklearn-expandedNyc open-data-2015-andvanced-sklearn-expanded
Nyc open-data-2015-andvanced-sklearn-expandedVivian S. Zhang
 
Nycdsa ml conference slides march 2015
Nycdsa ml conference slides march 2015 Nycdsa ml conference slides march 2015
Nycdsa ml conference slides march 2015 Vivian S. Zhang
 
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public dataTHE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public dataVivian S. Zhang
 
Max Kuhn's talk on R machine learning
Max Kuhn's talk on R machine learningMax Kuhn's talk on R machine learning
Max Kuhn's talk on R machine learningVivian S. Zhang
 
Winning data science competitions, presented by Owen Zhang
Winning data science competitions, presented by Owen ZhangWinning data science competitions, presented by Owen Zhang
Winning data science competitions, presented by Owen ZhangVivian S. Zhang
 
Using Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesUsing Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesVivian S. Zhang
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rVivian S. Zhang
 

More from Vivian S. Zhang (20)

Why NYC DSA.pdf
Why NYC DSA.pdfWhy NYC DSA.pdf
Why NYC DSA.pdf
 
Career services workshop- Roger Ren
Career services workshop- Roger RenCareer services workshop- Roger Ren
Career services workshop- Roger Ren
 
Nycdsa wordpress guide book
Nycdsa wordpress guide bookNycdsa wordpress guide book
Nycdsa wordpress guide book
 
We're so skewed_presentation
We're so skewed_presentationWe're so skewed_presentation
We're so skewed_presentation
 
Wikipedia: Tuned Predictions on Big Data
Wikipedia: Tuned Predictions on Big DataWikipedia: Tuned Predictions on Big Data
Wikipedia: Tuned Predictions on Big Data
 
A Hybrid Recommender with Yelp Challenge Data
A Hybrid Recommender with Yelp Challenge Data A Hybrid Recommender with Yelp Challenge Data
A Hybrid Recommender with Yelp Challenge Data
 
Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Kaggle Top1% Solution: Predicting Housing Prices in Moscow Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Kaggle Top1% Solution: Predicting Housing Prices in Moscow
 
Data mining with caret package
Data mining with caret packageData mining with caret package
Data mining with caret package
 
Xgboost
XgboostXgboost
Xgboost
 
Streaming Python on Hadoop
Streaming Python on HadoopStreaming Python on Hadoop
Streaming Python on Hadoop
 
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorKaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
 
Xgboost
XgboostXgboost
Xgboost
 
Nyc open-data-2015-andvanced-sklearn-expanded
Nyc open-data-2015-andvanced-sklearn-expandedNyc open-data-2015-andvanced-sklearn-expanded
Nyc open-data-2015-andvanced-sklearn-expanded
 
Nycdsa ml conference slides march 2015
Nycdsa ml conference slides march 2015 Nycdsa ml conference slides march 2015
Nycdsa ml conference slides march 2015
 
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public dataTHE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
 
Max Kuhn's talk on R machine learning
Max Kuhn's talk on R machine learningMax Kuhn's talk on R machine learning
Max Kuhn's talk on R machine learning
 
Winning data science competitions, presented by Owen Zhang
Winning data science competitions, presented by Owen ZhangWinning data science competitions, presented by Owen Zhang
Winning data science competitions, presented by Owen Zhang
 
Using Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesUsing Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York Times
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
 
Bayesian models in r
Bayesian models in rBayesian models in r
Bayesian models in r
 

Recently uploaded

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 

Recently uploaded (20)

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 

NYC Business Incorporation Disparities

  • 1. New NYC Business Incorporation 2005-2013 An Exploration of Non-Minority and Minority-Owned Enterprise Creation By Shelby Ahern stahern@gmail.com NYC Data Science Academy Student Demo day 07-21-2014 R005: Data Science by R(Beginner level)
  • 2. Explore • New Business Incorporation in NYC between 2005-2013, and • New Business Incorporation, by Minority and Non-Minority Ownership Data Sources • Active NewYork Corporations: Beginning in 18001 • NYC Online Directory of Certified Businesses: Minority-Owned Business Enterprises (MBE)2,3 • U.S. Census Population Estimates4 • EntityType: • Domestic BusinessCorporation • Domestic Cooperative Corporation • Domestic Professional Corporation Parameters and Notes • 2005-2013 (9 years) • Borough = County (ie. Manhattan: NewYork County, Brooklyn: Kings County, Queens = Queens County, Bronx = Bronx County, Staten Island = Richmond County
  • 3. Create Data Frames of Data from Each Source Run Summary Statistics forValidation Split by Borough and Combine DFs from Different Sources Perform Calculations ie. New Incorporations per Capita DataViz! Test: “Density” of New MBE Corps for Minority Population ≠ “Density” of New Non-MBECorps per Non-Minority Population
  • 4. An Initial Review of the Summaries of the Corporation Data and MBE-Certified Corporations show… Major disparity between the Number of Incorporations per year, and number of MBE’s established in that year. Why? - Data Quality: Change in Ownership Structure, Restrictions to MBE Certifications, and/or Filing Lag - !!What the Data actually represent: MBE application purpose & process
  • 5. >MH_Corps County year NewCorps NewMBECorps Tot_Pop MBE_pop1 NwCorpsperCap NwMBECorpsperCap NwMBECorpsperMBECap NwNonMBECor psperCap 1NEW YORK 2005 5101 35 1529774 690696 0.0033 2.30E-05 5.10E-05 0.006 2NEW YORK 2006 5395 42 1611581 738221 0.0033 2.60E-05 5.70E-05 0.0061 3NEW YORK 2007 5373 39 1620867 724926 0.0033 2.40E-05 5.40E-05 0.006 4NEW YORK 2008 5602 38 1634795 696413 0.0034 2.30E-05 5.50E-05 0.0059 5NEW YORK 2009 7617 39 1629054 669583 0.0047 2.40E-05 5.80E-05 0.0079 6NEW YORK 2010 9872 34 1585873 674800 0.0062 2.10E-05 5.00E-05 0.0108 7NEW YORK 2011 9909 24 1601948 703250 0.0062 1.50E-05 3.40E-05 0.011 8NEW YORK 2012 10326 15 1619090 697407 0.0064 9.30E-06 2.20E-05 0.0112 9NEW YORK 2013 10345 3 1585873 546732 0.0065 1.90E-06 5.50E-06 0.01 After merging data from different data frames, we are able to calculate the number of new corporations filed per capita, on a yearly basis. Further, we calculate the number of new corporations filed per capita of certain populations, like MBEs/Minority and Non-MBE’s/Non-Minority populations. Example Data Frame, Manhattan
  • 6. $NwCorpsperCap $NwMBECorpsperCap Incorporations per Capita and MBE Incorporations per capita, 2005- 2013
  • 7. MBE Incorporations per Capita, 2005- 2013 $NwCorpsperCap $NwMBECorpsperCap
  • 8. Findings: The per-capita incidence of incorporations increased across all boroughs, from 2005 - 2013. Manhattan, Queens, and Brooklyn had the highest per-capita incorporations. Queens appears to have the steepest increase in corporation filings. MBE incorporations per capita are a thousands of times smaller than the general level of per-capita-incorporation. The per-capita incidence of MBE incorporations varied by borough (led by Manhattan), and trended downward after 2009.
  • 9. Hypothesis:The number of MBE incorporations per non-white person is not equal to the number of non-MBE incorporations per white person. The approach: 1. SelectValue to test:  MBE Corps per Minority capita  Non-MBE Corps per Non-Minority capita ▪ Utilize data from all years and boroughs(5 boroughs x 9 years x 2 categories = 90 obs.) 2. Evaluate which test(s) to conduct.  Parametric vs. Non-parametric  Means test vs. Other 3. Conduct test and analyze results.
  • 10. Histogram, MBE Incorporations per Minority capita QQplot, MBE Incorporations per Minority capita
  • 11. Histogram, Non-MBE Incorporations per Non-Minority capita QQplot, Non-MBE Incorporations per Non-Minority capita
  • 12. Neither MBE nor Non-MBE per capita data appear to be normally distributed. Hence, we’ll consider the following two non-parametric tests: Mood’s MedianTest A nonparametric test where the null hypothesis of the medians of the populations from which two or more samples are drawn are identical. (Wikipedia) H0: Medians of MBE - Minority cap and Non-MBE -- Non-Minority cap are equivalent. H1: Medians of MBE - Minority cap and Non-MBE -- Non-Minority cap are NOT equivalent. Mann-Whitney-Wilcoxon Test A nonparametric test of the null hypothesis that two populations are the same against an alternative hypothesis, especially that a particular population tends to have larger values than the other. (Wikipedia) H0: MBE - Minority cap and Non-MBE -- Non- Minority cap could be representative of the same set of data. H1: MBE - Minority cap and Non-MBE -- Non- Minority cap could NOT be representative of the same set of data.
  • 13. In both tests of parity, the null hypothesis is rejected, thus we find that the incidence of new business incorporations per capita are different between the two populations.

Editor's Notes

  1. Notes: https://data.ny.gov/Economic-Development/Active-Corporations-Beginning-1800/g5xh-vgry. Accessed 7/1/14. http://mtprawvwsbswtp1-1.nyc.gov/Search.aspx. Accessed 7/9/14. Under Article 15-A of the Executive Law, an MBE is a business enterprise in which at least fifty-one percent (51%) is owned, operated and controlled by citizens or permanent resident aliens who are meeting the ethnic definitions: Black, Hispanic, Asian-Pacific, Asian-Indian Subcontinent, Native American. http://www.esd.ny.gov/MWBE/Qualifications.html. Accessed 7/16/2014. 2005: Source: U.S. Census Bureau, 2005 American Community Survey, DP01, General Demographic Characteristics: 2005. 2006: http://www.socialexplorer.com/tables/ACS2006/R10763189. Accessed 7/15/2014. 2007: http://www.socialexplorer.com/tables/ACS2007/R10763198. Accessed 7/15/2014. 2008: http://www.socialexplorer.com/tables/ACS2008/R10763200. Accessed 7/15/2014. 2009: http://www.socialexplorer.com/tables/ACS2009/R10763202. Accessed 7/15/2014. 2010: http://www.socialexplorer.com/tables/C2010/R10763203. Accessed 7/15/2014. 2011: http://www.socialexplorer.com/tables/ACS2011/R10763211. Accessed 7/15/2014. 2012: http://www.socialexplorer.com/tables/ACS2012/R10763214. Accessed 7/15/2014. 2013: Population Estimates, County Characteristics: Vintage 2013. “Annual Estimages of the Resident Population by Sex, Race, and Hispanic Origin: April 1, 2010 to July 1, 2013. http://www.census.gov/popest/data/counties/asrh/2013/index.html. Accessed 7/15/2014.
  2. 1. “MBE_pop” denotes minority population. It is the sum of the populations of the following groups: Black or African American Alone, American Indian and Alaska Native Alone, Asian Alone, Native Hawaiian and Other Pacific Islander Alone, Some Other Race Alone, and “Two or More races”. Note: in 2013, the U.S. Census Bureau stopped the practice of bucketing “Some Other Race Alone,” which is a variation in the data between 2005-2012 and 2013.