Big Data Science analysis of economic drivers impacting US broadband development using Census data and State Broadband Initiative Broadband Map data from 2011-2014.
Georgetown Data Analytics Project (Team DC)Noah Turner
This is my team's project for the Georgetown University Certificate in Data Analytics Program. We looked at Washington, DC crime data, and conducted analysis looking at correlations between types of crime and neighborhood information provided by the US Census.
9th triplehelix: Web visibility on political innovation systemHan Woo PARK
This study examines whether the network characteristics represented on the Internet drive or reflect other events and occurrences in the offline environment. More specifically, the purpose of this study is to investigate the relationship between the web visibility network of Korea’s National Assembly members and the amount of financial donations they receive from the public. The results of the linear correlation analysis indicate a positive direction, suggesting that politicians who occupy a central position in the web visibility network are more likely receive financial donations than those occupying a peripheral position. The QAP correlation results reveal a significant correlation between politicians’ web visibility network and their political finance network. This study identifies the structural power relationship between Korean politicians’ online and offline networks.
Using deep learning and Google Street View to estimate the demographic makeup...eraser Juan José Calderón
Using deep learning and Google Street View to
estimate the demographic makeup of neighborhoods
across the United States. Timnit Gebrua,1, Jonathan Krausea
, Yilun Wanga
, Duyun Chena
, Jia Dengb
, Erez Lieberman Aidenc,d,e, and Li Fei-Feia
Georgetown Data Analytics Project (Team DC)Noah Turner
This is my team's project for the Georgetown University Certificate in Data Analytics Program. We looked at Washington, DC crime data, and conducted analysis looking at correlations between types of crime and neighborhood information provided by the US Census.
9th triplehelix: Web visibility on political innovation systemHan Woo PARK
This study examines whether the network characteristics represented on the Internet drive or reflect other events and occurrences in the offline environment. More specifically, the purpose of this study is to investigate the relationship between the web visibility network of Korea’s National Assembly members and the amount of financial donations they receive from the public. The results of the linear correlation analysis indicate a positive direction, suggesting that politicians who occupy a central position in the web visibility network are more likely receive financial donations than those occupying a peripheral position. The QAP correlation results reveal a significant correlation between politicians’ web visibility network and their political finance network. This study identifies the structural power relationship between Korean politicians’ online and offline networks.
Using deep learning and Google Street View to estimate the demographic makeup...eraser Juan José Calderón
Using deep learning and Google Street View to
estimate the demographic makeup of neighborhoods
across the United States. Timnit Gebrua,1, Jonathan Krausea
, Yilun Wanga
, Duyun Chena
, Jia Dengb
, Erez Lieberman Aidenc,d,e, and Li Fei-Feia
Building Proxy Indicators of National Wellbeing with Postal Data - Project Ov...UN Global Pulse
This study investigated for the first time the potential of using the network of international postal flows to approximate socioeconomic indicators typically used to benchmark national wellbeing. The research used aggregated electronic postal records from 187 countries collected by the Universal Postal Union from 2010 to 2014 as a proxy indicator for real-world conditions.
Cite as: “Building Proxy Indicators of National Wellbeing with Postal Data”, Global Pulse Project Series, no. 22, 2016
Impacts of Government-Led Civic Tech: US Citiesmysociety
This was presented by Emily Shaw from mySociety at the Impacts of Civic Technology Conference (TICTeC2016) in Barcelona on 27th April. You can find out more information about the conference here: https://www.mysociety.org/research/tictec-2016/
Presentation by Tara Thue, Governor's Office of Economic Development (GOED), and Bert Granberg, Utah Automated Geographic Reference Center (AGRC) at the Utah Broadband Provider Roundtable on 10/4/2010.
Impacts of Open Data Standards on Transparency Tools - Khairil Yusof (Sinar P...mysociety
This was presented by Khairil Yusof (Sinar Project) and Soe Lin Htoot (Myanmar Fifth Estate), at the Impacts of Civic Technology Conference (TICTeC@Taipei) in Taipei on 12th September 2017. You can find out more information about the conference here: http://civictechfest.org/agenda
Abstract:
Most Open Data initiatives assume the provision of data by governments which will then be picked up and used by a variety of sectors for the good of all.
But for countries with opaque governments, or whose NGOs lack technical capacity, the promises of Open Data will fall far short of the reality.
This active research shows how adopting Open Data standards for government data helps civil society organizations collaborate in building usable Open Data sets for transparency, governance — and tools that increase participation by citizens.
And for those in places where government do not reliably release Open Data, discover how to source unstructured data by other means. Finally, Khairil discusses the contrasting impacts and uses of this approach in the two different environments of Malaysia and Myanmar.
This presentation presented by BroadBand USA and the International City/County Management Association Conference focuses on the economic impact of broadband on rural communities.
Building Proxy Indicators of National Wellbeing with Postal Data - Project Ov...UN Global Pulse
This study investigated for the first time the potential of using the network of international postal flows to approximate socioeconomic indicators typically used to benchmark national wellbeing. The research used aggregated electronic postal records from 187 countries collected by the Universal Postal Union from 2010 to 2014 as a proxy indicator for real-world conditions.
Cite as: “Building Proxy Indicators of National Wellbeing with Postal Data”, Global Pulse Project Series, no. 22, 2016
Impacts of Government-Led Civic Tech: US Citiesmysociety
This was presented by Emily Shaw from mySociety at the Impacts of Civic Technology Conference (TICTeC2016) in Barcelona on 27th April. You can find out more information about the conference here: https://www.mysociety.org/research/tictec-2016/
Presentation by Tara Thue, Governor's Office of Economic Development (GOED), and Bert Granberg, Utah Automated Geographic Reference Center (AGRC) at the Utah Broadband Provider Roundtable on 10/4/2010.
Impacts of Open Data Standards on Transparency Tools - Khairil Yusof (Sinar P...mysociety
This was presented by Khairil Yusof (Sinar Project) and Soe Lin Htoot (Myanmar Fifth Estate), at the Impacts of Civic Technology Conference (TICTeC@Taipei) in Taipei on 12th September 2017. You can find out more information about the conference here: http://civictechfest.org/agenda
Abstract:
Most Open Data initiatives assume the provision of data by governments which will then be picked up and used by a variety of sectors for the good of all.
But for countries with opaque governments, or whose NGOs lack technical capacity, the promises of Open Data will fall far short of the reality.
This active research shows how adopting Open Data standards for government data helps civil society organizations collaborate in building usable Open Data sets for transparency, governance — and tools that increase participation by citizens.
And for those in places where government do not reliably release Open Data, discover how to source unstructured data by other means. Finally, Khairil discusses the contrasting impacts and uses of this approach in the two different environments of Malaysia and Myanmar.
This presentation presented by BroadBand USA and the International City/County Management Association Conference focuses on the economic impact of broadband on rural communities.
Integrated Vulnerability Assessment (IVA): Status overview and role in M&E of...NAP Global Network
Presented by Julie Dekens, IISD/NAP Global Network, in September 2020 at the Virtual Learning Event on Monitoring and Evaluation (M&E) for National Adaptation in Pacific Small Island Developing States organized by organized by the NAP Global Network in collaboration with the Pacific Resilience Partnership (PRP)
Increasingly Unambitious: A Thematic Analysis of Canadian Broadband Policy an...Jennifer Evaniew
A thematic analysis of policy for Canadian federal and provincial broadband programs operating from 1994 to present to determine the change in objectives over time and the coherence between objectives at the Federal and Provincial levels.
Presentation to the National Association of Regional Councils describing the issues surrounding broadband access, adoption, and use and how the Connected program from Connected Nation is helping to address those issues in communities across the country.
Introduction to New Jersey's State Broadband Initiative and Broadband Data MapConnectingNJ
ConnectingNJ is New Jersey's State Broadband Initiative, a federally funded program awarded by the National Telecommunications and Information Administration's (NTIA) to gain better insight of broadband availability, adoption and usage.
Through the State Broadband Initiative, New Jersey is collecting data on the availability, speed, and location of broadband services to build and keep current the New Jersey Broadband Map. As part of the outreach effort, the objective of the program is to also identify barriers to broadband adoption and bringing awareness about the New Jersey Broadband Map and the effective use of broadband technology and its impact to our community and economy.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
No More Half Fast: Improving US Broadband Download Speed. Georgetown University Data Science Capstone
1. No More “Half-Fast”: Improving
US Broadband Download Speed
Georgetown University – 2015 Data Science
Capstone Brittne Nelson PhD, Amgad Sirag, Ernest S.
2. Approach and Overview
What?
• Broadband Data Story
• Research Problem
• Data Science Pipeline
So What?
• Data Visualization Story
• Findings
• Lessons Learned
Now What?
• Future Research
• Conclusions
4. There were
communities
with no
broadband
access
Every day,
residents and
businesses had
limited or no
access to
resources,
services,
content, new
customers, and
new technology
limiting
opportunities and
community
empowerment
One day, the
US government
created the SBI
to facilitate the
integration of
broadband and
information
technology into
state and local
economies
Because of
that, states
did more to
quickly
expand
broadband to
more areas
Because of
that, the SBI,
decision
makers, and
researchers-
including us-
were able to
assess how
broadband is
being
implemented
across the
US
Until finally,
residents and
businesses
gained more
access to
resources
services,
content, new
customers,
and
technology
that
empowered
and gave
them a
competitive
edge
Data Story
5. Benefits of Broadband
• Increased job opportunities
• Increased employment opportunities due to telework
• Higher pay
• Increased economic security
• Recruitment of job seekers, especially in rural areas
• Increased access to and quality of healthcare
• Availability of a wide variety of entertainment
• Increased participation in everyday economic, social, and community life
• Improved social connections to existing friends and acquaintances
• Creation of new relationships based on common interests
• Improved social integration of minority populations
• More positive attitudes toward aging
• Higher levels of perceived social support and connectivity among seniors
• Lower prices for online purchases
• Improved variety of items available for purchase
• Better purchasing decisions based on online information
• Savings in time and money for online vs. paper-based activities
• Improved connectivity for social or political action
Sources: Center for Social Inclusion,. (2010). The Promise and Challenge of Community Broadband Models. New York City: Center for
Social Inclusion.
Analytics ASR,. (2014). Final Report: Social and Economic Impacts of the Broadband Technology Opportunities Program. Potomac
Maryland.
6. Research Problem
• Does broadband availability and speed make a
state’s economy and it’s residents competitive?
• When will every state reach 98% broadband
connectivity?
• How are community economic features impacting
or related to broadband development?
7. Hypotheses
• Broadband speed and accessibility will cluster in
urban areas
• Areas with more broadband speed will have lower
unemployment, more businesses, and larger
populations
• Broadband growth is not consistent across all
counties
• Based on past growth, broadband coverage is not
expected to be available in 98% of all counties in
2016
8. Data Sources
• National Broadband Map Maximum and Minimum Download Speed by
County, June 2011-June 2014
– National Telecommunications and Information Administration
http://www.broadbandmap.gov/data-download
• Labor Force Data by County Annual Average, 2011-2013
– U.S. Department of Labor Local Area Unemployment Statistics
http://www.bls.gov/lau/
• Demographic Population by County, 2010
– U.S Census Bureau
http://factfinder.census.gov/faces/nav/jsf/pages/index.xhtml
• Total Number of Business Establishments, 2011-2012
– U.S Census Bureau
http://factfinder.census.gov/faces/nav/jsf/pages/index.xhtml
13. Hypotheses Results
• Broadband speed and accessibility will cluster in urban areas
• “URBAN” TOO DIFFICULT TO DEFINE GIVEN PROJECT TIMELINE,
NOT ANALYZED
• Areas with higher broadband speed have lower unemployment,
more businesses, and larger populations
• NOT TRUE
• Broadband growth is not consistent across all counties
• TRUE
• Based on past growth, broadband coverage is not expected to be
available in 98% of all counties in 2016
• NOT ENOUGH DATA TO COMFORTABLY FORECAST
14. Summary of Findings
• Identified economic features are mild drivers of
technology implementation specifically broadband
speed.
• Broadband availability makes a state economy and
it’s residents competitive.
• Implementing broadband is not the silver bullet to
community development or economic growth, it
should be incorporated with other economic and
social features.
15. Lessons Learned
• Quantity of data is important for forecasting
• Source of data is important. SBI reports data from
providers which makes it somewhat difficult to
assess
• Plan a significant amount of time for data
wrangling
• Master each step of the data science pipeline
before moving on
• Operationalize more factors to provide a clear
picture of relationships when identifying
hypotheses
17. Future Research
• Develop a matched pairs analysis framework that
compares changes in the availability of broadband at the
state level between counties
• Measure how much of the growth in availability within
these counties occurred due to funding (Grants, Federal
Government, Private Organizations)
• Examine broadband’s long-term quantitative
extrapolations and impact on social and economics
• Index and model additional community factors such as
education, adoption, tax rate, etc in order to broadly
define economic impact
18. Conclusions
• There is a business case for continued focus on
broadband improvement
• Broadband improves the overall communities
• Drives economic development and shared
opportunities
• Improve quality of life across the United
States
19. Thank You to the Georgetown University 2015 Data Science Program
Faculty
Benjamin Bengfort
Allen Leis
Sacha Litman
Laura Lorenz
Salil Mehta
Tony Ojeda
(and lady!)