SlideShare a Scribd company logo
1 of 58
MEETUP #17
Big Data, Open Data – What are
they really?
Our Speakers
Dennis Brink, Executive Director of Canadian Open
Data Institute
Trish Garner, Manager at Open Data at City of
Toronto
Mark MacDonnell, Software Developer at SELA
Canada
Adam Muise, Principal Architect at Horton Works
Jason Lavigne, Founder & CEO, Black & White Logic
Open Data – What is it?
Dennis Brink
Executive Director, Canadian Open Data
Institute
IAMCP Director and RIC Advisor
Open
Government
Open Data
Open
Knowledge
Open Data Definition from
Open Knowledge Foundation (OKF)
• https://okfn.org/opendata/
‘Open knowledge’ is any content,
information or data that people are
free to use, re-use and redistribute
— without any legal, technological
or social restriction.
Creative Commons & License Types
• Attribution-ShareAlike 4.0 International
• “CC BY-SA 4.0”
• Attribution — You must give appropriate
credit, provide a link to the license, and indicate
if changes were made.
• ShareAlike — If you remix, transform, or build
upon the material, you must distribute your
contributions under the same license as the
original.
Creative Commons & License Types
• Attribution-ShareAlike 4.0 International
• “CC BY-SA 4.0”
• You are free to:
• Share — copy and redistribute the material
in any medium or format
• Adapt — remix, transform, and build upon
the material for any purpose, even
commercially.
• The licensor cannot revoke these freedoms
as long as you follow the license terms.
Examples of Open Data
Not For Profit’s
Sunlight Foundation (US)
sunlightfoundation.com
The Open Data Institute (UK) theodi.org
Directory for Open Data
http://data.gc.ca/eng/maps/open-data-canada
Example of Federal Data - Census
http://data.gc.ca/data/en/dataset/e3586bbf-93b8-40e8-8b5a-
14023e1e705e
Population by home language, by province and territory (2011 Census)
Language spoken most often at home[1] Canada
number
Total 33,121,175
English 21,457,075
French 6,827,865
Non-official language 3,673,865
English and French 131,205
English and non-official language 875,135
French and non-official language 109,705
English, French and non-official language 46,325
[E] : use with caution. 1. Refers to the language spoken most often at home by the in
Example of Provincial Data - baby
names
http://www.ontario.ca/government/ontario-top-baby-names-male
Example of City Data - parking
tickets
http://www1.toronto.ca/wps/portal/contentonly?vgnextoid=9e56e0
3bb8d1e310VgnVCM10000071d60f89RCRD
Shawn Petersen aka “Saint John Shawn”
• Propertize.ca a New Brunswick property tax
assessment comparison tool.
• location detection to see all nearby properties
Address PID PAN Assessment (Year, Amount, Levy) Change Last Sale(s) Property Description Tax Class
228 Lancaster
Avenue
33357 1698246 2013 - $633,000.00 - $30,386.54
2012 - $600,900.00 - $29,341.35
2011 - $573,400.00 - $27,998.55
5.34% PARTS DEPOT Fully Taxable
266 Lancaster
Avenue
33225 1698115 2013 - $358,300.00 - $11,645.83
2012 - $340,600.00 - $11,274.88
2011 - $317,900.00 - $10,523.45
5.20% APT HOUSES & LAND Fully Taxable
Lancaster Avenue 55146310 5103560 2013 - $113,500.00 - $3,633.94
2012 - $111,900.00 - $3,649.85
2011 - $103,900.00 - $3,388.92
1.43% PARK AREA Fully Taxable
248 Lancaster
Avenue
55012132 1701756 2013 - $736,900.00 - $35,374.15
2012 - $732,800.00 - $35,781.89
2011 - $722,100.00 - $35,259.43
0.56% BOWLING CENTRE Fully Taxable
Data Marketplace – Microsoft
http://datamarket.azure.com/browse/data?price=paid
Big Data vs. Open Data
How are they Different?
• Look at the 3 V’s in a Big Data
World
Big Data vs. Open Data
• Volume
– ever increasing
– storage issue
• Variety
– unstructured
– external
– social/mobile/IoT’s
• Velocity
– daily or live streaming
Big Data vs. Open Data
How are they Different?
• All V’s are different
• Census:
– static, structured, and 60 GB in
size
NYU’s GovLab Survey
• OpenData500.com
• Survey of companies using Open Data for commercial
purposes.
• Attempting to expand to other countries.
–We want Canada represented!
NYU’s GovLab Open Data 500
Questions?
Please consider helping with the
Canadian Open Data survey project!
App Example from Sela Canada
Silicon Peel Meetup
About SELA Group
• Over 20 years in IT industry
• A global training, consulting and outsourcing
company, with offices in Canada, US, India,
Singapore & Israel
• Known for its high quality and lead in the
demanding Hi-Tech market
© Copyright SELA
software &
Education Labs
23
SELA Group - Selected
Customers
© Copyright SELA
software &
Education Labs
24
SELA and Microsoft
SELA & Microsoft
© Copyright SELA
software &
Education Labs
26
SELA and YOU
SELA & YOU
© Copyright SELA
software &
Education Labs
28
Training
• .NET
• Java
• C/C++
• ISTQB
• Onsite
• Online
Consulting
• Mentorship
• Short term
• Long term
• Onsite
• Remote
Outsourcing
• Near shore
• Offshore
Contact SELA Canada
Eran Barlev
eran@selacanada.ca
www.selacanada.ca
© Copyright SELA
software &
Education Labs
29
Thank You
Making Money with Open
Data
Mark MacDonnell, Software Developer, SELA
Diamond Program
What is (good) open data?
• Has a license declaring it open
• Can be accessed freely and easily
• Provided in a standard, structured way
• Reliable
Why publish your data?
• Provide a data subset or sample
• Allows you to focus on core
services
• Bring attention to your services
• Better information sharing
• Foster more open data
Why consume open data?
• Avoid maintenance costs
• Amalgamate information
• Support your own data
• Foster more open data
Communicating: The Triangle
Data Owners
Developers
Public
The “actual” Triangle
Data
Owners
DevelopersPublic
Getting the word out
• To developers
– Blogs
– Social media
– Emails
• To the public
– Press
releases
– News
conferences
To data owners
◦ Emails
◦ Social media
◦ Comments
To the public
◦ Apps
◦ Blog
To data owners
◦ Contact
◦ FAQs
To developers
◦ Reviews
◦ Downloads
◦ Un-installs
Data Owners Developers The Public
What to do?
• Have a clear audience
– Don’t forget the other audience
• Be proactive
• Say where you’re going
– Secrets don’t help
• Always be communicating
– Make it easy (RSS, Twitter, etc.)
Provision
Create data sets from
webpages
Automatically crawls site
for data
Currently growing
Useful for internal activities
Tools for building APIs from
data
Value added to the open data
Live in the wild
Good for exposing your data
Consumption
• Gather the data and do research
• Combine open data with closed data
• Process the data
• Value added
Thank you
Mark MacDonnell
✉ markm@selacanada.ca
🌐 markmacdonnell.wordpress.com
@MacDonnellMark
Break
Demo Me
Harpreet Singh, Lab B
logo
1. 76 percent reported an
increase in productivity
2. 93 percent said their social
circle had increased a lot
3. 86 percent said their
business network had
grown
BENEFITS
Source: Global Coworking Survey
2013
What makes coworking different from
business office centers?
Community > Space
Source: Global Coworking Survey
2013
Introductory Membership for
the First 10:
$150
Drop in: $10
COHIP
(Coworking Ontario Health Insurance Plan)
Word from our Sponsors
Thank You

More Related Content

Similar to Sp meetup 17 slidedeck

What Open Source Is and How Your Nonprofit Can Benefit
What Open Source Is and How Your Nonprofit Can BenefitWhat Open Source Is and How Your Nonprofit Can Benefit
What Open Source Is and How Your Nonprofit Can BenefitTechSoup
 
Webinar- Computer and Tech Hardware for Nonprofits and Libraries 2017-04-13
Webinar- Computer and Tech Hardware for Nonprofits and Libraries 2017-04-13Webinar- Computer and Tech Hardware for Nonprofits and Libraries 2017-04-13
Webinar- Computer and Tech Hardware for Nonprofits and Libraries 2017-04-13TechSoup
 
Webinar - Maximize Your Technology Donations Through TechSoup - 2017-03-28
Webinar - Maximize Your Technology Donations Through TechSoup - 2017-03-28Webinar - Maximize Your Technology Donations Through TechSoup - 2017-03-28
Webinar - Maximize Your Technology Donations Through TechSoup - 2017-03-28TechSoup
 
Webinar: Is the Cloud Right for You 2016-10-18
Webinar: Is the Cloud Right for You 2016-10-18Webinar: Is the Cloud Right for You 2016-10-18
Webinar: Is the Cloud Right for You 2016-10-18TechSoup
 
Webinar:How to Get Tech Donations a Special Tech soup Tour-2017-08-17
Webinar:How to Get Tech Donations a Special Tech soup Tour-2017-08-17Webinar:How to Get Tech Donations a Special Tech soup Tour-2017-08-17
Webinar:How to Get Tech Donations a Special Tech soup Tour-2017-08-17TechSoup
 
Webinar-Libraries Supporting Social Good: Tools and Tips for Outreach to Nonp...
Webinar-Libraries Supporting Social Good: Tools and Tips for Outreach to Nonp...Webinar-Libraries Supporting Social Good: Tools and Tips for Outreach to Nonp...
Webinar-Libraries Supporting Social Good: Tools and Tips for Outreach to Nonp...TechSoup
 
Webinar: Bring Your Data to Life with Power BI-2016-01-28
Webinar: Bring Your Data to Life with Power BI-2016-01-28Webinar: Bring Your Data to Life with Power BI-2016-01-28
Webinar: Bring Your Data to Life with Power BI-2016-01-28TechSoup
 
Webinar: Getting Technology Donations through TechSoup - 2016-01-07
Webinar: Getting Technology Donations through TechSoup - 2016-01-07Webinar: Getting Technology Donations through TechSoup - 2016-01-07
Webinar: Getting Technology Donations through TechSoup - 2016-01-07TechSoup
 
Graph tour keynote 2019
Graph tour keynote 2019Graph tour keynote 2019
Graph tour keynote 2019Neo4j
 
Understanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyUnderstanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyInside Analysis
 
9. Beechwood Reverse Pitch FINALISTS 6.14.17
9. Beechwood  Reverse Pitch FINALISTS 6.14.17 9. Beechwood  Reverse Pitch FINALISTS 6.14.17
9. Beechwood Reverse Pitch FINALISTS 6.14.17 Joel Bennett
 
Open Data Briefing for Alameda County Data Sharing Committee
Open Data Briefing for Alameda County Data Sharing CommitteeOpen Data Briefing for Alameda County Data Sharing Committee
Open Data Briefing for Alameda County Data Sharing CommitteeUrban Strategies Council
 
The Very Best of the Digital Workplace & Intranet Global Forum 2018
The Very Best of the Digital Workplace & Intranet Global Forum 2018The Very Best of the Digital Workplace & Intranet Global Forum 2018
The Very Best of the Digital Workplace & Intranet Global Forum 2018Toby Ward
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigManish Chopra
 
The Very Best Intranets and Digital Workplaces of 2017
The Very Best Intranets and Digital Workplaces of 2017The Very Best Intranets and Digital Workplaces of 2017
The Very Best Intranets and Digital Workplaces of 2017Prescient Digital Media
 
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...CambridgeshireInsight
 
London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20LG Inform Plus
 

Similar to Sp meetup 17 slidedeck (20)

What Open Source Is and How Your Nonprofit Can Benefit
What Open Source Is and How Your Nonprofit Can BenefitWhat Open Source Is and How Your Nonprofit Can Benefit
What Open Source Is and How Your Nonprofit Can Benefit
 
Webinar- Computer and Tech Hardware for Nonprofits and Libraries 2017-04-13
Webinar- Computer and Tech Hardware for Nonprofits and Libraries 2017-04-13Webinar- Computer and Tech Hardware for Nonprofits and Libraries 2017-04-13
Webinar- Computer and Tech Hardware for Nonprofits and Libraries 2017-04-13
 
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for LibrariansOpen Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
 
Webinar - Maximize Your Technology Donations Through TechSoup - 2017-03-28
Webinar - Maximize Your Technology Donations Through TechSoup - 2017-03-28Webinar - Maximize Your Technology Donations Through TechSoup - 2017-03-28
Webinar - Maximize Your Technology Donations Through TechSoup - 2017-03-28
 
Webinar: Is the Cloud Right for You 2016-10-18
Webinar: Is the Cloud Right for You 2016-10-18Webinar: Is the Cloud Right for You 2016-10-18
Webinar: Is the Cloud Right for You 2016-10-18
 
Webinar:How to Get Tech Donations a Special Tech soup Tour-2017-08-17
Webinar:How to Get Tech Donations a Special Tech soup Tour-2017-08-17Webinar:How to Get Tech Donations a Special Tech soup Tour-2017-08-17
Webinar:How to Get Tech Donations a Special Tech soup Tour-2017-08-17
 
Webinar-Libraries Supporting Social Good: Tools and Tips for Outreach to Nonp...
Webinar-Libraries Supporting Social Good: Tools and Tips for Outreach to Nonp...Webinar-Libraries Supporting Social Good: Tools and Tips for Outreach to Nonp...
Webinar-Libraries Supporting Social Good: Tools and Tips for Outreach to Nonp...
 
Webinar: Bring Your Data to Life with Power BI-2016-01-28
Webinar: Bring Your Data to Life with Power BI-2016-01-28Webinar: Bring Your Data to Life with Power BI-2016-01-28
Webinar: Bring Your Data to Life with Power BI-2016-01-28
 
Webinar: Getting Technology Donations through TechSoup - 2016-01-07
Webinar: Getting Technology Donations through TechSoup - 2016-01-07Webinar: Getting Technology Donations through TechSoup - 2016-01-07
Webinar: Getting Technology Donations through TechSoup - 2016-01-07
 
Graph tour keynote 2019
Graph tour keynote 2019Graph tour keynote 2019
Graph tour keynote 2019
 
Understanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyUnderstanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data Quickly
 
9. Beechwood Reverse Pitch FINALISTS 6.14.17
9. Beechwood  Reverse Pitch FINALISTS 6.14.17 9. Beechwood  Reverse Pitch FINALISTS 6.14.17
9. Beechwood Reverse Pitch FINALISTS 6.14.17
 
Open Data Briefing for Alameda County Data Sharing Committee
Open Data Briefing for Alameda County Data Sharing CommitteeOpen Data Briefing for Alameda County Data Sharing Committee
Open Data Briefing for Alameda County Data Sharing Committee
 
The Very Best of the Digital Workplace & Intranet Global Forum 2018
The Very Best of the Digital Workplace & Intranet Global Forum 2018The Very Best of the Digital Workplace & Intranet Global Forum 2018
The Very Best of the Digital Workplace & Intranet Global Forum 2018
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
The Very Best Intranets and Digital Workplaces of 2017
The Very Best Intranets and Digital Workplaces of 2017The Very Best Intranets and Digital Workplaces of 2017
The Very Best Intranets and Digital Workplaces of 2017
 
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...
 
London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20
 
New data product recipes
New data product recipesNew data product recipes
New data product recipes
 
ION Hangzhou - Opening Remarks
ION Hangzhou - Opening RemarksION Hangzhou - Opening Remarks
ION Hangzhou - Opening Remarks
 

Recently uploaded

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Sp meetup 17 slidedeck

  • 1. MEETUP #17 Big Data, Open Data – What are they really?
  • 2. Our Speakers Dennis Brink, Executive Director of Canadian Open Data Institute Trish Garner, Manager at Open Data at City of Toronto Mark MacDonnell, Software Developer at SELA Canada Adam Muise, Principal Architect at Horton Works Jason Lavigne, Founder & CEO, Black & White Logic
  • 3. Open Data – What is it? Dennis Brink Executive Director, Canadian Open Data Institute IAMCP Director and RIC Advisor
  • 5. Open Data Definition from Open Knowledge Foundation (OKF) • https://okfn.org/opendata/ ‘Open knowledge’ is any content, information or data that people are free to use, re-use and redistribute — without any legal, technological or social restriction.
  • 6. Creative Commons & License Types • Attribution-ShareAlike 4.0 International • “CC BY-SA 4.0” • Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. • ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
  • 7. Creative Commons & License Types • Attribution-ShareAlike 4.0 International • “CC BY-SA 4.0” • You are free to: • Share — copy and redistribute the material in any medium or format • Adapt — remix, transform, and build upon the material for any purpose, even commercially. • The licensor cannot revoke these freedoms as long as you follow the license terms.
  • 8. Examples of Open Data Not For Profit’s Sunlight Foundation (US) sunlightfoundation.com The Open Data Institute (UK) theodi.org
  • 9. Directory for Open Data http://data.gc.ca/eng/maps/open-data-canada
  • 10. Example of Federal Data - Census http://data.gc.ca/data/en/dataset/e3586bbf-93b8-40e8-8b5a- 14023e1e705e Population by home language, by province and territory (2011 Census) Language spoken most often at home[1] Canada number Total 33,121,175 English 21,457,075 French 6,827,865 Non-official language 3,673,865 English and French 131,205 English and non-official language 875,135 French and non-official language 109,705 English, French and non-official language 46,325 [E] : use with caution. 1. Refers to the language spoken most often at home by the in
  • 11. Example of Provincial Data - baby names http://www.ontario.ca/government/ontario-top-baby-names-male
  • 12. Example of City Data - parking tickets http://www1.toronto.ca/wps/portal/contentonly?vgnextoid=9e56e0 3bb8d1e310VgnVCM10000071d60f89RCRD
  • 13. Shawn Petersen aka “Saint John Shawn” • Propertize.ca a New Brunswick property tax assessment comparison tool. • location detection to see all nearby properties Address PID PAN Assessment (Year, Amount, Levy) Change Last Sale(s) Property Description Tax Class 228 Lancaster Avenue 33357 1698246 2013 - $633,000.00 - $30,386.54 2012 - $600,900.00 - $29,341.35 2011 - $573,400.00 - $27,998.55 5.34% PARTS DEPOT Fully Taxable 266 Lancaster Avenue 33225 1698115 2013 - $358,300.00 - $11,645.83 2012 - $340,600.00 - $11,274.88 2011 - $317,900.00 - $10,523.45 5.20% APT HOUSES & LAND Fully Taxable Lancaster Avenue 55146310 5103560 2013 - $113,500.00 - $3,633.94 2012 - $111,900.00 - $3,649.85 2011 - $103,900.00 - $3,388.92 1.43% PARK AREA Fully Taxable 248 Lancaster Avenue 55012132 1701756 2013 - $736,900.00 - $35,374.15 2012 - $732,800.00 - $35,781.89 2011 - $722,100.00 - $35,259.43 0.56% BOWLING CENTRE Fully Taxable
  • 14. Data Marketplace – Microsoft http://datamarket.azure.com/browse/data?price=paid
  • 15. Big Data vs. Open Data How are they Different? • Look at the 3 V’s in a Big Data World
  • 16. Big Data vs. Open Data • Volume – ever increasing – storage issue • Variety – unstructured – external – social/mobile/IoT’s • Velocity – daily or live streaming
  • 17. Big Data vs. Open Data How are they Different? • All V’s are different • Census: – static, structured, and 60 GB in size
  • 18. NYU’s GovLab Survey • OpenData500.com • Survey of companies using Open Data for commercial purposes. • Attempting to expand to other countries. –We want Canada represented!
  • 20. Questions? Please consider helping with the Canadian Open Data survey project!
  • 21. App Example from Sela Canada
  • 23. About SELA Group • Over 20 years in IT industry • A global training, consulting and outsourcing company, with offices in Canada, US, India, Singapore & Israel • Known for its high quality and lead in the demanding Hi-Tech market © Copyright SELA software & Education Labs 23
  • 24. SELA Group - Selected Customers © Copyright SELA software & Education Labs 24
  • 26. SELA & Microsoft © Copyright SELA software & Education Labs 26
  • 28. SELA & YOU © Copyright SELA software & Education Labs 28 Training • .NET • Java • C/C++ • ISTQB • Onsite • Online Consulting • Mentorship • Short term • Long term • Onsite • Remote Outsourcing • Near shore • Offshore
  • 29. Contact SELA Canada Eran Barlev eran@selacanada.ca www.selacanada.ca © Copyright SELA software & Education Labs 29
  • 31. Making Money with Open Data Mark MacDonnell, Software Developer, SELA Diamond Program
  • 32. What is (good) open data? • Has a license declaring it open • Can be accessed freely and easily • Provided in a standard, structured way • Reliable
  • 33. Why publish your data? • Provide a data subset or sample • Allows you to focus on core services • Bring attention to your services • Better information sharing • Foster more open data
  • 34. Why consume open data? • Avoid maintenance costs • Amalgamate information • Support your own data • Foster more open data
  • 35. Communicating: The Triangle Data Owners Developers Public
  • 37. Getting the word out • To developers – Blogs – Social media – Emails • To the public – Press releases – News conferences To data owners ◦ Emails ◦ Social media ◦ Comments To the public ◦ Apps ◦ Blog To data owners ◦ Contact ◦ FAQs To developers ◦ Reviews ◦ Downloads ◦ Un-installs Data Owners Developers The Public
  • 38. What to do? • Have a clear audience – Don’t forget the other audience • Be proactive • Say where you’re going – Secrets don’t help • Always be communicating – Make it easy (RSS, Twitter, etc.)
  • 39. Provision Create data sets from webpages Automatically crawls site for data Currently growing Useful for internal activities Tools for building APIs from data Value added to the open data Live in the wild Good for exposing your data
  • 40. Consumption • Gather the data and do research • Combine open data with closed data • Process the data • Value added
  • 41. Thank you Mark MacDonnell ✉ markm@selacanada.ca 🌐 markmacdonnell.wordpress.com @MacDonnellMark
  • 42. Break
  • 44. logo
  • 45.
  • 46.
  • 47.
  • 48. 1. 76 percent reported an increase in productivity 2. 93 percent said their social circle had increased a lot 3. 86 percent said their business network had grown BENEFITS Source: Global Coworking Survey 2013
  • 49. What makes coworking different from business office centers?
  • 51. Source: Global Coworking Survey 2013 Introductory Membership for the First 10: $150
  • 54.
  • 55.
  • 56.
  • 57. Word from our Sponsors

Editor's Notes

  1. Intro Level Presentation
  2. Three interlocking rings as they are interconnected.
  3. The Federal government has updated their license to follow these standards.
  4. Programs at Sunlight – Backed by philanthropic $ Program Spending, iPhone App for legislation, Influence Explorer for fundraising Programs at ODI – Intersection of public, private, and citizens. Evaluating and improving govt. O.D. programs, Training journalists, helping incubate companies, promoting OD usage and applications.
  5. Available in CSV and usually with supporting documents in HTML Federal site is now including the dataset metadata available in JSON format Home language is 65% English, 21% French, 11% Other, 3% Multiple This is nice and neat data – it all adds up and is not ambiguous
  6. Available in CSV and usually with supporting documents in HTML Federal site is now including the dataset metadata available in JSON format Home language is 65% English, 21% French, 11% Other, 3% Multiple This is nice and neat data – it all adds up and is not ambiguous Feds have over 200,000 data sets but more than 90% of that is Geo-type data.
  7. My Son is named Conrad Only 6 people born in Ontario in 1999 with that name. Suppressed for less than 5 names, privacy issue. Only 200 data sets
  8. Data Science Problem: 2.4 Million tickets in 2012. Cannot analyze in Excel  Most popular streets was Yonge Street – but also the longest. Now we need geo data to determine the density of tickets. ETL problem also as data is dirty – Yong, Yongge, Yonge, St., Street, Stret Data sets are over a thousand.
  9. Propertize.ca  to compare their property tax assessments. Address or nearby search, last three years, increases, etc. This program was quite successful and shows how open data had limited use until a developer exposed it in a better fashion. Easy to use tool – notice how everyone’s taxes went up! Ongoing challenge with data restrictions.
  10. Paid and unpaid data sets. The paid are the most useful – Zip+4 demographics.
  11. The person who coined this is a Gartner employee who lives in Chicago. He tweeted at me a year ago.
  12. Primer on how a data centric environment changes things: Volume – technology issue Variety – Organizational issue Velocity – Organizational issue
  13. Static every 4 years. Structured – table driven and from a multiple choice questionnaire. Analysis Size is within current database sizes, calculations can be done in memory.
  14. A Project that we are undertaking The GovLab is part of NYU. Takes after the Fortune 500 but not by revenue.
  15. The GovLab is part of NYU. Sectors for companies vs. Types of Data sets by Dept.