SlideShare a Scribd company logo
1 of 10
Download to read offline
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
1
01/05/2015
Data Analytics & Visualization
Cool Tools:
Martin Magdinier
@magdmartin
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
2
01/05/2015
About Me
● Contributor since 2011
● Committer since 2012
● http://openrefine.org
● @OpenRefine
● Toronto OpenRefine
Meetup Organizer
● Next Sessions:
May 21 and June 18
● Founder 2014
● OpenRefine Hosting
● http://refinepro.com
● @RefinePro
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
3
01/05/2015
80% of data analysis
is spent on the process of
cleaning, transformation and integration
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
4
01/05/2015
Cleaning
● Duplicate value
● Typos
● Multi value cells
● Data in the wrong field
● Missing / Partial Values
● Encoding Errors
● Wrong format (text,
number, date ...)
Integration &
Transformation
● Flat to relational data set
● Schema alignment
● Transpose
● Join data-set
● Enrichment from other
sources
● ....
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
5
01/05/2015
Bridging The Skill Gap
Spreadsheet
Basic Knowledge of Scripting
python, R, command line ...
ETL
Engineer
Data Science
Data Visualization / Interpretation
Understand The Data
(Business Skills)
Know How To
Transform Data
(Technical Skills)
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
6
01/05/2015
Discovery Wrangling
In application feedback
(personal usage)
Profiling Preparation
ad hoc usage
reporting - migration
Quality Transformation
Industralization
Integration
Measure
Check
Build - Do
Learn Think
Plan - Act
A Lean Data Model
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
7
01/05/2015
History
● 2009: Freebase Gridworks release
● 2010: Gridworks become Google Refine
● 2010: Google Refine 2.0 release
● 2011: Google Refine 2.5 release
● 2012: Google Refine become OpenRefine
● 2013: OpenRefine2.6-beta release
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
8
01/05/2015
OpenRefine Eco System
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
9
01/05/2015
Getting Started
Setting Up Refine
● Download OpenRefine2.6-
beta:
http://openrefine.org/download.html
● Unzip the file
● Start Refine
– Win: Double Click refine.exe
– Linux: ./refine
– Mac: Use RefinePro
● Register on RefinePro Beta
http://app.refinepro.com
● Use Chrome or Firefox
● Create An Instance and click
Access Instance
(can take up to 5 min the first time)
Data Analytics & Visualization Cool Tools: OpenRefine
Martin Magdinier @magdmartin
10
01/05/2015
Workshop / Demo: 2014 Toronto
Cleared Building Permits
● Presentation page: http://ow.ly/Js8GD
● Download Data: http://ow.ly/Js8Ho

More Related Content

Similar to Data analytics martinmagdinier-go open data 2015

Data Strategy Design: An Open Source Toolbox & Method for Data Thinking.
Data Strategy Design: An Open Source Toolbox & Method for Data Thinking. Data Strategy Design: An Open Source Toolbox & Method for Data Thinking.
Data Strategy Design: An Open Source Toolbox & Method for Data Thinking. Datentreiber
 
Introduction to Looker and Google Analytics 4.pptx
Introduction to Looker and Google Analytics 4.pptxIntroduction to Looker and Google Analytics 4.pptx
Introduction to Looker and Google Analytics 4.pptxKnoldus Inc.
 
Toronto Pardot User Group - Mar 26 Slideshare
Toronto Pardot User Group - Mar 26 SlideshareToronto Pardot User Group - Mar 26 Slideshare
Toronto Pardot User Group - Mar 26 SlideshareBrainrider B2B Marketing
 
Agile learning journey in public sector / UK Parliament
Agile learning journey in public sector / UK ParliamentAgile learning journey in public sector / UK Parliament
Agile learning journey in public sector / UK ParliamentKaroliina Luoto
 
Google analytics 101 fundamentals, key concepts and reporting basics
Google analytics 101   fundamentals, key concepts and reporting basicsGoogle analytics 101   fundamentals, key concepts and reporting basics
Google analytics 101 fundamentals, key concepts and reporting basicsivantage
 
Intro to Web Analytics - www.Cooperate.NYC - Cohort 3, Summer 2015: Day 2
Intro to Web Analytics - www.Cooperate.NYC - Cohort 3, Summer 2015: Day 2Intro to Web Analytics - www.Cooperate.NYC - Cohort 3, Summer 2015: Day 2
Intro to Web Analytics - www.Cooperate.NYC - Cohort 3, Summer 2015: Day 2emcubedanalytics
 
What Are the Road Mapping Essentials by former Capital One PM
What Are the Road Mapping Essentials by former Capital One PMWhat Are the Road Mapping Essentials by former Capital One PM
What Are the Road Mapping Essentials by former Capital One PMProduct School
 
Scalable Machine Learning
Scalable Machine LearningScalable Machine Learning
Scalable Machine LearningMikio L. Braun
 
Analytics home and pentaho 5.3 webinar
Analytics home and pentaho 5.3   webinarAnalytics home and pentaho 5.3   webinar
Analytics home and pentaho 5.3 webinarMarketo
 
How to track conversions in better way using google analytics
How to track conversions in better way using google analyticsHow to track conversions in better way using google analytics
How to track conversions in better way using google analyticsserrahmark234
 
Jfokus 2015 "Thinking Fast and Slow with Software Development"
Jfokus 2015 "Thinking Fast and Slow with Software Development"Jfokus 2015 "Thinking Fast and Slow with Software Development"
Jfokus 2015 "Thinking Fast and Slow with Software Development"Daniel Bryant
 
Analytics Alchemy - Transform your data with GA4.pdf
Analytics Alchemy - Transform your data with GA4.pdfAnalytics Alchemy - Transform your data with GA4.pdf
Analytics Alchemy - Transform your data with GA4.pdfVenkatesa Madhan V
 
ABI Conference 2015: Google Analytics
ABI Conference 2015: Google AnalyticsABI Conference 2015: Google Analytics
ABI Conference 2015: Google AnalyticsFar Reach
 
Pivotal Tracker - Research Findings
Pivotal Tracker - Research FindingsPivotal Tracker - Research Findings
Pivotal Tracker - Research FindingsPaulina Galindo
 
Surviving the Analytics Apocalypse_ The Death of Universal Analytics and the...
Surviving the Analytics Apocalypse_  The Death of Universal Analytics and the...Surviving the Analytics Apocalypse_  The Death of Universal Analytics and the...
Surviving the Analytics Apocalypse_ The Death of Universal Analytics and the...In Marketing We Trust
 
Meaningful Data - Best Internet Conference 2015 (Lithuania)
Meaningful Data - Best Internet Conference 2015 (Lithuania)Meaningful Data - Best Internet Conference 2015 (Lithuania)
Meaningful Data - Best Internet Conference 2015 (Lithuania)Simo Ahava
 
Mastering Analytics and Integrations - Brightedge Share 2016 Speaking Engagement
Mastering Analytics and Integrations - Brightedge Share 2016 Speaking EngagementMastering Analytics and Integrations - Brightedge Share 2016 Speaking Engagement
Mastering Analytics and Integrations - Brightedge Share 2016 Speaking EngagementFreddie Blicher
 
Effective Data-driven Product Roadmaps
Effective Data-driven Product RoadmapsEffective Data-driven Product Roadmaps
Effective Data-driven Product RoadmapsData Con LA
 
How to Make the Most of Google Analytics on Your Evoq Site
How to Make the Most of Google Analytics on Your Evoq SiteHow to Make the Most of Google Analytics on Your Evoq Site
How to Make the Most of Google Analytics on Your Evoq SiteDNN
 

Similar to Data analytics martinmagdinier-go open data 2015 (20)

Data Strategy Design: An Open Source Toolbox & Method for Data Thinking.
Data Strategy Design: An Open Source Toolbox & Method for Data Thinking. Data Strategy Design: An Open Source Toolbox & Method for Data Thinking.
Data Strategy Design: An Open Source Toolbox & Method for Data Thinking.
 
Introduction to Looker and Google Analytics 4.pptx
Introduction to Looker and Google Analytics 4.pptxIntroduction to Looker and Google Analytics 4.pptx
Introduction to Looker and Google Analytics 4.pptx
 
Toronto Pardot User Group - Mar 26 Slideshare
Toronto Pardot User Group - Mar 26 SlideshareToronto Pardot User Group - Mar 26 Slideshare
Toronto Pardot User Group - Mar 26 Slideshare
 
Digital Marketing on a Global Scale
Digital Marketing on a Global ScaleDigital Marketing on a Global Scale
Digital Marketing on a Global Scale
 
Agile learning journey in public sector / UK Parliament
Agile learning journey in public sector / UK ParliamentAgile learning journey in public sector / UK Parliament
Agile learning journey in public sector / UK Parliament
 
Google analytics 101 fundamentals, key concepts and reporting basics
Google analytics 101   fundamentals, key concepts and reporting basicsGoogle analytics 101   fundamentals, key concepts and reporting basics
Google analytics 101 fundamentals, key concepts and reporting basics
 
Intro to Web Analytics - www.Cooperate.NYC - Cohort 3, Summer 2015: Day 2
Intro to Web Analytics - www.Cooperate.NYC - Cohort 3, Summer 2015: Day 2Intro to Web Analytics - www.Cooperate.NYC - Cohort 3, Summer 2015: Day 2
Intro to Web Analytics - www.Cooperate.NYC - Cohort 3, Summer 2015: Day 2
 
What Are the Road Mapping Essentials by former Capital One PM
What Are the Road Mapping Essentials by former Capital One PMWhat Are the Road Mapping Essentials by former Capital One PM
What Are the Road Mapping Essentials by former Capital One PM
 
Scalable Machine Learning
Scalable Machine LearningScalable Machine Learning
Scalable Machine Learning
 
Analytics home and pentaho 5.3 webinar
Analytics home and pentaho 5.3   webinarAnalytics home and pentaho 5.3   webinar
Analytics home and pentaho 5.3 webinar
 
How to track conversions in better way using google analytics
How to track conversions in better way using google analyticsHow to track conversions in better way using google analytics
How to track conversions in better way using google analytics
 
Jfokus 2015 "Thinking Fast and Slow with Software Development"
Jfokus 2015 "Thinking Fast and Slow with Software Development"Jfokus 2015 "Thinking Fast and Slow with Software Development"
Jfokus 2015 "Thinking Fast and Slow with Software Development"
 
Analytics Alchemy - Transform your data with GA4.pdf
Analytics Alchemy - Transform your data with GA4.pdfAnalytics Alchemy - Transform your data with GA4.pdf
Analytics Alchemy - Transform your data with GA4.pdf
 
ABI Conference 2015: Google Analytics
ABI Conference 2015: Google AnalyticsABI Conference 2015: Google Analytics
ABI Conference 2015: Google Analytics
 
Pivotal Tracker - Research Findings
Pivotal Tracker - Research FindingsPivotal Tracker - Research Findings
Pivotal Tracker - Research Findings
 
Surviving the Analytics Apocalypse_ The Death of Universal Analytics and the...
Surviving the Analytics Apocalypse_  The Death of Universal Analytics and the...Surviving the Analytics Apocalypse_  The Death of Universal Analytics and the...
Surviving the Analytics Apocalypse_ The Death of Universal Analytics and the...
 
Meaningful Data - Best Internet Conference 2015 (Lithuania)
Meaningful Data - Best Internet Conference 2015 (Lithuania)Meaningful Data - Best Internet Conference 2015 (Lithuania)
Meaningful Data - Best Internet Conference 2015 (Lithuania)
 
Mastering Analytics and Integrations - Brightedge Share 2016 Speaking Engagement
Mastering Analytics and Integrations - Brightedge Share 2016 Speaking EngagementMastering Analytics and Integrations - Brightedge Share 2016 Speaking Engagement
Mastering Analytics and Integrations - Brightedge Share 2016 Speaking Engagement
 
Effective Data-driven Product Roadmaps
Effective Data-driven Product RoadmapsEffective Data-driven Product Roadmaps
Effective Data-driven Product Roadmaps
 
How to Make the Most of Google Analytics on Your Evoq Site
How to Make the Most of Google Analytics on Your Evoq SiteHow to Make the Most of Google Analytics on Your Evoq Site
How to Make the Most of Google Analytics on Your Evoq Site
 

More from GO Open Data (GOOD)

Panel 1 taylor_blank_to2015 word version
Panel 1 taylor_blank_to2015 word versionPanel 1 taylor_blank_to2015 word version
Panel 1 taylor_blank_to2015 word versionGO Open Data (GOOD)
 
Panel 1 mary_wiley_2015-05-01 niagara connects good15 ignite presentation
Panel 1 mary_wiley_2015-05-01 niagara connects good15 ignite presentationPanel 1 mary_wiley_2015-05-01 niagara connects good15 ignite presentation
Panel 1 mary_wiley_2015-05-01 niagara connects good15 ignite presentationGO Open Data (GOOD)
 
Panel 1 jury_konga_good15_open impact_jurykonga_may0115
Panel 1 jury_konga_good15_open impact_jurykonga_may0115Panel 1 jury_konga_good15_open impact_jurykonga_may0115
Panel 1 jury_konga_good15_open impact_jurykonga_may0115GO Open Data (GOOD)
 
Panel 1 antoine_belaieff_open data deck 20140430
Panel 1 antoine_belaieff_open data deck 20140430Panel 1 antoine_belaieff_open data deck 20140430
Panel 1 antoine_belaieff_open data deck 20140430GO Open Data (GOOD)
 
Key note3 jamievanymeren_data priorites nfp sector jvy
Key note3 jamievanymeren_data priorites nfp sector jvyKey note3 jamievanymeren_data priorites nfp sector jvy
Key note3 jamievanymeren_data priorites nfp sector jvyGO Open Data (GOOD)
 
Key note2 christianvillumgo open data 2015 presentation - christian villum
Key note2 christianvillumgo open data 2015 presentation - christian villumKey note2 christianvillumgo open data 2015 presentation - christian villum
Key note2 christianvillumgo open data 2015 presentation - christian villumGO Open Data (GOOD)
 
Poverty sarah pennisi_data presentation brock 2
Poverty sarah pennisi_data presentation brock 2Poverty sarah pennisi_data presentation brock 2
Poverty sarah pennisi_data presentation brock 2GO Open Data (GOOD)
 
Poverty mary wiley_2015-05-01 good15 knowledge session data & poverty master
Poverty mary wiley_2015-05-01 good15 knowledge session data & poverty masterPoverty mary wiley_2015-05-01 good15 knowledge session data & poverty master
Poverty mary wiley_2015-05-01 good15 knowledge session data & poverty masterGO Open Data (GOOD)
 
Education zach harmer-open data examples
Education zach harmer-open data examplesEducation zach harmer-open data examples
Education zach harmer-open data examplesGO Open Data (GOOD)
 
Education jeff boggsgo-open_data
Education jeff boggsgo-open_dataEducation jeff boggsgo-open_data
Education jeff boggsgo-open_dataGO Open Data (GOOD)
 
Education darren platakis_goopendata-presentation
Education darren platakis_goopendata-presentationEducation darren platakis_goopendata-presentation
Education darren platakis_goopendata-presentationGO Open Data (GOOD)
 
Education colleen beard_good2015
Education colleen beard_good2015Education colleen beard_good2015
Education colleen beard_good2015GO Open Data (GOOD)
 
Digital communityecosystem n'orakalb_innovate niagara digital community ecosy...
Digital communityecosystem n'orakalb_innovate niagara digital community ecosy...Digital communityecosystem n'orakalb_innovate niagara digital community ecosy...
Digital communityecosystem n'orakalb_innovate niagara digital community ecosy...GO Open Data (GOOD)
 
Digital communityecosystem jurykonga_good15_digitalcommunityecosystem_may0115
Digital communityecosystem jurykonga_good15_digitalcommunityecosystem_may0115Digital communityecosystem jurykonga_good15_digitalcommunityecosystem_may0115
Digital communityecosystem jurykonga_good15_digitalcommunityecosystem_may0115GO Open Data (GOOD)
 
Education pamela robinson_goopen2015 robinson slides
Education pamela robinson_goopen2015 robinson slidesEducation pamela robinson_goopen2015 robinson slides
Education pamela robinson_goopen2015 robinson slidesGO Open Data (GOOD)
 
Open dataportals bryansmith_good15_may1_presentation_final
Open dataportals bryansmith_good15_may1_presentation_finalOpen dataportals bryansmith_good15_may1_presentation_final
Open dataportals bryansmith_good15_may1_presentation_finalGO Open Data (GOOD)
 
Open datapolicy nosaerobrown_good2015-ontario-nosaerobrownv1
Open datapolicy nosaerobrown_good2015-ontario-nosaerobrownv1Open datapolicy nosaerobrown_good2015-ontario-nosaerobrownv1
Open datapolicy nosaerobrown_good2015-ontario-nosaerobrownv1GO Open Data (GOOD)
 
Open datapolicy heatherlaird_good2015
Open datapolicy heatherlaird_good2015Open datapolicy heatherlaird_good2015
Open datapolicy heatherlaird_good2015GO Open Data (GOOD)
 
Open datapolicy danmurray_goopendata2015_kitchenersapproachtoopendata
Open datapolicy danmurray_goopendata2015_kitchenersapproachtoopendataOpen datapolicy danmurray_goopendata2015_kitchenersapproachtoopendata
Open datapolicy danmurray_goopendata2015_kitchenersapproachtoopendataGO Open Data (GOOD)
 

More from GO Open Data (GOOD) (20)

Program
ProgramProgram
Program
 
Panel 1 taylor_blank_to2015 word version
Panel 1 taylor_blank_to2015 word versionPanel 1 taylor_blank_to2015 word version
Panel 1 taylor_blank_to2015 word version
 
Panel 1 mary_wiley_2015-05-01 niagara connects good15 ignite presentation
Panel 1 mary_wiley_2015-05-01 niagara connects good15 ignite presentationPanel 1 mary_wiley_2015-05-01 niagara connects good15 ignite presentation
Panel 1 mary_wiley_2015-05-01 niagara connects good15 ignite presentation
 
Panel 1 jury_konga_good15_open impact_jurykonga_may0115
Panel 1 jury_konga_good15_open impact_jurykonga_may0115Panel 1 jury_konga_good15_open impact_jurykonga_may0115
Panel 1 jury_konga_good15_open impact_jurykonga_may0115
 
Panel 1 antoine_belaieff_open data deck 20140430
Panel 1 antoine_belaieff_open data deck 20140430Panel 1 antoine_belaieff_open data deck 20140430
Panel 1 antoine_belaieff_open data deck 20140430
 
Key note3 jamievanymeren_data priorites nfp sector jvy
Key note3 jamievanymeren_data priorites nfp sector jvyKey note3 jamievanymeren_data priorites nfp sector jvy
Key note3 jamievanymeren_data priorites nfp sector jvy
 
Key note2 christianvillumgo open data 2015 presentation - christian villum
Key note2 christianvillumgo open data 2015 presentation - christian villumKey note2 christianvillumgo open data 2015 presentation - christian villum
Key note2 christianvillumgo open data 2015 presentation - christian villum
 
Poverty sarah pennisi_data presentation brock 2
Poverty sarah pennisi_data presentation brock 2Poverty sarah pennisi_data presentation brock 2
Poverty sarah pennisi_data presentation brock 2
 
Poverty mary wiley_2015-05-01 good15 knowledge session data & poverty master
Poverty mary wiley_2015-05-01 good15 knowledge session data & poverty masterPoverty mary wiley_2015-05-01 good15 knowledge session data & poverty master
Poverty mary wiley_2015-05-01 good15 knowledge session data & poverty master
 
Education zach harmer-open data examples
Education zach harmer-open data examplesEducation zach harmer-open data examples
Education zach harmer-open data examples
 
Education jeff boggsgo-open_data
Education jeff boggsgo-open_dataEducation jeff boggsgo-open_data
Education jeff boggsgo-open_data
 
Education darren platakis_goopendata-presentation
Education darren platakis_goopendata-presentationEducation darren platakis_goopendata-presentation
Education darren platakis_goopendata-presentation
 
Education colleen beard_good2015
Education colleen beard_good2015Education colleen beard_good2015
Education colleen beard_good2015
 
Digital communityecosystem n'orakalb_innovate niagara digital community ecosy...
Digital communityecosystem n'orakalb_innovate niagara digital community ecosy...Digital communityecosystem n'orakalb_innovate niagara digital community ecosy...
Digital communityecosystem n'orakalb_innovate niagara digital community ecosy...
 
Digital communityecosystem jurykonga_good15_digitalcommunityecosystem_may0115
Digital communityecosystem jurykonga_good15_digitalcommunityecosystem_may0115Digital communityecosystem jurykonga_good15_digitalcommunityecosystem_may0115
Digital communityecosystem jurykonga_good15_digitalcommunityecosystem_may0115
 
Education pamela robinson_goopen2015 robinson slides
Education pamela robinson_goopen2015 robinson slidesEducation pamela robinson_goopen2015 robinson slides
Education pamela robinson_goopen2015 robinson slides
 
Open dataportals bryansmith_good15_may1_presentation_final
Open dataportals bryansmith_good15_may1_presentation_finalOpen dataportals bryansmith_good15_may1_presentation_final
Open dataportals bryansmith_good15_may1_presentation_final
 
Open datapolicy nosaerobrown_good2015-ontario-nosaerobrownv1
Open datapolicy nosaerobrown_good2015-ontario-nosaerobrownv1Open datapolicy nosaerobrown_good2015-ontario-nosaerobrownv1
Open datapolicy nosaerobrown_good2015-ontario-nosaerobrownv1
 
Open datapolicy heatherlaird_good2015
Open datapolicy heatherlaird_good2015Open datapolicy heatherlaird_good2015
Open datapolicy heatherlaird_good2015
 
Open datapolicy danmurray_goopendata2015_kitchenersapproachtoopendata
Open datapolicy danmurray_goopendata2015_kitchenersapproachtoopendataOpen datapolicy danmurray_goopendata2015_kitchenersapproachtoopendata
Open datapolicy danmurray_goopendata2015_kitchenersapproachtoopendata
 

Data analytics martinmagdinier-go open data 2015

  • 1. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 1 01/05/2015 Data Analytics & Visualization Cool Tools: Martin Magdinier @magdmartin
  • 2. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 2 01/05/2015 About Me ● Contributor since 2011 ● Committer since 2012 ● http://openrefine.org ● @OpenRefine ● Toronto OpenRefine Meetup Organizer ● Next Sessions: May 21 and June 18 ● Founder 2014 ● OpenRefine Hosting ● http://refinepro.com ● @RefinePro
  • 3. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 3 01/05/2015 80% of data analysis is spent on the process of cleaning, transformation and integration
  • 4. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 4 01/05/2015 Cleaning ● Duplicate value ● Typos ● Multi value cells ● Data in the wrong field ● Missing / Partial Values ● Encoding Errors ● Wrong format (text, number, date ...) Integration & Transformation ● Flat to relational data set ● Schema alignment ● Transpose ● Join data-set ● Enrichment from other sources ● ....
  • 5. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 5 01/05/2015 Bridging The Skill Gap Spreadsheet Basic Knowledge of Scripting python, R, command line ... ETL Engineer Data Science Data Visualization / Interpretation Understand The Data (Business Skills) Know How To Transform Data (Technical Skills)
  • 6. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 6 01/05/2015 Discovery Wrangling In application feedback (personal usage) Profiling Preparation ad hoc usage reporting - migration Quality Transformation Industralization Integration Measure Check Build - Do Learn Think Plan - Act A Lean Data Model
  • 7. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 7 01/05/2015 History ● 2009: Freebase Gridworks release ● 2010: Gridworks become Google Refine ● 2010: Google Refine 2.0 release ● 2011: Google Refine 2.5 release ● 2012: Google Refine become OpenRefine ● 2013: OpenRefine2.6-beta release
  • 8. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 8 01/05/2015 OpenRefine Eco System
  • 9. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 9 01/05/2015 Getting Started Setting Up Refine ● Download OpenRefine2.6- beta: http://openrefine.org/download.html ● Unzip the file ● Start Refine – Win: Double Click refine.exe – Linux: ./refine – Mac: Use RefinePro ● Register on RefinePro Beta http://app.refinepro.com ● Use Chrome or Firefox ● Create An Instance and click Access Instance (can take up to 5 min the first time)
  • 10. Data Analytics & Visualization Cool Tools: OpenRefine Martin Magdinier @magdmartin 10 01/05/2015 Workshop / Demo: 2014 Toronto Cleared Building Permits ● Presentation page: http://ow.ly/Js8GD ● Download Data: http://ow.ly/Js8Ho