SlideShare a Scribd company logo
1 of 14
Data JournalismStudio MDST 3559:  DataestheticsProf. Alvarado1/27/2011
Business Late comers Readings still required for mid-term
Review:Features of Data Journalism Depends on emergence of the datasphere Transparency (Politics 2.0) All data leaks ... and freely available tools for publishing and visualizing data (Web 2.0) Google Docs, Zoho, Factual ManyEyes Data converted into a common format CSV = “comma separated vales” = tabular data in a text file
Features of Data Journalism (ii) Stories directly reference the data they use e.g. via embedded links to Google Docs Definition of story changes ... Visualizations can be stories in themselves The act of data curation itself considered a journalistic act Journalism, as the Fifth Estate, still mediates between power and people, but in new ways A new relationship of power is opened up
TBL says the future of journalism scholarship "lies with journalists scholars who know their CSV from their RDF, can throw together some quick MySQL queries for a PHP or Python output … and discover the story lurking in datasets released by governments, local authorities, agencies, [libraries, museums] or any combination of them – even across national borders."   http://www.guardian.co.uk/media/2010/nov/22/data-analysis-tim-berners-lee
Examples	 Data source Data structure and content Visualization Story/thesis
Overview Download a CSV file from Google Format as tab separated file with Excel Open up with a text editor Cut and paste into ManyEyes Explore ManyEyes visualization Upload to Google Explore Google Docs
Preliminaries Download jEdit A powerful, open source, cross platform text editor for programmers http://http://www.jedit.org/index.php?page=download Get an account on Google If you do not have one, or if you want a new one for this class Get an account on ManyEyes http://www-958.ibm.com/software/data/cognos/manyeyes/
Grab Some Data Go to links on Dataesthetics site Click on each link Should send you to Google Docs For each file, do:  File > Download As > Excel Note where you are saving your files
Convert the Data Open each file up in Excel Do:  Save as > tab delimited text Close file (resave if necessary) Open file in jEdit Make sure that ... Tabs are not converted to spaces File is saved as a Windows or Unix file These options found in Utilities > Buffer Options
View in ManyEyes Log in to ManyEyes For each spreadsheet, do: Participate > Upload a Dataset Cut and paste the content of the jEdit window into the text box Do: Ctrl-A, Ctrl-C, Ctrl-V  Add metadata and press Create ...
ManyEyes What kind of visualization to we choose? See Learn More > Visualization Types (Open in new window or tab) Start with first two visualizations
Visualization Types See relationships among data points Network Diagram Scatterplot Matrix Chart Compare a set of values Bar Chart Block Histogram Bubble Chart Track rises and falls over time Line Graph Stack Graph Stack Graph for Categories See the parts of a whole Pie Chart Treemap Treemap for Comparisons Analyze a text Word Tree Tag Cloud Word Cloud Generator Phrase Net See the world Massachusetts Map World Map US County Map New Jersey Map  http://www-958.ibm.com/software/data/cognos/manyeyes/page/Visualization_Options.html
Combos Social networks in the world Two rows of names Matrix Chart, Treemap, Map (custom) Owners of US Treasury Bonds  One row of numbers, one row of names Bubble Chart, Bar Chart Combined Two rows of names + row of numbers Bubble Chart

More Related Content

What's hot

GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
Marcus Paradies
 
Self-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and GridworksSelf-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and Gridworks
Richard Cyganiak
 

What's hot (18)

Why should you trust my data code4lib 2016
Why should you trust my data code4lib 2016Why should you trust my data code4lib 2016
Why should you trust my data code4lib 2016
 
Semantics for Big Data Integration and Analysis
Semantics for Big Data Integration and AnalysisSemantics for Big Data Integration and Analysis
Semantics for Big Data Integration and Analysis
 
Making data typing efforts or automatically detecting data types for automat...
Making data typing efforts or automatically detecting data types  for automat...Making data typing efforts or automatically detecting data types  for automat...
Making data typing efforts or automatically detecting data types for automat...
 
20110830 Introducing the Social Media Research Foundation
20110830 Introducing the Social Media Research Foundation20110830 Introducing the Social Media Research Foundation
20110830 Introducing the Social Media Research Foundation
 
Linked Data - Overview and Potentials
Linked Data - Overview and PotentialsLinked Data - Overview and Potentials
Linked Data - Overview and Potentials
 
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
 
Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History
 
Data integration
Data integrationData integration
Data integration
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
 
Session 1.2 improving access to digital content by semantic enrichment
Session 1.2   improving access to digital content by semantic enrichmentSession 1.2   improving access to digital content by semantic enrichment
Session 1.2 improving access to digital content by semantic enrichment
 
NCompass Live: Life After MARC: Cataloging Tools of the Future
NCompass Live: Life After MARC: Cataloging Tools of the FutureNCompass Live: Life After MARC: Cataloging Tools of the Future
NCompass Live: Life After MARC: Cataloging Tools of the Future
 
Webmining Overview
Webmining OverviewWebmining Overview
Webmining Overview
 
Self-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and GridworksSelf-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and Gridworks
 
non-slides-Thatcamp
non-slides-Thatcampnon-slides-Thatcamp
non-slides-Thatcamp
 
Dealing with Open Data in Istat
Dealing with Open Data in IstatDealing with Open Data in Istat
Dealing with Open Data in Istat
 
Mest
MestMest
Mest
 
From Records to Data
From Records to DataFrom Records to Data
From Records to Data
 
Session 1 and 2 "Challenges and Opportunities with Big Linked Data Visualiza...
Session 1 and 2  "Challenges and Opportunities with Big Linked Data Visualiza...Session 1 and 2  "Challenges and Opportunities with Big Linked Data Visualiza...
Session 1 and 2 "Challenges and Opportunities with Big Linked Data Visualiza...
 

Viewers also liked

Mdst3703 graph-theory-11-20-2012
Mdst3703 graph-theory-11-20-2012Mdst3703 graph-theory-11-20-2012
Mdst3703 graph-theory-11-20-2012
Rafael Alvarado
 
Mdst3705 2013-02-12-finding-data
Mdst3705 2013-02-12-finding-dataMdst3705 2013-02-12-finding-data
Mdst3705 2013-02-12-finding-data
Rafael Alvarado
 
MDST 3705 2012-03-05 Databases to Visualization
MDST 3705 2012-03-05 Databases to VisualizationMDST 3705 2012-03-05 Databases to Visualization
MDST 3705 2012-03-05 Databases to Visualization
Rafael Alvarado
 
UVA MDST 3703 Studio 01 2012-08-30
UVA MDST 3703 Studio 01 2012-08-30UVA MDST 3703 Studio 01 2012-08-30
UVA MDST 3703 Studio 01 2012-08-30
Rafael Alvarado
 
UVA MDST 3073 CSS 2012-09-20
UVA MDST 3073 CSS 2012-09-20UVA MDST 3073 CSS 2012-09-20
UVA MDST 3073 CSS 2012-09-20
Rafael Alvarado
 
Mdst3703 shiva-2012-10-18
Mdst3703 shiva-2012-10-18Mdst3703 shiva-2012-10-18
Mdst3703 shiva-2012-10-18
Rafael Alvarado
 
Mdst3703 maps-and-timelines-2012-11-13
Mdst3703 maps-and-timelines-2012-11-13Mdst3703 maps-and-timelines-2012-11-13
Mdst3703 maps-and-timelines-2012-11-13
Rafael Alvarado
 
MDST 3703 F10 Seminar 14
MDST 3703 F10 Seminar 14MDST 3703 F10 Seminar 14
MDST 3703 F10 Seminar 14
Rafael Alvarado
 
Mdst3703 2013-10-08-thematic-research-collections
Mdst3703 2013-10-08-thematic-research-collectionsMdst3703 2013-10-08-thematic-research-collections
Mdst3703 2013-10-08-thematic-research-collections
Rafael Alvarado
 
Mdst3705 2013-02-26-db-as-genre
Mdst3705 2013-02-26-db-as-genreMdst3705 2013-02-26-db-as-genre
Mdst3705 2013-02-26-db-as-genre
Rafael Alvarado
 
Mdst3703 2013-09-12-semantic-html
Mdst3703 2013-09-12-semantic-htmlMdst3703 2013-09-12-semantic-html
Mdst3703 2013-09-12-semantic-html
Rafael Alvarado
 
UVA MDST 3703 The Stack of Scholarship 2012-09-24
UVA MDST 3703 The Stack of Scholarship 2012-09-24UVA MDST 3703 The Stack of Scholarship 2012-09-24
UVA MDST 3703 The Stack of Scholarship 2012-09-24
Rafael Alvarado
 
UVA MDST 3703 2013 08-27 Introduction
UVA MDST 3703 2013 08-27 IntroductionUVA MDST 3703 2013 08-27 Introduction
UVA MDST 3703 2013 08-27 Introduction
Rafael Alvarado
 

Viewers also liked (18)

Mdst3703 graph-theory-11-20-2012
Mdst3703 graph-theory-11-20-2012Mdst3703 graph-theory-11-20-2012
Mdst3703 graph-theory-11-20-2012
 
Mdst3705 2013-02-12-finding-data
Mdst3705 2013-02-12-finding-dataMdst3705 2013-02-12-finding-data
Mdst3705 2013-02-12-finding-data
 
MDST 3705 2012-03-05 Databases to Visualization
MDST 3705 2012-03-05 Databases to VisualizationMDST 3705 2012-03-05 Databases to Visualization
MDST 3705 2012-03-05 Databases to Visualization
 
Hd Overview
Hd OverviewHd Overview
Hd Overview
 
UVA MDST 3703 Studio 01 2012-08-30
UVA MDST 3703 Studio 01 2012-08-30UVA MDST 3703 Studio 01 2012-08-30
UVA MDST 3703 Studio 01 2012-08-30
 
MDST 3703 F10 Studio 12
MDST 3703 F10 Studio 12MDST 3703 F10 Studio 12
MDST 3703 F10 Studio 12
 
UVA MDST 3073 CSS 2012-09-20
UVA MDST 3073 CSS 2012-09-20UVA MDST 3073 CSS 2012-09-20
UVA MDST 3073 CSS 2012-09-20
 
Mdst3703 shiva-2012-10-18
Mdst3703 shiva-2012-10-18Mdst3703 shiva-2012-10-18
Mdst3703 shiva-2012-10-18
 
Mdst3703 maps-and-timelines-2012-11-13
Mdst3703 maps-and-timelines-2012-11-13Mdst3703 maps-and-timelines-2012-11-13
Mdst3703 maps-and-timelines-2012-11-13
 
MDST 3703 F10 Seminar 14
MDST 3703 F10 Seminar 14MDST 3703 F10 Seminar 14
MDST 3703 F10 Seminar 14
 
MDST 3703 F10 Studio 2
MDST 3703 F10 Studio 2 MDST 3703 F10 Studio 2
MDST 3703 F10 Studio 2
 
Mdst3703 2013-10-08-thematic-research-collections
Mdst3703 2013-10-08-thematic-research-collectionsMdst3703 2013-10-08-thematic-research-collections
Mdst3703 2013-10-08-thematic-research-collections
 
Mdst3705 2013-02-26-db-as-genre
Mdst3705 2013-02-26-db-as-genreMdst3705 2013-02-26-db-as-genre
Mdst3705 2013-02-26-db-as-genre
 
Mdst 3559-02-22-sql1
Mdst 3559-02-22-sql1Mdst 3559-02-22-sql1
Mdst 3559-02-22-sql1
 
Mdst3703 2013-09-12-semantic-html
Mdst3703 2013-09-12-semantic-htmlMdst3703 2013-09-12-semantic-html
Mdst3703 2013-09-12-semantic-html
 
UVA MDST 3703 The Stack of Scholarship 2012-09-24
UVA MDST 3703 The Stack of Scholarship 2012-09-24UVA MDST 3703 The Stack of Scholarship 2012-09-24
UVA MDST 3703 The Stack of Scholarship 2012-09-24
 
UVA MDST 3703 2013 08-27 Introduction
UVA MDST 3703 2013 08-27 IntroductionUVA MDST 3703 2013 08-27 Introduction
UVA MDST 3703 2013 08-27 Introduction
 
Mdst 3559-02-15-php
Mdst 3559-02-15-phpMdst 3559-02-15-php
Mdst 3559-02-15-php
 

Similar to Mdst 3559-01-27-data-journalism-studio

George thomas gtra2010
George thomas gtra2010George thomas gtra2010
George thomas gtra2010
George Thomas
 
Open data Websmatch
Open data WebsmatchOpen data Websmatch
Open data Websmatch
data publica
 
Wisdom Of Crowds
Wisdom Of CrowdsWisdom Of Crowds
Wisdom Of Crowds
guest5dedec
 
Silverlight week5
Silverlight week5Silverlight week5
Silverlight week5
iedotnetug
 

Similar to Mdst 3559-01-27-data-journalism-studio (20)

Ben Ryan (University of Leeds) – Timescapes Project
Ben Ryan (University of Leeds) – Timescapes ProjectBen Ryan (University of Leeds) – Timescapes Project
Ben Ryan (University of Leeds) – Timescapes Project
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
Cloud Libraries
Cloud LibrariesCloud Libraries
Cloud Libraries
 
Where 2.0 NoSQL Presentation 2008 - GeoIQ
Where 2.0 NoSQL Presentation 2008 - GeoIQWhere 2.0 NoSQL Presentation 2008 - GeoIQ
Where 2.0 NoSQL Presentation 2008 - GeoIQ
 
Data Wrangling with Open Refine
Data Wrangling with Open RefineData Wrangling with Open Refine
Data Wrangling with Open Refine
 
George thomas gtra2010
George thomas gtra2010George thomas gtra2010
George thomas gtra2010
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic Web
 
Open data Websmatch
Open data WebsmatchOpen data Websmatch
Open data Websmatch
 
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEMAn Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
 
WEB EVOLUTION - THE SHIFT FROM INFORMATION PUBLISHING TO REASONING
WEB EVOLUTION - THE SHIFT FROM INFORMATION PUBLISHING TO REASONINGWEB EVOLUTION - THE SHIFT FROM INFORMATION PUBLISHING TO REASONING
WEB EVOLUTION - THE SHIFT FROM INFORMATION PUBLISHING TO REASONING
 
Wisdom Of Crowds
Wisdom Of CrowdsWisdom Of Crowds
Wisdom Of Crowds
 
Evolving social data mining and affective analysis
Evolving social data mining and affective analysis  Evolving social data mining and affective analysis
Evolving social data mining and affective analysis
 
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
 
Object models and object representation
Object models and object representationObject models and object representation
Object models and object representation
 
ORE en Fedora Op Klompen
ORE en Fedora Op KlompenORE en Fedora Op Klompen
ORE en Fedora Op Klompen
 
Silverlight week5
Silverlight week5Silverlight week5
Silverlight week5
 
ALA Interoperability
ALA InteroperabilityALA Interoperability
ALA Interoperability
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpages
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpages
 
Metadata and Tagging
Metadata and TaggingMetadata and Tagging
Metadata and Tagging
 

More from Rafael Alvarado

Mdst3703 2013-10-01-hypertext-and-history
Mdst3703 2013-10-01-hypertext-and-historyMdst3703 2013-10-01-hypertext-and-history
Mdst3703 2013-10-01-hypertext-and-history
Rafael Alvarado
 
Mdst3703 2013-09-24-hypertext
Mdst3703 2013-09-24-hypertextMdst3703 2013-09-24-hypertext
Mdst3703 2013-09-24-hypertext
Rafael Alvarado
 
Mdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-modelsMdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-models
Rafael Alvarado
 
Mdst3703 2013-09-10-textual-signals
Mdst3703 2013-09-10-textual-signalsMdst3703 2013-09-10-textual-signals
Mdst3703 2013-09-10-textual-signals
Rafael Alvarado
 
Mdst3703 2013-09-05-studio2
Mdst3703 2013-09-05-studio2Mdst3703 2013-09-05-studio2
Mdst3703 2013-09-05-studio2
Rafael Alvarado
 
Mdst3703 2013-09-03-plato2
Mdst3703 2013-09-03-plato2Mdst3703 2013-09-03-plato2
Mdst3703 2013-09-03-plato2
Rafael Alvarado
 
Mdst3703 2013-08-29-hello-world
Mdst3703 2013-08-29-hello-worldMdst3703 2013-08-29-hello-world
Mdst3703 2013-08-29-hello-world
Rafael Alvarado
 
Mdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-dataMdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-data
Rafael Alvarado
 
Mdst3705 2013-02-05-databases
Mdst3705 2013-02-05-databasesMdst3705 2013-02-05-databases
Mdst3705 2013-02-05-databases
Rafael Alvarado
 
Mdst3705 2013-01-29-praxis
Mdst3705 2013-01-29-praxisMdst3705 2013-01-29-praxis
Mdst3705 2013-01-29-praxis
Rafael Alvarado
 
Mdst3705 2013-01-31-php3
Mdst3705 2013-01-31-php3Mdst3705 2013-01-31-php3
Mdst3705 2013-01-31-php3
Rafael Alvarado
 
Mdst3705 2012-01-22-code-as-language
Mdst3705 2012-01-22-code-as-languageMdst3705 2012-01-22-code-as-language
Mdst3705 2012-01-22-code-as-language
Rafael Alvarado
 
Mdst3705 2013-01-24-php2
Mdst3705 2013-01-24-php2Mdst3705 2013-01-24-php2
Mdst3705 2013-01-24-php2
Rafael Alvarado
 
Mdst3705 2012-01-15-introduction
Mdst3705 2012-01-15-introductionMdst3705 2012-01-15-introduction
Mdst3705 2012-01-15-introduction
Rafael Alvarado
 
Mdst3703 culturomics-2012-11-01
Mdst3703 culturomics-2012-11-01Mdst3703 culturomics-2012-11-01
Mdst3703 culturomics-2012-11-01
Rafael Alvarado
 
Mdst3703 visualization-2012-10-23
Mdst3703 visualization-2012-10-23Mdst3703 visualization-2012-10-23
Mdst3703 visualization-2012-10-23
Rafael Alvarado
 
Mdst3703 ontology-overrated-2012-10-16
Mdst3703 ontology-overrated-2012-10-16Mdst3703 ontology-overrated-2012-10-16
Mdst3703 ontology-overrated-2012-10-16
Rafael Alvarado
 
Mdst3703 projects-2012-10-11
Mdst3703 projects-2012-10-11Mdst3703 projects-2012-10-11
Mdst3703 projects-2012-10-11
Rafael Alvarado
 
UVA MDST 3703 JavaScript (ii) 2012-10-04
UVA MDST 3703 JavaScript (ii) 2012-10-04UVA MDST 3703 JavaScript (ii) 2012-10-04
UVA MDST 3703 JavaScript (ii) 2012-10-04
Rafael Alvarado
 

More from Rafael Alvarado (20)

Mdst3703 2013-10-01-hypertext-and-history
Mdst3703 2013-10-01-hypertext-and-historyMdst3703 2013-10-01-hypertext-and-history
Mdst3703 2013-10-01-hypertext-and-history
 
Mdst3703 2013-09-24-hypertext
Mdst3703 2013-09-24-hypertextMdst3703 2013-09-24-hypertext
Mdst3703 2013-09-24-hypertext
 
Presentation1
Presentation1Presentation1
Presentation1
 
Mdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-modelsMdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-models
 
Mdst3703 2013-09-10-textual-signals
Mdst3703 2013-09-10-textual-signalsMdst3703 2013-09-10-textual-signals
Mdst3703 2013-09-10-textual-signals
 
Mdst3703 2013-09-05-studio2
Mdst3703 2013-09-05-studio2Mdst3703 2013-09-05-studio2
Mdst3703 2013-09-05-studio2
 
Mdst3703 2013-09-03-plato2
Mdst3703 2013-09-03-plato2Mdst3703 2013-09-03-plato2
Mdst3703 2013-09-03-plato2
 
Mdst3703 2013-08-29-hello-world
Mdst3703 2013-08-29-hello-worldMdst3703 2013-08-29-hello-world
Mdst3703 2013-08-29-hello-world
 
Mdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-dataMdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-data
 
Mdst3705 2013-02-05-databases
Mdst3705 2013-02-05-databasesMdst3705 2013-02-05-databases
Mdst3705 2013-02-05-databases
 
Mdst3705 2013-01-29-praxis
Mdst3705 2013-01-29-praxisMdst3705 2013-01-29-praxis
Mdst3705 2013-01-29-praxis
 
Mdst3705 2013-01-31-php3
Mdst3705 2013-01-31-php3Mdst3705 2013-01-31-php3
Mdst3705 2013-01-31-php3
 
Mdst3705 2012-01-22-code-as-language
Mdst3705 2012-01-22-code-as-languageMdst3705 2012-01-22-code-as-language
Mdst3705 2012-01-22-code-as-language
 
Mdst3705 2013-01-24-php2
Mdst3705 2013-01-24-php2Mdst3705 2013-01-24-php2
Mdst3705 2013-01-24-php2
 
Mdst3705 2012-01-15-introduction
Mdst3705 2012-01-15-introductionMdst3705 2012-01-15-introduction
Mdst3705 2012-01-15-introduction
 
Mdst3703 culturomics-2012-11-01
Mdst3703 culturomics-2012-11-01Mdst3703 culturomics-2012-11-01
Mdst3703 culturomics-2012-11-01
 
Mdst3703 visualization-2012-10-23
Mdst3703 visualization-2012-10-23Mdst3703 visualization-2012-10-23
Mdst3703 visualization-2012-10-23
 
Mdst3703 ontology-overrated-2012-10-16
Mdst3703 ontology-overrated-2012-10-16Mdst3703 ontology-overrated-2012-10-16
Mdst3703 ontology-overrated-2012-10-16
 
Mdst3703 projects-2012-10-11
Mdst3703 projects-2012-10-11Mdst3703 projects-2012-10-11
Mdst3703 projects-2012-10-11
 
UVA MDST 3703 JavaScript (ii) 2012-10-04
UVA MDST 3703 JavaScript (ii) 2012-10-04UVA MDST 3703 JavaScript (ii) 2012-10-04
UVA MDST 3703 JavaScript (ii) 2012-10-04
 

Mdst 3559-01-27-data-journalism-studio

  • 1. Data JournalismStudio MDST 3559: DataestheticsProf. Alvarado1/27/2011
  • 2. Business Late comers Readings still required for mid-term
  • 3. Review:Features of Data Journalism Depends on emergence of the datasphere Transparency (Politics 2.0) All data leaks ... and freely available tools for publishing and visualizing data (Web 2.0) Google Docs, Zoho, Factual ManyEyes Data converted into a common format CSV = “comma separated vales” = tabular data in a text file
  • 4. Features of Data Journalism (ii) Stories directly reference the data they use e.g. via embedded links to Google Docs Definition of story changes ... Visualizations can be stories in themselves The act of data curation itself considered a journalistic act Journalism, as the Fifth Estate, still mediates between power and people, but in new ways A new relationship of power is opened up
  • 5. TBL says the future of journalism scholarship "lies with journalists scholars who know their CSV from their RDF, can throw together some quick MySQL queries for a PHP or Python output … and discover the story lurking in datasets released by governments, local authorities, agencies, [libraries, museums] or any combination of them – even across national borders."   http://www.guardian.co.uk/media/2010/nov/22/data-analysis-tim-berners-lee
  • 6. Examples Data source Data structure and content Visualization Story/thesis
  • 7. Overview Download a CSV file from Google Format as tab separated file with Excel Open up with a text editor Cut and paste into ManyEyes Explore ManyEyes visualization Upload to Google Explore Google Docs
  • 8. Preliminaries Download jEdit A powerful, open source, cross platform text editor for programmers http://http://www.jedit.org/index.php?page=download Get an account on Google If you do not have one, or if you want a new one for this class Get an account on ManyEyes http://www-958.ibm.com/software/data/cognos/manyeyes/
  • 9. Grab Some Data Go to links on Dataesthetics site Click on each link Should send you to Google Docs For each file, do: File > Download As > Excel Note where you are saving your files
  • 10. Convert the Data Open each file up in Excel Do: Save as > tab delimited text Close file (resave if necessary) Open file in jEdit Make sure that ... Tabs are not converted to spaces File is saved as a Windows or Unix file These options found in Utilities > Buffer Options
  • 11. View in ManyEyes Log in to ManyEyes For each spreadsheet, do: Participate > Upload a Dataset Cut and paste the content of the jEdit window into the text box Do: Ctrl-A, Ctrl-C, Ctrl-V Add metadata and press Create ...
  • 12. ManyEyes What kind of visualization to we choose? See Learn More > Visualization Types (Open in new window or tab) Start with first two visualizations
  • 13. Visualization Types See relationships among data points Network Diagram Scatterplot Matrix Chart Compare a set of values Bar Chart Block Histogram Bubble Chart Track rises and falls over time Line Graph Stack Graph Stack Graph for Categories See the parts of a whole Pie Chart Treemap Treemap for Comparisons Analyze a text Word Tree Tag Cloud Word Cloud Generator Phrase Net See the world Massachusetts Map World Map US County Map New Jersey Map http://www-958.ibm.com/software/data/cognos/manyeyes/page/Visualization_Options.html
  • 14. Combos Social networks in the world Two rows of names Matrix Chart, Treemap, Map (custom) Owners of US Treasury Bonds One row of numbers, one row of names Bubble Chart, Bar Chart Combined Two rows of names + row of numbers Bubble Chart
  • 15. Workflow (Pipeline) Grab Google Convert Excel Copy jEdit Visualize ManyEyes
  • 16. Google Docs Go to docs.google.com Upload the files you had previously saved Use the drag and drop feature or just upload one at a time Create a folder an move them into it Click on an item Explore freezing, sorting, sharing, gadgets ...

Editor's Notes

  1. See http://www-958.ibm.com/software/data/cognos/manyeyes/page/Visualization_Options.html