SlideShare a Scribd company logo
1 of 25
Adventure in
       Computer-Assisted Reporting
                     AAN
                   Las Vegas
                 October, 2010
Julia Goldberg
What do we mean by computer-
      assisted reporting?




  Not this…
Not This Either
Computer-Assisted Reporting

Mining the internet for data,
working with the information
and presenting it to readers
Where’s the money?
              http://www.sfreporter.co
              m/santafe/article-4860-
              wheres-the-money.html
Socrata
public information
    databases
  embeddable
  visualizations
Spreadsheets

 the real reason you became a
  journalist was probably not
because you love spreadsheets
Spread sheet magic!
Google gadgets
Maps

  Map mashups
  google maps
interactive maps
      maps
      maps
      maps
Parks


   Zeemaps
Geobatch
timeline
           Docsdocsdocs
Data scraping

        the next frontier

using tools to pull information off
             the web
VisualizeThis
Money bubbles

    for all

More Related Content

Viewers also liked

Viewers also liked (13)

Introtomemoir
IntrotomemoirIntrotomemoir
Introtomemoir
 
Ledes
LedesLedes
Ledes
 
Voice, Sound, Time
Voice, Sound, TimeVoice, Sound, Time
Voice, Sound, Time
 
Flash NonFiction
Flash NonFictionFlash NonFiction
Flash NonFiction
 
Interviewingandsourcing
InterviewingandsourcingInterviewingandsourcing
Interviewingandsourcing
 
Fourth State of Matter
Fourth State of MatterFourth State of Matter
Fourth State of Matter
 
Nonfiction revision 2015
Nonfiction revision 2015Nonfiction revision 2015
Nonfiction revision 2015
 
Mixed Greens
Mixed GreensMixed Greens
Mixed Greens
 
Mixadlib
MixadlibMixadlib
Mixadlib
 
Children of Men/Utopia
Children of Men/UtopiaChildren of Men/Utopia
Children of Men/Utopia
 
Craft Elements, Fall 2015
Craft Elements, Fall 2015Craft Elements, Fall 2015
Craft Elements, Fall 2015
 
Dave Eggers' The Circle
Dave Eggers' The CircleDave Eggers' The Circle
Dave Eggers' The Circle
 
Media Shifts
Media ShiftsMedia Shifts
Media Shifts
 

Similar to AAN Writers Conference/Las Vegas

Mobile Social Location (Web 2.0 NYC edition)
Mobile Social Location (Web 2.0 NYC edition)Mobile Social Location (Web 2.0 NYC edition)
Mobile Social Location (Web 2.0 NYC edition)
Matt Biddulph
 
Martin Stabe, interactive producer, Financial Times
Martin Stabe, interactive producer, Financial TimesMartin Stabe, interactive producer, Financial Times
Martin Stabe, interactive producer, Financial Times
joelmgunter
 

Similar to AAN Writers Conference/Las Vegas (20)

State of the Internet Operating System: Web2 expo10
State of the Internet Operating System: Web2 expo10State of the Internet Operating System: Web2 expo10
State of the Internet Operating System: Web2 expo10
 
Mobile Social Location (Web 2.0 NYC edition)
Mobile Social Location (Web 2.0 NYC edition)Mobile Social Location (Web 2.0 NYC edition)
Mobile Social Location (Web 2.0 NYC edition)
 
Semantics and Travel
Semantics and TravelSemantics and Travel
Semantics and Travel
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016
 
IMRS 2010 | Rafael Siqueira | Social Location - Geolocalização
IMRS 2010 | Rafael Siqueira | Social Location - GeolocalizaçãoIMRS 2010 | Rafael Siqueira | Social Location - Geolocalização
IMRS 2010 | Rafael Siqueira | Social Location - Geolocalização
 
Augmented Reality’s First Educational Applications
Augmented Reality’s First Educational ApplicationsAugmented Reality’s First Educational Applications
Augmented Reality’s First Educational Applications
 
Bigger than Any One: Solving Large Scale Data Problems with People and Machines
Bigger than Any One: Solving Large Scale Data Problems with People and MachinesBigger than Any One: Solving Large Scale Data Problems with People and Machines
Bigger than Any One: Solving Large Scale Data Problems with People and Machines
 
Martin Stabe, interactive producer, Financial Times
Martin Stabe, interactive producer, Financial TimesMartin Stabe, interactive producer, Financial Times
Martin Stabe, interactive producer, Financial Times
 
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddjData-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
 
Web20 Mapping - by Alan Lew
Web20 Mapping - by  Alan LewWeb20 Mapping - by  Alan Lew
Web20 Mapping - by Alan Lew
 
Visalyze - Survival Skills for the Data Driven Society
Visalyze - Survival Skills for the Data Driven Society Visalyze - Survival Skills for the Data Driven Society
Visalyze - Survival Skills for the Data Driven Society
 
Survival Skills for the Data-Driven Society
Survival Skills for the Data-Driven SocietySurvival Skills for the Data-Driven Society
Survival Skills for the Data-Driven Society
 
Location Intelligence & Data Visualization
Location Intelligence & Data VisualizationLocation Intelligence & Data Visualization
Location Intelligence & Data Visualization
 
F. Giannotti - Big data & social data mining
F. Giannotti - Big data & social data miningF. Giannotti - Big data & social data mining
F. Giannotti - Big data & social data mining
 
ArLoMoSo Futures
ArLoMoSo FuturesArLoMoSo Futures
ArLoMoSo Futures
 
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
 
Web 3.0 Metaverse
Web 3.0 MetaverseWeb 3.0 Metaverse
Web 3.0 Metaverse
 
2015 Digital Market Trends: How To Stay Ahead of the Game
2015 Digital Market Trends: How To Stay Ahead of the Game2015 Digital Market Trends: How To Stay Ahead of the Game
2015 Digital Market Trends: How To Stay Ahead of the Game
 
Ficod 2011 (keynote file)
Ficod 2011 (keynote file)Ficod 2011 (keynote file)
Ficod 2011 (keynote file)
 
Transcendent Interactions Collaborative Contexts and Relationship-based Compu...
Transcendent Interactions Collaborative Contexts and Relationship-based Compu...Transcendent Interactions Collaborative Contexts and Relationship-based Compu...
Transcendent Interactions Collaborative Contexts and Relationship-based Compu...
 

More from Julia Goldberg

More from Julia Goldberg (20)

2022timeandplot.pptx
2022timeandplot.pptx2022timeandplot.pptx
2022timeandplot.pptx
 
Intro to Creative Writing Portfolio
Intro to Creative Writing PortfolioIntro to Creative Writing Portfolio
Intro to Creative Writing Portfolio
 
Revision and Theme
Revision and ThemeRevision and Theme
Revision and Theme
 
Shapes in Fiction
Shapes in FictionShapes in Fiction
Shapes in Fiction
 
Practice Writing Workshop
Practice Writing WorkshopPractice Writing Workshop
Practice Writing Workshop
 
Point of View
Point of ViewPoint of View
Point of View
 
Time and Plot
Time and PlotTime and Plot
Time and Plot
 
2020 fictional places
2020 fictional places2020 fictional places
2020 fictional places
 
Characterization, Part 2
Characterization, Part 2Characterization, Part 2
Characterization, Part 2
 
Characterization, Part 1
Characterization, Part 1Characterization, Part 1
Characterization, Part 1
 
Writing Habits & Sensory Detail
Writing Habits & Sensory DetailWriting Habits & Sensory Detail
Writing Habits & Sensory Detail
 
Welcome to Class
Welcome to ClassWelcome to Class
Welcome to Class
 
SFCC Revision Workshop Spring 19
SFCC Revision Workshop Spring 19SFCC Revision Workshop Spring 19
SFCC Revision Workshop Spring 19
 
April 29 presentation
April 29 presentationApril 29 presentation
April 29 presentation
 
Workshop April 17
Workshop April 17Workshop April 17
Workshop April 17
 
Flash Fiction
Flash FictionFlash Fiction
Flash Fiction
 
Workshop Prep and Exercise
Workshop Prep and ExerciseWorkshop Prep and Exercise
Workshop Prep and Exercise
 
Theme
ThemeTheme
Theme
 
Critique Workshop Exercise
Critique Workshop ExerciseCritique Workshop Exercise
Critique Workshop Exercise
 
Workshop 1Details
Workshop 1DetailsWorkshop 1Details
Workshop 1Details
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Recently uploaded (20)

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 

AAN Writers Conference/Las Vegas

Editor's Notes

  1. More like this…
  2. And this…
  3. And this
  4. And this
  5. I want to share with you some of the tools we’ve been using at the Reporter for computer-assisted projects, all of which are very accessible, easy to use, free and, for us, are making an impact with how we work with information. For Where’s the Money, Corey Pein had decided he wanted to do basically a Forbes style piece, who are the richest people in Santa Fe. And he actually talked to someone at Forbes to find out how they do it, and it’s basically a guestimation based on a lot of indicators, like land and home ownership, charitable donations, political contributions. By the time he was done, he had amassed a lot of spreadsheets, because you really can’t do a story like this without working in spreadsheets, you have to be able to keep track of the information, in some cases you’re going to need to do calculations. When he was done, we wanted to be able to present not just the results, but share some of that reporting with our readers. So we used a site that we use quite often, called Socrata. Go to socrata
  6. http://www.socrata.com first line linkable
  7. I know. I don’t actually love working with spreadsheets either, but there are a lot of things you will not ever really do very well if you can’t handle working with them. Campaign finance spending, or doing any sort of enterprise reporting involving lists or money or anything you want to be able to sort, you have to suck it up sometimes and working with them. We go back and forth between excel and google spreadsheets, and lately more google spreadsheets because we’ve been playing around with googlegadgets,which I’ll talk about in a minute. But first, a few resources.
  8. One of the things that’s really tedious about spreadsheets, especially when you download a spreadsheets, let’s say with political contributions. We had this a lot this year, that our Secretary of State’s website, you can download contributions in excel, but the fields are a nightmare, it’s just a total mess, and if you want to work with it, you spend as much time cleaning up the spreadsheet as you would just looking at the data. This is where Magic/Replace comes in. And I don’t know of another site that is like this, but there may be others, but this one has been consistently really helpful. I’m just going to show you how it works.
  9. Google, evil, evil google. Despite being evil, google does have a lot of useful tools that we’re still playing around with, but once you’ve actually put together a google spreadsheet, it’s really easy to use some of their gadgets to see if the different visualizations they offer help you interrogate your information a little bit. From regular charts to motion charts to pivot tables. And that’s just, the tip of the iceberg. Alexa did a heat chart this week that tracked the changes in election spending, using a google gadget.
  10. There are so many mapping tools out there right now, that I just chose a few that are super easy and that I think can have value added purpose for the kinds of stories our papers often do, and the kinds of stories we should do. There’s going to be a lot of new census information this year and there are a lot of mapping tools out there that can be used in conjunction with census data to really mine that information for stories and presentations. But the census is its own conversation, so let’s talk about parks instead.
  11. State of Play is a story Alexa did in thefall.it was actually a story idea generated at the Toronto AAN convention of going out and rating all your city’s playgrounds and seeing where the money is being spent and which ones are in disarray and that sort of thing. And Alexa did that and put together the information about how all the money gets spent in the old wealthy neighborhoods where there aren’t really any kids, and less on the other side of town.
  12. Oil vs. Real Estate, was a story we did over the summer, looking at where the money was coming from in our gubernatorial race, and we put some of that information into socrata spread sheets, but we also wanted to map it and see where it was coming from, and so we used batch geo, which will take your spread sheet and make an interactive google map for you. Now, if you are already comfortable batching data through google using the api, then you don’t need this, but if you’re not, this is idiot proof.
  13. Document Cloud is a project funded by Knight News Challenge, and it was dreamed up by some journalists from propublica and The New York Times. And it’s for journalists, it’s a primary source repisotry where you can put your documents, annotate them, edit them and publish them. If you’re on the back end it’s pretty cool because it creates really a compendium of your documents.
  14. So, what is data scraping. It’s basically writing code to pull information off of websites. It’s the reason I have 1,000 emails a day trying to sell me viagra. It’s super annoying. But for journalists, it’s an emergent field because although there is a lot of information on the internet, sometimes it’s not just there to download, or you want to be able to get. More advanced journalists with coding skills are spending a fair amount of time writing code to scrape jail websites, and auto scraping to get their information. I’ve included some information that will give you an introduction to that concept, but if this is brand new, I’m going to suggest you start with Outwit. Outwit is a firefoxextenson that can make your work on the internet a lot easier. Scraperwiki is a site where you can build your own, but you can also put out requests for particular scrapers. Demo data scraper on outwit of SOS site; pull lobbyist/organizations.
  15. Many Eyes application, link on Visualize This
  16. Links to Many Eyes presentation on http://www.sfreporter.com