This document contains summaries and links from an online data journalism course. It discusses techniques for cleaning data, visualizing data in charts and maps, using tools like Google Refine and Tableau, and mashing up multiple data sources. The document provides advice on key things to know about each topic and links to online resources for practicing and learning more about data journalism skills.
The business world is increasingly adopting the Moneyball principle of using data to predict and gain a competitive advantage in healthcare, telecommunications, retail, media, energy, and many other industries. Some argue that organizations that do not possess strong data and the skills to create value out of it will not survive. How can companies leverage data - sometimes described as the “new gold” - for consumer insights, improved processes or new product ideas? Can data assets be leveraged effectively for the overall business?
Bigit Keynote - Big Data & Critical ThinkingLutz Finger
BIGIT Technology Malaysia 2016, the Anchor Event of the Big Data Week Asia featuring concurrent conferences themed Data Security World Show and the 4th Big Data World Show will be held on 19th-20th September 2016 at KLCC Convention Centre, Malaysia.
As the leading Big Data World Show in Asia, BIGIT Technology Malaysia 2016 is co-organized with Malaysia Digital Economy Corporation (MDEC) - Malaysia's government agency leading the national Big Data Analytics initiative.
Making sense of your data doesn't require mountains of data; it requires a systematic approach that leads to actionable insights. But how to get there? This keynote (given at "Predictive Analytics and Business Insights") shows how to extract significant business value from big data with Ask-Measure-Learn, a system that helps you ask the right questions, measure the right data, and then learn from the results. Using this system can help you learn to:
* Focus on business-related questions
* Find measures that have high causation, a low error rate and a low cost
* Create actionable insights by starting with predictions, benchmarks or recommendations
Slide deck from my SharePoint Governance Group Therapy session at SharePoint Saturday Houston. This sessions is structured as a workshop, so there aren't a lot of slides, Feel free to contact me if you have questions!
Contingut encapsulat del programa "Sarrià de Ter en Xarxa" de 21 de juny de 2013 emès a Ràdio Sarrià. Més info i tots els enllaços a http://sarriadeterenxarxa.cat
The business world is increasingly adopting the Moneyball principle of using data to predict and gain a competitive advantage in healthcare, telecommunications, retail, media, energy, and many other industries. Some argue that organizations that do not possess strong data and the skills to create value out of it will not survive. How can companies leverage data - sometimes described as the “new gold” - for consumer insights, improved processes or new product ideas? Can data assets be leveraged effectively for the overall business?
Bigit Keynote - Big Data & Critical ThinkingLutz Finger
BIGIT Technology Malaysia 2016, the Anchor Event of the Big Data Week Asia featuring concurrent conferences themed Data Security World Show and the 4th Big Data World Show will be held on 19th-20th September 2016 at KLCC Convention Centre, Malaysia.
As the leading Big Data World Show in Asia, BIGIT Technology Malaysia 2016 is co-organized with Malaysia Digital Economy Corporation (MDEC) - Malaysia's government agency leading the national Big Data Analytics initiative.
Making sense of your data doesn't require mountains of data; it requires a systematic approach that leads to actionable insights. But how to get there? This keynote (given at "Predictive Analytics and Business Insights") shows how to extract significant business value from big data with Ask-Measure-Learn, a system that helps you ask the right questions, measure the right data, and then learn from the results. Using this system can help you learn to:
* Focus on business-related questions
* Find measures that have high causation, a low error rate and a low cost
* Create actionable insights by starting with predictions, benchmarks or recommendations
Slide deck from my SharePoint Governance Group Therapy session at SharePoint Saturday Houston. This sessions is structured as a workshop, so there aren't a lot of slides, Feel free to contact me if you have questions!
Contingut encapsulat del programa "Sarrià de Ter en Xarxa" de 21 de juny de 2013 emès a Ràdio Sarrià. Més info i tots els enllaços a http://sarriadeterenxarxa.cat
Michael mahlberg exploratory-testing-the_missing_half_of_bddMichael Mahlberg
"We should just call it testing - when it's not exploratory testing it's not real testing anyway" -Twitter, Summer 2011 Lately many professional testers have started to make a clear distinctions between thing that we call testing (like TDD and BDD) and what they consider testing - referring to TDD and BDD mostly as checking. And actually I – and I would think many of you as well – have seen projects with a test coverage of 80% and more that still fail to meet the clients' needs. Even though they meet the specifications perfectly. This points to some value that could be added to techniques like BDD and TDD by embracing the ideas from people like James Marcus Bach, Paul Carvalho and Michael Bolton. After giving an overview of current trends in the testing community like ET (exploratory testing) and ATDD (Acceptance Test Driven Development) this session will try to do exactly that: discuss the - often missing - intersection between BDD and exploratory testing and suggest ways to fill it.
Andrew will share with you a 10 step architecture to building a successful social media strategy. Andrew will pull from real client case studies tbk Creative has been involved in to help you draw your own parallels on how social media can apply to your non-profit. Last, Andrew will discuss all the facets of digital (social media, websites, email marketing) and how you can create cohesion across all these platforms to drive maximum brand awareness and donor dollars for your cause.
Progress Report on Government Linked Data Worldwide3 Round Stones
The W3C Government Linked Data Working Group is chartered for two years (2011-2013) to update the W3C Recommendations related to the publication of government open data as high quality "5 star" Linked Data. This progress report highlights the progress of the Government Linked Data Working Group.
Include:
* Best Practices and the Government Linked Data Cookbook;
* Recommended vocabularies specific to government use; * Guidance on handling legacy data, versioning and procurement; and
* Advice on the value proposition of publishing open government content as Linked Data to support project funding.
This presentation reports on the results of an international collaborative project with 100 libraries to benchmark the marketing of electronic resources.
"Are you planning on organizing an Adventure Time fan club in the near future? How about a furry meet-up? Or even the next Pixar? Whatever your aspirations, there are several key things you should keep in mind, from conducting a resource inventory, knowing your strengths and planning for your weaknesses, creating milestones, and being able to delegate tasks. Jamie Schumacher, founder of Altered Esthetics gallery in sunny Northeast Minneapolis, will stop by to share the secrets of her success, from planning a successful camping trip to organizing a successful project or group." - Kevin Cannon.
Top 5 Social Media Tactics Every School Should Implementfrank barry
Top 5 Social Media Tactics Every School Should Implement. If you're looking to start or improve your social media efforts at your schools then these 5 Social Media Tactics are for you. You'll walk away with great tips and tactics for effective social media use.
Workshop "Open Data 4 Start-up", organizzato dall'Associazione Luoghi di Relazione in collaborazione con TOP-IX all'interno del Digital Experience Festival - 30 maggio 2012 - Intervento di Massimo Zaglio (Open Data Ninja, Consorzio TOP-IX) e di Saverino Reale (Open Data Specialist, CSI Piemonte)
A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18Mariia Bocheva
While working with data we usually face several problems: we don't have enough data, we have too much data, we don't know what to do with this data.
In this session, I'll show how to make sure you can rely on your data and share my favorite ideas on how you can use Google Analytics and other for A/B testing, optimization and analysis.
You’ll gain a better understanding on what to look at to answer your UX questions, how to run a test properly and evaluate the its results.
Telling factual stories in virtual reality, 360 degree video and augmented re...Paul Bradshaw
Slides from a lecture on the MA in Data Journalism and the MA in Media Production at Birmingham City University, explaining what types of stories and projects suit immersive technologies such as VR and AR, considerations when using them, and techniques employed in the field.
Generative AI tools like ChatGPT and Google Bard are already changing journalism workflows - in this talk for the BBC Local Democracy Reporters conference 2023, Paul Bradshaw walks through a number of ways those tools can help local journalists - and how to avoid the pitfalls and weaknesses of AI including bias and hallucinations.
More Related Content
Similar to Data Journalism 2: cleaning, combining, communicating
Michael mahlberg exploratory-testing-the_missing_half_of_bddMichael Mahlberg
"We should just call it testing - when it's not exploratory testing it's not real testing anyway" -Twitter, Summer 2011 Lately many professional testers have started to make a clear distinctions between thing that we call testing (like TDD and BDD) and what they consider testing - referring to TDD and BDD mostly as checking. And actually I – and I would think many of you as well – have seen projects with a test coverage of 80% and more that still fail to meet the clients' needs. Even though they meet the specifications perfectly. This points to some value that could be added to techniques like BDD and TDD by embracing the ideas from people like James Marcus Bach, Paul Carvalho and Michael Bolton. After giving an overview of current trends in the testing community like ET (exploratory testing) and ATDD (Acceptance Test Driven Development) this session will try to do exactly that: discuss the - often missing - intersection between BDD and exploratory testing and suggest ways to fill it.
Andrew will share with you a 10 step architecture to building a successful social media strategy. Andrew will pull from real client case studies tbk Creative has been involved in to help you draw your own parallels on how social media can apply to your non-profit. Last, Andrew will discuss all the facets of digital (social media, websites, email marketing) and how you can create cohesion across all these platforms to drive maximum brand awareness and donor dollars for your cause.
Progress Report on Government Linked Data Worldwide3 Round Stones
The W3C Government Linked Data Working Group is chartered for two years (2011-2013) to update the W3C Recommendations related to the publication of government open data as high quality "5 star" Linked Data. This progress report highlights the progress of the Government Linked Data Working Group.
Include:
* Best Practices and the Government Linked Data Cookbook;
* Recommended vocabularies specific to government use; * Guidance on handling legacy data, versioning and procurement; and
* Advice on the value proposition of publishing open government content as Linked Data to support project funding.
This presentation reports on the results of an international collaborative project with 100 libraries to benchmark the marketing of electronic resources.
"Are you planning on organizing an Adventure Time fan club in the near future? How about a furry meet-up? Or even the next Pixar? Whatever your aspirations, there are several key things you should keep in mind, from conducting a resource inventory, knowing your strengths and planning for your weaknesses, creating milestones, and being able to delegate tasks. Jamie Schumacher, founder of Altered Esthetics gallery in sunny Northeast Minneapolis, will stop by to share the secrets of her success, from planning a successful camping trip to organizing a successful project or group." - Kevin Cannon.
Top 5 Social Media Tactics Every School Should Implementfrank barry
Top 5 Social Media Tactics Every School Should Implement. If you're looking to start or improve your social media efforts at your schools then these 5 Social Media Tactics are for you. You'll walk away with great tips and tactics for effective social media use.
Workshop "Open Data 4 Start-up", organizzato dall'Associazione Luoghi di Relazione in collaborazione con TOP-IX all'interno del Digital Experience Festival - 30 maggio 2012 - Intervento di Massimo Zaglio (Open Data Ninja, Consorzio TOP-IX) e di Saverino Reale (Open Data Specialist, CSI Piemonte)
A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18Mariia Bocheva
While working with data we usually face several problems: we don't have enough data, we have too much data, we don't know what to do with this data.
In this session, I'll show how to make sure you can rely on your data and share my favorite ideas on how you can use Google Analytics and other for A/B testing, optimization and analysis.
You’ll gain a better understanding on what to look at to answer your UX questions, how to run a test properly and evaluate the its results.
Telling factual stories in virtual reality, 360 degree video and augmented re...Paul Bradshaw
Slides from a lecture on the MA in Data Journalism and the MA in Media Production at Birmingham City University, explaining what types of stories and projects suit immersive technologies such as VR and AR, considerations when using them, and techniques employed in the field.
Generative AI tools like ChatGPT and Google Bard are already changing journalism workflows - in this talk for the BBC Local Democracy Reporters conference 2023, Paul Bradshaw walks through a number of ways those tools can help local journalists - and how to avoid the pitfalls and weaknesses of AI including bias and hallucinations.
How to generate a 100+ page website using parameterisation in RPaul Bradshaw
Parameterisation can be used to build a website with a page for every region/category/row in your data. This talk at DataHarvest/EIJC 2023 walks through how to do that, with example code and tips.
ChatGPT (and generative AI) in journalismPaul Bradshaw
A brief roundup of tips and examples of using ChatGPT and generative AI for journalism (especially data journalism) - presentation from DataHarvest 2023
A brief history of data in journalism, how data journalism differs from forms such as CAR, and what qualities and skills modern data journalism roles involve.
Talk for the Comet Research Centre at Tampere University, Helsinki, Finland, March 2023.
Using narrative structures in shortform and longform journalismPaul Bradshaw
How an understanding of narrative structures can help you write for different platforms and formats, from shortform (Twitter) to news articles and longform features. The second part of a presentation to the Civic Journalism Lab at Newcastle University - you can find the first part at https://www.slideshare.net/onlinejournalist/narrative-and-multiplatform-journalism-part-1
Narrative and multiplatform journalism (part 1)Paul Bradshaw
How an understanding of narrative concepts can help you get to grips with new (and old) platforms and genres. Presentation to the Civic Journalism Lab at Newcastle University - you can find the second part at https://www.slideshare.net/onlinejournalist/using-narrative-structures-in-shortform-and-longform-journalism
Storytelling in the database era: uncertainty and science reportingPaul Bradshaw
Presentation at the Humboldt Foundation's International Journalists' Programmes 2020 about the changes within journalism around using interactivity for telling stories, and communicating uncertainty. The slides also include recommendations around avoiding mistakes.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Honest Reviews of Tim Han LMA Course Program.pptxtimhan337
Personal development courses are widely available today, with each one promising life-changing outcomes. Tim Han’s Life Mastery Achievers (LMA) Course has drawn a lot of interest. In addition to offering my frank assessment of Success Insider’s LMA Course, this piece examines the course’s effects via a variety of Tim Han LMA course reviews and Success Insider comments.
Embracing GenAI - A Strategic ImperativePeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
8. “With the help of just Benford’s law
and data sets to compare he’s
able to demonstrate how the police
are systematically hiding over a
thousand murders a year in a
single state, and that’s just in one
small part of the article”
Monday, 5 March 2012
- Pete Warden
10. 5 things you need to know about
cleaning data
1. Data always needs cleaning up
2. Treat the ‘source’ like a source
3. Use the right ‘average’ and
percentage
4. Watch for changing context: inflation,
boundaries, classification
5. Always work on copies of raw data
Monday, 5 March 2012
12. “What the Independent have done
is confuse the UK’s deficit with our
debt [making] the debt problem
look around eight times worse than
it is. And it used the whole of its
front page to do so.”
- James Ball
Monday, 5 March 2012
14. Question?
A town has two hospitals. Hospital A is
bigger than hospital B. One of them has
a birth rate of 60% boys. Which one is it
more likely to be?
Monday, 5 March 2012
15. Question?
The smaller hospital is more likely to
have a 60% birth rate - larger samples
are more stable.
Monday, 5 March 2012
17. What is the data worth?
Measurement doesn't answer anything if
there's only one variable
Statistical significance
Sample size and selection
Controls and the placebo effect
Regression to the mean
Read up.
Monday, 5 March 2012
18. Getting data ready to answer
questions
Data > Text to columns or =SPLIT
Find & replace
=IF(condition, if met, if not)
=TRIM, =CONCATENATE
=RIGHT, =LEFT, =MID
=REPLACE, =SUBSTITUTE
=LEN
Monday, 5 March 2012
19. Walkthrough: cleaning data in
Google Refine
Edit cells > common transforms
Edit cells > split multi-valued cells
Facet > text facet
Export...
Monday, 5 March 2012
22. 5 things you need to know about
visualising data
1. Choose the chart for the purpose
2. For answers or for story?
3. Good design is when there’s nothing
more to take away
4. It should be self-contained & have refs
5. Be careful with scales and classes
Monday, 5 March 2012
29. Visualisation tools
ManyEyes, Tableau, Number Picture
Wordle, Tagxedo
BatchGeo, FusionTables
Gephi
Delicious.com/paulb/vis+tools
Monday, 5 March 2012
30. Distribution: getting social
Publish embed code & link to data
Have or join a Flickr group for
visualisations, comment on others
Tumblr blog
Digg, Reddit, Stumbleupon
Buzzdata
Monday, 5 March 2012
32. 5 things you need to know about
mashing data
1. It is what a journalist does best
2. Look for a point of connection: place?
Person? Company? Date? Code?
3. Mashups can be live, updated or
static
4. What an API can do
5. What APIs there are
Monday, 5 March 2012
34. Mashup tools
Yahoo! Pipes, xFruits
OpenHeatMap
Mapalist, Maptube, FusionTables
Scraperwiki
Google Refine
Monday, 5 March 2012
35. Walkthrough: grabbing geo data
with Google Refine
Edit column > Add column by fetching
URLs
Use GREL (Google Refine Expression
Language)
Search web for help & examples
Monday, 5 March 2012
38. Lab
Before the lab: play with these
techniques yourself, have problems,
find solutions, raise questions. Install
Google Refine and Tableau on your
laptop to use.
- Visualise, interrogate or mash data
Monday, 5 March 2012
39. Books
Kaiser Fung - Numbers Rule Your World
Ben Goldacre - Bad Science
Donna Wong - The WSJ Guide to
Information Graphics
Brian Suda - A Practical Guide to
Designing with Data
Monday, 5 March 2012