Data Journalism Training @ Southern Metropolis Daily, Guangdong, China

  • 3,404 views
Uploaded on

A 3-hour lecture on data journalism, with focus on data visualization, for the Southern Metropolis Daily, one of the biggest circulation newspapers in Guangdong Province, China.

A 3-hour lecture on data journalism, with focus on data visualization, for the Southern Metropolis Daily, one of the biggest circulation newspapers in Guangdong Province, China.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
3,404
On Slideshare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
0
Comments
0
Likes
14

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • \n
  • \n
  • \n
  • \n
  • natural-language processing, pattern recognition and machine learning.\n
  • http://www.nytimes.com/2011/09/11/business/computer-generated-articles-are-gaining-traction.html?pagewanted=all\n
  • http://www.alibuybuy.com/posts/73006.html#jtss-tsina\n
  • \n
  • \n
  • \n
  • \n
  • \n
  • http://www.texastribune.org/library/data/government-employee-salaries/#_rank_and_file\n
  • \n
  • 重思路,不重制作\n
  • \n
  • 问题?切入点?数据?展现方式?阳光/政见\n
  • \n
  • \n
  • don’t assume everything’s easy in the west\n
  • http://www.censtatd.gov.hk/home/index.jsp\n Hong Kong’s Census and Statistics Department is a one-stop shop for government statistics on economic indicators, demographics, health, labor, and many other areas.\n
  • http://www.hkexnews.hk/listedco/listconews/mainindex/SEHK_LISTEDCO_DATETIME_TODAY.htm\n Hong Kong Exchange: Warehouse of information for companies listed in Hong Kong, including interim and annual reports, and required disclosures such as acquisitions of more than 15 per cent of a company, change of senior executives.\n
  • Webb-site: An independent website that tracks HK company disclosures, news and other events. A good place to start, but it is always good practice to check back with the original source.\n Webb-site.com was established in 1998 by David M. Webb, a former investment banker who has lived in Hong Kong since 1991. We provide anindependent commentary on corporate and economic governance, business, finance, investment and regulatory affairs in Hong Kong. Webb-site.com is run on a not-for-profit basis.\n
  • http://www.google.com/publicdata/home\n
  • \n
  • back to the question of information vs. data\n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • Fields: Design, Communication, Information and their mix: Visual Communication, Data  journalism, User Interface\nRaw elements: Look & Feel, Idea, Data\nDisciplines: Journalism, Information Architecture, Typography\nProcess elements: Visual Design, Objective, Dataset\nOutputs: Layout, Story, Report, Data Analysis, Dashboard, Interface\nFinal result: Form, Concept, Knowledge\nCore competencies: Readability, Logic, Usability\nCore values: Simplicity, Informativeness, Relevance\n
  • \n
  • \n
  • http://www.guardian.co.uk/world/interactive/2012/may/08/gay-rights-united-states\n
  • \n
  • \n
  • http://www.whitehouse.gov/omb/budget\n
  • http://www.guardian.co.uk/news/datablog/2011/oct/26/government-spending-department-2010-11#graphic\n
  • http://www.nytimes.com/interactive/2010/02/01/us/budget.html\nHierarchy\n
  • http://www.nytimes.com/interactive/2012/02/13/us/politics/2013-budget-proposal-graphic.html\n
  • \n
  • \n
  • https://www.nytimes.com/interactive/2012/04/24/world/asia/all-in-the-family.html\n
  • \n
  • go online\n
  • https://www.recordedfuture.com/2012/04/09/piece-together-web-data-like-a-detective/\n
  • \n
  • \n
  • \n
  • \n
  • http://www.guardian.co.uk/technology/datablog/interactive/2012/apr/16/web-filtering-censorship\n
  • \n
  • \n
  • \n
  • go online\n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • http://www.ipe.org.cn/pollution/index.aspx\n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • http://www.lasvegassun.com/hospital-care/events-chart/\n
  • http://www.lasvegassun.com/hospital-care/infections-interactive/\nhttp://www.lasvegassun.com/hospital-care/surgical-injuries-interactive/\n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • It takes 300,000 Web pages per hour from 40,000 to 50,000 Internet sources and digests the information to create a database.\nRecorded Future works with structure such as people, places, products and companies; events such as meetings, travels, acquisitions, earning calls and natural disasters; and ontologies or hierarchies that explain groupings such as world leaders, corporations or technology areas. \n
  • \n
  • \n
  • \n

Transcript

  • 1. 数据新闻 Data Journalism 南方都市报 2012.5.14
  • 2. 信息?数字?
  • 3. 设计 多媒体媒体 数据可视化 数据新闻 社交媒体 计算机/ 信息科学
  • 4. 大数据 Big Data
  • 5. “大数据不仅仅是一个时髦词汇,我相信它有真正的未来,我们需要分辨出 些是未来的趋势, 些是时髦的概念,而大数据无疑是个趋势。” 硅谷顶级风险投资机构 Draper Fisher Jurvweston 创始合 人Tim Draper
  • 6. 生产:新闻由机器所写
  • 7. 消费:新闻由机器所读Wavii,采用自然语言处理方法,从Web(包括新闻网站、Twitter和博客)上获得跟大量话题有 的新闻,并对文章、微博、视频等内容进行分类,然后自动创建 于某事实或者新闻事件的“状态更新”——往往用一句话来总结某个新闻,再附上一个查看全文的链接。呈现在Wavii用户面前的是一个类似Facebook风格的新闻源,里面包含了和他们 注的主题有 的所有新闻。简单来说,它是通过结构化的方式呈现非结构化的新闻。
  • 8. 商业模式:温水煮青蛙
  • 9. 媒体何为?记者何为?
  • 10. Data drives the story 数据为先 文字在后 Data with the story 数据文字 相辅相成 Data for the story 数据为辅
  • 11. St. Petersburg Times, Politifact, The Obameter, fact-checking of statements.
  • 12. 中国革命后代 − 华尔街日报WSJ, Nov 26, Children of the Revolution
  • 13. 公务员收入 660,000 机构 职位 收入
  • 14. OnlineJournalism Blog
  • 15. 团队?
  • 16. 数据选题 Data Topics
  • 17. 南海 十八大 油价 省委选举水污染 512四周年 贪官
  • 18. 避免情况✤ 数据过于有限✤ 没有趋势或结论✤ 文字/多媒体更适合✤ 地图不成地图✤ 表格已经足
  • 19. 数据获取Data Acquisition
  • 20. ✤ 表格✤ 报告 pdf to excel✤ 网页 whois✤ 视频✤ 微博
  • 21. 数据分析 Data Analytics
  • 22. Document Viewer / OpenCalais
  • 23. ✤ 转化数据格式——Excel✤ 缩小数据范围✤ 透过数据看本质✤ 莫轻信数字
  • 24. 数据可视化 Data visualization
  • 25. 事实 分析 FACT 展示 Analysis Presentation 理解 美化 Understand Illustrate探索性产品 解释性产品Exploratory Explanatory Product Product
  • 26. 文本分析:伊战/维基解密
  • 27. 对比图:美国各州同性恋权利
  • 28. WhoRunsHK - SCMP
  • 29. 从枯燥到有趣
  • 30. 政府预算
  • 31. 政府预算:卫报 2011
  • 32. 政府预算:纽约时报 2012
  • 33. 政府预算:纽约时报 2013 - 互动版
  • 34. 政府预算:政见
  • 35. 从 杂到清晰
  • 36. 薄之 系图 − 纽约时报
  • 37. 薄之 系图 − 路透
  • 38. 薄之 系图 Silobreaker
  • 39. 薄之 系图 Recorded Future
  • 40. 王之时间轴 Recorded Future
  • 41. 中国之未来 Recorded Future
  • 42. 地图Mapping
  • 43. 日本震后核辐射 − 纽约时报NYT, Mar 16, 2011, The Evacuation Zones Around the Fukushima Daiichi Nuclear Plant
  • 44. 网络审查 − 卫报
  • 45. 公民地图 − 南华早报http://citizenmap.scmp.com
  • 46. 80,000+粉丝
  • 47. 最初版本 案例数量 香港地图 互动时间轴 社交媒体 传统新闻
  • 48. 众包
  • 49. 编辑判断
  • 50. 推广与合作
  • 51. 追踪事件发展
  • 52. 市民 问题 被发现发现报告 被表达帮助 被报道 被重视南华早报 环保团体深入调查 认事件持续报道 教育民众公民意识 政策推动
  • 53. 可 制,可繁殖
  • 54. 中国污染地图 − 马军
  • 55. 伦敦骚乱UK riots: every verified incident - interactive map
  • 56. 伦敦骚乱http://www.guardian.co.uk/news/datablog/interactive/2011/aug/09/uk-riots-incident-map
  • 57. 伦敦骚乱http://www.guardian.co.uk/news/datablog/2011/aug/15/riots-map-happened-suspects-addresses
  • 58. 伦敦骚乱 One 24 year-old man described his car journey from Chingford to the riot in Tottenham: "We [saw] cars and there were groups of boys. Only group of boys in vans. And they were speeding down the motorway tryna get to that direction … everyone was communicating. Everyone was putting their hazards on. It was all a laugh. It was like a, just a fun day out. There was no law. Nothing to control us...So we got there. They blocked off the exit towards the Tottenham junction and we saw about 12 police vans parading down the motorway at that same time. Everyone was speeding past them. Everyone was swearing at them. People were flashing their hazards, putting on their beams. The police just carried on driving. They didnt stop for no-one."http://www.guardian.co.uk/uk/datablog/video/2011/dec/05/england-riots-commute-maphttp://www.guardian.co.uk/uk/datablog/2011/dec/05/england-riots-distance-travelled-map
  • 59. 伦敦骚乱 "These riots were not about poverty," said David Cameron. "That insults the millions of people who, whatever the hardship, would never dream of making others suffer like this." But the question is: how do we know? If poverty affects health, education and crime, could it be a factor in the events of last week? We wanted to know what would happen if we overlayed those addresses with the poverty indicators mapped by Englands Indices of Multiple Deprivation, which cover very small areas. We had already done this with the riot locations themselves, but knowing where people came from seems a better indicator, especially if people were travelling.http://www.guardian.co.uk/news/datablog/2011/aug/16/riots-poverty-map-suspects
  • 60. 伦敦骚乱http://www.guardian.co.uk/news/datablog/interactive/2011/aug/16/riots-poverty-map
  • 61. Guardian, Data desk
  • 62. iPad/Fathom: 公共健康
  • 63. 数据本地化 Data localization
  • 64. 拉斯维加斯医疗事故
  • 65. 拉斯维加斯医疗事故
  • 66. 数据工具 Data Tools
  • 67. 数据前景 Future of Data
  • 68. 内容聚合 Aggregated content 语义 掘 Semantic text-mining Contextual insight 深度透视 系分析 Relational analysis 视觉解释 Explanatory graphicshttp://blog.twingly.com/2011/03/18/an-interview-with-the-ceo-of-silobreaker/
  • 69. Insight no longer comes from access toinformation but from your ability to makesense of it.And we cannot solve information overloadsimply by trying to read more articles. Wedon’t have the time nor the brain capacity. Kristofer Månsson CEO of Silobreaker
  • 70. Recorded Future “What companies are working on fuel cell products expected between 2012 and 2015?“ “Which heads of state visited Libya in 2010?“ “What pharma companies are releasing new products in the first quarter of 2012?”
  • 71. 放数据Open Data
  • 72. @马金馨majinxin.cn@gmail.com