ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...Michael Cummings
Providing Voyager catalog data in a custom, open source web application, "Launchpad" outlines the features of customized library catalog software application from the George Washington University.
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
MuseoTorino, is the first italian project using Web 3.0 tecnologies. NOSQL-GraphDB (Neo4J), RDFa, Linked Open Data.
MuseoTorino is a 21style (www.21-style.com) project for the municipality of Torino, Italy.
These slides come from CodeMotion, the best Italian conference for developers and IT entusiast !
Central Pennsylvania Open Source Conference, October 17, 2015
Data is a hot topic in the tech sector with big data, data processing, data science, linked open data and data visualization to name only a few examples. Before data can be processed or analyzed it often has to be cleaned. OpenRefine is an open source interactive data transformation tool for working with messy data. This presentation will begin with a short overview of the features of OpenRefine. To demonstrate basic concepts of data cleaning, manipulating, faceting and filtering with OpenRefine, Pennsylvania Heritage magazine subject index data will be used as a case study.
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...Michael Cummings
Providing Voyager catalog data in a custom, open source web application, "Launchpad" outlines the features of customized library catalog software application from the George Washington University.
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
MuseoTorino, is the first italian project using Web 3.0 tecnologies. NOSQL-GraphDB (Neo4J), RDFa, Linked Open Data.
MuseoTorino is a 21style (www.21-style.com) project for the municipality of Torino, Italy.
These slides come from CodeMotion, the best Italian conference for developers and IT entusiast !
Central Pennsylvania Open Source Conference, October 17, 2015
Data is a hot topic in the tech sector with big data, data processing, data science, linked open data and data visualization to name only a few examples. Before data can be processed or analyzed it often has to be cleaned. OpenRefine is an open source interactive data transformation tool for working with messy data. This presentation will begin with a short overview of the features of OpenRefine. To demonstrate basic concepts of data cleaning, manipulating, faceting and filtering with OpenRefine, Pennsylvania Heritage magazine subject index data will be used as a case study.
This talk was given at the IIPC General Assembly in Paris in May 2014. It introduces the distributed, parallel extraction framework provided by the Web Data Commons project. The framework is public accessible and tailored for the Amazon Web Service Stack. Besides the presentation includes an excerpt of datasets which were extracted from over 100 TB of crawling data and are as well available at http://webdatacommons.org.
A Web-scale Study of the Adoption and Evolution of the schema.org Vocabulary ...Robert Meusel
Promoted by major search engines, schema.org has become a widely adopted standard for marking up structured data in HTML web pages. In this paper, we use a series of largescale Web crawls to analyze the evolution and adoption of schema.org over time. The availability of data from dierent points in time for both the schema and the websites deploying data allows for a new kind of empirical analysis of standards adoption, which has not been possible before. To conduct our analysis, we compare dierent versions of the schema.org vocabulary to the data that was deployed on hundreds of thousands of Web pages at dierent points in time. We measure both top-down adoption (i.e., the extent to which changes in the schema are adopted by data providers) as well as bottom-up evolution (i.e., the extent to which the actually deployed data drives changes in the schema). Our empirical analysis shows that both processes can be observed.
Data Wranglers DC December meetup: http://www.meetup.com/Data-Wranglers-DC/events/151563622/
There's a lot of data sitting on websites just waiting to be combined with data you have sitting on your servers. During this talk, Robert Dempsey will show you how to create a dataset using Python by scraping websites for the data you want.
Presentation for a workshop about persistent identifiers organized by the Royal Library of The Netherlands and DANS. Highlights the non-trivial commitments required of all parties involved in persistent identifier systems to actually keep links based on persistent identifiers ... err ... persistent.
The Power of Semantic Technologies to Explore Linked Open DataOntotext
Atanas Kiryakov's, Ontotext’s CEO, presentation at the first edition of Graphorum (http://graphorum2017.dataversity.net/) – a new forum that taps into the growing interest in Graph Databases and Technologies. Graphorum is co-located with the Smart Data Conference, organized by the digital publishing platform Dataversity.
The presentation demonstrates the capabilities of Ontotext’s own approach to contributing to the discipline of more intelligent information gathering and analysis by:
- graphically explorinh the connectivity patterns in big datasets;
- building new links between identical entities residing in different data silos;
- getting insights of what type of queries can be run against various linked data sets;
- reliably filtering information based on relationships, e.g., between people and organizations, in the news;
- demonstrating the conversion of tabular data into RDF.
Learn more at http://ontotext.com/.
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
Presentation by Tony Hammond and Michele Pasin to Linked Science workshop, co-located with International Semantic Web Conference (ISWC) 2015, on October 12, 2015
This is an informal overview of Linked Data and the usage made of it for the project http://res.space (presented on August 11th 2016 during a team meeting)
This talk was given at the IIPC General Assembly in Paris in May 2014. It introduces the distributed, parallel extraction framework provided by the Web Data Commons project. The framework is public accessible and tailored for the Amazon Web Service Stack. Besides the presentation includes an excerpt of datasets which were extracted from over 100 TB of crawling data and are as well available at http://webdatacommons.org.
A Web-scale Study of the Adoption and Evolution of the schema.org Vocabulary ...Robert Meusel
Promoted by major search engines, schema.org has become a widely adopted standard for marking up structured data in HTML web pages. In this paper, we use a series of largescale Web crawls to analyze the evolution and adoption of schema.org over time. The availability of data from dierent points in time for both the schema and the websites deploying data allows for a new kind of empirical analysis of standards adoption, which has not been possible before. To conduct our analysis, we compare dierent versions of the schema.org vocabulary to the data that was deployed on hundreds of thousands of Web pages at dierent points in time. We measure both top-down adoption (i.e., the extent to which changes in the schema are adopted by data providers) as well as bottom-up evolution (i.e., the extent to which the actually deployed data drives changes in the schema). Our empirical analysis shows that both processes can be observed.
Data Wranglers DC December meetup: http://www.meetup.com/Data-Wranglers-DC/events/151563622/
There's a lot of data sitting on websites just waiting to be combined with data you have sitting on your servers. During this talk, Robert Dempsey will show you how to create a dataset using Python by scraping websites for the data you want.
Presentation for a workshop about persistent identifiers organized by the Royal Library of The Netherlands and DANS. Highlights the non-trivial commitments required of all parties involved in persistent identifier systems to actually keep links based on persistent identifiers ... err ... persistent.
The Power of Semantic Technologies to Explore Linked Open DataOntotext
Atanas Kiryakov's, Ontotext’s CEO, presentation at the first edition of Graphorum (http://graphorum2017.dataversity.net/) – a new forum that taps into the growing interest in Graph Databases and Technologies. Graphorum is co-located with the Smart Data Conference, organized by the digital publishing platform Dataversity.
The presentation demonstrates the capabilities of Ontotext’s own approach to contributing to the discipline of more intelligent information gathering and analysis by:
- graphically explorinh the connectivity patterns in big datasets;
- building new links between identical entities residing in different data silos;
- getting insights of what type of queries can be run against various linked data sets;
- reliably filtering information based on relationships, e.g., between people and organizations, in the news;
- demonstrating the conversion of tabular data into RDF.
Learn more at http://ontotext.com/.
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
Presentation by Tony Hammond and Michele Pasin to Linked Science workshop, co-located with International Semantic Web Conference (ISWC) 2015, on October 12, 2015
This is an informal overview of Linked Data and the usage made of it for the project http://res.space (presented on August 11th 2016 during a team meeting)
Big Data - The 5 Vs Everyone Must KnowBernard Marr
This slide deck, by Big Data guru Bernard Marr, outlines the 5 Vs of big data. It describes in simple language what big data is, in terms of Volume, Velocity, Variety, Veracity and Value.
Use of Open Data in Hong Kong (LegCo 2014)Sammy Fung
Presentation on use of open data in HK given to Legislative Council Secretariat. Content is mixed from my presentations at startmeup 2013 and opendatahk meetup.
Put Your Desktop in the Cloud In Support of the Open Government Directive and...guest1e3ee089
Proposal:
Session Objectives
Key Audiences
Session Format
Key Questions to be Addressed
Session Participants
AV and Other Requirements
Tutorial Materials:
Background
EPA Enterprise Architecture (Land and Water)
EPA Ontology Standard (Faceted Search and Desktop Versions)
MyAirQuality (iPhone App developed by NOAA)
Put Your Desktop in the Cloud In Support of the Open Government Directive and...guest8c518a8
As part of “Put Your Desktop in the Cloud to Support the Open Government Directive and Data.gov/semantic”, I believe that each government employee should:
Create an Open Government Webpage;
Create an Open Government Dashboard; and
Publish Three or More Data Sets.
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
Keynote presentation of Martin Kaltenböck (LOD2 project, Semantic Web Company) at the Government Linked Data Workshop in the course of the OGD Camp 2011 in Warsaw, Poland: Putting the L in front: from Open Data to Linked Open Data
Von Open Data zu Linked Open Data, M. Kaltenböck, SWCMartin Kaltenböck
Präsentation von Martin Kaltenböck, Semantic Web Company am 28.11. 2011 bei der AGEO Jahresveranstaltung 2011 über den Weg von Open Data (Offenen Daten) zu Linked Open Data (Vernetzten offenen Daten), sowie über das Potential und die Vorteile von Linked Open Data (LOD) im Bereich von Offenen Regierungsdaten (Open Government Data- OGD).
Internet Governance of Open Government Data
Workshop 303
Internet Governance Forum
22 October 2013
Bali, Indonesia
Tomoaki Watanabe
Senior Research Fellow& Associate Professor at Center for Global Communications (GLOCOM)
Advocate (volunteer): open licensing
Creative Commons Japan; Executive director for its host organization CommonSphere)
Open Knowledge Foundation (Co-founder)
DevRel - Transform article writing from printing to onlineSammy Fung
My presentation at DevRel/Asia 2020 to talk about Developer Relationship - Transforming article writing from printing to online.
#opensource #hackthon #linux #asia #hongkong #devrel
My Open Source Journey - Developer and CommunitySammy Fung
I share my open source journey which begins from Linux User Group to nowadays in the Open Source community, in developer role and community leader/volunteer role.
https://coscup.org/2019/en/programs/my-open-source-journey-developer-and-community
I introduced my open source job board at Lightning talk of COSCUP 2014 in Taiwan. This presentation slide is extended at lightning talk of Software Freedom Day 2014 in Hong Kong.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Epistemic Interaction - tuning interfaces to provide information for AI support
Use of Open Data in Hong Kong
1. Use of Open Data in Hong Kong
Sammy Fung
sammy.hk
Incu-Lab ICE in StartMeUpHK - Open Data Initiative Gathering
2013/12/04
http://slidesha.re/1cleS2y
3. We want a easier way to
access the public data.
4. Agenda
●
What is Open Data ?
●
Use of Open Source Software in web crawling.
●
Starting new Open Source project hk0weather
to create Open Weather Data.
5. Sammy Fung
●
Software Developer
–
to use and develop open source sofware.
–
Perl → PHP → Python.
–
interests on Data Mining / Web Crawling.
–
own a startup of web and mobile technology.
6. Sammy Fung
●
15+ years in Open Source Communities.
–
Founding Chairman, Hong Kong Linux User Group.
–
Founding Chairman, Open Source Hong Kong.
–
Member, GNOME Asia committee.
–
Mozilla Representative
–
Member, program committee at COSCUP
●
Conference for Open Source Coders, Users and Developers.
●
Largest open source conference in Taiwan.
8. Open Data
Three Laws of Open Government Data by David Eaves.
1.If it can't be spidered or indexed, it doesn't exist.
2.If it isn't available in open and machine readable format, it
can't engage.
3.If a legal framework doesn't allow it to be repurposed, it
doesn't empower.
http://eaves.ca/2009/09/30/three-law-of-open-government-data/
10. * One Star - Open Data
1.make your stuff available on the Web (whatever format) under an
open license.
2.make it available as structured data (e.g., Excel instead of image
scan of a table)
3.use non-proprietary formats (e.g., CSV instead of Excel)
4.use URIs to denote things, so that people can point at your stuff.
5.link your data to other data to provide context.
5stardata.info by Tim Berners-Lee, the inventor of the Web.
11. ** Two Star - Open Data
1.make your stuff available on the Web (whatever format) under an
open license.
2.make it available as structured data (e.g., Excel instead of image
scan of a table)
3.use non-proprietary formats (e.g., CSV instead of Excel)
4.use URIs to denote things, so that people can point at your stuff.
5.link your data to other data to provide context.
5stardata.info by Tim Berners-Lee, the inventor of the Web.
12. *** Three Star - Open Data
1.make your stuff available on the Web (whatever format) under an
open license.
2.make it available as structured data (e.g., Excel instead of image
scan of a table)
3.use non-proprietary formats (e.g., CSV instead of Excel)
4.use URIs to denote things, so that people can point at your stuff.
5.link your data to other data to provide context.
5stardata.info by Tim Berners-Lee, the inventor of the Web.
13. **** Four Star - Open Data
1.make your stuff available on the Web (whatever format) under an
open license.
2.make it available as structured data (e.g., Excel instead of image
scan of a table)
3.use non-proprietary formats (e.g., CSV instead of Excel)
4.use URIs to denote things, so that people can point at your stuff.
5.link your data to other data to provide context.
5stardata.info by Tim Berners-Lee, the inventor of the Web.
14. ***** Five Star - Open Data
1.make your stuff available on the Web (whatever format) under an
open license.
2.make it available as structured data (e.g., Excel instead of image
scan of a table)
3.use non-proprietary formats (e.g., CSV instead of Excel)
4.use URIs to denote things, so that people can point at your stuff.
5.link your data to other data to provide context.
5stardata.info by Tim Berners-Lee, the inventor of the Web.
16. Open Data in Hong Kong
●
Data.One
–
http://www.gov.hk/en/theme/psi
–
released on 2011/3/31.
–
First App Competition on Data.One
●
Call for Submission now till 2014/02/28.
17. Weather Information in Hong Kong
●
Hong Kong Observatory
–
Hourly Hong Kong Weather Report
–
Regional Weather in Hong Kong (10 min updates)
–
Weather Forecast and Weekly Weather Forecast
–
Typhoon Report and Forecast
20. Weather at Data.One
●
●
I posted a blog 'Progress of Open
Government Data in Hong Kong' on
2013/01/17.
Weather at Data.One provides 7 dataset URLs,
returns RSS (XML) format (Eng/TChi/SChi)
–
One word: Useless.
–
Data.One dataset (RSS) is completely different
with HKO own paid service (XML).
21. Weather at Data.One
●
Example - Current local weather report:
●
Plain text report in RSS.
●
Difference to quote report content:
–
–
●
Website: a pair of HTML tags, eg. <PRE>....</PRE>.
Data.One: a pair of RSS description tags,
<description>....</description>.
Other weather data is missing, eg. Regional
temperture updates per each 12 mins.
22. Weather at Data.One
●
●
●
Weather at Data.One is 'report' but not 'data'.
Weather RSS is already released by HKO
before launch of Data.One.
Technically, json/xml format is better
readable by computer programs.
24. Data.One
●
JSON/XML (18 datasets)
–
Air Pollution.
●
Past 24-hour Air Pollution Index from stations.
–
Approved Charitable Fund-raising Activities
–
Restaurant and Food Licences.
–
Details of facility locations.
–
Reward Notices from Police Force.
–
Marine Traffic (Arrival/Departure).
–
Traffic Speed and special news.
–
EventHK information.
25. Data.One
●
RSS (10 datasets)
–
Weather Information (7 datasets)
–
Beach Water Quality (1 datasets)
–
Current Air Pollution Index range and forecase (2
datasets)
27. Data.One
●
CSV
–
–
Locations of Public Facility and GovWifi
–
●
Past Record of Air Pollution Index
Marine Shipping directory of HK
HTML
–
●
HTML version of Marine Traffic.
XLS, MDB
–
2011 Population Census.
–
Property Market Statistics.
–
Monthly Digested Stats and Registers of Auth Persons from Building Dept.
–
Routes and fares of public transport.
28. Data.One
●
Many departments does not release their useful data, and
release current information available on their website.
–
●
Few of them keep available open data in their own.
Most of them does not understand what is 'real' open data.
–
–
Open data format insteads of proprietary data format.
–
●
Data insteads of Information.
Useful of data.
Some departments should manage their open data in better
data structure.
31. Legco Meeting Minutes
and Voting Results
●
●
●
In October 2013, LegCo start to publish voting
results of House Committe in XML.
It is not a part of Data.One project.
My open source software on LegCo vote
result XML:
–
http://github.com/sammyfung/legcovotes
32. Digital21 Strategy
Public Consultation Document
(G) Public Sector Information (PSI) as Default
"34. Through different channels (like press releases, publications, websites, etc.), the
Government releases a lot of information in different areas. However, most of such
information can only be read but cannot be used. In view of the immense benefits of
widening access to PSI for free and easy re-use, we propose to make all Government
information released for public consumption machine-readable by default. Where
appropriate, datasets will be released with application programming interfaces (APIs),
providing predefined functions to make their retrieval easier."
(G) 廣泛提供公共資料
"34. 政府透過不同途徑 ( 例如新聞稿、出版物、網站等 ) 發放大量不同範疇的資料。然而 , 這些資
料大都只可供閱讀而不能使用。有見開放公共資料以供免費再用可帶來巨大效益 , 我們建議所有
開放予公眾使用的政府資料都須以數碼格式編製。在適用情況下 , 資料發布時會同時推出應用程
式界面 , 以便提供預設功能 , 讓公眾輕易地檢索資料。 "
33. Digital21 Strategy
Public Consultation Document
"33. PSI datasets can be used and meshed together to create innovative new applications, as
demonstrated by the creative and useful products and services developed from PSI in Hong Kong
and around the world. For example, using PSI datasets on traffic snapshot images, a number of
mobile apps have been developed to provide real-time traffic situation for users to avoid traffic jams
in planning their traffic routes. Experience from other developed economies shows that widening
access to PSI datasets can open up lucrative business opportunities and bring social benefits. By
tapping the creativity of the community and entrepreneurs, the use of PSI can lead to positive social
outcomes. For instance, in some cities in the United States, application of PSI on hygiene inspections
has led to a significant drop in food poisoning incidents."
35. Digital21 Strategy
Public Consultation Document
"35. Apart from Government data, there are vast amounts of PSI handled,
collected and disseminated by public organisations, which are equally useful
for the development of innovative services and products. Therefore, we
propose to encourage public organisations (e.g. public utilities and transport
operators) to release data owned by them in machine-readable format."
"35. 除了政府資料外 , 本港亦備有大量經公共機構處理、收集及發放的公共資料 ,
這些資料對開發創新服務及產品同樣有用。因此 , 我們建議鼓勵公共機構 ( 例如公
用事業及運輸機構 ) 發放以數碼格式編製的資料。 "
38. Web Scraping
●
a computer software technique of extracting
information from websites. (Wikipedia)
●
for business, hobbies, research purposes.
39. Web Scraping
●
Look for right URLs to scrap.
●
Look for right content from webpages.
●
Saving data into data store.
●
When to run the web scraping program ?
40. Use of Open Source Software in
Web Crawling
●
●
Use Open Source Tools to collect useful and
meaningful machine-readable data.
Doesn't need to wait provider to release data
in machine-readable format.
41. Open Source Tools
●
Python programming lanugage
●
with Regular Expression library
●
Scrapy web crawling framework
42. Why python + scrapy ?
●
●
python: my current favourite programming
language for few years.
scrapy: web crawling framework written in
Python.
43. What is Scrapy ?
●
●
An open source web scraping framework for
Python.
Scrapy is a fast high-level screen scraping and
web crawling framework, used to crawl
websites and extract structured data from
their pages. It can be used for a wide range of
purposes, from data mining to monitoring
and automated testing.
44. Scrapy Features
●
define data you want to scrapy
●
write spider to extract data
●
Built-in: selecting and extracting data from HTML
and XML
●
Built-in: JSON, CSV, XML output
●
Interactive shell console
●
Built-in: web service, telnet console, logging
●
Others
46. Programme List of Paid TVs in 2004
●
I want to know live football match was
showing on which channel.
●
Paid TV web site = M$ + IIS + ASP + Flash
●
Slow....... Very Slow...... Extremely Slow!
●
Couldn't connect at any peak hours!
●
Wrote my first web crawler in PHP in 2004.
47. Public Transportation in 2006-2010
●
Kowloon Motor Bus (KMB)
–
●
No map view for a bus route
Public Transportation Enquiry System (PTES)
–
Exteremly Poor, Ugly (or much worse) map UI on
PTES.
48. HK Observatory and Joint Typhoon
Warning Center
●
Any typhoon is coming to Hong Kong ? And
When will it come ?
●
No easy data exchange format.
●
No RSS nor ATOM.
●
We aren't check websites everyday.
69. Agenda
●
What is Open Data ?
●
Use of Open Source Software in web crawling.
●
Starting new Open Source project hk0weather
to create Open Weather Data.
70. We want a easier way to
access the public data.