5. Canonical Web 2.0
• Amazon
– Customer Reviews
– Amazon Recommends
• Google
– PageRank™
– Making money out of links
– Google Mail, Maps, APIs, Desktop Search, ...
6. Web 2.0 Technology (nothing new)
• Minimum
– CGI (e.g. Perl, PHP, Python, C/C++)
– Database (e.g. MySQL, Postgres, Oracle)
• More recent additions
– Java
– XML
– Web Services
– AJAX
– Ruby on Rails
8. Google PageRank™
• Sergey Brin and Lawrence Page (Stanford, 1995)
• Intuition behind PageRank:
– Web is a network (graph) connected by links
– A link is a "vote" for the destination page
– Strength of vote is a fraction of the PageRank
of the page casting the vote
9. PageRank of a page is the
probability of a random
surfer arriving at that page
after many clicks.
(By Markov Theory)
15. Semantic Web
The Semantic Web is a graph-based
knowledge representation of data, spanning
the Web, traditional databases, the desktop
and mobile devices.
16. Friend of a Friend (FOAF)
"The FOAF project is about creating a Web of
machine-readable homepages describing people, the
links between them and the things they create and do."
http://www.foaf-project.org/
18. PARIP
• PARIP = Practice As Research In Performance
– 5 year national project
– Led by University of Bristol's Department of Drama:
Theatre, Film, Television
– Professor Baz Kershaw and Dr Angela Piccini
• PARIP Explorer
– Innovative contacts and research database
– Developed by ILRT
– Semantic Web technology
20. PARIP - User Perspective
• Dual interface:
– Text View cross-database search-engine
– Map View visual link discovery and browsing
21. PARIP - Technical Perspective
• Semantic Web: RDF/XML and FOAF
• Prolog running as a Web Service (WSDL+SOAP)
• SPARQL query interface for programmatic access
• XHTML AJAX client
• Visualisation via Flash
24. Exabyte Scale Informatics
• 1 Exabyte = 1018
bytes
i.e. 1,000,000,000,000,000,000 bytes
• 1 Exabyte is approximately everything ever:
• written,
• composed,
• filmed,
• painted
• or in any other way 'recorded' by humans.
• Manual classification and retrieval is inadequate;
machine learning and data mining are essential.