• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

Ld4d 2013 part 2

on

  • 184 views

Slides for the second part of the Linked Data for Development (LD4D) Tutorial, held at WSSF2013 in Montreal Canada. ...

Slides for the second part of the Linked Data for Development (LD4D) Tutorial, held at WSSF2013 in Montreal Canada.

In this presentation I talk about Downscaling the Semantic Web, taking into account issues around 1) infrastructure and hardware 2) interfaces 3) relevant data

Statistics

Views

Total Views
184
Views on SlideShare
184
Embed Views
0

Actions

Likes
0
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

CC Attribution-ShareAlike LicenseCC Attribution-ShareAlike License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • - Moreaffiiations and who I am Choice point Martin Murillo
  • We all love the web and all of us appreciate the influence it had on our social, political and economic lives. This Empowerment through the sharing of knowledge of businesses, people and societies is reflected in the rapid growth of the Internet and the World Wide Web4.5 Billion people are now unconnected to the web.
  • Spice this upRelate this to w4ra
  • Information sharing needsArrange icons properlyActually built and running
  • s
  • using voice technologies we make the benefits of the Web available to those with simple mobile phones.
  • IATI was on the Web of Data, but no longer. However… they do have an API, so….
  • Also Applications!
  • Emphasis that SE guy records in multiple languagesThat there is a user profile, which includes languageThat farmers get the info in their languageThat they can respond and that it is stored in a db

Ld4d 2013 part 2 Ld4d 2013 part 2 Presentation Transcript

  • Linked Data for Development: Part 2: Downscaling Linked Data Victor de Boer With significant input from Christophe Guéret, Martin Murillo, Stephane Boyera, Stefan Schlobach, Bernie Innocenti, Walter Bender, Claudia Urrea, Anna Bon, Hans Akkermans, Nana Gyan, Amadou Tangara. Mary Allen,…
  • LD4D at ISWC2012
  • Outline • Part 2: – Why Linked Data for Development – Bringing the Semantic Web and Linked Data to the Base of the Pyramid • Relevancy • Infrastructure and connectivity • Interfaces – IATI as Linked Data – Voice-based access to Market data in the Sahel – Distributed data sharing: OLPC and ERS • Part 3: Handson session!
  • CAUTION! DIGITAL DIVIDE AHEAD Img: Internet World Stats
  • Digital divide in classrooms
  • ICT4D • Technology is a development tool – – – – • Education Healthcare Livelihood etc. Leveraging communication independently of physical/geographical barriers • Improving transparency, accountability, efficiency of governments • Developing nations can leapfrog directly into the information age, jumping many phases of immature technologies Based on Sbc4d.com
  • Information sharing needs • Agriculture – – – – Market Prices Business opportunities Support Sharing indigenous knowledge – Etc. • Health – Prevention – Access to healthcare – Detection of disease outbreak – etc. • • Education Etc. Based on Sbc4d.com
  • Web Alliance for Regreening in Africa W4RA : Information exchange and knowledge sharing in rural Africa Washington, 13-15 May 2013 9
  • World Wide Web as Instrument of Empowerment Sir Tim Berners-Lee, inventor of the Web: “Our success will be measured by how well we foster the creativity of our children. Whether future scientists have the tools to cure diseases. Whether people, in developed and developing economies alike, can distinguish reliable information from propaganda or commercial chaff. Whether the next generation will build systems that support democracy and accountable debate. I hope that you will join this global effort to advance the Web to empower people.”
  • Why the Semantic Web? • Information (from NGOs) in silos – Specific products – Specific communities • Lot of knowledge is lost due to lack of publication  Sharing (heterogeneous) knowledge is essential • LD is well-suited because of: – Language-agnostic – Interface-agnostic – De-centralised authoring • Slicing – Re-usability • Local • Global Img: flickr/elcovs
  • Why linked data (1/2) Slide stolen from Christophe Gueret
  • Why linked data (2/2) Slide stolen from Christophe Gueret
  • Web of Documents (WWW) Linked Documents
  • Web of Data Linked Data
  • Barriers to the Internet 1. Technology: The lack of connectivity and electricity, cost of devices and cost of connection are limiting the adoption and usage of new technologies; 2. Capacity: Lack of time and resources limits the participation in data sharing processes. There are also issues related to low education levels, low capacity to interpret data, and illiteracy; 3. Relevance: Power balance, culture, apathy, lack of incentives, lack of interest and dis-empowerment are also all threats to having citizens engage in data sharing. Stephane Boyera (SBC4D.com)
  • Sem.tech/Linked Data should be made 1. usable on small, affordable, hardware deployed in various connectivity contexts; 2. accessible to individuals with varied cultural backgrounds / literacy levels; 3. relevant and directly useful to the target public they aim to empower. Infrastructure Interface Relevancy
  • Infrastructure • • • • No internet, No bandwith, No computer, No electricity • Cost – Total cost of ownership
  • Interface • • • • Low literacy Low education Small languages low capacity to interpret data, and illiteracy;
  • Relevancy • No local content • No local ownership • Power balance, culture, apathy, lack of incentives, dis-empowerment Subsecretario de transparencia, Alcaldes y la gente http://www.youtube.com/watch?v=q0S3juRQXR0 Max Rodriguez
  • New ways of connecting to the (Semantic) Web
  • Mobile phones
  • Radio •No. 1 source of information •Interactive radio programs •Huge listening base
  • Low-powered hardware • OLPC XO laptop • Raspberry Pi • Sheevaplug etc.
  • With the mainstream • Dev.countries can leapfrog directly into the information age, – jumping many phases of immature technologies • Linked Data is mainstream computer science research. – Let’s worry about the 4.5 B unconnected prosumers now! Img: flickr/n3v3rv0id
  • Voice-based Web access in Africa
  • • Integrate local community radios and mobile ICT for knowledge sharing • Better support and integrate local languages in voicebased services – Development of appropriate speech elements (textto-speech and Speech recognition) • Develop a free and open source toolbox for local developers. – Investigate self-sustainability – Develop appropriate business models – In collaboration with local communities.
  • Bottom-up • Involvement of local communities – Trust and ownership – Co-creation • Bottom-up: field visits, workshops, demos, roadsh ows, etc • Local communities: innovation cocreation, “Living Labs” sociotechnical approach – Use case gathering – Observation and prototyping – Test, adapt
  • From 20 use cases to 3 voice systems 1 m-Milk ordering and delivery service of Tominian Milk producers and NGO 2 m-Tree protection alert service Sahel Eco Farmers and NGO 3 mobile-web Event organizer for vaccination of herds Farmers 4 m-Farmer-expert directory service Farmer organization 5 NGO info-line about legal issues in several languages Sahel Eco 6 Leave announcement or select your favourite song Radio 7 Shea butter and honey trading service Radio and Sahel Eco 8 Access radio programs and announcements on your phone Radio 9 Gourcy seed producers seed certification service Farmer organization 10 Radio questions and answers about agricultural issues Radio 11 m-collective purchase organizing service Local buyers 12 m-GIS regreening service Sahel Eco 13 m-Farmer social network Sahel Eco 14 mobile-web regional market system Farmer organization 15 Sahel Eco portal to Regreening and access to m-services Sahel Eco 16 m-event organizer for re-greening events Sahel Eco, farmers Market Information Citizen Journalism Event Organiser
  • Local market data Communiqué Web Interface Text-To-Speech GSM/Voice interface Sahel Eco operative Buyers Community radio
  • “Slot and Filler” Text-to-Speech Spoken Language Elements Repository Bambara: English: 15_ba.wav L_ba.wav Of_ba.wav 15 liters of honey offered by Zakari Diarra
  • VoiceXML <?xml version="1.0" encoding="ISO-8859-1"?> <vxml version="2.0" lang="en"> <form> <prompt bargein="false"> Welcome to RadioMarche! <audio src=“audio/communique_1_bambara.wav"/> </prompt> <option dtmf="1" value=“1">Press one for X</option> <option dtmf=“2" value=“2">Press two for Y</option> ... </vxml> DTMF = Dual-tone multi-frequency signaling
  • Foroba Blon
  • Web for ALL. Using voice technologies and available tools… ~ ~ ~ ~ ~ ~ … we make the benefits of the Web available to people who use simple mobile phones.
  • Results • RadioMarché -- Increased market for farmers. – Political, social, economical, ecological factors play a great role – Too successful: not the entire value chain is served • Foroba Blon -- Facilitating rural citizen journalism. – Privacy and security, – New business models Voice platform with reusable components for different use cases.
  • Linked Data for RadioMarche
  • http://semanticweb.cs.vu.nl/radiomarche
  • Linked Market Data • 1,952 RDF triples – 90 offerings – 19 contacts Local market data • Links to Data / communique layer – Data • DBPedia • GeoNames • Agrovoc Interface handler layer Web Email SMS GSM/Voice – Vocabularies • Foaf • GoodRelations Local radio Farmers (producers) Buyers (consumers)
  • Sharing across regions/NGOs Local market data Local market data Data / communique layer RadioMarché market information system Data / communique platform Web Interface handler layer Web Email SMS Email SMS GSM/Voice GSM/Voice Local radio Farmers (producers) Buyers (consumers) RadioMarché in second region Local radio Farmers (producers) Buyers (consumers)
  • Re-use: EcoMash Henk Kroon
  • Speakle voice labels rdfs:label rm:1000 “1000” speakle:voicelabel_ba rm:audio_1000_ba.wav speakle:voicelabel_nl rm:offering0001 rm:audio_1000_nl.wav rdfs:label rm:has_contact rm:kilo speakle:voicelabel_ba “kilo”@en rm:audio_kilo_ba.wav speakle:voicelabel_nl rm:audio_kilo_nl.wav rdfs:label “Amande de Karité”@fr “Shea Nuts”@en rdfs:label rm:shea_butter speakle:voicelabel_ba rm:audio_shea_ba.wav rm:Mazankuy_Diarra speakle:voicelabel_nl rm:audio_shea_nl.wav
  • https://github.com/biktorrr/speakle
  • Welcome Choose application and language 1 About which product (EN) 3 dtmf 2 About which product (NL) List all products (EN) dtmf 1..n dtmf 1..n List product offerings List product offerings Voice browser Tel: +31208080855 Skype: +990009369996162208
  • Current status • Linked Market Data – Locally created – Linked Data make re-use possible (NGO, others) – LD voice labels • Can be (re)used to develop voice applications with this data • To go beyond proof-of-concept – – – – More localization needed Local hardware/services (Emerginov / OfficeRoute) User testing More sophisticated translations (VoiceSPARQL)
  • Infrastructure Interface Relevancy
  • Icon-based interaction
  • Icon-based interaction NCR ATM interface for illiterate 'grammar' - ISOTYPE by Otto Neurath available at http://imaginarymuseum.org/MHV/PZImhv/NeurathPictureLanguage.html
  • Crowdsourcing voice fragment gathering
  • One Laptop Per Child (OLPC), Sugar and the Entity Registry System Bernie Innocenti, Walter Bender, Christophe Guéret,Claudia Urrea
  • OLPC mission and vision • Develop (and deploy) a low-cost laptop in order to revolutionize how we educate the world's children • What motivates learning is not carrots or sticks, but rather: – autonomy, – mastery, and – a sense of purpose. • A laptop makes learning more flexible: Children learn by teaching and actively helping each other; the teacher is free to focus expertise where it is needed
  • How is learning with the XO different? OLPC Computer for learning Student-centric Teacher as mentor Voice, text Learning to learn Critical thinking
  • Sugar • Operating system for XO laptops • Learner centric • Activities (Apps)
  • Different activities
  • The numbers (2012) • • • • • 2,000,000+ children with XOs 1,000,000,000 children w/o laptops 150+ language projects 40+ countries 500+ Sugar activities
  • Efficient Knowledge sharing with SemanticXO and ERS
  • Mesh VS Infrastructure network
  • Christophe Gueret
  • Christophe Gueret
  • Christophe Gueret
  • Christophe Gueret
  • Christophe Gueret
  • Christophe Gueret
  • Christophe Gueret
  • Christophe Gueret
  • Hybrid solution http://www.firstmilesolutions.com/documents/DakNet_IEEE_Computer.pdf
  • Sneakernet Latency Throughput “Never underestimate the bandwidth of a station wagon full of tapes hurtling down the highway.” —Andrew Tanenbaum
  • Infrastructure Interface Relevancy
  • Linked Data for Kasper Brandt Victor de Boer
  • Introduction - IATI “IATI is a voluntary, multi-stakeholder initiative that seeks to improve the transparency of aid in order to increase its effectiveness in tackling poverty.” Now: 180+ As of 2013, over 150 donors, NGOs and governments have registered to the IATIregistry.org by publishing their aid activities in this XML standard.
  • Introduction - IATI users • • • • Funders o Where is the money of my organisation spent? o Where do other organisations spend their money? Governments o How much money is spent in my country? o What are the budgets or planned disbursements for my country? Locals o What organisations are working in my area? o What projects are currently going on in my area? Public o Where is my tax money going? o What are the organisations doing with my donations?
  • Introduction - IATI model Activities Organisations
  • Introduction - Why IATI Linked Data? 1. Reusable vocabularies o Extract information automatically from the IATI data by making use of applications which are able to interpret standard vocabularies 2. Enrich IATI data o Link IATI data to external datasets in order to enrich the IATI data with additional information or metadata. 3. Donors can use their own Linked Data specification. o @Linked-data-uri attribute already exists in the IATI model.
  • Model and links based on requirements elicitated from experts Iterative Requirements Engineering Process Model by Loucopoulos and Karakostas
  • Linked Data model - Example iati:activity/GB-CHC-285776-CHA024 iati:activity-transaction iati:activity/GB-CHC-285776-CHA024/transaction/42737 . iati:activity/GB-CHC-285776-CHA024/transaction/42737 iati:transaction-tied-status iati:codelist/TiedStatus/5 .
  • Linked Data model - Provenance • • On file level o Not on activity level A named graph per file, e.g.:iati:graph/dataset/Worldb ank
  • Linked Data model - Vocabularies
  • Linked Data model - Triple store Python • • • • RDFLib Triples loaded into a ClioPatria triple store: o http://semanticweb.cs.vu.nl/iati/ o Sparql endpoint – Dereferenceable URIs (http://purl.org/collections/iati/codelist/Sector/11420) Total number of triples: 36,629,017 Total number of named graphs: 4,790 o Largest activities graph is UNOPS containing 1,231,896 triples Takes approximately 30 minutes to load all data into the triple store. RDF/Turtle
  • Linking datasets - Approach 1. In total, how much does a given country receive in aid? 2. A comparative index of aid versus the Human Development Index. 3. What is the geographic location of a project? How much aid went to a given province, constituency or village? o Is the aid spent in places where the need is highest? Is it well distributed across the country? o Can we attribute sub-national breakdowns for aid so we can see how much goes to different parts of recipient countries? 4. How does violent conflict in recipient countries affect aid activities? 5. How does aid spending as registered in the IATI standard compare to World Bank indicators?
  • Linking datasets
  • Linking Data applications - Approach 1. In total, how much does a given country receive in aid? 2. A comparative index of aid versus the Human Development Index. 3. What is the geographic location of a project? How much aid went to a given province, constituency or village? o Is the aid spent in places where the need is highest? Is it well distributed across the country? o Can we attribute sub-national breakdowns for aid so we can see how much goes to different parts of recipient countries? 4. How does violent conflict in recipient countries affect aid activities? 5. How does aid spending as registered in the IATI standard compare to World Bank indicators?
  • http://iati2lod.appspot.com/ 1. In total, how much does a given country receive in aid?
  • http://iati2lod.appspot.com/ 2. A comparative index of aid versus the Human Development Index.
  • http://iati2lod.appspot.com/ 4. How does violent conflict in recipient countries affect aid activities? 5. How does aid spending as registered in the IATI standard compare to World Bank indicators?
  • Links to DBPedia Theme:” Food aid emergencies ” IDS: document 0001 Theme:”Food Security” Analysis of approaches to understanding and addressing food security issues; examination of the structural causes of food insecurity and different policy responses “Voedselzekerheid”@NL DBPedia:”Food Security” Person:”David Pimentel” Organisation:”FAO”
  • Links to IATI Theme Education IDS: document 0003 Theme 'Higher education’ Degree and diploma programmes at universities, colleges and polytechnics; scholarships. IATI Sector:”Higher Education” Organisation : UN Habitat Activity: Multi donor fund to support civil society in democracy related issues
  • Linked Data for Landportal.info • The Land Portal is an easy access, easy-to-use platform to share land related information, to monitor trends, and identify information gaps to promote effective and sustainable land governance. [M.Sc. thesis by Alan Chavoshe]
  • Nichesourcing for pluvial data digitization for the Sahel [M.Sc. thesis by Binyam Tesfa]
  • Linked Data for Development (LD4D) Inst. of Development Studies LOD Sahel Pluvial data SemanticXO Citizen Journalism data DBpedia GeoNames Agrovoc RadioMarché Linked market data IATI data
  • Infrastructure Interface Relevancy
  • Take home • Knowledge sharing is a tool for development • Linked Data is well-suited because of – Language- and interface agnostic characteristics – Decentralizability – Reusability outside of original context • Downscaling – Interface – Infrastructure – Relevancy Img: flickr/TomJByrne
  • What we need from you? • Data • Cases – Transparency, Governance, Democracy – Economic development, Healthcare • Reflection – Ethics of ICT4D • Open Data • Linked Data Img: flickr/wetwebwork
  • More information? http://worldwidesemanticweb.org http://w4ra.org http://iati2lod.appspot.com/ http://victordeboer.com v.de.boer@vu.nl
  • The Tabale Platform Bomu yes don’t know Malian French NGO staff Record multiple messages in different languages Bambara no
  • VUI design (three languages)