Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Mobipedia
Building a Mobile Applications Knowledge Base for
the Linked Data Cloud
Primal Pappachan, Roberto Yus, Prajit Ku...
Lots of Smartphones
Lots of Apps
1.6M Android apps
Apps need Semantics!
international semantic
web conference apps
iswc 2015 iswc
semantic web
conference apps
Not so uniform access
• App stores
• PlayDrone
• Privacy Grade
• Android Permissions website
wow
Much non-semantics
Such a...
Who you gonna call?
Our Goal
Build a Knowledge Base for apps
1. Create an ontology for apps
2. Identify interesting sources
3. Add semantics t...
Apps Ontology
Existing ones
• Dubline Core
• DOAP (Description of a Project)
Apps Ontology
Data Properties
Mobipedia Overview
Current Sources
1. Scalable Google Play Store Crawler:
includes 1.1 M apps (Viennot et. al
2014)
2. Website which assigns ...
PlayDrone Project
https://archive.org/details/android_apps
PrivacyGrade
http://privacygrade.org/
Permissions!
http://developer.android.com/reference/android/Manifest.permission.html
Extraction Process
• Download the data
• JSON files (PlayDrone)
• HTML files (PrivacyGrade, Android
Permissions)
• Crawler...
Semantic Labeling
• Match data with entities
o Mobipedia’s ontology
o Custom code
• Create RDF data
o We used the OWL API
Access Mechanisms
Linked Data
Mobipedia
Not to scale
Linking with DBpedia
• Why not Xpedia?
o We couldn’t find other KBs talking about Apps
• Found two categories related to a...
Benefits
• Centralized repository with different sources
• Common format for representation of Apps
• Derive inferences fr...
Challenges
• Stayin’ relevant!
• Stayin’ accurate!
• Stayin’ alive!
• Stayin’ alive!
• Ah, ha, ha, ha!
23
Mobipedia
Usecases
SPARQL Query Support
• SELECT App
• WHERE
o App description contains “flashlight” and
o App has version Version
o Version ...
Query Results
Semantic Search
soccer games no gps needed least # permissions required
Not soccer
games!!!
Semantic Search
• Support Searches like
o Superhero games with parental control
o Todo list with location reminder
• Trans...
Recommendation
Apps
• Requires user history of
app usage
• Augment with user
context
28
App Permissions
• Verify permissio...
Policy Representation
• Rule languages used to represent policies about
apps
• Capture user preferences about apps and wha...
Others
• Linking application user experiences
o Capture user experiences during app usage and link to the app
entity
• Min...
Where do we go now
• Include more data sources
o Android Malware Genome Project
o BlueSeal project
o Amazon.com, GetJar, A...
Take aways
• Mobipedia is the Knowledge Base of mobile apps
• Current version has information of ~1 million
• Three Access...
Bonus Slide
Powered by
Mobipedia ;)
Upcoming SlideShare
Loading in …5
×

Mobipedia presentation

508 views

Published on

Building a Mobile Applications Knowledge Base for the Linked Data Cloud

Published in: Internet
  • Be the first to like this

Mobipedia presentation

  1. 1. Mobipedia Building a Mobile Applications Knowledge Base for the Linked Data Cloud Primal Pappachan, Roberto Yus, Prajit Kumar Das, Sharad Mehrotra, Tim Finin, and Anupam Joshi bit.ly/MPSlides
  2. 2. Lots of Smartphones
  3. 3. Lots of Apps 1.6M Android apps
  4. 4. Apps need Semantics! international semantic web conference apps iswc 2015 iswc semantic web conference apps
  5. 5. Not so uniform access • App stores • PlayDrone • Privacy Grade • Android Permissions website wow Much non-semantics Such access methods
  6. 6. Who you gonna call?
  7. 7. Our Goal Build a Knowledge Base for apps 1. Create an ontology for apps 2. Identify interesting sources 3. Add semantics to the information 4. Interlink the KB with others 5. Establish access mechanisms
  8. 8. Apps Ontology Existing ones • Dubline Core • DOAP (Description of a Project)
  9. 9. Apps Ontology
  10. 10. Data Properties
  11. 11. Mobipedia Overview
  12. 12. Current Sources 1. Scalable Google Play Store Crawler: includes 1.1 M apps (Viennot et. al 2014) 2. Website which assigns privacy grades to apps 3. Android permissions model: includes 152 official permissions used by Android apps 1 2 3
  13. 13. PlayDrone Project https://archive.org/details/android_apps
  14. 14. PrivacyGrade http://privacygrade.org/
  15. 15. Permissions! http://developer.android.com/reference/android/Manifest.permission.html
  16. 16. Extraction Process • Download the data • JSON files (PlayDrone) • HTML files (PrivacyGrade, Android Permissions) • Crawler - crawler4j • Parse the data • JSON – GSON • HTML - jsoup
  17. 17. Semantic Labeling • Match data with entities o Mobipedia’s ontology o Custom code • Create RDF data o We used the OWL API
  18. 18. Access Mechanisms
  19. 19. Linked Data Mobipedia Not to scale
  20. 20. Linking with DBpedia • Why not Xpedia? o We couldn’t find other KBs talking about Apps • Found two categories related to apps o Android_(operating_system)_software: 409 o Mobile_software: 221 • Filtered for duplicates and identified 600 entities • Linked using owl:sameAs property o Retrieve list of links based on name of the entity o Manually select the appropriate ones
  21. 21. Benefits • Centralized repository with different sources • Common format for representation of Apps • Derive inferences from different sources of related information
  22. 22. Challenges • Stayin’ relevant! • Stayin’ accurate! • Stayin’ alive! • Stayin’ alive! • Ah, ha, ha, ha!
  23. 23. 23 Mobipedia Usecases
  24. 24. SPARQL Query Support • SELECT App • WHERE o App description contains “flashlight” and o App has version Version o Version has Permission • Group by App and order by Permissions
  25. 25. Query Results
  26. 26. Semantic Search soccer games no gps needed least # permissions required Not soccer games!!!
  27. 27. Semantic Search • Support Searches like o Superhero games with parental control o Todo list with location reminder • Translate search terms to SPARQL queries (Han et. al.) • Execute converted queries on Mobipedia for results • Convert RDF results into App store combatable format Lushan Han, Tim Finin, and Anupam Joshi. 2011. GoRelations: an intuitive query system for DBpedia. In Proceedings of the 2011 joint international conference on The Semantic Web(JIST'11), 27
  28. 28. Recommendation Apps • Requires user history of app usage • Augment with user context 28 App Permissions • Verify permissions and third party libraries used by similar apps • Evolution of a privacy guideline
  29. 29. Policy Representation • Rule languages used to represent policies about apps • Capture user preferences about apps and what data it can access • Leverage Mobipedia for representation of concepts and obtaining data for policies
  30. 30. Others • Linking application user experiences o Capture user experiences during app usage and link to the app entity • Mining app reviews o Link concepts from app reviews to apps itself capturing user sentiment in the knowledge base • Saving the world one App at a time ;)
  31. 31. Where do we go now • Include more data sources o Android Malware Genome Project o BlueSeal project o Amazon.com, GetJar, Apple App store, Baidu store, Tencent App Gem • Community participation o User submissions o Moderation o Tools for easy contribution
  32. 32. Take aways • Mobipedia is the Knowledge Base of mobile apps • Current version has information of ~1 million • Three Access Mechanisms o Linked Data Interface o SPARQL endpoint o RDF Dumps Thank you NSF for the travel grant Go to http://mobipedia.link today
  33. 33. Bonus Slide Powered by Mobipedia ;)

×