SlideShare a Scribd company logo
1 of 33
Mobipedia
Building a Mobile Applications Knowledge Base for
the Linked Data Cloud
Primal Pappachan, Roberto Yus, Prajit Kumar Das,
Sharad Mehrotra, Tim Finin, and Anupam Joshi
bit.ly/MPSlides
Lots of Smartphones
Lots of Apps
1.6M Android apps
Apps need Semantics!
international semantic
web conference apps
iswc 2015 iswc
semantic web
conference apps
Not so uniform access
• App stores
• PlayDrone
• Privacy Grade
• Android Permissions website
wow
Much non-semantics
Such access methods
Who you gonna call?
Our Goal
Build a Knowledge Base for apps
1. Create an ontology for apps
2. Identify interesting sources
3. Add semantics to the information
4. Interlink the KB with others
5. Establish access mechanisms
Apps Ontology
Existing ones
• Dubline Core
• DOAP (Description of a Project)
Apps Ontology
Data Properties
Mobipedia Overview
Current Sources
1. Scalable Google Play Store Crawler:
includes 1.1 M apps (Viennot et. al
2014)
2. Website which assigns privacy grades
to apps
3. Android permissions model: includes
152 official permissions used by
Android apps
1
2
3
PlayDrone Project
https://archive.org/details/android_apps
PrivacyGrade
http://privacygrade.org/
Permissions!
http://developer.android.com/reference/android/Manifest.permission.html
Extraction Process
• Download the data
• JSON files (PlayDrone)
• HTML files (PrivacyGrade, Android
Permissions)
• Crawler - crawler4j
• Parse the data
• JSON – GSON
• HTML - jsoup
Semantic Labeling
• Match data with entities
o Mobipedia’s ontology
o Custom code
• Create RDF data
o We used the OWL API
Access Mechanisms
Linked Data
Mobipedia
Not to scale
Linking with DBpedia
• Why not Xpedia?
o We couldn’t find other KBs talking about Apps
• Found two categories related to apps
o Android_(operating_system)_software: 409
o Mobile_software: 221
• Filtered for duplicates and identified 600 entities
• Linked using owl:sameAs property
o Retrieve list of links based on name of the entity
o Manually select the appropriate ones
Benefits
• Centralized repository with different sources
• Common format for representation of Apps
• Derive inferences from different sources of related
information
Challenges
• Stayin’ relevant!
• Stayin’ accurate!
• Stayin’ alive!
• Stayin’ alive!
• Ah, ha, ha, ha!
23
Mobipedia
Usecases
SPARQL Query Support
• SELECT App
• WHERE
o App description contains “flashlight” and
o App has version Version
o Version has Permission
• Group by App and order by Permissions
Query Results
Semantic Search
soccer games no gps needed least # permissions required
Not soccer
games!!!
Semantic Search
• Support Searches like
o Superhero games with parental control
o Todo list with location reminder
• Translate search terms to SPARQL queries (Han et.
al.)
• Execute converted queries on Mobipedia for results
• Convert RDF results into App store combatable
format
Lushan Han, Tim Finin, and Anupam Joshi. 2011. GoRelations: an intuitive query system for DBpedia.
In Proceedings of the 2011 joint international conference on The Semantic Web(JIST'11), 27
Recommendation
Apps
• Requires user history of
app usage
• Augment with user
context
28
App Permissions
• Verify permissions and
third party libraries used
by similar apps
• Evolution of a privacy
guideline
Policy Representation
• Rule languages used to represent policies about
apps
• Capture user preferences about apps and what
data it can access
• Leverage Mobipedia for representation of
concepts and obtaining data for policies
Others
• Linking application user experiences
o Capture user experiences during app usage and link to the app
entity
• Mining app reviews
o Link concepts from app reviews to apps itself capturing user
sentiment in the knowledge base
• Saving the world one App at a time ;)
Where do we go now
• Include more data sources
o Android Malware Genome Project
o BlueSeal project
o Amazon.com, GetJar, Apple App store, Baidu store, Tencent App
Gem
• Community participation
o User submissions
o Moderation
o Tools for easy contribution
Take aways
• Mobipedia is the Knowledge Base of mobile apps
• Current version has information of ~1 million
• Three Access Mechanisms
o Linked Data Interface
o SPARQL endpoint
o RDF Dumps
Thank you NSF for the travel grant
Go to http://mobipedia.link today
Bonus Slide
Powered by
Mobipedia ;)

More Related Content

Similar to Mobipedia presentation

Similar to Mobipedia presentation (20)

Unit 1
Unit 1Unit 1
Unit 1
 
App to App Communication in Windows Phone 8 - VTC 2014
App to App Communication in Windows Phone 8 - VTC 2014App to App Communication in Windows Phone 8 - VTC 2014
App to App Communication in Windows Phone 8 - VTC 2014
 
aMS Delhi - Are you thinking about building PowerApps on to of SharePoint-Ser...
aMS Delhi - Are you thinking about building PowerApps on to of SharePoint-Ser...aMS Delhi - Are you thinking about building PowerApps on to of SharePoint-Ser...
aMS Delhi - Are you thinking about building PowerApps on to of SharePoint-Ser...
 
2.pdf
2.pdf2.pdf
2.pdf
 
Demystifying Systems for Interactive and Real-time Analytics
Demystifying Systems for Interactive and Real-time AnalyticsDemystifying Systems for Interactive and Real-time Analytics
Demystifying Systems for Interactive and Real-time Analytics
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panel
 
Building Social Business Applications with OpenSocial
Building Social Business Applications with OpenSocialBuilding Social Business Applications with OpenSocial
Building Social Business Applications with OpenSocial
 
Software citation
Software citationSoftware citation
Software citation
 
Conversational Architecture, CAVE Language, Data Stewardship
Conversational Architecture, CAVE Language, Data StewardshipConversational Architecture, CAVE Language, Data Stewardship
Conversational Architecture, CAVE Language, Data Stewardship
 
Sweeny Seo30 Web20 Final
Sweeny Seo30 Web20 FinalSweeny Seo30 Web20 Final
Sweeny Seo30 Web20 Final
 
App indexing api
App indexing apiApp indexing api
App indexing api
 
C4 sandip-oss
C4 sandip-ossC4 sandip-oss
C4 sandip-oss
 
Project Panorama: vistas on validated information
Project Panorama: vistas on validated informationProject Panorama: vistas on validated information
Project Panorama: vistas on validated information
 
The Future of Search - five trends to follow
The Future of Search - five trends to followThe Future of Search - five trends to follow
The Future of Search - five trends to follow
 
A comparative study between commercial and open source discovery tools
A comparative study between commercial and open source discovery toolsA comparative study between commercial and open source discovery tools
A comparative study between commercial and open source discovery tools
 
Samepoint API
Samepoint APISamepoint API
Samepoint API
 
Open Government Primer
Open Government PrimerOpen Government Primer
Open Government Primer
 
A Preliminary Field Study of Game Programming on Mobile Devices
A Preliminary Field Study of Game Programming on Mobile DevicesA Preliminary Field Study of Game Programming on Mobile Devices
A Preliminary Field Study of Game Programming on Mobile Devices
 
Droidcon Tunisia 2014 connect your mobile app to your backend
Droidcon Tunisia 2014 connect your mobile app to your backendDroidcon Tunisia 2014 connect your mobile app to your backend
Droidcon Tunisia 2014 connect your mobile app to your backend
 
Software Citation: Principles, Implementation, and Impact
Software Citation:  Principles, Implementation, and ImpactSoftware Citation:  Principles, Implementation, and Impact
Software Citation: Principles, Implementation, and Impact
 

More from Primal Pappachan

Cenitpede: Analyzing Webcrawl
Cenitpede: Analyzing WebcrawlCenitpede: Analyzing Webcrawl
Cenitpede: Analyzing Webcrawl
Primal Pappachan
 
An ontology based sensor selection engine
An ontology based sensor selection engineAn ontology based sensor selection engine
An ontology based sensor selection engine
Primal Pappachan
 
Pythonizing the Indian Engineering Education
Pythonizing the Indian Engineering EducationPythonizing the Indian Engineering Education
Pythonizing the Indian Engineering Education
Primal Pappachan
 

More from Primal Pappachan (6)

A Semantic Context-aware Privacy Model for FaceBlock
A Semantic Context-aware Privacy Model for FaceBlockA Semantic Context-aware Privacy Model for FaceBlock
A Semantic Context-aware Privacy Model for FaceBlock
 
Cenitpede: Analyzing Webcrawl
Cenitpede: Analyzing WebcrawlCenitpede: Analyzing Webcrawl
Cenitpede: Analyzing Webcrawl
 
An ontology based sensor selection engine
An ontology based sensor selection engineAn ontology based sensor selection engine
An ontology based sensor selection engine
 
Droidcon India 2011 Talk
Droidcon India 2011 TalkDroidcon India 2011 Talk
Droidcon India 2011 Talk
 
Pythonizing the Indian Engineering Education
Pythonizing the Indian Engineering EducationPythonizing the Indian Engineering Education
Pythonizing the Indian Engineering Education
 
FOSSEE
FOSSEEFOSSEE
FOSSEE
 

Recently uploaded

Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsRussian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Monica Sydney
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Monica Sydney
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
ayvbos
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Monica Sydney
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
ydyuyu
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
ydyuyu
 
75539-Cyber Security Challenges PPT.pptx
75539-Cyber Security Challenges PPT.pptx75539-Cyber Security Challenges PPT.pptx
75539-Cyber Security Challenges PPT.pptx
Asmae Rabhi
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
JOHNBEBONYAP1
 

Recently uploaded (20)

"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrStory Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
 
Best SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency DallasBest SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency Dallas
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsRussian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
 
Power point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria IuzzolinoPower point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria Iuzzolino
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
 
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
75539-Cyber Security Challenges PPT.pptx
75539-Cyber Security Challenges PPT.pptx75539-Cyber Security Challenges PPT.pptx
75539-Cyber Security Challenges PPT.pptx
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 

Mobipedia presentation

Editor's Notes

  1. If there's something strange with the semantics, Who you gonna call
  2. Dublin Core Metadata Initiative (DCMI) and Description of a Project (DOAP) which are used to describe web resources and software projects. either of them aren't focused on mobile development, the concepts and properties in those vocabularies didn't match the requirements for modeling of mobile apps. We subclassed some of the terms in DCMI ontology using \texttt{owl:subClassOf} to define the terms in Mobipedia ontology.
  3. Google Play store has no public API which we could use for crawling it. So we looked around to see if someone had already released a public dataset with metadata about the apps.
  4. 152 official permissions. Other custom permissions can be included too.
  5. Why OWL API? Reuse of code developed for other projects
  6. setup SPARQL endpoint using OpenVirtuoso Project We generated the Linked Data interface for the SPARQL endpoint by using the Pubby project17. Pubby is a Java web application which translates URIs which are not dereferenceable to dereferenceable URIs by connecting to the SPARQL endpoint.
  7. Can you see the Mobipedia circle? Are you looking closely?
  8. Execute the query on the website and show the results <- LIVE
  9. Android Malware Genome Project (one of the first) – Genetic makeup of Android malware, malware families, collected 1200 apps and classified them into malware families and identified what these apps do at various points of lifetime BlueSeal – proposed different permission model for apps and analyzed flow of permissions or information and classifies apps into flow categories