1) The document discusses the use of semantics and semantic web technologies in both enterprise applications and on the public web from 1999 to the present.
2) Key applications mentioned include semantic search engines developed in 1999-2001, an active semantic electronic medical record, fraud prevention at global banks, the BBC Sound music index, and social media analytics tools that extract and query semantics from tweets.
3) The emergence of large linked open data sets starting in 2010 and ontologies like Schema.org and GoodRelations are also summarized as important developments for the semantic web.
Amit Sheth, Pramod Anantharam, Krishnaprasad Thirunarayan, "kHealth: Proactive Personalized Actionable Information for Better Healthcare", Workshop on Personal Data Analytics in the Internet of Things at VLDB2014, Hangzhou, China, September 5, 2014.
Accompanying Video: http://youtu.be/pqcbwGYHPuc
Paper: http://www.knoesis.org/library/resource.php?id=2008
Linked Open Data (LOD) has emerged as one of the largest collections of interlinked structured datasets on the Web. Although the adoption of such datasets for applications is
increasing, identifying relevant datasets for a specific task or topic is still challenging. As an initial step to make such identification easier, we provide an approach to automatically identify the topic domains of given datasets. Our method utilizes existing knowledge sources, more specifically Freebase, and we present an evaluation which validates the topic domains we can identify with our system. Furthermore, we evaluate the effectiveness of identified topic domains for the purpose of finding relevant datasets, thus showing that our approach improves reusability of LOD datasets.
Amit Sheth, "Semantic Interoperability and Information Brokering in Global Information Systems," Keynote given at IEEE Meta-Data, Bathesda, MD, April 6 1999.
Krishnaprasad Thirunarayan, Pramod Anantharam, Cory Henson, and Amit Sheth, 'Trust Networks', In: 5th Indian International Conference on Artificial Intelligence (IICAI-11), December 14-16, 2011 (invited tutorial).
Presentation given by Chris Welty (IBM Research) at Knoesis. We get the permission to upload this presentation from Chris Welty. Event details are at: http://j.mp/Welty-at-Knoesis and the associate video is at: https://www.youtube.com/watch?v=grDKpicM5y0
Amit Sheth, Pramod Anantharam, Krishnaprasad Thirunarayan, "kHealth: Proactive Personalized Actionable Information for Better Healthcare", Workshop on Personal Data Analytics in the Internet of Things at VLDB2014, Hangzhou, China, September 5, 2014.
Accompanying Video: http://youtu.be/pqcbwGYHPuc
Paper: http://www.knoesis.org/library/resource.php?id=2008
Linked Open Data (LOD) has emerged as one of the largest collections of interlinked structured datasets on the Web. Although the adoption of such datasets for applications is
increasing, identifying relevant datasets for a specific task or topic is still challenging. As an initial step to make such identification easier, we provide an approach to automatically identify the topic domains of given datasets. Our method utilizes existing knowledge sources, more specifically Freebase, and we present an evaluation which validates the topic domains we can identify with our system. Furthermore, we evaluate the effectiveness of identified topic domains for the purpose of finding relevant datasets, thus showing that our approach improves reusability of LOD datasets.
Amit Sheth, "Semantic Interoperability and Information Brokering in Global Information Systems," Keynote given at IEEE Meta-Data, Bathesda, MD, April 6 1999.
Krishnaprasad Thirunarayan, Pramod Anantharam, Cory Henson, and Amit Sheth, 'Trust Networks', In: 5th Indian International Conference on Artificial Intelligence (IICAI-11), December 14-16, 2011 (invited tutorial).
Presentation given by Chris Welty (IBM Research) at Knoesis. We get the permission to upload this presentation from Chris Welty. Event details are at: http://j.mp/Welty-at-Knoesis and the associate video is at: https://www.youtube.com/watch?v=grDKpicM5y0
Krishnaprasad Thirunarayan and Amit Sheth: Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Social Applications, In: Proceedings of AAAI 2013 Fall Symposium on Semantics for Big Data, Arlington, Virginia, November 15-17, 2013.
With the rapid proliferation of mobile phones, social media, and sensors, it is critical to collect and convert big data so generated into actionable information that is relevant for decision making. In this session, we explore challenges and approaches for synthesizing relevant background knowledge and inferences that can enable smart healthcare and ultimately benefit community at large.
Paper: http://www.knoesis.org/library/resource.php?id=1903
A statistical and schema independent approach to determine equivalent properties between linked datasets. The approach utilizes interlinking between datasets and property extensions to understand the equivalence of properties.
Talk given by prof. Amit Sheth at the ICMSE-MGI Digital Data Workshop held at Kno.e.sis Center from November 13-14 2013.
workshop page: http://wiki.knoesis.org/index.php/ICMSE-MGI_Digital_Data_Workshop
Harshal Patni, "Real Time Semantic Analysis of Streaming Sensor Data," MS Thesis Defense, Kno.e.sis Center, Wright State University, Dayton OH, March 21, 2001.
More at: http://wiki.knoesis.org/index.php/SSW
Dissertation Advisor: Prof. Amit Sheth
Cursing is not uncommon during conversations in the physical world: 0.5% to 0.7% of all the words we speak are curse words, given that 1% of all the words are first-person plural pronouns (e.g., we, us, our). On social media, people can instantly chat with friends without face-to-face interaction, usually in a more public fashion and broadly disseminated through highly connected social network. Will these distinctive features of social media lead to a change in people’s curs- ing behavior? In this paper, we examine the characteristics of cursing activity on a popular social media platform – Twitter, involving the analysis of about 51 million tweets and about 14 million users. In particular, we explore a set of questions that have been recognized as crucial for understanding curs- ing in offline communications by prior studies, including the ubiquity, utility, and contextual dependencies of cursing.
Original paper: http://knoesis.org/library/resource.php?id=1937
Pavan Kapanipathi, Prateek Jain, Chitra Venkataramani, Amit Sheth, User Interests Identification on Twitter Using a Hierarchical Knowledge Base, ESWC 2014, May 2014.
Paper at: http://j.mp/user-ig
More at: http://wiki.knoesis.org/index.php/Hierarchical_Interest_Graph
Invited talk presented by Hemant Purohit (http://knoesis.org/researchers/hemant) at the NCSU workshop on IT for sustainable tourism development. The talk presents application of technology developed for crisis coordination into more general marketplace coordination via social media for helping suppliers (micro-entrepreneurs) and demanders (tourists).
The recent emergence of the “Linked Data” approach for publishing data represents a major step forward in realizing the original vision of a web that can "understand and satisfy the requests of people and machines to use the web content" – i.e. the Semantic Web. This new approach has resulted in the Linked Open Data (LOD) Cloud, which includes more than 70 large datasets contributed by experts belonging to diverse communities such as geography, entertainment, and life sciences. However, the current interlinks between datasets in the LOD Cloud – as we will illustrate – are too shallow to realize much of the benefits promised. If this limitation is left unaddressed, then the LOD Cloud will merely be more data that suffers from the same kinds of problems, which plague the Web of Documents, and hence the vision of the Semantic Web will fall short.
This thesis presents a comprehensive solution to address the issue of alignment and relationship identification using a bootstrapping based approach. By alignment we mean the process of determining correspondences between classes and properties of ontologies. We identify subsumption, equivalence and part-of relationship between classes. The work identifies part-of relationship between instances. Between properties we will establish subsumption and equivalence relationship. By bootstrapping we mean the process of being able to utilize the information which is contained within the datasets for improving the data within them. The work showcases use of bootstrapping based methods to identify and create richer relationships between LOD datasets. The BLOOMS project (http://wiki.knoesis.org/index.php/BLOOMS) and the PLATO project, both built as part of this research, have provided evidence to the feasibility and the applicability of the solution.
Mending the Gap between Library's Electronic and Print Collections in ILS and...New York University
This presentation proposed a conceptual model to model user's info seeking behavior in the context of their experience and use the model to improve library's collections and services using St. John's University Libraries for case study. It reviewed Web content technologies offered by IT vendors, and compared what offered in content technologies by Library IT vendors. To fill in the gap, It developed the preliminary proposal for 1) required data architecture in SOA framework, 2) desired features for managing library print and electronic content on library's website, 3) adoption of Semantic Web standards and technologies for managing library resources, and 4) the case study scenario with sample conceptual model.
"At the toolbar (menu, whatever) associated with a document there is a button marked "Oh, yeah?". You press it when you lose that feeling of trust. It says to the Web, 'so how do I know I can trust this information?'. The software then goes directly or indirectly back to metainformation about the document, which suggests a number of reasons."
Tim Berners-Lee, W3C Chair, Web Design Issues, September 1997
Provenance is focused on the description and understanding of where and how data is produced, the actors involved in the production of such data, and the processes by which the data was manipulated and transformed until it arrived to the collection from which it is being accessed. Provenance aims at providing the ability to trace the sources of data, enabling the exploration not just of the relationships between datasets, but also of their authors and affiliations, with the goal of preserving data ownership and establishing a notion of trust based on authenticity and reliability.
The Future Internet poses important challenges for provenance, derived from complex and rich scenarios characterized by the presence of large amounts of data stemming from heterogeneous sources like user communities, services, and things. Such challenges span across technical but also socioeconomic dimensions. The former includes aspects like vocabularies for representing provenance, interoperability and scalability issues, and means to produce, acquire, and reason with provenance in order to provide measures of trust and information quality. However, it is probably in the socieconomic dimension where more significant efforts need to be made as to addressing issues like the role of provenance in the overall picture of the Future Internet, entry barriers preventing the generation of provenance-aware internet content, means required to incentivate the production of such content, and ways to prevent provenance forgery.
In this talk, we provide and overview on provenance and the above mentioned challenges and introduce ongoing work in order to address trust issues from the provenance perspective in the Future Internet. We also link provenance to other relevant aspects for trust discussed in the session, like security, legal frameworks, and economics.
Linked data for Enterprise Data IntegrationSören Auer
The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.
This talk introduces Linked Data and Semantic Web by using two examples - population sciences grid and semantAqua - a semantically enabled environmental monitoring. It shows a few tools and the semantic methodology and opens discussion for LOD and team science
Krishnaprasad Thirunarayan and Amit Sheth: Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Social Applications, In: Proceedings of AAAI 2013 Fall Symposium on Semantics for Big Data, Arlington, Virginia, November 15-17, 2013.
With the rapid proliferation of mobile phones, social media, and sensors, it is critical to collect and convert big data so generated into actionable information that is relevant for decision making. In this session, we explore challenges and approaches for synthesizing relevant background knowledge and inferences that can enable smart healthcare and ultimately benefit community at large.
Paper: http://www.knoesis.org/library/resource.php?id=1903
A statistical and schema independent approach to determine equivalent properties between linked datasets. The approach utilizes interlinking between datasets and property extensions to understand the equivalence of properties.
Talk given by prof. Amit Sheth at the ICMSE-MGI Digital Data Workshop held at Kno.e.sis Center from November 13-14 2013.
workshop page: http://wiki.knoesis.org/index.php/ICMSE-MGI_Digital_Data_Workshop
Harshal Patni, "Real Time Semantic Analysis of Streaming Sensor Data," MS Thesis Defense, Kno.e.sis Center, Wright State University, Dayton OH, March 21, 2001.
More at: http://wiki.knoesis.org/index.php/SSW
Dissertation Advisor: Prof. Amit Sheth
Cursing is not uncommon during conversations in the physical world: 0.5% to 0.7% of all the words we speak are curse words, given that 1% of all the words are first-person plural pronouns (e.g., we, us, our). On social media, people can instantly chat with friends without face-to-face interaction, usually in a more public fashion and broadly disseminated through highly connected social network. Will these distinctive features of social media lead to a change in people’s curs- ing behavior? In this paper, we examine the characteristics of cursing activity on a popular social media platform – Twitter, involving the analysis of about 51 million tweets and about 14 million users. In particular, we explore a set of questions that have been recognized as crucial for understanding curs- ing in offline communications by prior studies, including the ubiquity, utility, and contextual dependencies of cursing.
Original paper: http://knoesis.org/library/resource.php?id=1937
Pavan Kapanipathi, Prateek Jain, Chitra Venkataramani, Amit Sheth, User Interests Identification on Twitter Using a Hierarchical Knowledge Base, ESWC 2014, May 2014.
Paper at: http://j.mp/user-ig
More at: http://wiki.knoesis.org/index.php/Hierarchical_Interest_Graph
Invited talk presented by Hemant Purohit (http://knoesis.org/researchers/hemant) at the NCSU workshop on IT for sustainable tourism development. The talk presents application of technology developed for crisis coordination into more general marketplace coordination via social media for helping suppliers (micro-entrepreneurs) and demanders (tourists).
The recent emergence of the “Linked Data” approach for publishing data represents a major step forward in realizing the original vision of a web that can "understand and satisfy the requests of people and machines to use the web content" – i.e. the Semantic Web. This new approach has resulted in the Linked Open Data (LOD) Cloud, which includes more than 70 large datasets contributed by experts belonging to diverse communities such as geography, entertainment, and life sciences. However, the current interlinks between datasets in the LOD Cloud – as we will illustrate – are too shallow to realize much of the benefits promised. If this limitation is left unaddressed, then the LOD Cloud will merely be more data that suffers from the same kinds of problems, which plague the Web of Documents, and hence the vision of the Semantic Web will fall short.
This thesis presents a comprehensive solution to address the issue of alignment and relationship identification using a bootstrapping based approach. By alignment we mean the process of determining correspondences between classes and properties of ontologies. We identify subsumption, equivalence and part-of relationship between classes. The work identifies part-of relationship between instances. Between properties we will establish subsumption and equivalence relationship. By bootstrapping we mean the process of being able to utilize the information which is contained within the datasets for improving the data within them. The work showcases use of bootstrapping based methods to identify and create richer relationships between LOD datasets. The BLOOMS project (http://wiki.knoesis.org/index.php/BLOOMS) and the PLATO project, both built as part of this research, have provided evidence to the feasibility and the applicability of the solution.
Mending the Gap between Library's Electronic and Print Collections in ILS and...New York University
This presentation proposed a conceptual model to model user's info seeking behavior in the context of their experience and use the model to improve library's collections and services using St. John's University Libraries for case study. It reviewed Web content technologies offered by IT vendors, and compared what offered in content technologies by Library IT vendors. To fill in the gap, It developed the preliminary proposal for 1) required data architecture in SOA framework, 2) desired features for managing library print and electronic content on library's website, 3) adoption of Semantic Web standards and technologies for managing library resources, and 4) the case study scenario with sample conceptual model.
"At the toolbar (menu, whatever) associated with a document there is a button marked "Oh, yeah?". You press it when you lose that feeling of trust. It says to the Web, 'so how do I know I can trust this information?'. The software then goes directly or indirectly back to metainformation about the document, which suggests a number of reasons."
Tim Berners-Lee, W3C Chair, Web Design Issues, September 1997
Provenance is focused on the description and understanding of where and how data is produced, the actors involved in the production of such data, and the processes by which the data was manipulated and transformed until it arrived to the collection from which it is being accessed. Provenance aims at providing the ability to trace the sources of data, enabling the exploration not just of the relationships between datasets, but also of their authors and affiliations, with the goal of preserving data ownership and establishing a notion of trust based on authenticity and reliability.
The Future Internet poses important challenges for provenance, derived from complex and rich scenarios characterized by the presence of large amounts of data stemming from heterogeneous sources like user communities, services, and things. Such challenges span across technical but also socioeconomic dimensions. The former includes aspects like vocabularies for representing provenance, interoperability and scalability issues, and means to produce, acquire, and reason with provenance in order to provide measures of trust and information quality. However, it is probably in the socieconomic dimension where more significant efforts need to be made as to addressing issues like the role of provenance in the overall picture of the Future Internet, entry barriers preventing the generation of provenance-aware internet content, means required to incentivate the production of such content, and ways to prevent provenance forgery.
In this talk, we provide and overview on provenance and the above mentioned challenges and introduce ongoing work in order to address trust issues from the provenance perspective in the Future Internet. We also link provenance to other relevant aspects for trust discussed in the session, like security, legal frameworks, and economics.
Linked data for Enterprise Data IntegrationSören Auer
The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.
This talk introduces Linked Data and Semantic Web by using two examples - population sciences grid and semantAqua - a semantically enabled environmental monitoring. It shows a few tools and the semantic methodology and opens discussion for LOD and team science
International Journal of Recent Advances in Mechanical Engineering (IJMECH)ijfcst journal
International Journal of Recent Advances in Mechanical Engineering (IJMECH) is a peer-reviewed, open access journal that addresses the impacts and challenges of Mechanical Engineering. The journal documents practical and theoretical results which make a fundamental contribution for the development of Mechanical Engineering This journal aims to bring together researchers and practitioners in all Mechanical Engineering aspects, including (but not limited to)..
With increase in size of web, volume of information content is becoming huge resulting in difficult to search, access, manage and maintain. Creating machine processible semantic could decrease some of these problems. In this post, we will discuss some of the applications of semantic web as we discussed in earlier post. Before we dive into applications, lets see what are semantic web applications.
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...Dr. Haxel Consult
Synonym breaks search! How? Why is this important? What synonym is and how it breaks search will be explained with real-world examples. AI-based solutions are proposed, and relevant standards are identified. How synonym solutions should be used for search are explained. Learn what you can do yourself. Tools help, but it doesn’t have to be complicated, nor expensive. It is as straight forward as setting priorities!
AI-SDV 2020: Can There Be Profitable Revenue from an AI Deployment? The Upsid...Dr. Haxel Consult
In the last twelve months AI activity has continued to accelerate. While there have been major setbacks in AI over the decades its recent up surge seems to be holding. Many positives stories are hitting the news, but is anyone actually making any money on AI deployments besides the big AI vendors? Have there been significant, meaningful cost reductions from AI deployments? Yes! Brief case studies will be presented from primary and secondary sources illustrating impacts on real world cost savings and revenue enhancements. As is always the case with real world projects there are lessons learned!
Similar to Semantic Computing in Real-World: Vertical and Horizontal application (20)
Biological screening of herbal drugs: Introduction and Need for
Phyto-Pharmacological Screening, New Strategies for evaluating
Natural Products, In vitro evaluation techniques for Antioxidants, Antimicrobial and Anticancer drugs. In vivo evaluation techniques
for Anti-inflammatory, Antiulcer, Anticancer, Wound healing, Antidiabetic, Hepatoprotective, Cardio protective, Diuretics and
Antifertility, Toxicity studies as per OECD guidelines
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Francesca Gottschalk - How can education support child empowerment.pptxEduSkills OECD
Francesca Gottschalk from the OECD’s Centre for Educational Research and Innovation presents at the Ask an Expert Webinar: How can education support child empowerment?
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
Embracing GenAI - A Strategic ImperativePeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Semantic Computing in Real-World: Vertical and Horizontal application
1. Semantic Computing in Real-World:
Vertical and Horizontal application,
within Enterprise and on the Web
Panel at Intl Conf on Semantic Computing, Palo Alto, CA, Sept 20m 2011
Amit P. Sheth
LexisNexis Ohio Eminent Scholar
Ohio Center of Excellence in Knowledge-enabled Computing (Kno.e.sis)
Wright State University, Dayton, OH
amit@knoesis.org
Ohio Center of Excellence in Knowledge-Enabled Computing
2. Semantics as core enabler, enhancer @
Kno.e.sis
knoesis.org
Ohio Center of Excellence in Knowledge-Enabled Computing
3. Semantics & Semantic Web in 1999-2002
2001
Patent : http://bit.ly/sw-p
Ohio Center of Excellence in Knowledge-Enabled Computing
4. Significant Presence In …
BBC Sound
MediaAnywhere
Index
/Taalee
……….
Twitris
Twarql
Pharmaceutical
Defense
Health
Care ASEMR
Finance
GIS
Life Scooner
Science
App Specific Search Social Sensor ……….
Ohio Center of Excellence in Knowledge-Enabled Computing
5. Semantic Web In Action
Create Domain Model and Semantic Annotations
Knowledge bases
Taalee -
Bio2RDF MediaAnywhere
BestBuy
Linked Open Schema.org
Data
BBC Sound Index
Global Investment Twarql
Scooner Bank
Semagix
ASEMR
Reasoning and Analysis
Ohio Center of Excellence in Knowledge-Enabled Computing
6. Semantic Search 1999 - 2001
Ohio Center of Excellence in Knowledge-Enabled Computing
7. Taalee Semantic/Faceted Search & Browsing
(1999-2001)
Targeted e-shopping/e-commerce
assets access
uniform view of worldwide
distributed assets of similar type
Taalee - Search
Ohio Center of Excellence in Knowledge-Enabled Computing
8. Taalee Semantic/Faceted Search & Browsing
(1999-2001)
BLENDED BROWSING & QUERYING Targeted e-shopping/e-commerce
ATTRIBUTE & KEYWORD
QUERYING
assets access
SEMANTIC BROWSING
uniform view of worldwide
distributed assets of similar type
Taalee - Search
Ohio Center of Excellence in Knowledge-Enabled Computing
9. Fast Forward to 2010 - 2011
Ohio Center of Excellence in Knowledge-Enabled Computing
10. Schema.org
Shared Vocabulary
Amazing things can
happen
Ohio Center of Excellence in Knowledge-Enabled Computing
12. Extracting Semantic Metadata from
Semi structured and Structured Sources (1999 – 2002)
Semagix Freedom for building
ontology-driven information system
Managing Semantic Content on the Web
Ohio Center of Excellence in Knowledge-Enabled Computing
13. Active Semantic Electronic Medical Record
Application
In Use Today at Athens Heart Center For Clinical Decision Support
since January 2006
Amit P. Sheth, S. Agrawal,JonathanLathem, Nicole Oldham, H. Wingate, P. Yadav, and K. Gallagher, Active Semantic
Electronic Medical Record, Proc. of the 5th International Semantic Web Conference
Ohio Center of Excellence in Knowledge-Enabled Computing
14. Global Investment Bank
Law Public World Wide BLOGS,
Watch Lists Enforcement Regulators Records Web content RSS
Semi-structured Government Data Un-structure text, Semi-structured Data
Establishing
New Account
User will be able to navigate
the ontology using a number
of different interfaces
Scores the entity
based on the
content and entity
relationships
Fraud Prevention application used in
financial services – Related KYC
application is deployed at Majority
of Global Banks
Ohio Center of Excellence in Knowledge-Enabled Computing
15. Using large data sets for Structured
Data on the web
Ohio Center of Excellence in Knowledge-Enabled Computing
16. Linked Open Data
Publish Open Data Sets in RDF
By 2010, 203 data data sets
25 billion Triples
Image: http://richard.cyganiak.de/2007/10/lod/
Ohio Center of Excellence in Knowledge-Enabled Computing
17. You publish the raw data…
Semantic Web Adoption and Application
Ohio Center of Excellence in Knowledge-Enabled Computing
18. … and others can use it
Semantic Web Adoption and Application
Ohio Center of Excellence in Knowledge-Enabled Computing
19. Using the LOD to build Web site: BBC
Semantic Web Adoption and Application
Ohio Center of Excellence in Knowledge-Enabled Computing
20. Using the LOD to build Web site: BBC
Semantic Web Adoption and Application
Ohio Center of Excellence in Knowledge-Enabled Computing
21. GoodRelations Ontology - RDFa
Semantic Web Adoption and Application
Ohio Center of Excellence in Knowledge-Enabled Computing
22. GoodRelations Ontology - RDFa
Semantic Web Adoption and Application
Ohio Center of Excellence in Knowledge-Enabled Computing
23. GoodRelations Ontology - RDFa
Semantic Web Adoption and Application
Ohio Center of Excellence in Knowledge-Enabled Computing
25. BBC Sound Index
Using MusicBrainz to spot entities in the BBC Sound Index
60 songs with Merry Christmas
3600 songs with Yesterday
195 releases of American Pie
32 Artists Amrican Pie
Ohio Center of Excellence in Knowledge-Enabled Computing
26. BBC Sound Index
Which ‘Merry Christmas’?; ‘So Good’is also a song
Scoped Relationship graphs
– Using context cues from the content,
webpage title, url…
e.g. new Merry Christmas tune
– Reduce potential entity spot size
e.g. new albums/songs
• Generate candidate entities
• Spot and Disambiguate
Ohio Center of Excellence in Knowledge-Enabled Computing
30. Twitris: Semantic Social Web Mash-up
Select date Select topic
N-gram summaries
Topic tree
Sentiment Spatial Marker
Tweet traffic
Images & Videos Analysis
Related tweets Reference news
Wikipedia articles
TWITRIS Ohio Center of Excellence in Knowledge-Enabled Computing
31. Twarql (Twitter Feeds through SPARQL)
• Semantically annotate tweets with entities, hashtags, URLs,
sentiments, etc.
• Encode content in a structured format (RDF) using shared
vocabularies (FOAT, SIOC, MOAT, etc.)
• Structured querying of tweets
• Subscribe to a stream of tweets that match a given query
• Real-time delivery of streaming data.
TWARQL
Ohio Center of Excellence in Knowledge-Enabled Computing
32. Twarql Architecture
TWARQL
Ohio Center of Excellence in Knowledge-Enabled Computing
33. User Controlled Content Dissemination
• Create Semantic Social Graphs (FOAF) of publishers (followee) and
subscribers (followers).
• Dynamically create subset of subscribers (SPARQL Query on Semantic
Social Graph), based on publishers privacy preference for the content
generated.
• Distribute the content to only the subset of subscribers from the Social
graph in (near) Real-Time
• Example –
• SMOB ( Semantic Microblogging Framework) http://smob.me
• Semantic Hub ( Publisher/Subscriber protocol) http://semantichub.appspot.com
Ohio Center of Excellence in Knowledge-Enabled Computing
34. Semantic Hub
Ohio Center of Excellence in Knowledge-Enabled Computing
35. Scenario ….
• Give me a stream of • Give me all people that
locations where Kinect is have said negative things
being mentioned now about Kinect
TWARQL
Ohio Center of Excellence in Knowledge-Enabled Computing
36. Creating Focus Specific Knowledge
Bases for Knowledge Exploration
Ohio Center of Excellence in Knowledge-Enabled Computing
37. Scooner
HPC keywords Doozer: Base Hierarchy
from Wikipedia Focused Pattern
based extraction
SenseLab Neuroscience
Ontologies
Initial KB Creation
Meta Knowledgebase
PubMed Abstracts
Knoesis: Parsing
Enrich Knowledge Base
based NLP Triples
NLM: Rule based
Final Knowledge Base
BKR Triples
Scooner
Ohio Center of Excellence in Knowledge-Enabled Computing
38. Search for “VIP Peptide”
Scooner
Ohio Center of Excellence in Knowledge-Enabled Computing
40. Interested in more?
• “Citizen Sensor Data Mining, Social Media Analytics and Development
Centric Web Applications” (WWW2011)
• Jorge Cardoso, Martin Hepp and MiltiadisLytras“The Semantic Web :
Real World Applications from Industry”
• Ivan Herman, “Semantic Web Adoption and Application”
• Michael Hausenblas, “Open Data Ireland : From Research to Practice”
• Amit Sheth: “Semantics Scales Up: Beyond Search in Web 3.0”
• See showcase of several applications at: http://knoesis.org/showcase
Ohio Center of Excellence in Knowledge-Enabled Computing
Editor's Notes
Animation needs work
Animation needs work
Obtained from Ivan’s slide
Obtained from Ivan’s slide
Obtained from Ivan’s slide
Obtained from Ivan’s slide
Obtained from Ivan’s slide
Architecture of twarqlStream tweetsCovert it into RDFUse SPARQL Query to filter it