One of the main problems in online advertising is to display ads which are relevant and appropriate \wrt what the user is looking for. Often search engines fail to reach this goal as they do not consider semantics attached to keywords. In this paper we propose a system that tackles the problem by two different angles: help (i) advertisers to create more efficient ads campaigns and (ii) ads providers to properly match ads content to keywords in search engines.
We exploit semantic relations stored in the DBpedia dataset and use an hybrid ranking system to rank keywords and to expand queries formulated by the user. Inputs of our ranking system are (i) the DBpedia dataset; (ii) external information sources such as classical search engine results and social tagging systems.
We compare our approach with other RDF similarity measures, proving the validity of our algorithm with an extensive evaluation involving real users.
From Exploratory Search to Web Search and back - PIKM 2010Roku
The power of search is with no doubt one of the main aspects for the success of the Web. Currently available search engines on the Web allow to return results with a high precision. Nevertheless, if we limit our attention only to lookup search we are missing another important search task. In exploratory search, the user is willing not only to find documents relevant with respect to her query but she is also interested in learning, discovering and understanding novel knowledge on complex and sometimes unknown topics.
In the paper we address this issue presenting LED, a web based system that aims to improve (lookup) Web search by enabling users to properly explore knowledge associated to her query. We rely on DBpedia to explore the semantics of keywords within the query thus suggesting potentially interesting related topics/keywords to the user.
This presentation discusses the value of inferred knowledge over LOD and presents a new version of FactForge, a reason-able view, the biggest body of heterogeneous generic knowledge on which inference is performed, showing examples of inferred statements across LOD datasets.
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...Paolo Nesi
A number of accessible RDF stores are populating the linked open data world. The navigation on data reticular relationships is becoming every day more relevant. Several knowledge base present relevant links to common vocabularies while many others are going to be discovered increasing the reasoning capabilities of our knowledge base applications. In this paper, the Linked Open Graph, LOG, is presented. It is a web tool for collaborative browsing and navigation on multiple SPARQL entry points. The paper presented an overview of major problems to be addressed, a comparison with the state of the arts tools, and some details about the LOG graph computation to cope with high complexity of large Linked Open Dada graphs. The LOG.disit.org tool is also presented by means of a set of examples involving multiple RDF stores and putting in evidence the new provided features and advantages using dbPedia, Getty, Europeana, Geonames, etc. The LOG tool is free to be used, and it has been adopted, developed and/or improved in multiple projects: such as ECLAP for social media cultural heritage, Sii-Mobility for smart city, and ICARO for cloud ontology analysis, OSIM for competence / knowledge mining and analysis. Keywords LOD, LOD browsing, knowledge base browsing, SPARQL entry points.
JURIX talk on representing and reasoning on the deontic aspects of normative rules relying only on standard Semantic Web languages.
The corresponding paper is at https://hal.inria.fr/hal-01643769v1
Conceptual modelling from natural languageLuisa Mich
Most requirements are in natural language. Natural Language Processing Systems (NLPS) can support conceptual modeling activity.
The more so if the NLPS deals with Semantics. Recent approaches still suffer of limitations described in the presentation.
From Exploratory Search to Web Search and back - PIKM 2010Roku
The power of search is with no doubt one of the main aspects for the success of the Web. Currently available search engines on the Web allow to return results with a high precision. Nevertheless, if we limit our attention only to lookup search we are missing another important search task. In exploratory search, the user is willing not only to find documents relevant with respect to her query but she is also interested in learning, discovering and understanding novel knowledge on complex and sometimes unknown topics.
In the paper we address this issue presenting LED, a web based system that aims to improve (lookup) Web search by enabling users to properly explore knowledge associated to her query. We rely on DBpedia to explore the semantics of keywords within the query thus suggesting potentially interesting related topics/keywords to the user.
This presentation discusses the value of inferred knowledge over LOD and presents a new version of FactForge, a reason-able view, the biggest body of heterogeneous generic knowledge on which inference is performed, showing examples of inferred statements across LOD datasets.
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...Paolo Nesi
A number of accessible RDF stores are populating the linked open data world. The navigation on data reticular relationships is becoming every day more relevant. Several knowledge base present relevant links to common vocabularies while many others are going to be discovered increasing the reasoning capabilities of our knowledge base applications. In this paper, the Linked Open Graph, LOG, is presented. It is a web tool for collaborative browsing and navigation on multiple SPARQL entry points. The paper presented an overview of major problems to be addressed, a comparison with the state of the arts tools, and some details about the LOG graph computation to cope with high complexity of large Linked Open Dada graphs. The LOG.disit.org tool is also presented by means of a set of examples involving multiple RDF stores and putting in evidence the new provided features and advantages using dbPedia, Getty, Europeana, Geonames, etc. The LOG tool is free to be used, and it has been adopted, developed and/or improved in multiple projects: such as ECLAP for social media cultural heritage, Sii-Mobility for smart city, and ICARO for cloud ontology analysis, OSIM for competence / knowledge mining and analysis. Keywords LOD, LOD browsing, knowledge base browsing, SPARQL entry points.
JURIX talk on representing and reasoning on the deontic aspects of normative rules relying only on standard Semantic Web languages.
The corresponding paper is at https://hal.inria.fr/hal-01643769v1
Conceptual modelling from natural languageLuisa Mich
Most requirements are in natural language. Natural Language Processing Systems (NLPS) can support conceptual modeling activity.
The more so if the NLPS deals with Semantics. Recent approaches still suffer of limitations described in the presentation.
Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...Minsuk Kahng
Minsuk Kahng, Sangkeun Lee, and Sang-goo Lee, "Ranking Objects by Following Paths in Entity-Relationship Graphs", Proceedings of the 4th ACM Workshop for Ph.D. Students in Information and Knowledge Management (PhD Workshop at CIKM 2011), 2011.
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Project
Presentation at CIKM 2013 of the CUbRIK research paper: "Efficient Jaccard-based Diversity Analysis of Large
Document Collections" authored by Fan Deng, Stefan Siersdorfer and Sergej Zerr of L3S Research Center, partner of the CUbRIK Consortium.
Leveraging Joint Interactions for Credibility Analysis in News CommunitiesSubhabrata Mukherjee
Leveraging Joint Interactions for Credibility Analysis in News Communities,
Subhabrata Mukherjee and Gerhard Weikum,
Max Planck Institute for Informatics,
CIKM 2015
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...Shuai Yuan
Computational Advertising has been an important topical area in information retrieval and knowledge management. This tutorial will be focused on real-time advertising, aka Real-Time Bidding (RTB), the fundamental shift in the field of computational advertising. It is strongly related to CIKM areas such as user log analysis and modelling, information retrieval, text mining, knowledge extraction and management, behaviour targeting, recommender systems, personalization, and data management platform.
This tutorial aims to provide not only a comprehensive and systemic introduction to RTB and computational advertising in general, but also the emerging research challenges and research tools and datasets in order to facilitate the research. Compared to previous Computational Advertising tutorials in relevant top-tier conferences, this tutorial takes a fresh, neutral, and the latest look of the field and focuses on the fundamental changes brought by RTB.
We will begin by giving a brief overview of the history of online advertising and present the current eco-system in which RTB plays an increasingly important part. Based on our field study and the DSP optimisation contest organised by iPinyou, we analyse optimization problems both from the demand side (advertisers) and the supply side (publishers), as well as the auction mechanism design challenges for Ad exchanges. We discuss how IR, DM and ML techniques have been applied to these problems. In addition, we discuss why game theory is important in this area and how it could be extended beyond the auction mechanism design.
CIKM is an ideal venue for this tutorial because RTB is an area of multiple disciplines, including information retrieval, data mining, knowledge discovery and management, and game theory, most of which are traditionally the key themes of the conference. As an illustration of practical application in the real world, we shall cover algorithms in the iPinyou global DSP optimisation contest on a production platform; for the supply side, we also report experiments of inventory management, reserve price optimisation, etc. in production systems.
We expect the audience, after attending the tutorial, to understand the real-time online advertising mechanisms and the state of the art techniques, as well as to grasp the research challenges in this field. Our motivation is to help the audience acquire domain knowledge and obtain relevant datasets, and to promote research activities in RTB and computational advertising in general.
The Semantic Web: What IAs Need to Know About Web 3.0Chiara Fox Ogan
This presentation from the IA Summit 2009 will answer the questions “What exactly *is* the Semantic Web? And why should I care?” We’ll discuss how ontologies are similar and different from thesauri and taxonomies. We’ll look at examples of how this technology is being used in the marketplace. We’ll talk about how these concepts can be incorporated into the information architecture work that you are doing today. And where you can go to learn more.
call for papers!
12th International Conference on Web services & Semantic Technology (WeST 2020)
November 28 ~ 29, 2020, London, United Kingdom
https://cndc2020.org/west/index.html
paper submission link: https://cndc2020.org/submission/index.php
Will Robots Take all the Jobs? Not yet.Dagmar Monett
Slides of the talk at the 3rd European Conference on the Impact of Artificial Intelligence and Robotics, ECIAIR 2021 (a virtual conference), November 18th, 2021.
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Amit Sheth
Amit Sheth, "Semantic Web & Info. Brokering Opportunities, Commercialization and Challenges," Keynote talk at the workshop on Semantic Web: Models, Architecture and Management, September 21, 2000, Lisbon, Portugal.
This was the keynote given at probably the first international event with "Semantic Web" in title (and before the well known SciAm article). As in TBL's use of Semantic Web in his 1999 book, (semantic) metadata plays central role. The use of Worldmodel/Ontology is consistent with our use of ontology for (Web) information integration in 1994 CIKM paper. Summary of the talk by event organizers and other details are at: http://knoesis.org/library/resource.php?id=735
Prof. Sheth started a Semantic Web company Taalee, Inc. in 1999 (product was called MediaAnywhere A/V search engine- discussed in this paper in the context of one of its use by a customer Redband Broadcasting). The product included Semantic Web/populated Ontology based semantic (faceted) search, semantic browsing, semantic personalization, semantic targeting (advertisement), etc as is described in U.S. Patent #6311194, 30 Oct. 2001 (filed 2000). MediaAnywhere has about 25 ontologies in News/Business, Sports, Entertainment, etc.
Taalee merged to become Voquette in 2001 (product was called SCORE), Semagix in 2004 (product was called Semagix Freedom), and then Fortent in 2006 (products included Know Your Customers).
Metadata Tagging in Education—What Every Publisher and Content Developer Need...AAP PreK-12 Learning Group
AEP and Creative Commons are co-leading the effort to establish a common vocabulary for describing learning resources. This webinar reviews the background of the Learning Resource Metadata Initiative, the roles of the organizations involved, and the goals for this major initiative.
As a framework is created and then adopted by publishers and content developers, many opportunities lie ahead. The LRMI will have a valuable impact on the way educators search for and use online educational material.
On 2008-11-15 Maurice Vanderfeesten gave a presentation in Baltimore at the SPARC OpenAccess confenrence.
This presentation explains about the needs for interoperability amoung repository systems. DRIVER provides guidelines how to expose metadata via OAI-PMH is a way that has international compliance.
CALL FOR PAPERS - International Conference on Data Science and Applications (...dannyijwest
International Conference on Data Science and Applications (DSA 2020) will act as a major forum for the presentation of innovative ideas, approaches, developments, and research projects in the areas of Data Science and Applications. It will also serve to facilitate the exchange of information between researchers and industry professionals to discuss the latest issues and advancement in the area of Data Science & Applications
Authors are solicited to contribute to the conference by submitting articles that illustrate research results, projects, surveying works and industrial experiences that describe significant advances in Data Science & Applications.
Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...Minsuk Kahng
Minsuk Kahng, Sangkeun Lee, and Sang-goo Lee, "Ranking Objects by Following Paths in Entity-Relationship Graphs", Proceedings of the 4th ACM Workshop for Ph.D. Students in Information and Knowledge Management (PhD Workshop at CIKM 2011), 2011.
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Project
Presentation at CIKM 2013 of the CUbRIK research paper: "Efficient Jaccard-based Diversity Analysis of Large
Document Collections" authored by Fan Deng, Stefan Siersdorfer and Sergej Zerr of L3S Research Center, partner of the CUbRIK Consortium.
Leveraging Joint Interactions for Credibility Analysis in News CommunitiesSubhabrata Mukherjee
Leveraging Joint Interactions for Credibility Analysis in News Communities,
Subhabrata Mukherjee and Gerhard Weikum,
Max Planck Institute for Informatics,
CIKM 2015
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...Shuai Yuan
Computational Advertising has been an important topical area in information retrieval and knowledge management. This tutorial will be focused on real-time advertising, aka Real-Time Bidding (RTB), the fundamental shift in the field of computational advertising. It is strongly related to CIKM areas such as user log analysis and modelling, information retrieval, text mining, knowledge extraction and management, behaviour targeting, recommender systems, personalization, and data management platform.
This tutorial aims to provide not only a comprehensive and systemic introduction to RTB and computational advertising in general, but also the emerging research challenges and research tools and datasets in order to facilitate the research. Compared to previous Computational Advertising tutorials in relevant top-tier conferences, this tutorial takes a fresh, neutral, and the latest look of the field and focuses on the fundamental changes brought by RTB.
We will begin by giving a brief overview of the history of online advertising and present the current eco-system in which RTB plays an increasingly important part. Based on our field study and the DSP optimisation contest organised by iPinyou, we analyse optimization problems both from the demand side (advertisers) and the supply side (publishers), as well as the auction mechanism design challenges for Ad exchanges. We discuss how IR, DM and ML techniques have been applied to these problems. In addition, we discuss why game theory is important in this area and how it could be extended beyond the auction mechanism design.
CIKM is an ideal venue for this tutorial because RTB is an area of multiple disciplines, including information retrieval, data mining, knowledge discovery and management, and game theory, most of which are traditionally the key themes of the conference. As an illustration of practical application in the real world, we shall cover algorithms in the iPinyou global DSP optimisation contest on a production platform; for the supply side, we also report experiments of inventory management, reserve price optimisation, etc. in production systems.
We expect the audience, after attending the tutorial, to understand the real-time online advertising mechanisms and the state of the art techniques, as well as to grasp the research challenges in this field. Our motivation is to help the audience acquire domain knowledge and obtain relevant datasets, and to promote research activities in RTB and computational advertising in general.
The Semantic Web: What IAs Need to Know About Web 3.0Chiara Fox Ogan
This presentation from the IA Summit 2009 will answer the questions “What exactly *is* the Semantic Web? And why should I care?” We’ll discuss how ontologies are similar and different from thesauri and taxonomies. We’ll look at examples of how this technology is being used in the marketplace. We’ll talk about how these concepts can be incorporated into the information architecture work that you are doing today. And where you can go to learn more.
call for papers!
12th International Conference on Web services & Semantic Technology (WeST 2020)
November 28 ~ 29, 2020, London, United Kingdom
https://cndc2020.org/west/index.html
paper submission link: https://cndc2020.org/submission/index.php
Will Robots Take all the Jobs? Not yet.Dagmar Monett
Slides of the talk at the 3rd European Conference on the Impact of Artificial Intelligence and Robotics, ECIAIR 2021 (a virtual conference), November 18th, 2021.
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Amit Sheth
Amit Sheth, "Semantic Web & Info. Brokering Opportunities, Commercialization and Challenges," Keynote talk at the workshop on Semantic Web: Models, Architecture and Management, September 21, 2000, Lisbon, Portugal.
This was the keynote given at probably the first international event with "Semantic Web" in title (and before the well known SciAm article). As in TBL's use of Semantic Web in his 1999 book, (semantic) metadata plays central role. The use of Worldmodel/Ontology is consistent with our use of ontology for (Web) information integration in 1994 CIKM paper. Summary of the talk by event organizers and other details are at: http://knoesis.org/library/resource.php?id=735
Prof. Sheth started a Semantic Web company Taalee, Inc. in 1999 (product was called MediaAnywhere A/V search engine- discussed in this paper in the context of one of its use by a customer Redband Broadcasting). The product included Semantic Web/populated Ontology based semantic (faceted) search, semantic browsing, semantic personalization, semantic targeting (advertisement), etc as is described in U.S. Patent #6311194, 30 Oct. 2001 (filed 2000). MediaAnywhere has about 25 ontologies in News/Business, Sports, Entertainment, etc.
Taalee merged to become Voquette in 2001 (product was called SCORE), Semagix in 2004 (product was called Semagix Freedom), and then Fortent in 2006 (products included Know Your Customers).
Metadata Tagging in Education—What Every Publisher and Content Developer Need...AAP PreK-12 Learning Group
AEP and Creative Commons are co-leading the effort to establish a common vocabulary for describing learning resources. This webinar reviews the background of the Learning Resource Metadata Initiative, the roles of the organizations involved, and the goals for this major initiative.
As a framework is created and then adopted by publishers and content developers, many opportunities lie ahead. The LRMI will have a valuable impact on the way educators search for and use online educational material.
On 2008-11-15 Maurice Vanderfeesten gave a presentation in Baltimore at the SPARC OpenAccess confenrence.
This presentation explains about the needs for interoperability amoung repository systems. DRIVER provides guidelines how to expose metadata via OAI-PMH is a way that has international compliance.
CALL FOR PAPERS - International Conference on Data Science and Applications (...dannyijwest
International Conference on Data Science and Applications (DSA 2020) will act as a major forum for the presentation of innovative ideas, approaches, developments, and research projects in the areas of Data Science and Applications. It will also serve to facilitate the exchange of information between researchers and industry professionals to discuss the latest issues and advancement in the area of Data Science & Applications
Authors are solicited to contribute to the conference by submitting articles that illustrate research results, projects, surveying works and industrial experiences that describe significant advances in Data Science & Applications.
International Conference on Data Science and Machine Learning (DSML 2020)ijdms
International Conference on Data Science and Machine Learning (DSML 2020) will act as a major forum for the presentation of innovative ideas, approaches, developments, and research projects in the areas of Data Science and Machine Learning. It will also serve to facilitate the exchange of information between researchers and industry professionals to discuss the latest issues and advancement in the area of Data Science and Machine Learning.
A profile of the Toronto ICT Cluster and the Silicon Valley & Waterloo ICT Clusters, along with infrastructure, news, events, and marketing activities.
View slides from our 3 May 2016 webinar presentation showcasing how to manage Technology Obsolescence with the support of the BDNA Technopedia integration with LeanIX.
===
LeanIX offers an innovative software-as-a-service solution for Enterprise Architecture Management (EAM), based either in a public cloud or the client’s data center.
Companies like Adidas, Axel Springer, Helvetia, RWE, Trusted Shops and Zalando use LeanIX Enterprise Architecture Management tool.
Free Trial: http://bit.ly/LeanIXFreeTrial
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010
1. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
SEMANTIC TAGS GENERATION AND RETRIEVAL
FOR ONLINE ADVERTISING
1Politecnico di Bari
Via Orabona, 4
70125 Bari (ITALY)
2University of Trento
Via Sommarive, 14
38100 Trento (ITALY)
Roberto Mirizzi1, Azzurra Ragone1,2,
Tommaso Di Noia1, Eugenio Di Sciascio1
2. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
Outline
Tags in Web 2.0 → 3.0
Computational advertising
NOT (Not Only Tag): semantic tag cloud
generation
DBpediaRanker: RDF ranking in DBpedia
Conclusion and Future work
3. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
Who is using tags nowadays?
and many
more…
4. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
What about Tags in Online Advertising?
5. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
BigG (& co.) helps you… in half (i)
…nice, but there is no
“semantics” in it.
You can not expand your
keywords list exploiting the
meaning of a term
(keyword/tag/query)
https://adwords.google.com/select/KeywordToolExternal
Keyword Tool
Based on actual Google
search queries
Generates keywords
based on the content of a
URL, words or phrases
1
2
3
6. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
BigG (& co.) helps you… in half (ii)
…nice, but there is no
“semantics” in it.
You can not expand your
keywords list exploiting the
meaning of a term
(keyword/tag/query)
Keyword Tool
Based on actual Google
search queries
Generates keywords
based on the content of a
URL, words or phrases
7. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
Why not to use Semantic tags?
Plugged into the Web 3.0
Disambiguation
Relations among tags
Machine understandable
NOT: Not Only Tag
http://sisinflab.poliba.it/not-only-tag/
8. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
NOT: Not Only Tag
Objectives
Assist advertisers to
create more efficient ads
campaigns
Support ads providers to
properly match ads
content to keywords in
search engines
Improve
advertiser experience and ad selection
9. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
What is behind NOT? (i)
10. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
What is behind NOT? (ii)
Comments
DBpedia resources are
highly interconnected
in the RDF graph
Not all the relevant
resources for a given
node are its direct
neighbors
1. Explore the
neighborhood of a
resource to discover
new relevant
resources not
directly connected to
it
2. Rank the results
11. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
DBpedia graph exploration in NOT
Open_source_CMS Web_application_frameworks
Content_management_systems Free_business_software …
…
Web_development Web_applications
JavaServer_Faces Python_web_application_frameworks
Zend_Framework
Joomla_extensions
skos:subject skos:broaderCategoryArticle
Legend
…
……
Magento
…
PHP
Drupal
…
12. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
The functional architecture
Back-end
Query engine
Storage
Tag Cloud
Generator
GUI
Ext.InfoSources
DBpedia
Lookup
Service
Interface
Delicious
Yahoo!
Bing
Google
Graph
Explorer
SPARQL
Context
Analyzer
Ranker
Offline computation
Linked Data graph
exploration
Rank nodes exploiting
external information
Store results as pairs of
nodes together with their
similarity
Runtime Search
Start typing a query
Query the system for
relevant tags
(corresponding to DBpedia
resources)
Show the semantic tag
cloud
1
2
3
1
2
3
OfflinecomputationRuntimesearch
1
2
3
1
2
3
13. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
DBpediaRanker: ranking
?r1 ?r2
isSimilar
v
hasValue
einfo_sourc2
21
1
21
einfo_sourc21
)(
),(
)(
),(
),(
rf
rrf
rf
rrf
rrsim
viceversaandrandrbetweenwikilink,2
saor viceverrandrbetweenkwikilin,1
randrbetweenwikilinkno,0
),(
21
21
21
21 rrorewikilinkSc
)(
),(
),(
2
12
21
rl
rrl
rroreabstractSc
Graph-based and text-based ranking
Ranking based on external sources
14. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
DBpediaRanker: an example (i)
wikilinkScore(Zend_Framework, PHP) = 2 abstractScore(Zend_Framework, PHP) = 1.0
15. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
DBpediaRanker: an example (ii)
sim(Zend_Framework, PHP)Google = 1.53e6 / 2.96e6 + 1.53e6 / 1.71e9 ≈ 0.52 + 0
delicious
16. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
DBpediaRanker: context analysis
The same similarity measure is used in the context analysis
?r1
?c1
belongsTo
v
hasValue
?c2
?c…
?cN
C
Example:
C = {Programming Languages, Databases, Software}
Does Dennis Ritchie belong to the given context?
Algorithm:
If(v>THRESHOLD) then
r1 belongs to the context;
add r1 to the graph exploration queue
Else
r1 does not belong to the context;
exclude r1 from graph exploration
EndIf
17. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
Evaluation (i)
http://sisinflab.poliba.it/evaluation
Comparison of 5 different algorithms
50 volunteers
Researchers in the ICT area
244 votes collected (on average 5 votes for each users)
Average time to vote: 1min and 40secs
18. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
Evaluation (ii)
http://sisinflab.poliba.it/evaluation/data
3.91 - Good
19. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
Conclusion
NOT: a prototype system for tag cloud generation in
semantic advertising
DBpediaRanker: ranking algorithms for resources in
DBpedia
Future work
Use the back-end of the system to develop new interfaces
for exploratory browsing
Improve ranking algorithms
Combine a content-based recommendation and a
collaborative-filtering approach
Develop a platform to test our system with real ads about
different domains
20. CIKM 2010 – 19th ACM Internation Conference on Information and Knowledge Management
October 29, 2010 – Fairmont Royal York, Toronto, Canada
Q&A
Thanks for your attention!
SEMANTIC TAGS GENERATION AND RETRIEVAL FOR ONLINE ADVERTISING (CIKM 2010)
If you're interested in learning more…
1. Roberto Mirizzi, Tommaso Di Noia. From Exploratory Search to Web Search and back. 4th Workshop for Ph.D. Students in Information
and Knowledge Management (PIKM 2010)
2. Roberto Mirizzi, Azzurra Ragone, Tommaso Di Noia, Eugenio Di Sciascio. Ranking the Linked Data: the case of DBpedia. 10th
International Conference on Web Engineering (ICWE 2010)
3. Roberto Mirizzi, Azzurra Ragone, Tommaso Di Noia, Eugenio Di Sciascio. Semantic tag cloud generation via DBpedia. 11th International
Conference on Electronic Commerce and Web Technologies (EC-Web 2010)
4. Roberto Mirizzi, Azzurra Ragone, Tommaso Di Noia, Eugenio Di Sciascio. Semantic tagging for crowd computing. 18th Italian
Symposium on Advanced Database Systems (SEBD 2010)
5. Roberto Mirizzi, Azzurra Ragone, Tommaso Di Noia, Eugenio Di Sciascio. Semantic Wonder Cloud: exploratory search in DBpedia. 2th
International Workshop on Semantic Web Information Management (SWIM 2010) - Best Workshop Paper at International Conference on
Web Engineering (ICWE 2010)
Roberto Mirizzi - mirizzi@deemail.poliba.it
See you tomorrow at PIKM 2010 in Room Alberta at 4pm with…
From Exploratory Search to Web Search and back