The document discusses word sense disambiguation and induction using Python. It describes using graphs and clustering algorithms to group terms that frequently occur together to induce senses of ambiguous entities. It also discusses evaluating the approach and provides a demo link. Packages like NumPy, NetworkX, and NLTK are used to process text, build graphs, and classify word senses.
This is a class for businesspeople/MBA that I currently teach at UVA Darden. It's a continuation of Software Design (bit.ly/sw-class). For more on the Software Development class, see bit.ly/sw-dev.
Search London - The technical factors that every small or local business shou...StrategiQ Marketing
This document discusses technical optimizations that small and local businesses can make to their websites. It recommends establishing a culture that prioritizes website speed among developers. Using specialized hosting and content delivery networks can significantly improve page load speeds. Cloudflare offers a free CDN and security features to help speed up sites and support HTTPS. Migrating to HTTPS is recommended and easier than often assumed. The overall message is that any business can have a fast and secure site through an iterative process of implementing best practices.
10 Ways To Improve Any Website Or Blog Best Practices For Web PublishersRichard Harrington
In this class we’ll explore practical techniques to improve your
website. From design to technology, this class will take a look at techniques that work.Rich Harrington published his first website in 1996, and with nearly 20 years publishing experience he’s got a lot of life lessons to share. We’ll look at essential tools that increase time on site as well as explore helper services for automation. You’ll also discover essential strategies to enhance your posts with graphics and video.
Next Era of SEO: A Guide to SEO Split-TestingTom Anthony
SEO focused A/B Testing or Split-Testing is fast becoming an important new technique for digital marketers. This deck explains why it is important, and how you can do it.
Attracting Visitors Through SEO - Converge 2015Rob Ousbey
Rob Ousbey of Distilled will share insights, tactics and common mistakes of Search Engine Optimization (SEO). Find out about the latest trends and hear what to expect next, at the cutting edge of search marketing.
This is a class for businesspeople/MBA that I currently teach at UVA Darden. It's a continuation of Software Design (bit.ly/sw-class). For more on the Software Development class, see bit.ly/sw-dev.
Search London - The technical factors that every small or local business shou...StrategiQ Marketing
This document discusses technical optimizations that small and local businesses can make to their websites. It recommends establishing a culture that prioritizes website speed among developers. Using specialized hosting and content delivery networks can significantly improve page load speeds. Cloudflare offers a free CDN and security features to help speed up sites and support HTTPS. Migrating to HTTPS is recommended and easier than often assumed. The overall message is that any business can have a fast and secure site through an iterative process of implementing best practices.
10 Ways To Improve Any Website Or Blog Best Practices For Web PublishersRichard Harrington
In this class we’ll explore practical techniques to improve your
website. From design to technology, this class will take a look at techniques that work.Rich Harrington published his first website in 1996, and with nearly 20 years publishing experience he’s got a lot of life lessons to share. We’ll look at essential tools that increase time on site as well as explore helper services for automation. You’ll also discover essential strategies to enhance your posts with graphics and video.
Next Era of SEO: A Guide to SEO Split-TestingTom Anthony
SEO focused A/B Testing or Split-Testing is fast becoming an important new technique for digital marketers. This deck explains why it is important, and how you can do it.
Attracting Visitors Through SEO - Converge 2015Rob Ousbey
Rob Ousbey of Distilled will share insights, tactics and common mistakes of Search Engine Optimization (SEO). Find out about the latest trends and hear what to expect next, at the cutting edge of search marketing.
DrupalCamp Montreal 2017 - The less Twig files I see, the better I like your ...Jorge Diaz
Drupal 8 has a new powerful theming engine: TWIG. Twig is a powerful template engine that comes from the Symfony framework for theming. Either if you come from tpl.php files or are new to Drupal 8, there are some things you need to be aware of.
Theming in Drupal 8 is way more than creating a template file and start building layouts. While Twig shows up as a solid foundation to build your themes on top of, there are many Drupal-specific features that are part of the actual "Drupal theming process" and it is not directly related to the template files.
This document summarizes an introductory software design class session. It introduces the instructor Alex Cowan and outlines the venture design process that will be used. It discusses that students will work individually and in teams, using methods like design thinking, lean startup, agile development, and user experience design. Students will complete assignments on personas, problems, alternatives, and discovery questions. The class aims to improve students' abilities to generate and test ideas, collaborate, and engage with product design. Grading will consider attendance, assignments, and a team venture design project.
Rethinking SEO - Facts, Figures & DataChad Pollitt
This presentation is an exploration of SEO's past, present and potential future. It also shows that to win at search today requires winning on the other channels first.
This document discusses using agile methods when working with big customers. It advocates focusing expertise, leveraging economies of scale, and using an outside perspective to help resolve internal disagreements. It also notes that initiatives can lose momentum and provides tips for maintaining initiative. Some key agile practices discussed include focusing on individuals and interactions over processes, working software over documentation, customer collaboration, and responding to change. The document emphasizes writing user stories and testing ideas quickly through experiments.
This document contains tweets and slides from a presentation given by Kelvin Newman, founder of BrightonSEO. It discusses how Newman started hosting SEO meetups in pub rooms that grew very large over time. It also covers various keyword research techniques like analyzing competitor pages, image search suggestions, and social media to find additional phrases to target. Finally, it outlines the many features now present on search engine results pages and how content needs to be optimized for these informational queries.
The Future Of Technical SEO Isn't Your WebsiteKaizen
Google in the future will reduce traffic to ‘traditional’ websites as the mobile revolution continues. Learn how you can leverage this exciting new world of Technical SEO via App Indexing, Deep Linking, Structured Data and how to build your own API.
On Intrapreneurship: Lean Startup & MVP'sAlex Cowan
1) The document discusses lean startup methodology and customer development frameworks. It provides examples of how companies like Dropbox and Leonid Systems used minimum viable products (MVPs) and hypothesis-driven experimentation to validate business ideas.
2) Different types of MVPs are described, such as "Wizard of Oz" prototypes and "pre-sales" validation of topics. The document also outlines four types of hypotheses to test - related to personas, problems, value propositions, and customer creation.
3) Frameworks and best practices for hypothesis testing and product development are laid out, including developing an understanding of customer personas, defining problems and alternatives, designing MVPs, and using discovery and experimentation to iterate the business model
Working in a digital role and not feeling confident with code? Then this beginners guide is perfect for you.
We take a look at the most popular use cases and occasions where you are likely to face or interact with code, and teach you basics in HTML, CSS & Javascript knowledge that will enable you to complete those tasks.
You will learn the followings:
-HTML & CSS Basics for Editing & Styling Content
- Understanding & Working with a Web-Pages Source Code (Using Inspect Element)
- How to Use EMBED Codes to Display Content from Social Media Platforms & Third-Party Sites (YouTube Videos, Tweets)
- Understanding & Working with 'Tracking Tags' (Google Analytics, Retargeting Pixels via Facebook, Twitter)
- Useful Resources for Ongoing Learning
Crawl Budget Conqueror - Take Control of Your Crawl BudgetCatalyst
Jori Ford presented on how to take control of a website's crawl budget. She explained that crawl budget is determined by crawl rate and demand. Crawl rate depends on factors like response time and server errors, while demand is based on popularity and freshness. She recommended quantifying a site's current crawl budget by analyzing log files to see what pages are crawled and how often. Then pages can be optimized by improving response time and removing errors. Non-essential pages should be pruned to conserve budget, while important pages can be better linked and updated to increase demand. Overall the key is understanding how Google allocates crawl resources and managing page quality and relevance accordingly.
This document discusses how user experience (UX) has become an important ranking factor for search engines like Google. It notes that early SEO advice focused only on search engines, but Google has increasingly prioritized user signals through updates like Panda that target low-quality sites. The document suggests tracking user metrics from Google Analytics to identify problematic pages, replicating Google's quality surveys, and improving UX through schema markup and authorship to build brand weight. Overall it argues that UX should be the primary product focus rather than solely optimizing for search algorithms.
Venture Design, Module I at General Assembly (GA SF)Alex Cowan
This document contains an agenda and materials for a series of Venture Design sessions. The agenda outlines 5 sessions focused on achieving customer relevance, iterating a minimum viable product, validating the venture's progress, engineering the business model, and designing the right product. Additional materials provide overviews of topics like the lean startup methodology, hypotheses testing, business model canvassing, and the roles and skills of a full stack product person.
Discrimination of Word Senses with HypernymsArtem Revenko
Languages are inherently ambiguous. Four out of five words
in English have more than one meaning. Nowadays there is a growing
number of small proprietary thesauri used for knowledge management
for different applications. In order to enable the usage of these thesauri
for automatic text annotations, we introduce a robust method for dis-
criminating word senses using hypernyms. The method uses collocations
to induce word senses and to discriminate the thesaural sense from the
other senses by utilizing hypernym entries taken from a thesaurus. The
main novelty of this work is the usage of hypernyms already at the stage
sense induction. The hypernyms enable us to cast the task to a binary
scenario, namely teasing apart thesaural senses from all the rest. The
introduced method outperforms the baseline and has indicates accuracy
above 80%.
This document discusses PoolParty APIs, a Python package for accessing PoolParty APIs. It provides an overview of semantic analysis capabilities and examples of semantic applications, including a wine and cheese recommender. It also demonstrates examples of using the pp_vectorizer Python library and developing a Python web app for topic transition analysis.
DrupalCamp Montreal 2017 - The less Twig files I see, the better I like your ...Jorge Diaz
Drupal 8 has a new powerful theming engine: TWIG. Twig is a powerful template engine that comes from the Symfony framework for theming. Either if you come from tpl.php files or are new to Drupal 8, there are some things you need to be aware of.
Theming in Drupal 8 is way more than creating a template file and start building layouts. While Twig shows up as a solid foundation to build your themes on top of, there are many Drupal-specific features that are part of the actual "Drupal theming process" and it is not directly related to the template files.
This document summarizes an introductory software design class session. It introduces the instructor Alex Cowan and outlines the venture design process that will be used. It discusses that students will work individually and in teams, using methods like design thinking, lean startup, agile development, and user experience design. Students will complete assignments on personas, problems, alternatives, and discovery questions. The class aims to improve students' abilities to generate and test ideas, collaborate, and engage with product design. Grading will consider attendance, assignments, and a team venture design project.
Rethinking SEO - Facts, Figures & DataChad Pollitt
This presentation is an exploration of SEO's past, present and potential future. It also shows that to win at search today requires winning on the other channels first.
This document discusses using agile methods when working with big customers. It advocates focusing expertise, leveraging economies of scale, and using an outside perspective to help resolve internal disagreements. It also notes that initiatives can lose momentum and provides tips for maintaining initiative. Some key agile practices discussed include focusing on individuals and interactions over processes, working software over documentation, customer collaboration, and responding to change. The document emphasizes writing user stories and testing ideas quickly through experiments.
This document contains tweets and slides from a presentation given by Kelvin Newman, founder of BrightonSEO. It discusses how Newman started hosting SEO meetups in pub rooms that grew very large over time. It also covers various keyword research techniques like analyzing competitor pages, image search suggestions, and social media to find additional phrases to target. Finally, it outlines the many features now present on search engine results pages and how content needs to be optimized for these informational queries.
The Future Of Technical SEO Isn't Your WebsiteKaizen
Google in the future will reduce traffic to ‘traditional’ websites as the mobile revolution continues. Learn how you can leverage this exciting new world of Technical SEO via App Indexing, Deep Linking, Structured Data and how to build your own API.
On Intrapreneurship: Lean Startup & MVP'sAlex Cowan
1) The document discusses lean startup methodology and customer development frameworks. It provides examples of how companies like Dropbox and Leonid Systems used minimum viable products (MVPs) and hypothesis-driven experimentation to validate business ideas.
2) Different types of MVPs are described, such as "Wizard of Oz" prototypes and "pre-sales" validation of topics. The document also outlines four types of hypotheses to test - related to personas, problems, value propositions, and customer creation.
3) Frameworks and best practices for hypothesis testing and product development are laid out, including developing an understanding of customer personas, defining problems and alternatives, designing MVPs, and using discovery and experimentation to iterate the business model
Working in a digital role and not feeling confident with code? Then this beginners guide is perfect for you.
We take a look at the most popular use cases and occasions where you are likely to face or interact with code, and teach you basics in HTML, CSS & Javascript knowledge that will enable you to complete those tasks.
You will learn the followings:
-HTML & CSS Basics for Editing & Styling Content
- Understanding & Working with a Web-Pages Source Code (Using Inspect Element)
- How to Use EMBED Codes to Display Content from Social Media Platforms & Third-Party Sites (YouTube Videos, Tweets)
- Understanding & Working with 'Tracking Tags' (Google Analytics, Retargeting Pixels via Facebook, Twitter)
- Useful Resources for Ongoing Learning
Crawl Budget Conqueror - Take Control of Your Crawl BudgetCatalyst
Jori Ford presented on how to take control of a website's crawl budget. She explained that crawl budget is determined by crawl rate and demand. Crawl rate depends on factors like response time and server errors, while demand is based on popularity and freshness. She recommended quantifying a site's current crawl budget by analyzing log files to see what pages are crawled and how often. Then pages can be optimized by improving response time and removing errors. Non-essential pages should be pruned to conserve budget, while important pages can be better linked and updated to increase demand. Overall the key is understanding how Google allocates crawl resources and managing page quality and relevance accordingly.
This document discusses how user experience (UX) has become an important ranking factor for search engines like Google. It notes that early SEO advice focused only on search engines, but Google has increasingly prioritized user signals through updates like Panda that target low-quality sites. The document suggests tracking user metrics from Google Analytics to identify problematic pages, replicating Google's quality surveys, and improving UX through schema markup and authorship to build brand weight. Overall it argues that UX should be the primary product focus rather than solely optimizing for search algorithms.
Venture Design, Module I at General Assembly (GA SF)Alex Cowan
This document contains an agenda and materials for a series of Venture Design sessions. The agenda outlines 5 sessions focused on achieving customer relevance, iterating a minimum viable product, validating the venture's progress, engineering the business model, and designing the right product. Additional materials provide overviews of topics like the lean startup methodology, hypotheses testing, business model canvassing, and the roles and skills of a full stack product person.
Discrimination of Word Senses with HypernymsArtem Revenko
Languages are inherently ambiguous. Four out of five words
in English have more than one meaning. Nowadays there is a growing
number of small proprietary thesauri used for knowledge management
for different applications. In order to enable the usage of these thesauri
for automatic text annotations, we introduce a robust method for dis-
criminating word senses using hypernyms. The method uses collocations
to induce word senses and to discriminate the thesaural sense from the
other senses by utilizing hypernym entries taken from a thesaurus. The
main novelty of this work is the usage of hypernyms already at the stage
sense induction. The hypernyms enable us to cast the task to a binary
scenario, namely teasing apart thesaural senses from all the rest. The
introduced method outperforms the baseline and has indicates accuracy
above 80%.
This document discusses PoolParty APIs, a Python package for accessing PoolParty APIs. It provides an overview of semantic analysis capabilities and examples of semantic applications, including a wine and cheese recommender. It also demonstrates examples of using the pp_vectorizer Python library and developing a Python web app for topic transition analysis.
This document outlines Mike Linville's experience creating Black Dog Studios to provide affordable web design and development services using a standardized framework. It describes the common problems small business owners face with expensive, infrequent website updates. The framework involves an 11-step process to build search engine optimized websites using common tools like WordPress, themes, and plugins. Case studies show how the framework helped businesses increase their online presence. The document promotes Mike's new online training program called DIY WordPress Creator to teach others the framework.
Christine Chubenko has over 13 years of experience in software engineering and web development. She is seeking a project management role where she can utilize her creative and analytical skills. Her career has focused on all phases of software and web development projects, from initial design through maintenance. She has led over 50 small projects and consulted with business owners to improve their websites and search engine visibility.
Delivering responsive redesign projects at large scale enterprises is hard, but not impossible with modern processes and methods.
Slides by Mike Kivikoski, UX Designer at Cantina
How to Execute and A/B Test Cross-Channel PersonalizationCXL
According to Forrester Research, 47% of customers report that they’ve engaged in at least one cross-channel interaction in the past 90 days.
Yet only 16% of marketers can actually recognize the same visitors and understand their interests and intent across channels - a critical capability required to deliver a consistent multichannel experience.
How do you close this customer experience gap, integrate cross-channel into your digital marketing strategy and get ahead of your competitors?
How do you run proper A/B tests that lead to meaningful insights, thus improving your personalization efforts?
Join Shanelle Mullin of ConversionXL and Andy Zimmerman, CMO of Evergage, to learn how to offer a superior, personalized customer experience and run meaningful, insightful A/B tests.
An Exploration of Cross-product App ExperiencesAtlassian
Atlassian has been building out the Teamwork platform, bringing cross-product experiences like the rich-text editor to all of our products. Extending the Teamwork platform presents a new opportunity for developers.
In this talk, we'll share more on what the Teamwork platform is, where is it available, and explore how we’re thinking app developers might extend the platform. Learn more about the future vision of building cross-product apps, consider what new opportunities it might present for your team, and give early feedback for how you’d like to see it evolve.
This document discusses ergonomics for web applications. It begins with introductions and then covers the following key points:
1. The definition of ergonomics as it relates to designing user interfaces that fit human abilities and movements.
2. The benefits of a user-centered interface including fewer support issues, happier customers, and better marketing.
3. The speaker's vision to create the most user-friendly systems for the Norwegian market using open source technologies.
4. Their approach to projects which includes interviewing and observing users to understand needs before designing interfaces in Photoshop.
WooConf 2017: Enterprise eCommerce with WooCommerce at the center - David LockieAngry Creative (UK)
When it comes to enterprise eCommerce, there are a number of different platform options.
Option 1 is to choose a platform like Adobe or Salesforce, pay your license fees and hope you can drive the platform better than your competitors. Option 2 is to build your own platform by choosing best-of-breed components and services.
This presentation runs through a case study of how we delivered an Option 2 solution for a B2B eCommerce customer, including: – why WordPress and WooCommerce are a great fit to play the central role in modern digital marketing and eCommerce platforms – infrastructure – software architecture (integrations, components, etc) – user experiences (e.g. personalisation) – data strategy (e.g. single customer view & business intelligence).
This document provides information on design thinking, lean startup methodology, customer development, agile development practices, and other topics related to venture design and product development. It includes descriptions and examples of key concepts like personas, storyboards, assumptions, experiments, pivots, minimum viable products, and agile user stories. Copyright notices are included at the bottom of most pages.
Understanding Complexity and How it Impacts Innovation - AWS Summit SydneyAmazon Web Services
Creating new products and services for customers is an increasingly complex challenge – technology trends are changing rapidly, industries are being disrupted, team dynamics are intricately impulsive, and customer behaviour is highly unpredictable. Albert Einstein famously said “the thinking that got us to where we are is not the thinking that will get us to where we want to be.” What would it take to rethink complexity, and nudge your behaviours to operate more successfully in this complex world? What if you could learn to use experimentation to create better products, high-performing teams, and a culture that is truly innovative? Come and learn how to thrive within the complex and uncertain world of modern business.
How to get yourself a great facebook application. Talking you from the most simple way to the tough way. Companied with proven case studies, such as the whopper sacrifice. Nothing techies.
The document summarizes discussions and presentations from the Web 2.0 Expo conference in San Francisco. The conference covered topics like the importance of user testing, the future of web pages, website scaling and acquisition, freemium models, location-based services, opening platforms through APIs, HTML5 capabilities, and challenges around geolocation. Speakers were from major tech companies and discussed their experiences and learnings.
How to Link Your Product to Business Outcomes by Intercom Sr PMProduct School
Main takeaways:
- Understand how the features that you ship deliver business results
- Recognize the two different starting points for projects
- If you can't measure impact, bridge the gap with proxy metrics
Wicked Problems and SharePoint - Rethinking the ApproachPaul Culmsee
The document discusses how wicked problems are common in IT projects like SharePoint deployments. It describes how achieving a shared understanding of problems is important but difficult. Issue Mapping and its structured approach to collaboration is presented as a key tool to help groups develop shared understanding and tackle wicked problems more effectively. Integrating Issue Mapping outputs with SharePoint is also presented as a powerful combination for project management.
Seffcon is a software development company based in India that has over 25 years of combined experience helping clients build customized software solutions. They provide services like website development, mobile app development, and automation of business processes. Some of their portfolio includes an online exam preparation tool, a customer relationship management system, and a file management system for allocating and tracking files. They emphasize rapid development and user-centric design to help clients launch digital products and services quickly.
Landing Page Testing: get customers...before codingElizabeth Yin
The document discusses how to get first customers online without coding through landing page testing. It describes identifying hypotheses about a business, building simple landing pages, marketing the pages through various low-cost channels, analyzing results, and provides a case study of how the authors used landing page testing to evaluate the potential viability of their own startup idea called The Wed Connect in just one week with minimal investment.
Similar to Using Python for Industrial Prototypes (20)
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...Social Samosa
The Modern Marketing Reckoner (MMR) is a comprehensive resource packed with POVs from 60+ industry leaders on how AI is transforming the 4 key pillars of marketing – product, place, price and promotions.
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataKiwi Creative
Harness the power of AI-backed reports, benchmarking and data analysis to predict trends and detect anomalies in your marketing efforts.
Peter Caputa, CEO at Databox, reveals how you can discover the strategies and tools to increase your growth rate (and margins!).
From metrics to track to data habits to pick up, enhance your reporting for powerful insights to improve your B2B tech company's marketing.
- - -
This is the webinar recording from the June 2024 HubSpot User Group (HUG) for B2B Technology USA.
Watch the video recording at https://youtu.be/5vjwGfPN9lw
Sign up for future HUG events at https://events.hubspot.com/b2b-technology-usa/
Build applications with generative AI on Google CloudMárton Kodok
We will explore Vertex AI - Model Garden powered experiences, we are going to learn more about the integration of these generative AI APIs. We are going to see in action what the Gemini family of generative models are for developers to build and deploy AI-driven applications. Vertex AI includes a suite of foundation models, these are referred to as the PaLM and Gemini family of generative ai models, and they come in different versions. We are going to cover how to use via API to: - execute prompts in text and chat - cover multimodal use cases with image prompts. - finetune and distill to improve knowledge domains - run function calls with foundation models to optimize them for specific tasks. At the end of the session, developers will understand how to innovate with generative AI and develop apps using the generative ai industry trends.
1. Using Python for industrial prototypes
Word Sense Induction and Disambiguation
Artem Revenko
Head of Research
semantic-web.at
PyDays Vienna
May 5, 2017
2. Problem Statement Method Implementation
Controlled Vocabularies
Coffee Taxonomy
Coffee
Filter Coffee Espresso-based
CappuccinoMelange Espresso
Concepts First class citizens,
2 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
3. Problem Statement Method Implementation
Controlled Vocabularies
Coffee Taxonomy
Coffee
Filter Coffee Espresso-based
CappuccinoMelange Espresso
Concepts First class citizens,
Hierarchy Broader / narrower relations,
2 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
4. Problem Statement Method Implementation
Controlled Vocabularies
Coffee Taxonomy
Coffee
Filter Coffee Espresso-based
CappuccinoMelange Espresso
Concepts First class citizens,
Hierarchy Broader / narrower relations,
Auxiliary Alternative forms, custom relations, axioms, . . .
2 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
5. Problem Statement Method Implementation
Entity Extraction for Enabling Search
3 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
6. Problem Statement Method Implementation
Exploitation of Developments
Plan
1. Take existing tool with target functionality;
2. Use APIs to get the results / data;
3. Improve the results / data;
4. Push it back or push it further.
Data Market Austria
Flagship project for commersializing data and services.
Call for startups out soon!
Check datamarket.at
4 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
7. Problem Statement Method Implementation
Problem Statement
Method
Implementation
5 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
8. Problem Statement Method Implementation
Finding Correct Concept
Taxonomy
Thing
Car
BMW Jaguar
Animal
Jaguar
BMW has designed a car that is going to drive Jaguar X1 out
of the market.
6 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
9. Problem Statement Method Implementation
Finding Correct Concept
Taxonomy
Thing
Car
BMW Jaguar
Animal
Jaguar
BMW has designed a car that is going to drive Jaguar X1 out
of the market.
6 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
10. Problem Statement Method Implementation
Exact Semantic Search
Coffee Taxonomy
Coffee
Filter Coffee Espresso-based
CappuccinoMelange Espresso
Search 1
Query Coffee;
In Document Espresso;
Result Found.
7 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
11. Problem Statement Method Implementation
Exact Semantic Search
Coffee Taxonomy
Coffee
Filter Coffee Espresso-based
CappuccinoMelange Espresso
Search 2
Query Espresso;
In Document Coffee;
Result ?
7 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
12. Problem Statement Method Implementation
Exact Semantic Search
Coffee Taxonomy
Coffee
Filter Coffee Espresso-based
CappuccinoMelange Espresso
Search 2
Query Espresso;
In Document Coffee;
Result Not Found.
7 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
13. Problem Statement Method Implementation
Exact Semantic Search
Coffee Taxonomy
Coffee
Filter Coffee Espresso-based
CappuccinoMelange Espresso
Document
One alternative to the Node.js
utility is the Coffee Maven
Plugin, a plugin for the popular
Apache Maven build system.
Search 3
Query Coffee;
In Document Coffee;
Result ?
7 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
14. Problem Statement Method Implementation
Problem Statement
Word Sense Disambiguation (and Induction)
WSD is identifying which sense of a word (i.e. meaning)
is used in a sentence, when the word has multiple
meanings.
We go even further and say that the task is to induce the senses of
the entity and to find the correct sense for each occurrence.
Conventions
The word of interest: entity,
The surrounding words: terms.
8 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
15. Problem Statement Method Implementation
Problem Statement
Method
Implementation
9 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
16. Problem Statement Method Implementation
Describing Senses
Describing Senses
You shall know a word by the company it keeps.
Firth, J. R. 1957
10 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
17. Problem Statement Method Implementation
Describing Senses
Describing Senses
You shall know a word by the company it keeps.
Firth, J. R. 1957
Co-occurrence Measures
Dice scores: 2 AB
A+B ,
Statistical significance of co-occurrence (z score),
. . .
10 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
18. Problem Statement Method Implementation
Co-occurrence Graph
11 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
19. Problem Statement Method Implementation
Graph Clustering Algorithms
Chinese whispers
Propagate the signal.
HyperLex
Find most important hubs (pageranky).
Triangulation
How many cliques (complete graphs).
12 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
20. Problem Statement Method Implementation
2nd Order Co-occurrence Graph
Entity
Term1Term2
Term3
Term4
0.2
0.15
0.12 0.1
13 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
21. Problem Statement Method Implementation
2nd Order Co-occurrence Graph
Entity
Term1Term2
Term3
Term4
Term5
Term6
0.2
0.15
0.12 0.1
0.22
0.16
0.2
0.1
0.2
13 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
22. Problem Statement Method Implementation
2nd Order Co-occurrence Graph
Entity
Term1Term2
Term3
Term4
Term5
Term6
0.2
0.15
0.12 0.1
0.22
0.16
0.2
0.1
0.2
Sense1
Sense2
13 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
23. Problem Statement Method Implementation
Problem Statement
Method
Implementation
14 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
24. Problem Statement Method Implementation
Packages
numpy
15 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
25. Problem Statement Method Implementation
Packages
numpy
networkx for graph operations, in particular pagerank;
15 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
26. Problem Statement Method Implementation
Packages
numpy
networkx for graph operations, in particular pagerank;
rdflib for queries and rdf;
15 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
27. Problem Statement Method Implementation
Packages
numpy
networkx for graph operations, in particular pagerank;
rdflib for queries and rdf;
requests for HTTP requests;
15 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
28. Problem Statement Method Implementation
Packages
numpy
networkx for graph operations, in particular pagerank;
rdflib for queries and rdf;
requests for HTTP requests;
nltk stemming, stopwords;
15 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
29. Problem Statement Method Implementation
Packages
numpy
networkx for graph operations, in particular pagerank;
rdflib for queries and rdf;
requests for HTTP requests;
nltk stemming, stopwords;
sklearn classification, vectorization;
15 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
30. Problem Statement Method Implementation
Packages
numpy
networkx for graph operations, in particular pagerank;
rdflib for queries and rdf;
requests for HTTP requests;
nltk stemming, stopwords;
sklearn classification, vectorization;
matplotlib, plotly visualizations;
15 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
31. Problem Statement Method Implementation
Packages
numpy
networkx for graph operations, in particular pagerank;
rdflib for queries and rdf;
requests for HTTP requests;
nltk stemming, stopwords;
sklearn classification, vectorization;
matplotlib, plotly visualizations;
django showcase.
15 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
32. HyperLex
1 for node , score in ranking :
2 p r s c o r e = pr [ node ] / max pr
3 i f p r s c o r e > th hub :
4 o r i g i n a l n e i g h b o r s =set (G. neighbors ( node ) )
5 l e f t n e i g h b o r s = l e f t n o d e s &
→ o r i g i n a l n e i g h b o r s
6 value = ( l e f t n e i g h b o r s /
→ o r i g i n a l n e i g h b o r s * p r s c o r e )
7 i f value > th hub :
8 l e f t n o d e s −= {node}
9 hubs . append ( node )
10 c l u s t e r = {node : pr [ node ]}
11 for x in l e f t n e i g h b o r s :
12 c l u s t e r [ x ] = pr [ x ] * c o s c o r e [ x ] [ node ]
13 l e f t n o d e s −= set ( c l u s t e r )
14 c l u s t e r s . append ( c l u s t e r )
33. Problem Statement Method Implementation
Evaluation and Demo
Method Macro average Micro average
Baseline 0.447 0.324
Our method 0.784 0.821
Demo: https://artem.semantic-web.at/wsd
Thank you!Thank you!
⇒⇒ poolparty.biz ⇐⇐
17 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
34. Problem Statement Method Implementation
Evaluation and Demo
Method Macro average Micro average
Baseline 0.447 0.324
Our method 0.784 0.821
Demo: https://artem.semantic-web.at/wsd
Thank you!Thank you!
⇒⇒ poolparty.biz ⇐⇐
17 / 17
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY
SEMANTIC WEB COMPANY
school • consulting • projects • events • media
SEMANTIC WEB COMPANY
SEMANTIC
WEB COMPANY