This document provides information about new features and enhancements to patent search and analysis tools from Minesoft, including:
1. Citation Explorer module integrated with PatBase to allow better examination of forward and backward citations.
2. Legal status enhancements including new groups and indicator of patent family status.
3. Minesoft's new Textmine product allows chemical structure and keyword searching of chemicals extracted from patent texts.
4. Details provided on Textmine's search options, visualization of chemical instances in text, and linking capabilities.
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...Dr. Haxel Consult
Life science companies increasingly rely on text mining to gain important insights from vast amounts of published information. But researchers struggle to get access to full-text articles for text mining. When they do get the full text they must contend with multiple formats and inconsistent license terms – all of which inhibit text mining efforts. In this presentation, we will describe the value in mining full-text scientific literature and outline the issues researchers face in accessing and licensing this content for commercial purposes. We will provide a walkthrough of Copyright Clearance Center’s (CCC) RightFind™ XML for Mining solution and contrast this with other approaches to solving these time-consuming content and licensing challenges. CCC is the parent organization of RightsDirect.
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...Dr. Haxel Consult
WIPO started work in the area of patent analytics in 2010 with a Development Agenda project on “Developing Tools for Access to Patent Information” which resulted in the production of a series of Patent Landscape Reports (WIPO’s patent landscape reports can be found here). These reports, prepared in cooperation with various UN Agencies, non-governmental organizations, research institutes and national IP Offices, analyze patent activity in various topics in the areas of public health, food and agriculture, environment and energy, and disabilities. The key findings are often summarized in an infographic.
In 2013 WIPO started working also on awareness raising and capacity building in the area of patent analytics. Apart from various workshops organized on this topic, WIPO published in September 2015 the “Guidelines for Preparing Patent Landscape Reports”. The Guidelines describe the objectives and motivations for preparing Patent Landscape Reports (PLR) and other types of patent analysis, the tasks associated with patent analytics, as well as the stages in the preparation of PLRs, providing also some insights from WIPO’s experience in the area.
Since 2015 WIPO is exploring open source tools for patent analytics purposes in the framework of the preparation of a Manual on Open Source Tools for Patent Analytics. Open source tools are typically used by other disciplines, usually business/data analysts, statisticians, IT professionals and scientists, rather than with regard to patent data. Nevertheless, in recent years they started emerging as an alternative and/or a complement to ready-to-use tools, providing flexibility and adaptability in different analysis types. In view of the necessary programming related to this type of tools, WIPO developed step-by-step instructions in the Manual with example datasets, and will provide capacity building activities with training on patent analytics for Technology and Innovation Technology Support Centers (TISCs) around the world (for more information on the TISC program please visit www.wipo.int/tisc) .
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...Dr. Haxel Consult
Life science companies increasingly rely on text mining to gain important insights from vast amounts of published information. But researchers struggle to get access to full-text articles for text mining. When they do get the full text they must contend with multiple formats and inconsistent license terms – all of which inhibit text mining efforts. In this presentation, we will describe the value in mining full-text scientific literature and outline the issues researchers face in accessing and licensing this content for commercial purposes. We will provide a walkthrough of Copyright Clearance Center’s (CCC) RightFind™ XML for Mining solution and contrast this with other approaches to solving these time-consuming content and licensing challenges. CCC is the parent organization of RightsDirect.
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...Dr. Haxel Consult
WIPO started work in the area of patent analytics in 2010 with a Development Agenda project on “Developing Tools for Access to Patent Information” which resulted in the production of a series of Patent Landscape Reports (WIPO’s patent landscape reports can be found here). These reports, prepared in cooperation with various UN Agencies, non-governmental organizations, research institutes and national IP Offices, analyze patent activity in various topics in the areas of public health, food and agriculture, environment and energy, and disabilities. The key findings are often summarized in an infographic.
In 2013 WIPO started working also on awareness raising and capacity building in the area of patent analytics. Apart from various workshops organized on this topic, WIPO published in September 2015 the “Guidelines for Preparing Patent Landscape Reports”. The Guidelines describe the objectives and motivations for preparing Patent Landscape Reports (PLR) and other types of patent analysis, the tasks associated with patent analytics, as well as the stages in the preparation of PLRs, providing also some insights from WIPO’s experience in the area.
Since 2015 WIPO is exploring open source tools for patent analytics purposes in the framework of the preparation of a Manual on Open Source Tools for Patent Analytics. Open source tools are typically used by other disciplines, usually business/data analysts, statisticians, IT professionals and scientists, rather than with regard to patent data. Nevertheless, in recent years they started emerging as an alternative and/or a complement to ready-to-use tools, providing flexibility and adaptability in different analysis types. In view of the necessary programming related to this type of tools, WIPO developed step-by-step instructions in the Manual with example datasets, and will provide capacity building activities with training on patent analytics for Technology and Innovation Technology Support Centers (TISCs) around the world (for more information on the TISC program please visit www.wipo.int/tisc) .
ICIC 2017: Publication Analysis and Publication Strategy Dr. Haxel Consult
Dieter Küry (Novartis Pharma, Switzerland)
Using analytical methods are more and more replacing database searching in a knowledge manager's daily activities. In this presentation various facets of publication analysis will be presented and discussed. These new methods were applied for the analysis of publications in scientific journals and visuals were created to deduct publications strategies. On the technical side, the overall analysis process requires diverse tools for reference managing, text analysis and visualization. The impact on skills of the knowledge manager who moves from the expert for query languages to the expert for creation and maintaining of thesauri is also shown. Main benefit of the analytical methods compared to traditional database searching is the manifold use of results, which are easily adaptable to new requirements.
II-SDV 2016 Aalt van de Kuilen - The Art of Patent LandscapingDr. Haxel Consult
This presentation will give some guidelines on how to create a meaningful Patent Landscapes. Generating patent landscaping reports seems simple, but it isn’t. For making patent landscapes you have to take several different issues into consideration.
It’s important at the start to already have in mind what kind of landscape report you are going to prepare, and choose a topic of interest, but preferable not one that is too broad. It’s also extremely important to have a clean (80-90% relevance) dataset that the landscape is based on; otherwise the outcome will be rubbish. And of course, do not use landscapes for questions that require a legal opinion (like Freedom-to-operate conclusions!!). Patent landscapes are not aimed to be as precise as other patent searches.
Some more important issues has to be taken in account and are presented.
ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities Dr. Haxel Consult
Understanding competitors’ patent portfolios and protecting their own intellectual properties are key questions for pharmaceutical companies. Extracting and analyzing the chemical space covered by these patents is an extremely complex and time consuming challenge and requires many communication rounds between IP experts and members of drug discovery teams. ChemAxon has been working with researchers in the industry to develop tools to help in this area by building and analyzing project specific databases based on high quality computer-assisted extraction of chemical information from patent documents. These databases can be useful across the full drug discovery process from idea generation to lead candidate selection, drug design and creation of new patents. This way we can eliminate these rounds of communication, because IP experts can precisely translate the content of patent documents to the language of chemistry which is more comprehensible to other actors. This presentation will discuss the results of this development and technologies developed or used, namely: English and Chinese Name to Structure, which dramatically speeds up the extraction process; Markush Editor that helps draw complex Markush structures more easily, Structure Checker and Markush Validation, which confirm the quality of extracted information. We will also introduce our search, enumeration and hit visualization and our latest improvements that allow overlap analysis between Markush structures.
This joint presentation will focus on a project between CENTREDOC and the ARMASUISSE Science and Technology Foresight program to set up an optimized Patent Landscape process. The talk will outline the major bottlenecks identified in the existing process, the solutions considered and implemented by CENTREDOC, as well as the results achieved by ARMASUISSE in its capacity to anticipate and get the necessary understanding of emerging technologies. As technologies can be considered independently of the domain of application, creating a contributory platform providing structured information about technologies is of common general interest at governmental and industrial level, both national and international.
II-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web SearchDr. Haxel Consult
Boehringer Ingelheim has been developing dedicated Life Science SEARCHCORPORA for startups, scientific literature and news tracking based on the Web Data Analysis platform Deep SEARCH 9.
Using the Deep SEARCH 9 approach, Boehringer Ingelheim is capable of tapping directly any web resources like online websites data bases, web sites or news feeds.
Use case 1: SEARCHCORPUS® for life science startups:
We find startup information we could not find in public search engines.
Use case 2: Life science news SEARCHCORPUS®:
100s of incoming mails and alerts are processed every day and websites and articles behind the news tags are crawled automatically.
The purpose of these applications is that Scientists can subscribe to the services to have compilations of results of personalized deep searches sent to them automatically or that they can alternatively use faceted search on the life science SEARCHCORPORA interactively.
1st International Indian Patent Information Conference for Patent Searchers in Bangalore, India. 2nd - 3rd November 2017: Minsoft is a Platinum sponsor
II-SDV Emmanuelle Fortune - SMEs as Patent Applicants in France in 2014 Dr. Haxel Consult
SMEs represent a prime target for public authority awareness-raising policies especially as regards innovation and filing patents. Yet it is not always easy to get a handle on this population in terms of statistics, meaning that it is particularly difficult to systematically identify in the patent databases those SMEs that do file patent applications.
Two census of SMEs conducted in 1999 and 2007, organised jointly by Bpifrance and INPI, allow the INPI to yearly identify SMEs among the companies that filed a patent application at the French patent office.
This study reveals the importance of SMEs among the total patent applicant population. They show that in France, in 2014, SMEs represented 67% of the French corporate bodies that filed patent applications, but only accounted for 23% of the patent applications published in 2014. This share of SMEs and large companies in the patent applications of French corporate bodies are stable since 2011. The figures highlight the fact that in 2014, on average, an SME filed 1.4 patents, compared to 15.2 for a large company (more than 5,000 employees). We observe regional disparities: regions such as Alsace, Languedoc-Roussillon, Pays de la Loire and Poitou-Charentes are characterised by the highest share of patent application from SMEs. And SMEs are more specialised in medical technologies.
ICIC 2017: Publication Analysis and Publication Strategy Dr. Haxel Consult
Dieter Küry (Novartis Pharma, Switzerland)
Using analytical methods are more and more replacing database searching in a knowledge manager's daily activities. In this presentation various facets of publication analysis will be presented and discussed. These new methods were applied for the analysis of publications in scientific journals and visuals were created to deduct publications strategies. On the technical side, the overall analysis process requires diverse tools for reference managing, text analysis and visualization. The impact on skills of the knowledge manager who moves from the expert for query languages to the expert for creation and maintaining of thesauri is also shown. Main benefit of the analytical methods compared to traditional database searching is the manifold use of results, which are easily adaptable to new requirements.
II-SDV 2016 Aalt van de Kuilen - The Art of Patent LandscapingDr. Haxel Consult
This presentation will give some guidelines on how to create a meaningful Patent Landscapes. Generating patent landscaping reports seems simple, but it isn’t. For making patent landscapes you have to take several different issues into consideration.
It’s important at the start to already have in mind what kind of landscape report you are going to prepare, and choose a topic of interest, but preferable not one that is too broad. It’s also extremely important to have a clean (80-90% relevance) dataset that the landscape is based on; otherwise the outcome will be rubbish. And of course, do not use landscapes for questions that require a legal opinion (like Freedom-to-operate conclusions!!). Patent landscapes are not aimed to be as precise as other patent searches.
Some more important issues has to be taken in account and are presented.
ICIC 2014 Chemical Patent Curation and Management – New Tools and Capabilities Dr. Haxel Consult
Understanding competitors’ patent portfolios and protecting their own intellectual properties are key questions for pharmaceutical companies. Extracting and analyzing the chemical space covered by these patents is an extremely complex and time consuming challenge and requires many communication rounds between IP experts and members of drug discovery teams. ChemAxon has been working with researchers in the industry to develop tools to help in this area by building and analyzing project specific databases based on high quality computer-assisted extraction of chemical information from patent documents. These databases can be useful across the full drug discovery process from idea generation to lead candidate selection, drug design and creation of new patents. This way we can eliminate these rounds of communication, because IP experts can precisely translate the content of patent documents to the language of chemistry which is more comprehensible to other actors. This presentation will discuss the results of this development and technologies developed or used, namely: English and Chinese Name to Structure, which dramatically speeds up the extraction process; Markush Editor that helps draw complex Markush structures more easily, Structure Checker and Markush Validation, which confirm the quality of extracted information. We will also introduce our search, enumeration and hit visualization and our latest improvements that allow overlap analysis between Markush structures.
This joint presentation will focus on a project between CENTREDOC and the ARMASUISSE Science and Technology Foresight program to set up an optimized Patent Landscape process. The talk will outline the major bottlenecks identified in the existing process, the solutions considered and implemented by CENTREDOC, as well as the results achieved by ARMASUISSE in its capacity to anticipate and get the necessary understanding of emerging technologies. As technologies can be considered independently of the domain of application, creating a contributory platform providing structured information about technologies is of common general interest at governmental and industrial level, both national and international.
II-SDV 2016 Aleksandar Kapisoda, Klaus Kater - Deep Web SearchDr. Haxel Consult
Boehringer Ingelheim has been developing dedicated Life Science SEARCHCORPORA for startups, scientific literature and news tracking based on the Web Data Analysis platform Deep SEARCH 9.
Using the Deep SEARCH 9 approach, Boehringer Ingelheim is capable of tapping directly any web resources like online websites data bases, web sites or news feeds.
Use case 1: SEARCHCORPUS® for life science startups:
We find startup information we could not find in public search engines.
Use case 2: Life science news SEARCHCORPUS®:
100s of incoming mails and alerts are processed every day and websites and articles behind the news tags are crawled automatically.
The purpose of these applications is that Scientists can subscribe to the services to have compilations of results of personalized deep searches sent to them automatically or that they can alternatively use faceted search on the life science SEARCHCORPORA interactively.
1st International Indian Patent Information Conference for Patent Searchers in Bangalore, India. 2nd - 3rd November 2017: Minsoft is a Platinum sponsor
II-SDV Emmanuelle Fortune - SMEs as Patent Applicants in France in 2014 Dr. Haxel Consult
SMEs represent a prime target for public authority awareness-raising policies especially as regards innovation and filing patents. Yet it is not always easy to get a handle on this population in terms of statistics, meaning that it is particularly difficult to systematically identify in the patent databases those SMEs that do file patent applications.
Two census of SMEs conducted in 1999 and 2007, organised jointly by Bpifrance and INPI, allow the INPI to yearly identify SMEs among the companies that filed a patent application at the French patent office.
This study reveals the importance of SMEs among the total patent applicant population. They show that in France, in 2014, SMEs represented 67% of the French corporate bodies that filed patent applications, but only accounted for 23% of the patent applications published in 2014. This share of SMEs and large companies in the patent applications of French corporate bodies are stable since 2011. The figures highlight the fact that in 2014, on average, an SME filed 1.4 patents, compared to 15.2 for a large company (more than 5,000 employees). We observe regional disparities: regions such as Alsace, Languedoc-Roussillon, Pays de la Loire and Poitou-Charentes are characterised by the highest share of patent application from SMEs. And SMEs are more specialised in medical technologies.
The use of ontologies to aid in the development of text search queries, the quality and relevance ranking of results, and the
categorization of patents has been well characterized. Similar work has been performed on non-patent scientific literature such as journal articles. We present here the employment of common life science ontologies to search both the entirety of the patent and non-patent literature corpora at the same time. The results of these searches can be readily studied in a single unified search result that allows for the annotation of key patent and non-patent documents in a mixed data-type environment.
II-SDV 2016 Stefan Geißler Navigating complex information landscapes – Semant...Dr. Haxel Consult
Information that is relevant for researchers and decision makers in the Life Sciences comes from many different backgrounds: Scientific publications, patents, news, clinical reports, user-generated content, they all may be required to understand trends, opportunities and threats. A key to providing quick and comprehensive overview is having information from various source in one place and semantically enrich and normalize them and relate them to one another.
We present the key principles of a platform that serves that purpose and that provides users with insights into the scientific, clinical and competitive intelligence landscape of their respective area of interest. Forged in close collaboration with industry practitioners, the Luxid Biopharma Navigator is today used in production by hundreds of experts.
II-SDV 2016 Bob Stembridge We have all the Time in the World; a Review of ho...Dr. Haxel Consult
One of the key elements in understanding a technology sector or a competitor’s activities is to measure and detect any significant changes over time that may indicate a declining interest or a new hot emerging area.
But how do we spot the signal from the noise? What constitutes a significant change? This depends on how we measure change which in turn depends on the measure of time we use. In scientific literature, we have limited choice – publication date (but even that is changing with wide availability of electronic pre-prints). In patent literature, publication date provides a measure of when an invention is publicly disclosed, but priority date is perhaps a truer measure of when the invention was made. And is it better to look at individual dates, or use moving windows of time?
This presentation will consider these questions using a case study approach to determine the impacts and effectiveness of the different approaches.
This paper revisits some of the issues discussed in our 2013 presentation "Challenges in Visualizing Pharmaceutical Business Information," where we analyzed some of the unique challenges in visualizing competitive intelligence information for the pharmaceutical industry. A key challenge for pharmaceutical companies is to evaluate the competitive landscape for drug launches many years in the future, based on a combination of publicly available drug pipeline and clinical trials data and internal company knowledge. This information is often conveyed in hand-drawn PowerPoint slides, which are very time consuming to create and update as the competitive landscape changes. In this paper we'll discuss approaches to developing a toolkit to facilitate the analysis and visualization of competitive drug launch timelines, and then show how to apply the same tools to a different problem -- forecasting the patent expiration landscape.
II-SDV 2015 The International Information Conference on Search, Data Mining a...Dr. Haxel Consult
he II-SDV meeting takes place in Nice in April 2016 for an intensive two days. Venue is the Hotel Plaza in central Nice. The meeting provides an international forum for those in the field of advanced search applications, data and text mining, and visualization technology. The primary focus is on tools for intelligence and the meeting examines the requirements of specialists in scientific and technical information.
The meeting will be of interest to those who wish to update themselves and keep in touch with the leading edge of information search and analysis technologies; it features approximately 22 speakers for the two days. There will be an adjacent, focused exhibition to complement the conference programme.
II-SDV 2017 in Nice - The International Information Conference on Search, Dat...Dr. Haxel Consult
The 2017 II-SDV Conference in Nice, 24 - 25 April 2017
The II-SDV meeting takes place in Nice in April 2017 for an intensive two days. Venue is the Hotel Plaza in central Nice. The meeting provides an international forum for those in the field of advanced search applications, data and text mining, and visualization technology. The primary focus is on tools for intelligence and the meeting examines the requirements of specialists in scientific and technical information.
The meeting will be of interest to those who wish to update themselves and keep in touch with the leading edge of information search and analysis technologies; it features approximately 22 speakers for the two days. There will be an adjacent, focused exhibition to complement the conference programme.
PatSeer as a global Patent database is being used by IPR professionals from various domains ranging from Pharmaceutical, Chemical, Engineering, Law firms etc., and has full-text coverage of 19 authorities in addition to 102+ countries bibliographic coverage.
PatSeer Lite is a professional search edition well suited for search -> filter/narrow down results-> export type of projects.
The key benefits of on-demand access to PatSeer Lite are:
1. Most Flexible Subscription Plans: Available on a Daily, Monthly and Quarterly access (in addition to Annual)
2. Zero Upfront Spending: Get your user-id for free the first time and pay for access only when you need it.
3. Eliminate Guesswork on your expected workload: No long-term commitments or associated hassles!
4. Simplified on-demand activation anytime of the day: Credit-card based subscription purchase or renewal with automatic account activation
5. Professional Search Made Easy: Easy to use search forms suited for patent searchers, technology professionals and end-users
Efficient and Effective Patent Landscaping Using PatBase: a Case Study Dr. Haxel Consult
Key questions for any patent landscape analysis are:
Is this a growing area of interest?
What are the fields of current interest?
Who are the key players?
Delivered from two different perspectives, we will discuss the challenges in creating a patent landscape analysis and the features and functionality of PatBase which enable the efficient creation of clean and reliable patent landscape studies.
From the development of a comprehensive search strategy to the analysis and visualization of results, we will use a case study to demonstrate a number of features and techniques including
Thesauri to craft a comprehensive strategy.
Citation ranking to identify the most pertinent patents.
Advanced keyword highlighting to efficiently review large numbers of documents.
Use of the “Similar” function to identify related documents and
PatBase Analytics to:
Help build a comprehensive strategy and
Visualize the results of the landscape analysis.
The aim of any patent landscape analysis is to accurately identify and visualise results so that answers to key questions are quickly and easily found. This presentation will demonstrate how any user can benefit from the innovative features and functionality in PatBase to craft and visualize a meaningful patent landscape for any technical area.
II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...Dr. Haxel Consult
Greg Harrop-Griffiths (minesoft, UK)
Patent data is a critical source of information to stimulate innovation and for competitive intelligence. Patents are often the first and only source of disclosure of a new invention and hence, ignoring them will only delay innovation and give an incomplete competitive intelligence picture.
Delivered from the perspective of an experienced patent analyst, we will use case studies to describe the use of patent data to compile a competitive landscape, to stimulate innovation by learning from others and to help identify valuable IP in a portfolio. We will discuss the challenges in using patents for competitive intelligence and the recent innovative features and functionality in PatBase which can help, including:
Using thesauri, semantic and non-patent literature searching to compile a comprehensive competitive landscape
The use of Analytics for customised, multidimensional analysis and to visually compare multiple datasets.
Text-mining to automatically identify and highlight concepts within any full text patent.
Citation analysis to identify key competitors, collaborators or potential infringers.
This presentation will demonstrate how any user can benefit from the innovative features and functionality in PatBase to interrogate and visualize the competitive landscape for any technical area.
PatSeer Premier edition is a complete professional patent research package comprising an online global patent database and research platform with integrated analytics, project workflow, and collaboration capabilities. PatSeer Premier quickly exceeds current systems in its analytics, team collaboration and data sharing capabilities.
PatSeer is a fully-featured global patent database with powerful integrated analytics, project management, and collaboration capabilities. PatSeer includes 74 million full-text records and more than 115 million+ records across 104+ countries. It includes a rich search syntax with all the capabilities needed by professional patent searchers. With powerful filtering, multidimensional analysis, and collaboration capabilities PatSeer helps you get your patent projects done online with ease.
PatSeer Patent Database brings you a fresh Web 2.0 approach to searching, analyzing, comparing, collaborating, sharing and managing patent data projects. It’s simple, smart and serious enough to meet the needs of most demanding professional searchers too. PatSeer includes full text of 15 countries and Biblio data of 95+ countries to ensure that your patent search is comprehensive and reliable.
PatSeer can offer a multi-dimensional solution for the entire company’s patent project requirements:
- It creates a centralized work environment for your internal team to manage and work on patent data projects, carry out analysis and deliver insights
- It can be configured to share access to various projects with members across departments within the company that may need the insights or collaborate on a project
- It can be used to assign projects to external service providers for analysis, litigation analysis or other work while managing control, access and security at every level
- It can be used to urgently pull up, filter and analyze patents for quick insights needed to make immediate decisions from any location, meeting room or device with web access
- It creates a unified patent project management environment integrating all the various resources, functions and people involved in the process eliminating the inefficiencies and challenges usually faced by those who rely on and work with patent projects and data.
PatSeer provides a multiple advantage for service providers:
- It creates a centralized work environment for your team to manage and work on patent data projects, carry out analysis and deliver insights. Managers can assign projects to research associates and monitor progress.
- It doubles as a web based delivery platform for customers giving them a richer engaging experience than Excel, while helping you manage the quality of your deliverables
A wide range of permission settings give you complete control of what to make visible or editable to your customer and also allow you to engage other stakeholders of the project such as external counsel or senior management
- Reduced risk of failure – As compared to developing and maintaining inhouse platforms, with PatSeer, you are using a platform thats been developed with tried and tested practices and where continued product innovation ensures the platform meets markets needs today and in the future.
Both PatSeer Projects and PatSeer Premier have been architecturally designed keeping in mind the most critical needs of service providers while understanding each one can have their own unique requirements.
PatSeer Premier (as well as PatSeer Projects) offers a quick “sign in & get started” solution ready to use in minutes while extending extensive administrative controls that allow you to set up a work environment for patent projects as well as a collaborative online sharing delivery platform for your customers on your own terms, based on your specific requirements.
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
Knowledge Graphs are an increasingly relevant approach to store detailed knowledge in many domains. Recent advances in NLP allow to enrich Knowledge Graphs through automated analysis of large volumes of literature, reducing a lot the efforts in traditional manual information capturing. In our presentation we report the approach taken in a project with partner Fraunhofer SCAI in the life sciences where a knowledge graph organising detailed facts about psychiatric diseases has been computed.
Information of cause-effect relations between proteins, genes, drugs and diseases has been encoded in the BEL (Biological Expression Language) and imported into a Graph database to approach an indication-wide Knowledge Graph for the selected therapeutic area. Ultimately, updating the graph will amount to just rerunning the analysis on the newly published literature.
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
In 2019 the UK was the first major economy to embrace a legal obligation to achieve net zero carbon emissions by 2050. More broadly, the 2021 UK Innovation Strategy sets out the UK government’s vision to make the UK a global hub for innovation by 2035 with a target of increasing public and private sector R&D expenditure to 2.4% of GDP to support the UK being a science superpower with a world-class research and innovation system.
IP rights create an incentive for R&D which ultimately leads to innovation. Analysis and insights from IP data can therefore help provide a better understanding of how the IP system is being used and where and what innovation is taking place. Research and analysis of IP data is a key input to the ongoing work of the UKIPO’s Green Tech Working Group which seeks to:
further the UK’s status as a global leader by making the UK’s IP environment the best for innovating green technology;
develop and deliver IP policies to support government’s ambition on climate change and green technologies; and
to help innovators best protect and commercialise their green tech innovations both at home and internationally.
The UKIPO has been developing a broad portfolio of ‘green’ IP analytics research. A series of patent analytics reports have been published looking at green technologies, and analysis of how the UK’s Green Channel scheme for accelerated processing of green patent applications has been conducted. Patents have been used to identify technological comparative advantage within different green technologies at a country level, and new insights uncovered by mapping green technology patents to the UN Sustainable Development Goals (SDGs). Trade mark data provides a timeliness and closeness to market factor that patent data does not, and complementary trade mark analysis of UK ‘green’ trade marks, identified using a machine learning algorithm, provides a commercialisation angle to our research.
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
Word embeddings, deep learning, transformer models and other pre-trained neural language models (sometimes recently referred to as "foundational models") have fundamentally changed the way state-of-the-art systems for natural language processing and information access are built today. The "Data-to-Value" process methodology (Leidner 2013; Leidner 2022a,b) has been devised to embody best practices for the construction of natural language engineering solutions; it can assist practitioners and has also been used to transfer industrial insights into the university classroom. This talk recaps how the methodology supports engineers in building systems more consistently and then outlines the changes in the methodology to adapt it to the deep learning age. The cost and energy implications will also be discussed.
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
In the patent domain, all types of issues, from very specific search requirements to the linguistic characteristics of the text domain, are accentuated. Consequently, to develop patent text mining tools for scientists and patent experts, we need to understand their daily work tasks, as well as the linguistic character of the text genre (i.e., patentese). Patent text is a mixture of legal and domain-specific terms. In processing technical English texts, a multi-word unit method is often deployed as a word-formation strategy to expand the working vocabulary, i.e., introducing a new concept without the invention of an entirely new word. This productive word formation is a well-known challenge for traditional natural language processing tools utilizing supervised machine learning algorithms due to limited domain-specific training data. Deep learning technologies have been introduced to overcome the reduction in performance of traditional NLP tools. In the Artificial Researcher technologies, we have integrated explicit and implicit linguistic knowledge into the deep learning algorithms, essential for domain-specific text mining tools. In this talk, we will present a step-by-step process of how we have developed the mentioned text mining tools. For the final outline, we will also demonstrate how these tools can be integrated in a cross-genre passage retrieval system, based on a technology from 2016 that still holds the state-of-the-art within the patent text mining research community in 2022.
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
In 2013 we witnessed an evolutionary change in the NLP field evolved thanks to the introduction of space embeddings that, with the use of deep learning architectures, achieved human-level performances in many NLP tasks. With the introduction of the Attention mechanism in 2017 the results were further improved and, as result, embeddings are quickly becoming the de facto standards in solving many NLP problems. In this presentation, you will learn how generate and use space embedding for search purposes and provide comparison metrics to more traditional relevance-based search engines. Moreover, I will provide some initial results on a paper currently under review that provides an insight on hyperparameter tuning during the generation of embeddings.
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
10 years in the making. How real-world business cases have driven the development of CCC's deep search solutions, leading to the capabilities for web-crawling and delivery of targeted intelligence that helps R&D; intensive companies gain a competitive advantage.
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
Machine learning based patent categorization: A success story in monitoring a complex technology with high patenting activity
Susanne Tropf (Syngenta, Switzerland)
Kornel Marko (Averbis, Germany)
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
Machine learning based patent categorization: A success story in monitoring a complex technology with high patenting activity
Susanne Tropf (Syngenta, Switzerland)
Kornel Marko (Averbis, Germany)
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
It is relatively easy for a human to read a document and quickly figure out which concepts are important. However, this task is a difficult challenge for a machine. During the past few decades, there have been two main approaches for concept identification: Natural Language Processing and Machine Learning. During the early part of this century, Machine Learning made great strides as new techniques came into wider use (SVM’s, Topic Modeling, etc..). Sensing the competition, Natural Language Processing responded with deployment of new emerging techniques (sematic networks, finite state automata, etc..). Neither approach has completely solved the WHAT problem. Advances in Artificial Intelligence have the potential to significantly improve the situation. Where AI is making the most impact is as an enhancement to make Machine Learning and Natural Language Processing work better and, more importantly, work together. This presentation looks at some of this history and what might happen in the future when we blend the interpretation of language with pattern prediction.
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
Trademarks serve as key leading indicators for innovation and economic growth. As the vanguards of new and expanding enterprises, trademarks can be used to study entrepreneurship and shifting market demands in response to varying economic factors. This responsiveness has been seen as recently as the COVID-19 pandemic, where trademark research revealed key insights about business reaction to the global upheaval.
At CIPO, we have been delving more deeply than ever before into trademark analysis by leveraging cutting-edge natural language processing (NLP) tools to derive actionable business intelligence from trademark data. In this presentation, we present a survey of NLP in use at CIPO and the insights we have learned applying them. These insights include COVID-19 responses, line-of-business trends based on firm characteristics, and more.
We also discuss ongoing and future trademark research projects at CIPO. These projects include emerging technology detection methods and high-resolution trademark classification systems. We conclude that artificial intelligence-enhanced tools like NLP are key components of future exploitation of trademark data for business and economic intelligence.
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
In our customer projects involving automated document processing, we often encounter document types providing crucial data in the form of tables. While established text analytics algorithms are usually optimized to operate on running text, they tend to produce rather poor results on tables as they do not capture the non-sequential relations inside them (e.g. interpret the content of a table cell relative to its column title, interpret line breaks inside a cell differently from line breaks between cells or rows). While there are elaborate information extraction products in the market for a few highly specific types of tabular documents, there is no general approach out there. The main cause for this is the fact that table structures can be encoded by a heterogenous range of layout means (e.g. column boundaries can be signaled by lines vs. aligned text vs. white space). In this talk, we will illustrate several solutions that we have developed for a range of challenges occurring in this context, both for scanned and digitally generated documents.
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
Most scientific journals request, that the complete set of research data is published simultaneously with the peer-reviewed paper. The publication of the research data usually is carried out as so-called "Supplementary Material", attached to the original paper, or on a "Research Data Repository". Both forms have in common, that the data is published usually unstructured and not in an uniform machine processable format. This makes its further use in electronic tools for AI or data mining unnecessarily difficult or even impossible. A concept is presented, in which the data is digitally recorded, following the principle of FAIR data, as part of the publication process. This digital capture makes the data available to the scientific community for easy use in data mining and AI tools. The data in the repository contains links to the publication to document its origin. The concept is applicable for preprints, peer-review papers, diploma and doctoral theses and is particularly suitable for open access publications. Moreover, the presentation highlights correspondent activities, which were released in scientific publications recently.
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...Dr. Haxel Consult
How do you find video when you only have sparse data? While you can wander the stacks (if you can still find open stacks) for inspiration, video either physical or digital, is difficult to discover. Wandering the virtual stacks is, well, virtually impossible. Discovery platforms on the whole have not replicated the inspirational experience of wandering the stacks.
More companies are using archivable video for internal communication of the various research projects, product developments, test results, and more that are being considered, in progress, or completed. Showing how an experiment was conducted can convey considerably more information that is very difficult to communicate via text. How do you find a company video that might be helpful for your project?
A case study is presented of the problems and the solutions that were implemented by a large, multinational chemical company. A suite of content discovery technologies was used including a video to text to tagging system connected to their documents database and automatically indexed using several chemical as well as conceptual systems (rule-based, NLP, inference engine). To build the system and support the manuscript and video submission there is a metadata extraction program which pulls and inserts the metadata into the submission forms so the author can move quickly through that process.
Copyright Clearance Center
A pioneer in voluntary collective licensing, CCC (Copyright Clearance Center) helps organizations integrate, access, and share information through licensing, content, software, and professional services. With expertise in copyright and information management, CCC and its subsidiary RightsDirect collaborate with stakeholders to design and deliver innovative information solutions that power decision-making by helping people integrate and navigate data sources and content assets. CCC recently acquired the assets and technology of Deep SEARCH 9 (DS9), a knowledge management platform that leverages machine learning to help customers perform semantic search, tag content, and discover new insights.
Lighthouse IP is the world’s leading provider of intellectual property content. The core business of Lighthouse IP is sourcing and creating content from the world’s most challenging authorities. Specialized in IP data, Lighthouse IP provides over 160 countries coverage for patents, over 200 authorities for trademarks and over 90 authorities for designs. Lighthouse IP data is available via several partners. The company is headquartered in Schiphol-Rijk in the Netherlands and has offices in the United States, China, Thailand, Vietnam, Egypt, Indonesia and Belarus. Globally a team of 150 experts works on the creation of this unique data collection.
CENTREDOC was created in 1964 as the technical information center of the swiss watchmaking industry. Building on a strong team of engineers, CENTREDOC now offers a complete range of services and solutions for the monitoring of strategic, technological and competitive information. CENTREDOC is also a leader in the research of patent, technical and business intelligence, and offers consulting expertise in the implementation of monitoring solutions.
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
The everyday use of AI-driven algorithms for data search, analysis and synthesis comes with important time savings, but also reveals the need to understand and accept the limitations of the technology. Practical deployments on concrete topics are inevitable to assess and manage the challenges of neuronal network based AI. A workshop report.
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...Dr. Haxel Consult
What if there was a platform where literature, conference abstracts, patents, clinical trials, news, grants and other sources were fully integrated? What if the data would be harmonized, enriched with standardized concepts and ready for analysis? After building our patent analytics platform we didn’t stop dreaming and built our big data analytics platform by semantically integrating text-rich, scientific sources. In my presentation I will talk about what we built and why we built it. And, of course, I will also address the challenges and hurdles along the way. Was it worth it and what comes next? Let’s talk about it!
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBrad Spiegel Macon GA
Brad Spiegel Macon GA’s journey exemplifies the profound impact that one individual can have on their community. Through his unwavering dedication to digital inclusion, he’s not only bridging the gap in Macon but also setting an example for others to follow.
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC
Ellisha Heppner, Grant Management Lead, presented an update on APNIC Foundation to the PNG DNS Forum held from 6 to 10 May, 2024 in Port Moresby, Papua New Guinea.
1.Wireless Communication System_Wireless communication is a broad term that i...JeyaPerumal1
Wireless communication involves the transmission of information over a distance without the help of wires, cables or any other forms of electrical conductors.
Wireless communication is a broad term that incorporates all procedures and forms of connecting and communicating between two or more devices using a wireless signal through wireless communication technologies and devices.
Features of Wireless Communication
The evolution of wireless technology has brought many advancements with its effective features.
The transmitted distance can be anywhere between a few meters (for example, a television's remote control) and thousands of kilometers (for example, radio communication).
Wireless communication can be used for cellular telephony, wireless access to the internet, wireless home networking, and so on.
# Internet Security: Safeguarding Your Digital World
In the contemporary digital age, the internet is a cornerstone of our daily lives. It connects us to vast amounts of information, provides platforms for communication, enables commerce, and offers endless entertainment. However, with these conveniences come significant security challenges. Internet security is essential to protect our digital identities, sensitive data, and overall online experience. This comprehensive guide explores the multifaceted world of internet security, providing insights into its importance, common threats, and effective strategies to safeguard your digital world.
## Understanding Internet Security
Internet security encompasses the measures and protocols used to protect information, devices, and networks from unauthorized access, attacks, and damage. It involves a wide range of practices designed to safeguard data confidentiality, integrity, and availability. Effective internet security is crucial for individuals, businesses, and governments alike, as cyber threats continue to evolve in complexity and scale.
### Key Components of Internet Security
1. **Confidentiality**: Ensuring that information is accessible only to those authorized to access it.
2. **Integrity**: Protecting information from being altered or tampered with by unauthorized parties.
3. **Availability**: Ensuring that authorized users have reliable access to information and resources when needed.
## Common Internet Security Threats
Cyber threats are numerous and constantly evolving. Understanding these threats is the first step in protecting against them. Some of the most common internet security threats include:
### Malware
Malware, or malicious software, is designed to harm, exploit, or otherwise compromise a device, network, or service. Common types of malware include:
- **Viruses**: Programs that attach themselves to legitimate software and replicate, spreading to other programs and files.
- **Worms**: Standalone malware that replicates itself to spread to other computers.
- **Trojan Horses**: Malicious software disguised as legitimate software.
- **Ransomware**: Malware that encrypts a user's files and demands a ransom for the decryption key.
- **Spyware**: Software that secretly monitors and collects user information.
### Phishing
Phishing is a social engineering attack that aims to steal sensitive information such as usernames, passwords, and credit card details. Attackers often masquerade as trusted entities in email or other communication channels, tricking victims into providing their information.
### Man-in-the-Middle (MitM) Attacks
MitM attacks occur when an attacker intercepts and potentially alters communication between two parties without their knowledge. This can lead to the unauthorized acquisition of sensitive information.
### Denial-of-Service (DoS) and Distributed Denial-of-Service (DDoS) Attacks
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesSanjeev Rampal
Talk presented at Kubernetes Community Day, New York, May 2024.
Technical summary of Multi-Cluster Kubernetes Networking architectures with focus on 4 key topics.
1) Key patterns for Multi-cluster architectures
2) Architectural comparison of several OSS/ CNCF projects to address these patterns
3) Evolution trends for the APIs of these projects
4) Some design recommendations & guidelines for adopting/ deploying these solutions.
This 7-second Brain Wave Ritual Attracts Money To You.!nirahealhty
Discover the power of a simple 7-second brain wave ritual that can attract wealth and abundance into your life. By tapping into specific brain frequencies, this technique helps you manifest financial success effortlessly. Ready to transform your financial future? Try this powerful ritual and start attracting money today!
3. Patent Families
Analytics
Quality Control
Fast Search
Legal Status
Review
Alerts
• 31 Full Text Collections
• 52 Million Families
• 106 Issuing Authorities
• IPC, CPC US and JP classes
• Quality Controlled content
• Normalised data
5. Citation Enhancements
• New Module Citation Explorer, integrated to PatBase
• Allows for better examination of forward
and backwards citations
• Side by Side view
• Annotate and Flag Citations
• X, Y, citations categories displayable and searchable
for more concise search and review
• Monitor and track Forward citations with Citetracker
• Stay Alerted to new patents citing a particular set of documents
• New citation command available (CTBX, CTBY etc..)
5
6.
7. Legal Status enhancements
7
• New Legal Status Groups Added
• LSAL “Appeal”
• LSRX “Reexamination”
• Integrate the Legal Status View into your
own internal applications with the PatBase API
8. • New status indicator in the family table
• Alive or Dead based on Legal Status
• Enable this option in your user settings
• Faster review of the whole Patent Family
• Faster review during FTO searches
• Can now also be exported
in excel, word, pdf etc..
New status indicator
8
9. • NEW Minesoft product
• Database of chemical entities extracted from full text of patents
• Structure and keyword searchable
• Coverage:
• >12m unique chemicals from >10m full text documents
• English text: WO, EP, US, GB, AU, IL and IN
• Non-Latin text: JP, CN, KR (extracted from original text not from MT)
• French and German text: FR, DE, EP
• US images from 2001 to date (~40% increase in recall)
• Updated daily
• Seamlessly linked to PatBase & PatBase Express
9
13. All the terms
are identified,
grouped,
categorised and
clickable
Textmine offers a unique FT visualisation
14. Upcoming PatBase Roadshows
London: 11th May
Basel: 19th May
Utrecht: 24th May
Paris: 14th June
Munich: 16th June
Madrid: 21st June
Barcelona: 23rd June
Dusseldorf: 24th June