SlideShare a Scribd company logo
Sequence Search/Comparison/Analysis
Stephen Allen- Solutions Consultant
Authority Document Count Sequence Count Database
USA 320,873 215,305,722 Gold+
EPO 108,362 37,883,488 Gold+
WIPO 144,292 74,293,342 Gold+
Japan 104,108 27,355,841 Gold+
China 78,683 1,029,562 Platinum
India 6,446 69,071 Platinum
Canada 57,671 24,026,839 Gold+
Brazil 2,134 39,001 Platinum
Others 81,148 3,913,808 Gold+
Total 903,717 383,916,674
Country Coverage
World’s Largest Sequence Database
https://www.gqlifesciences.com/genomequest/capabilities-features/
GQ Gold+ vs Platinum
Topic Gold+ PLATINUM
Traditional All Patents ST.25 Listings From US, EPO, WIPO,
Korea, Japan
All Patents ST.25 Listings From US, EPO, WIPO,
Korea, Japan
Traditional
and Manual
Curation
GQ-Pat Sequences (including non-ST.25) from
US, EPO, WIPO, Korea, Japan plus the following
Authorities: AT, AU, BE, CA, CH, DE, ES, FR, GB,
LU, NL, NO, TW
GQ-Pat Sequences (including non-ST.25) from
US, EPO, WIPO, Korea, Japan plus the following
Authorities: AT, AU, BE, CA, CH, DE, ES, FR, GB,
LU, NL, NO, TW
 BRIC Country Documents: CN, BR, IN, RU +
Emerging Country Documents
Features Extended Legal Status (ELS) Extended Legal Status (ELS)
Normalized Patent Assignee; Parent Normalized Patent Assignee; Parent
Unique Family Sequence (UFS) Unique Family Sequence (UFS)
 Access to PDF Downloads
 Family Portrait Report
Results
Results Pre & Post filtering
560K sequences 2K sequencesFilter
Getting to Your Results
 ALGORITHMS
• Searches can be done in a broad inclusive manner by
selecting the correct algorithm and a few basic settings
 FILTERS
• Broad searches can be narrowed quickly based on
homology data, legal status, and many other critera
 VIEWS
• Views allow you to tailor the display to your liking – with
specific columns and intelligent grouping
Search Setup
https://www.gqlifesciences.com/blog/category/genepast/
Filters
• Filter your search based on specific legal status,
homology, authority, or many other categories
• Save your favorite – frequently used filters
• Save multiple filters– different filters for different searches
• Filters are categorized for fast access
• Categories include alignment properties, subject text, subject dates, subject
properties etc.
• Filters reduce reported hits based on your criteria
Views & Grouping
• Choose how to display your data on Results page
• Tailored views are also used for Excel Table Export
• Add Columns to View with Display List
• Display fields are similar to filter fields
• Display categories similar to filter categories
• Save favorite – frequently used views
• Save multiple views – different views for different searches
• Group based on specific criteria
• Patent ID, Patent Family, Patent Assignee
• Display all records in group, or subset for streamlined analysis
Details and Alignments
LifeQuest
Consolidated Sequence & Text Searching
Filter with LQ markup
Filter by Stars Filter by Color
• Sequence Search
• Filter
• Export Results to LQ
• Mark to distinguish sequence searches
• LQ text search
• ( ttl_abst_clm:IL-17*^5 OR ttl_abst_clm:IL17*^5) AND antibod*)
• Mark to distinguish text search
• Unite!
• Filter
• Highlight key hits
• Export
• Filter within Excel
Sample Workflow
Non Sequence IP
Claims & Alignments
Quickly add columns
Post Filtering
Post Filter sequence searches, text searches, or combined searches
Additional Linkouts
Contact us at:
Stephen.Allen@aptean.com
Ellen.Sherin@aptean.com
Bill Perkins@Aptean.com
Questions?
LifeQuest
• Unite Sequence Based & Text Based Searches
• Create Virtual Sequence Database from LQ Results
Nested – Savable Filters
Complex Boolean filters
• Nested filters for fine tuning
• Save standard filters for easy application
Alerts: See what’s new
Contact us at:
Stephen.Allen@aptean.com
Ellen.Sherin@aptean.com
Bill Perkins@Aptean.com
Questions?
Supplementary Slides
Please contact stephen.allen@aptean.com with any questions
Q:
S:
LOCAL ALIGNMENT
Part of the Query matches part of the
Subject. BLAST, FASTA, and Smith &
Waterman.
S:
Q: GLOBAL ALIGNMENT
All of the Query matches all of the
Subject. Needleman & Wunsch and
algorithms like it.
Q:
S:
BEST FIT ALIGNMENT
All of the Query is fitted into the
Subject. GenePast. Ideal for patent
sequence searching.
Alignment Types
Alignment Subject % ID Query % ID
Subject %
Coverage
Query %
Coverage
100% 100% 100% 100%
100% 50% 100% 50%
50% 100% 50% 100%
50% 50% 50% 50%
95% 95% 100% 100%
Alignment % identity, corrected for the ratio of the alignment length to either the query or subject length.
Query/Subject % Identity Definition
This example assumes 100% alignment identity, the longer lines are 100 residues, the shorter lines are 50 residues.
• By filtering for 100% subject coverage you can capture CDR to CDR matches
• With variability % ID can drop, so % coverage is the preferable filter
• This is a key feature to understand – these filters are very powerful
5 mismatches
Key Fields
Legal Status
Extended Legal Status And National Phase Legal Status
US PAIR Legal Status
• PAIR Legal status – Updates from US PAIR occur Monthly
Live Links to Reports, Alignments
• Links on analysis page carry over to Excel Reports
• Simple Easy Sharing among groups
Microsoft Excel 97
- 2004 Worksheet
Short sequences need GenePAST or Motif
searches (BLAST may miss patents)
• For short Query sequences – or for
easy analysis of variants, GenePAST
is the preferred algorithm.
MOTIF on full length – Direct Strike
The long sequence gives hits comprising all three CDRs in the specific order
provided. *. Represents “any number of unspecified residues, including zero”.
Motif searches require 100% match in “defined” residues.
>37-motif
DLSIH.*GFDPQDGETIYAQKFQG.*GSSSSWFDP
>9-motif
RASQGISSWLA.*GASNLES.*QQANSFPWT
Unique Family Sequence UFS
• Merge all identical sequences within a family
• Based on strict criteria: identical sequence, patent family, sequence length
• Examine a sequence’s status across authorities
• Group By UFS can replace group by family for finer resolution of unique hits
• UFS Identifier = MD5Sum + Sequence Length + Family ID
• UFS IDs can be transient
Normalized Sequence/Patent Family
Methodology – Searching CDRs
All3CDRs(orprimer/ampliconsets)insubjectorpatent
MOTIF – exact match
GenePAST – variations
By requiring a group size equal to three in the post search grouping – we show patents that
contain all three CDRs
• Fasta sequences for your search
allows multiple queries at once
• GenePAST will allow you to view
patent hits with variability in the CDRs
Conservative Substitutions
Subjectscomprisingall3CDRS
Upto1substitution
Subject and Query Gaps
• Gaps in CDRs and primers can be ignored
using the Query/Subject gap filter
• Variations – i.e. number of differences can
be adjusted without calculating % identity
Database Selection
Tree Structure and Virtual Databases
• Tree structure allows easy database search setup
• Multiple virtual databases can be chosen
• Virtual databases can be shared among teams
• Save your own databases from
keyword or IP searches – and
search within results
Patent Statistics Report
• For multiple queries
quickly display patents
that contain all or a
subset of the queries
GenomeQuest 101

More Related Content

Similar to GenomeQuest 101

patterndat.pdf
patterndat.pdfpatterndat.pdf
patterndat.pdf
FizzaFaisal
 
Fuzzing - A Tale of Two Cultures
Fuzzing - A Tale of Two CulturesFuzzing - A Tale of Two Cultures
Fuzzing - A Tale of Two Cultures
CISPA Helmholtz Center for Information Security
 
E-LEARN Search Strategies
E-LEARN Search StrategiesE-LEARN Search Strategies
2016 02 23_biological_databases_part1
2016 02 23_biological_databases_part12016 02 23_biological_databases_part1
2016 02 23_biological_databases_part1
Prof. Wim Van Criekinge
 
Tips & Tricks for Patent Search orbit.com
Tips & Tricks for Patent Search orbit.comTips & Tricks for Patent Search orbit.com
Tips & Tricks for Patent Search orbit.com
Pushpak Singh - IPR Consultant
 
Search Basics
Search BasicsSearch Basics
Search Basics
Sander Kieft
 
You can do WHAT with GenomeQuest? (Almost) 101 Things You May Not Know
You can do WHAT with GenomeQuest? (Almost) 101 Things You May Not KnowYou can do WHAT with GenomeQuest? (Almost) 101 Things You May Not Know
You can do WHAT with GenomeQuest? (Almost) 101 Things You May Not Know
Kayleigh Duggan
 
CSPro Workshop P-3
CSPro Workshop P-3CSPro Workshop P-3
CSPro Workshop P-3
prabhustat
 
Finding the Bad Actor: Custom scoring & forensic name matching with Elastics...
Finding the Bad Actor: Custom scoring & forensic name matching  with Elastics...Finding the Bad Actor: Custom scoring & forensic name matching  with Elastics...
Finding the Bad Actor: Custom scoring & forensic name matching with Elastics...
Charlie Hull
 
2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload
Prof. Wim Van Criekinge
 
2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge
Prof. Wim Van Criekinge
 
Patent Search
Patent SearchPatent Search
Patent Search
BananaIP Counsels
 
Using DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment ContentUsing DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment Content
Edwina Lui
 
CDISC SDTM Domain Presentation
CDISC SDTM Domain PresentationCDISC SDTM Domain Presentation
CDISC SDTM Domain Presentation
Ankur Sharma
 
2020 02 11_biological_databases_part1
2020 02 11_biological_databases_part12020 02 11_biological_databases_part1
2020 02 11_biological_databases_part1
Prof. Wim Van Criekinge
 
Intro to Elasticsearch
Intro to ElasticsearchIntro to Elasticsearch
Intro to Elasticsearch
Clifford James
 
Scalable Data Models with Elasticsearch
Scalable Data Models with ElasticsearchScalable Data Models with Elasticsearch
Scalable Data Models with Elasticsearch
BeyondTrees
 
Google for Life Science Researchers
Google for Life Science ResearchersGoogle for Life Science Researchers
Google for Life Science Researchers
University of Michigan Taubman Health Sciences Library
 
code4lib 2011 preconference: What's New in Solr (since 1.4.1)
code4lib 2011 preconference: What's New in Solr (since 1.4.1)code4lib 2011 preconference: What's New in Solr (since 1.4.1)
code4lib 2011 preconference: What's New in Solr (since 1.4.1)
Erik Hatcher
 
Shift-Left Testing: QA in a DevOps World by David Laulusa
Shift-Left Testing: QA in a DevOps World by David LaulusaShift-Left Testing: QA in a DevOps World by David Laulusa
Shift-Left Testing: QA in a DevOps World by David Laulusa
QA or the Highway
 

Similar to GenomeQuest 101 (20)

patterndat.pdf
patterndat.pdfpatterndat.pdf
patterndat.pdf
 
Fuzzing - A Tale of Two Cultures
Fuzzing - A Tale of Two CulturesFuzzing - A Tale of Two Cultures
Fuzzing - A Tale of Two Cultures
 
E-LEARN Search Strategies
E-LEARN Search StrategiesE-LEARN Search Strategies
E-LEARN Search Strategies
 
2016 02 23_biological_databases_part1
2016 02 23_biological_databases_part12016 02 23_biological_databases_part1
2016 02 23_biological_databases_part1
 
Tips & Tricks for Patent Search orbit.com
Tips & Tricks for Patent Search orbit.comTips & Tricks for Patent Search orbit.com
Tips & Tricks for Patent Search orbit.com
 
Search Basics
Search BasicsSearch Basics
Search Basics
 
You can do WHAT with GenomeQuest? (Almost) 101 Things You May Not Know
You can do WHAT with GenomeQuest? (Almost) 101 Things You May Not KnowYou can do WHAT with GenomeQuest? (Almost) 101 Things You May Not Know
You can do WHAT with GenomeQuest? (Almost) 101 Things You May Not Know
 
CSPro Workshop P-3
CSPro Workshop P-3CSPro Workshop P-3
CSPro Workshop P-3
 
Finding the Bad Actor: Custom scoring & forensic name matching with Elastics...
Finding the Bad Actor: Custom scoring & forensic name matching  with Elastics...Finding the Bad Actor: Custom scoring & forensic name matching  with Elastics...
Finding the Bad Actor: Custom scoring & forensic name matching with Elastics...
 
2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload
 
2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge
 
Patent Search
Patent SearchPatent Search
Patent Search
 
Using DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment ContentUsing DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment Content
 
CDISC SDTM Domain Presentation
CDISC SDTM Domain PresentationCDISC SDTM Domain Presentation
CDISC SDTM Domain Presentation
 
2020 02 11_biological_databases_part1
2020 02 11_biological_databases_part12020 02 11_biological_databases_part1
2020 02 11_biological_databases_part1
 
Intro to Elasticsearch
Intro to ElasticsearchIntro to Elasticsearch
Intro to Elasticsearch
 
Scalable Data Models with Elasticsearch
Scalable Data Models with ElasticsearchScalable Data Models with Elasticsearch
Scalable Data Models with Elasticsearch
 
Google for Life Science Researchers
Google for Life Science ResearchersGoogle for Life Science Researchers
Google for Life Science Researchers
 
code4lib 2011 preconference: What's New in Solr (since 1.4.1)
code4lib 2011 preconference: What's New in Solr (since 1.4.1)code4lib 2011 preconference: What's New in Solr (since 1.4.1)
code4lib 2011 preconference: What's New in Solr (since 1.4.1)
 
Shift-Left Testing: QA in a DevOps World by David Laulusa
Shift-Left Testing: QA in a DevOps World by David LaulusaShift-Left Testing: QA in a DevOps World by David Laulusa
Shift-Left Testing: QA in a DevOps World by David Laulusa
 

Recently uploaded

EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
MaheshaNanjegowda
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
Carl Bergstrom
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Selcen Ozturkcan
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills MN
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
PirithiRaju
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
Advanced-Concepts-Team
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
RDhivya6
 

Recently uploaded (20)

EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
 

GenomeQuest 101

  • 2. Authority Document Count Sequence Count Database USA 320,873 215,305,722 Gold+ EPO 108,362 37,883,488 Gold+ WIPO 144,292 74,293,342 Gold+ Japan 104,108 27,355,841 Gold+ China 78,683 1,029,562 Platinum India 6,446 69,071 Platinum Canada 57,671 24,026,839 Gold+ Brazil 2,134 39,001 Platinum Others 81,148 3,913,808 Gold+ Total 903,717 383,916,674 Country Coverage World’s Largest Sequence Database https://www.gqlifesciences.com/genomequest/capabilities-features/
  • 3. GQ Gold+ vs Platinum Topic Gold+ PLATINUM Traditional All Patents ST.25 Listings From US, EPO, WIPO, Korea, Japan All Patents ST.25 Listings From US, EPO, WIPO, Korea, Japan Traditional and Manual Curation GQ-Pat Sequences (including non-ST.25) from US, EPO, WIPO, Korea, Japan plus the following Authorities: AT, AU, BE, CA, CH, DE, ES, FR, GB, LU, NL, NO, TW GQ-Pat Sequences (including non-ST.25) from US, EPO, WIPO, Korea, Japan plus the following Authorities: AT, AU, BE, CA, CH, DE, ES, FR, GB, LU, NL, NO, TW  BRIC Country Documents: CN, BR, IN, RU + Emerging Country Documents Features Extended Legal Status (ELS) Extended Legal Status (ELS) Normalized Patent Assignee; Parent Normalized Patent Assignee; Parent Unique Family Sequence (UFS) Unique Family Sequence (UFS)  Access to PDF Downloads  Family Portrait Report
  • 4.
  • 6. Results Pre & Post filtering 560K sequences 2K sequencesFilter
  • 7. Getting to Your Results  ALGORITHMS • Searches can be done in a broad inclusive manner by selecting the correct algorithm and a few basic settings  FILTERS • Broad searches can be narrowed quickly based on homology data, legal status, and many other critera  VIEWS • Views allow you to tailor the display to your liking – with specific columns and intelligent grouping
  • 9. Filters • Filter your search based on specific legal status, homology, authority, or many other categories • Save your favorite – frequently used filters • Save multiple filters– different filters for different searches • Filters are categorized for fast access • Categories include alignment properties, subject text, subject dates, subject properties etc. • Filters reduce reported hits based on your criteria
  • 10. Views & Grouping • Choose how to display your data on Results page • Tailored views are also used for Excel Table Export • Add Columns to View with Display List • Display fields are similar to filter fields • Display categories similar to filter categories • Save favorite – frequently used views • Save multiple views – different views for different searches • Group based on specific criteria • Patent ID, Patent Family, Patent Assignee • Display all records in group, or subset for streamlined analysis
  • 13. Filter with LQ markup Filter by Stars Filter by Color
  • 14. • Sequence Search • Filter • Export Results to LQ • Mark to distinguish sequence searches • LQ text search • ( ttl_abst_clm:IL-17*^5 OR ttl_abst_clm:IL17*^5) AND antibod*) • Mark to distinguish text search • Unite! • Filter • Highlight key hits • Export • Filter within Excel Sample Workflow
  • 17. Post Filtering Post Filter sequence searches, text searches, or combined searches
  • 20. LifeQuest • Unite Sequence Based & Text Based Searches • Create Virtual Sequence Database from LQ Results
  • 21. Nested – Savable Filters Complex Boolean filters • Nested filters for fine tuning • Save standard filters for easy application
  • 24. Supplementary Slides Please contact stephen.allen@aptean.com with any questions
  • 25.
  • 26. Q: S: LOCAL ALIGNMENT Part of the Query matches part of the Subject. BLAST, FASTA, and Smith & Waterman. S: Q: GLOBAL ALIGNMENT All of the Query matches all of the Subject. Needleman & Wunsch and algorithms like it. Q: S: BEST FIT ALIGNMENT All of the Query is fitted into the Subject. GenePast. Ideal for patent sequence searching. Alignment Types
  • 27. Alignment Subject % ID Query % ID Subject % Coverage Query % Coverage 100% 100% 100% 100% 100% 50% 100% 50% 50% 100% 50% 100% 50% 50% 50% 50% 95% 95% 100% 100% Alignment % identity, corrected for the ratio of the alignment length to either the query or subject length. Query/Subject % Identity Definition This example assumes 100% alignment identity, the longer lines are 100 residues, the shorter lines are 50 residues. • By filtering for 100% subject coverage you can capture CDR to CDR matches • With variability % ID can drop, so % coverage is the preferable filter • This is a key feature to understand – these filters are very powerful 5 mismatches
  • 28. Key Fields Legal Status Extended Legal Status And National Phase Legal Status US PAIR Legal Status • PAIR Legal status – Updates from US PAIR occur Monthly Live Links to Reports, Alignments • Links on analysis page carry over to Excel Reports • Simple Easy Sharing among groups Microsoft Excel 97 - 2004 Worksheet
  • 29. Short sequences need GenePAST or Motif searches (BLAST may miss patents) • For short Query sequences – or for easy analysis of variants, GenePAST is the preferred algorithm.
  • 30. MOTIF on full length – Direct Strike The long sequence gives hits comprising all three CDRs in the specific order provided. *. Represents “any number of unspecified residues, including zero”. Motif searches require 100% match in “defined” residues. >37-motif DLSIH.*GFDPQDGETIYAQKFQG.*GSSSSWFDP >9-motif RASQGISSWLA.*GASNLES.*QQANSFPWT
  • 31. Unique Family Sequence UFS • Merge all identical sequences within a family • Based on strict criteria: identical sequence, patent family, sequence length • Examine a sequence’s status across authorities • Group By UFS can replace group by family for finer resolution of unique hits • UFS Identifier = MD5Sum + Sequence Length + Family ID • UFS IDs can be transient Normalized Sequence/Patent Family
  • 32. Methodology – Searching CDRs All3CDRs(orprimer/ampliconsets)insubjectorpatent MOTIF – exact match GenePAST – variations By requiring a group size equal to three in the post search grouping – we show patents that contain all three CDRs • Fasta sequences for your search allows multiple queries at once • GenePAST will allow you to view patent hits with variability in the CDRs
  • 33. Conservative Substitutions Subjectscomprisingall3CDRS Upto1substitution Subject and Query Gaps • Gaps in CDRs and primers can be ignored using the Query/Subject gap filter • Variations – i.e. number of differences can be adjusted without calculating % identity
  • 34. Database Selection Tree Structure and Virtual Databases • Tree structure allows easy database search setup • Multiple virtual databases can be chosen • Virtual databases can be shared among teams • Save your own databases from keyword or IP searches – and search within results
  • 35. Patent Statistics Report • For multiple queries quickly display patents that contain all or a subset of the queries