SlideShare a Scribd company logo
©2014 Cambridge Semantics Inc. All rights reserved.
Introduction to Anzo Unstructured
June 29, 2016
Richard Mallah
Director of Unstructured and Advanced Analytics
richard@cambridgesemantics.com
©2013 Cambridge Semantics Inc. All rights reserved. Page 2.
Agenda
• Anzo Unstructured and the Anzo Smart Data Platform
• Core Capabilities of Anzo Unstructured
• Configuration, Operations, and Output
• Example Use Cases in Pharma and Finance
• Exploring Document-Derived Analytics
• Visualizing Additional Annotators and Capabilities
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 3.
Introduction to Cambridge Semantics (CSI)
The Anzo Smart Data Platform is used to create data analytics and
management solutions with diverse data from varied sources
Company:
 Founded in 2007 by senior team from IBM’s Advanced Internet Technology Group
 Privately Funded
 Select customers:
Software:
 Market leading Anzo software suite is built on open Semantic Web standards
 Currently 3rd generation of the product in production use
ApplicationsMiddlewareEnterprise
DataFabric
Anzo.js
Client Library
Anzo Enterprise Server
(SOA; OSGI, RDF & OWL over JMS)
Anzo.Net
Client Library
Anzo .java/.Net
Client Library Anzo Relational Replicator
Reasoning
& Rules
Workflow
Semantic
Services
Anzo
Connect
Enterprise
Directory Connect
Anzo
Unstructured
Anzo for Excel Applications and BI ToolsAnzo on the Web
Anzo
Graph
Database
Anzo
Content
Repository
RDBMS
Data Mart/
Warehouse
Enterprise
Applications
Directory
(LDAP, AD)
• Virtualize data using W3C
semantic standards
• Operationalize industry
standards e.g., FIBO, LEI
• Real-time data events
• Granular security and access
control
• Ontology, Mapping,
Visualization & Service registries
Rich Client Apps
………
 Full/Incremental ETL
 Web Services
 Federated SPARQL
 NLP
 Text Analytics
 Semantic Analysis
3rd Party
Databases &
Applications External Data Sources
Unstructured
Content
 RDBMS
 Teradata
 Hadoop
 SalesForce
The Anzo Smart Data Platform
©2015 Cambridge Semantics Inc. All rights reserved.
Anzo Smart Data Lake
Anzo Smart Data Lake Server
Anzo Enterprise Server
• Self-service analytics,
visualization and data discovery
• Data curation, annotation and
application workflow
• MPP graph query engine for
interactive analytics at scale
• ODATA Integration for 3rd party
analytics tools
• Metadata, ontology and
mapping catalog
• Model-driven data provisioning
and loading
• Text analytics
• Canonical entity linking and
transformation
• Scalable Graph and Document
Storage
Anzo Graph Query Engine
Anzo Ingestion Servers
Anzo Unstructured
©2014 Cambridge Semantics Inc. All rights reserved. Page 6.
Anzo Ontology Editor
What Solutions Benefit From Anzo?
• For aggregation of data from multiple, diverse data sources
• For integration of internal data with external data across the Web or
firewalls
• For solutions involving data sources, business rules, analytics and
actions that are not evident in advance
• For solutions that change often
• For analyzing diverse data sources with a diverse variety of access
control requirements with a need for full provenance and traceability
• For evolving solutions benefiting from ongoing involvement from
domain experts to update data models, data sources, and analytics as
needed
• For formal and informal day-to-day business activities that require
workflow, alerts, and automation
• For collecting & analyzing data that doesn’t currently have any system
of record (e.g. “shadow IT” systems)
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 8.
Anzo Unstructured Capabilities
Overview
• Intake Sources
– Social Media
– Local Directories
– Enterprise CMSs
– Structured Databases
– Web Sites & Boards
– Spreadsheets
– Google Search Appliance
– Mail Servers
– + dozens more
• File Formats
– Office Documents
– PDFs
– Web Pages
– Email Messages
– + dozens more
• Multilingual
– European, Asian, and Middle Eastern Languages
– Native-Language Annotation
– Document Translation
– Annotation Translation
– Phonetic Name Normalization/Indexing
– Cross-Lingual Concepts Automapped
• Extraction Categories
– Entities
– Relationships
– Granular Sentiment
– Topic Classification
– Patterns and Concepts
– and more
• Concept Types Extracted
– MedicalHistoryAilment
– LegalStatuteSection
– BiomarkerForDisease
– AnalystEarningsEstimate
– JobTitle
– SentimentTopic
– + thousands more
– + easily user-extended/customized
• Semantic Analysis
– Concept-Based Relationships
– Relationship Compounding
– Annotation Harmonization
– Multi-NLP Weighting/Voting
– Ontology Growing
– Ontology Alignment
• Semantic Search
– Concept-Based Full-Text Search
– Facet On Concept or Type
– Mix Structured & Unstructured Filters
– Visualize Annotations In Context
– External Index Federation
– Multi-Stage Searching/Filtering/Clustering
• Structured/Unstructured Integration
– Find/link structured resources in text
– Analyze text within structured columns
– Populate new structured resources
from text
– Auto-enrich entities found in unstructured
– Auto-extend schemas from unstructured
properties
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 9.
Anzo Unstructured NLP Plugins
Overview
• Anzo Unstructured is both a pluggable framework supporting a
large number of ready-made third-party NLP integrations, and also
has significant NLP capabilities bundled along with it
– Plugins on the following pages are a small number of our many supported
NLP capabilities from a variety of sources
• Among the annotators include out of the box are:
– Autotagger and Classifier Annotator (Statistical, can fall back to rule-based)
– Autotagger and Classifier Annotator (Rule-Based, can fall back to statistical)
– Standard Entity Extractors (People, companies, locations, job titles, dates, etc.)
– Custom Knowledgebase Annotator (Lever your taxonomies, thesauri, databases)
– Fuzzy Rule Network Annotator (Find concepts by related, surrounding, contextual concepts)
– Significant Phrase Annotator (Automatically extracts the important concepts)
– Document Section Annotator (Autogenerate table of contents and contextualize more)
– Pattern Annotators (Find part no., id no., statute section, or any custom pattern)
– Custom Relationship Annotator (Find events or relationships spanning different extractions)
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 10.
Optional NLP Plugin Technology Partners
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 11.
Semantic Post-Processing of NLP
• Harmonization
– Normalized formats for knowledge integration
• Cooperation
– Multiple annotators strengthen, correct, and increase the network effect
of relationships
• Probabilistic Reasoning
– Semantic knowledge integration includes both deduction and inference
• Filtering
– The set of concepts, overlaps, affects, and relationships can be
automatically filtered down to reduce noise
• Enrichment
– Web services, semantic services, internal and external databases and
knowledgebases, and pluggable computations can be used to add more
context and data to your new domain object
• Machine Learning and Predictive Analytics
– Train on some gold standard and do some supervised classification
– Incrementally build a conceptual cluster space for predictive analytics
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 12.
Point and Click Configuration of Unstructured Pipelines
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 13.
Point and Click Configuration of Annotation
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 14.
Unstructured Pipeline Operations Monitor
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 15.
Dashboarding Structured/Unstructured
Knowledge Integration
Structured
property
Multiple NLP
Technologies
Harmonized
Overlapping
annotations
Enriched
property
Unstructured
entity
Unstructured
relationship
Archived copy for review,
validation & provenance
(both HTML Format &
Original )
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 16.
The CSI Semantic Knowledge Integration Approach
to Enterprise Text Analytics
• Use Multiple NLP Engines or Annotators
• Leverage a Knowledge Integration Platform
– Make the annotators cooperate
– Enrich the annotations with internal or external data
– Link annotations with existing structured data
– Filter them down to the most relevant set
– Harmonize ontologies and instances
– Deal with probabilistic or uncertain information
• Quality Control
– Manual curation and automated QC
– Workflow, provenance lineage
• Easily Deal with Data Changes and Schema Changes
– Both are dealt with in real-time at runtime
– Maintenance is orders of magnitude more efficient
Use Cases in Pharma
• PV & Safety Data Management - Automatic tagging of case reports with
customized curation workflow, text mining, and contextual search
• R&D Competitive Intelligence – Explore the competitive landscape for
Therapeutic Area, Indication, Target, Company, Compound, & Partners
• R&D Informatics– Understand and correlate your internal research and
how it may be related to any external developments or research
• Clinical Trial Site Selection and Optimization - Site selection, KOL search,
trial planning
• Scientific Affairs/Medical Science Liaisons - Track Key Opinion Leaders
(KOL) in literature and clinical trials & analyze feedback from medical
professionals and patients
• Information Landscape - Track and monitor data stewardship and usage
through the organization to drive more efficient usage.
• Commercial Analytics – Sales and Marketing, Rx Data, Text Analytics
Use Cases in Financial Services
• Compliance Policy & Procedure Management - Monitor structured and
unstructured data sources for relevant regulatory changes; have
collaborative workflows for policy & documentation development,
approval, and control; and establish targeted policy dissemination and
attestation workflows.
• Compliance Surveillance & Investigation– Legal and Compliance analysts
can create structures and views that provide analysis, rules, and alert
thresholds easily changed on-the-fly by investigators, who can then
comprehend and interact with the big data picture.
• Market and Customer Intelligence- Understand how clients and prospects
are thinking about your firm and competitors’ offerings
• Research - Automated analytics of news, chatter, IMs, secondary research
reports, emails, sentiment, etc. for research alerts, semantic search, and
relationship visualization, forming an integrated intelligence platform for
analysts, including Complex Event Processing.
• Information Landscape - Track and monitor data stewardship and usage
through the organization to drive more efficient usage.
• Commercial Analytics – Sales and Marketing, Tx Data, Text Analytics
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 19.
Point and Click Configuration of Annotation
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 20.
Point and Click Configuration of Annotation
©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 21
Relationship Explorer
Find Unexpected Connections Between Companies | Follow Paths Out or In From Anything | Follow the Money
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 22.
Incremental Semantic Overlays: Product, Brand, Offering
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 23.
Semantic Correspondence Linking and Overlay
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 24.
Asking Cross-Ontology Questions
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 25.
Cross-Ontology Questions Meet The Network Effect
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 26.
Multi-Ontology Knowledge Graph Exploration
©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 27
Deep News View
Customizable Fundamental, Technical, and Thematic Filters | View Only Most Recent n Minutes | Semantic Search
©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 28
Rapid Concept Drilldown
GPS for Concepts | Assisted Skimming | Interactive Annotation-Driven Navigation | Auto-translates Foreign Languages
©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 29
Example: Customizable Stock Centric Surveillance
Dashboards Per Stock | Per Cohort | Per Industry | Per Custom Sector | Analyst Can Define Filters and Drilldowns
©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 30
Example: Competitor Sentiment Comparison
Longitudinal | Sentiment Aggregation | By Cohort | From Single Stock Selection | Visualize Leaders and Followers
©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 31
Example: Intraday Sentiment
Drill Down | Intraday Topic-Granular Sentiment | Attribute Price Action Drivers | Investigate Unusual Volumes
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 32.
Longitudinal and Outlier Business Intelligence
Unstructured Data Becomes Structured
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 33.
Anzo Unstructured NLP Plugins
CSI Web Scraper Annotator
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 34.
Contextual Semantic Overlay
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 35.
I1
I2 I3
E1
E2
I4
I1
I2 I3
E1
E2
I4
I1
I2 I3
E1
E2
I4
Main
Pipeline
Purple
Helper
Pipeline
Green
Helper
Pipeline
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 36.
Fuzzy Concept Matching Example: Skills
Understanding and Recognition in Semantic Search
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 37.
Fuzzy Concept Matching Example: Skills
Concept Curation
• Use Excel to define each skill concept with any combination of methods
• Multiple values are comma-separated
• Patterns support wildcards, y within n words of x, and intuitive groupings
• Define more atomic concepts before more compound concepts
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 38.
Anzo Unstructured NLP Plugins
CSI Document Classifier
©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 39
Indirect Filters on Domain-Specific Summaries
Auto-Summarization | Extensive Filters | Integration with Multiple Sources of News and Research | Assisted Reader
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 40.
Cross-Lingual Annotation and Optional Translation
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 41.
Multiple Languages, One Concept
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 42.
In Situ Translation and Annotation
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 43.
Automated Redaction
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 44.
Anzo Unstructured NLP Plugins
CSI Significant Phrase Annotator
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 45.
Anzo Unstructured NLP Plugins
CSI Custom Relationship Annotator
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 46.
Anzo Unstructured NLP Plugins
Linguamatics I2E Annotator, Biomarkers for Diseases
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 47.
Anzo Unstructured NLP Plugins
SciBite Termite Annotator
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 48.
Anzo Unstructured NLP Plugins
Lexalytics Salience
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 49.
Simplified Views for Non-Technical Users
Semantic Search Made Easy
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 50.
Anzo Unstructured Capabilities
APIs and SDK
Create new pipeline components for any of these tiers:
– Document Crawler / Listener
• Obtain documents of any format from any source
– Document Rich Text, Thumbnail, and Metadata Extraction
• Deal with custom or less-common file formats completely pluggably
– Document Format Cleansing and Transformation
• Remove unwanted artifacts specific to your documents or translate to a particular
format or language
– Full-Text Indexing
• Pluggable corpus-level indexing and search
– Annotator
• Already supports GATE, UIMA, and FrAU annotation frameworks
• Provides access to annotations from any other annotator, cleansed text, format-
analyzed document, and original file, supporting mixed-representation annotation
• Multithreading safe
– Semantic Postprocessor
• Recombine, filter, and restructure annotations
©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 51.
Click here to view the full webinar

More Related Content

What's hot

Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data Fabric
Cambridge Semantics
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Cambridge Semantics
 
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricUsing Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Cambridge Semantics
 
Building A Self Service Analytics Platform on Hadoop
Building A Self Service Analytics Platform on HadoopBuilding A Self Service Analytics Platform on Hadoop
Building A Self Service Analytics Platform on Hadoop
Craig Warman
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Cambridge Semantics
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
Cambridge Semantics
 
Modern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in InsuranceModern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in Insurance
Cambridge Semantics
 
Should a Graph Database Be in Your Next Data Warehouse Stack?
Should a Graph Database Be in Your Next Data Warehouse Stack?Should a Graph Database Be in Your Next Data Warehouse Stack?
Should a Graph Database Be in Your Next Data Warehouse Stack?
Cambridge Semantics
 
Sustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive AnalyticsSustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive Analytics
Cambridge Semantics
 
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Cambridge Semantics
 
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Cambridge Semantics
 
Going Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph AnalyticsGoing Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph Analytics
Cambridge Semantics
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Cambridge Semantics
 
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
Cambridge Semantics
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational Databases
Cambridge Semantics
 
Harnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie MacHarnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie Mac
DataWorks Summit
 
Automating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge BaseAutomating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge Base
Vaticle
 
A Dynamic Data Catalog for Autonomy and Self-Service
A Dynamic Data Catalog for Autonomy and Self-ServiceA Dynamic Data Catalog for Autonomy and Self-Service
A Dynamic Data Catalog for Autonomy and Self-Service
Denodo
 
Necessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorNecessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services Sector
DataWorks Summit
 
BigData Analysis
BigData AnalysisBigData Analysis

What's hot (20)

Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data Fabric
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
 
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricUsing Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
 
Building A Self Service Analytics Platform on Hadoop
Building A Self Service Analytics Platform on HadoopBuilding A Self Service Analytics Platform on Hadoop
Building A Self Service Analytics Platform on Hadoop
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 
Modern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in InsuranceModern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in Insurance
 
Should a Graph Database Be in Your Next Data Warehouse Stack?
Should a Graph Database Be in Your Next Data Warehouse Stack?Should a Graph Database Be in Your Next Data Warehouse Stack?
Should a Graph Database Be in Your Next Data Warehouse Stack?
 
Sustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive AnalyticsSustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive Analytics
 
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
 
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
 
Going Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph AnalyticsGoing Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph Analytics
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
 
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational Databases
 
Harnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie MacHarnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie Mac
 
Automating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge BaseAutomating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge Base
 
A Dynamic Data Catalog for Autonomy and Self-Service
A Dynamic Data Catalog for Autonomy and Self-ServiceA Dynamic Data Catalog for Autonomy and Self-Service
A Dynamic Data Catalog for Autonomy and Self-Service
 
Necessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorNecessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services Sector
 
BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
 

Viewers also liked

PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
Daniel Westzaan
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentation
elasticdave
 
Data: The Good, The Bad & The Ugly
Data: The Good, The Bad & The UglyData: The Good, The Bad & The Ugly
Data: The Good, The Bad & The Ugly
SciBite Limited
 
Visualization 101 BA4All
Visualization 101 BA4AllVisualization 101 BA4All
Visualization 101 BA4All
Jos van Dongen
 
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite Limited
 
A Journey to Modern Apps with Containers, Microservices and Big Data
A Journey to Modern Apps with Containers, Microservices and Big DataA Journey to Modern Apps with Containers, Microservices and Big Data
A Journey to Modern Apps with Containers, Microservices and Big Data
Edward Hsu
 
Lambda at Weather Scale - Cassandra Summit 2015
Lambda at Weather Scale - Cassandra Summit 2015Lambda at Weather Scale - Cassandra Summit 2015
Lambda at Weather Scale - Cassandra Summit 2015
Robbie Strickland
 
Data Scientist 101 BI Dutch
Data Scientist 101 BI DutchData Scientist 101 BI Dutch
Data Scientist 101 BI Dutch
Jos van Dongen
 
Hi Speed Datawarehousing
Hi Speed DatawarehousingHi Speed Datawarehousing
Hi Speed Datawarehousing
Jos van Dongen
 
SnappyData overview NikeTechTalk 11/19/15
SnappyData overview NikeTechTalk 11/19/15SnappyData overview NikeTechTalk 11/19/15
SnappyData overview NikeTechTalk 11/19/15
SnappyData
 
Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Database Shootout: What's best for BI?
Database Shootout: What's best for BI?
Jos van Dongen
 
Always On: Building Highly Available Applications on Cassandra
Always On: Building Highly Available Applications on CassandraAlways On: Building Highly Available Applications on Cassandra
Always On: Building Highly Available Applications on Cassandra
Robbie Strickland
 
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch AnalysisNoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
Helena Edelson
 
Scalable On-Demand Hadoop Clusters with Docker and Mesos
Scalable On-Demand Hadoop Clusters with Docker and MesosScalable On-Demand Hadoop Clusters with Docker and Mesos
Scalable On-Demand Hadoop Clusters with Docker and Mesos
DataWorks Summit
 
Online Analytics with Hadoop and Cassandra
Online Analytics with Hadoop and CassandraOnline Analytics with Hadoop and Cassandra
Online Analytics with Hadoop and Cassandra
Robbie Strickland
 
Streaming Big Data & Analytics For Scale
Streaming Big Data & Analytics For ScaleStreaming Big Data & Analytics For Scale
Streaming Big Data & Analytics For Scale
Helena Edelson
 
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Helena Edelson
 
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
Insight Technology, Inc.
 
SnappyData, the Spark Database. A unified cluster for streaming, transactions...
SnappyData, the Spark Database. A unified cluster for streaming, transactions...SnappyData, the Spark Database. A unified cluster for streaming, transactions...
SnappyData, the Spark Database. A unified cluster for streaming, transactions...
SnappyData
 

Viewers also liked (19)

PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentation
 
Data: The Good, The Bad & The Ugly
Data: The Good, The Bad & The UglyData: The Good, The Bad & The Ugly
Data: The Good, The Bad & The Ugly
 
Visualization 101 BA4All
Visualization 101 BA4AllVisualization 101 BA4All
Visualization 101 BA4All
 
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
 
A Journey to Modern Apps with Containers, Microservices and Big Data
A Journey to Modern Apps with Containers, Microservices and Big DataA Journey to Modern Apps with Containers, Microservices and Big Data
A Journey to Modern Apps with Containers, Microservices and Big Data
 
Lambda at Weather Scale - Cassandra Summit 2015
Lambda at Weather Scale - Cassandra Summit 2015Lambda at Weather Scale - Cassandra Summit 2015
Lambda at Weather Scale - Cassandra Summit 2015
 
Data Scientist 101 BI Dutch
Data Scientist 101 BI DutchData Scientist 101 BI Dutch
Data Scientist 101 BI Dutch
 
Hi Speed Datawarehousing
Hi Speed DatawarehousingHi Speed Datawarehousing
Hi Speed Datawarehousing
 
SnappyData overview NikeTechTalk 11/19/15
SnappyData overview NikeTechTalk 11/19/15SnappyData overview NikeTechTalk 11/19/15
SnappyData overview NikeTechTalk 11/19/15
 
Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Database Shootout: What's best for BI?
Database Shootout: What's best for BI?
 
Always On: Building Highly Available Applications on Cassandra
Always On: Building Highly Available Applications on CassandraAlways On: Building Highly Available Applications on Cassandra
Always On: Building Highly Available Applications on Cassandra
 
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch AnalysisNoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
 
Scalable On-Demand Hadoop Clusters with Docker and Mesos
Scalable On-Demand Hadoop Clusters with Docker and MesosScalable On-Demand Hadoop Clusters with Docker and Mesos
Scalable On-Demand Hadoop Clusters with Docker and Mesos
 
Online Analytics with Hadoop and Cassandra
Online Analytics with Hadoop and CassandraOnline Analytics with Hadoop and Cassandra
Online Analytics with Hadoop and Cassandra
 
Streaming Big Data & Analytics For Scale
Streaming Big Data & Analytics For ScaleStreaming Big Data & Analytics For Scale
Streaming Big Data & Analytics For Scale
 
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
 
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
 
SnappyData, the Spark Database. A unified cluster for streaming, transactions...
SnappyData, the Spark Database. A unified cluster for streaming, transactions...SnappyData, the Spark Database. A unified cluster for streaming, transactions...
SnappyData, the Spark Database. A unified cluster for streaming, transactions...
 

Similar to Introduction to Anzo Unstructured

Sdl use cases
Sdl use casesSdl use cases
Sdl use cases
John Rueter
 
Advanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout SessionAdvanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout Session
Splunk
 
Oracle analytics cloud overview feb 2017
Oracle analytics cloud overview   feb 2017Oracle analytics cloud overview   feb 2017
Oracle analytics cloud overview feb 2017
aioughydchapter
 
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Lucidworks
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
Molly Alexander
 
KIT601 Unit I.pptx
KIT601 Unit I.pptxKIT601 Unit I.pptx
KIT601 Unit I.pptx
LBSIMDS, Lucknow
 
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
Dr. Haxel Consult
 
Groundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarGroundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search Webinar
Concept Searching, Inc
 
Winning with data
Winning with dataWinning with data
Winning with data
NUS-ISS
 
Lingustic Harmony in the Tower of Babel
Lingustic Harmony in the Tower of BabelLingustic Harmony in the Tower of Babel
Lingustic Harmony in the Tower of Babel
Ann Kelly
 
Data Analytics and Big Data on IoT
Data Analytics and Big Data on IoTData Analytics and Big Data on IoT
Data Analytics and Big Data on IoT
Shivam Singh
 
How to Apply Your Taxonomy to Your Content Automatically
How to Apply Your Taxonomy to Your Content AutomaticallyHow to Apply Your Taxonomy to Your Content Automatically
How to Apply Your Taxonomy to Your Content Automatically
Access Innovations, Inc.
 
Azure_Purview.pdf
Azure_Purview.pdfAzure_Purview.pdf
Azure_Purview.pdf
hija7
 
Introduction to Business and Data Analysis Undergraduate.pdf
Introduction to Business and Data Analysis Undergraduate.pdfIntroduction to Business and Data Analysis Undergraduate.pdf
Introduction to Business and Data Analysis Undergraduate.pdf
AbdulrahimShaibuIssa
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
enterprisesearchmeetup
 
How To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization WebinarHow To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization Webinar
Concept Searching, Inc
 
Data Science for Retail Broking
Data Science for Retail BrokingData Science for Retail Broking
Data Science for Retail Broking
AlgoAnalytics Financial Consultancy Pvt. Ltd.
 
Data Science for Retail Broking
Data Science for Retail BrokingData Science for Retail Broking
Data Science for Retail Broking
AlgoAnalytics Financial Consultancy Pvt. Ltd.
 
Bioschemas Workshop
Bioschemas WorkshopBioschemas Workshop
Bioschemas Workshop
Niall Beard
 

Similar to Introduction to Anzo Unstructured (20)

Sdl use cases
Sdl use casesSdl use cases
Sdl use cases
 
Advanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout SessionAdvanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout Session
 
Oracle analytics cloud overview feb 2017
Oracle analytics cloud overview   feb 2017Oracle analytics cloud overview   feb 2017
Oracle analytics cloud overview feb 2017
 
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
 
KIT601 Unit I.pptx
KIT601 Unit I.pptxKIT601 Unit I.pptx
KIT601 Unit I.pptx
 
MECBOT
MECBOTMECBOT
MECBOT
 
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
 
Groundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarGroundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search Webinar
 
Winning with data
Winning with dataWinning with data
Winning with data
 
Lingustic Harmony in the Tower of Babel
Lingustic Harmony in the Tower of BabelLingustic Harmony in the Tower of Babel
Lingustic Harmony in the Tower of Babel
 
Data Analytics and Big Data on IoT
Data Analytics and Big Data on IoTData Analytics and Big Data on IoT
Data Analytics and Big Data on IoT
 
How to Apply Your Taxonomy to Your Content Automatically
How to Apply Your Taxonomy to Your Content AutomaticallyHow to Apply Your Taxonomy to Your Content Automatically
How to Apply Your Taxonomy to Your Content Automatically
 
Azure_Purview.pdf
Azure_Purview.pdfAzure_Purview.pdf
Azure_Purview.pdf
 
Introduction to Business and Data Analysis Undergraduate.pdf
Introduction to Business and Data Analysis Undergraduate.pdfIntroduction to Business and Data Analysis Undergraduate.pdf
Introduction to Business and Data Analysis Undergraduate.pdf
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
 
How To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization WebinarHow To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization Webinar
 
Data Science for Retail Broking
Data Science for Retail BrokingData Science for Retail Broking
Data Science for Retail Broking
 
Data Science for Retail Broking
Data Science for Retail BrokingData Science for Retail Broking
Data Science for Retail Broking
 
Bioschemas Workshop
Bioschemas WorkshopBioschemas Workshop
Bioschemas Workshop
 

More from Cambridge Semantics

Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Cambridge Semantics
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Cambridge Semantics
 
The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge Graph
Cambridge Semantics
 
Introduction to RDF*
Introduction to RDF*Introduction to RDF*
Introduction to RDF*
Cambridge Semantics
 
AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101
Cambridge Semantics
 
Healthcare and Life Sciences: Two Industries Separated by Common Data
Healthcare and Life Sciences: Two Industries Separated by Common DataHealthcare and Life Sciences: Two Industries Separated by Common Data
Healthcare and Life Sciences: Two Industries Separated by Common Data
Cambridge Semantics
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
Cambridge Semantics
 
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowScalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
Cambridge Semantics
 
Accelerate Pharma R&D with Cross-Study Analytics
Accelerate Pharma R&D with Cross-Study AnalyticsAccelerate Pharma R&D with Cross-Study Analytics
Accelerate Pharma R&D with Cross-Study Analytics
Cambridge Semantics
 
Large Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel ProcessingLarge Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel Processing
Cambridge Semantics
 

More from Cambridge Semantics (10)

Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
 
The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge Graph
 
Introduction to RDF*
Introduction to RDF*Introduction to RDF*
Introduction to RDF*
 
AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101
 
Healthcare and Life Sciences: Two Industries Separated by Common Data
Healthcare and Life Sciences: Two Industries Separated by Common DataHealthcare and Life Sciences: Two Industries Separated by Common Data
Healthcare and Life Sciences: Two Industries Separated by Common Data
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowScalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
 
Accelerate Pharma R&D with Cross-Study Analytics
Accelerate Pharma R&D with Cross-Study AnalyticsAccelerate Pharma R&D with Cross-Study Analytics
Accelerate Pharma R&D with Cross-Study Analytics
 
Large Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel ProcessingLarge Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel Processing
 

Recently uploaded

Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 

Recently uploaded (20)

Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 

Introduction to Anzo Unstructured

  • 1. ©2014 Cambridge Semantics Inc. All rights reserved. Introduction to Anzo Unstructured June 29, 2016 Richard Mallah Director of Unstructured and Advanced Analytics richard@cambridgesemantics.com
  • 2. ©2013 Cambridge Semantics Inc. All rights reserved. Page 2. Agenda • Anzo Unstructured and the Anzo Smart Data Platform • Core Capabilities of Anzo Unstructured • Configuration, Operations, and Output • Example Use Cases in Pharma and Finance • Exploring Document-Derived Analytics • Visualizing Additional Annotators and Capabilities
  • 3. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 3. Introduction to Cambridge Semantics (CSI) The Anzo Smart Data Platform is used to create data analytics and management solutions with diverse data from varied sources Company:  Founded in 2007 by senior team from IBM’s Advanced Internet Technology Group  Privately Funded  Select customers: Software:  Market leading Anzo software suite is built on open Semantic Web standards  Currently 3rd generation of the product in production use
  • 4. ApplicationsMiddlewareEnterprise DataFabric Anzo.js Client Library Anzo Enterprise Server (SOA; OSGI, RDF & OWL over JMS) Anzo.Net Client Library Anzo .java/.Net Client Library Anzo Relational Replicator Reasoning & Rules Workflow Semantic Services Anzo Connect Enterprise Directory Connect Anzo Unstructured Anzo for Excel Applications and BI ToolsAnzo on the Web Anzo Graph Database Anzo Content Repository RDBMS Data Mart/ Warehouse Enterprise Applications Directory (LDAP, AD) • Virtualize data using W3C semantic standards • Operationalize industry standards e.g., FIBO, LEI • Real-time data events • Granular security and access control • Ontology, Mapping, Visualization & Service registries Rich Client Apps ………  Full/Incremental ETL  Web Services  Federated SPARQL  NLP  Text Analytics  Semantic Analysis 3rd Party Databases & Applications External Data Sources Unstructured Content  RDBMS  Teradata  Hadoop  SalesForce The Anzo Smart Data Platform
  • 5. ©2015 Cambridge Semantics Inc. All rights reserved. Anzo Smart Data Lake Anzo Smart Data Lake Server Anzo Enterprise Server • Self-service analytics, visualization and data discovery • Data curation, annotation and application workflow • MPP graph query engine for interactive analytics at scale • ODATA Integration for 3rd party analytics tools • Metadata, ontology and mapping catalog • Model-driven data provisioning and loading • Text analytics • Canonical entity linking and transformation • Scalable Graph and Document Storage Anzo Graph Query Engine Anzo Ingestion Servers Anzo Unstructured
  • 6. ©2014 Cambridge Semantics Inc. All rights reserved. Page 6. Anzo Ontology Editor
  • 7. What Solutions Benefit From Anzo? • For aggregation of data from multiple, diverse data sources • For integration of internal data with external data across the Web or firewalls • For solutions involving data sources, business rules, analytics and actions that are not evident in advance • For solutions that change often • For analyzing diverse data sources with a diverse variety of access control requirements with a need for full provenance and traceability • For evolving solutions benefiting from ongoing involvement from domain experts to update data models, data sources, and analytics as needed • For formal and informal day-to-day business activities that require workflow, alerts, and automation • For collecting & analyzing data that doesn’t currently have any system of record (e.g. “shadow IT” systems)
  • 8. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 8. Anzo Unstructured Capabilities Overview • Intake Sources – Social Media – Local Directories – Enterprise CMSs – Structured Databases – Web Sites & Boards – Spreadsheets – Google Search Appliance – Mail Servers – + dozens more • File Formats – Office Documents – PDFs – Web Pages – Email Messages – + dozens more • Multilingual – European, Asian, and Middle Eastern Languages – Native-Language Annotation – Document Translation – Annotation Translation – Phonetic Name Normalization/Indexing – Cross-Lingual Concepts Automapped • Extraction Categories – Entities – Relationships – Granular Sentiment – Topic Classification – Patterns and Concepts – and more • Concept Types Extracted – MedicalHistoryAilment – LegalStatuteSection – BiomarkerForDisease – AnalystEarningsEstimate – JobTitle – SentimentTopic – + thousands more – + easily user-extended/customized • Semantic Analysis – Concept-Based Relationships – Relationship Compounding – Annotation Harmonization – Multi-NLP Weighting/Voting – Ontology Growing – Ontology Alignment • Semantic Search – Concept-Based Full-Text Search – Facet On Concept or Type – Mix Structured & Unstructured Filters – Visualize Annotations In Context – External Index Federation – Multi-Stage Searching/Filtering/Clustering • Structured/Unstructured Integration – Find/link structured resources in text – Analyze text within structured columns – Populate new structured resources from text – Auto-enrich entities found in unstructured – Auto-extend schemas from unstructured properties
  • 9. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 9. Anzo Unstructured NLP Plugins Overview • Anzo Unstructured is both a pluggable framework supporting a large number of ready-made third-party NLP integrations, and also has significant NLP capabilities bundled along with it – Plugins on the following pages are a small number of our many supported NLP capabilities from a variety of sources • Among the annotators include out of the box are: – Autotagger and Classifier Annotator (Statistical, can fall back to rule-based) – Autotagger and Classifier Annotator (Rule-Based, can fall back to statistical) – Standard Entity Extractors (People, companies, locations, job titles, dates, etc.) – Custom Knowledgebase Annotator (Lever your taxonomies, thesauri, databases) – Fuzzy Rule Network Annotator (Find concepts by related, surrounding, contextual concepts) – Significant Phrase Annotator (Automatically extracts the important concepts) – Document Section Annotator (Autogenerate table of contents and contextualize more) – Pattern Annotators (Find part no., id no., statute section, or any custom pattern) – Custom Relationship Annotator (Find events or relationships spanning different extractions)
  • 10. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 10. Optional NLP Plugin Technology Partners
  • 11. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 11. Semantic Post-Processing of NLP • Harmonization – Normalized formats for knowledge integration • Cooperation – Multiple annotators strengthen, correct, and increase the network effect of relationships • Probabilistic Reasoning – Semantic knowledge integration includes both deduction and inference • Filtering – The set of concepts, overlaps, affects, and relationships can be automatically filtered down to reduce noise • Enrichment – Web services, semantic services, internal and external databases and knowledgebases, and pluggable computations can be used to add more context and data to your new domain object • Machine Learning and Predictive Analytics – Train on some gold standard and do some supervised classification – Incrementally build a conceptual cluster space for predictive analytics
  • 12. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 12. Point and Click Configuration of Unstructured Pipelines
  • 13. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 13. Point and Click Configuration of Annotation
  • 14. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 14. Unstructured Pipeline Operations Monitor
  • 15. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 15. Dashboarding Structured/Unstructured Knowledge Integration Structured property Multiple NLP Technologies Harmonized Overlapping annotations Enriched property Unstructured entity Unstructured relationship Archived copy for review, validation & provenance (both HTML Format & Original )
  • 16. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 16. The CSI Semantic Knowledge Integration Approach to Enterprise Text Analytics • Use Multiple NLP Engines or Annotators • Leverage a Knowledge Integration Platform – Make the annotators cooperate – Enrich the annotations with internal or external data – Link annotations with existing structured data – Filter them down to the most relevant set – Harmonize ontologies and instances – Deal with probabilistic or uncertain information • Quality Control – Manual curation and automated QC – Workflow, provenance lineage • Easily Deal with Data Changes and Schema Changes – Both are dealt with in real-time at runtime – Maintenance is orders of magnitude more efficient
  • 17. Use Cases in Pharma • PV & Safety Data Management - Automatic tagging of case reports with customized curation workflow, text mining, and contextual search • R&D Competitive Intelligence – Explore the competitive landscape for Therapeutic Area, Indication, Target, Company, Compound, & Partners • R&D Informatics– Understand and correlate your internal research and how it may be related to any external developments or research • Clinical Trial Site Selection and Optimization - Site selection, KOL search, trial planning • Scientific Affairs/Medical Science Liaisons - Track Key Opinion Leaders (KOL) in literature and clinical trials & analyze feedback from medical professionals and patients • Information Landscape - Track and monitor data stewardship and usage through the organization to drive more efficient usage. • Commercial Analytics – Sales and Marketing, Rx Data, Text Analytics
  • 18. Use Cases in Financial Services • Compliance Policy & Procedure Management - Monitor structured and unstructured data sources for relevant regulatory changes; have collaborative workflows for policy & documentation development, approval, and control; and establish targeted policy dissemination and attestation workflows. • Compliance Surveillance & Investigation– Legal and Compliance analysts can create structures and views that provide analysis, rules, and alert thresholds easily changed on-the-fly by investigators, who can then comprehend and interact with the big data picture. • Market and Customer Intelligence- Understand how clients and prospects are thinking about your firm and competitors’ offerings • Research - Automated analytics of news, chatter, IMs, secondary research reports, emails, sentiment, etc. for research alerts, semantic search, and relationship visualization, forming an integrated intelligence platform for analysts, including Complex Event Processing. • Information Landscape - Track and monitor data stewardship and usage through the organization to drive more efficient usage. • Commercial Analytics – Sales and Marketing, Tx Data, Text Analytics
  • 19. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 19. Point and Click Configuration of Annotation
  • 20. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 20. Point and Click Configuration of Annotation
  • 21. ©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 21 Relationship Explorer Find Unexpected Connections Between Companies | Follow Paths Out or In From Anything | Follow the Money
  • 22. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 22. Incremental Semantic Overlays: Product, Brand, Offering
  • 23. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 23. Semantic Correspondence Linking and Overlay
  • 24. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 24. Asking Cross-Ontology Questions
  • 25. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 25. Cross-Ontology Questions Meet The Network Effect
  • 26. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 26. Multi-Ontology Knowledge Graph Exploration
  • 27. ©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 27 Deep News View Customizable Fundamental, Technical, and Thematic Filters | View Only Most Recent n Minutes | Semantic Search
  • 28. ©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 28 Rapid Concept Drilldown GPS for Concepts | Assisted Skimming | Interactive Annotation-Driven Navigation | Auto-translates Foreign Languages
  • 29. ©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 29 Example: Customizable Stock Centric Surveillance Dashboards Per Stock | Per Cohort | Per Industry | Per Custom Sector | Analyst Can Define Filters and Drilldowns
  • 30. ©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 30 Example: Competitor Sentiment Comparison Longitudinal | Sentiment Aggregation | By Cohort | From Single Stock Selection | Visualize Leaders and Followers
  • 31. ©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 31 Example: Intraday Sentiment Drill Down | Intraday Topic-Granular Sentiment | Attribute Price Action Drivers | Investigate Unusual Volumes
  • 32. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 32. Longitudinal and Outlier Business Intelligence Unstructured Data Becomes Structured
  • 33. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 33. Anzo Unstructured NLP Plugins CSI Web Scraper Annotator
  • 34. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 34. Contextual Semantic Overlay
  • 35. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 35. I1 I2 I3 E1 E2 I4 I1 I2 I3 E1 E2 I4 I1 I2 I3 E1 E2 I4 Main Pipeline Purple Helper Pipeline Green Helper Pipeline
  • 36. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 36. Fuzzy Concept Matching Example: Skills Understanding and Recognition in Semantic Search
  • 37. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 37. Fuzzy Concept Matching Example: Skills Concept Curation • Use Excel to define each skill concept with any combination of methods • Multiple values are comma-separated • Patterns support wildcards, y within n words of x, and intuitive groupings • Define more atomic concepts before more compound concepts
  • 38. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 38. Anzo Unstructured NLP Plugins CSI Document Classifier
  • 39. ©2013 Cambridge Semantics Inc. All rights reserved. Company Confidential Page 39 Indirect Filters on Domain-Specific Summaries Auto-Summarization | Extensive Filters | Integration with Multiple Sources of News and Research | Assisted Reader
  • 40. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 40. Cross-Lingual Annotation and Optional Translation
  • 41. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 41. Multiple Languages, One Concept
  • 42. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 42. In Situ Translation and Annotation
  • 43. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 43. Automated Redaction
  • 44. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 44. Anzo Unstructured NLP Plugins CSI Significant Phrase Annotator
  • 45. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 45. Anzo Unstructured NLP Plugins CSI Custom Relationship Annotator
  • 46. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 46. Anzo Unstructured NLP Plugins Linguamatics I2E Annotator, Biomarkers for Diseases
  • 47. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 47. Anzo Unstructured NLP Plugins SciBite Termite Annotator
  • 48. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 48. Anzo Unstructured NLP Plugins Lexalytics Salience
  • 49. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 49. Simplified Views for Non-Technical Users Semantic Search Made Easy
  • 50. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 50. Anzo Unstructured Capabilities APIs and SDK Create new pipeline components for any of these tiers: – Document Crawler / Listener • Obtain documents of any format from any source – Document Rich Text, Thumbnail, and Metadata Extraction • Deal with custom or less-common file formats completely pluggably – Document Format Cleansing and Transformation • Remove unwanted artifacts specific to your documents or translate to a particular format or language – Full-Text Indexing • Pluggable corpus-level indexing and search – Annotator • Already supports GATE, UIMA, and FrAU annotation frameworks • Provides access to annotations from any other annotator, cleansed text, format- analyzed document, and original file, supporting mixed-representation annotation • Multithreading safe – Semantic Postprocessor • Recombine, filter, and restructure annotations
  • 51. ©2015 Cambridge Semantics Inc. All rights reserved. Company Confidential. Page 51. Click here to view the full webinar

Editor's Notes

  1. Upper ontology vs lower ontology Operational ontology
  2. Main message: If your business faces challenges that show the characteristics on this slide, then you’d derive value from Anzo. Additional details: (for the first point): often these solutions combine “traditional” data sources (e.g. databases) with “non-traditional” data sources (e.g. spreadsheets or documents) (for the second point): often these solutions involve some degree of back-and-forth collaboration between people inside a company and partners, suppliers, customers, etc. (for the fourth point): Examples: the solution needs ad-hoc data, the solution needs rapidly evolving KPIs, the business is rapidly evolving due to mergers/acquisitions, etc.
  3. At this point, can apply semantic services for reasoning, probabilistic reasoning, predictive analytics
  4. Main message: If your business faces challenges that show the characteristics on this slide, then you’d derive value from Anzo. Additional details: (for the first point): often these solutions combine “traditional” data sources (e.g. databases) with “non-traditional” data sources (e.g. spreadsheets or documents) (for the second point): often these solutions involve some degree of back-and-forth collaboration between people inside a company and partners, suppliers, customers, etc. (for the fourth point): Examples: the solution needs ad-hoc data, the solution needs rapidly evolving KPIs, the business is rapidly evolving due to mergers/acquisitions, etc.
  5. Main message: If your business faces challenges that show the characteristics on this slide, then you’d derive value from Anzo. Additional details: (for the first point): often these solutions combine “traditional” data sources (e.g. databases) with “non-traditional” data sources (e.g. spreadsheets or documents) (for the second point): often these solutions involve some degree of back-and-forth collaboration between people inside a company and partners, suppliers, customers, etc. (for the fourth point): Examples: the solution needs ad-hoc data, the solution needs rapidly evolving KPIs, the business is rapidly evolving due to mergers/acquisitions, etc.
  6. Batch vs. incremental - often not explicit that is different ontology - especially when using same symbols - seldom presented a full other ontology in practice - updating bayesian priors from convincing stories Suppose that instead of both annotators calling Corona a Product, one of them called it a Brand instead. Would we still be able to do this semantic correspondence linking and semantic overlay? Yes, with something called Rough Semantic Overlay.
  7. Note that, because we chose to include a semantic correspondence linker in the pipeline, correspondences were found between Lexalytics and Calais regarding Company and regarding Product. There was another correspondence picked up between Stocks found by our knowledgebase annotator and the company extractions as well, which is what will let us tie this all automatically to our structured data. Semantic Overlay – Four Types: Direct Semantic Overlay; Meaning correspondence via shared class name; Rough Semantic Overlay; Meaning correspondence via compatible class names; Bridge Semantic Overlay Bridging structured and unstructured derived classes; Contextual Correspondence; Leverages the contains-in-some-way relationship Growing or evolving a semantic network by connecting, merging, or overlaying corresponding nodes of meaning from different sources A given resource of class A can be treated the same for some purposes as some other resource of class B A contextualization where the entities correspond in some way other than equivalence Generic subsumption – a ‘contains’ or ‘part-of’ relationship; Type-specific relationships – e.g. third-party modifiers on sentiment
  8. So given all this, we can start to ask some interesting questions. This path in the ontology here represents the question: What stocks from our structured dataset were found mentioned as acquirers in all our news and research reports, and what were the corresponding stocks that were the acquirees, also expressed in our original structured data. An analyst or compliance officer would be able to follow these paths when building a dashboard to ask such a question, of course without the need for any coding.
  9. Another question they can ask is: What stocks/companies from my original structured dataset have a Corporate Interest in Products/brands from a different company? This is even though the CompanyProduct relationship came from Calais and the Corporate Interest relationship, a custom relationship, was based on Lexalytics entities, we are still able to follow the path, ask the question, build the dashboard, because of the semantic overlay.
  10. Which leads us to notice that there was a trade, a buy, on STZ, Constellation Brands. This would not in and of itself be an issue, since only deal team members have knowledge of the deal.
  11. Alignment of contexts Often useful approach in data fusion and knowledge blending