SlideShare a Scribd company logo
K E E N A N A L Y T I C S 1
Semantic
SPEAKERS:
Dr. Arthur Keen, Principal
Keen Analytics
Thomas Kelly, Practice Director
Cognizant Technology Solutions, Inc.
K E E N A N A L Y T I C S 2
Operator, get me Klondike 5-397
K E E N A N A L Y T I C S 3
Data Ecosystems are Growing in Complexity
Tens of thousands of
databases
Millions to billions of
data elements
Dozens of markets
Hundreds to thousands of
social media sites
K E E N A N A L Y T I C S 4
Analytics without semantics
is like having a multi-lingual
conversation without
interpreters
Semantics Manages the Complexity of Data Variety
K E E N A N A L Y T I C S 5
Semantic Analytics
Data Science
Domains
Technologies
Analytic Methods
Semantics
• Knowledge
• Expertise
• Abstraction &
Diversity
• Consistency
Semantics
• Data Meaning
• Context
• Relationships
• Vocabulary
K E E N A N A L Y T I C S 6
Semantic Analytics
Emphasis is on
data relationships,
not just the data
Data focus is on data
concepts (abstraction),
not the diversity of
implementation details
Data assumptions are
made explicit in the
semantic model
The semantics guide
the analytics process,
rather than just the
analyst’s knowledge
SPARQL is a key
component, but not
the only tool in the
semantic analytics
toolbox
K E E N A N A L Y T I C S 7
Challenges in Semantic Analytics
1
2
3
Semantic models that
do not abstract data
concepts from their
implementation details
4
Semantic models that
are missing semantics
Semantic data that is
missing a semantic
model
Rich, accurate
provenance is required
to establish confidence
in the analytics results
5
Data cleansing must
meet requirements for
accuracy, consistency,
and fitness for the
purpose of the analytic
task and result
K E E N A N A L Y T I C S 8
Semantics Analytics in the Data to Action Loop
Analyze
Transform
Classify
Correlate
Predict
Interpret
COA’s
Semantics
Which relationships relevant?
What class? What kind of group?
Define New Relationships?
Inference
Tag/Inference
Representation/Provenance
Wisdom
Knowledge
Information
Data
W
K
I
D
W
K
I
D
W
K
I
D
W
K
I
D
Semantics
Intelligence Pyramid
Analytics
Semantic
Analytics
K E E N A N A L Y T I C S 9
Clustering Through Semantic Tagging
Image Credit: historyinthecity.blogspot.com
Semantic Tags
• Tend to be user- or publisher-defined based on preferences,
including terminology and depth of attribution
• May have ambiguities to resolve (synonyms, reuse/overuse,
too specific, language, jargon)
Key Benefits
• Faster search of content
• Greater precision of search results
Semantic Tags are keywords
used to describe a resource
(webpages, documents, business
transactions)
Source-Directed Tags
• Manual selection and entry by the author
• Automated population by the publisher, such as
professional literature or publication websites
• Automatically excerpted from a corpus through
semantic analysis of the content, guided by a
controlled vocabulary
K E E N A N A L Y T I C S 10
Clustering Through Semantic Tagging
Source: Implementing Iterative Algorithms with SPARQL http://ceur-ws.org/Vol-1133/paper-36.pdf
DROP GRAPH <urn:ga/g/xjz[i+1]> ;
CREATE GRAPH <urn:ga/g/xjz[i+1]> ;
INSERT { GRAPH <urn:ga/g/xjz[i+1]>
{?s <urn:ga/p/inCluster> ?clus3 } }
WHERE {
{ SELECT ?s (SAMPLE(?clus) AS ?clus3) WHERE {
{ SELECT ?s (MAX(?clusCt) AS ?maxClusCt) WHERE {
SELECT ?s ?clus (COUNT(?clus) AS ?clusCt)
WHERE { ?s <urn:ga/p/hasLink> ?o .
GRAPH <urn:ga/g/xjz[i] > ?clus }
} GROUP BY ?s ?clus
} GROUP BY ?s }
{ SELECT ?s ?clus (COUNT(?clus) AS ?clusCt)
WHERE { ?s <urn:ga/p/hasLink> ?o .
GRAPH <urn:ga/g/xjz[i]>
{ ?o <urn:ga/p/inCluster ?clus }
} GROUP BY ?s ?clus
} FILTER (?clusCt = ?maxClusCt)
} GROUP BY ?s } }
DROP GRAPH <urn:ga/g/xjz0> ;
CREATE GRAPH <urn:ga/g/xjz0> ;
INSERT { GRAPH <urn:ga/g/xjz0>
{?s <urn:ga/p/inCluster> ?s } }
WHERE {
{ SELECT DISTINCT ?s WHERE {
{ SELECT ?s <urn:ga/p/hasLink> ?o . } }
Assign Each Tag Vertex to a Cluster
For Each Tag Vertex, Populate Cluster Assignments of Neighbors
Peer-Pressure Clustering
Observation
• No use of semantics features, such as
vocabulary and knowledge
management capabilities
Strengths
• Effective over large volumes of data
• Comprehensive use of RDF data
structure features
K E E N A N A L Y T I C S 11
Clustering Through Semantic Tagging
Positive Negative
Ecstatic Pleased Okay Terms used in
Semantic Tags
Common Taxonomy for Semantic Tags
K E E N A N A L Y T I C S 12
Clustering Through Semantic Tagging
Positive Negative
Ecstatic Inspired Charged
Excited
Exceeds
Need
Very
Satisfied
Satisfied
Somewhat
Satisfied
Preferred Terms,
Synonyms, and
Common
Misspellings
Frequently-Used
Generalizations and
Degrees of Specificity
Knowledge-based Taxonomy for Semantic Tags
Estatic
Extatic
Egstatic
K E E N A N A L Y T I C S 13
Clustering through Semantic Tagging
Process
Cluster resources with highest frequency semantic tag pairs
Map the semantic tags to an N-level taxonomy of preferred tags,
based on exact and synonym matches, and desired degree of
specificity
Select a set of triples containing URIs of the resources, as well as the
semantic tags assigned to the resources
K E E N A N A L Y T I C S 14
INSERT
{ ?SemanticTagEdgeURI
rdf:type :SemanticTagEdge ;
:resourceURI ?resource ;
:edgeNode1 ?clusterTagLabel1 ;
:edgeNode2 ?clusterTagLabel2 . }
WHERE {
?SemanticTagURI1
rdf:type :SemanticTag ;
:resourceURI ?resource ;
:clusterTagValue ?clusterTagLabel1 .
?SemanticTagURI2
rdf:type :SemanticTag ;
:resourceURI ?resource ;
:clusterTagValue ?clusterTagLabel2 .
FILTER ( ?clusterTagLabel1 != ?clusterTagLabel2 )
BIND ( URI( CONCAT( str(?resource),
?clusterTagLabel1, ?clusterTagLabel2 ) ) AS
?SemanticTagEdgeURI ) }
Clustering through Semantic Tagging
:Webpage1 :hasTag “10101” .
:Webpage1 :hasTag “1030303B” .
:Webpage2 :hasTag “10201” .
:Webpage2 :hasTag “1030301” .
:Webpage3 :hasTag “1030303B” .
:Webpage3 :hasTag “10201A” .
:Webpage4 :hasTag “10101B” .
:Webpage4 :hasTag “10302A” .
:Webpage5 :hasTag “1030301” .
:Webpage5 :hasTag “10101A” . …
INSERT { ?SemanticTagURI :clusterTagValue
?clusterTagLabel }
WHERE {
?SemanticTagURI rdf:type :SemanticTag ;
:hasTag ?tagLabel .
?Concept rdf:type skos:Concept ;
( skos:prefLabel|skos:altLabel|skos:hiddenLabel )
?tagLabel .
OPTIONAL {
?Concept :degreeOfSpecificity :<SPECIFICITY> ;
skos:prefLabel ?clusterTagLabel . }
OPTIONAL {
?Concept :degreeOfSpecificity ?Specificity .
?Concept skos:broader* ?BroaderConcept .
?BroaderConcept :degreeOfSpecificity
?BroaderSpecificity .
FILTER ( ?BroaderSpecificity = :<SPECIFICITY> )
?BroaderConcept skos:prefLabel
?clusterTagLabel . } }
Insert Sample Data
Find Preferred/Generalized
Tag Value
Generate Tag Pairs
Concept
- Preferred Tag Term
- Synonyms, Misspellings
- Broader/Generalized
Concepts
- Degree of Specificity
Taxonomy
• Highest Frequency Tag Pairs
• Highest Frequency Solitary Tags *
• Triple and Quadruple Tag Sets *
Results
* Not depicted
K E E N A N A L Y T I C S 15
Semantic Analytics in Two Flavors
Semantics on Analytics Analytics on Semantics
Semantic assisted analysis: Money
laundering, fraud detection,
community detection, insider trading…
Understanding Risk (financial
trading & cyber security),
transaction optimization,
vulnerability assessment…
K E E N A N A L Y T I C S 16
Discover Abnormal BehaviorProbability
Degree Centrality
Rare Occurrence
(Frequent
Communication)
Rare Occurrence
(Infrequent
Communication)
Normal
Communication
Levels
K E E N A N A L Y T I C S 17
Identifying and predicting behavior changes
Observe Orient Decide Act
Network
Density
Time
Classify and predict group behavior using communication network density
What kind of organization is this?
What is their objective/intent?
Distributing food? Terrorist attack? Cyber attack?
Merger/Acquisition? Bank robbery?
When are they going to act?
K E E N A N A L Y T I C S 18
Company
Understanding Risk: Systemic Risk Analysis
Transitive risk exposure in a network of trading partners and holding companies
E
F
A
D H
L
J
B
K
C
G
I
M
O
N
Q
relationship
R
P
K E E N A N A L Y T I C S 19
Company
Systemic Risk Analysis:
Transitive risk exposure in a network of trading partners and holding companies
E
F
A
D H
L
J
B
K
C
G
I
M
O
N
Q
controlledBy
tradesWith
R
P
K E E N A N A L Y T I C S 20
Company
Systemic Risk Analysis:
Transitive risk exposure in a network of trading partners and holding companies
E
F
A
D H
L
J
B
K
C
G
I
M
O
N
Q
controlledBy
tradesWith
R
P
Bank
K E E N A N A L Y T I C S 21
Systemic Risk Analysis:
Transitive risk exposure in a network of trading partners and holding companies
E
F
A
D H
L
J
B
K
C
G
I
M
O
N
Q
controlledBy
tradesWith
R
P
Bank
BankHoldingCompany
A bank holding company controls a bank or controls a bank holding company
K E E N A N A L Y T I C S 22
Systemic Risk Analysis:
Transitive risk exposure in a network of trading partners and holding companies
E
F
A
D
H
L
J
B
K
C
G
I
M O
N
Q
controlledBy
tradesWith
R
P
Bank
BankHoldingCompany
risk
K E E N A N A L Y T I C S 23
In SPARQL
PREFIX : <http://pagerank/>
PREFIX rank: <http://www.ontotext.com/owlim/RDFRank#>
SELECT DISTINCT ?node ?rank
WHERE {GRAPH <http://pagerank>{
{?node :to [].}UNION {[] :to ?node}
?node rank:hasRDFRank ?rank .
}}ORDER BY ?node
PREFIX rank: <http://www.ontotext.com/owlim/RDFRank#>
INSERT DATA { rank:epsilon rank:setParam "0.001" . }
K E E N A N A L Y T I C S 24
Questions?
K E E N A N A L Y T I C S 25
Thank you!
K E E N A N A L Y T I C S 26
Speakers
Thomas (Tom) Kelly
Practice Director, Enterprise Information Management, Cognizant
Thomas Kelly is a Director in Cognizant’s Enterprise Information Management
(EIM) Practice and heads its Semantic Technology Center of Excellence. He has 20-
plus years of technology consulting experience in leading data warehousing,
business intelligence and big data projects, focused primarily on the life sciences,
healthcare, and financial services industries. Tom can be reached at
Thomas.Kelly@cognizant.com.
Dr. Arthur Keen
Principal, Keen Analytics
Arthur Keen possesses a deep understanding of graph analytics, predictive
modeling, unstructured data, categorization, text mining, natural language
processing, data mining algorithms, neural networks, and Artificial Intelligence.
He has used his expertise in these areas to provide thought leadership and
develop applications and evaluations in multiple domains including
intelligence/security informatics, business intelligence, cyber security, financial
analysis, corporate governance, retail and energy. Arthur can be reached at
akeen@keenassoc.com

More Related Content

Similar to Semantic Analytics

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Enterprise Knowledge
 
SKOS as a key element in Enterprise Linked Data Strategies
SKOS as a key element in Enterprise Linked Data StrategiesSKOS as a key element in Enterprise Linked Data Strategies
SKOS as a key element in Enterprise Linked Data Strategies
Semantic Web Company
 
Assessment In Spreadsheets
Assessment In SpreadsheetsAssessment In Spreadsheets
Assessment In Spreadsheets
guest46de76
 
슬라이드 1
슬라이드 1슬라이드 1
슬라이드 1
butest
 
Text Analytics for Non-Experts
Text Analytics for Non-ExpertsText Analytics for Non-Experts
Text Analytics for Non-Experts
Synaptica, LLC
 
Semantic Perspectives for Contemporary Question Answering Systems
Semantic Perspectives for Contemporary Question Answering SystemsSemantic Perspectives for Contemporary Question Answering Systems
Semantic Perspectives for Contemporary Question Answering Systems
Andre Freitas
 
Different Semantic Perspectives for Question Answering Systems
Different Semantic Perspectives for Question Answering SystemsDifferent Semantic Perspectives for Question Answering Systems
Different Semantic Perspectives for Question Answering Systems
Andre Freitas
 
Introduction of semantic technology for SAS programmers
Introduction of semantic technology for SAS programmersIntroduction of semantic technology for SAS programmers
Introduction of semantic technology for SAS programmers
Kevin Lee
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Max Irwin
 
Recommender Systems and Linked Open Data
Recommender Systems and Linked Open DataRecommender Systems and Linked Open Data
Recommender Systems and Linked Open Data
Polytechnic University of Bari
 
The Power of Data
The Power of DataThe Power of Data
The Power of Data
DataWorks Summit
 
Identifying Security Risks Using Auto-Tagging and Text Analytics
Identifying Security Risks Using Auto-Tagging and Text AnalyticsIdentifying Security Risks Using Auto-Tagging and Text Analytics
Identifying Security Risks Using Auto-Tagging and Text Analytics
Enterprise Knowledge
 
Irmac presentation for website
Irmac presentation for websiteIrmac presentation for website
Irmac presentation for website
Frank Barnes
 
Introduction to Application Profiles
Introduction to Application ProfilesIntroduction to Application Profiles
Introduction to Application Profiles
Diane Hillmann
 
SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategies
Semantic Web Company
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge
 
Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...
Amit Sheth
 
Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Climbing the Ontology Mountain to Achieve a Successful Knowledge GraphClimbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Enterprise Knowledge
 
Taxonomy Quality Assessment
Taxonomy Quality AssessmentTaxonomy Quality Assessment
Taxonomy Quality Assessment
Semantic Web Company
 
Data Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and FutureData Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and Future
feiwin
 

Similar to Semantic Analytics (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
SKOS as a key element in Enterprise Linked Data Strategies
SKOS as a key element in Enterprise Linked Data StrategiesSKOS as a key element in Enterprise Linked Data Strategies
SKOS as a key element in Enterprise Linked Data Strategies
 
Assessment In Spreadsheets
Assessment In SpreadsheetsAssessment In Spreadsheets
Assessment In Spreadsheets
 
슬라이드 1
슬라이드 1슬라이드 1
슬라이드 1
 
Text Analytics for Non-Experts
Text Analytics for Non-ExpertsText Analytics for Non-Experts
Text Analytics for Non-Experts
 
Semantic Perspectives for Contemporary Question Answering Systems
Semantic Perspectives for Contemporary Question Answering SystemsSemantic Perspectives for Contemporary Question Answering Systems
Semantic Perspectives for Contemporary Question Answering Systems
 
Different Semantic Perspectives for Question Answering Systems
Different Semantic Perspectives for Question Answering SystemsDifferent Semantic Perspectives for Question Answering Systems
Different Semantic Perspectives for Question Answering Systems
 
Introduction of semantic technology for SAS programmers
Introduction of semantic technology for SAS programmersIntroduction of semantic technology for SAS programmers
Introduction of semantic technology for SAS programmers
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
 
Recommender Systems and Linked Open Data
Recommender Systems and Linked Open DataRecommender Systems and Linked Open Data
Recommender Systems and Linked Open Data
 
The Power of Data
The Power of DataThe Power of Data
The Power of Data
 
Identifying Security Risks Using Auto-Tagging and Text Analytics
Identifying Security Risks Using Auto-Tagging and Text AnalyticsIdentifying Security Risks Using Auto-Tagging and Text Analytics
Identifying Security Risks Using Auto-Tagging and Text Analytics
 
Irmac presentation for website
Irmac presentation for websiteIrmac presentation for website
Irmac presentation for website
 
Introduction to Application Profiles
Introduction to Application ProfilesIntroduction to Application Profiles
Introduction to Application Profiles
 
SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategies
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
 
Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...
 
Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Climbing the Ontology Mountain to Achieve a Successful Knowledge GraphClimbing the Ontology Mountain to Achieve a Successful Knowledge Graph
Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph
 
Taxonomy Quality Assessment
Taxonomy Quality AssessmentTaxonomy Quality Assessment
Taxonomy Quality Assessment
 
Data Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and FutureData Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and Future
 

More from Thomas Kelly, PMP

Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
Thomas Kelly, PMP
 
Enterprise Semantic Technology
Enterprise Semantic TechnologyEnterprise Semantic Technology
Enterprise Semantic Technology
Thomas Kelly, PMP
 
Mobile semantic technology
Mobile semantic technologyMobile semantic technology
Mobile semantic technology
Thomas Kelly, PMP
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
Thomas Kelly, PMP
 
Rapid data integration and curation
Rapid data integration and curationRapid data integration and curation
Rapid data integration and curation
Thomas Kelly, PMP
 
Transforming Big Data into Big Value
Transforming Big Data into Big ValueTransforming Big Data into Big Value
Transforming Big Data into Big Value
Thomas Kelly, PMP
 
Semantic Technology for the Data Warehousing Practitioner
Semantic Technology for the Data Warehousing PractitionerSemantic Technology for the Data Warehousing Practitioner
Semantic Technology for the Data Warehousing Practitioner
Thomas Kelly, PMP
 
Semantic Technology for Provider-Payer-Pharma Data Collaboration
Semantic Technology for Provider-Payer-Pharma Data CollaborationSemantic Technology for Provider-Payer-Pharma Data Collaboration
Semantic Technology for Provider-Payer-Pharma Data Collaboration
Thomas Kelly, PMP
 

More from Thomas Kelly, PMP (8)

Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
 
Enterprise Semantic Technology
Enterprise Semantic TechnologyEnterprise Semantic Technology
Enterprise Semantic Technology
 
Mobile semantic technology
Mobile semantic technologyMobile semantic technology
Mobile semantic technology
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 
Rapid data integration and curation
Rapid data integration and curationRapid data integration and curation
Rapid data integration and curation
 
Transforming Big Data into Big Value
Transforming Big Data into Big ValueTransforming Big Data into Big Value
Transforming Big Data into Big Value
 
Semantic Technology for the Data Warehousing Practitioner
Semantic Technology for the Data Warehousing PractitionerSemantic Technology for the Data Warehousing Practitioner
Semantic Technology for the Data Warehousing Practitioner
 
Semantic Technology for Provider-Payer-Pharma Data Collaboration
Semantic Technology for Provider-Payer-Pharma Data CollaborationSemantic Technology for Provider-Payer-Pharma Data Collaboration
Semantic Technology for Provider-Payer-Pharma Data Collaboration
 

Recently uploaded

Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
jpupo2018
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 

Recently uploaded (20)

Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 

Semantic Analytics

  • 1. K E E N A N A L Y T I C S 1 Semantic SPEAKERS: Dr. Arthur Keen, Principal Keen Analytics Thomas Kelly, Practice Director Cognizant Technology Solutions, Inc.
  • 2. K E E N A N A L Y T I C S 2 Operator, get me Klondike 5-397
  • 3. K E E N A N A L Y T I C S 3 Data Ecosystems are Growing in Complexity Tens of thousands of databases Millions to billions of data elements Dozens of markets Hundreds to thousands of social media sites
  • 4. K E E N A N A L Y T I C S 4 Analytics without semantics is like having a multi-lingual conversation without interpreters Semantics Manages the Complexity of Data Variety
  • 5. K E E N A N A L Y T I C S 5 Semantic Analytics Data Science Domains Technologies Analytic Methods Semantics • Knowledge • Expertise • Abstraction & Diversity • Consistency Semantics • Data Meaning • Context • Relationships • Vocabulary
  • 6. K E E N A N A L Y T I C S 6 Semantic Analytics Emphasis is on data relationships, not just the data Data focus is on data concepts (abstraction), not the diversity of implementation details Data assumptions are made explicit in the semantic model The semantics guide the analytics process, rather than just the analyst’s knowledge SPARQL is a key component, but not the only tool in the semantic analytics toolbox
  • 7. K E E N A N A L Y T I C S 7 Challenges in Semantic Analytics 1 2 3 Semantic models that do not abstract data concepts from their implementation details 4 Semantic models that are missing semantics Semantic data that is missing a semantic model Rich, accurate provenance is required to establish confidence in the analytics results 5 Data cleansing must meet requirements for accuracy, consistency, and fitness for the purpose of the analytic task and result
  • 8. K E E N A N A L Y T I C S 8 Semantics Analytics in the Data to Action Loop Analyze Transform Classify Correlate Predict Interpret COA’s Semantics Which relationships relevant? What class? What kind of group? Define New Relationships? Inference Tag/Inference Representation/Provenance Wisdom Knowledge Information Data W K I D W K I D W K I D W K I D Semantics Intelligence Pyramid Analytics Semantic Analytics
  • 9. K E E N A N A L Y T I C S 9 Clustering Through Semantic Tagging Image Credit: historyinthecity.blogspot.com Semantic Tags • Tend to be user- or publisher-defined based on preferences, including terminology and depth of attribution • May have ambiguities to resolve (synonyms, reuse/overuse, too specific, language, jargon) Key Benefits • Faster search of content • Greater precision of search results Semantic Tags are keywords used to describe a resource (webpages, documents, business transactions) Source-Directed Tags • Manual selection and entry by the author • Automated population by the publisher, such as professional literature or publication websites • Automatically excerpted from a corpus through semantic analysis of the content, guided by a controlled vocabulary
  • 10. K E E N A N A L Y T I C S 10 Clustering Through Semantic Tagging Source: Implementing Iterative Algorithms with SPARQL http://ceur-ws.org/Vol-1133/paper-36.pdf DROP GRAPH <urn:ga/g/xjz[i+1]> ; CREATE GRAPH <urn:ga/g/xjz[i+1]> ; INSERT { GRAPH <urn:ga/g/xjz[i+1]> {?s <urn:ga/p/inCluster> ?clus3 } } WHERE { { SELECT ?s (SAMPLE(?clus) AS ?clus3) WHERE { { SELECT ?s (MAX(?clusCt) AS ?maxClusCt) WHERE { SELECT ?s ?clus (COUNT(?clus) AS ?clusCt) WHERE { ?s <urn:ga/p/hasLink> ?o . GRAPH <urn:ga/g/xjz[i] > ?clus } } GROUP BY ?s ?clus } GROUP BY ?s } { SELECT ?s ?clus (COUNT(?clus) AS ?clusCt) WHERE { ?s <urn:ga/p/hasLink> ?o . GRAPH <urn:ga/g/xjz[i]> { ?o <urn:ga/p/inCluster ?clus } } GROUP BY ?s ?clus } FILTER (?clusCt = ?maxClusCt) } GROUP BY ?s } } DROP GRAPH <urn:ga/g/xjz0> ; CREATE GRAPH <urn:ga/g/xjz0> ; INSERT { GRAPH <urn:ga/g/xjz0> {?s <urn:ga/p/inCluster> ?s } } WHERE { { SELECT DISTINCT ?s WHERE { { SELECT ?s <urn:ga/p/hasLink> ?o . } } Assign Each Tag Vertex to a Cluster For Each Tag Vertex, Populate Cluster Assignments of Neighbors Peer-Pressure Clustering Observation • No use of semantics features, such as vocabulary and knowledge management capabilities Strengths • Effective over large volumes of data • Comprehensive use of RDF data structure features
  • 11. K E E N A N A L Y T I C S 11 Clustering Through Semantic Tagging Positive Negative Ecstatic Pleased Okay Terms used in Semantic Tags Common Taxonomy for Semantic Tags
  • 12. K E E N A N A L Y T I C S 12 Clustering Through Semantic Tagging Positive Negative Ecstatic Inspired Charged Excited Exceeds Need Very Satisfied Satisfied Somewhat Satisfied Preferred Terms, Synonyms, and Common Misspellings Frequently-Used Generalizations and Degrees of Specificity Knowledge-based Taxonomy for Semantic Tags Estatic Extatic Egstatic
  • 13. K E E N A N A L Y T I C S 13 Clustering through Semantic Tagging Process Cluster resources with highest frequency semantic tag pairs Map the semantic tags to an N-level taxonomy of preferred tags, based on exact and synonym matches, and desired degree of specificity Select a set of triples containing URIs of the resources, as well as the semantic tags assigned to the resources
  • 14. K E E N A N A L Y T I C S 14 INSERT { ?SemanticTagEdgeURI rdf:type :SemanticTagEdge ; :resourceURI ?resource ; :edgeNode1 ?clusterTagLabel1 ; :edgeNode2 ?clusterTagLabel2 . } WHERE { ?SemanticTagURI1 rdf:type :SemanticTag ; :resourceURI ?resource ; :clusterTagValue ?clusterTagLabel1 . ?SemanticTagURI2 rdf:type :SemanticTag ; :resourceURI ?resource ; :clusterTagValue ?clusterTagLabel2 . FILTER ( ?clusterTagLabel1 != ?clusterTagLabel2 ) BIND ( URI( CONCAT( str(?resource), ?clusterTagLabel1, ?clusterTagLabel2 ) ) AS ?SemanticTagEdgeURI ) } Clustering through Semantic Tagging :Webpage1 :hasTag “10101” . :Webpage1 :hasTag “1030303B” . :Webpage2 :hasTag “10201” . :Webpage2 :hasTag “1030301” . :Webpage3 :hasTag “1030303B” . :Webpage3 :hasTag “10201A” . :Webpage4 :hasTag “10101B” . :Webpage4 :hasTag “10302A” . :Webpage5 :hasTag “1030301” . :Webpage5 :hasTag “10101A” . … INSERT { ?SemanticTagURI :clusterTagValue ?clusterTagLabel } WHERE { ?SemanticTagURI rdf:type :SemanticTag ; :hasTag ?tagLabel . ?Concept rdf:type skos:Concept ; ( skos:prefLabel|skos:altLabel|skos:hiddenLabel ) ?tagLabel . OPTIONAL { ?Concept :degreeOfSpecificity :<SPECIFICITY> ; skos:prefLabel ?clusterTagLabel . } OPTIONAL { ?Concept :degreeOfSpecificity ?Specificity . ?Concept skos:broader* ?BroaderConcept . ?BroaderConcept :degreeOfSpecificity ?BroaderSpecificity . FILTER ( ?BroaderSpecificity = :<SPECIFICITY> ) ?BroaderConcept skos:prefLabel ?clusterTagLabel . } } Insert Sample Data Find Preferred/Generalized Tag Value Generate Tag Pairs Concept - Preferred Tag Term - Synonyms, Misspellings - Broader/Generalized Concepts - Degree of Specificity Taxonomy • Highest Frequency Tag Pairs • Highest Frequency Solitary Tags * • Triple and Quadruple Tag Sets * Results * Not depicted
  • 15. K E E N A N A L Y T I C S 15 Semantic Analytics in Two Flavors Semantics on Analytics Analytics on Semantics Semantic assisted analysis: Money laundering, fraud detection, community detection, insider trading… Understanding Risk (financial trading & cyber security), transaction optimization, vulnerability assessment…
  • 16. K E E N A N A L Y T I C S 16 Discover Abnormal BehaviorProbability Degree Centrality Rare Occurrence (Frequent Communication) Rare Occurrence (Infrequent Communication) Normal Communication Levels
  • 17. K E E N A N A L Y T I C S 17 Identifying and predicting behavior changes Observe Orient Decide Act Network Density Time Classify and predict group behavior using communication network density What kind of organization is this? What is their objective/intent? Distributing food? Terrorist attack? Cyber attack? Merger/Acquisition? Bank robbery? When are they going to act?
  • 18. K E E N A N A L Y T I C S 18 Company Understanding Risk: Systemic Risk Analysis Transitive risk exposure in a network of trading partners and holding companies E F A D H L J B K C G I M O N Q relationship R P
  • 19. K E E N A N A L Y T I C S 19 Company Systemic Risk Analysis: Transitive risk exposure in a network of trading partners and holding companies E F A D H L J B K C G I M O N Q controlledBy tradesWith R P
  • 20. K E E N A N A L Y T I C S 20 Company Systemic Risk Analysis: Transitive risk exposure in a network of trading partners and holding companies E F A D H L J B K C G I M O N Q controlledBy tradesWith R P Bank
  • 21. K E E N A N A L Y T I C S 21 Systemic Risk Analysis: Transitive risk exposure in a network of trading partners and holding companies E F A D H L J B K C G I M O N Q controlledBy tradesWith R P Bank BankHoldingCompany A bank holding company controls a bank or controls a bank holding company
  • 22. K E E N A N A L Y T I C S 22 Systemic Risk Analysis: Transitive risk exposure in a network of trading partners and holding companies E F A D H L J B K C G I M O N Q controlledBy tradesWith R P Bank BankHoldingCompany risk
  • 23. K E E N A N A L Y T I C S 23 In SPARQL PREFIX : <http://pagerank/> PREFIX rank: <http://www.ontotext.com/owlim/RDFRank#> SELECT DISTINCT ?node ?rank WHERE {GRAPH <http://pagerank>{ {?node :to [].}UNION {[] :to ?node} ?node rank:hasRDFRank ?rank . }}ORDER BY ?node PREFIX rank: <http://www.ontotext.com/owlim/RDFRank#> INSERT DATA { rank:epsilon rank:setParam "0.001" . }
  • 24. K E E N A N A L Y T I C S 24 Questions?
  • 25. K E E N A N A L Y T I C S 25 Thank you!
  • 26. K E E N A N A L Y T I C S 26 Speakers Thomas (Tom) Kelly Practice Director, Enterprise Information Management, Cognizant Thomas Kelly is a Director in Cognizant’s Enterprise Information Management (EIM) Practice and heads its Semantic Technology Center of Excellence. He has 20- plus years of technology consulting experience in leading data warehousing, business intelligence and big data projects, focused primarily on the life sciences, healthcare, and financial services industries. Tom can be reached at Thomas.Kelly@cognizant.com. Dr. Arthur Keen Principal, Keen Analytics Arthur Keen possesses a deep understanding of graph analytics, predictive modeling, unstructured data, categorization, text mining, natural language processing, data mining algorithms, neural networks, and Artificial Intelligence. He has used his expertise in these areas to provide thought leadership and develop applications and evaluations in multiple domains including intelligence/security informatics, business intelligence, cyber security, financial analysis, corporate governance, retail and energy. Arthur can be reached at akeen@keenassoc.com