Lucidworks + Thomson Reuters for
Improved Investment Performance
Agenda	
  
•  Introductions
•  Thomson Reuters Intelligent Tagger (TRIT)
•  Fusion
•  Demo of TRIT
•  Key use cases in Finance
•  How Fusion and TRIT work together
•  Demo
•  Q&A
Thomson	
  Reuters	
  Intelligent	
  Tagging	
  Demo	
  
PERMID.ORG
3
Tagging	
  –	
  maximize	
  insights	
  from	
  research	
  reports	
  and	
  
unstructured	
  data	
  	
  
Unstructured content (including Thomson
Reuters)
Connecting entities and PermIDs Calculating relevance scores Enhanced search
PERMID
The Open Identifier Strategy
5
PermID	
  is	
  unique	
  in	
  the	
  market	
  place,	
  due	
  to	
  its	
  breadth	
  &	
  depth,	
  and	
  openness	
  
PermID	
  
Based in San Francisco
Offices in Bangalore, Bangkok,
New York City, Raleigh, Munich
Over 300 customers across the
Fortune 1000
Fusion, a Solr-powered platform
for search-driven apps
Produces the world’s largest open
source user conference dedicated to
Lucene/Solr
Lucidworks is the primary sponsor of
the Apache Solr project
Employs over 40% of the active
committers on the Solr project
Contributes over 70% of Solr's
open source codebase
40%
70%
User expectations changed.
1
2
Search, Discovery and
Analytics are three sides of
the same coin
Understanding users is too
slow and too complicated
3
Next-Gen Results.
No Degree Required.
•  Cognitive: Signals aggregation
learn and automatically tune
relevancy and drive
recommendations out of the box.
•  Rules when you need them:
Point-and-click query pipeline
configuration and rules module
allow fine-grained control of
results.
•  Science, not guessing:
Experiment management and
bandits do the heavy lifting of
systematically testing new ideas.
Simplified App Dev
•  Over 50 connectors and a
robust parsing framework to
seamlessly ingest all your data
•  Powerful pipeline stages:
Customize fields, stages,
synonyms, boosts, facets, and
dozens of other powerful search
stages.
•  Point and click Indexing
configuration and iterative
simulation of results for full
control over your ETL process
•  Your security model enforced
end-to-end from ingest to
search across your different
datasources
Trusted Technologies
SQL
•  Built on the most widely deployed
search engine on the planet
•  Enhanced with scalable machine
learning and analytics
•  Well understood and documented APIs
and client support designed to work
with industry standard tools like
Tableau
•  Enriched with deep expertise and
partners and a large community
Trusted Technologies
SQL
Intelligent	
  Tagging	
  
Dem
o
Financial Use Cases
Use	
  Cases	
  
•  Investment Management Research Search
•  Co-mingled (internal and external) Research content search
•  Wealth Management Portal Search
•  Co-mingled (News, Research, SocialMedia) content search
•  Single unified user-experience search
•  Effective smart/intelligent semantic search
•  Related content
Top Tier Investment Bank
•  Large Scale Analytics for
Customer 360
•  Full Text Search, SQL 2003
compliant
•  Advanced machine learning
and modeling support
•  Fusion is 20X faster than
Teradata for analysis + has
search!
Fusion + TRIT Tech Details
Fusion Architecture
SECURITY BUILT-IN
Shards Shards
Apache Solr
Apache Zookeeper
ZK 1
Leader Election Load Balancing
ZK N
Shared Config
Management
Worker Worker
Apache Spark
Cluster
Manager
RESTAPI
Admin UI
Lucidworks
View/Twigkit
LOGS FILE WEB DATABASE CLOUD
HDFS(Optional)
Core Services
• • •
ETL and Query Pipelines
Recommenders/Signals/Rules
NLP
Machine Learning
Alerting and Messaging
Security
Scheduling
Connectors
Thomson	
  Reuters	
  Intelligent	
  Tagging	
  Deployed	
  
1000s’	
  Content	
  Analysts	
  
Linked	
  Data	
  Storage	
  
AuthoriCes	
  and	
  Data	
  Sources	
  
Intelligent	
  Tagging	
  is	
  updated	
  every	
  15	
  minutes.	
  Hundreds	
  of	
  MBs	
  are	
  downloaded	
  daily.	
  	
  
TR	
  Analysts	
  update	
  corporate	
  acCons	
  for	
  T1	
  companies	
  within	
  7	
  minutes.	
  	
  
	
  
Documents	
  are	
  sent	
  for	
  tagging.	
  No	
  informaCon	
  is	
  sent	
  to	
  TR.	
  
Customer	
  
Meta	
  Data	
  
Versions	
  Server	
  
Intelligent
Tagging
New	
  versions	
  are	
  downloaded	
  and	
  installed	
  automaCcally	
  (future	
  support)	
  
Store	
  Meta	
  Data	
  	
  
and	
  Allow	
  Search	
  	
  
Meta	
  Data	
  
Thomson	
  Reuters	
  Co-­‐mingled	
  Intelligent	
  Search	
  
Search	
  Engine	
  pulls	
  data	
  from	
  customer	
  database	
  and	
  from	
  Thomson	
  Reuters	
  content	
  
Search	
  Engine	
  tag	
  content	
  with	
  Intelligent	
  Tagging	
  
CUSTOMER
1000s’	
  Content	
  
Analysts	
  
Linked	
  Data	
  Storage	
  
AuthoriCes	
  and	
  Data	
  Sources	
  
Versions	
  Server	
  
TRKD	
  
Intelligent
Tagging
Search	
  index	
  enhanced	
  with	
  metadata	
  for	
  intelligent	
  search	
  
Co-mingled
search
Lucidworks	
  Fusion	
  
Intelligent	
  Search	
  
TR	
  Content	
  
Demo
Q & A
Resources	
  
•  Lucidworks Fusion: http://lucidworks.com/fusion
•  Thomson Reuters Intelligent Tagging:
http://financial.thomsonreuters.com/tagit
•  Twigkit UI: http://twigkit.com
•  Webinar recording will be available on http://lucidworks.com

Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance

  • 1.
    Lucidworks + ThomsonReuters for Improved Investment Performance
  • 2.
    Agenda   •  Introductions • Thomson Reuters Intelligent Tagger (TRIT) •  Fusion •  Demo of TRIT •  Key use cases in Finance •  How Fusion and TRIT work together •  Demo •  Q&A
  • 3.
    Thomson  Reuters  Intelligent  Tagging  Demo   PERMID.ORG 3
  • 4.
    Tagging  –  maximize  insights  from  research  reports  and   unstructured  data     Unstructured content (including Thomson Reuters) Connecting entities and PermIDs Calculating relevance scores Enhanced search
  • 5.
    PERMID The Open IdentifierStrategy 5 PermID  is  unique  in  the  market  place,  due  to  its  breadth  &  depth,  and  openness  
  • 6.
  • 7.
    Based in SanFrancisco Offices in Bangalore, Bangkok, New York City, Raleigh, Munich Over 300 customers across the Fortune 1000 Fusion, a Solr-powered platform for search-driven apps Produces the world’s largest open source user conference dedicated to Lucene/Solr Lucidworks is the primary sponsor of the Apache Solr project Employs over 40% of the active committers on the Solr project Contributes over 70% of Solr's open source codebase 40% 70%
  • 8.
  • 9.
    2 Search, Discovery and Analyticsare three sides of the same coin
  • 10.
    Understanding users istoo slow and too complicated 3
  • 12.
  • 13.
    •  Cognitive: Signalsaggregation learn and automatically tune relevancy and drive recommendations out of the box. •  Rules when you need them: Point-and-click query pipeline configuration and rules module allow fine-grained control of results. •  Science, not guessing: Experiment management and bandits do the heavy lifting of systematically testing new ideas.
  • 14.
  • 15.
    •  Over 50connectors and a robust parsing framework to seamlessly ingest all your data •  Powerful pipeline stages: Customize fields, stages, synonyms, boosts, facets, and dozens of other powerful search stages. •  Point and click Indexing configuration and iterative simulation of results for full control over your ETL process •  Your security model enforced end-to-end from ingest to search across your different datasources
  • 16.
  • 17.
    •  Built onthe most widely deployed search engine on the planet •  Enhanced with scalable machine learning and analytics •  Well understood and documented APIs and client support designed to work with industry standard tools like Tableau •  Enriched with deep expertise and partners and a large community Trusted Technologies SQL
  • 18.
  • 19.
  • 20.
    Use  Cases   • Investment Management Research Search •  Co-mingled (internal and external) Research content search •  Wealth Management Portal Search •  Co-mingled (News, Research, SocialMedia) content search •  Single unified user-experience search •  Effective smart/intelligent semantic search •  Related content
  • 22.
    Top Tier InvestmentBank •  Large Scale Analytics for Customer 360 •  Full Text Search, SQL 2003 compliant •  Advanced machine learning and modeling support •  Fusion is 20X faster than Teradata for analysis + has search!
  • 23.
    Fusion + TRITTech Details
  • 24.
    Fusion Architecture SECURITY BUILT-IN ShardsShards Apache Solr Apache Zookeeper ZK 1 Leader Election Load Balancing ZK N Shared Config Management Worker Worker Apache Spark Cluster Manager RESTAPI Admin UI Lucidworks View/Twigkit LOGS FILE WEB DATABASE CLOUD HDFS(Optional) Core Services • • • ETL and Query Pipelines Recommenders/Signals/Rules NLP Machine Learning Alerting and Messaging Security Scheduling Connectors
  • 25.
    Thomson  Reuters  Intelligent  Tagging  Deployed   1000s’  Content  Analysts   Linked  Data  Storage   AuthoriCes  and  Data  Sources   Intelligent  Tagging  is  updated  every  15  minutes.  Hundreds  of  MBs  are  downloaded  daily.     TR  Analysts  update  corporate  acCons  for  T1  companies  within  7  minutes.       Documents  are  sent  for  tagging.  No  informaCon  is  sent  to  TR.   Customer   Meta  Data   Versions  Server   Intelligent Tagging New  versions  are  downloaded  and  installed  automaCcally  (future  support)   Store  Meta  Data     and  Allow  Search    
  • 26.
    Meta  Data   Thomson  Reuters  Co-­‐mingled  Intelligent  Search   Search  Engine  pulls  data  from  customer  database  and  from  Thomson  Reuters  content   Search  Engine  tag  content  with  Intelligent  Tagging   CUSTOMER 1000s’  Content   Analysts   Linked  Data  Storage   AuthoriCes  and  Data  Sources   Versions  Server   TRKD   Intelligent Tagging Search  index  enhanced  with  metadata  for  intelligent  search   Co-mingled search Lucidworks  Fusion   Intelligent  Search   TR  Content  
  • 27.
  • 28.
  • 29.
    Resources   •  LucidworksFusion: http://lucidworks.com/fusion •  Thomson Reuters Intelligent Tagging: http://financial.thomsonreuters.com/tagit •  Twigkit UI: http://twigkit.com •  Webinar recording will be available on http://lucidworks.com