SlideShare a Scribd company logo
1 of 28
Download to read offline
Enhance Your Decisions
with News DataMassively extract value from reliable publications
|
● Computer Scientist + Data Scientist
● Translating business needs to technical
solutions
● Based in Barcelona, and covering EMEA
miguel.ballesteros@dowjones.com
| Who am I?
Customer Solutions Engineer
| Why News?
| News data have proved valuable to...
● Better understand the business
context
● Assess the risk of the whole
economy or even particular entities
● Monitor the changing conditions
around entities
● Anticipating economic trends
● Anticipating the potential
performance of a security
● Deliver high-speed facts that later
are confirmed with reports or
official announcements (aka
“Nowcasting”)
| Quality News
● Follow strict editorial guidelines
● Reduce noise and misinformation
● Use reliable sources
● Some sources focus on Economic
and Financial events
Available behind a paywall !
| Uncertainty boosts news consumption
● Higher need to anticipate trends and risks
● Reliable source to explain behaviours
| Use Cases
● Credit Risk Assessment
● Credit Risk Monitoring
● Economic Research
● Due Diligence
● Compliance
● Securities Sentiment
● Insurance Risk Assessment
● ...
| News Aggregators
| Consolidate Multiple Premium Sources
● Simplify subscription management
● Simplify workflows
● Unify data structure
● Single point of access
● Data enrichment with unified criteria
| API - Data Schema
title
snippet
body
subject_codes
company_codes
region_codes
word_count
an
byline
modification_datetime
,m11,mcroil,c1522,m14,mcat,mnasdq,ncmac,nenac,neqac,nfiac,
,eurcb,eurcb,fed,fed,jyskb,jyskb,nbkden,nbkden,rryce,rryce,
,saarab,usa,asiaz,gulfstz,meastz,namz,wasiaz,
525
...
DJDN000020190920ef9k001ey
ISIN
CUSIP
SEDOL
Market+Ticker
| Traditional News Consumption
| Human-Driven Content Consumption
● Credit Risk Assessment
● Credit Risk Monitoring
● Economic Research
● Due Diligence
● Compliance
● Economic Risk Assessment
● Securities Sentiment
Aggregator DB Aggregator App
| Human-Driven Content Consumption
Disadvantages
● Not scalable in volume or speed
● Operational decisions require human
intervention
● Hard to combine with proprietary or
third-party data
| API-Based News Consumption
| Machine-Driven Content Consumption
Aggregator
API
Aggregator DB
Custom
Processing
● Risk Assessment / Monitoring
● Due Diligence
● Compliance
● Algorithmic Trading (No online HFT)
● ...
● Insurance Risk Assessment
● Platforms B2B4C (e.g. )
● Portfolio Management
● ESG Scoring (e.g. )
| API - Scenarios
Workflow
Automation
Text-Mining
Machine
Learning
Licensing is
different!
Low-Volume
API
High-Volume
API
Aggregator DB
| Low-Volume API Use Cases
| Low-Volume API Use Cases
● Corporate Credit Risk Assessment
● Corporate Credit Risk Monitoring
● Due Diligence
● Compliance
● CRM Integration
● Insurance Risk Assessment
● Platforms B2B4C (e.g. )
Workflow
Automation
Low-Volume
API
Aggregator DB
| Low-Volume API - Corp. Credit Risk Assessment
On-Demand
Data
Aggregator
● Simplifies the data collection for research
● Highlight relevant results according to
predefined criteria
● Increase Productivity
Proprietary
Data
Other Data
Providers
Low-Volume
API
Aggregator DB
| Low-Volume API - Corp. Credit Risk Monitoring
● Automatic process checking for specific
conditions
● Require attention only when necessary
Proprietary
Data
Other Data
Providers
Low-Volume
API
Automated
Monitor
Aggregator DB
| Low-Volume API - News Radar Visualisation
# c11="Corporate Strategy/Planning"
# c15="Financial Performance"
# c16="Bankruptcy"
entities = [ 'aapl', 'msft', 'tsla']
subjects = [ 'c11', 'c15', 'c16']
symbology = 'DJTicker'
nro = newsradar.get_ex(entities, subjects, symbology)
# nro contains volumes for different timeframes for
# each entity and subject
Not only article data!
| High-Volume API Use Cases
| High-Volume API Use Cases
NLP
Aggregate
Statistics
Business
Workflow
X-aaS
ML Models
● Algorithmic Trading (No online HFT)
● Portfolio Management
● Economic Research
● Platforms B2B4C
● Asset Management
● ESG Scoring (e.g. )
● Credit Risk Assessment / Monitoring
High-Volume
API
Aggregator DB*
Extracted
News DB
| High-Volume API - Portfolio Management
Alpha Signals
Risk Signals
Quant
Workflow
● Fits well in automated Quant Workflows
● Updated in near real-time
● Allows Machine-Processing
High-Volume
API
Aggregator DB* Extracted
News DB
ML Models + NLP
| High-Volume API - 2 Services
Now
Snapshots Streams
● Train Models
● Understand past events
● Summarise facts and events
● Identify patterns
● Backtesting
Delivered as files (AVRO, CSV, JSON)
● Predict
● Monitor in near real time
● Get notifications
● Robo-advice
● Calculate signals
Delivered as messages
| High-Volume - Long Running Operation Pattern
Snapshot
● Submits a job instead of
receiving an immediate
response
● Require to check for the
job until completed
● Results are collected by
downloading the
generated files
where_statement = "publication_date >= '2020-09-01 00:00:00' AND
language_code IN ('en')"
snp = Snapshot(query=where_statement)
snp.submit_extraction_job()
snp.get_extraction_job_results()
while(True):
if snp.last_extraction_job.job_state == 'JOB_STATE_DONE':
break
...
else:
time.sleep(const.API_JOB_ACTIVE_WAIT_SPACING)
snp.download_extraction_files()
# Files are downloaded to a folder named as the snapshot ID
| Resources
| More information at
This Slide Deck
http://bit.ly/hackzurichcscdata
Q&A Channel
#HackZurichCSCData
Snippets + Samples
https://github.com/dowjones/developer-platform
Developer Portal
https://developer.dowjones.com
Miguel Ballesteros
miguel.ballesteros@dowjones.com

More Related Content

Similar to apidays LIVE LONDON - Enriching Decision with News Data by Miguel Ballesteros

Insurance - Open Source Analytics Dashboards for Real Time Business Overview
Insurance - Open Source Analytics Dashboards for Real Time Business OverviewInsurance - Open Source Analytics Dashboards for Real Time Business Overview
Insurance - Open Source Analytics Dashboards for Real Time Business OverviewEuro IT Group
 
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward
 
IEGROUP Sundip Gorai v3
IEGROUP Sundip Gorai v3IEGROUP Sundip Gorai v3
IEGROUP Sundip Gorai v3Sundip Gorai
 
Why use ad words api
Why use ad words apiWhy use ad words api
Why use ad words apisupergigas
 
MSIT Adaptive Insights Brochure 2018
MSIT Adaptive Insights Brochure 2018MSIT Adaptive Insights Brochure 2018
MSIT Adaptive Insights Brochure 2018Agus Tirtoredjo
 
Predictive Analytics Solutions, Edsson 2019
Predictive Analytics Solutions, Edsson 2019Predictive Analytics Solutions, Edsson 2019
Predictive Analytics Solutions, Edsson 2019Anna Davydova
 
Applying Robotic Process Automation in Banking: Innovations in Finance and Risk
Applying Robotic Process Automation in Banking: Innovations in Finance and RiskApplying Robotic Process Automation in Banking: Innovations in Finance and Risk
Applying Robotic Process Automation in Banking: Innovations in Finance and Riskaccenture
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Sri Ambati
 
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...Neo4j
 
apidays Dubai & Middle East 2023 - Combining APIs and Events, Richard Bosch, ...
apidays Dubai & Middle East 2023 - Combining APIs and Events, Richard Bosch, ...apidays Dubai & Middle East 2023 - Combining APIs and Events, Richard Bosch, ...
apidays Dubai & Middle East 2023 - Combining APIs and Events, Richard Bosch, ...apidays
 
Solvency II Brochure
Solvency II BrochureSolvency II Brochure
Solvency II BrochureEce Alptunaer
 
MongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB
 
Big Data overview
Big Data overviewBig Data overview
Big Data overviewalexisroos
 
Practical AI use cases in Customer Service
Practical AI use cases in Customer ServicePractical AI use cases in Customer Service
Practical AI use cases in Customer ServiceDenys Holovatyi
 
Business Intelligence for Logistics and Freight Forwarders
Business Intelligence for Logistics and Freight ForwardersBusiness Intelligence for Logistics and Freight Forwarders
Business Intelligence for Logistics and Freight ForwardersSmarten Augmented Analytics
 
Business Analytics Training
Business Analytics TrainingBusiness Analytics Training
Business Analytics TrainingNatalija Pavic
 

Similar to apidays LIVE LONDON - Enriching Decision with News Data by Miguel Ballesteros (20)

Insurance - Open Source Analytics Dashboards for Real Time Business Overview
Insurance - Open Source Analytics Dashboards for Real Time Business OverviewInsurance - Open Source Analytics Dashboards for Real Time Business Overview
Insurance - Open Source Analytics Dashboards for Real Time Business Overview
 
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
 
RPA in Finance v2
RPA in Finance v2RPA in Finance v2
RPA in Finance v2
 
IEGROUP Sundip Gorai v3
IEGROUP Sundip Gorai v3IEGROUP Sundip Gorai v3
IEGROUP Sundip Gorai v3
 
Modern Software Architectures - Overview
Modern Software Architectures - Overview Modern Software Architectures - Overview
Modern Software Architectures - Overview
 
Inawisdom IDP
Inawisdom IDPInawisdom IDP
Inawisdom IDP
 
Why use ad words api
Why use ad words apiWhy use ad words api
Why use ad words api
 
MSIT Adaptive Insights Brochure 2018
MSIT Adaptive Insights Brochure 2018MSIT Adaptive Insights Brochure 2018
MSIT Adaptive Insights Brochure 2018
 
Predictive Analytics Solutions, Edsson 2019
Predictive Analytics Solutions, Edsson 2019Predictive Analytics Solutions, Edsson 2019
Predictive Analytics Solutions, Edsson 2019
 
Applying Robotic Process Automation in Banking: Innovations in Finance and Risk
Applying Robotic Process Automation in Banking: Innovations in Finance and RiskApplying Robotic Process Automation in Banking: Innovations in Finance and Risk
Applying Robotic Process Automation in Banking: Innovations in Finance and Risk
 
Machine Learning For Stock Broking
Machine Learning For Stock BrokingMachine Learning For Stock Broking
Machine Learning For Stock Broking
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...
 
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
 
apidays Dubai & Middle East 2023 - Combining APIs and Events, Richard Bosch, ...
apidays Dubai & Middle East 2023 - Combining APIs and Events, Richard Bosch, ...apidays Dubai & Middle East 2023 - Combining APIs and Events, Richard Bosch, ...
apidays Dubai & Middle East 2023 - Combining APIs and Events, Richard Bosch, ...
 
Solvency II Brochure
Solvency II BrochureSolvency II Brochure
Solvency II Brochure
 
MongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB in a Mainframe World
MongoDB in a Mainframe World
 
Big Data overview
Big Data overviewBig Data overview
Big Data overview
 
Practical AI use cases in Customer Service
Practical AI use cases in Customer ServicePractical AI use cases in Customer Service
Practical AI use cases in Customer Service
 
Business Intelligence for Logistics and Freight Forwarders
Business Intelligence for Logistics and Freight ForwardersBusiness Intelligence for Logistics and Freight Forwarders
Business Intelligence for Logistics and Freight Forwarders
 
Business Analytics Training
Business Analytics TrainingBusiness Analytics Training
Business Analytics Training
 

More from apidays

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Apidays New York 2024 - The secrets to Graph success, by Leah Hurwich Adler, ...
Apidays New York 2024 - The secrets to Graph success, by Leah Hurwich Adler, ...Apidays New York 2024 - The secrets to Graph success, by Leah Hurwich Adler, ...
Apidays New York 2024 - The secrets to Graph success, by Leah Hurwich Adler, ...apidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Apidays New York 2024 - API Discovery - From Crawl to Run by Rob Dickinson, G...
Apidays New York 2024 - API Discovery - From Crawl to Run by Rob Dickinson, G...Apidays New York 2024 - API Discovery - From Crawl to Run by Rob Dickinson, G...
Apidays New York 2024 - API Discovery - From Crawl to Run by Rob Dickinson, G...apidays
 
Apidays Singapore 2024 - Building with the Planet in Mind by Sandeep Joshi, M...
Apidays Singapore 2024 - Building with the Planet in Mind by Sandeep Joshi, M...Apidays Singapore 2024 - Building with the Planet in Mind by Sandeep Joshi, M...
Apidays Singapore 2024 - Building with the Planet in Mind by Sandeep Joshi, M...apidays
 
Apidays Singapore 2024 - Connecting Cross Border Commerce with Payments by Gu...
Apidays Singapore 2024 - Connecting Cross Border Commerce with Payments by Gu...Apidays Singapore 2024 - Connecting Cross Border Commerce with Payments by Gu...
Apidays Singapore 2024 - Connecting Cross Border Commerce with Payments by Gu...apidays
 
Apidays Singapore 2024 - Privacy Enhancing Technologies for AI by Mark Choo, ...
Apidays Singapore 2024 - Privacy Enhancing Technologies for AI by Mark Choo, ...Apidays Singapore 2024 - Privacy Enhancing Technologies for AI by Mark Choo, ...
Apidays Singapore 2024 - Privacy Enhancing Technologies for AI by Mark Choo, ...apidays
 
Apidays Singapore 2024 - Blending AI and IoT for Smarter Health by Matthew Ch...
Apidays Singapore 2024 - Blending AI and IoT for Smarter Health by Matthew Ch...Apidays Singapore 2024 - Blending AI and IoT for Smarter Health by Matthew Ch...
Apidays Singapore 2024 - Blending AI and IoT for Smarter Health by Matthew Ch...apidays
 
Apidays Singapore 2024 - OpenTelemetry for API Monitoring by Danielle Kayumbi...
Apidays Singapore 2024 - OpenTelemetry for API Monitoring by Danielle Kayumbi...Apidays Singapore 2024 - OpenTelemetry for API Monitoring by Danielle Kayumbi...
Apidays Singapore 2024 - OpenTelemetry for API Monitoring by Danielle Kayumbi...apidays
 
Apidays Singapore 2024 - Connecting Product and Engineering Teams with Testin...
Apidays Singapore 2024 - Connecting Product and Engineering Teams with Testin...Apidays Singapore 2024 - Connecting Product and Engineering Teams with Testin...
Apidays Singapore 2024 - Connecting Product and Engineering Teams with Testin...apidays
 
Apidays Singapore 2024 - The Growing Carbon Footprint of Digitalization and H...
Apidays Singapore 2024 - The Growing Carbon Footprint of Digitalization and H...Apidays Singapore 2024 - The Growing Carbon Footprint of Digitalization and H...
Apidays Singapore 2024 - The Growing Carbon Footprint of Digitalization and H...apidays
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Apidays Singapore 2024 - API Monitoring x SRE by Ryan Ashneil and Eugene Wong...
Apidays Singapore 2024 - API Monitoring x SRE by Ryan Ashneil and Eugene Wong...Apidays Singapore 2024 - API Monitoring x SRE by Ryan Ashneil and Eugene Wong...
Apidays Singapore 2024 - API Monitoring x SRE by Ryan Ashneil and Eugene Wong...apidays
 
Apidays Singapore 2024 - A nuanced approach on AI costs and benefits for the ...
Apidays Singapore 2024 - A nuanced approach on AI costs and benefits for the ...Apidays Singapore 2024 - A nuanced approach on AI costs and benefits for the ...
Apidays Singapore 2024 - A nuanced approach on AI costs and benefits for the ...apidays
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Apidays Singapore 2024 - How APIs drive business at BNP Paribas by Quy-Doan D...
Apidays Singapore 2024 - How APIs drive business at BNP Paribas by Quy-Doan D...Apidays Singapore 2024 - How APIs drive business at BNP Paribas by Quy-Doan D...
Apidays Singapore 2024 - How APIs drive business at BNP Paribas by Quy-Doan D...apidays
 

More from apidays (20)

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - The secrets to Graph success, by Leah Hurwich Adler, ...
Apidays New York 2024 - The secrets to Graph success, by Leah Hurwich Adler, ...Apidays New York 2024 - The secrets to Graph success, by Leah Hurwich Adler, ...
Apidays New York 2024 - The secrets to Graph success, by Leah Hurwich Adler, ...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - API Discovery - From Crawl to Run by Rob Dickinson, G...
Apidays New York 2024 - API Discovery - From Crawl to Run by Rob Dickinson, G...Apidays New York 2024 - API Discovery - From Crawl to Run by Rob Dickinson, G...
Apidays New York 2024 - API Discovery - From Crawl to Run by Rob Dickinson, G...
 
Apidays Singapore 2024 - Building with the Planet in Mind by Sandeep Joshi, M...
Apidays Singapore 2024 - Building with the Planet in Mind by Sandeep Joshi, M...Apidays Singapore 2024 - Building with the Planet in Mind by Sandeep Joshi, M...
Apidays Singapore 2024 - Building with the Planet in Mind by Sandeep Joshi, M...
 
Apidays Singapore 2024 - Connecting Cross Border Commerce with Payments by Gu...
Apidays Singapore 2024 - Connecting Cross Border Commerce with Payments by Gu...Apidays Singapore 2024 - Connecting Cross Border Commerce with Payments by Gu...
Apidays Singapore 2024 - Connecting Cross Border Commerce with Payments by Gu...
 
Apidays Singapore 2024 - Privacy Enhancing Technologies for AI by Mark Choo, ...
Apidays Singapore 2024 - Privacy Enhancing Technologies for AI by Mark Choo, ...Apidays Singapore 2024 - Privacy Enhancing Technologies for AI by Mark Choo, ...
Apidays Singapore 2024 - Privacy Enhancing Technologies for AI by Mark Choo, ...
 
Apidays Singapore 2024 - Blending AI and IoT for Smarter Health by Matthew Ch...
Apidays Singapore 2024 - Blending AI and IoT for Smarter Health by Matthew Ch...Apidays Singapore 2024 - Blending AI and IoT for Smarter Health by Matthew Ch...
Apidays Singapore 2024 - Blending AI and IoT for Smarter Health by Matthew Ch...
 
Apidays Singapore 2024 - OpenTelemetry for API Monitoring by Danielle Kayumbi...
Apidays Singapore 2024 - OpenTelemetry for API Monitoring by Danielle Kayumbi...Apidays Singapore 2024 - OpenTelemetry for API Monitoring by Danielle Kayumbi...
Apidays Singapore 2024 - OpenTelemetry for API Monitoring by Danielle Kayumbi...
 
Apidays Singapore 2024 - Connecting Product and Engineering Teams with Testin...
Apidays Singapore 2024 - Connecting Product and Engineering Teams with Testin...Apidays Singapore 2024 - Connecting Product and Engineering Teams with Testin...
Apidays Singapore 2024 - Connecting Product and Engineering Teams with Testin...
 
Apidays Singapore 2024 - The Growing Carbon Footprint of Digitalization and H...
Apidays Singapore 2024 - The Growing Carbon Footprint of Digitalization and H...Apidays Singapore 2024 - The Growing Carbon Footprint of Digitalization and H...
Apidays Singapore 2024 - The Growing Carbon Footprint of Digitalization and H...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays Singapore 2024 - API Monitoring x SRE by Ryan Ashneil and Eugene Wong...
Apidays Singapore 2024 - API Monitoring x SRE by Ryan Ashneil and Eugene Wong...Apidays Singapore 2024 - API Monitoring x SRE by Ryan Ashneil and Eugene Wong...
Apidays Singapore 2024 - API Monitoring x SRE by Ryan Ashneil and Eugene Wong...
 
Apidays Singapore 2024 - A nuanced approach on AI costs and benefits for the ...
Apidays Singapore 2024 - A nuanced approach on AI costs and benefits for the ...Apidays Singapore 2024 - A nuanced approach on AI costs and benefits for the ...
Apidays Singapore 2024 - A nuanced approach on AI costs and benefits for the ...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays Singapore 2024 - How APIs drive business at BNP Paribas by Quy-Doan D...
Apidays Singapore 2024 - How APIs drive business at BNP Paribas by Quy-Doan D...Apidays Singapore 2024 - How APIs drive business at BNP Paribas by Quy-Doan D...
Apidays Singapore 2024 - How APIs drive business at BNP Paribas by Quy-Doan D...
 

Recently uploaded

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

Recently uploaded (20)

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

apidays LIVE LONDON - Enriching Decision with News Data by Miguel Ballesteros

  • 1. Enhance Your Decisions with News DataMassively extract value from reliable publications
  • 2. | ● Computer Scientist + Data Scientist ● Translating business needs to technical solutions ● Based in Barcelona, and covering EMEA miguel.ballesteros@dowjones.com | Who am I? Customer Solutions Engineer
  • 4. | News data have proved valuable to... ● Better understand the business context ● Assess the risk of the whole economy or even particular entities ● Monitor the changing conditions around entities ● Anticipating economic trends ● Anticipating the potential performance of a security ● Deliver high-speed facts that later are confirmed with reports or official announcements (aka “Nowcasting”)
  • 5. | Quality News ● Follow strict editorial guidelines ● Reduce noise and misinformation ● Use reliable sources ● Some sources focus on Economic and Financial events Available behind a paywall !
  • 6. | Uncertainty boosts news consumption ● Higher need to anticipate trends and risks ● Reliable source to explain behaviours
  • 7. | Use Cases ● Credit Risk Assessment ● Credit Risk Monitoring ● Economic Research ● Due Diligence ● Compliance ● Securities Sentiment ● Insurance Risk Assessment ● ...
  • 9. | Consolidate Multiple Premium Sources ● Simplify subscription management ● Simplify workflows ● Unify data structure ● Single point of access ● Data enrichment with unified criteria
  • 10. | API - Data Schema title snippet body subject_codes company_codes region_codes word_count an byline modification_datetime ,m11,mcroil,c1522,m14,mcat,mnasdq,ncmac,nenac,neqac,nfiac, ,eurcb,eurcb,fed,fed,jyskb,jyskb,nbkden,nbkden,rryce,rryce, ,saarab,usa,asiaz,gulfstz,meastz,namz,wasiaz, 525 ... DJDN000020190920ef9k001ey ISIN CUSIP SEDOL Market+Ticker
  • 11. | Traditional News Consumption
  • 12. | Human-Driven Content Consumption ● Credit Risk Assessment ● Credit Risk Monitoring ● Economic Research ● Due Diligence ● Compliance ● Economic Risk Assessment ● Securities Sentiment Aggregator DB Aggregator App
  • 13. | Human-Driven Content Consumption Disadvantages ● Not scalable in volume or speed ● Operational decisions require human intervention ● Hard to combine with proprietary or third-party data
  • 14. | API-Based News Consumption
  • 15. | Machine-Driven Content Consumption Aggregator API Aggregator DB Custom Processing ● Risk Assessment / Monitoring ● Due Diligence ● Compliance ● Algorithmic Trading (No online HFT) ● ... ● Insurance Risk Assessment ● Platforms B2B4C (e.g. ) ● Portfolio Management ● ESG Scoring (e.g. )
  • 16. | API - Scenarios Workflow Automation Text-Mining Machine Learning Licensing is different! Low-Volume API High-Volume API Aggregator DB
  • 17. | Low-Volume API Use Cases
  • 18. | Low-Volume API Use Cases ● Corporate Credit Risk Assessment ● Corporate Credit Risk Monitoring ● Due Diligence ● Compliance ● CRM Integration ● Insurance Risk Assessment ● Platforms B2B4C (e.g. ) Workflow Automation Low-Volume API Aggregator DB
  • 19. | Low-Volume API - Corp. Credit Risk Assessment On-Demand Data Aggregator ● Simplifies the data collection for research ● Highlight relevant results according to predefined criteria ● Increase Productivity Proprietary Data Other Data Providers Low-Volume API Aggregator DB
  • 20. | Low-Volume API - Corp. Credit Risk Monitoring ● Automatic process checking for specific conditions ● Require attention only when necessary Proprietary Data Other Data Providers Low-Volume API Automated Monitor Aggregator DB
  • 21. | Low-Volume API - News Radar Visualisation # c11="Corporate Strategy/Planning" # c15="Financial Performance" # c16="Bankruptcy" entities = [ 'aapl', 'msft', 'tsla'] subjects = [ 'c11', 'c15', 'c16'] symbology = 'DJTicker' nro = newsradar.get_ex(entities, subjects, symbology) # nro contains volumes for different timeframes for # each entity and subject Not only article data!
  • 22. | High-Volume API Use Cases
  • 23. | High-Volume API Use Cases NLP Aggregate Statistics Business Workflow X-aaS ML Models ● Algorithmic Trading (No online HFT) ● Portfolio Management ● Economic Research ● Platforms B2B4C ● Asset Management ● ESG Scoring (e.g. ) ● Credit Risk Assessment / Monitoring High-Volume API Aggregator DB* Extracted News DB
  • 24. | High-Volume API - Portfolio Management Alpha Signals Risk Signals Quant Workflow ● Fits well in automated Quant Workflows ● Updated in near real-time ● Allows Machine-Processing High-Volume API Aggregator DB* Extracted News DB ML Models + NLP
  • 25. | High-Volume API - 2 Services Now Snapshots Streams ● Train Models ● Understand past events ● Summarise facts and events ● Identify patterns ● Backtesting Delivered as files (AVRO, CSV, JSON) ● Predict ● Monitor in near real time ● Get notifications ● Robo-advice ● Calculate signals Delivered as messages
  • 26. | High-Volume - Long Running Operation Pattern Snapshot ● Submits a job instead of receiving an immediate response ● Require to check for the job until completed ● Results are collected by downloading the generated files where_statement = "publication_date >= '2020-09-01 00:00:00' AND language_code IN ('en')" snp = Snapshot(query=where_statement) snp.submit_extraction_job() snp.get_extraction_job_results() while(True): if snp.last_extraction_job.job_state == 'JOB_STATE_DONE': break ... else: time.sleep(const.API_JOB_ACTIVE_WAIT_SPACING) snp.download_extraction_files() # Files are downloaded to a folder named as the snapshot ID
  • 28. | More information at This Slide Deck http://bit.ly/hackzurichcscdata Q&A Channel #HackZurichCSCData Snippets + Samples https://github.com/dowjones/developer-platform Developer Portal https://developer.dowjones.com Miguel Ballesteros miguel.ballesteros@dowjones.com