SlideShare a Scribd company logo
WHITE PAPER
Sponsored by
Deriving Business Value from
Big Data using Sentiment analysis
INTRODUCTION
‘Big Data’ are two small words that are widely used to describe the massive growth in data of all forms and that
hold; the promise of delivering huge potential business impact. The question is, how?
Today, and increasingly in the future, businesses are surrounded by masses of data and raw information. Some
of this data is very relevant but much of it is not. Further, most of that data is unstructured in the form of email,
documents, images and different types of social media, blogs, and so on. Unstructured data is notoriously difficult
to access and query, it is scattered across many different locations and formats, and it requires some form of
preprocessing before it can be analyzed and used. Yet, it is this unstructured type data that is primarily exploding
in quantity, representing around 80 per cent of the annual growth of data and doubling in quantity every two years.
A few years ago, ‘Big Data’ was just another buzzword; a fad perhaps that would eventually fade. Today though, big
data is increasingly being used to provide deep insight and predictive analysis in to everything from stock market
movements to individual buying behaviors. Those that are able to make use and harness the power of this disrup-
tive force in markets will benefit by being smarter, faster and more efficient, meaning they are more likely to seize
opportunities early and thereby profit. In the financial services industry, this possibility has not been lost on the
banks who along with associated firms, are investing heavily in applying a variety of technologies and approaches
to unlocking the value of ‘big data’.
How might big data be used practically in the commodity trading and risk management world? This white paper
attempts to answer this question and describes a practical application brought to market by DataGenic that uses
sentiment analysis to predict the price of crude oil.
© Commodity Technology Advisory LLC, 2015, All Rights Reserved.
© Commodity Technology Advisory LLC, 2015, All Rights Reserved.
BIG DATA AND ITS POTENTIAL
APPLICATIONS
A couple of years ago, ComTech undertook some research in the commodity trading and risk management arena
around the value of social media as data. The results were not particularly startling in terms of the interest levels
expressed in something so chaotic and voluminous as social media1
. Yet, when it was explained in the context
of a meaningful example, interest levels heightened considerably. Since then, many big data opportunities and
application areas within commodity trading and risk management have been identified and they encapsulate a
whole raft of new possibilities including, for example
/ Trading Analysis
	- Predictive analytics
	- Pre-trade decision support analytics including senti-
ment analysis
/ Market Risk
	- On demand risk management
	- Predictive indicators
	- Exposure simulations
/ Regulation  Compliance
	- Trade surveillance
	- Fraud management
	- Regulation/Compliance audits
Many of these areas are already being actively explored and
productized. For example, Bloomberg announced that it
is enhancing its enterprise compliance platform to provide
next-generation communications surveillance functionality
and analytics to meet increasingly stringent regulatory guide-
lines, prevent market abuse, and deepen visibility into the com-
mercial use of social media. The financial trading dashboard
managed by Thomson Reuters is another example, and it uses
sentiment analysis data to track news on 20,000 stocks and
thousands of commodities. It parses text from multiple sourc-
es, looks for keywords, tone, relevance and freshness to pro-
vide sentiment analysis for traders to act upon.
Indeed, sentiment analysis has developed rapidly as a tech-
nology that applies machine learning and makes a rapid as-
sessment of the sentiments expressed in the various types of
unstructured data available today in the form of social media,
news and blogs. These sources of information can move the
market and are measured quantitatively. Analysts and inves-
tors digest financial news and their perceptions can rapidly
impact the market and move stock and/or commodity prices.
However, making use of masses of unstructured data of vari-
able quality and reliability isn’t an easy undertaking. It requires
a high degree of specialism, usually provided by data science,
and those with the expertise to deploy the right combination of
analytics, machine learning, data mining and statistical skills
as well as experience with algorithms and coding in order to
explain the significance of data in a way that can easily be un-
derstood by others. Part of the problem is in understanding ex-
actly what is meant by the words used by social commentators
and others. This means dealing with synonyms, spelling errors,
use of different languages such as Latin, polysemy (where one
word can actually have many meanings) and the sheer volume
of data, amongst other issues. If these issues can be resolved,
then one is left with the ability to track brand perception and
business opinion trends that might have real business value.
The types of data that can be analysed using this approach
include news feeds from almost any source, as well as so-
cial media content from popular tools such as Twitter. Taking
the data in near real-time, stripping out noise and irrelevant
content and using Natural Language Processing (NLP) and
machine learning in an attempt to extract useful meaning can
result in information that has immediate and actionable value
in the form of, for example, sentiment analysis.
Deriving Business Value from Big Data using Sentiment analysis A ComTechAdvisory Whitepaper
1) Making More of Data Using AI, Commodity Technology Advisory White Paper, 2014
DATAGENIC’S NEWS ANALYTICS SYSTEM
DataGenic has been deeply engaged in the data management and aggregation business in commodity markets
for many years and has a broad array of blue chip clients in the industry. Recently, it has been developing a news
aggregation service to be a part of its GenicIQ product and designed to provide sentiment analysis that can be
used as an input in to trading decision-making and risk management. The objective was to,
/ Automatically process unstructured textual data in near
real-time to deliver both insight and value,
/ Utilise Twitter and a multitude of news resources avail-
able online as inputs,
/ Provide sentiment Analysis that had value to traders and
risk managers in the commodity space.
The problem for DataGenic was very much as described
above. It involved designing the delivery of a message/infor-
mation system in such a way that it facilitated a greater under-
standing of the market. By utilizing Twitter as one source of
raw information, DataGenic data scientists had to figure out
how to process over 4.5 million tweets in a 34 period (equat-
ing to some 150,000 – 400,000 tweets per day) and over 1.7
million news articles over 73 days of activity. The final product
needed the scalability to handle much more data than that in
real live use and at a greater velocity in order to produce sen-
timent scores, volume and indicators that could be exposed in
an readily consumable form both via DataIQ and through an
API.
By defining a process that involved stripping unnecessary and
superfluous data, utilising NLP and machine learning and the
development of scoring mechanisms, DataGenic has been
able to produce quite remarkable results for crude oil price
sentiment in its test case (Figure 1). The product is now live
and its output available to subscribers. The remarkable aspect
of DatGenic’s efforts at mining big data for its intrinsic busi-
ness value in the form of sentiment analysis is in how closely
the Twitter sentiment score appears to predict crude oil prices.
Deriving Business Value from Big Data using Sentiment analysis A ComTechAdvisory Whitepaper
Figure 1 – DataGenic’s Oil Price Sensitivity Analysis Results
Deriving Business Value from Big Data using Sentiment analysis A ComTechAdvisory Whitepaper
© Commodity Technology Advisory LLC, 2015, All Rights Reserved.
GENIC DATAIQ
Genic DataiQ is a Real-time powerful charting and analytical application designed for commodity market analysts,
traders, procurement managers and researchers that is available on premise or in the cloud. The tool offers a host
of tools to get to grips with data and derive business value including;
/ Real-time collaboration allowing users to share and col-
laborate any part of the onscreen analysis or reporting,
/ Publishing of lockable desktops and reports to internal
and external parties,
/ Powerful search using free-form ‘google-like’ searching,
/ Intelligent drawing tools to focus on trends and patterns
in the data,
/ Excel add-in allowing access to all data and derived data
via Excel.
It also supports multi-currency and cross-commodity compar-
isons and multiple data types including time-series, curves,
options, matrices, unstructured data and more.
It is within the context of the GeniciQ tools that DataGenic has
applied machine learning models to provide the ability to anal-
yse thousands of news and social media feeds including Twit-
ter to determine market sentiment across commodities.
GeniciQ is a collaborative set of tools that are designed to al-
low the benefits of ‘big data’ in the commodities industry to be
captured. It is a leap forward in terms of technology and the
use of advanced data management, visualization and analysis.
The inclusion of sentiment analysis and usage of social media
potentially brings the commodities trading industry mining of
data capabilities closer to that of other industries such as fi-
nancial services. It is in many respects a tool that can help the
industry answer to the question posed at the beginning of this
paper – how?
ABOUT DATAGENIC
DataGenic is the leading global provider of on-premise and in-cloud Smart Commodity Data Management
software, delivering intelligent analytics, real-time data content and proven business value. The innovative
solutions include a data-agnostic multi-commodity data management platform, visual mapping and manage-
ment of business processes, extensive and extensible data quality management, unlimited forward curves
construction and an intelligent decision framework.
DataGenic operates in Europe, Asia and the Americas.
For more information, please contact DataGenic at:
AMERICAS: +1 281 810 8290
EMEA: +44 203 814 8500
APAC: + 91 802 662 2607
info@datagenicgroup.com
ABOUT
Commodity
Technology
Advisory
LLC
Commodity Technology Advisory is the leading analyst organization covering the ETRM and
CTRM markets. We provide the invaluable insights into the issues and trends affecting the
users and providers of the technologies that are crucial for success in the constantly evolving
global commodities markets.
Patrick Reames and Gary Vasey head our team, whose combined 60-plus years in the energy
and commodities markets, provides depth of understanding of the market and its issues that is
unmatched and unrivaled by any analyst group.
For more information, please visit:
www.comtechadvisory.com
ComTech Advisory also hosts the CTRMCenter, your online portal with news and views about
commodity markets and technology as well as a comprehensive online directory of software
and services providers.
Please visit the CTRMCenter at:
www.ctrmcenter.com
19901 Southwest Freeway
Sugar Land TX 77479
+1 281 207 5412
Prague, Czech Republic
+420 775 718 112
ComTechAdvisory.com
Email: info@comtechadvisory.com

More Related Content

What's hot

Glenmark pharma limited
Glenmark pharma limitedGlenmark pharma limited
Glenmark pharma limited
Pavan Reddy
 
Colgate rural marketing jay
Colgate rural marketing  jayColgate rural marketing  jay
Colgate rural marketing jayJay Parekh
 
HUL
HULHUL
HUL Case Study | TIGI Professionals
HUL Case Study | TIGI Professionals HUL Case Study | TIGI Professionals
HUL Case Study | TIGI Professionals
Tarun Gupta
 
HINDUSTAN UNILEVER LIMITED
HINDUSTAN UNILEVER LIMITEDHINDUSTAN UNILEVER LIMITED
HINDUSTAN UNILEVER LIMITED
Nischal16
 
Presentation on Rural Marketing -Colgate
Presentation on Rural Marketing -ColgatePresentation on Rural Marketing -Colgate
Presentation on Rural Marketing -Colgate
Sagar Sawant
 
Amul
AmulAmul
Projeto Rumo a ISO 9001:2015
Projeto Rumo a ISO 9001:2015Projeto Rumo a ISO 9001:2015
Projeto Rumo a ISO 9001:2015
QUALIENG Consultoria, Auditoria e Treinamento
 
6 - Plano de ação para desenvolvimento do mercado residencial e comercial - C...
6 - Plano de ação para desenvolvimento do mercado residencial e comercial - C...6 - Plano de ação para desenvolvimento do mercado residencial e comercial - C...
6 - Plano de ação para desenvolvimento do mercado residencial e comercial - C...encontroresidencial
 
Hul
HulHul
ISO9001 - Aplicação prática no Setor da Construção Civil
ISO9001 -  Aplicação prática no Setor da Construção Civil               ISO9001 -  Aplicação prática no Setor da Construção Civil
Itc interrobang final
Itc interrobang finalItc interrobang final
Itc interrobang final
Robin Jain
 
Oracle Erp solutions
Oracle Erp solutionsOracle Erp solutions
Oracle Erp solutions
Namit Sahai
 
Pepsodent in rural market
Pepsodent in rural marketPepsodent in rural market
Pepsodent in rural market
Savitashindhe
 
Presentation - Sales & Distribution at ITC
Presentation - Sales & Distribution at ITCPresentation - Sales & Distribution at ITC
Presentation - Sales & Distribution at ITCSharad Srivastava
 
Dabur
DaburDabur
Airtel hr policies
Airtel hr policiesAirtel hr policies
Airtel hr policies
SANAL C.WILSON
 
Amul Advertisement STP Analysis
Amul Advertisement STP AnalysisAmul Advertisement STP Analysis
Amul Advertisement STP Analysis
Komal Gupta
 
Presentation on Colgate Toothpaste
Presentation on Colgate ToothpastePresentation on Colgate Toothpaste
Presentation on Colgate Toothpaste
Vipul Mittal
 

What's hot (20)

Glenmark pharma limited
Glenmark pharma limitedGlenmark pharma limited
Glenmark pharma limited
 
Colgate rural marketing jay
Colgate rural marketing  jayColgate rural marketing  jay
Colgate rural marketing jay
 
HUL
HULHUL
HUL
 
HUL Case Study | TIGI Professionals
HUL Case Study | TIGI Professionals HUL Case Study | TIGI Professionals
HUL Case Study | TIGI Professionals
 
HINDUSTAN UNILEVER LIMITED
HINDUSTAN UNILEVER LIMITEDHINDUSTAN UNILEVER LIMITED
HINDUSTAN UNILEVER LIMITED
 
COLGATE
COLGATECOLGATE
COLGATE
 
Presentation on Rural Marketing -Colgate
Presentation on Rural Marketing -ColgatePresentation on Rural Marketing -Colgate
Presentation on Rural Marketing -Colgate
 
Amul
AmulAmul
Amul
 
Projeto Rumo a ISO 9001:2015
Projeto Rumo a ISO 9001:2015Projeto Rumo a ISO 9001:2015
Projeto Rumo a ISO 9001:2015
 
6 - Plano de ação para desenvolvimento do mercado residencial e comercial - C...
6 - Plano de ação para desenvolvimento do mercado residencial e comercial - C...6 - Plano de ação para desenvolvimento do mercado residencial e comercial - C...
6 - Plano de ação para desenvolvimento do mercado residencial e comercial - C...
 
Hul
HulHul
Hul
 
ISO9001 - Aplicação prática no Setor da Construção Civil
ISO9001 -  Aplicação prática no Setor da Construção Civil               ISO9001 -  Aplicação prática no Setor da Construção Civil
ISO9001 - Aplicação prática no Setor da Construção Civil
 
Itc interrobang final
Itc interrobang finalItc interrobang final
Itc interrobang final
 
Oracle Erp solutions
Oracle Erp solutionsOracle Erp solutions
Oracle Erp solutions
 
Pepsodent in rural market
Pepsodent in rural marketPepsodent in rural market
Pepsodent in rural market
 
Presentation - Sales & Distribution at ITC
Presentation - Sales & Distribution at ITCPresentation - Sales & Distribution at ITC
Presentation - Sales & Distribution at ITC
 
Dabur
DaburDabur
Dabur
 
Airtel hr policies
Airtel hr policiesAirtel hr policies
Airtel hr policies
 
Amul Advertisement STP Analysis
Amul Advertisement STP AnalysisAmul Advertisement STP Analysis
Amul Advertisement STP Analysis
 
Presentation on Colgate Toothpaste
Presentation on Colgate ToothpastePresentation on Colgate Toothpaste
Presentation on Colgate Toothpaste
 

Similar to Deriving Business Value from Big Data using Sentiment analysis

Know The What, Why, and How of Big Data_.pdf
Know The What, Why, and How of Big Data_.pdfKnow The What, Why, and How of Big Data_.pdf
Know The What, Why, and How of Big Data_.pdf
Anil
 
What are Big Data, Data Science, and Data Analytics
 What are Big Data, Data Science, and Data Analytics What are Big Data, Data Science, and Data Analytics
What are Big Data, Data Science, and Data Analytics
Ray Business Technologies
 
Big data unit i
Big data unit iBig data unit i
Big data unit i
Navjot Kaur
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
IJSCAI Journal
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
ijscai
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
ijscai
 
the influence of machine language and data science in the emerging world
the influence of machine language and data science in the emerging worldthe influence of machine language and data science in the emerging world
the influence of machine language and data science in the emerging world
ijtsrd
 
Mejorar la toma de decisiones con Big Data
Mejorar la toma de decisiones con Big DataMejorar la toma de decisiones con Big Data
Mejorar la toma de decisiones con Big Data
Miguel Ángel Gómez
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
PreethaSuresh2
 
Policy paper need for focussed big data & analytics skillset building throu...
Policy  paper  need for focussed big data & analytics skillset building throu...Policy  paper  need for focussed big data & analytics skillset building throu...
Policy paper need for focussed big data & analytics skillset building throu...
Ritesh Shrivastava
 
new.pptx
new.pptxnew.pptx
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
IRJET Journal
 
Unlocking big data
Unlocking big dataUnlocking big data
Impact of Data Analytics in Changing the Future of Business and Challenges Fa...
Impact of Data Analytics in Changing the Future of Business and Challenges Fa...Impact of Data Analytics in Changing the Future of Business and Challenges Fa...
Impact of Data Analytics in Changing the Future of Business and Challenges Fa...
IJSRP Journal
 
Big Data: Where is the Real Opportunity?
Big Data: Where is the Real Opportunity?Big Data: Where is the Real Opportunity?
Big Data: Where is the Real Opportunity?
Cartesian (formerly CSMG)
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
saranya270513
 

Similar to Deriving Business Value from Big Data using Sentiment analysis (20)

Know The What, Why, and How of Big Data_.pdf
Know The What, Why, and How of Big Data_.pdfKnow The What, Why, and How of Big Data_.pdf
Know The What, Why, and How of Big Data_.pdf
 
What are Big Data, Data Science, and Data Analytics
 What are Big Data, Data Science, and Data Analytics What are Big Data, Data Science, and Data Analytics
What are Big Data, Data Science, and Data Analytics
 
Big data unit i
Big data unit iBig data unit i
Big data unit i
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
the influence of machine language and data science in the emerging world
the influence of machine language and data science in the emerging worldthe influence of machine language and data science in the emerging world
the influence of machine language and data science in the emerging world
 
Mejorar la toma de decisiones con Big Data
Mejorar la toma de decisiones con Big DataMejorar la toma de decisiones con Big Data
Mejorar la toma de decisiones con Big Data
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
Policy paper need for focussed big data & analytics skillset building throu...
Policy  paper  need for focussed big data & analytics skillset building throu...Policy  paper  need for focussed big data & analytics skillset building throu...
Policy paper need for focussed big data & analytics skillset building throu...
 
new.pptx
new.pptxnew.pptx
new.pptx
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Impact of Data Analytics in Changing the Future of Business and Challenges Fa...
Impact of Data Analytics in Changing the Future of Business and Challenges Fa...Impact of Data Analytics in Changing the Future of Business and Challenges Fa...
Impact of Data Analytics in Changing the Future of Business and Challenges Fa...
 
Big Data: Where is the Real Opportunity?
Big Data: Where is the Real Opportunity?Big Data: Where is the Real Opportunity?
Big Data: Where is the Real Opportunity?
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 

More from CTRM Center

CTRM - The Next Generation - ComTechAdvisory Vendor Technical Update
CTRM - The Next Generation - ComTechAdvisory Vendor Technical UpdateCTRM - The Next Generation - ComTechAdvisory Vendor Technical Update
CTRM - The Next Generation - ComTechAdvisory Vendor Technical Update
CTRM Center
 
Managing Supply Chain Complexity and Exposures
Managing Supply Chain Complexity and ExposuresManaging Supply Chain Complexity and Exposures
Managing Supply Chain Complexity and Exposures
CTRM Center
 
Global Sugar - A Complex Market that Requires a Fit for Purpose CTRM Solution
Global Sugar - A Complex Market that Requires a Fit for Purpose CTRM SolutionGlobal Sugar - A Complex Market that Requires a Fit for Purpose CTRM Solution
Global Sugar - A Complex Market that Requires a Fit for Purpose CTRM Solution
CTRM Center
 
Putting Data at the Heart of Energy Trading
Putting Data at the Heart of Energy TradingPutting Data at the Heart of Energy Trading
Putting Data at the Heart of Energy Trading
CTRM Center
 
US Dairy Markets – Digitalizing to address complexity and volatility
US Dairy Markets – Digitalizing to address complexity and volatilityUS Dairy Markets – Digitalizing to address complexity and volatility
US Dairy Markets – Digitalizing to address complexity and volatility
CTRM Center
 
Diversifying Into Renewable Energy: Challenges And Opportunities
Diversifying Into Renewable Energy: Challenges And OpportunitiesDiversifying Into Renewable Energy: Challenges And Opportunities
Diversifying Into Renewable Energy: Challenges And Opportunities
CTRM Center
 
Approaches to Accounting Integration
Approaches to Accounting IntegrationApproaches to Accounting Integration
Approaches to Accounting Integration
CTRM Center
 
How can your ETRM / CTRM solution help with credit
How can your ETRM / CTRM solution help with creditHow can your ETRM / CTRM solution help with credit
How can your ETRM / CTRM solution help with credit
CTRM Center
 
Managing the Worlds Metals
Managing the Worlds MetalsManaging the Worlds Metals
Managing the Worlds Metals
CTRM Center
 
RPS and RECs – Managing an Increasing Regulatory Burden
RPS and RECs – Managing an Increasing Regulatory BurdenRPS and RECs – Managing an Increasing Regulatory Burden
RPS and RECs – Managing an Increasing Regulatory Burden
CTRM Center
 
Global Renewables Transition Requires Dedicated ETRM Capabilities
Global Renewables Transition Requires Dedicated ETRM CapabilitiesGlobal Renewables Transition Requires Dedicated ETRM Capabilities
Global Renewables Transition Requires Dedicated ETRM Capabilities
CTRM Center
 
Global LNG Navigating Risks in a Dynamic Market
Global LNG Navigating Risks in a Dynamic MarketGlobal LNG Navigating Risks in a Dynamic Market
Global LNG Navigating Risks in a Dynamic Market
CTRM Center
 
Disruptive Technologies – A 2021 Update
Disruptive Technologies – A 2021 UpdateDisruptive Technologies – A 2021 Update
Disruptive Technologies – A 2021 Update
CTRM Center
 
What is Modern Risk Management?
What is Modern Risk Management?What is Modern Risk Management?
What is Modern Risk Management?
CTRM Center
 
Instant CTRM in the Cloud
Instant CTRM in the CloudInstant CTRM in the Cloud
Instant CTRM in the Cloud
CTRM Center
 
Reimagining Energy Trading and Risk Management (ETRM) With Advanced Delivery ...
Reimagining Energy Trading and Risk Management (ETRM) With Advanced Delivery ...Reimagining Energy Trading and Risk Management (ETRM) With Advanced Delivery ...
Reimagining Energy Trading and Risk Management (ETRM) With Advanced Delivery ...
CTRM Center
 
Risk and Compliance – Lessons learned and looking beyond the COVID-19 Era
Risk and Compliance – Lessons learned and looking beyond the COVID-19 EraRisk and Compliance – Lessons learned and looking beyond the COVID-19 Era
Risk and Compliance – Lessons learned and looking beyond the COVID-19 Era
CTRM Center
 
2021 Trends in Agricultural and Soft Commodities Trading
2021 Trends in Agricultural and Soft Commodities Trading2021 Trends in Agricultural and Soft Commodities Trading
2021 Trends in Agricultural and Soft Commodities Trading
CTRM Center
 
Achieving Digitalization in a Document Intensive Energy Market
Achieving Digitalization in a Document Intensive Energy MarketAchieving Digitalization in a Document Intensive Energy Market
Achieving Digitalization in a Document Intensive Energy Market
CTRM Center
 
Commodity Management for Metals
Commodity Management for MetalsCommodity Management for Metals
Commodity Management for Metals
CTRM Center
 

More from CTRM Center (20)

CTRM - The Next Generation - ComTechAdvisory Vendor Technical Update
CTRM - The Next Generation - ComTechAdvisory Vendor Technical UpdateCTRM - The Next Generation - ComTechAdvisory Vendor Technical Update
CTRM - The Next Generation - ComTechAdvisory Vendor Technical Update
 
Managing Supply Chain Complexity and Exposures
Managing Supply Chain Complexity and ExposuresManaging Supply Chain Complexity and Exposures
Managing Supply Chain Complexity and Exposures
 
Global Sugar - A Complex Market that Requires a Fit for Purpose CTRM Solution
Global Sugar - A Complex Market that Requires a Fit for Purpose CTRM SolutionGlobal Sugar - A Complex Market that Requires a Fit for Purpose CTRM Solution
Global Sugar - A Complex Market that Requires a Fit for Purpose CTRM Solution
 
Putting Data at the Heart of Energy Trading
Putting Data at the Heart of Energy TradingPutting Data at the Heart of Energy Trading
Putting Data at the Heart of Energy Trading
 
US Dairy Markets – Digitalizing to address complexity and volatility
US Dairy Markets – Digitalizing to address complexity and volatilityUS Dairy Markets – Digitalizing to address complexity and volatility
US Dairy Markets – Digitalizing to address complexity and volatility
 
Diversifying Into Renewable Energy: Challenges And Opportunities
Diversifying Into Renewable Energy: Challenges And OpportunitiesDiversifying Into Renewable Energy: Challenges And Opportunities
Diversifying Into Renewable Energy: Challenges And Opportunities
 
Approaches to Accounting Integration
Approaches to Accounting IntegrationApproaches to Accounting Integration
Approaches to Accounting Integration
 
How can your ETRM / CTRM solution help with credit
How can your ETRM / CTRM solution help with creditHow can your ETRM / CTRM solution help with credit
How can your ETRM / CTRM solution help with credit
 
Managing the Worlds Metals
Managing the Worlds MetalsManaging the Worlds Metals
Managing the Worlds Metals
 
RPS and RECs – Managing an Increasing Regulatory Burden
RPS and RECs – Managing an Increasing Regulatory BurdenRPS and RECs – Managing an Increasing Regulatory Burden
RPS and RECs – Managing an Increasing Regulatory Burden
 
Global Renewables Transition Requires Dedicated ETRM Capabilities
Global Renewables Transition Requires Dedicated ETRM CapabilitiesGlobal Renewables Transition Requires Dedicated ETRM Capabilities
Global Renewables Transition Requires Dedicated ETRM Capabilities
 
Global LNG Navigating Risks in a Dynamic Market
Global LNG Navigating Risks in a Dynamic MarketGlobal LNG Navigating Risks in a Dynamic Market
Global LNG Navigating Risks in a Dynamic Market
 
Disruptive Technologies – A 2021 Update
Disruptive Technologies – A 2021 UpdateDisruptive Technologies – A 2021 Update
Disruptive Technologies – A 2021 Update
 
What is Modern Risk Management?
What is Modern Risk Management?What is Modern Risk Management?
What is Modern Risk Management?
 
Instant CTRM in the Cloud
Instant CTRM in the CloudInstant CTRM in the Cloud
Instant CTRM in the Cloud
 
Reimagining Energy Trading and Risk Management (ETRM) With Advanced Delivery ...
Reimagining Energy Trading and Risk Management (ETRM) With Advanced Delivery ...Reimagining Energy Trading and Risk Management (ETRM) With Advanced Delivery ...
Reimagining Energy Trading and Risk Management (ETRM) With Advanced Delivery ...
 
Risk and Compliance – Lessons learned and looking beyond the COVID-19 Era
Risk and Compliance – Lessons learned and looking beyond the COVID-19 EraRisk and Compliance – Lessons learned and looking beyond the COVID-19 Era
Risk and Compliance – Lessons learned and looking beyond the COVID-19 Era
 
2021 Trends in Agricultural and Soft Commodities Trading
2021 Trends in Agricultural and Soft Commodities Trading2021 Trends in Agricultural and Soft Commodities Trading
2021 Trends in Agricultural and Soft Commodities Trading
 
Achieving Digitalization in a Document Intensive Energy Market
Achieving Digitalization in a Document Intensive Energy MarketAchieving Digitalization in a Document Intensive Energy Market
Achieving Digitalization in a Document Intensive Energy Market
 
Commodity Management for Metals
Commodity Management for MetalsCommodity Management for Metals
Commodity Management for Metals
 

Recently uploaded

GOING AOT WITH GRAALVM FOR SPRING BOOT (SPRING IO)
GOING AOT WITH GRAALVM FOR  SPRING BOOT (SPRING IO)GOING AOT WITH GRAALVM FOR  SPRING BOOT (SPRING IO)
GOING AOT WITH GRAALVM FOR SPRING BOOT (SPRING IO)
Alina Yurenko
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
Shane Coughlan
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
Boni García
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
timtebeek1
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
Paco van Beckhoven
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
NYGGS Automation Suite
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
Octavian Nadolu
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
Aftab Hussain
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Łukasz Chruściel
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Neo4j
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke
 
Transform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR SolutionsTransform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR Solutions
TheSMSPoint
 
A Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of PassageA Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of Passage
Philip Schwarz
 
E-commerce Application Development Company.pdf
E-commerce Application Development Company.pdfE-commerce Application Development Company.pdf
E-commerce Application Development Company.pdf
Hornet Dynamics
 
Using Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional SafetyUsing Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional Safety
Ayan Halder
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
Donna Lenk
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
Google
 
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket ManagementUtilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate
 
Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
Roshan Dwivedi
 

Recently uploaded (20)

GOING AOT WITH GRAALVM FOR SPRING BOOT (SPRING IO)
GOING AOT WITH GRAALVM FOR  SPRING BOOT (SPRING IO)GOING AOT WITH GRAALVM FOR  SPRING BOOT (SPRING IO)
GOING AOT WITH GRAALVM FOR SPRING BOOT (SPRING IO)
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
 
Transform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR SolutionsTransform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR Solutions
 
A Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of PassageA Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of Passage
 
E-commerce Application Development Company.pdf
E-commerce Application Development Company.pdfE-commerce Application Development Company.pdf
E-commerce Application Development Company.pdf
 
Using Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional SafetyUsing Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional Safety
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
 
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket ManagementUtilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
 
Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
 

Deriving Business Value from Big Data using Sentiment analysis

  • 1. WHITE PAPER Sponsored by Deriving Business Value from Big Data using Sentiment analysis
  • 2. INTRODUCTION ‘Big Data’ are two small words that are widely used to describe the massive growth in data of all forms and that hold; the promise of delivering huge potential business impact. The question is, how? Today, and increasingly in the future, businesses are surrounded by masses of data and raw information. Some of this data is very relevant but much of it is not. Further, most of that data is unstructured in the form of email, documents, images and different types of social media, blogs, and so on. Unstructured data is notoriously difficult to access and query, it is scattered across many different locations and formats, and it requires some form of preprocessing before it can be analyzed and used. Yet, it is this unstructured type data that is primarily exploding in quantity, representing around 80 per cent of the annual growth of data and doubling in quantity every two years. A few years ago, ‘Big Data’ was just another buzzword; a fad perhaps that would eventually fade. Today though, big data is increasingly being used to provide deep insight and predictive analysis in to everything from stock market movements to individual buying behaviors. Those that are able to make use and harness the power of this disrup- tive force in markets will benefit by being smarter, faster and more efficient, meaning they are more likely to seize opportunities early and thereby profit. In the financial services industry, this possibility has not been lost on the banks who along with associated firms, are investing heavily in applying a variety of technologies and approaches to unlocking the value of ‘big data’. How might big data be used practically in the commodity trading and risk management world? This white paper attempts to answer this question and describes a practical application brought to market by DataGenic that uses sentiment analysis to predict the price of crude oil. © Commodity Technology Advisory LLC, 2015, All Rights Reserved.
  • 3. © Commodity Technology Advisory LLC, 2015, All Rights Reserved. BIG DATA AND ITS POTENTIAL APPLICATIONS A couple of years ago, ComTech undertook some research in the commodity trading and risk management arena around the value of social media as data. The results were not particularly startling in terms of the interest levels expressed in something so chaotic and voluminous as social media1 . Yet, when it was explained in the context of a meaningful example, interest levels heightened considerably. Since then, many big data opportunities and application areas within commodity trading and risk management have been identified and they encapsulate a whole raft of new possibilities including, for example / Trading Analysis - Predictive analytics - Pre-trade decision support analytics including senti- ment analysis / Market Risk - On demand risk management - Predictive indicators - Exposure simulations / Regulation Compliance - Trade surveillance - Fraud management - Regulation/Compliance audits Many of these areas are already being actively explored and productized. For example, Bloomberg announced that it is enhancing its enterprise compliance platform to provide next-generation communications surveillance functionality and analytics to meet increasingly stringent regulatory guide- lines, prevent market abuse, and deepen visibility into the com- mercial use of social media. The financial trading dashboard managed by Thomson Reuters is another example, and it uses sentiment analysis data to track news on 20,000 stocks and thousands of commodities. It parses text from multiple sourc- es, looks for keywords, tone, relevance and freshness to pro- vide sentiment analysis for traders to act upon. Indeed, sentiment analysis has developed rapidly as a tech- nology that applies machine learning and makes a rapid as- sessment of the sentiments expressed in the various types of unstructured data available today in the form of social media, news and blogs. These sources of information can move the market and are measured quantitatively. Analysts and inves- tors digest financial news and their perceptions can rapidly impact the market and move stock and/or commodity prices. However, making use of masses of unstructured data of vari- able quality and reliability isn’t an easy undertaking. It requires a high degree of specialism, usually provided by data science, and those with the expertise to deploy the right combination of analytics, machine learning, data mining and statistical skills as well as experience with algorithms and coding in order to explain the significance of data in a way that can easily be un- derstood by others. Part of the problem is in understanding ex- actly what is meant by the words used by social commentators and others. This means dealing with synonyms, spelling errors, use of different languages such as Latin, polysemy (where one word can actually have many meanings) and the sheer volume of data, amongst other issues. If these issues can be resolved, then one is left with the ability to track brand perception and business opinion trends that might have real business value. The types of data that can be analysed using this approach include news feeds from almost any source, as well as so- cial media content from popular tools such as Twitter. Taking the data in near real-time, stripping out noise and irrelevant content and using Natural Language Processing (NLP) and machine learning in an attempt to extract useful meaning can result in information that has immediate and actionable value in the form of, for example, sentiment analysis. Deriving Business Value from Big Data using Sentiment analysis A ComTechAdvisory Whitepaper 1) Making More of Data Using AI, Commodity Technology Advisory White Paper, 2014
  • 4. DATAGENIC’S NEWS ANALYTICS SYSTEM DataGenic has been deeply engaged in the data management and aggregation business in commodity markets for many years and has a broad array of blue chip clients in the industry. Recently, it has been developing a news aggregation service to be a part of its GenicIQ product and designed to provide sentiment analysis that can be used as an input in to trading decision-making and risk management. The objective was to, / Automatically process unstructured textual data in near real-time to deliver both insight and value, / Utilise Twitter and a multitude of news resources avail- able online as inputs, / Provide sentiment Analysis that had value to traders and risk managers in the commodity space. The problem for DataGenic was very much as described above. It involved designing the delivery of a message/infor- mation system in such a way that it facilitated a greater under- standing of the market. By utilizing Twitter as one source of raw information, DataGenic data scientists had to figure out how to process over 4.5 million tweets in a 34 period (equat- ing to some 150,000 – 400,000 tweets per day) and over 1.7 million news articles over 73 days of activity. The final product needed the scalability to handle much more data than that in real live use and at a greater velocity in order to produce sen- timent scores, volume and indicators that could be exposed in an readily consumable form both via DataIQ and through an API. By defining a process that involved stripping unnecessary and superfluous data, utilising NLP and machine learning and the development of scoring mechanisms, DataGenic has been able to produce quite remarkable results for crude oil price sentiment in its test case (Figure 1). The product is now live and its output available to subscribers. The remarkable aspect of DatGenic’s efforts at mining big data for its intrinsic busi- ness value in the form of sentiment analysis is in how closely the Twitter sentiment score appears to predict crude oil prices. Deriving Business Value from Big Data using Sentiment analysis A ComTechAdvisory Whitepaper Figure 1 – DataGenic’s Oil Price Sensitivity Analysis Results
  • 5. Deriving Business Value from Big Data using Sentiment analysis A ComTechAdvisory Whitepaper © Commodity Technology Advisory LLC, 2015, All Rights Reserved. GENIC DATAIQ Genic DataiQ is a Real-time powerful charting and analytical application designed for commodity market analysts, traders, procurement managers and researchers that is available on premise or in the cloud. The tool offers a host of tools to get to grips with data and derive business value including; / Real-time collaboration allowing users to share and col- laborate any part of the onscreen analysis or reporting, / Publishing of lockable desktops and reports to internal and external parties, / Powerful search using free-form ‘google-like’ searching, / Intelligent drawing tools to focus on trends and patterns in the data, / Excel add-in allowing access to all data and derived data via Excel. It also supports multi-currency and cross-commodity compar- isons and multiple data types including time-series, curves, options, matrices, unstructured data and more. It is within the context of the GeniciQ tools that DataGenic has applied machine learning models to provide the ability to anal- yse thousands of news and social media feeds including Twit- ter to determine market sentiment across commodities. GeniciQ is a collaborative set of tools that are designed to al- low the benefits of ‘big data’ in the commodities industry to be captured. It is a leap forward in terms of technology and the use of advanced data management, visualization and analysis. The inclusion of sentiment analysis and usage of social media potentially brings the commodities trading industry mining of data capabilities closer to that of other industries such as fi- nancial services. It is in many respects a tool that can help the industry answer to the question posed at the beginning of this paper – how?
  • 6. ABOUT DATAGENIC DataGenic is the leading global provider of on-premise and in-cloud Smart Commodity Data Management software, delivering intelligent analytics, real-time data content and proven business value. The innovative solutions include a data-agnostic multi-commodity data management platform, visual mapping and manage- ment of business processes, extensive and extensible data quality management, unlimited forward curves construction and an intelligent decision framework. DataGenic operates in Europe, Asia and the Americas. For more information, please contact DataGenic at: AMERICAS: +1 281 810 8290 EMEA: +44 203 814 8500 APAC: + 91 802 662 2607 info@datagenicgroup.com
  • 7. ABOUT Commodity Technology Advisory LLC Commodity Technology Advisory is the leading analyst organization covering the ETRM and CTRM markets. We provide the invaluable insights into the issues and trends affecting the users and providers of the technologies that are crucial for success in the constantly evolving global commodities markets. Patrick Reames and Gary Vasey head our team, whose combined 60-plus years in the energy and commodities markets, provides depth of understanding of the market and its issues that is unmatched and unrivaled by any analyst group. For more information, please visit: www.comtechadvisory.com ComTech Advisory also hosts the CTRMCenter, your online portal with news and views about commodity markets and technology as well as a comprehensive online directory of software and services providers. Please visit the CTRMCenter at: www.ctrmcenter.com 19901 Southwest Freeway Sugar Land TX 77479 +1 281 207 5412 Prague, Czech Republic +420 775 718 112 ComTechAdvisory.com Email: info@comtechadvisory.com