SlideShare a Scribd company logo
1 of 3
Download to read offline
How SAP HANA Leverages the
Cloud to Glean Business Insights
from Unstructured Data
SAP’s in-memory computing and cloud-based delivery model makes a
strong business case for investing in more innovative ways of analyzing
unstructured data.
Executive Summary
For most organizations, significant volumes of
data are locked in the form of unstructured data
generated by the broad sweep of social and
mobile interactions that pervade our personal
and professional lives. With SAP’s “in-memory”
technology, called HANA,1
made available in
the cloud, organizations can utilize the power of
cloud computing to access tools required to effec-
tively and efficiently analyze unstructured data.
Unstructured data is generated in various forms
during formal and informal operations. Textual
forms of unstructured data include e-mails,
contracts (contained in PDF documents),
medical records and claims and customer reviews
and feedback shared via traditional Web sites
and social media. Existing business intelligence
(BI) and analytical tools are in our estimation
less efficient at analyzing unstructured textual
information as compared with structured data
stored in database tables.
Research institutes such as PRIME Research,
the Thomas J. Watson Research Center and the
Fraunhofer Institute for Algorithms and Scientific
Computing are said to be making use of open
source frameworks like Apache UIMA for extract-
ing information from textual unstructured data
for research purposes. This approach is focused
primarily around feature-based opinion mining,
summarization, aspect-based sentiment analysis,
detecting fake reviews, etc. However, a huge gap
is observed when it comes to applications that can
be used by business to drive decisions based on
insights gained from textual unstructured data.
This paper describes a cloud-based solution that
uses information extractors such as Apache UIMA
for extracting information from textual unstruc-
tured data and SAP HANA for generating business
insights that correlate with structured enterprise
data contained in back-end systems. SAP HANA
cloud platform provides a robust platform as a
service to develop applications that can provide
cognizant 20-20 insights | june 2013
•	 Cognizant 20-20 Insights
cognizant 20-20 insights 2
analytical scalability for in-memory data analysis
via RESTful API, federated user authentication
and integration with back-end enterprise systems.
Architectural Design
Use of SAP HANA Cloud
SAP HANA cloud is a platform that can host
various components of the solution. An
Information extractor such as Apache UIMA is
first deployed on the cloud. Unstructured data is
fed into the information extractor, either in real
time or periodically. The information extractor
uses industry-specific models as a reference for
extracting information. Extracted entities and
structured enterprise data is saved into persis-
tent data storage. The SAP HANA cloud provides
HANA as an in-memory database storage device
stored in industry-specific schemas.
Consider a case of a manufacturing company that
is interested in obtaining insights from unstruc-
tured data generated in the form of official e-mail
communications with its vendors and custom-
ers. Apache UIMA can be trained to identify and
distinguish customers and vendors from their
e-mail communication. Other allied information –
such as product, part, material, location, date,
verbs and adjectives – can be extracted and saved
to the HANA database (containing manufactur-
ing industry-specific schema) in the cloud for
correlation analysis. This can help in sentiment
analysis or sourcing marketing information.
Industry-Specific Apps for Business Insights
Apps specific to an industry are used to analyze
data stored in HANA from which users can derive
insights. These apps make use of HANA’s Predic-
tive Analysis Library (PAL) to execute various
algorithms on the data. Algorithms for cluster
analysis, classification analysis and association
analysis are currently available. New apps can be
custom developed and deployed per the organiza-
tion’s requirements.
API for External Use
The SAP HANA cloud’s application programming
interface is exposed for external consumption.
External applications that may be on a cloud
or reside in on-premises systems can use these
APIs to consume business insights. Developers
can then make creative use of these insights
to develop rich user interfaces using diverse
technologies such as JavaScript or Adobe Flash
or Microsoft Silverlight. APIs can be consumed
by mobile apps as well by using a simple REST
protocol.
Data-Loading Mechanism/Use Cases
•	 Persistent storage scenario: The user of
this cloud service can opt to store data in
the HANA system persistently. This provides
continuously available analytics to the user,
who can then develop his own external apps
that make use of the insights provided by the
cloud service. He can also access custom-built
apps developed by third parties on SAP cloud
that sit on top of HANA.
An Integrated Approach to Meaning Making
Figure 1
On Premise
System Mobile Apps,
Java Apps, etc.
Other
Cloud
HANA
SAP HANA Cloud
Information
Extractor
(UIMA)
Industry
APPS
REST API
Unstructured
Data
(Industry Specific)
E-mail Docs
Web Note
Structured
Enterprise
Data
ERP DW
RDBMS OLAP
About Cognizant
Cognizant (NASDAQ: CTSH) is a leading provider of information technology, consulting, and business process
outsourcing services, dedicated to helping the world’s leading companies build stronger businesses. Headquartered
in Teaneck, New Jersey (U.S.), Cognizant combines a passion for client satisfaction, technology innovation, deep
industry and business process expertise, and a global, collaborative workforce that embodies the future of work.
With over 50 delivery centers worldwide and approximately 162,700 employees as of March 31, 2013, Cognizant is a
member of the NASDAQ-100, the S&P 500, the Forbes Global 2000, and the Fortune 500 and is ranked among the
top performing and fastest growing companies in the world.
Visit us online at www.cognizant.com for more information.
World Headquarters
500 Frank W. Burr Blvd.
Teaneck, NJ 07666 USA
Phone: +1 201 801 0233
Fax: +1 201 801 0243
Toll Free: +1 888 937 3277
Email: inquiry@cognizant.com
European Headquarters
1 Kingdom Street
Paddington Central
London W2 6BD
Phone: +44 (0) 207 297 7600
Fax: +44 (0) 207 121 0102
Email: infouk@cognizant.com
India Operations Headquarters
#5/535, Old Mahabalipuram Road
Okkiyam Pettai, Thoraipakkam
Chennai, 600 096 India
Phone: +91 (0) 44 4209 6000
Fax: +91 (0) 44 4209 6060
Email: inquiryindia@cognizant.com
­­© Copyright 2013, Cognizant. All rights reserved. No part of this document may be reproduced, stored in a retrieval system, transmitted in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise, without the express written permission from Cognizant. The information contained herein is
subject to change without notice. All other trademarks mentioned herein are the property of their respective owners.
About the Author
Rahul Aware is a Senior Associate within Cognizant’s Enterprise Application Software Business Unit. He
has nine-plus years of experience as a SAP technical consultant and is a certified SAP professional with
a flair for new technologies and their applications. He can be reached at Rahul.Aware@cognizant.com.
•	 Temporary storage scenario: This mecha-
nism is suitable for one-time consumption of
the analytic service (e.g., to generate insights
for an advertisement campaign by analyzing
consumer reviews). Once the campaign is
completed, there’s no need to keep data on
the cloud.
•	 On-the-fly scenario: This is purely in-memory
analytics where data is not stored permanently
within the HANA database. Unstructured data
can be manually fed to the information extrac-
tor via a user interface or programmatically via
API services. Extracted information is stored
as temporary tables (in-memory) in HANA;
SAP’s PAL is used to analyze the entities. Once
the output is calculated, the temporary table
is deleted so that there’s no permanent
storage. This will be of use to other sys-
tems’ software and applications that require
analytics to complete a task or transaction.
For example, a vendor may want to query a
partner’s HANA database by feeding in some
of its work-in-progress materials to receive
insights about inventory requirements con-
tained in unstructured data stored in HANA.
Cloud-Powered Analytics Empowers
the Masses
SAP HANA cloud, with its inherent in-memory
analytics, provides the right mix of cloud comput-
ing’s economic advantages with fast and potent
computing power. HANA’s PAL simplifies the
process for converting raw data into insights that
help inform smarter and more timely business
decisions.
The REST API brings this solution into the hands
of all the developers who are interested in
developing their applications around the insights
sourced from unstructured data. As such, the
solution bridges unstructured data analytics with
enterprise functions.
Footnote
1
	 HANA is an abbreviation that stands for high-performance analytical appliance.

More Related Content

What's hot

BigData-Architecture
BigData-ArchitectureBigData-Architecture
BigData-Architecture
Narayana B
 
Power of the SAP HANA Platform – Integrating non-SAP data with custom HANA ap...
Power of the SAP HANA Platform – Integrating non-SAP data with custom HANA ap...Power of the SAP HANA Platform – Integrating non-SAP data with custom HANA ap...
Power of the SAP HANA Platform – Integrating non-SAP data with custom HANA ap...
Dickinson + Associates
 

What's hot (19)

SAP HANA Use Cases for Pharma Research & Development
SAP HANA Use Cases for Pharma Research & DevelopmentSAP HANA Use Cases for Pharma Research & Development
SAP HANA Use Cases for Pharma Research & Development
 
SAP BusinessObjects Cloud Demo
SAP BusinessObjects Cloud DemoSAP BusinessObjects Cloud Demo
SAP BusinessObjects Cloud Demo
 
Business Intelligence tools comparison
Business Intelligence tools comparisonBusiness Intelligence tools comparison
Business Intelligence tools comparison
 
Sap bw4 hana architecture archetypes
Sap bw4 hana architecture archetypesSap bw4 hana architecture archetypes
Sap bw4 hana architecture archetypes
 
BI 解決方案介紹
BI 解決方案介紹BI 解決方案介紹
BI 解決方案介紹
 
Introduction To Pentaho
Introduction To PentahoIntroduction To Pentaho
Introduction To Pentaho
 
Olap
OlapOlap
Olap
 
Dmm117 – SAP HANA Processing Services Text Spatial Graph Series and Predictive
Dmm117 – SAP HANA Processing Services Text Spatial Graph Series and PredictiveDmm117 – SAP HANA Processing Services Text Spatial Graph Series and Predictive
Dmm117 – SAP HANA Processing Services Text Spatial Graph Series and Predictive
 
Enterprise architecture for big data projects
Enterprise architecture for big data projectsEnterprise architecture for big data projects
Enterprise architecture for big data projects
 
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
 
Dmm212 – Sap Hana Graph Processing
Dmm212 – Sap Hana  Graph ProcessingDmm212 – Sap Hana  Graph Processing
Dmm212 – Sap Hana Graph Processing
 
SaaSRefArch
SaaSRefArchSaaSRefArch
SaaSRefArch
 
Finance and Audit Predictive Analytics
Finance and Audit Predictive AnalyticsFinance and Audit Predictive Analytics
Finance and Audit Predictive Analytics
 
BigData-Architecture
BigData-ArchitectureBigData-Architecture
BigData-Architecture
 
Intranet systems beyond SharePoint
Intranet systems beyond SharePointIntranet systems beyond SharePoint
Intranet systems beyond SharePoint
 
Solution architecture
Solution architectureSolution architecture
Solution architecture
 
Solution architecture for big data projects
Solution architecture for big data projectsSolution architecture for big data projects
Solution architecture for big data projects
 
Power of the SAP HANA Platform – Integrating non-SAP data with custom HANA ap...
Power of the SAP HANA Platform – Integrating non-SAP data with custom HANA ap...Power of the SAP HANA Platform – Integrating non-SAP data with custom HANA ap...
Power of the SAP HANA Platform – Integrating non-SAP data with custom HANA ap...
 
Driving Business Applications with Real-Time Data
Driving Business Applications with Real-Time DataDriving Business Applications with Real-Time Data
Driving Business Applications with Real-Time Data
 

Similar to How SAP HANA Leverages the Cloud to Glean Business Insights from Unstructured Data

ManMachine&Mathematics_Arup_Ray_Ext
ManMachine&Mathematics_Arup_Ray_ExtManMachine&Mathematics_Arup_Ray_Ext
ManMachine&Mathematics_Arup_Ray_Ext
Arup Ray
 
intelligent-data-lake_executive-brief
intelligent-data-lake_executive-briefintelligent-data-lake_executive-brief
intelligent-data-lake_executive-brief
Lindy-Anne Botha
 

Similar to How SAP HANA Leverages the Cloud to Glean Business Insights from Unstructured Data (20)

Sap hana
Sap hanaSap hana
Sap hana
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Getting Started with BI Analytics on HANA
Getting Started with BI Analytics on HANAGetting Started with BI Analytics on HANA
Getting Started with BI Analytics on HANA
 
SAP Data Hub – What is it, and what’s new? (Sefan Linders)
SAP Data Hub – What is it, and what’s new? (Sefan Linders)SAP Data Hub – What is it, and what’s new? (Sefan Linders)
SAP Data Hub – What is it, and what’s new? (Sefan Linders)
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligence
 
Overview of SAP HANA Cloud Platform
Overview of SAP HANA Cloud PlatformOverview of SAP HANA Cloud Platform
Overview of SAP HANA Cloud Platform
 
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact SheetBig Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
 
Digital Business with SAP B1 - Introduction
Digital Business with SAP B1 - IntroductionDigital Business with SAP B1 - Introduction
Digital Business with SAP B1 - Introduction
 
Capture and Feed Telecom Network Data and More Into SAP HANA - Quicky and Aff...
Capture and Feed Telecom Network Data and More Into SAP HANA - Quicky and Aff...Capture and Feed Telecom Network Data and More Into SAP HANA - Quicky and Aff...
Capture and Feed Telecom Network Data and More Into SAP HANA - Quicky and Aff...
 
Extend SAP S/4HANA to deliver real-time intelligent processes
Extend SAP S/4HANA to deliver real-time intelligent processesExtend SAP S/4HANA to deliver real-time intelligent processes
Extend SAP S/4HANA to deliver real-time intelligent processes
 
SAP Cloud Platform Community NL Kick-off
SAP Cloud Platform Community NL Kick-offSAP Cloud Platform Community NL Kick-off
SAP Cloud Platform Community NL Kick-off
 
Become a Smart Enterprise with SAP Analytics Cloud - ConVista Asia
Become a Smart Enterprise with SAP Analytics Cloud - ConVista AsiaBecome a Smart Enterprise with SAP Analytics Cloud - ConVista Asia
Become a Smart Enterprise with SAP Analytics Cloud - ConVista Asia
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
CIO Guide to Using SAP HANA Platform For Big Data
CIO Guide to Using SAP HANA Platform For Big DataCIO Guide to Using SAP HANA Platform For Big Data
CIO Guide to Using SAP HANA Platform For Big Data
 
ManMachine&Mathematics_Arup_Ray_Ext
ManMachine&Mathematics_Arup_Ray_ExtManMachine&Mathematics_Arup_Ray_Ext
ManMachine&Mathematics_Arup_Ray_Ext
 
intelligent-data-lake_executive-brief
intelligent-data-lake_executive-briefintelligent-data-lake_executive-brief
intelligent-data-lake_executive-brief
 
ISV- Connect, Transform and Reimagine- Help your customers take advantage of ...
ISV- Connect, Transform and Reimagine- Help your customers take advantage of ...ISV- Connect, Transform and Reimagine- Help your customers take advantage of ...
ISV- Connect, Transform and Reimagine- Help your customers take advantage of ...
 
Connect, Transform and Reimagine- Help your customers take advantage of Inter...
Connect, Transform and Reimagine- Help your customers take advantage of Inter...Connect, Transform and Reimagine- Help your customers take advantage of Inter...
Connect, Transform and Reimagine- Help your customers take advantage of Inter...
 
Move to S4 HANA
Move to S4 HANA Move to S4 HANA
Move to S4 HANA
 
Project report
Project reportProject report
Project report
 

More from Cognizant

More from Cognizant (20)

Using Adaptive Scrum to Tame Process Reverse Engineering in Data Analytics Pr...
Using Adaptive Scrum to Tame Process Reverse Engineering in Data Analytics Pr...Using Adaptive Scrum to Tame Process Reverse Engineering in Data Analytics Pr...
Using Adaptive Scrum to Tame Process Reverse Engineering in Data Analytics Pr...
 
Data Modernization: Breaking the AI Vicious Cycle for Superior Decision-making
Data Modernization: Breaking the AI Vicious Cycle for Superior Decision-makingData Modernization: Breaking the AI Vicious Cycle for Superior Decision-making
Data Modernization: Breaking the AI Vicious Cycle for Superior Decision-making
 
It Takes an Ecosystem: How Technology Companies Deliver Exceptional Experiences
It Takes an Ecosystem: How Technology Companies Deliver Exceptional ExperiencesIt Takes an Ecosystem: How Technology Companies Deliver Exceptional Experiences
It Takes an Ecosystem: How Technology Companies Deliver Exceptional Experiences
 
Intuition Engineered
Intuition EngineeredIntuition Engineered
Intuition Engineered
 
The Work Ahead: Transportation and Logistics Delivering on the Digital-Physic...
The Work Ahead: Transportation and Logistics Delivering on the Digital-Physic...The Work Ahead: Transportation and Logistics Delivering on the Digital-Physic...
The Work Ahead: Transportation and Logistics Delivering on the Digital-Physic...
 
Enhancing Desirability: Five Considerations for Winning Digital Initiatives
Enhancing Desirability: Five Considerations for Winning Digital InitiativesEnhancing Desirability: Five Considerations for Winning Digital Initiatives
Enhancing Desirability: Five Considerations for Winning Digital Initiatives
 
The Work Ahead in Manufacturing: Fulfilling the Agility Mandate
The Work Ahead in Manufacturing: Fulfilling the Agility MandateThe Work Ahead in Manufacturing: Fulfilling the Agility Mandate
The Work Ahead in Manufacturing: Fulfilling the Agility Mandate
 
The Work Ahead in Higher Education: Repaving the Road for the Employees of To...
The Work Ahead in Higher Education: Repaving the Road for the Employees of To...The Work Ahead in Higher Education: Repaving the Road for the Employees of To...
The Work Ahead in Higher Education: Repaving the Road for the Employees of To...
 
Engineering the Next-Gen Digital Claims Organisation for Australian General I...
Engineering the Next-Gen Digital Claims Organisation for Australian General I...Engineering the Next-Gen Digital Claims Organisation for Australian General I...
Engineering the Next-Gen Digital Claims Organisation for Australian General I...
 
Profitability in the Direct-to-Consumer Marketplace: A Playbook for Media and...
Profitability in the Direct-to-Consumer Marketplace: A Playbook for Media and...Profitability in the Direct-to-Consumer Marketplace: A Playbook for Media and...
Profitability in the Direct-to-Consumer Marketplace: A Playbook for Media and...
 
Green Rush: The Economic Imperative for Sustainability
Green Rush: The Economic Imperative for SustainabilityGreen Rush: The Economic Imperative for Sustainability
Green Rush: The Economic Imperative for Sustainability
 
Policy Administration Modernization: Four Paths for Insurers
Policy Administration Modernization: Four Paths for InsurersPolicy Administration Modernization: Four Paths for Insurers
Policy Administration Modernization: Four Paths for Insurers
 
The Work Ahead in Utilities: Powering a Sustainable Future with Digital
The Work Ahead in Utilities: Powering a Sustainable Future with DigitalThe Work Ahead in Utilities: Powering a Sustainable Future with Digital
The Work Ahead in Utilities: Powering a Sustainable Future with Digital
 
AI in Media & Entertainment: Starting the Journey to Value
AI in Media & Entertainment: Starting the Journey to ValueAI in Media & Entertainment: Starting the Journey to Value
AI in Media & Entertainment: Starting the Journey to Value
 
Operations Workforce Management: A Data-Informed, Digital-First Approach
Operations Workforce Management: A Data-Informed, Digital-First ApproachOperations Workforce Management: A Data-Informed, Digital-First Approach
Operations Workforce Management: A Data-Informed, Digital-First Approach
 
Five Priorities for Quality Engineering When Taking Banking to the Cloud
Five Priorities for Quality Engineering When Taking Banking to the CloudFive Priorities for Quality Engineering When Taking Banking to the Cloud
Five Priorities for Quality Engineering When Taking Banking to the Cloud
 
Getting Ahead With AI: How APAC Companies Replicate Success by Remaining Focused
Getting Ahead With AI: How APAC Companies Replicate Success by Remaining FocusedGetting Ahead With AI: How APAC Companies Replicate Success by Remaining Focused
Getting Ahead With AI: How APAC Companies Replicate Success by Remaining Focused
 
Crafting the Utility of the Future
Crafting the Utility of the FutureCrafting the Utility of the Future
Crafting the Utility of the Future
 
Utilities Can Ramp Up CX with a Customer Data Platform
Utilities Can Ramp Up CX with a Customer Data PlatformUtilities Can Ramp Up CX with a Customer Data Platform
Utilities Can Ramp Up CX with a Customer Data Platform
 
The Work Ahead in Intelligent Automation: Coping with Complexity in a Post-Pa...
The Work Ahead in Intelligent Automation: Coping with Complexity in a Post-Pa...The Work Ahead in Intelligent Automation: Coping with Complexity in a Post-Pa...
The Work Ahead in Intelligent Automation: Coping with Complexity in a Post-Pa...
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

How SAP HANA Leverages the Cloud to Glean Business Insights from Unstructured Data

  • 1. How SAP HANA Leverages the Cloud to Glean Business Insights from Unstructured Data SAP’s in-memory computing and cloud-based delivery model makes a strong business case for investing in more innovative ways of analyzing unstructured data. Executive Summary For most organizations, significant volumes of data are locked in the form of unstructured data generated by the broad sweep of social and mobile interactions that pervade our personal and professional lives. With SAP’s “in-memory” technology, called HANA,1 made available in the cloud, organizations can utilize the power of cloud computing to access tools required to effec- tively and efficiently analyze unstructured data. Unstructured data is generated in various forms during formal and informal operations. Textual forms of unstructured data include e-mails, contracts (contained in PDF documents), medical records and claims and customer reviews and feedback shared via traditional Web sites and social media. Existing business intelligence (BI) and analytical tools are in our estimation less efficient at analyzing unstructured textual information as compared with structured data stored in database tables. Research institutes such as PRIME Research, the Thomas J. Watson Research Center and the Fraunhofer Institute for Algorithms and Scientific Computing are said to be making use of open source frameworks like Apache UIMA for extract- ing information from textual unstructured data for research purposes. This approach is focused primarily around feature-based opinion mining, summarization, aspect-based sentiment analysis, detecting fake reviews, etc. However, a huge gap is observed when it comes to applications that can be used by business to drive decisions based on insights gained from textual unstructured data. This paper describes a cloud-based solution that uses information extractors such as Apache UIMA for extracting information from textual unstruc- tured data and SAP HANA for generating business insights that correlate with structured enterprise data contained in back-end systems. SAP HANA cloud platform provides a robust platform as a service to develop applications that can provide cognizant 20-20 insights | june 2013 • Cognizant 20-20 Insights
  • 2. cognizant 20-20 insights 2 analytical scalability for in-memory data analysis via RESTful API, federated user authentication and integration with back-end enterprise systems. Architectural Design Use of SAP HANA Cloud SAP HANA cloud is a platform that can host various components of the solution. An Information extractor such as Apache UIMA is first deployed on the cloud. Unstructured data is fed into the information extractor, either in real time or periodically. The information extractor uses industry-specific models as a reference for extracting information. Extracted entities and structured enterprise data is saved into persis- tent data storage. The SAP HANA cloud provides HANA as an in-memory database storage device stored in industry-specific schemas. Consider a case of a manufacturing company that is interested in obtaining insights from unstruc- tured data generated in the form of official e-mail communications with its vendors and custom- ers. Apache UIMA can be trained to identify and distinguish customers and vendors from their e-mail communication. Other allied information – such as product, part, material, location, date, verbs and adjectives – can be extracted and saved to the HANA database (containing manufactur- ing industry-specific schema) in the cloud for correlation analysis. This can help in sentiment analysis or sourcing marketing information. Industry-Specific Apps for Business Insights Apps specific to an industry are used to analyze data stored in HANA from which users can derive insights. These apps make use of HANA’s Predic- tive Analysis Library (PAL) to execute various algorithms on the data. Algorithms for cluster analysis, classification analysis and association analysis are currently available. New apps can be custom developed and deployed per the organiza- tion’s requirements. API for External Use The SAP HANA cloud’s application programming interface is exposed for external consumption. External applications that may be on a cloud or reside in on-premises systems can use these APIs to consume business insights. Developers can then make creative use of these insights to develop rich user interfaces using diverse technologies such as JavaScript or Adobe Flash or Microsoft Silverlight. APIs can be consumed by mobile apps as well by using a simple REST protocol. Data-Loading Mechanism/Use Cases • Persistent storage scenario: The user of this cloud service can opt to store data in the HANA system persistently. This provides continuously available analytics to the user, who can then develop his own external apps that make use of the insights provided by the cloud service. He can also access custom-built apps developed by third parties on SAP cloud that sit on top of HANA. An Integrated Approach to Meaning Making Figure 1 On Premise System Mobile Apps, Java Apps, etc. Other Cloud HANA SAP HANA Cloud Information Extractor (UIMA) Industry APPS REST API Unstructured Data (Industry Specific) E-mail Docs Web Note Structured Enterprise Data ERP DW RDBMS OLAP
  • 3. About Cognizant Cognizant (NASDAQ: CTSH) is a leading provider of information technology, consulting, and business process outsourcing services, dedicated to helping the world’s leading companies build stronger businesses. Headquartered in Teaneck, New Jersey (U.S.), Cognizant combines a passion for client satisfaction, technology innovation, deep industry and business process expertise, and a global, collaborative workforce that embodies the future of work. With over 50 delivery centers worldwide and approximately 162,700 employees as of March 31, 2013, Cognizant is a member of the NASDAQ-100, the S&P 500, the Forbes Global 2000, and the Fortune 500 and is ranked among the top performing and fastest growing companies in the world. Visit us online at www.cognizant.com for more information. World Headquarters 500 Frank W. Burr Blvd. Teaneck, NJ 07666 USA Phone: +1 201 801 0233 Fax: +1 201 801 0243 Toll Free: +1 888 937 3277 Email: inquiry@cognizant.com European Headquarters 1 Kingdom Street Paddington Central London W2 6BD Phone: +44 (0) 207 297 7600 Fax: +44 (0) 207 121 0102 Email: infouk@cognizant.com India Operations Headquarters #5/535, Old Mahabalipuram Road Okkiyam Pettai, Thoraipakkam Chennai, 600 096 India Phone: +91 (0) 44 4209 6000 Fax: +91 (0) 44 4209 6060 Email: inquiryindia@cognizant.com ­­© Copyright 2013, Cognizant. All rights reserved. No part of this document may be reproduced, stored in a retrieval system, transmitted in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the express written permission from Cognizant. The information contained herein is subject to change without notice. All other trademarks mentioned herein are the property of their respective owners. About the Author Rahul Aware is a Senior Associate within Cognizant’s Enterprise Application Software Business Unit. He has nine-plus years of experience as a SAP technical consultant and is a certified SAP professional with a flair for new technologies and their applications. He can be reached at Rahul.Aware@cognizant.com. • Temporary storage scenario: This mecha- nism is suitable for one-time consumption of the analytic service (e.g., to generate insights for an advertisement campaign by analyzing consumer reviews). Once the campaign is completed, there’s no need to keep data on the cloud. • On-the-fly scenario: This is purely in-memory analytics where data is not stored permanently within the HANA database. Unstructured data can be manually fed to the information extrac- tor via a user interface or programmatically via API services. Extracted information is stored as temporary tables (in-memory) in HANA; SAP’s PAL is used to analyze the entities. Once the output is calculated, the temporary table is deleted so that there’s no permanent storage. This will be of use to other sys- tems’ software and applications that require analytics to complete a task or transaction. For example, a vendor may want to query a partner’s HANA database by feeding in some of its work-in-progress materials to receive insights about inventory requirements con- tained in unstructured data stored in HANA. Cloud-Powered Analytics Empowers the Masses SAP HANA cloud, with its inherent in-memory analytics, provides the right mix of cloud comput- ing’s economic advantages with fast and potent computing power. HANA’s PAL simplifies the process for converting raw data into insights that help inform smarter and more timely business decisions. The REST API brings this solution into the hands of all the developers who are interested in developing their applications around the insights sourced from unstructured data. As such, the solution bridges unstructured data analytics with enterprise functions. Footnote 1 HANA is an abbreviation that stands for high-performance analytical appliance.