SlideShare a Scribd company logo
1 of 38
Download to read offline
1 
©2014 SAP AG or an SAP affiliate company. All rights reserved. 
SAP HANA SPS 09 – What’s New? Text Analysis 
SAP HANA Product Management November, 2014 
(Delta from SPS 08 to SPS 09)
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
2 
Public 
Agenda (1/2) 
New or Improved Text Analysis Features 
Custom extraction rules 
Document filters 
Voice of Customer 
TA_PARENT 
Language identification 
Word segmentation 
New or Improved Language Coverage 
Indonesian support 
Part-of-Speech & Noun Groups for Hebrew, Thai, Turkish 
Cyrillic Script for Serbian 
Core extraction for Traditional Chinese 
Voice of Customer for Italian, Portuguese, Russian, Traditional Chinese 
Emoticon & Profanity extraction for Dutch 
Public Sector fact extraction for English 
Enterprise fact extraction for English
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
3 
Public 
Agenda (2/2) 
New Text Mining Features 
Intro 
Top-ranked functions 
Characteristics 
Sample 
Queries 
Demo
New or Improved Text Analysis Features
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
5 
Public 
New Support of Custom Extraction Rules (1/2) 
Rule patterns enable you to identify more complex entity types than text analysis dictionaries can 
Dictionaries are ideal for specifying named entities, whereas Custom Grouper User Language (CGUL) rules enable you to identify events, relationships, etc. 
CGUL rules can leverage linguistic markup, core entities and custom dictionaries 
Several CGUL rules are included in a rule set which is stored as a file in the SAP HANA repository 
#group BuyingEvent: { [SN] <>*? 
[TE PERSON] <>+ [/TE] <>*? 
<STEM:buy> <>*? 
[NP] <>+ [/NP] <>*? 
[TE DATE] <>+ [/TE] 
<>* [/SN] }
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
6 
Public 
New Support of Custom Extraction Rules (2/2) 
Allow users to create CGUL rule sets and store them in the repository 
A single custom rule set may support all supported languages or a single language 
Custom rule sets reside in the SAP HANA repository and benefit from its life cycle management 
Custom rules sets are interchangeable with Text Data Processing on SAP Data Services 
Steps 
1.Choose the project to contain the new rule set in the Development perspective of SAP HANA Studio. 
2.Enter or select a parent folder and enter the rule set file name in the Wizard. The file extension must be .hdbtextrule. Your rule set file is created locally and opens as an empty file in the text editor. 
3.Enter your text analysis rule set specification into the new file and save it locally. 
4.Commit your new rule set. The rule set is now synchronized to the repository as a design time object and the icon shows the rule set is committed. 
5.Activate once you have finished editing your rule set. The rule set is created in the repository as a runtime object and the icon shows the rule set is activated. This allows you and others to use the rule set. If you haven’t done so previously, you will need to create a custom text analysis configuration as well.
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
7 
Public 
Improved Document Filters 
Document filters in the NLP engine automatically detect and extract text content and metadata from almost any type of binary file format from PPT to XLS to PDF, etc. 
•Additional format support – new and enhanced file types 
•Performance and viewing fidelity improvements: 
•Major enhancements to speed, memory usage, and viewing fidelity of MS Excel 
•Improved consistency of output across all formats 
•Bug fixes
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
8 
Public 
Challenge: Make Existing Sentiment Analysis Easier to Use (1/6) 
Redesign the Voice of Customer (VOC) language module for domain customization via simple positive/negative dictionaries instead of complex rule modifications 
This will result in: 
easier customizability 
improved runtime performance 
reliably extended extraction coverage 
Simplify Voice of Customer module for domain customization by non-experts
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
9 
Public 
Improved Voice of Customer (2/6) 
Sentiment analysis modules for previously available English, French, German, Spanish and Simplified Chinese have been redesigned. 
New languages adhere to the redesign: Italian, Portuguese, Russian and Traditional Chinese 
Keyword and profanity dictionaries identify and classify sentiments and are easily customizable, if needed. These dictionaries use the EXTRACTION_CORE_VOICEOFCUSTOMER configuration. 
Dictionaries are located in the sap.hana.ta.voc package of the SAP HANA repository. 
Language 
EXTRACTION_CORE_VOICEOFCUSTOMER 
Chinese (Simplified) 
IMPROVED 
Chinese (Traditional) 
NEW 
Dutch 
NEW (Emoticons & Profanity only) 
English 
IMPROVED 
French 
IMPROVED 
German 
IMPROVED 
Italian 
NEW 
Portuguese 
NEW 
Russian 
NEW 
Spanish 
IMPROVED
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
10 
Public 
Previous Voice of Customer Rule Behavior (3/6) 
Rule patterns tried to extract all the following pieces within one pass: 
Positive or negative keyword 
Possible topic 
Possible presence of an intensifier that modifies the positive/negative keyword 
Possible presence of a negation 
I am [happy].  Weak Positive I am [not happy].  Weak Negative I am [tremendously happy].  Strong Positive I am [not tremendously happy].  Weak Negative I am [happy with my purchase].  Weak Positive I am [not happy with my purchase].  Weak Negative I am [tremendously happy with my purchase].  Strong Positive I am [not tremendously happy with my purchase].  Weak Negative 
Rules became very complex: need to account for all combinations of these 4 pieces of information and output the right sentiment value. 
Not easily customizable: list of positive/negative keywords lived in the CGUL rules. 
Resulting in very large finite state networks which impacted runtime performance.
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
11 
Public 
Rule Behavior with Improved Voice of Customer (4/6) 
Now there are 2 sets of VOC rules that require a post-processing filtering mechanism (embedded in the NLP engine): 
1) One set of rules identifies positive/negative keywords with topics 
2) The other set of rules identifies positive/negative keywords along with their modifiers (negation and intensifier) 
3) The filtering mechanism sorts through the rule results and outputs one final value for each Keyword/Topic/Modifier combination 
Two Rule Sets Output I am <not [happy> with my purchase].  “happy” + “my purchase”  WeakNegative  “happy” + “not” I am <tremendously [happy> with my purchase].  “happy” + “my purchase”  StrongPositive  “happy” + “tremendously” I am <not tremendously [happy> with my purchase].  “happy” + “my purchase”  WeakNegative  “happy” + “tremendously” + “not”
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
12 
Public 
Benefits of Improved Voice of Customer (5/6) 
Rules are easily customizable via a dictionary: 
•Positive/negative keywords are now maintained via a customizable dictionary. 
•Entries in the dictionary get called by the CGUL rules  No need to go through and modify the CGUL code. 
•Users can add or remove entries as well as change the positive/negative categorization of entries to better match their vertical domain. 
Better runtime performance: 
•Best case example: German module is 71% faster 
Extension of extraction coverage to subordinate clauses, such as “I don’t think your product is good” which did not reliably extract before because of the many possible intervening tokens between the negation “don’t” and the keyword “good”. 
•Before: I don’t think [Topic]your product[/Topic] is [WeakPositive]good[/WeakPositive]. 
•After: I don’t think [Topic]your product[/Topic] is [WeakNegative]good[/WeakNegative].
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
13 
Public 
What Differentiates Voice of Customer (6/6) 
SAP solution … 
Other offerings … 
7 sentiment levels (strong positive, weak positive, neutral, weak negative, strong negative, major problem, minor problem) 
Usually limited to 3 (positive, neutral, negative) 
Identification of topic for each sentiment 
Topics not necessarily linked to sentiments 
One sentiment per clause 
Often only one sentiment/document 
Identification of requests 
N/a 
Broad domain coverage – from automotive to electronics 
Usually limited to a particular domain – e.g., hospitality 
Wide input coverage (customer reviews, CRM messages, social media, surveys, etc.) 
Usually limited to social media 
Consistent across languages 
Vary widely across languages
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
14 
Public 
New TA_PARENT 
A new column has been added to the $TA output table. 
The TA_PARENT column stores the TA_COUNTER value of the parent token, or NULL if the token has no parent. This field is used to indicate that there is a semantic relationship between two tokens. 
For example, it is used by the EXTRACTION_CORE_VOICEOFCUSTOMER rules to relate topics to their enclosing sentiments.
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
15 
Public 
Improved Language Identification 
Text analysis automatically detects the language of the input text in order to apply the appropriate linguistic rules. 
Language identification for Farsi, Polish, Thai and Turkish has higher precision.
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
16 
Public 
Improved Word Segmentation for Chinese 
More granular Chinese tokenization improves recall in searches.
New or Improved Language Coverage
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
18 
Public 
Available Text Analysis Configuration Options 
Language 
LINGANALYSIS_BASIC LINGANALYSIS_STEMS 
LINGANALYSIS_FULL 
EXTRACTION_CORE 
EXTRACTION_CORE_VOICEOFCUSTOMER 
EXTRACTION_CORE_PUBLIC_SECTOR 
EXTRACTION_CORE_ENTERPRISE 
Arabic 
 
 
 
Catalan 
 
 
Chinese (Simplified) 
 
 
 
 IMPROVED 
Chinese (Traditional) 
 
 
 NEW 
 NEW 
Croatian 
 
 
Czech 
 
 
Danish 
 
 
Dutch 
 
 
 
 NEW (Emoticons & Profanity only) 
English 
 
 
 
 IMPROVED 
 NEW 
 NEW 
Farsi 
 
 
 
French 
 
 
 
 IMPROVED 
German 
 
 
 
 IMPROVED 
Greek 
 
Hebrew 
 
 NEW 
Hungarian 
 
Indonesian 
 NEW 
 NEW 
Italian 
 
 
 
 NEW 
Japanese 
 
 
 
Korean 
 
 
 
Norwegian (Bokmal) 
 
 
Norwegian (Nynorsk) 
 
 
Polish 
 
Portuguese 
 
 
 
 NEW 
Romanian 
 
Russian 
 
 
 
 NEW 
Serbian 
 
 NEW (Cyrillic support) 
Slovak 
 
 
Slovenian 
 
 
Spanish 
 
 
 
 IMPROVED 
Swedish 
 
 
Thai 
 
 NEW 
Turkish 
 
 NEW
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
19 
Public 
New Indonesian Language Support 
Bahasa Indonesia is the official language of Indonesia. 
Indonesia is the 4th most populous nation in the world. The majority speak Indonesian, making it one of the most widely spoken languages in the world. 
The type of language support: 
•Language identification – apply the appropriate grammatical rules and dictionaries 
•Tokenization – decompose word sequences, e.g. “the quick brown fox” > “the” “quick” “brown” “fox” 
•Stemming – reduce to linguistic base form, e.g. “ran” > “run” 
•Part-of-Speech – tag word categories, e.g. “houses”: Nn-Pl 
•Noun Groups – identify concepts, e.g. “global piracy”
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
20 
Public 
Improved Language Support for Hebrew, Thai, Turkish 
Full linguistic analysis support by adding Part-of-Speech (POS) tagging and Noun Group (concepts) extraction for the following languages: 
•Hebrew 
•Thai 
•Turkish
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
21 
Public 
New Cyrillic Script Support for Serbian 
The Cyrillic script is an alphabetic writing system employed across Eastern Europe, North and Central Asian countries. 
Added Cyrillic script support to the previous Serbian Latin script coverage.
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
22 
Public 
New Predefined Core Extraction for Traditional Chinese 
TITLE President 
PERSON Barak Obama 
PEOPLE Greeks 
LANGUAGE Greek 
ADDRESS1 245 First Street Floor 16 
Cambridge, MA 02142 
LOCALITY Cambridge 
REGION@MAJOR Connecticut 
COUNTRY Brazil 
CONTINENT South America 
GEO_FEATURE Mount Fuji 
GEO_AREA Scandinavia 
ORGANIZATION@COMMERCIAL AT&T 
ORGANIZATION@EDUCATIONAL University of Washington 
ORGANIZATION@OTHER FBI 
PRODUCT iPhone 
SOCIAL_MEDIA@TWITTER_ID @SAP 
SOCIAL_MEDIA@TWITTER_TOPIC #HANA 
DATE 2/14/2011 DAY Monday MONTH June YEAR 2011 TIME 3:47pm TIME_PERIOD 3 days, from 9 to 5pm HOLIDAY Memorial Day CURRENCY 17 euros MEASURE 217 meters PERCENT 4% PHONE 617-677-2030 URI@EMAIL john.smith@sap.com URI@IP 165.14.2.0 URI@URL http://sap.com Syntactic Entities: NOUN_GROUP big umbrella PROP_MISC Cup o’ Soup
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
23 
Public 
New VOC for Italian, Portuguese, Russian, Traditional Chinese 
The following major fact types are classified: 
Sentiments: expression of a customer’s feelings about something 
Problems: a statement about something which impedes a customer’s work 
Requests: expression of a customer’s desire for an enhancement/change 
Profanity: defines a set of pejorative vocabulary 
Emoticons: expression of someone's feelings about the whole sentence or situation
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
24 
Public 
New Profanity & Emoticon Extraction for Dutch 
The following major fact types are classified: 
Sentiments: expression of a customer’s feelings about something 
Problems: a statement about something which impedes a customer’s work 
Requests: expression of a customer’s desire for an enhancement/change 
Profanity: defines a set of pejorative vocabulary 
Emoticons: expression of someone's feelings about the whole sentence or situation 
Note: Dutch does not support the above ‘faint’ sentiment analysis
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
25 
Public 
New Public Sector Extraction for English (1/2) 
Augments predefined entity types for core extraction with a number of entity, event, and relation types targeting public sector’s needs. 
The following major fact types are classified: 
Action: information about action and travel events 
Military Units: information about teams, wings, and squadrons 
Organizational Information: information about organizations 
Person-Alias: information about a person’s possible aliases 
Person-Appearance: information about a person’s appearance 
Person-Attributes: information about a person’s non-appearance attributes 
Person-Relationships: information about a person’s relationships 
Spatial References: distances, cardinal directions, or locations
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
26 
Public 
New Public Sector Extraction for English (2/2) 
The following 15+ types of named entities specific to public sector are extracted: 
VEHICLE/AIR, VEHICLE/LAND, VEHICLE/WATER, VEHICLE/VIN, VEHICLE/LICENCE 
PRECURSOR/CHEMICAL, PRECURSOR/NUCLEAR 
WEAPON/BIOLOGICAL, WEAPON/CHEMICAL, WEAPON/EXPLODING, WEAPON/NUCLEAR, WEAPON/PROJECTILE, WEAPON/SHOOTING 
GEOCOORD and MGRS (Military Grid Reference System) 
The following 40+ types of common entities specific to public sector are extracted: 
COMMON_COUNTRY (beloved country) 
COMMON_FACILITY (commercial airport) 
COMMON_PERSON (math teacher) 
Etc.
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
27 
Public 
New Enterprise Extraction for English 
Rules for the extraction of entities and facts of particular interest to the enterprise domain. 
The following major fact types are classified: 
Membership Information: information about a person’s affiliations 
Management Changes: information about management changes 
Product Releases: information about product releases 
Mergers & Acquisitions: information about mergers and acquisitions 
Organizational Information: founder, location or contact information
New Text Mining Features
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
29 
Public 
New Text Mining 
Text mining works at the document level – making semantic determinations about the overall content of documents relative to other documents. Whereas text analysis does linguistic analysis and extracts information embedded within each document. 
Functions based on Vector Space Model 
Identify similar documents 
Identify key terms of a document 
Identify related terms 
Categorize new documents based on a training corpus 
Scenarios 
Highlight the key terms when viewing a patent document 
Identify similar incidents for faster problem solving 
Categorize new scientific papers along a hierarchy of topics 
t1 
tn 
d1 
d2
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
30 
Public 
Text Mining – Top-Ranked Functions 
Input / Output 
Terms 
Documents 
Terms 
related terms, suggested terms 
relevant documents 
Documents 
relevant terms 
related documents, 
categorize
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
31 
Public 
Text Mining Index – Characteristics 
The text mining index is an optional data structure that is built from the results of linguistic analysis. It is bound to the full-text indexing and text analysis process. 
Full Text Indexing 
Full Text Index 
Text Analysis 
Results Table 
Full Text Indexing with TA and TM 
Text Mining Index 
TM config. 
insert 
ID 
TITLE
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
32 
Public 
New Text Mining – Basic Sample 
Prerequisite 
data is stored in a SAP HANA table 
Goal 
build a text mining index in order to use the text mining functions 
CREATE FULLTEXT INDEX myIndexName ON myTable(myColumn) TEXT MINING ON;
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
33 
Public 
New Text Mining – Queries 
SAP HANA provides a server-side JavaScript interface for the Text Mining engine 
getRelatedTerms 
getRelatdDocuments 
and more 
SAP HANA 
Indexserver 
Tables 
Engine 
HANA App 
Preprocessor 
Linguistic Processing 
Entity, Fact Extraction 
Extended Application Services (XS) 
TM API 
Views 
Text Mining
Demo 
Text Mining in SAP HANA SPS09
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
35 
Public 
Disclaimer 
This presentation outlines our general product direction and should not be relied on in making a purchase decision. This presentation is not subject to your license agreement or any other agreement with SAP. 
SAP has no obligation to pursue any course of business outlined in this presentation or to develop or release any functionality mentioned in this presentation. This presentation and SAP’s strategy and possible future developments are subject to change and may be changed by SAP at any time for any reason without notice. 
This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this document, except if such damages were caused by SAP intentionally or grossly negligent.
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
36 
Public 
How to find SAP HANA documentation on this topic? 
SAP HANA Platform SPS 09 
What’s New – Release Notes 
Development 
–SAP HANA Developer Guide 
–SAP HANA Text Analysis Developer Guide 
–SAP HANA Text Mining Developer Guide 
References 
–SAP HANA Text Analysis Extraction Customization Guide 
–SAP HANA Text Analysis Language Reference Guide 
•In addition to this learning material, you find SAP HANA documentation on SAP Help Portal knowledge center at http://help.sap.com/hana_platform. 
•The knowledge center is structured according to the product lifecycle: installation, security, administration, development.
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
Thank you 
Anthony Waite 
SAP HANA Product Management 
AskSAPHANA@sap.com
©2014 SAP SE or an SAP affiliate company. All rights reserved. 
38 
Public 
© 2014 SAP SE or an SAP affiliate company. All rights reserved. 
No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP SE or an SAP affiliate company. 
SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP SE (or an SAP affiliate company) in Germany and other countries. Please see http://global12.sap.com/corporate-en/legal/copyright/index.epx for additional trademark information and notices. 
Some software products marketed by SAP SE and its distributors contain proprietary software components of other software vendors. 
National product specifications may vary. 
These materials are provided by SAP SE or an SAP affiliate company for informational purposes only, without representation or warranty of any kind, and SAP SE or its affiliated companies shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP SE or SAP affiliate company products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. 
In particular, SAP SE or its affiliated companies have no obligation to pursue any course of business outlined in this document or any related presentation, or to develop or release any functionality mentioned therein. This document, or any related presentation, and SAP SE’s or its affiliated companies’ strategy and possible future developments, products, and/or platform directions and functionality are all subject to change and may be changed by SAP SE or its affiliated companies at any time for any reason without notice. The information in this document is not a commitment, promise, or legal obligation to deliver any material, code, or functionality. All forward- looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions.

More Related Content

What's hot

SAP HANA SPS10- Extended Application Services (XS) Programming Model
SAP HANA SPS10- Extended Application Services (XS) Programming ModelSAP HANA SPS10- Extended Application Services (XS) Programming Model
SAP HANA SPS10- Extended Application Services (XS) Programming ModelSAP Technology
 
HANA SPS07 Fuzzy Search
HANA SPS07 Fuzzy SearchHANA SPS07 Fuzzy Search
HANA SPS07 Fuzzy SearchSAP Technology
 
SAP HANA SPS10- SQLScript
SAP HANA SPS10- SQLScriptSAP HANA SPS10- SQLScript
SAP HANA SPS10- SQLScriptSAP Technology
 
SAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM ServicesSAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM ServicesSAP Technology
 
SAP HANA SPS09 - SAP HANA Core & SQL
SAP HANA SPS09 - SAP HANA Core & SQLSAP HANA SPS09 - SAP HANA Core & SQL
SAP HANA SPS09 - SAP HANA Core & SQLSAP Technology
 
What's New for SAP HANA Smart Data Integration & Smart Data Quality
What's New for SAP HANA Smart Data Integration & Smart Data QualityWhat's New for SAP HANA Smart Data Integration & Smart Data Quality
What's New for SAP HANA Smart Data Integration & Smart Data QualitySAP Technology
 
HANA SPS07 Smart Data Access
HANA SPS07 Smart Data AccessHANA SPS07 Smart Data Access
HANA SPS07 Smart Data AccessSAP Technology
 
SAP HANA SPS09 - XS Programming Model
SAP HANA SPS09 - XS Programming ModelSAP HANA SPS09 - XS Programming Model
SAP HANA SPS09 - XS Programming ModelSAP Technology
 
SAP HANA SPS09 - Development Tools
SAP HANA SPS09 - Development ToolsSAP HANA SPS09 - Development Tools
SAP HANA SPS09 - Development ToolsSAP Technology
 
Spark Usage in Enterprise Business Operations
Spark Usage in Enterprise Business OperationsSpark Usage in Enterprise Business Operations
Spark Usage in Enterprise Business OperationsSAP Technology
 
SAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Technology
 
SAP HANA SPS10- SAP HANA Dynamic Tiering
SAP HANA SPS10- SAP HANA Dynamic TieringSAP HANA SPS10- SAP HANA Dynamic Tiering
SAP HANA SPS10- SAP HANA Dynamic TieringSAP Technology
 
Building Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANABuilding Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANASAP Technology
 
Enhancing data sources with badi in SAP ABAP
Enhancing data sources with badi in SAP ABAPEnhancing data sources with badi in SAP ABAP
Enhancing data sources with badi in SAP ABAPAabid Khan
 
What's New in SAP HANA View Modeling
What's New in SAP HANA View ModelingWhat's New in SAP HANA View Modeling
What's New in SAP HANA View ModelingSAP Technology
 
SAP HANA SPS09 - SAP HANA Answers
SAP HANA SPS09 - SAP HANA AnswersSAP HANA SPS09 - SAP HANA Answers
SAP HANA SPS09 - SAP HANA AnswersSAP Technology
 
What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10SAP Technology
 
SAP HANA SPS10- Series Data/ TimeSeries
SAP HANA SPS10- Series Data/ TimeSeriesSAP HANA SPS10- Series Data/ TimeSeries
SAP HANA SPS10- Series Data/ TimeSeriesSAP Technology
 

What's hot (20)

SAP HANA SPS10- Extended Application Services (XS) Programming Model
SAP HANA SPS10- Extended Application Services (XS) Programming ModelSAP HANA SPS10- Extended Application Services (XS) Programming Model
SAP HANA SPS10- Extended Application Services (XS) Programming Model
 
HANA SPS07 Fuzzy Search
HANA SPS07 Fuzzy SearchHANA SPS07 Fuzzy Search
HANA SPS07 Fuzzy Search
 
SAP HANA SPS10- SQLScript
SAP HANA SPS10- SQLScriptSAP HANA SPS10- SQLScript
SAP HANA SPS10- SQLScript
 
SAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM ServicesSAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM Services
 
SAP HANA SPS09 - SAP HANA Core & SQL
SAP HANA SPS09 - SAP HANA Core & SQLSAP HANA SPS09 - SAP HANA Core & SQL
SAP HANA SPS09 - SAP HANA Core & SQL
 
What's New for SAP HANA Smart Data Integration & Smart Data Quality
What's New for SAP HANA Smart Data Integration & Smart Data QualityWhat's New for SAP HANA Smart Data Integration & Smart Data Quality
What's New for SAP HANA Smart Data Integration & Smart Data Quality
 
HANA SPS07 Smart Data Access
HANA SPS07 Smart Data AccessHANA SPS07 Smart Data Access
HANA SPS07 Smart Data Access
 
SAP HANA SPS09 - XS Programming Model
SAP HANA SPS09 - XS Programming ModelSAP HANA SPS09 - XS Programming Model
SAP HANA SPS09 - XS Programming Model
 
SAP HANA SPS09 - Development Tools
SAP HANA SPS09 - Development ToolsSAP HANA SPS09 - Development Tools
SAP HANA SPS09 - Development Tools
 
Spark Usage in Enterprise Business Operations
Spark Usage in Enterprise Business OperationsSpark Usage in Enterprise Business Operations
Spark Usage in Enterprise Business Operations
 
SAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial Data
 
Hana sql
Hana sql Hana sql
Hana sql
 
SAP HANA SPS10- SAP HANA Dynamic Tiering
SAP HANA SPS10- SAP HANA Dynamic TieringSAP HANA SPS10- SAP HANA Dynamic Tiering
SAP HANA SPS10- SAP HANA Dynamic Tiering
 
Building Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANABuilding Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANA
 
Enhancing data sources with badi in SAP ABAP
Enhancing data sources with badi in SAP ABAPEnhancing data sources with badi in SAP ABAP
Enhancing data sources with badi in SAP ABAP
 
What's New in SAP HANA View Modeling
What's New in SAP HANA View ModelingWhat's New in SAP HANA View Modeling
What's New in SAP HANA View Modeling
 
SAP HANA SPS09 - SAP HANA Answers
SAP HANA SPS09 - SAP HANA AnswersSAP HANA SPS09 - SAP HANA Answers
SAP HANA SPS09 - SAP HANA Answers
 
Why SAP HANA?
Why SAP HANA?Why SAP HANA?
Why SAP HANA?
 
What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10
 
SAP HANA SPS10- Series Data/ TimeSeries
SAP HANA SPS10- Series Data/ TimeSeriesSAP HANA SPS10- Series Data/ TimeSeries
SAP HANA SPS10- Series Data/ TimeSeries
 

Similar to SAP HANA SPS 09 Text Analysis What's New

What's new for Text in SAP HANA SPS 11
What's new for Text in SAP HANA SPS 11What's new for Text in SAP HANA SPS 11
What's new for Text in SAP HANA SPS 11SAP Technology
 
SAP HANA SPS09 - SQLScript
SAP HANA SPS09 - SQLScriptSAP HANA SPS09 - SQLScript
SAP HANA SPS09 - SQLScriptSAP Technology
 
Transforme la operación de tu negocio en tiempo real.
Transforme la operación de tu negocio en tiempo real.Transforme la operación de tu negocio en tiempo real.
Transforme la operación de tu negocio en tiempo real.SAP Latinoamérica
 
B1 intercompany sizing guide
B1 intercompany sizing guideB1 intercompany sizing guide
B1 intercompany sizing guidewalldorf_share
 
Sap bw abap_analyzer_v3_4_ppt
Sap bw abap_analyzer_v3_4_pptSap bw abap_analyzer_v3_4_ppt
Sap bw abap_analyzer_v3_4_ppt원석 최
 
Choosing The Right Tools For The Right Job
Choosing The Right Tools For The Right JobChoosing The Right Tools For The Right Job
Choosing The Right Tools For The Right Jobguest6159b2
 
Unveiling Our All-New Enhancement Request Model and Customer Support Portal
Unveiling Our All-New Enhancement Request Model and Customer Support PortalUnveiling Our All-New Enhancement Request Model and Customer Support Portal
Unveiling Our All-New Enhancement Request Model and Customer Support PortalSAP Ariba
 
SAP #BOBJ #BI 4.1 Upgrade Webcast Series 8: Converting Desktop Intelligence R...
SAP #BOBJ #BI 4.1 Upgrade Webcast Series 8: Converting Desktop Intelligence R...SAP #BOBJ #BI 4.1 Upgrade Webcast Series 8: Converting Desktop Intelligence R...
SAP #BOBJ #BI 4.1 Upgrade Webcast Series 8: Converting Desktop Intelligence R...SAP Analytics
 
Enhancement packages for sap erp what's new in logistics and operations
Enhancement packages for sap erp what's new in logistics and operationsEnhancement packages for sap erp what's new in logistics and operations
Enhancement packages for sap erp what's new in logistics and operationsaminab
 
Building a content portfolio for multi-channel publishing, presentation @TCEu...
Building a content portfolio for multi-channel publishing, presentation @TCEu...Building a content portfolio for multi-channel publishing, presentation @TCEu...
Building a content portfolio for multi-channel publishing, presentation @TCEu...Minna Vänskä
 
Bug Hunting with the Salesforce Developer Console
Bug Hunting with the Salesforce Developer ConsoleBug Hunting with the Salesforce Developer Console
Bug Hunting with the Salesforce Developer ConsoleMatthew Poe
 
Why your SAP PI/PO system should be updated
Why your SAP PI/PO system should be updatedWhy your SAP PI/PO system should be updated
Why your SAP PI/PO system should be updatedDaniel Graversen
 
011000358700001078532011 e
011000358700001078532011 e011000358700001078532011 e
011000358700001078532011 eRavi Ahmed
 
S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...
S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...
S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...lakshmi vara
 
SAP Business One - Value of maintenance for partner and customer v2
SAP Business One - Value of maintenance for partner and customer v2SAP Business One - Value of maintenance for partner and customer v2
SAP Business One - Value of maintenance for partner and customer v2Davinder Singh
 
SAP HANA SPS09 - SAP HANA Workload Management
SAP HANA SPS09 - SAP HANA Workload ManagementSAP HANA SPS09 - SAP HANA Workload Management
SAP HANA SPS09 - SAP HANA Workload ManagementSAP Technology
 
SAP HANA Data Center Intelligence Overview
SAP HANA Data Center Intelligence OverviewSAP HANA Data Center Intelligence Overview
SAP HANA Data Center Intelligence OverviewSAP Technology
 

Similar to SAP HANA SPS 09 Text Analysis What's New (20)

AG_18.ppt
AG_18.pptAG_18.ppt
AG_18.ppt
 
Ten Years of DITA: Are We There Yet?
Ten Years of DITA: Are We There Yet?Ten Years of DITA: Are We There Yet?
Ten Years of DITA: Are We There Yet?
 
What's new for Text in SAP HANA SPS 11
What's new for Text in SAP HANA SPS 11What's new for Text in SAP HANA SPS 11
What's new for Text in SAP HANA SPS 11
 
SAP HANA SPS09 - SQLScript
SAP HANA SPS09 - SQLScriptSAP HANA SPS09 - SQLScript
SAP HANA SPS09 - SQLScript
 
Transforme la operación de tu negocio en tiempo real.
Transforme la operación de tu negocio en tiempo real.Transforme la operación de tu negocio en tiempo real.
Transforme la operación de tu negocio en tiempo real.
 
B1 intercompany sizing guide
B1 intercompany sizing guideB1 intercompany sizing guide
B1 intercompany sizing guide
 
Sap bw abap_analyzer_v3_4_ppt
Sap bw abap_analyzer_v3_4_pptSap bw abap_analyzer_v3_4_ppt
Sap bw abap_analyzer_v3_4_ppt
 
Choosing The Right Tools For The Right Job
Choosing The Right Tools For The Right JobChoosing The Right Tools For The Right Job
Choosing The Right Tools For The Right Job
 
Tailoring the DITA Suit to Fit
Tailoring the DITA Suit to FitTailoring the DITA Suit to Fit
Tailoring the DITA Suit to Fit
 
Unveiling Our All-New Enhancement Request Model and Customer Support Portal
Unveiling Our All-New Enhancement Request Model and Customer Support PortalUnveiling Our All-New Enhancement Request Model and Customer Support Portal
Unveiling Our All-New Enhancement Request Model and Customer Support Portal
 
SAP #BOBJ #BI 4.1 Upgrade Webcast Series 8: Converting Desktop Intelligence R...
SAP #BOBJ #BI 4.1 Upgrade Webcast Series 8: Converting Desktop Intelligence R...SAP #BOBJ #BI 4.1 Upgrade Webcast Series 8: Converting Desktop Intelligence R...
SAP #BOBJ #BI 4.1 Upgrade Webcast Series 8: Converting Desktop Intelligence R...
 
Enhancement packages for sap erp what's new in logistics and operations
Enhancement packages for sap erp what's new in logistics and operationsEnhancement packages for sap erp what's new in logistics and operations
Enhancement packages for sap erp what's new in logistics and operations
 
Building a content portfolio for multi-channel publishing, presentation @TCEu...
Building a content portfolio for multi-channel publishing, presentation @TCEu...Building a content portfolio for multi-channel publishing, presentation @TCEu...
Building a content portfolio for multi-channel publishing, presentation @TCEu...
 
Bug Hunting with the Salesforce Developer Console
Bug Hunting with the Salesforce Developer ConsoleBug Hunting with the Salesforce Developer Console
Bug Hunting with the Salesforce Developer Console
 
Why your SAP PI/PO system should be updated
Why your SAP PI/PO system should be updatedWhy your SAP PI/PO system should be updated
Why your SAP PI/PO system should be updated
 
011000358700001078532011 e
011000358700001078532011 e011000358700001078532011 e
011000358700001078532011 e
 
S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...
S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...
S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...
 
SAP Business One - Value of maintenance for partner and customer v2
SAP Business One - Value of maintenance for partner and customer v2SAP Business One - Value of maintenance for partner and customer v2
SAP Business One - Value of maintenance for partner and customer v2
 
SAP HANA SPS09 - SAP HANA Workload Management
SAP HANA SPS09 - SAP HANA Workload ManagementSAP HANA SPS09 - SAP HANA Workload Management
SAP HANA SPS09 - SAP HANA Workload Management
 
SAP HANA Data Center Intelligence Overview
SAP HANA Data Center Intelligence OverviewSAP HANA Data Center Intelligence Overview
SAP HANA Data Center Intelligence Overview
 

More from SAP Technology

SAP Integration Suite L1
SAP Integration Suite L1SAP Integration Suite L1
SAP Integration Suite L1SAP Technology
 
Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...
Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...
Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...SAP Technology
 
7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...
7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...
7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...SAP Technology
 
Extend SAP S/4HANA to deliver real-time intelligent processes
Extend SAP S/4HANA to deliver real-time intelligent processesExtend SAP S/4HANA to deliver real-time intelligent processes
Extend SAP S/4HANA to deliver real-time intelligent processesSAP Technology
 
Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...
Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...
Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...SAP Technology
 
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology PlatformAccelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology PlatformSAP Technology
 
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...SAP Technology
 
Transform your business with intelligent insights and SAP S/4HANA
Transform your business with intelligent insights and SAP S/4HANATransform your business with intelligent insights and SAP S/4HANA
Transform your business with intelligent insights and SAP S/4HANASAP Technology
 
SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...
SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...
SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...SAP Technology
 
Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...
Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...
Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...SAP Technology
 
The IoT Imperative for Consumer Products
The IoT Imperative for Consumer ProductsThe IoT Imperative for Consumer Products
The IoT Imperative for Consumer ProductsSAP Technology
 
The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...
The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...
The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...SAP Technology
 
IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...
IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...
IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...SAP Technology
 
The IoT Imperative in Government and Healthcare
The IoT Imperative in Government and HealthcareThe IoT Imperative in Government and Healthcare
The IoT Imperative in Government and HealthcareSAP Technology
 
SAP S/4HANA Finance and the Digital Core
SAP S/4HANA Finance and the Digital CoreSAP S/4HANA Finance and the Digital Core
SAP S/4HANA Finance and the Digital CoreSAP Technology
 
Five Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANA
Five Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANAFive Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANA
Five Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANASAP Technology
 
Spotlight on Financial Services with Calypso and SAP ASE
Spotlight on Financial Services with Calypso and SAP ASESpotlight on Financial Services with Calypso and SAP ASE
Spotlight on Financial Services with Calypso and SAP ASESAP Technology
 
SAP ASE 16 SP02 Performance Features
SAP ASE 16 SP02 Performance FeaturesSAP ASE 16 SP02 Performance Features
SAP ASE 16 SP02 Performance FeaturesSAP Technology
 
What's New in SAP HANA SPS 11 Operations
What's New in SAP HANA SPS 11 OperationsWhat's New in SAP HANA SPS 11 Operations
What's New in SAP HANA SPS 11 OperationsSAP Technology
 
What's New in SAP HANA SPS 11 Application Lifecycle Management
What's New in SAP HANA SPS 11 Application Lifecycle ManagementWhat's New in SAP HANA SPS 11 Application Lifecycle Management
What's New in SAP HANA SPS 11 Application Lifecycle ManagementSAP Technology
 

More from SAP Technology (20)

SAP Integration Suite L1
SAP Integration Suite L1SAP Integration Suite L1
SAP Integration Suite L1
 
Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...
Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...
Future-Proof Your Business Processes by Automating SAP S/4HANA processes with...
 
7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...
7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...
7 Top Reasons to Automate Processes with SAP Intelligent Robotic Processes Au...
 
Extend SAP S/4HANA to deliver real-time intelligent processes
Extend SAP S/4HANA to deliver real-time intelligent processesExtend SAP S/4HANA to deliver real-time intelligent processes
Extend SAP S/4HANA to deliver real-time intelligent processes
 
Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...
Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...
Process optimization and automation for SAP S/4HANA with SAP’s Business Techn...
 
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology PlatformAccelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
 
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
 
Transform your business with intelligent insights and SAP S/4HANA
Transform your business with intelligent insights and SAP S/4HANATransform your business with intelligent insights and SAP S/4HANA
Transform your business with intelligent insights and SAP S/4HANA
 
SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...
SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...
SAP Cloud Platform for SAP S/4HANA: Accelerate your move to an Intelligent En...
 
Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...
Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...
Innovate collaborative applications with SAP Jam Collaboration & SAP Cloud Pl...
 
The IoT Imperative for Consumer Products
The IoT Imperative for Consumer ProductsThe IoT Imperative for Consumer Products
The IoT Imperative for Consumer Products
 
The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...
The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...
The IoT Imperative for Discrete Manufacturers - Automotive, Aerospace & Defen...
 
IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...
IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...
IoT is Enabling a New Era of Shareholder Value in Energy and Natural Resource...
 
The IoT Imperative in Government and Healthcare
The IoT Imperative in Government and HealthcareThe IoT Imperative in Government and Healthcare
The IoT Imperative in Government and Healthcare
 
SAP S/4HANA Finance and the Digital Core
SAP S/4HANA Finance and the Digital CoreSAP S/4HANA Finance and the Digital Core
SAP S/4HANA Finance and the Digital Core
 
Five Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANA
Five Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANAFive Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANA
Five Reasons To Skip SAP Suite on HANA and Go Directly to SAP S/4HANA
 
Spotlight on Financial Services with Calypso and SAP ASE
Spotlight on Financial Services with Calypso and SAP ASESpotlight on Financial Services with Calypso and SAP ASE
Spotlight on Financial Services with Calypso and SAP ASE
 
SAP ASE 16 SP02 Performance Features
SAP ASE 16 SP02 Performance FeaturesSAP ASE 16 SP02 Performance Features
SAP ASE 16 SP02 Performance Features
 
What's New in SAP HANA SPS 11 Operations
What's New in SAP HANA SPS 11 OperationsWhat's New in SAP HANA SPS 11 Operations
What's New in SAP HANA SPS 11 Operations
 
What's New in SAP HANA SPS 11 Application Lifecycle Management
What's New in SAP HANA SPS 11 Application Lifecycle ManagementWhat's New in SAP HANA SPS 11 Application Lifecycle Management
What's New in SAP HANA SPS 11 Application Lifecycle Management
 

Recently uploaded

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 

Recently uploaded (20)

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 

SAP HANA SPS 09 Text Analysis What's New

  • 1. 1 ©2014 SAP AG or an SAP affiliate company. All rights reserved. SAP HANA SPS 09 – What’s New? Text Analysis SAP HANA Product Management November, 2014 (Delta from SPS 08 to SPS 09)
  • 2. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 2 Public Agenda (1/2) New or Improved Text Analysis Features Custom extraction rules Document filters Voice of Customer TA_PARENT Language identification Word segmentation New or Improved Language Coverage Indonesian support Part-of-Speech & Noun Groups for Hebrew, Thai, Turkish Cyrillic Script for Serbian Core extraction for Traditional Chinese Voice of Customer for Italian, Portuguese, Russian, Traditional Chinese Emoticon & Profanity extraction for Dutch Public Sector fact extraction for English Enterprise fact extraction for English
  • 3. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 3 Public Agenda (2/2) New Text Mining Features Intro Top-ranked functions Characteristics Sample Queries Demo
  • 4. New or Improved Text Analysis Features
  • 5. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 5 Public New Support of Custom Extraction Rules (1/2) Rule patterns enable you to identify more complex entity types than text analysis dictionaries can Dictionaries are ideal for specifying named entities, whereas Custom Grouper User Language (CGUL) rules enable you to identify events, relationships, etc. CGUL rules can leverage linguistic markup, core entities and custom dictionaries Several CGUL rules are included in a rule set which is stored as a file in the SAP HANA repository #group BuyingEvent: { [SN] <>*? [TE PERSON] <>+ [/TE] <>*? <STEM:buy> <>*? [NP] <>+ [/NP] <>*? [TE DATE] <>+ [/TE] <>* [/SN] }
  • 6. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 6 Public New Support of Custom Extraction Rules (2/2) Allow users to create CGUL rule sets and store them in the repository A single custom rule set may support all supported languages or a single language Custom rule sets reside in the SAP HANA repository and benefit from its life cycle management Custom rules sets are interchangeable with Text Data Processing on SAP Data Services Steps 1.Choose the project to contain the new rule set in the Development perspective of SAP HANA Studio. 2.Enter or select a parent folder and enter the rule set file name in the Wizard. The file extension must be .hdbtextrule. Your rule set file is created locally and opens as an empty file in the text editor. 3.Enter your text analysis rule set specification into the new file and save it locally. 4.Commit your new rule set. The rule set is now synchronized to the repository as a design time object and the icon shows the rule set is committed. 5.Activate once you have finished editing your rule set. The rule set is created in the repository as a runtime object and the icon shows the rule set is activated. This allows you and others to use the rule set. If you haven’t done so previously, you will need to create a custom text analysis configuration as well.
  • 7. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 7 Public Improved Document Filters Document filters in the NLP engine automatically detect and extract text content and metadata from almost any type of binary file format from PPT to XLS to PDF, etc. •Additional format support – new and enhanced file types •Performance and viewing fidelity improvements: •Major enhancements to speed, memory usage, and viewing fidelity of MS Excel •Improved consistency of output across all formats •Bug fixes
  • 8. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 8 Public Challenge: Make Existing Sentiment Analysis Easier to Use (1/6) Redesign the Voice of Customer (VOC) language module for domain customization via simple positive/negative dictionaries instead of complex rule modifications This will result in: easier customizability improved runtime performance reliably extended extraction coverage Simplify Voice of Customer module for domain customization by non-experts
  • 9. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 9 Public Improved Voice of Customer (2/6) Sentiment analysis modules for previously available English, French, German, Spanish and Simplified Chinese have been redesigned. New languages adhere to the redesign: Italian, Portuguese, Russian and Traditional Chinese Keyword and profanity dictionaries identify and classify sentiments and are easily customizable, if needed. These dictionaries use the EXTRACTION_CORE_VOICEOFCUSTOMER configuration. Dictionaries are located in the sap.hana.ta.voc package of the SAP HANA repository. Language EXTRACTION_CORE_VOICEOFCUSTOMER Chinese (Simplified) IMPROVED Chinese (Traditional) NEW Dutch NEW (Emoticons & Profanity only) English IMPROVED French IMPROVED German IMPROVED Italian NEW Portuguese NEW Russian NEW Spanish IMPROVED
  • 10. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 10 Public Previous Voice of Customer Rule Behavior (3/6) Rule patterns tried to extract all the following pieces within one pass: Positive or negative keyword Possible topic Possible presence of an intensifier that modifies the positive/negative keyword Possible presence of a negation I am [happy].  Weak Positive I am [not happy].  Weak Negative I am [tremendously happy].  Strong Positive I am [not tremendously happy].  Weak Negative I am [happy with my purchase].  Weak Positive I am [not happy with my purchase].  Weak Negative I am [tremendously happy with my purchase].  Strong Positive I am [not tremendously happy with my purchase].  Weak Negative Rules became very complex: need to account for all combinations of these 4 pieces of information and output the right sentiment value. Not easily customizable: list of positive/negative keywords lived in the CGUL rules. Resulting in very large finite state networks which impacted runtime performance.
  • 11. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 11 Public Rule Behavior with Improved Voice of Customer (4/6) Now there are 2 sets of VOC rules that require a post-processing filtering mechanism (embedded in the NLP engine): 1) One set of rules identifies positive/negative keywords with topics 2) The other set of rules identifies positive/negative keywords along with their modifiers (negation and intensifier) 3) The filtering mechanism sorts through the rule results and outputs one final value for each Keyword/Topic/Modifier combination Two Rule Sets Output I am <not [happy> with my purchase].  “happy” + “my purchase”  WeakNegative  “happy” + “not” I am <tremendously [happy> with my purchase].  “happy” + “my purchase”  StrongPositive  “happy” + “tremendously” I am <not tremendously [happy> with my purchase].  “happy” + “my purchase”  WeakNegative  “happy” + “tremendously” + “not”
  • 12. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 12 Public Benefits of Improved Voice of Customer (5/6) Rules are easily customizable via a dictionary: •Positive/negative keywords are now maintained via a customizable dictionary. •Entries in the dictionary get called by the CGUL rules  No need to go through and modify the CGUL code. •Users can add or remove entries as well as change the positive/negative categorization of entries to better match their vertical domain. Better runtime performance: •Best case example: German module is 71% faster Extension of extraction coverage to subordinate clauses, such as “I don’t think your product is good” which did not reliably extract before because of the many possible intervening tokens between the negation “don’t” and the keyword “good”. •Before: I don’t think [Topic]your product[/Topic] is [WeakPositive]good[/WeakPositive]. •After: I don’t think [Topic]your product[/Topic] is [WeakNegative]good[/WeakNegative].
  • 13. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 13 Public What Differentiates Voice of Customer (6/6) SAP solution … Other offerings … 7 sentiment levels (strong positive, weak positive, neutral, weak negative, strong negative, major problem, minor problem) Usually limited to 3 (positive, neutral, negative) Identification of topic for each sentiment Topics not necessarily linked to sentiments One sentiment per clause Often only one sentiment/document Identification of requests N/a Broad domain coverage – from automotive to electronics Usually limited to a particular domain – e.g., hospitality Wide input coverage (customer reviews, CRM messages, social media, surveys, etc.) Usually limited to social media Consistent across languages Vary widely across languages
  • 14. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 14 Public New TA_PARENT A new column has been added to the $TA output table. The TA_PARENT column stores the TA_COUNTER value of the parent token, or NULL if the token has no parent. This field is used to indicate that there is a semantic relationship between two tokens. For example, it is used by the EXTRACTION_CORE_VOICEOFCUSTOMER rules to relate topics to their enclosing sentiments.
  • 15. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 15 Public Improved Language Identification Text analysis automatically detects the language of the input text in order to apply the appropriate linguistic rules. Language identification for Farsi, Polish, Thai and Turkish has higher precision.
  • 16. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 16 Public Improved Word Segmentation for Chinese More granular Chinese tokenization improves recall in searches.
  • 17. New or Improved Language Coverage
  • 18. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 18 Public Available Text Analysis Configuration Options Language LINGANALYSIS_BASIC LINGANALYSIS_STEMS LINGANALYSIS_FULL EXTRACTION_CORE EXTRACTION_CORE_VOICEOFCUSTOMER EXTRACTION_CORE_PUBLIC_SECTOR EXTRACTION_CORE_ENTERPRISE Arabic    Catalan   Chinese (Simplified)     IMPROVED Chinese (Traditional)    NEW  NEW Croatian   Czech   Danish   Dutch     NEW (Emoticons & Profanity only) English     IMPROVED  NEW  NEW Farsi    French     IMPROVED German     IMPROVED Greek  Hebrew   NEW Hungarian  Indonesian  NEW  NEW Italian     NEW Japanese    Korean    Norwegian (Bokmal)   Norwegian (Nynorsk)   Polish  Portuguese     NEW Romanian  Russian     NEW Serbian   NEW (Cyrillic support) Slovak   Slovenian   Spanish     IMPROVED Swedish   Thai   NEW Turkish   NEW
  • 19. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 19 Public New Indonesian Language Support Bahasa Indonesia is the official language of Indonesia. Indonesia is the 4th most populous nation in the world. The majority speak Indonesian, making it one of the most widely spoken languages in the world. The type of language support: •Language identification – apply the appropriate grammatical rules and dictionaries •Tokenization – decompose word sequences, e.g. “the quick brown fox” > “the” “quick” “brown” “fox” •Stemming – reduce to linguistic base form, e.g. “ran” > “run” •Part-of-Speech – tag word categories, e.g. “houses”: Nn-Pl •Noun Groups – identify concepts, e.g. “global piracy”
  • 20. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 20 Public Improved Language Support for Hebrew, Thai, Turkish Full linguistic analysis support by adding Part-of-Speech (POS) tagging and Noun Group (concepts) extraction for the following languages: •Hebrew •Thai •Turkish
  • 21. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 21 Public New Cyrillic Script Support for Serbian The Cyrillic script is an alphabetic writing system employed across Eastern Europe, North and Central Asian countries. Added Cyrillic script support to the previous Serbian Latin script coverage.
  • 22. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 22 Public New Predefined Core Extraction for Traditional Chinese TITLE President PERSON Barak Obama PEOPLE Greeks LANGUAGE Greek ADDRESS1 245 First Street Floor 16 Cambridge, MA 02142 LOCALITY Cambridge REGION@MAJOR Connecticut COUNTRY Brazil CONTINENT South America GEO_FEATURE Mount Fuji GEO_AREA Scandinavia ORGANIZATION@COMMERCIAL AT&T ORGANIZATION@EDUCATIONAL University of Washington ORGANIZATION@OTHER FBI PRODUCT iPhone SOCIAL_MEDIA@TWITTER_ID @SAP SOCIAL_MEDIA@TWITTER_TOPIC #HANA DATE 2/14/2011 DAY Monday MONTH June YEAR 2011 TIME 3:47pm TIME_PERIOD 3 days, from 9 to 5pm HOLIDAY Memorial Day CURRENCY 17 euros MEASURE 217 meters PERCENT 4% PHONE 617-677-2030 URI@EMAIL john.smith@sap.com URI@IP 165.14.2.0 URI@URL http://sap.com Syntactic Entities: NOUN_GROUP big umbrella PROP_MISC Cup o’ Soup
  • 23. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 23 Public New VOC for Italian, Portuguese, Russian, Traditional Chinese The following major fact types are classified: Sentiments: expression of a customer’s feelings about something Problems: a statement about something which impedes a customer’s work Requests: expression of a customer’s desire for an enhancement/change Profanity: defines a set of pejorative vocabulary Emoticons: expression of someone's feelings about the whole sentence or situation
  • 24. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 24 Public New Profanity & Emoticon Extraction for Dutch The following major fact types are classified: Sentiments: expression of a customer’s feelings about something Problems: a statement about something which impedes a customer’s work Requests: expression of a customer’s desire for an enhancement/change Profanity: defines a set of pejorative vocabulary Emoticons: expression of someone's feelings about the whole sentence or situation Note: Dutch does not support the above ‘faint’ sentiment analysis
  • 25. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 25 Public New Public Sector Extraction for English (1/2) Augments predefined entity types for core extraction with a number of entity, event, and relation types targeting public sector’s needs. The following major fact types are classified: Action: information about action and travel events Military Units: information about teams, wings, and squadrons Organizational Information: information about organizations Person-Alias: information about a person’s possible aliases Person-Appearance: information about a person’s appearance Person-Attributes: information about a person’s non-appearance attributes Person-Relationships: information about a person’s relationships Spatial References: distances, cardinal directions, or locations
  • 26. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 26 Public New Public Sector Extraction for English (2/2) The following 15+ types of named entities specific to public sector are extracted: VEHICLE/AIR, VEHICLE/LAND, VEHICLE/WATER, VEHICLE/VIN, VEHICLE/LICENCE PRECURSOR/CHEMICAL, PRECURSOR/NUCLEAR WEAPON/BIOLOGICAL, WEAPON/CHEMICAL, WEAPON/EXPLODING, WEAPON/NUCLEAR, WEAPON/PROJECTILE, WEAPON/SHOOTING GEOCOORD and MGRS (Military Grid Reference System) The following 40+ types of common entities specific to public sector are extracted: COMMON_COUNTRY (beloved country) COMMON_FACILITY (commercial airport) COMMON_PERSON (math teacher) Etc.
  • 27. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 27 Public New Enterprise Extraction for English Rules for the extraction of entities and facts of particular interest to the enterprise domain. The following major fact types are classified: Membership Information: information about a person’s affiliations Management Changes: information about management changes Product Releases: information about product releases Mergers & Acquisitions: information about mergers and acquisitions Organizational Information: founder, location or contact information
  • 28. New Text Mining Features
  • 29. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 29 Public New Text Mining Text mining works at the document level – making semantic determinations about the overall content of documents relative to other documents. Whereas text analysis does linguistic analysis and extracts information embedded within each document. Functions based on Vector Space Model Identify similar documents Identify key terms of a document Identify related terms Categorize new documents based on a training corpus Scenarios Highlight the key terms when viewing a patent document Identify similar incidents for faster problem solving Categorize new scientific papers along a hierarchy of topics t1 tn d1 d2
  • 30. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 30 Public Text Mining – Top-Ranked Functions Input / Output Terms Documents Terms related terms, suggested terms relevant documents Documents relevant terms related documents, categorize
  • 31. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 31 Public Text Mining Index – Characteristics The text mining index is an optional data structure that is built from the results of linguistic analysis. It is bound to the full-text indexing and text analysis process. Full Text Indexing Full Text Index Text Analysis Results Table Full Text Indexing with TA and TM Text Mining Index TM config. insert ID TITLE
  • 32. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 32 Public New Text Mining – Basic Sample Prerequisite data is stored in a SAP HANA table Goal build a text mining index in order to use the text mining functions CREATE FULLTEXT INDEX myIndexName ON myTable(myColumn) TEXT MINING ON;
  • 33. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 33 Public New Text Mining – Queries SAP HANA provides a server-side JavaScript interface for the Text Mining engine getRelatedTerms getRelatdDocuments and more SAP HANA Indexserver Tables Engine HANA App Preprocessor Linguistic Processing Entity, Fact Extraction Extended Application Services (XS) TM API Views Text Mining
  • 34. Demo Text Mining in SAP HANA SPS09
  • 35. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 35 Public Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase decision. This presentation is not subject to your license agreement or any other agreement with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or to develop or release any functionality mentioned in this presentation. This presentation and SAP’s strategy and possible future developments are subject to change and may be changed by SAP at any time for any reason without notice. This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this document, except if such damages were caused by SAP intentionally or grossly negligent.
  • 36. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 36 Public How to find SAP HANA documentation on this topic? SAP HANA Platform SPS 09 What’s New – Release Notes Development –SAP HANA Developer Guide –SAP HANA Text Analysis Developer Guide –SAP HANA Text Mining Developer Guide References –SAP HANA Text Analysis Extraction Customization Guide –SAP HANA Text Analysis Language Reference Guide •In addition to this learning material, you find SAP HANA documentation on SAP Help Portal knowledge center at http://help.sap.com/hana_platform. •The knowledge center is structured according to the product lifecycle: installation, security, administration, development.
  • 37. ©2014 SAP SE or an SAP affiliate company. All rights reserved. Thank you Anthony Waite SAP HANA Product Management AskSAPHANA@sap.com
  • 38. ©2014 SAP SE or an SAP affiliate company. All rights reserved. 38 Public © 2014 SAP SE or an SAP affiliate company. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP SE or an SAP affiliate company. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP SE (or an SAP affiliate company) in Germany and other countries. Please see http://global12.sap.com/corporate-en/legal/copyright/index.epx for additional trademark information and notices. Some software products marketed by SAP SE and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP SE or an SAP affiliate company for informational purposes only, without representation or warranty of any kind, and SAP SE or its affiliated companies shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP SE or SAP affiliate company products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. In particular, SAP SE or its affiliated companies have no obligation to pursue any course of business outlined in this document or any related presentation, or to develop or release any functionality mentioned therein. This document, or any related presentation, and SAP SE’s or its affiliated companies’ strategy and possible future developments, products, and/or platform directions and functionality are all subject to change and may be changed by SAP SE or its affiliated companies at any time for any reason without notice. The information in this document is not a commitment, promise, or legal obligation to deliver any material, code, or functionality. All forward- looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions.