An Intro to Text Analytics on Big Data with a use case
Upcoming SlideShare
Loading in...5
×
 

An Intro to Text Analytics on Big Data with a use case

on

  • 111 views

Introduction on how to perform text analytics using input from twitter and the "Emmys" as use case example.

Introduction on how to perform text analytics using input from twitter and the "Emmys" as use case example.

Statistics

Views

Total Views
111
Views on SlideShare
109
Embed Views
2

Actions

Likes
0
Downloads
2
Comments
0

1 Embed 2

http://www.slideee.com 2

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

An Intro to Text Analytics on Big Data with a use case An Intro to Text Analytics on Big Data with a use case Presentation Transcript

  • #TOSMAC Toronto SMAC Meetup – Welcome! An Intro to Text Analytics on Big Data with a use case
  • #TOSMAC Toronto SMAC Team | © 2014 IBM Corporation2 Lucas Silva Felipe MosquettaMarcos de Mello
  • #TOSMAC Twitters numbers An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation3 As you know: -500 million Tweets are sent per day. -Twitter supports 35+ languages. -255 million monthly active users. Huge amount of data!
  • #TOSMAC An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation4 Overview Section1 Section2 Section3 Section4 Section5
  • #TOSMAC Section1 Section2 Section3 Section4 Section5 An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation5 Overview
  • #TOSMAC Section1 Section2 Section3 Section4 Section5 An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation6 Overview
  • #TOSMAC Let’s get started! | © 2014 IBM Corporation7
  • #TOSMAC Input data An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation8
  • #TOSMAC An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation9 Section2
  • #TOSMAC Demo | © 2014 IBM Corporation10
  • #TOSMAC Section1 Section2 Section3 Section4 Section5 An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation11 Next section
  • #TOSMAC Section1 Section2 Section3 Section4 Section5 An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation12 Next section Extractor: used to extract structured information from unstructured and semi-structured data. AQL: Annotation Query Language. Rule language with familiar SQL-like syntax.
  • #TOSMAC Section1 Section2 Section3 Section4 Section5 An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation13 Next section Profiler: troubleshooting performance problems.
  • #TOSMAC Main concepts An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation14 Types of extraction specifications: - Dictionaries - Regular expressions - Part of speech
  • #TOSMAC Main concepts An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation15
  • #TOSMAC Main concepts An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation16
  • #TOSMAC An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation17
  • #TOSMAC Main concepts An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation18
  • #TOSMAC Main concepts An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation19
  • #TOSMAC Main concepts An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation20 Types of extraction specifications: - Dictionaries -Regular expressions - Part of speech numbers: 7.5 4 13
  • #TOSMAC Demo | © 2014 IBM Corporation21
  • #TOSMAC Main concepts An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation22 Types of extraction specifications: - Dictionaries - Regular expressions - Part of speech
  • #TOSMAC Main concepts An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation23
  • #TOSMAC An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation24
  • #TOSMAC | © 2014 IBM Corporation25 An Intro to Text Analytics on Big Data with a use case AQL Guidelines Basic feature AQL statements - Develop the core building blocks of the extractor.
  • #TOSMAC | © 2014 IBM Corporation26 An Intro to Text Analytics on Big Data with a use case AQL Guidelines Candidate generation AQL statements - Combine basic features AQL statements.
  • #TOSMAC | © 2014 IBM Corporation27 An Intro to Text Analytics on Big Data with a use case Candidate generation AQL statements $7.5 million $4 thousand $ 7.5 million
  • #TOSMAC | © 2014 IBM Corporation28 An Intro to Text Analytics on Big Data with a use case Candidate generation AQL statements $7.5 million $4 thousand $ 7.5 million $7.5 million
  • #TOSMAC | © 2014 IBM Corporation29 An Intro to Text Analytics on Big Data with a use case AQL Guidelines Filter and consolidate AQL statements - Refine results - Remove invalid annotations - Resolve overlap between annotations.
  • #TOSMAC Demo | © 2014 IBM Corporation30
  • #TOSMAC | © 2014 IBM Corporation31 An Intro to Text Analytics on Big Data with a use case Conclusion
  • #TOSMAC Check point An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation32
  • #TOSMAC What we have done An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation33 Section1 Section2 Section3
  • #TOSMAC What are we going to do? An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation34 Section4 Section5
  • #TOSMAC Demo | © 2014 IBM Corporation35
  • #TOSMAC Also using R An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation36 1.75 0.32
  • #TOSMAC What are we going to do? An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation37
  • #TOSMAC Demo | © 2014 IBM Corporation38
  • #TOSMAC So what? An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation39
  • #TOSMAC Companies An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation40
  • #TOSMAC Exporting to you An Intro to Text Analytics on Big Data with a use case | © 2014 IBM Corporation41
  • #TOSMAC Thank you! Let's network! | © 2014 IBM Corporation42