×
  • Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

Mahout and Scalable Natural Language Processing

by on Jul 10, 2013

  • 10,358 views

Peter Norvig, the Director of Research at Google, said in the Amazon book review[1] for the book “Statistical Natural Language Processing” “If someone told me I had to make a million bucks in ...

Peter Norvig, the Director of Research at Google, said in the Amazon book review[1] for the book “Statistical Natural Language Processing” “If someone told me I had to make a million bucks in one year, and I could only refer to one book to do it, I`d grab a copy of this book and start a web text-processing company.” Despite this vast availability of free-form text, without a scalable way to process this data we have missed an opportunity. Apache Mahout is an open source machine learning library built atop Hadoop. We will cover how to use it to do some interesting, scalable machine learning (classification, clustering, etc.) on free-form text. We will learn about classification using Random Forests, clustering beyond simple K-Means using Latent Dirichlet Allocation. We will investigate the limitations and abilities of this platform together as it relates to natural language. 1. http://www.amazon.com/review/R3GSYXSKRU8V17

Statistics

Views

Total Views
10,358
Views on SlideShare
2,392
Embed Views
7,966

Actions

Likes
4
Downloads
72
Comments
0

9 Embeds 7,966

http://blog.jubat.us 7859
http://2023884025454259159_72fe9922d69be86dcda13bb0adbe886583a5d5ad.blogspot.com 83
http://cloud.feedly.com 16
http://www.feedspot.com 2
https://www.google.co.jp 2
http://2023884025454259159_72fe9922d69be86dcda13bb0adbe886583a5d5ad.blogspot.jp 1
http://translate.googleusercontent.com 1
http://feedly.com 1
http://webcache.googleusercontent.com 1
More...

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
Post Comment
Edit your comment

Mahout and Scalable Natural Language Processing Mahout and Scalable Natural Language Processing Presentation Transcript