Presentations and documents tagged hadoop
-
Is There Room For Another Elephant In Tucson
from lenards, posted 1 day ago in Technology. 16 views
Would you like to scale data-intensive tasks horizontally? Would you like an open source project that gave you that foundation?
Well, there is: Apache Hadoop. It's a Java software framework for supporting data-intensive distributed appli...
-
Nutch - web-scale search engine toolkit
from abial, posted 4 days ago in Technology, Business & Mgmt. 290 views
This slideset presents the Nutch search engine (http://lucene.apache.org/nutch). A high-level architecture is described, as well as some challenges common in web-crawling and solutions implemented in Nutch. The presentation closes with a br...
-
ZooKeeper Futures
from cloudera, posted 5 days ago in Technology, News & Politics. 378 views
Presented at the post-ApacheCon Hadoop meetup in Oakland on November 5th, 2009.
-
Elastic Web Mining
from kkrugler, posted 1 week ago in Technology. 73 views
PDF version (with notes) of my talk at the ACM Data Mining Unconference on 01 Nov 2009. How to use an open source stack (Hadoop, Cascading, Bixo) in EC2 for cost effective, scalable and reliable web mining.
-
Elastic Web Mining
from kkrugler, posted 1 week ago in Technology. 132 views
My talk at the ACM Data Mining Unconference on 01 Nov 2009. How to use an open source stack (Hadoop, Cascading, Bixo) in EC2 for cost effective, scalable and reliable web mining.
-
Escalando Aplicaciones Web
from santiagocoffey, posted 1 week ago in Technology. 129 views
Presentation at BarCamp Buenos Aires 2009 on Scaling web applications using memcached, MapReduce and Amazon Web Services.
-
"Large-Scale Distributed Systems at Google: Cur...
from yarapavan, posted 1 week ago in Technology. 76 views
From his abstract at LADIS09 talk:
As part of implementing the many products and services offered by Google, we have built a collection of systems and tools that simplify the storing and processing of large-scale data sets, and the construc...
-
Hands on Hadoop
from ptarjan, posted 1 week ago in Education, Books. 138 views
My intro talk for hadoop and how to use it with python streaming.
Code is here : http://github.com/ptarjan/hands-on-hadoop-tutorial/
-
Hadoop, Pig, and Twitter (NoSQL East 2009)
from kevinweil, posted 1 week ago in Technology. 8177 views
A talk on the use of Hadoop and Pig inside Twitter, focusing on the flexibility and simplicity of Pig, and the benefits of that for solving real-world big data problems.
-
20091030nasajpl
from jhammerb, posted 1 week ago in Technology. 46 views
-
Introduction to Hive for Hadoop
from ryanlecompte, posted 2 weeks ago in Technology. 301 views
Provides an introduction to Hive. This was given at the 1st Boston Hadoop User Meetup Group on October 28th, 2009.
-
Hadoop Lecture for Harvard's CS 264 -- October ...
from cloudera, posted 2 weeks ago in Technology. 354 views
-
20091027genentech
from jhammerb, posted 2 weeks ago in Technology. 80 views
Presentation at Genentech on October 27, 2009
-
HW09 Social network analysis with Hadoop
from cloudera, posted 2 weeks ago in Technology. 237 views
-
Get involved with the Apache Software Foundation
from shalinmangar, posted 2 weeks ago in Technology. 188 views
Presented at Indian Institute of Information Technology (IIIT) Allahabad on 21 Oct 2009 to students about the Apache Software Foundation, Lucene, Solr, Hadoop and on the benefits of contributing to open source projects. The target audience ...
-
Hw09 Data Processing In The Enterprise
from cloudera, posted 2 weeks ago in Technology. 239 views
-
Hw09 Hadoop Based Data Mining Platform For Th...
from cloudera, posted 2 weeks ago in Technology. 254 views
-
Hw09 Terapot Email Archiving With Hadoop
from cloudera, posted 2 weeks ago in Technology. 161 views
-
Karmasphere Studio for Hadoop
from hadoopusergroup, posted 2 weeks ago in Technology. 309 views
Shevek talks about Karmasphere Studio for Hadoop
-
Mumak
from hadoopusergroup, posted 2 weeks ago in Technology. 238 views
Hong Tang talks about Using Simulation for Large-scale Distributed System Verification and Debugging