• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Chicago Data Summit: Flume: An Introduction
 

Chicago Data Summit: Flume: An Introduction

on

  • 4,335 views

Flume is an open-source, distributed, streaming log collection system designed for ingesting large quantities of data into large-scale data storage and analytics platforms such as Apache Hadoop. It ...

Flume is an open-source, distributed, streaming log collection system designed for ingesting large quantities of data into large-scale data storage and analytics platforms such as Apache Hadoop. It has four goals in mind: Reliability, Scalability, Extensibility, and Manageability. Its horizontal scalable architecture offers fault-tolerant end-to-end delivery guarantees, support for low-latency event processing, provides a centralized management interface , and exposes metrics for ingest monitoring and reporting. It natively supports writing data to Hadoop's HDFS but also has a simple extension interface that allows it to write to other scalable data systems such as low-latency datastores or incremental search indexers.

Statistics

Views

Total Views
4,335
Views on SlideShare
3,531
Embed Views
804

Actions

Likes
13
Downloads
0
Comments
0

4 Embeds 804

http://www.cloudera.com 693
http://lanyrd.com 108
http://blog.cloudera.com 2
http://test.cloudera.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Chicago Data Summit: Flume: An Introduction Chicago Data Summit: Flume: An Introduction Presentation Transcript