• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Flume in 10minutes
 

Flume in 10minutes

on

  • 6,481 views

Slides for the video walkthrough at https://www.youtube.com/watch?v=112opbzgBiw

Slides for the video walkthrough at https://www.youtube.com/watch?v=112opbzgBiw

Statistics

Views

Total Views
6,481
Views on SlideShare
6,450
Embed Views
31

Actions

Likes
2
Downloads
42
Comments
0

7 Embeds 31

http://127.0.0.1 12
http://www.newsblur.com 7
http://www.directrss.co.il 4
http://itnewscast.com 3
http://reader.aol.com 3
https://blogs.oracle.com 1
http://news.google.com 1
More...

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Flume in 10minutes Flume in 10minutes Presentation Transcript

    • Flume NG Basics 1 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • Oracle’s Big Data Approach 4 Steps to Greater Value • Acquire and organize all data • Enable greater access to wide data • Analyze and refine important data • Decide and publish insights2 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • How do I get data to my Hadoop Cluster? Using Flume NG to collect distributed data3 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • My log data is not near my Hadoop cluster OracleApplication Big Data ApplianceServers Customer Logs ?4 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • Moving Data with Flume NG Application Servers Oracle Big Data Appliance Flume NG Flume NG Logs Agent HDFS Write Avro Agent Flume NG Flume NG Logs Avro HDFS Write Agent Agent Flume NG Flume NG Logs Avro HDFS Write Agent Agent5 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • Building a Basic Flume Agent One configuration file • Flume is flexible – Durable Transactions – In-Flight Data Modification – Compresses Data • Flume simpler than it used to be – No Zookeeper requirement – No Master-Slave architecture • 3 basic pieces – Source, Channel, Sink6 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • Flume Configurationflume-ng agent –f this_file –n hdfs-agentollectehannelllect.type = netcatllect.bind = 127.0.0.1llect.port = 11111type = hdfshdfs.path = hdfs://localhost:8020/user/oracle/sabre_examplerollInterval = 30hdfs.writeFormat=Texthdfs.fileType=DataStreamannel.type = memoryannel.capacity=10000llect.channels=memoryChannelchannel=memoryChannel 7 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • Sending Data to the Agent • Connect netcat to the host • Pipe input to it • Records are transmitted on newline • head example.xml | nc localhost 111118 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • Alternatives to Flume And Their Trade-Offs • Scribe – Thrift-based – Lightweight, but no support – Not designed around Hadoop • Kafka – Designed to resemble a publish-subscribe system – Explicitly distributed – Apache Incubator Project9 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • 10 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
    • 11 Copyright © 2012, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.