An introduction to Apache Falcon
Upcoming SlideShare
Loading in...5
×
 

An introduction to Apache Falcon

on

  • 676 views

A short introduction to Apache Falcon, what is it and what is it used for ? ...

A short introduction to Apache Falcon, what is it and what is it used for ?
How can it help with Hadoop based data life cycle management ? What is it's
architecture and what are the benefits of using it ?

Statistics

Views

Total Views
676
Views on SlideShare
676
Embed Views
0

Actions

Likes
1
Downloads
24
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as OpenOffice

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

An introduction to Apache Falcon An introduction to Apache Falcon Presentation Transcript

  • Apache Falcon ● What is it ? ● Benefits ● Architecture ● Example www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • Apache Falcon – What is it ? ● A data life cycle management framework ● Created for Hadoop ● Logic based in Falcon rather than apps ● Simplifies data management ● Developed by InMobi and HortonWorks ● Falcon can manage – Work flows – Replication – Provides data abstraction www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • Apache Falcon – What is it ? ● Falcon provides services – Data import / replication – Scheduling / coordination – Lifecycle policies – Cluster management – SLA Management ● An enterprise solution for data lifecycle management ● Currently an Apache incubator project www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • Apache Falcon – Benefits ● Reduce workflow / ETL development time ● Reduce costs ● No need to re implement functionality – – ● Already in Falcon Already tested Use a single Falcon configuration file to – Define replication points – Define data processing pipeline www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • Apache Falcon – Architecture www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • Apache Falcon – BI Example ● Falcon used to manage work flow ● Falcon used to manage Cluster data replication ● BI example – Staged and presented data replicated – Presented data visible for Reporting ● Analytics ● See next slide ..... ● www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • Apache Falcon – BI Example www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • Contact Us ● Feel free to contact us at – www.semtech-solutions.co.nz – info@semtech-solutions.co.nz ● We offer IT project consultancy ● We are happy to hear about your problems ● You can just pay for those hours that you need ● To solve your problems