Big Data in the Microsoft Platform
Upcoming SlideShare
Loading in...5
×
 

Big Data in the Microsoft Platform

on

  • 442 views

 

Statistics

Views

Total Views
442
Views on SlideShare
442
Embed Views
0

Actions

Likes
1
Downloads
11
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Big Data in the Microsoft Platform Big Data in the Microsoft Platform Presentation Transcript

  • •••••
  • •••••
  • ETL Tools BI Reporting RDBMSZookeepr (Coordination) Pig (Data Flow) Hive (SQL) Sqoop Avro (Serialization) MapReduce (Job Scheduling/Execution System) HBase (key-value store) (Streaming/Pipes APIs) HDFS (Hadoop Distributed File System) View slide
  • Block Size = 64MBReplication Factor = 3Cost/GB is a few ¢/month vs $/month View slide
  • HDFSDemo
  • •••
  • Map Reduce Demo
  • Hacking with Hive
  •   
  • Rocking Data Processing with Pig
  • Bulk Data Loading UsingSqoop
  • HDInsight Service Overview