Your SlideShare is downloading. ×
Drive Smarter Decisions with Hadoop and Windows Azure HDInsight
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Introducing the official SlideShare app

Stunning, full-screen experience for iPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Drive Smarter Decisions with Hadoop and Windows Azure HDInsight

788
views

Published on

Published in: Technology, Business

0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
788
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide
  • View from Camp Muir looking to Mount Adams, Mount Rainier National Park, Washington 2011, © matt winkler
  • Innovate across the stack
  • Transcript

    • 1. WINDOWS AZURE Matt Winkler Azure Data Platform HDINSIGHT
    • 2. Windows Azure
    • 3. cloud services applicationbuilding blocks
    • 4. Windows AzureHDInsight Service  elastic  simple  secure
    • 5. Built on HDP Core Pig Hive Oozie Sqoop Ambari HCatalog Templeton
    • 6. Demo
    • 7. Provisionin g
    • 8. Provisionin g
    • 9. Leverage Azure Storage Economic Flexibility Scale Geo-Redundancy
    • 10. Secure IsolatedSingle REST Entrypoint
    • 11. Hive, Pig, Mahout, Cascading, Scalding, Scoobi, Pegasus…C#, F# Map/Reduce, LINQ to Hive, .NET management clientsJavaScript Map/Reduce, Browser hosted console, Node.js management clientsPowerShell, Cross Platform CLI tools
    • 12. Price Compute*(~ + Storage(~
    • 13. The Data Platformfor Modern Apps Any Data, Any Size, Anywhere Data Management and Insights at Scale
    • 14. Resources Windows Azure Free trial Getting Started with HDInsight Pricing .NET SDK For Hadoop Halo 4 Case Study
    • 15. start now.
    • 16. Management  UI Tooling  Cluster usage>_  Job authoring  Result consumption in common tools  PowerShell & Cross platform scripting  API Surface  RDFE – Azure provisioning  Ambari – Cluster monitoring  WebHCatalog – Metadata and job submission  WebHDFS, Blob Storage – Storage
    • 17. Existing Ecosystem Actively contributing to:  Core  Pig  Hive  HCatalog Branching to other projects Simple one-box developer install on Windows
    • 18. .NET Map/Reduce LINQ to Hive Client API’s  WebHCat  Ambari  WebHDFS  Azure Visual Studio Tooling  Local debugging support
    • 19. JavaScript MRjs – Map/Reduce in JavaScript Node.js client API’s  WebHCat  WebHDFS  Ambari  Azure
    • 20. Management  UI Tooling  Cluster usage>_  Job authoring  Result consumption in common tools  PowerShell & Cross platform scripting  API Surface  RDFE – Azure provisioning  Ambari – Cluster monitoring  WebHCatalog – Metadata and job submission  WebHDFS, Blob Storage – Storage
    • 21.  Sources  http://hadoopsdk.codeplex.comopen  http://www.github.com/windowsazure  NuGet packages  Microsoft.Hadoop.MapReduce  Microsoft.Hadoop.Hive  Microsoft.Hadoop.WebHDFS => WebClient  NPM packages  Azure  Azure-cli  Hadoop REST clients pending…