WINDOWS AZURE           Matt Winkler                 Azure Data Platform     HDINSIGHT
Windows Azure
cloud services    applicationbuilding blocks
Windows AzureHDInsight Service  elastic  simple  secure
Built on HDP      Core       Pig      Hive     Oozie     Sqoop    Ambari   HCatalog   Templeton
Demo
Provisionin    g
Provisionin    g
Leverage Azure Storage    Economic Flexibility          Scale     Geo-Redundancy
Secure        IsolatedSingle REST Entrypoint
Hive, Pig, Mahout, Cascading, Scalding, Scoobi, Pegasus…C#, F# Map/Reduce, LINQ to Hive, .NET management clientsJavaScript...
Price     Compute*(~        +     Storage(~
The Data Platformfor Modern Apps Any Data, Any Size, Anywhere Data Management and Insights at Scale
Resources Windows Azure Free trial Getting Started with HDInsight Pricing .NET SDK For Hadoop Halo 4 Case Study
start now.
Management      UI Tooling       Cluster usage>_     Job authoring       Result consumption in common tools      Powe...
Existing Ecosystem Actively contributing to:  Core  Pig  Hive  HCatalog Branching to other projects Simple one-box ...
.NET Map/Reduce LINQ to Hive Client API’s  WebHCat  Ambari  WebHDFS  Azure Visual Studio Tooling    Local debuggi...
JavaScript MRjs – Map/Reduce in JavaScript Node.js client API’s  WebHCat  WebHDFS  Ambari  Azure
Management      UI Tooling       Cluster usage>_     Job authoring       Result consumption in common tools      Powe...
 Sources         http://hadoopsdk.codeplex.comopen     http://www.github.com/windowsazure        NuGet packages       ...
Drive Smarter Decisions with Hadoop and Windows Azure HDInsight
Drive Smarter Decisions with Hadoop and Windows Azure HDInsight
Upcoming SlideShare
Loading in …5
×

Drive Smarter Decisions with Hadoop and Windows Azure HDInsight

1,349 views

Published on

Published in: Technology, Business
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,349
On SlideShare
0
From Embeds
0
Number of Embeds
37
Actions
Shares
0
Downloads
0
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide
  • View from Camp Muir looking to Mount Adams, Mount Rainier National Park, Washington 2011, © matt winkler
  • Innovate across the stack
  • Drive Smarter Decisions with Hadoop and Windows Azure HDInsight

    1. 1. WINDOWS AZURE Matt Winkler Azure Data Platform HDINSIGHT
    2. 2. Windows Azure
    3. 3. cloud services applicationbuilding blocks
    4. 4. Windows AzureHDInsight Service  elastic  simple  secure
    5. 5. Built on HDP Core Pig Hive Oozie Sqoop Ambari HCatalog Templeton
    6. 6. Demo
    7. 7. Provisionin g
    8. 8. Provisionin g
    9. 9. Leverage Azure Storage Economic Flexibility Scale Geo-Redundancy
    10. 10. Secure IsolatedSingle REST Entrypoint
    11. 11. Hive, Pig, Mahout, Cascading, Scalding, Scoobi, Pegasus…C#, F# Map/Reduce, LINQ to Hive, .NET management clientsJavaScript Map/Reduce, Browser hosted console, Node.js management clientsPowerShell, Cross Platform CLI tools
    12. 12. Price Compute*(~ + Storage(~
    13. 13. The Data Platformfor Modern Apps Any Data, Any Size, Anywhere Data Management and Insights at Scale
    14. 14. Resources Windows Azure Free trial Getting Started with HDInsight Pricing .NET SDK For Hadoop Halo 4 Case Study
    15. 15. start now.
    16. 16. Management  UI Tooling  Cluster usage>_  Job authoring  Result consumption in common tools  PowerShell & Cross platform scripting  API Surface  RDFE – Azure provisioning  Ambari – Cluster monitoring  WebHCatalog – Metadata and job submission  WebHDFS, Blob Storage – Storage
    17. 17. Existing Ecosystem Actively contributing to:  Core  Pig  Hive  HCatalog Branching to other projects Simple one-box developer install on Windows
    18. 18. .NET Map/Reduce LINQ to Hive Client API’s  WebHCat  Ambari  WebHDFS  Azure Visual Studio Tooling  Local debugging support
    19. 19. JavaScript MRjs – Map/Reduce in JavaScript Node.js client API’s  WebHCat  WebHDFS  Ambari  Azure
    20. 20. Management  UI Tooling  Cluster usage>_  Job authoring  Result consumption in common tools  PowerShell & Cross platform scripting  API Surface  RDFE – Azure provisioning  Ambari – Cluster monitoring  WebHCatalog – Metadata and job submission  WebHDFS, Blob Storage – Storage
    21. 21.  Sources  http://hadoopsdk.codeplex.comopen  http://www.github.com/windowsazure  NuGet packages  Microsoft.Hadoop.MapReduce  Microsoft.Hadoop.Hive  Microsoft.Hadoop.WebHDFS => WebClient  NPM packages  Azure  Azure-cli  Hadoop REST clients pending…

    ×