• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
24HoP 2013 - Por Onde Começar no BigData
 

24HoP 2013 - Por Onde Começar no BigData

on

  • 368 views

Palestra realizada no 24HoP 2013 (24 Hours of PASS - Portuguese), abordando o assunto Por Onde Começar no BigData.

Palestra realizada no 24HoP 2013 (24 Hours of PASS - Portuguese), abordando o assunto Por Onde Começar no BigData.

Statistics

Views

Total Views
368
Views on SlideShare
368
Embed Views
0

Actions

Likes
1
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    24HoP 2013 - Por Onde Começar no BigData 24HoP 2013 - Por Onde Começar no BigData Presentation Transcript

    • Por onde começar no BigData Diego Nogare / MVP SQL Server @DiegoNogare / www.DiegoNogare.net
    • HDInsight
    • Criando o Cluster 3 3
    • HDInsight 4 4
    • Demo – Criando o Cluster de HDInsight 5 5
    • A caixa de sapato digital Business Intelligence Data Science Real-Time Hadoop DB Machine Learning Ad-Hoc Dremel Operational Data Mining Query Self-Service Analytics Hive Predictive Batch Map Reduce SQL Exploratory Unstructured Social Insight Pig Interactive Cloud Scale 6 Pivot Visualization Hadoop Big Query Text Analytics 6 Data Warehouse Reporting Drill
    • Hadoop
    • 15% Dados Estruturados 85% Dados Não Estruturados Fonte: Apresentação da Gartner ‘O Gerenciamento 'Radical‘ de Informações': Os Maiores Desafios para CIOs do Século 21 8 Mark Beyer, Outubro 2011 8
    • Arquitetura do Hadoop Distributed Processing (Map Reduce) Distributed Storage (HDFS) 9 9
    • Arquitetura do Hadoop Log file aggregation (Flume) 10 Business Intelligence (Excel, Power View, SSAS…) Pipeline / workflow (Oozie) System Center (Future) 10 Data Integration Active Directory (Future) Distributed Storage (HDFS) ( ODBC / SQOOP/ REST) Distributed Processing (Map Reduce) NoSQL Database (HBase) Machine Learning (Mahout) Stats processing (RHadoop) Graph (Pegasus) Metadata (HCatalog)
    • Arquitetura do Hadoop Log file aggregation (Flume) 11 Business Intelligence (Excel, Power View, SSAS…) Pipeline / workflow (Oozie) System Center (Future) 11 Data Integration Active Directory (Future) Distributed Storage (HDFS) ( ODBC / SQOOP/ REST) Distributed Processing (Map Reduce) NoSQL Database (HBase) Machine Learning (Mahout) Stats processing (RHadoop) Graph (Pegasus) Metadata (HCatalog)
    • Movimentando dados para o HDInsight
    • Demo – Movimentando Dados 13 13
    • Mudando a forma de pensar com os 3 Vs do BigData Volume Velocidade Variedade
    • HIVE
    • Arquitetura do Hive Hive Hadoop 16 16
    • Demo – Consumindo dados com HIVE
    • MapReduce
    • Propósito
    • E como é processado?
    • Livros técnicos sobre BigData
    • Livros sobre assuntos relacionados 22 |
    • Links e Assuntos relacionados Links relacionados • Lançamento do HDInsight http://blogs.technet.com/b/microsoft_blog/archive/2013/10/28/ announcing-windows-azure-hdinsight-where-big-data-meets-the-cloud.aspx • SQL Continues to Crash the BigData Party http://visualstudiomagazine.com/blogs/data-driver/2013/11/sql-and-big-data.aspx • Facebook SQL-on-Hadoop http://gigaom.com/2013/11/06/ facebook-open-sources-its-sql-on-hadoop-engine-and-the-web-rejoices/ Sessões relacionadas • PowerBI (Bruno Basto) – 13/11 – 18hs GMT (16hs Brasília) • Self-Service BI (Manuel Dias) – 13/11 – 22hs GMT (20hs Brasília) • Data-Driven (Rui Quintino) – 14/11 – 20h GMT (18hs Brasília) 23 |
    • Dúvidas?
    • Obrigado pela participação!