How to Build a Real Time Analytics Enterprise with Open Source

573 views

Published on

Given the ease of scaling from zero to millions of users with very little capital investment and availability of managed infrastructure, building startups or enterprise grade products has become easier, faster and cheaper then ever before.

We will go over what it would take to build an enterprise big data real time analytics system, with insights onto architecture, open source and closed source available software, support requirements and alternatives around managed services.

We will explore the use of open source tools (using cloud providers such as AWS, Google and Microsoft) to build these big data applications. From zero to enterprise will be the theme of this meetup, come join us to learn, explore and contribute.

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
573
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
11
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

How to Build a Real Time Analytics Enterprise with Open Source

  1. 1. Building  enterprise  analy0cs  from     Open  Source   Lecole  Cole,  Founder  &  CEO     1  
  2. 2. A little about me. •  •  •  •  •  •  My  name  is  Lecole  Cole   Worked  in  data  analysis  for  +15  years   TwiEer:  @lecole   Email:  lecole@skydera.com   Company  Skydera  Inc.   Projects:  Chartleaf   (2)  
  3. 3. Interes0ng  Packages   •  Real-­‐0me  Stream   –  AWS  Kinesis   –  TwiEer  Storm   –  Hadoop  v2?   •  Analy0cs  package   –  Apache  Mahout   –  R-­‐project   –  Pandas  (Python)   –  pyBrain  (Python)   –  Custom  Python   •  Visualiza0on   –  D3.js   –  R-­‐project   (#)  
  4. 4. Interes0ng  Packages   •  Batch  Processors   –  Hadoop  V1   –  EMR  (AWS)   •   NoSQL:   –  MongoDB   –  DynamoDB  (AWS)   –  BigTable  (Google)   –  Cassandra   (#)  
  5. 5. Example  Stack   •  Compute:   –  Google  Compute   •  Database   –  MySQL   –  BigTable   •  Analy0cs   •  Language:   –  Java  applica0on  for   Hadoop   •  Data  access   –  Apache  Pig   –  Apache  Hive   –  Hadoop       (#)  
  6. 6. Example  Stack   •  Compute:   –  AWS  EC2   •  Database   –  MySQL   –  DynamoDB   •  Analy0cs  Batch   –  EMR     •  Analy0cs  Real-­‐0me:   –  AWS  Kinesis   •  Data  warehouse   –  Redshib   (#)  
  7. 7. Screen  Shots   (#)  
  8. 8. Screen  Shots   (#)  
  9. 9. Screen  Shots   (#)  
  10. 10. Screen  Shots   (#)  
  11. 11. Screen  Shots   (#)  
  12. 12. Screen  Shots   (#)  

×