Your SlideShare is downloading. ×
0
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

2011 - TDWI Big Data Forum - The New Analytics

707

Published on

Presentation by Casey Kiernan at the 2011 TDWI Big Data Forum - Orlando, FL - Oct 30 / Nov 1 - 2011

Presentation by Casey Kiernan at the 2011 TDWI Big Data Forum - Orlando, FL - Oct 30 / Nov 1 - 2011

Published in: Technology, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
707
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
26
Comments
0
Likes
1
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Hadoop&The New Analytics Casey KiernanSr. Director / Data Architecture - Shopzilla.com November 1, 2011
  • 2. AgendaThe New “Data”The New Business ModelThe New Analytic ScenariosThe New Analytic ArchitecturesThe New Analytic TechnologiesAnd, Yes… The New Data-Center 2
  • 3. SERVICES • SOA • JSON • AVRO • APPLICATIONS • HTML • JAVA • C# • THE CLOUD • HADOOP • OLAP PYTHON • SITES • TENANT • ORG • SSNFINANCE • ID • CUSTOMER • EXPERIAN SQL SERVER • ORACLE • UNIX • SUBVERSION • COMPLIANCE • SECURITY • SALESFORCE • MYSQL The World as I See it
  • 4. My Mountain Bike as a Data Platform Data Collection Heart Rate Data Collection Altitude Data Collection Temperature Speed / Trip Miles Time Guidance Performance Rate of Climb Calories Burned Miles Obtained Total Climbed Elapsed Time Current, Average, Max Values Data Collection Cadence / RPM Data Architecture - on a Local Wireless Network (ANT+ Protocol)
  • 5. “Business” Analytics BUSINESS INTELLIGENCE. DATA WAREHOUSE/OLAP. OLTP DATA. What are our most profitable Movie titles?
  • 6. “Business” Analytics What did Happen? What will Happen? Operational Reporting Tactical Analytics Strategic Months WeeksWeeks Months Years 6
  • 7. “Personal” Analytics SELF-SERVICE. GUIDANCE. BEHAVIOURS. What Movie should I watch tonight?
  • 8. “Personal” Analytics What is Happening NOW? What did Happen? What will Happen? Historical Behaviors Tactical Analytics Strategic Months WeeksWeeks Months Years 8
  • 9. MeaningfulGUIDANCE Massive DATA COLLECTION 9
  • 10. 10
  • 11. 11
  • 12. “Business” Analytics OLTP App Staging Data OLAP / Business Orders App Warehouse Analyst Reports OLTP to OLAP Mapping FIN App What are our most profitable Movie titles? 12
  • 13. “Personal” Analytics End User Application Data Analytics What Movie should I watch tonight? 13
  • 14. End-User Experience Browser, Tablet, Self-Service Application Mobile,… Personalization, Personalized Preferences, State Recommendations App Persistence AnalyticsPersistence/Analytics “State” Persistence “Read” Performance Big Data Behaviors / “Write” Performance “Personal Analytics” Data Architecture 14
  • 15. RDBMSHighly Structured EnvironmentFormalized intake processACID Transactional SemanticsTarget Scenario – OLTP R/WHigh Level Query Syntax - SQLHadoopLate-Binding StructuresNon-Formal Intake (“Copy”)Minimal Transaction SemanticsTarget Scenario - WritersProcedural Query Syntax - MapReduce 15
  • 16. The New Technology StackSpecialization / Individual Scalability / Late-Binding - for each componentTechnology Data Warehousing New AnalyticsAnalytics OLAP OLAP + Open-SourceData Movement ETL Tool MapReduceSQL RDBMS HiveSchema Metadata RDBMS JSON / AVROIndexing (Readers) RDBMS HBaseRI RDBMS Application LogicApp Store (Objects) RDBMS Key/Value - Cassandra,…Schema / Columns RDBMS Column Families / DynamicLogs (Writers) RDBMS Scalable - HadoopInfrastructure Data-Center Cloud 16
  • 17. End-User Experience Browser, Tablet, Self-Service Application Mobile,… Personalization, Personalized Preferences, State Recommendations App Persistence AnalyticsPersistence/Analytics Cassandra (JSON) Hbase (Column-Families) Data-Center or Cloud MapReduce Big Data Hadoop (AVRO) SQL Hive Specialization of Data Technologies 17
  • 18. Personal Analytics + Business Intelligence App Staging Data OLAP / Business OLTP App Warehouse Analyst Reports OLTP to OLAP Mapping OLTP App 18
  • 19. Contact InformationIf you have further questions or comments: Casey Kiernan Sr. Director / Data Architecture Shopzilla.com casey.kiernan@hotmail.com BLOG: www.the-data-platform.com 19
  • 20. A recent ride in AZ

×