Presentation: BI/Big Data Futures - Is it really all about the Cloud?In this survey session, Lynn will bring you up-to-date on what's happening in the world of enterprise Business Intelligence. BigData, NoSQL, Hadoop, Big Analytics, Cloud Storage, what does all of this mean to you as a data professional? Which products and technologies are mature enough for enterprise adoption and which ones are not? Which vendors should you be trying out and why? What is the reality of hosting enterprise data on the cloud? What are the business reasons to explore these new technologies? How do you learn to implement them?Lynn frames this talk with the three major trends that she sees in the Enterprise BI space, highlighting products and technologies that warrant a deeper look.
http://hadoop.apache.org/ & http://www.mongodb.org/
Part of Microsoft's Hadoop efforts include an ODBC driver for Hive, the Hadoop query engine, which will provide “direct realtime querying from business intelligence tools into Hadoop,” Leland said. The Azure Hadoop service and the Hive ODBC driver will be available in preview before the end of 2011, with a community technology preview for Hadoop on Windows Server arriving next year.
From the blog - http://www.thisisthegreenroom.com/2011/data-science-vs-business-intelligence/
From Business Intelligence to BigData
BI/Big Data Futures – the Cloud? “psst…it’s about Data Mining” Lynn Langit Practioner, Author, Instructor Jan 2012- for SoCalCodeCamp
BI = ‘Current State’ Questions • What did we sell? Collecting • When did we sell it?Transactional data • Where did we sell it? • What did we sell with it?
Current State • I define my OLAP Let’s all • Maybe it’s a read-only copy of my OLTP –OR- OLAP • Maybe it’s a cube • Maybe it uses some data mining too • I’ll keep it on premisesIt’s my data • I’ll secure it, tune queries, back it up, etc. Data Mining • Too difficult, expensive, proprietary…. really?
Current State QuestionsWhy did this happen?When did this happen?Where did this happen?Who is responsible?What might happen to this one value in the future?Can you write me a report for…?
BI Data Landscape Storage Processing Query Presentation
Mix-in #1 -- the Cloud and…• Host Data in the Cloud• Process & Query Data in the Cloud – Click to query and (data) mine – Return the data locally – Use Self-service BI visualizers• Mash-up Cloud data – Combine with local data
NoSQL and Cloud-based BI• The Elephant in the room…Hadoop• Over 120+ types of noSQL databases – http://nosql-database.org/
Comparing RDBMS and MapReduce Reference: Tom White’s Hadoop: The Definitive Guide Traditional RDBMS MapReduceData Size Gigabytes (Terabytes) Petabytes (Hexabytes)Access Interactive and Batch BatchUpdates Read / Write many times Write once, Read many timesStructure Static Schema Dynamic SchemaIntegrity High (ACID) LowScaling Nonlinear LinearDBA Ratio 1:40 1:3000
BI vNext for MicrosoftSQL Server 2012 -New BI tools andsemantic model Connectors for• Data Quality Services Hadoop• Master Data Services • SQL Server• Semantic Search • Excel• PowerView • Power Pivot SQL Azure vNext - Full BI IN the Federations, featu cloud (SSAS, SSIS, res, max. size SSRS) increase • Data Explorer – ETL for all
BI >BigData ‘To Do ListStore some (more) data on the cloud• Relational and non-relationalProcess some data in the cloud• Try data mining• Learn about Data ScienceUpdate your client tools• New UI (touch, gestures)• Click to Query• New form factors (phone, tablet)