The Briefing Room with Robin Bloor and Cirro
Live Webcast on Dec. 11, 2012
As the information landscape expands with all kinds of Big Data, businesses are searching for ways to unite their traditional analytics with this new source of insight. One ambitious approach involves federating access to multiple data sources, even across various operating systems. The idea is to take analytic processing to the data, then intelligently assemble the results for a business user. Could this be the long-awaited alternative to data virtualization?
Check out this episode of The Briefing Room to hear veteran Analyst Robin Bloor explain how federated access to data sources can pave the way for a truly integrated data fabric. Bloor will be briefed by Mark Theissen of Cirro, who will tout his company's patent-pending Data Hub, which simplifies data access by federating queries across multiple sources of structured, semi-structured, and unstructured data. He'll discuss Cirro's cost based optimizer, smart caching, dynamic query plan re-optimization, normalization of cost estimates and a metadata repository for unstructured data sources.
Visit: http://www.insideanalysis.com
2. Welcome
Host:
Eric Kavanagh
eric.kavanagh@bloorgroup.com
Twitter Tag: #briefr The Briefing Room
3. Mission
! Reveal the essential characteristics of enterprise software,
good and bad
! Provide a forum for detailed analysis of today s innovative
technologies
! Give vendors a chance to explain their product to savvy
analysts
! Allow audience members to pose serious questions... and get
answers!
Twitter Tag: #briefr The Briefing Room
4. December: Innovators
January: Big Data
February: Analytics
March: Data in Motion
Twitter Tag: #briefr The Briefing Room
5. Innovators
! Charles Babbage conceived the Analytical Engine in 1834.
! Automation and ease of use have driven innovation in
computing ever since.
! The Cloud and Big Data are raising the bar.
Twitter Tag: #briefr The Briefing Room
6. Analyst: Robin Bloor
Robin Bloor is
Chief Analyst at
The Bloor Group
robin.bloor@bloorgroup.com
Twitter Tag: #briefr The Briefing Room
7. Cirro
! Cirro provides a single method to access any type of data,
on any platform, in any environment.
! Its product suite consists of Cirro Data Hub, Analyst for
Excel and Multi Store – all designed to remove complexity
from Big Data analytics.
! Cirro’s products are cloud based and can run in public,
private and on-premise environments.
Twitter Tag: #briefr The Briefing Room
8. Mark Theissen
Mark is CEO at Cirro. He is a respected analytics and data
warehousing expert with more than 22 years in the industry.
Most recently Mark was the worldwide data warehousing
technical lead at Microsoft following the acquisition of
DATAllegro. At DATAllegro Mark was the COO and a member
of the board of directors. Prior to joining DATAllegro, Mark
was Vice President and Research Lead at META Group
(Gartner Group) for Enterprise Analytics Strategies, covering data warehousing,
business intelligence and data integration markets. Before META, Mark was VP
of Professional Services at Accruent where he was responsible for domestic and
overseas services and operations. Mark has a BS in Computer Information
Systems from Chapman University and a MBA from the University of California,
Irvine.
Twitter Tag: #briefr The Briefing Room
28. Hadoop & The Big Data Dynamic
Hadoop has become the de facto reservoir for data
The Bloor Group
29. Hadoop & The Big Data Dynamic
– We witnessed something like this a long time
ago, with ISAM files - before the advent of
RDBMS
– The difference this time is that Hadoop has an
ecosystem and it is growing
– Big Data (usually caught first by Hadoop) is
mostly new data and mostly event data
– Hadoop is not (yet) a performance engine. It is
an all-purpose capability
– It is delivering business benefits in a big way: it
is hot….
The Bloor Group
30. BI Categories
HINDSIGHT Regular reporting/operational BI, Excel
OVERSIGHT Dashboards, OLAP, BPM, Excel
Data mining, statistical analysis
INSIGHT (trends and relationships)
FORESIGHT Predictive analytics
The Bloor Group
32. Data Sources
Graph
DBMS,
XML
Standard DBMS, NoSQL
SQL Flat files
Hadoop
and Metadata
Hadoop Hub?
++
The Bloor Group
33. Problems Of The Data Layer
Hadoop is capable of ETL and often
Hadoop is multi-role and hence
used for ETL, but that usually
can spawn multiple instances
involves coding of a kind
BI tools, which had good-enough
The data layer is more
interfaces to RDBMS, don’t link to
complicated than it was and its
Hadoop directly, and probably
complexity is increasing
shouldn’t
Point to point connectivity usually
A connectivity architecture is
was, is and may always be a bad
needed
idea
IT REQUIRES SIMPLE CONNECTORS
The Bloor Group
34. ! How would one use the Cirro Multi Store?
! Which companies/products do you regard as
competitors (either directly or close competitors)?
! How does a Cirro implementation proceed, i.e.,
where do you start, what are the medium term
goals, what do you replace?
! Conceptually a hub for the data layer is attractive.
But how well does it scale out?
The Bloor Group
35. ! Can the hub be physically distributed, i.e., one
logical instance with multiple physical instances?
! How does your proprietary MapReduce differ from
Hadoop MapReduce?
! Is there any aspect of BI that you don’t or can’t
cater for (CEP, Data governance, MDM, etc.)?
The Bloor Group
37. Upcoming Topics
January: Big Data
February: Analytics
March: Data in Motion
2013 Editorial Calendar
www.insideanalysis.com
Twitter Tag: #briefr The Briefing Room
38. Thank You
for Your
Attention
Twitter Tag: #briefr The Briefing Room