Wednesday, February 22, 12
Eric Kavanagh                             Eric.kavanagh@bloorgroup.com                                             Twitter...
To conduct an Open Research program that                       invites the participation of both IT users and             ...
Your Host: Eric Kavanagh                   Research Leader: Mark Madsen - Third Nature                  Primary Collaborat...
Rajeev Rawat is the founder and CEO of BI             Results. His career has involved leading large             cross-fun...
The Bloor Group       Fit for Purpose: The New Database Revolution       The Bloor Group – February 22, 2012         Five ...
The Bloor Group       The Next Five Years       The Most Exciting Times In Information Acrobatics         New Venture Fund...
The Bloor Group       RDBMS Still Dominates       Reliable Heavy Lifting                                                  ...
The Bloor Group       NoSQL       Being Tested, Validated, Calibrated                                                     ...
The Bloor Group       NoSQL, RDBMS Innovation       Fantastic Opportunity for Growth         Gaps You Can Help Close      ...
Disection &                             Discussion                                           Twitter Tag: #briefrWednesday...
Robin Bloor is Chief                               Analyst at The                                Bloor Group.             ...
Wednesday, February 22, 12
RDBMSWednesday, February 22, 12
The SQL Barrier           SQL has:            DDL (for data definition)                                     SQL            ...
That MapReduce Thing           There are two fundamental           approaches to parallelism             Data Partitioning...
Malcolm Chisholm has 25+ years experience in               data management working in finance, insurance,               ma...
Disection &                             Discussion                                           Twitter Tag: #briefrWednesday...
The New Database Revolution:                        Relational Roundtable                                          The Vir...
“Big Data” Is Used Differently                             Relational Paradigm                     ULS Dataspace in Cloud ...
Sources          Source A                                          Emails          Source B                     Documents ...
Segments in Dataspace                                                         ULS Dataspace in Cloud     Source A         ...
No Common Notation for Columnar Designs                                                                            ?      ...
Need a Data Dictionary    • The ULS dataspace can grow quickly and have many data objects    • Without a DD developers and...
Disection &                             Discussion                                           Twitter Tag: #briefrWednesday...
Mark Madsen is founder of Third Nature, a                research and consulting firm focused on                analytics,...
One Size Doesn’t Fit All                             February 22, 2012                             Mark R. Madsen         ...
The	  future	  of	  data	  is	  the	  databaseWednesday, February 22, 12
You keep using that word.                             I do not think it means                             what you think i...
Good	  conceptual	  model,	  but	  a	  prematurely	                               standardized	  implementa5onThe	  rela*o...
Plus,	  if	  they’re	  all	  the	  same	  why	  are	  there	  so	  many?       Sybase	  IQ,	  ASE                  Enterpr...
The	  future	  of	  data	  is	  the	  rela0onal	  database?                             SQL                       noSQLWed...
The	  future	  of	  data	  is	  the	  rela0onal	  database?                             SQL                       noSQLWed...
Technologies	  are	  not	                               perfect	  replacements	  for	                               one	  ...
Disection &                             Discussion                                           Twitter Tag: #briefrWednesday...
Wednesday, February 22, 12
March:                             Vendor Research                             March 14th: Second Round Table focusing on ...
March 14th: Second DB               Revolution Round Table               March Briefing Room:               Integration   ...
Thank You                              For Your                             AttentionWednesday, February 22, 12
Upcoming SlideShare
Loading in …5
×

Roundtable 1: Relational and Analytic Database Innovations

1,107 views

Published on

Slides from the Live Webcast on Feb. 22, 2012

Watch this Roundtable Webcast to learn about what’s happening in the relational and specialized “analytics” database market. The discussion will include four veteran analysts: Robin Bloor of The Bloor Group, Mark Madsen of Third Nature, Malcolm Chisholm of AskGet, and Rajeev Rawat of BI Results.

For more information visit: http://www.databaserevolution.com

Watch this and the entire series at : http://www.youtube.com/playlist?list=PLE1A2D56295866394

Published in: Technology, Business
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,107
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
17
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Roundtable 1: Relational and Analytic Database Innovations

  1. 1. Wednesday, February 22, 12
  2. 2. Eric Kavanagh Eric.kavanagh@bloorgroup.com Twitter Tag: #briefrWednesday, February 22, 12
  3. 3. To conduct an Open Research program that invites the participation of both IT users and technology vendors To assist IT buyers in understanding database technology and the architecture that surrounds it. Allow audience members to pose serious questions... and get answers! Publish all findings Twitter Tag: #briefrWednesday, February 22, 12
  4. 4. Your Host: Eric Kavanagh Research Leader: Mark Madsen - Third Nature Primary Collaborator: Robin Bloor - The Bloor Group Guest Analyst 1: Rajiv Rawat - BI Results Guest Analyst 2: Malcolm Chisholm - ConsultantWednesday, February 22, 12
  5. 5. Rajeev Rawat is the founder and CEO of BI Results. His career has involved leading large cross-functional teams at both IBM and Xerox, where he was involved in direct customer facing roles as well as taking part in headquarters assignments. His headquarters positions with worldwide responsibility included strategic assignments for alliances and relationships with technology partners, product management and product marketing. Other responsibilities include restructuring business models, test of new technology platforms, and sales coverage plans. Rajeev led the introduction of new technologies and solutions for Xerox and IBM. www.biresults.com, biresult@gmail.com LinkedIn: Rajeev Rawat Twitter Tag: #briefWednesday, February 22, 12
  6. 6. The Bloor Group Fit for Purpose: The New Database Revolution The Bloor Group – February 22, 2012 Five Years of Incredible Excitement In Information Acrobatics! -Seismic shift in dataÇ Ç Variety, Volume, Velocity√ √ Rajeev Rawat Serving to achieve your full potential ©Copyright BI Results, LLC 2012 Wednesday, February 22, 12
  7. 7. The Bloor Group The Next Five Years The Most Exciting Times In Information Acrobatics New Venture Funding Key Value Store, Big Table, Graph DB, Document DB New (Needed) FunctionalityÇ Ç New Skills√ √ New Ventures Innovative Code NoSQL Innovation Lots of Great Innovation Apache Project, Amazon, Facebook, Google, Open Source Community, Twitter Reports of the Death of The RDBMS Are Highly Exaggerated ©Copyright BI Results, LLC 2012 Wednesday, February 22, 12
  8. 8. The Bloor Group RDBMS Still Dominates Reliable Heavy Lifting RDBMS Vs. NoSQL? Strengths - Robust (ACID, Fail-proof) - Structure (Granular, Scalable, Fast)Ç - Governance (Backups, Precision) - Tools (ETL, Analytics, Reporting) Ç√ - Ecosystem (Global deep collaboration) - Skills (Certifications, Experience) √ - Policies, Procedures (Reliability) - Documentation (Support, Training) Photo: Watchmojo.com Reports of the Death of RDBMS Are Highly Exaggerated ©Copyright BI Results, LLC 2012 Wednesday, February 22, 12
  9. 9. The Bloor Group NoSQL Being Tested, Validated, Calibrated Key Value Store, Big Table, Graph - Co-Existence, Transition, DB, Document DB NoSQL Only - Meta Tag, Master DataÇ Ç Other scheme/s√ √ - Data Governance, Controls. Authentication, Security - Deep Analytics on Mixed Datasets Complexity, Semi- Structured, Highly Connected Data Fantastic Growth Opportunity Skills, Investing ©Copyright BI Results, LLC 2012 Wednesday, February 22, 12
  10. 10. The Bloor Group NoSQL, RDBMS Innovation Fantastic Opportunity for Growth Gaps You Can Help Close The Race Is On! - Mapping Big Data with Legacy DataÇ - Strategy and Policy for Finish Line Ç√ √ Governance, Precision, Controls - Opportunities at all sides - Enterprise - Legacy Vendors - Innovative Ventures Tested For Prime Time - Technology and Business Time to Rise To The Top Skills, Investing ©Copyright BI Results, LLC 2012 Wednesday, February 22, 12
  11. 11. Disection & Discussion Twitter Tag: #briefrWednesday, February 22, 12
  12. 12. Robin Bloor is Chief Analyst at The Bloor Group. Robin.Bloor@Bloorgroup.com Twitter Tag: #briefrWednesday, February 22, 12
  13. 13. Wednesday, February 22, 12
  14. 14. RDBMSWednesday, February 22, 12
  15. 15. The SQL Barrier SQL has: DDL (for data definition) SQL Barrier DML (for Select, Project and Join) Results Or results But it has no MML or TML processing must be done here processing must be done here Usually result sets are brought to the client for further manipulation, but using them for further data access becomes problematic. SQL Conclusions: Analytic DBMS This separation of data from process is arbitrary and unhelpfulWednesday, February 22, 12
  16. 16. That MapReduce Thing There are two fundamental approaches to parallelism Data Partitioning Process partitioning MapReduce implements an approach which is oriented to the first of these. Thus proves to be suited to many “big data” tasks. It is not the end ofd the parallel processing story by any means.Wednesday, February 22, 12
  17. 17. Malcolm Chisholm has 25+ years experience in data management working in finance, insurance, manufacturing, government, defense, pharmaceuticals, and retail. He specializes in data governance, MDM, metadata engineering, business rules management/execution, data architecture and design. He is a well-known presenter at conferences in the U.S. and Europe, writes columns in trade journals, and has authored the books: Managing Reference Data in Enterprise Databases; How to Build a Business Rules Engine; and Definitions in Information Management. In 2011, Malcolm was presented with the prestigious DAMA International Professional Achievement Award for contributions to Master Data Management. He can be contacted at mchisholm@refdataportal.com. Twitter Tag: #briefrWednesday, February 22, 12
  18. 18. Disection & Discussion Twitter Tag: #briefrWednesday, February 22, 12
  19. 19. The New Database Revolution: Relational Roundtable The Virtual Circle February 22, 2012 San Francisco Malcolm Chisholm Ph.D. mchisholm@refdataportal.com Telephone 732-687-9283 • Fax 407-264-6809 www.refdataportal.com www.bizrulesengine.com © AskGet.com Inc., 2012. All rights reservedWednesday, February 22, 12
  20. 20. “Big Data” Is Used Differently Relational Paradigm ULS Dataspace in Cloud “Set at a time” processing Uncover individual facts Behavior of populations of identical things Much is master data Event data predominates Events are not as much repetitive transactions Exception reporting for singular things/events Can aggregate from individual facts (but bottom- (bust still top-down) up) Heavy data entry supported Surf and drill Data entry is to support analysis • The relational paradigm is different to ULS “Big Data”. [ULS = Ultra-Large Scale - usually Petabyte scale] • Difficult to rely on relational thinking in Cloud databases © AskGet.com Inc., 2012. All rights reservedWednesday, February 22, 12
  21. 21. Sources Source A Emails Source B Documents I ULS Dataspace in Cloud Web Pages N Source C G XML E Source D Relational S T Flat Files I Source E O Audio N Image Video • Sources provide data to the ULS dataspace • One source can provide many data formats • Many sources can provide the same format • Sources may duplicate the same data • HINT – Think metadata © AskGet.com Inc., 2012. All rights reservedWednesday, February 22, 12
  22. 22. Segments in Dataspace ULS Dataspace in Cloud Source A I N Ingested Data Terms in G Store Documents Document-Term Source B E Inverted Index S M/R M/R Source C T I O Source N N Extracted Deduplicated Master Data Master Data M/R M/R • The ULS dataspace is not a single “blob” of data • It will have different segments with different kinds of data in it • The segments will be derived from the originally ingested data • MapReduce (M/R) is the equivalent of ETL to move data around and transform it (filter, summarize) © AskGet.com Inc., 2012. All rights reservedWednesday, February 22, 12
  23. 23. No Common Notation for Columnar Designs ? Col A Col B Col C Col D Col E Row 01 Val1A Row 02 Val2A Val2B Val2C Val2D Val2E Row 03 Val3A Val3C Val3E • E/R diagramming techniques allow us to visualize a relational database • There is nothing that is quite the same for columnar databases • (a) It is sparse and columns may be missing • (b) How do you show the MapReduce transformations (not quite relations)? © AskGet.com Inc., 2012. All rights reservedWednesday, February 22, 12
  24. 24. Need a Data Dictionary • The ULS dataspace can grow quickly and have many data objects • Without a DD developers and users will get hopelessly lost (none of the logic imposed by the relational model) • The fundamental unit is the field – show where it occurs in rows, ColQuals and payloads • Tables less important than in relational © AskGet.com Inc., 2012. All rights reservedWednesday, February 22, 12
  25. 25. Disection & Discussion Twitter Tag: #briefrWednesday, February 22, 12
  26. 26. Mark Madsen is founder of Third Nature, a research and consulting firm focused on analytics, BI and decision-making. Mark spent the past two decades working on analysis and decision support in many industries and countries. He is an award- winning architect and former CTO whose work has been featured in numerous industry publications. Over the past ten years Mark received awards for his work from the American Productivity & Quality Center, TDWI, and the Smithsonian Institute. He is an international speaker, a contributing editor at Intelligent Enterprise, and manages the open source channel at the Business Intelligence Network. For more information or to contact Mark, visit http:// ThirdNature.net. Twitter Tag: #briefrWednesday, February 22, 12
  27. 27. One Size Doesn’t Fit All February 22, 2012 Mark R. Madsen http://ThirdNature.netWednesday, February 22, 12
  28. 28. The  future  of  data  is  the  databaseWednesday, February 22, 12
  29. 29. You keep using that word. I do not think it means what you think it means.Wednesday, February 22, 12
  30. 30. Good  conceptual  model,  but  a  prematurely   standardized  implementa5onThe  rela*onal  database  is  the  franchise  technology  for  storing  and  retrieving  data,  but…1.Global,  sta*c  schema  model2.No  rich  typing  system3.Many  are  not  a  good  fit  for  network  parallel  compu*ng,  aka  cloud4.Limited  API  in  atomic  SQL  statement  syntax    &  simple  result  set  returnWednesday, February 22, 12
  31. 31. Plus,  if  they’re  all  the  same  why  are  there  so  many? Sybase  IQ,  ASE EnterpriseDB   Algebraix Teradata,  Aster  Data LucidDB Intersystems  Caché Oracle,  RAC Vectorwise Streambase MicrosoT  SQLServer,  PDW MonetDB SQLStream IBM  DB2s,  Netezza Exasol Coral8 Paraccel Illuminate Ingres Kogni*o Ver*ca Postgres EMC/Greenplum InfiniDB Cassandra Oracle  Exadata 1010  Data CouchDB SAP  HANA SAND Mongo Infobright Endeca Hbase MySQL Xtreme  Data Redis MarkLogic IMS RainStor Tokyo  Cabinet Hive Scalaris And a few hundred more.Wednesday, February 22, 12
  32. 32. The  future  of  data  is  the  rela0onal  database? SQL noSQLWednesday, February 22, 12
  33. 33. The  future  of  data  is  the  rela0onal  database? SQL noSQLWednesday, February 22, 12
  34. 34. Technologies  are  not   perfect  replacements  for   one  another. When  replacing  the  old   with  the  new  (or  ignoring   the  new  over  the  old)  you   always  make  tradeoffs,  and   usually  you  won’t  see  them   for  a  long  0me.Wednesday, February 22, 12
  35. 35. Disection & Discussion Twitter Tag: #briefrWednesday, February 22, 12
  36. 36. Wednesday, February 22, 12
  37. 37. March: Vendor Research March 14th: Second Round Table focusing on No SQL databases and their application DB Revolution Survey conducted April: Vendor Research Publishing of Round Table Transcripts, with comments May: Authoring of White Paper Publishing of White Paper Publishing of survey activity Twitter Tag: #briefrWednesday, February 22, 12
  38. 38. March 14th: Second DB Revolution Round Table March Briefing Room: Integration April Briefing Room: Discovery May Briefing Room: Analytics Twitter Tag: #briefrWednesday, February 22, 12
  39. 39. Thank You For Your AttentionWednesday, February 22, 12

×