Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

JDV Big Data Webinar v2

1,028 views

Published on

Published in: Technology, Business
  • Be the first to comment

  • Be the first to like this

JDV Big Data Webinar v2

  1. 1. GAIN BETTER INSIGHTS FROM BIG DATA USING RED HAT JBOSS DATA VIRTUALIZATION Syed Rasheed Product Marketing Manager Kenny Peeples Technical Marketing Manager Red Hat Corporation December 4th, 2013
  2. 2. Red Hat is… “By running tests and executing numerous examples for specific teams, we were able to prove […] not only would the solution work, but it will perform better & at a fraction of the costs.” MICHAEL BLAKE, Director, Systems & Architecture 2 RED HAT CONFIDENTIAL
  3. 3. Agenda ● Data challenges getting bigger ● Red Hat Big Data Strategy and Platform ● Data Virtualization Overview ● Customer Use Case for Big Data integration using Data Virtualization ● ● 3 Demo Q&A RED HAT CONFIDENTIAL
  4. 4. Poll Question #1 ● What are your plans regarding usage of Hadoop technology at your company? – – Under consideration – Under development – Project level deployment – 4 No plans Enterprise level deployment RED HAT CONFIDENTIAL
  5. 5. Poll Question #2 ● What are your plans regarding usage of Data Virtualization technology at your company? – – Under consideration – Under development – Project level deployment – 5 No plans Enterprise level deployment RED HAT CONFIDENTIAL
  6. 6. Data Driven Economy Data is becoming the new raw material of business: an economic input almost on a par with capital and labor. “Every day I wake up and ask, ‘how can I flow data better, manage data better, analyze data better?” CIO - Wal-Mart 6 RED HAT CONFIDENTIAL
  7. 7. Data Challenges Getting Bigger Big Data, Cloud, and Mobile Existing Data Integration approaches are not sufficient ● Extracting and moving data adds latency and cost ● Every project solves data access and integration in a different way ● Solutions are tightly coupled to data sources ● Poor flexibility and agility BI Reports Operational Reports Enterprise Applications SOA Applications Mobile Applications Constant Change How to align? Integration Complexity Siloed & Complex Hadoop 7 NoSQL Cloud Apps Data Warehouse & Databases Mainframe RED HAT CONFIDENTIAL XML, CSV & Excel Files Enterprise Apps
  8. 8. Business Objective Turn Data into Actionable Information Only 28% Users have any meaningful data access  Reduce costs for finding and accessing highly fragmented data Over 70% BI project efforts lies in the integration of source data  Improve time to market for new products and services by simplifying data access and integration  Deliver IT solution agility necessary to capitalize on constantly changing market conditions  Transform fragmented data into actionable information that delivers competitive advantage 8 RED HAT CONFIDENTIAL
  9. 9. Red Hat’s Big Data Strategy ● Reduce Information Gap thru cost effectively making ALL data easily consumable for analytics Process Integrate Data to Actionable Information Cycle 9 RED HAT CONFIDENTIAL Analytics Data Capture
  10. 10. Red Hat Big Data Platform Platform RHEL Platform Integration & Optimization Hadoop Integration Middleware JBoss Data Virtualization Fedora Big Data SIG Apache Hadoop Hadoop Distributions Hadoop On Red Hat Storage Storage 10 RED HAT CONFIDENTIAL Hadoop On OpenStack Cloud / Virtualization
  11. 11. Red Hat Big Data Platform Platform RHEL Platform Integration & Optimization Hadoop Integration Middleware Fedora Big Data SIG JBoss Data Virtualization Apache Hadoop Hadoop Distributions Hadoop On Red Hat Storage Storage 11 RED HAT CONFIDENTIAL Hadoop On OpenStack Cloud / Virtualization
  12. 12. What does Data Virtualization software do? Turn Fragmented Data into Actionable Information Data Virtualization software virtually unifies data spread across various disparate sources; and makes it available to applications as a single consolidated data source. DATA CONSUMERS BI Reports The data virtualization software implements 3 steps process to bridge data sources and data consumers: ● ● ● 12 Connect: Fast access to data from diverse data sources Compose: Easily create unified virtual data models and views by combining and transforming data from multiple sources. Consume: Expose consistent information to data consumers in the right form thru standard data access methods. SOA Applications Easy, Real-time Information Access Virtual Consolidated Data Source Data Virtualization Software • • • Consume Compose Connect Oracle DW SAP Hadoop DATA SOURCES RED HAT CONFIDENTIAL Salesforce.com Virtualize Abstract Federate Siloed & Complex
  13. 13. Turn Fragmented Data into Actionable Information Mobile Applications Data Consumers JBoss Data Virtualization ESB, ETL BI Reports & Analytics SOA Applications & Portals Design Tools Standard based Data Provisioning JDBC, ODBC, SOAP, REST, OData Consume Easy, Real-time Information Access Dashboard Unified Virtual Database / Common Data Model Compose Unified Customer View Unified Product View Unified Supplier View Optimization Caching Virtualize Abstract Federate Security Connect Native Data Connectivity Data Sources Metadata Siloed & Complex Hadoop 13 NoSQL Cloud Apps Data Warehouse & Databases Mainframe RED HAT CONFIDENTIAL XML, CSV & Excel Files Enterprise Apps
  14. 14. JBoss Data Virtualization: Supported Data Sources Enterprise RDBMS: • Oracle • IBM DB2 • Microsoft SQL Server • Sybase ASE • MySQL • PostgreSQL • Ingres Enterprise EDW: • Teradata • Netezza • Greenplum 14 Hadoop: • Apache • HortonWorks • Cloudera • More coming… Office Productivity: • Microsoft Excel • Microsoft Access • Google Spreadsheets Specialty Data Sources: • ModeShape Repository • Mondrian • MetaMatrix • LDAP RED HAT CONFIDENTIAL NoSQL: • JBoss Data Grid • MongoDB • More coming… Enterprise & Cloud Applications: • Salesforce.com • SAP Technology Connectors: • Flat Files, XML Files, XML over HTTP • SOAP Web Services • REST Web Services • OData Services
  15. 15. Key New Features and Capabilities ● Data connectivity enhancements – – NoSQL (MongoDB – Tech Preview) and JBoss Data Grid – ● Hadoop Integration (Hive – Big Data), Odata support (SAP integration) Developer Productivity improvements – – Enhanced column level security, – ● New VDB Designer 8 and integration with JBoss Developer Studio v7 VDB import/reuse, and native queries Simplify deployment and packaging – – ● Requires JBoss EAP only; included with subscription Remove dependency with SOA Platform Business Dashboard – 15 New rapid data reporting/visualization capability RED HAT CONFIDENTIAL
  16. 16. JBoss Data Virtualization – Use Cases Self-Service Business Intelligence The virtual, reusable data model provides business-friendly representation of data, allowing the user to interact with their data without having to know the complexities of their database or where the data is stored and allowing multiple BI tools to acquire data from centralized data layer. Gain better insights from Big Data using JBoss Data Virtualization to integrate with existing information sources. 360◦ Unified View Deliver a complete view of master & transactional data in real-time. The virtual data layer serves as a unified, enterprise-wide view of business information that improves users’ ability to understand and leverage enterprise data. Agile SOA Data Services A data virtualization layer deliver the missing data services layer to SOA applications. JBoss Data Virtualization increases agility and loose coupling with virtual data stores without the need to touch underlying sources and creation of data services that encapsulate the data access logic and allowing multiple business service to acquire data from centralized data layer. Regulatory Compliance Data Virtualization layer deliver the data firewall functionality. JBoss Data Virtualization improves data quality via centralized access control, robust security infrastructure and reduction in physical copies of data thus reducing risk. Furthermore, the metadata repository catalogs enterprise data locations and the relationships between the data in various data stores, enabling transparency and visibility. 16 RED HAT CONFIDENTIAL
  17. 17. Big Data integration use case Retail Customer Use Case Gain Better Insight from Big Data for Intelligent Inventory Management ● Objective: – ● Right merchandise, at right time and price JBoss BRMS Problem: – ● Analytical Apps Data Driven Decision Management Cannot utilize social data and sentiment analysis with their inventory and purchase management system Solution: – Leverage JBoss Data Virtualization to mashup Sentiment analysis data with inventory and purchasing system data. Leveraged BRMS to optimize pricing and stocking decisions. Consume Compose Connect JBoss Data Virtualization Hive Purchase Mgmt Application Inventory Databases Sentiment Analysis 17 RED HAT CONFIDENTIAL
  18. 18. Better Together - Big Data and Data Virtualization Hadoop not another Silo - Customers Combine Multiple Technologies ● Combine structured and unstructured analysis – ● Combine high velocity and historical analysis – ● Analyze and react to data in motion; adjust models with deep historical analysis Reuse structured data for analysis – 18 Augment data warehouse with additional external sources, such as social media Experimentation and ad-hoc analysis with structured data RED HAT CONFIDENTIAL
  19. 19. Better Together - Big Data and Data Virtualization BI Analytics (historical, operational, predictive) SOA Composite Applications Data Integration JBoss Data Virtualization Capture & Process In-memory Cache JBoss Data Grid Messaging and Event Processing JBoss A-MQ and JBoss BRMS J Structured Data 19 Streaming Data RED HAT CONFIDENTIAL Hadoop Semi-Structured Data Red Hat Storage Red Hat Enterprise Linux & Virtualization Integrate & Analyze Capture, Process and Integrate Data Volume, Velocity, Variety
  20. 20. Consider... Inconsistent, Incomplete Information Uninformed, Delayed Decisions Costly Business Risk and Exposure How would your organization change… ● ● ● 20 If data were readily reusable in place rather than requiring significant effort to build new intermediary data tiers? If data could be repurposed quickly into new applications and business processes? If all applications and business processes could get all of the information needed in the form needed, where needed and when needed? RED HAT CONFIDENTIAL
  21. 21. Red Hat JBoss Middleware Business Process Management • • JBoss BRMS JBoss BPM Suite Application Integration • • • JBoss A-MQ JBoss Fuse JBoss Fuse Service Works Data Integration Foundation ACCELERATE 21 • • • • JBoss Data Virtualization JBoss EAP JBoss Web Server JBoss Data Grid INTEGRATE RED HAT CONFIDENTIAL AUTOMATE JBoss Operations Network JBoss Developer Studio JBoss Portal • • • Management Tools Development Toolsh User Interaction
  22. 22. Big Data Integration using JBoss Data Virtualization DEMO
  23. 23. Demo Scenario ● Objective: – ● Determine if sentiment data from the first week of the Iron Man 3 movie is a predictor of sales Problem: – ● Excel Powerview and DV Dashboard to analyze the aggregated data Cannot utilize social data and sentiment analysis with sales management system Consume Compose Connect Solution: – JBoss Data Virtualization Leverage JBoss Data Virtualization to mashup Sentiment analysis data with ticket and merchandise sales data on MySQL into a single view of the data. Hive SOURCE 1: Hive/Hadoop contains twitter data including sentiment 23 RED HAT CONFIDENTIAL SOURCE 2: MySQL data that includes ticket and merchandise sales
  24. 24. Demonstration System Requirements • JDK – Oracle JDK 1.6, 1.7 or OpenJDK 1.6 or 1.7 • JBoss Data Virtualization v6 Beta – http://jboss.org/products/datavirt.html • JBoss Developer Studio – http://jboss.org/products • JBoss Integration Stack Tools (Teiid) – https://devstudio.jboss.com/updates/7.0-development/integration-stack/ • Slides, Code and References for demo – https://github.com/DataVirtualizationByExample/Mashup-with-Hive-andMySQL • Hortonworks Data Platform (A VM for testing Hive/Hadoop) – http://hortonworks.com/products/hdp-2/#install • Red Hat Storage – http://www.redhat.com/products/storage-server/ 24 RED HAT CONFIDENTIAL
  25. 25. 25 RED HAT CONFIDENTIAL
  26. 26. 26 RED HAT CONFIDENTIAL
  27. 27. 27 RED HAT CONFIDENTIAL
  28. 28. 28 RED HAT CONFIDENTIAL
  29. 29. 29 RED HAT CONFIDENTIAL
  30. 30. 30 RED HAT CONFIDENTIAL
  31. 31. 31 RED HAT CONFIDENTIAL
  32. 32. 32 RED HAT CONFIDENTIAL
  33. 33. 33 RED HAT CONFIDENTIAL
  34. 34. 34 RED HAT CONFIDENTIAL
  35. 35. 35 RED HAT CONFIDENTIAL
  36. 36. 36 RED HAT CONFIDENTIAL
  37. 37. 37 RED HAT CONFIDENTIAL
  38. 38. 38 RED HAT CONFIDENTIAL
  39. 39. 39 RED HAT CONFIDENTIAL
  40. 40. 40 RED HAT CONFIDENTIAL
  41. 41. 41 RED HAT CONFIDENTIAL
  42. 42. 42 RED HAT CONFIDENTIAL
  43. 43. 43 RED HAT CONFIDENTIAL
  44. 44. 44 RED HAT CONFIDENTIAL
  45. 45. 45 RED HAT CONFIDENTIAL
  46. 46. 46 RED HAT CONFIDENTIAL
  47. 47. 47 RED HAT CONFIDENTIAL
  48. 48. 48 RED HAT CONFIDENTIAL
  49. 49. 49 RED HAT CONFIDENTIAL
  50. 50. 50 RED HAT CONFIDENTIAL
  51. 51. 51 RED HAT CONFIDENTIAL
  52. 52. 52 RED HAT CONFIDENTIAL
  53. 53. 53 RED HAT CONFIDENTIAL
  54. 54. 54 RED HAT CONFIDENTIAL
  55. 55. 55 RED HAT CONFIDENTIAL
  56. 56. 56 RED HAT CONFIDENTIAL
  57. 57. 57 RED HAT CONFIDENTIAL
  58. 58. 58 RED HAT CONFIDENTIAL
  59. 59. 59 RED HAT CONFIDENTIAL
  60. 60. 60 RED HAT CONFIDENTIAL
  61. 61. Why Red Hat for Big Data? ● Transform ALL data into actionable information – Cost Effective, Comprehensive Platform – Community based Innovation – Enterprise Class Software and Support Process Integrate Data to Actionable Information Cycle 61 RED HAT CONFIDENTIAL Information Data Capture
  62. 62. Red Hat Big Data Platform Platform RHEL Platform Integration & Optimization Hadoop Integration Middleware JBoss Data Virtualization Fedora Big Data SIG Apache Hadoop Hadoop Distributions Hadoop On Red Hat Storage Storage 62 RED HAT CONFIDENTIAL Hadoop On OpenStack Cloud / Virtualization
  63. 63. Thank You Q&A

×