Advanced Analytics for “Big Data” with SAP Sybase IQDocument Transcript
In today’s digital world, enterprises need sim-pler and more cost-effective ways to analyzean exploding volume of data and supportexpanding user communities. Users want fastanswers to complex questions involving data,without having to rely on database administra-tors. The latest release of the SAP® Sybase® IQserver helps produce those answers. Designedfor advanced analytics, data warehousing,and business intelligence environments, SAPSybase IQ, version 15.4, works with large vol-umes of structured and unstructured dataand is ideally suited for user-driven analysis.Maximizing Performance, Flexibility, and EconomySAP Sybase IQ is distinguished from conventional databasesby its column-oriented, grid-based architecture; patented datacompression; and advanced query optimizer. It offers a singledatabase management system platform to analyze structured,semistructured, and unstructured data using a variety ofalgorithms.SAP Sybase IQ 15.4 is revolutionizing Big Data analytics by break-ing down silos of data analysis and integrating it into enterpriseanalytic processes.This version expands functionality with thefollowing elements:•• A native MapReduce application programming interface(API)•• Comprehensive and flexible Hadoop integration•• Support for predictive model markup language (PMML)•• An expanded library of statistical and data mining algorithmsthat leverage the power of distributed query processingacross a massively parallel processing (MPP) grid based onPlexQ™ technologyA new API enables application vendors and enterprise develop-ers to quickly and safely implement proprietary algorithmsthat can run in-database, delivering performance acceleration10 to 100 times greater than existing approaches. Additionally,significant improvements have been made for text data com-pression and bulk data loading interfaces.SAP Sybase IQAdvanced Analytics for “Big Data”with SAP® Sybase® IQRun Faster In-Database Analytic Workloadsand Leverage New Analytic Paradigms
Leveraging an Innovative ArchitectureUnlike other MPP solutions, SAP Sybase IQ PlexQ grid technol-ogy can dynamically manage analytics workloads across anexpandable set of compute and storage resources dedicatedto different groups and processes. These attributes make itsimpler and more cost-effective to support escalating volumesof data and rapidly growing user communities.Integrate Data-Analysis Silos with SAP Sybase IQTurn Massive Data into Actionable IntelligenceThe SAP® Sybase® IQ server is built on proven PlexQ™ technology and usesa three-tier architecture shown in the figure:•• Base tier: A massively parallel processing shared-everything analyticdatabase management system (DBMS) engine that supports multiplestyles of complex analytics involving massive data sets, massivenumbers of concurrent users, and unique workflows•• Second tier: The analytics application services layer providing C++and Java in-database application programming interfaces and enablingintegration and federation with external data sources, including fourmethods of Hadoop integration•• Top tier: The SAP Sybase IQ ecosystem, which consists of our strong anddiverse partners and certified applications developed by independentsoftware vendorsWeb 2.0 Java C/C++ SQLIngest + PersistFederationSAP Sybasecontrol centerBusiness objectsSAP® Sybase®PowerDesigner™CertifiedISV toolsDBMSAppservicesEco-systemUnstructureddata (Hadoop,content mgmt)Structureddata (DBMS)Figure: SAP® Sybase® IQ Three-Tier Architecture Based on PlexQ™ Technology
Transforming Big Data intoActionable IntelligenceSAP Sybase IQ builds upon its PlexQ technology to transformBig Data into actionable intelligence for everyone, putting thepower of Big Data analytics easily within reach of users andbusiness processes throughout the entire enterprise. SAPSybase IQ introduces the following key attributes in thenew version.Data Management EnhancementsA number of enhancements improve the data management,deployment, and maintainability of an SAP Sybase IQinstallation.•• Faster bulk loading: Bulk load data inserts into SAP SybaseIQ through open database connectivity (ODBC) and Javadatabase connectivity (JDBC) interfaces, enabling more scal-able applications, with orders-of-magnitude improvement inload performance.•• Better text compression: Better compression of datatypes such as variable character field (VARCHAR), variablebinary (VARBINARY), single character (CHAR), and BINARYdelivers a more efficient and cost-effective way to deployhigh-performance text analytics applications, with significantimprovements in compression rates.Application ServicesThe latest version of SAP Sybase IQ provides a series of APIsand tools to build advanced analytic algorithms that run in-database and leverage MPP through a PlexQ grid.Table parameterized user-defined function (UDF) APIenabling native MapReduce: An API native to SAP Sybase IQthat allows application programmers to build and deploy C++libraries inside an SAP Sybase IQ database server. Use theseAPIs to implement proprietary algorithms or a packaged libraryof algorithms securely inside SAP Sybase IQ to return results10 times faster by executing close to data stored in an SAPSybase IQ database server.This framework allows development and deployment ofMapReduce programs in SAP Sybase IQ to analyze very largedata sets covering structured, semistructured, and unstruc-tured data formats. The C++ map and reduce algorithms arecalled via standard Structured Query Language (SQL) andautomatically distributed and parallelized across the PlexQgrid by the powerful query engine in SAP Sybase IQ.Hadoop integration and federation: Integrate results from aHadoop-based analysis with queries running in SAP Sybase IQ.You can use four different techniques to integrate Hadoop dataand analysis within standard SQL queries (client-side federa-tion; extract, transform, and load processing; data federation;and query federation) with an analytics database.Leverage Hadoop to identify relevant data points from massivesets of structured and unstructured data, and then integratethose relevant data points from Hadoop into SAP Sybase IQfor analysis with transactional data and result sets from otherdata sources.PMML support: Through a certified plug-in from Zementis,automate the execution of analytic models defined usingindustry-standard language that are created in tools like SAS,SPSS,“R,” and other popular predictive workbench products.Leverage popular analytic tools to build predictive models,automate execution of predictive models deployed in SAPSybase IQ, and use industry-standard language to avoid vendorlock-in.“R” integration: Use “R,” the popular open source statisticaltool, to query SAP Sybase IQ databases using an RJDBC inter-face. You can also execute “R” libraries from SAP Sybase IQas a function call within SQL queries and return result sets.In-Database Analytics LibraryA library of advanced analytic, statistical, and data mining algo-rithms run inside SAP Sybase IQ. The latest version providesan updated in-database statistical and data mining library(DBLytix from Fuzzy Logix). The updates enable the library toleverage the MapReduce API in some data mining algorithmsfor MPP, and also include several new functions such as sup-port vector machines, neural networks, and adaptive boosting.SAP Sybase IQ 15.4 is revolutionizingBig Data analytics by breaking downsilos of data analysis and integrating itinto enterprise analytic processes.