SlideShare a Scribd company logo
1 of 5
Download to read offline
EMC GREENPLUM DATABASE
Driving the future of data warehousing and analytics

                                                 MEETING THE CHALLENGES OF A DATA-DRIVEN WORLD
  ESSENTIALS
                                                 Rising IT costs, exploding data volumes, and ever-evolving competitive challenges have
  A shared-nothing, massively parallel
                                                 catalyzed new ways of thinking about effective systems for analytics. All these changes have
  processing (MPP) architecture supports
  extreme performance on commodity
                                                 driven radical changes in database technology, and collectively, these developments have
  infrastructure                                 led to a new approach for effective data exploitation. Decades-old legacy architecture for
  • Enables Big Data Analytics via the support
                                                 data management and analytics is inherently unfit for scaling to today’s dramatic increase in
    of high-performance and flexible data        data volume.
    exchange between Hadoop and Green-
                                                 The EMC® Greenplum® Database is a shared-nothing, massively parallel processing (MPP)
    plum Database
                                                 architecture that has been designed for business intelligence and analytical processing. In
  • Designed with polymorphic storage
                                                 this architecture, each server node acts as a self-contained database management system
    (hybrid of row and column) to best fit the
    unique needs of each business intel-         that owns and manages a distinct portion of the overall data. The system automatically
    ligence and analytical use case              distributes data and parallelizes query workloads across all available hardware.
  • Enables massive data storage, loading,
    and processing with unlimited linear
    scalability
  • Provides automatic parallelization with no
    need for manual partitioning or tuning




                                                 The figure shown summarizes why the EMC Greenplum Database is the best platform on the
                                                 market for mission-critical analytics. The core shared-nothing MPP architecture enables
                                                 massive data storage, loading, and processing with unlimited linear scalability. Adaptive
                                                 services provide worldwide enterprises with high availability, workload management, and
                                                 online expansion of capacity. Key product features enable petabyte-scale loading, hybrid
                                                 storage (row or column) to best fit the unique needs of each analytical use case, and
                                                 embedded support for SQL, MapReduce, and programmable analytics. In addition, all major
                                                 third-party analytic and administration tools are supported through standard client
                                                 interfaces.




D ATA S H E E T
The core principle of the EMC Greenplum Database is to move the processing dramatically
closer to the data and its users. This effectively enables the computational resources to
process every query in a fully parallel manner, use all storage connections simultaneously,
and flow data efficiently between resources as the query plan dictates. The result is that a
wide variety of complex processing can be pushed down in close proximity to the data for
maximum efficiency and incredible expressiveness.

The EMC Greenplum Database is already regarded as the most scalable mission-critical
analytical database and in use at more than 200 leading enterprises worldwide.


GREENPLUM DATABASE FEATURES
DATABASE ARCHITECTURE
Core Massively Parallel Processing Architecture
The Greenplum Database architecture provides automatic parallelization of data and
queries—all data is automatically partitioned across all nodes of the system, and queries are
planned and executed using all nodes working together in a highly coordinated fashion.

Petabyte-Scale Loading
High-performance loading uses MPP Scatter/Gather Streaming™ technology. Loading speeds
scale with each additional node to greater than 10 terabytes per hour, per rack. When
loading a continuous stream, trickle micro-batching and re-usable table objects enable data
to be loaded at frequent intervals (e.g., every five minutes), while maintaining extremely
high data ingest rates.

Polymorphic Data Storage and Execution
Using Greenplum’s Polymorphic Data Storage™ technology, the DBA can select the storage,
execution, and compression settings that suit the way that table will be accessed. With this
feature, customers have the choice of row- or column-oriented storage and processing for
any table or partition. In addition, Greenplum Database also supports the placement of data
on specific storage types, such as SSD media or network-attached storage (NAS) archival
stores.

Anywhere Data Access
Anywhere data access enables queries to be executed from the database against external
data sources, returning data in parallel regardless of location, format, or storage medium.

In-Database Compression
In-database compression uses industry-leading compression technology to increase
performance and dramatically reduce the space required to store data. Customers can
expect to see a three- to 10-time disk space reduction with a corresponding increase in
effective I/O performance.

Multi-level Partitioning
Flexible partitioning of tables is based on date, range, or value. Partitioning is specified
using DDL and enables an arbitrary number of levels. The query optimizer will automatically
prune unneeded partitions from the query plan.

Dynamic Partitioning Elimination and Query Memory Optimization
Greenplum Database supports dynamic partition elimination and query memory
optimization. Dynamic Partition Elimination disregards irrelevant partitions in a table and
allows for significant reduction in the amount of data scanned and results in faster query
execution times. The query memory optimization feature intelligently frees and reallocates
memory to different operators during query processing, allowing for better memory
utilization, higher throughput, and higher concurrency.
IN-DATABASE ANALYTICS
Native MapReduce
Greenplum natively runs MapReduce programs within its parallel engine and supports PL/
Java, optimized C, and Java function support.

High-Performance gNet for Hadoop
Greenplum Database enables high-performance parallel import and export of compressed
and uncompressed data from Hadoop clusters using gNet for Hadoop, a parallel
communications transport with the industry’s first direct query interoperability between
Greenplum Database nodes and corresponding Hadoop nodes. To further streamline
resource consumption during load times, custom-format data (binary, Pig, Hive, etc.) in
Hadoop can be converted to GPDB Format via MapReduce, and then imported into
Greenplum Database. This is a high-speed direct integration option that provides an efficient
and flexible data exchange between Greenplum Database and Hadoop. gNet for Hadoop is
available for both Greenplum HD Community Edition and Enterprise Edition.

Advanced Analytical Functions
Greenplum Database provides analytical functions (t-statistics, p-values, and Naïve Bayes)
for advanced in-database analytics. These functions provide the needed metrics for variable
selection to improve the quality of a regression model, as well as enhance the ability to
understand and reason about the edge cases.

Programmable Analytics
A new level of parallel analysis capabilities for mathematicians and statisticians and support
for R, linear algebra, and machine learning primitives is offered.

Greenplum Database Extension Framework and Turnkey In-Database Analytics
Greenplum Database delivers an agile, extensible platform for in-database analytics,
leveraging the system’s massively parallel architecture. Greenplum Database enables
turnkey in-database analytics via Greenplum Extensions, which can be downloaded from
EMC Subscribenet and installed using the new Greenplum Package Manager. This new
Greenplum Database utility ensures automatic installation and updates of functional
extensions like in-database GeoSpatial functions, PL/R, PL/Java, PL/Python, and PL/Perl.
Greenplum Extensions dramatically simplify the task of enabling and managing advanced
in-database functionality across a cluster. For example, extensions automatically get
deployed on new nodes during expansions of Greenplum clusters.


DATABASE MANAGEMENT TOOLS
Online System Expansion
You can add servers to increase storage capacity, processing performance, and loading
performance. The database can remain online and fully available while the expansion
process takes place in the background. Performance and capacity increase linearly as
servers are added.

Workload Management
With administrative control over system resources and their allocation to queries, users can
be assigned to resource queues that manage the inflow of work to the database. Workload
management also enables priority adjustment of running queries.

Dynamic Query Prioritization
Greenplum’s Advanced Workload Management is extended with patent-pending technology
that provides continuous realtime balancing of the resources of the entire cluster across all
running queries. This gives DBAs the controls they need to meet workload service-level
agreements in complex, mixed-workload environments.
Database Performance Monitor Tool
The Greenplum Database’s Performance Monitor data collection agents gather metrics to
help administrators analyze network patterns of Greenplum Database. Collecting these
metrics allows system administrators to pinpoint the cause of network issues, and separate
hardware issues from software issues.

Simple and Fast Parallel Installation
The parallel installation utility allows system administrators to install the Greenplum
Database software on multiple hosts at once. When run as root, it also automates other
system configuration tasks such as creating the Greenplum system user (gpadmin), setting
the system user’s password, setting the ownership of the Greenplum Database installation
directory, and exchanging ssh keys between all specified host address names.


HIGH-AVAILABILITY, BACKUP, AND DISASTER RECOVERY
SUPPORT
Self-Healing Fault Tolerance
Traditional MPP database fault-tolerance techniques were suitable for environments with
less than 100 servers, but TCO has increased dramatically beyond that scale. Greenplum’s
fault-tolerance capabilities provide intelligent fault detection and fast online differential
recovery, lowering TCO and enabling cloud-scale systems with the highest levels of
availability.

Post-Recovery, Online Segment Rebalancing
After segment recovery, the EMC Greenplum Database segments can be rebalanced while the
system is online. All client sessions remain connected to allow no downtime. The database
remains functional while the system is recovered back into an optimal state.

Simpler, Scalable Backup with Data Domain Boost
Greenplum Database now includes advanced integration with EMC Data Domain®
deduplication storage systems via EMC Data Domain Boost for faster, more efficient backup.
This integration distributes parts of the deduplication process to Greenplum Database
servers, enabling them to send only unique data to the Data Domain system. This
dramatically increases aggregate throughput, reduces the amount of data transferred over
the network, and eliminates the need for NFS mount management.


INTEROPERABILITY
Indexes—B-Tree, Bitmap, and More
Greenplum Database supports a range of index types, including B-Tree and Bitmap.

Comprehensive SQL Support
Greenplum Database offers comprehensive SQL-92 and SQL-99 support with SQL 2003 OLAP
extensions and full standard support, including window functions, rollup, cube, and a wide
range of other expressive functionality. All queries are parallelized and executed across the
entire system.

Greenplum Database offers enhanced SQL support, including native support of 20+ Oracle
functions, correlated sub queries, non-recursive WITH clause, and fixed format loader. These
enhancements streamline support of third-party tools that generate such queries and make
migration to Greenplum Database faster and simpler.

Client Access and Third-Party Tools
Greenplum supports standard database interfaces (PostgreSQL, SQL, ODBC, JDBC, OLEDB,
etc.) and is fully supported and certified by a wide range of business intelligence (BI) and
extract/ transform/load (ETL) tools.
pgAdmin3 for GPDB
                                       pgAdmin3 is the most popular and feature-rich Open Source administration and
                                       development platform for PostgreSQL. Greenplum Database ships with an enhanced version
                                       of pgAdmin3 that has been extended to work with Greenplum Database and provides full
                                       support for Greenplum-specific capabilities.

                                       XML Support
                                       Greenplum Database provides support for XML, enabling high-performance, parallel load of
                                       XML documents into the database, support for XML data type and the XML Path language
                                       (xpath).


                                       MAXIMIZE EMC GREENPLUM DCA BENEFITS WITH EMC
                                       GLOBAL SERVICES
                                       EMC delivers a full complement of services for EMC Greenplum hardware and software to
                                       ensure that your system performs as expected in your environment, while minimizing risk to
                                       your business and budget. Expert planning, design, and implementation services help you
                                       quickly realize the value of your hardware and software in your environment—no matter how
                                       simple or complex. After implementation, EMC’s data migration services can help you plan,
                                       design, and safely migrate your critical data over any distance to your new system. EMC will
                                       also help you integrate your new system into your information architecture and business
                                       intelligence and analytics applications (for example: SAS, MicroStrategy, Business Objects,
                                       Tableau, etc.), and manage your new environment when it is complete.

                                       Extensively trained professional services personnel and project management teams,
                                       leveraging EMC’s extensive data warehousing/business intelligence deployment best
                                       practices and guided by our proven methodology, accelerate the business results you need
                                       without straining the resources you have.




  Contact Us
  For more information, contact your
  EMC Greenplum representative or
  visit www.greemplum.com.




                                       EMC2, EMC, Data Domain, Greenplum, and the EMC logo are registered trademarks or trademarks of EMC Corporation in the United
                                       States and other countries. All other trademarks used herein are the property of their respective owners. © Copyright 2011 EMC
                                       Corporation. All rights reserved. Published in the USA. 11/11 Data Sheet H8995


EMC Greenplum
Division Headquarters
1900 South Norfolk Street
San Mateo, CA 94403
Tel: 650-286-8012
www.greenplum.com

More Related Content

What's hot

Integrating dbm ss as a read only execution layer into hadoop
Integrating dbm ss as a read only execution layer into hadoopIntegrating dbm ss as a read only execution layer into hadoop
Integrating dbm ss as a read only execution layer into hadoopJoão Gabriel Lima
 
The Database Environment Chapter 9
The Database Environment Chapter 9The Database Environment Chapter 9
The Database Environment Chapter 9Jeanie Arnoco
 
An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...
An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...
An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...IRJET Journal
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Karl Roche
 
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...IJCSIS Research Publications
 
% Razones para alta disponibilidad
% Razones para alta disponibilidad% Razones para alta disponibilidad
% Razones para alta disponibilidadCPVEN
 
Accelerate Oil & Gas Discovery
Accelerate Oil & Gas DiscoveryAccelerate Oil & Gas Discovery
Accelerate Oil & Gas DiscoveryPanasas
 
The Database Environment Chapter 6
The Database Environment Chapter 6The Database Environment Chapter 6
The Database Environment Chapter 6Jeanie Arnoco
 
The Database Environment Chapter 1
The Database Environment Chapter 1The Database Environment Chapter 1
The Database Environment Chapter 1Jeanie Arnoco
 
Data Center Optimization
Data Center OptimizationData Center Optimization
Data Center OptimizationBihag Karnani
 
The Database Environment Chapter 12
The Database Environment Chapter 12The Database Environment Chapter 12
The Database Environment Chapter 12Jeanie Arnoco
 
Netezza Architecture and Administration
Netezza Architecture and AdministrationNetezza Architecture and Administration
Netezza Architecture and AdministrationBraja Krishna Das
 
An-Insight-about-Glusterfs-and-it's-Enforcement-Techniques
An-Insight-about-Glusterfs-and-it's-Enforcement-TechniquesAn-Insight-about-Glusterfs-and-it's-Enforcement-Techniques
An-Insight-about-Glusterfs-and-it's-Enforcement-TechniquesManikandan Selvaganesh
 
Towards a low cost etl system
Towards a low cost etl systemTowards a low cost etl system
Towards a low cost etl systemijdms
 

What's hot (19)

Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
 
Integrating dbm ss as a read only execution layer into hadoop
Integrating dbm ss as a read only execution layer into hadoopIntegrating dbm ss as a read only execution layer into hadoop
Integrating dbm ss as a read only execution layer into hadoop
 
The Database Environment Chapter 9
The Database Environment Chapter 9The Database Environment Chapter 9
The Database Environment Chapter 9
 
An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...
An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...
An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
 
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
 
% Razones para alta disponibilidad
% Razones para alta disponibilidad% Razones para alta disponibilidad
% Razones para alta disponibilidad
 
3 olap storage
3 olap storage3 olap storage
3 olap storage
 
Whitepaper
WhitepaperWhitepaper
Whitepaper
 
Accelerate Oil & Gas Discovery
Accelerate Oil & Gas DiscoveryAccelerate Oil & Gas Discovery
Accelerate Oil & Gas Discovery
 
The Database Environment Chapter 6
The Database Environment Chapter 6The Database Environment Chapter 6
The Database Environment Chapter 6
 
The Database Environment Chapter 1
The Database Environment Chapter 1The Database Environment Chapter 1
The Database Environment Chapter 1
 
Oracle D Brg2
Oracle D Brg2Oracle D Brg2
Oracle D Brg2
 
Data Center Optimization
Data Center OptimizationData Center Optimization
Data Center Optimization
 
The Database Environment Chapter 12
The Database Environment Chapter 12The Database Environment Chapter 12
The Database Environment Chapter 12
 
Netezza Architecture and Administration
Netezza Architecture and AdministrationNetezza Architecture and Administration
Netezza Architecture and Administration
 
An-Insight-about-Glusterfs-and-it's-Enforcement-Techniques
An-Insight-about-Glusterfs-and-it's-Enforcement-TechniquesAn-Insight-about-Glusterfs-and-it's-Enforcement-Techniques
An-Insight-about-Glusterfs-and-it's-Enforcement-Techniques
 
IBM Power 730 Express server
IBM Power 730 Express serverIBM Power 730 Express server
IBM Power 730 Express server
 
Towards a low cost etl system
Towards a low cost etl systemTowards a low cost etl system
Towards a low cost etl system
 

Viewers also liked

La meva primera presentació
La meva primera presentacióLa meva primera presentació
La meva primera presentacióhelenbou4352
 
Swipp Brochure
Swipp BrochureSwipp Brochure
Swipp BrochureSwipp
 
Pozo requena fotonovel·la
Pozo requena fotonovel·laPozo requena fotonovel·la
Pozo requena fotonovel·lamgonellgomez
 
Analyst Report: Clearing the Clouds
Analyst Report: Clearing the Clouds  Analyst Report: Clearing the Clouds
Analyst Report: Clearing the Clouds EMC
 
Industrial internet
Industrial internetIndustrial internet
Industrial internetEMC
 
MDP on Financial Statement Analysis
MDP on  Financial Statement AnalysisMDP on  Financial Statement Analysis
MDP on Financial Statement Analysiskanagaraj300
 
Thurs greek and maya
Thurs greek and mayaThurs greek and maya
Thurs greek and mayaTravis Klein
 
02 block productivity pp fs
02 block productivity pp fs02 block productivity pp fs
02 block productivity pp fsTravis Klein
 
Digital content promotion q1 2013
Digital content promotion q1 2013Digital content promotion q1 2013
Digital content promotion q1 2013Rene Summer
 
What must a leader bear in mind when attempting to change workplace culture
What must a leader bear in mind when attempting to change workplace cultureWhat must a leader bear in mind when attempting to change workplace culture
What must a leader bear in mind when attempting to change workplace cultureDaleCarnegieIndia1
 
EMC Enterprise Hybrid Cloud 2.5.1, Federation SDDC Edition: Backup Solution G...
EMC Enterprise Hybrid Cloud 2.5.1, Federation SDDC Edition: Backup Solution G...EMC Enterprise Hybrid Cloud 2.5.1, Federation SDDC Edition: Backup Solution G...
EMC Enterprise Hybrid Cloud 2.5.1, Federation SDDC Edition: Backup Solution G...EMC
 
What's New in VMware Virtual SAN
What's New in VMware Virtual SANWhat's New in VMware Virtual SAN
What's New in VMware Virtual SANEMC
 
Ablation material book
Ablation material   bookAblation material   book
Ablation material bookRahman Hakim
 
What Are Your Servers Doing While You’re Sleeping?
What Are Your Servers Doing While You’re Sleeping?What Are Your Servers Doing While You’re Sleeping?
What Are Your Servers Doing While You’re Sleeping?Tracy McKibben
 
17 λειτουργια πεπτικου - ανιχνευση ουσιων
17   λειτουργια πεπτικου - ανιχνευση ουσιων17   λειτουργια πεπτικου - ανιχνευση ουσιων
17 λειτουργια πεπτικου - ανιχνευση ουσιωνXristos Lyg
 

Viewers also liked (20)

La meva primera presentació
La meva primera presentacióLa meva primera presentació
La meva primera presentació
 
Swipp Brochure
Swipp BrochureSwipp Brochure
Swipp Brochure
 
Monopolistic comp
Monopolistic compMonopolistic comp
Monopolistic comp
 
Day 6 civil war
Day 6 civil warDay 6 civil war
Day 6 civil war
 
Pozo requena fotonovel·la
Pozo requena fotonovel·laPozo requena fotonovel·la
Pozo requena fotonovel·la
 
Analyst Report: Clearing the Clouds
Analyst Report: Clearing the Clouds  Analyst Report: Clearing the Clouds
Analyst Report: Clearing the Clouds
 
Industrial internet
Industrial internetIndustrial internet
Industrial internet
 
Perfect comp
Perfect compPerfect comp
Perfect comp
 
MDP on Financial Statement Analysis
MDP on  Financial Statement AnalysisMDP on  Financial Statement Analysis
MDP on Financial Statement Analysis
 
Discover Great Reasons to move to ConfigMgr 2012 SP1
Discover Great Reasons to move to ConfigMgr 2012 SP1Discover Great Reasons to move to ConfigMgr 2012 SP1
Discover Great Reasons to move to ConfigMgr 2012 SP1
 
Thurs greek and maya
Thurs greek and mayaThurs greek and maya
Thurs greek and maya
 
02 block productivity pp fs
02 block productivity pp fs02 block productivity pp fs
02 block productivity pp fs
 
Digital content promotion q1 2013
Digital content promotion q1 2013Digital content promotion q1 2013
Digital content promotion q1 2013
 
What must a leader bear in mind when attempting to change workplace culture
What must a leader bear in mind when attempting to change workplace cultureWhat must a leader bear in mind when attempting to change workplace culture
What must a leader bear in mind when attempting to change workplace culture
 
2015 day 1
2015 day 12015 day 1
2015 day 1
 
EMC Enterprise Hybrid Cloud 2.5.1, Federation SDDC Edition: Backup Solution G...
EMC Enterprise Hybrid Cloud 2.5.1, Federation SDDC Edition: Backup Solution G...EMC Enterprise Hybrid Cloud 2.5.1, Federation SDDC Edition: Backup Solution G...
EMC Enterprise Hybrid Cloud 2.5.1, Federation SDDC Edition: Backup Solution G...
 
What's New in VMware Virtual SAN
What's New in VMware Virtual SANWhat's New in VMware Virtual SAN
What's New in VMware Virtual SAN
 
Ablation material book
Ablation material   bookAblation material   book
Ablation material book
 
What Are Your Servers Doing While You’re Sleeping?
What Are Your Servers Doing While You’re Sleeping?What Are Your Servers Doing While You’re Sleeping?
What Are Your Servers Doing While You’re Sleeping?
 
17 λειτουργια πεπτικου - ανιχνευση ουσιων
17   λειτουργια πεπτικου - ανιχνευση ουσιων17   λειτουργια πεπτικου - ανιχνευση ουσιων
17 λειτουργια πεπτικου - ανιχνευση ουσιων
 

Similar to EMC Greenplum Database version 4.2

Dataintensive
DataintensiveDataintensive
Dataintensivesulfath
 
Green Plum IIIT- Allahabad
Green Plum IIIT- Allahabad Green Plum IIIT- Allahabad
Green Plum IIIT- Allahabad IIIT ALLAHABAD
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computinghuda2018
 
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse EMC
 
White Paper: Advanced Cyber Analytics with Greenplum Database
White Paper: Advanced Cyber Analytics with Greenplum DatabaseWhite Paper: Advanced Cyber Analytics with Greenplum Database
White Paper: Advanced Cyber Analytics with Greenplum DatabaseEMC
 
Netezza Deep Dives
Netezza Deep DivesNetezza Deep Dives
Netezza Deep DivesRush Shah
 
Advancing life sciences with IBM reference architecture for genomics
Advancing life sciences with IBM reference architecture for genomicsAdvancing life sciences with IBM reference architecture for genomics
Advancing life sciences with IBM reference architecture for genomicsPatrick Berghaeger
 
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...In-Memory Computing Summit
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataEMC
 
Netezza vs teradata
Netezza vs teradataNetezza vs teradata
Netezza vs teradataAsis Mohanty
 
spectrum Storage Whitepaper
spectrum Storage Whitepaperspectrum Storage Whitepaper
spectrum Storage WhitepaperCarina Kordan
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action MapR Technologies
 
Hyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsHyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsSpringpath
 
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...IBM India Smarter Computing
 
4870 ibm-storage-solutions-final_nov26_18_34019934_usen
4870  ibm-storage-solutions-final_nov26_18_34019934_usen4870  ibm-storage-solutions-final_nov26_18_34019934_usen
4870 ibm-storage-solutions-final_nov26_18_34019934_usenduc_spt
 
Data Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeData Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeDenodo
 
Storage Networking Solutions for High Performance Databases by QLogic
Storage Networking Solutions for High Performance Databases by QLogicStorage Networking Solutions for High Performance Databases by QLogic
Storage Networking Solutions for High Performance Databases by QLogicJone Smith
 
IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015Doug O'Flaherty
 

Similar to EMC Greenplum Database version 4.2 (20)

Dataintensive
DataintensiveDataintensive
Dataintensive
 
Green Plum IIIT- Allahabad
Green Plum IIIT- Allahabad Green Plum IIIT- Allahabad
Green Plum IIIT- Allahabad
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computing
 
Greenplum feature
Greenplum featureGreenplum feature
Greenplum feature
 
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
 
White Paper: Advanced Cyber Analytics with Greenplum Database
White Paper: Advanced Cyber Analytics with Greenplum DatabaseWhite Paper: Advanced Cyber Analytics with Greenplum Database
White Paper: Advanced Cyber Analytics with Greenplum Database
 
Facade
FacadeFacade
Facade
 
Netezza Deep Dives
Netezza Deep DivesNetezza Deep Dives
Netezza Deep Dives
 
Advancing life sciences with IBM reference architecture for genomics
Advancing life sciences with IBM reference architecture for genomicsAdvancing life sciences with IBM reference architecture for genomics
Advancing life sciences with IBM reference architecture for genomics
 
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big Data
 
Netezza vs teradata
Netezza vs teradataNetezza vs teradata
Netezza vs teradata
 
spectrum Storage Whitepaper
spectrum Storage Whitepaperspectrum Storage Whitepaper
spectrum Storage Whitepaper
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
Hyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsHyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQs
 
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
 
4870 ibm-storage-solutions-final_nov26_18_34019934_usen
4870  ibm-storage-solutions-final_nov26_18_34019934_usen4870  ibm-storage-solutions-final_nov26_18_34019934_usen
4870 ibm-storage-solutions-final_nov26_18_34019934_usen
 
Data Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeData Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data Lake
 
Storage Networking Solutions for High Performance Databases by QLogic
Storage Networking Solutions for High Performance Databases by QLogicStorage Networking Solutions for High Performance Databases by QLogic
Storage Networking Solutions for High Performance Databases by QLogic
 
IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015
 

More from EMC

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDEMC
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote EMC
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOEMC
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremioEMC
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lakeEMC
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereEMC
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History EMC
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewEMC
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeEMC
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic EMC
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityEMC
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeEMC
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015EMC
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesEMC
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsEMC
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookEMC
 

More from EMC (20)

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremio
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis Openstack
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop Elsewhere
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical Review
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or Foe
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for Security
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure Age
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere Environments
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBook
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

EMC Greenplum Database version 4.2

  • 1. EMC GREENPLUM DATABASE Driving the future of data warehousing and analytics MEETING THE CHALLENGES OF A DATA-DRIVEN WORLD ESSENTIALS Rising IT costs, exploding data volumes, and ever-evolving competitive challenges have A shared-nothing, massively parallel catalyzed new ways of thinking about effective systems for analytics. All these changes have processing (MPP) architecture supports extreme performance on commodity driven radical changes in database technology, and collectively, these developments have infrastructure led to a new approach for effective data exploitation. Decades-old legacy architecture for • Enables Big Data Analytics via the support data management and analytics is inherently unfit for scaling to today’s dramatic increase in of high-performance and flexible data data volume. exchange between Hadoop and Green- The EMC® Greenplum® Database is a shared-nothing, massively parallel processing (MPP) plum Database architecture that has been designed for business intelligence and analytical processing. In • Designed with polymorphic storage this architecture, each server node acts as a self-contained database management system (hybrid of row and column) to best fit the unique needs of each business intel- that owns and manages a distinct portion of the overall data. The system automatically ligence and analytical use case distributes data and parallelizes query workloads across all available hardware. • Enables massive data storage, loading, and processing with unlimited linear scalability • Provides automatic parallelization with no need for manual partitioning or tuning The figure shown summarizes why the EMC Greenplum Database is the best platform on the market for mission-critical analytics. The core shared-nothing MPP architecture enables massive data storage, loading, and processing with unlimited linear scalability. Adaptive services provide worldwide enterprises with high availability, workload management, and online expansion of capacity. Key product features enable petabyte-scale loading, hybrid storage (row or column) to best fit the unique needs of each analytical use case, and embedded support for SQL, MapReduce, and programmable analytics. In addition, all major third-party analytic and administration tools are supported through standard client interfaces. D ATA S H E E T
  • 2. The core principle of the EMC Greenplum Database is to move the processing dramatically closer to the data and its users. This effectively enables the computational resources to process every query in a fully parallel manner, use all storage connections simultaneously, and flow data efficiently between resources as the query plan dictates. The result is that a wide variety of complex processing can be pushed down in close proximity to the data for maximum efficiency and incredible expressiveness. The EMC Greenplum Database is already regarded as the most scalable mission-critical analytical database and in use at more than 200 leading enterprises worldwide. GREENPLUM DATABASE FEATURES DATABASE ARCHITECTURE Core Massively Parallel Processing Architecture The Greenplum Database architecture provides automatic parallelization of data and queries—all data is automatically partitioned across all nodes of the system, and queries are planned and executed using all nodes working together in a highly coordinated fashion. Petabyte-Scale Loading High-performance loading uses MPP Scatter/Gather Streaming™ technology. Loading speeds scale with each additional node to greater than 10 terabytes per hour, per rack. When loading a continuous stream, trickle micro-batching and re-usable table objects enable data to be loaded at frequent intervals (e.g., every five minutes), while maintaining extremely high data ingest rates. Polymorphic Data Storage and Execution Using Greenplum’s Polymorphic Data Storage™ technology, the DBA can select the storage, execution, and compression settings that suit the way that table will be accessed. With this feature, customers have the choice of row- or column-oriented storage and processing for any table or partition. In addition, Greenplum Database also supports the placement of data on specific storage types, such as SSD media or network-attached storage (NAS) archival stores. Anywhere Data Access Anywhere data access enables queries to be executed from the database against external data sources, returning data in parallel regardless of location, format, or storage medium. In-Database Compression In-database compression uses industry-leading compression technology to increase performance and dramatically reduce the space required to store data. Customers can expect to see a three- to 10-time disk space reduction with a corresponding increase in effective I/O performance. Multi-level Partitioning Flexible partitioning of tables is based on date, range, or value. Partitioning is specified using DDL and enables an arbitrary number of levels. The query optimizer will automatically prune unneeded partitions from the query plan. Dynamic Partitioning Elimination and Query Memory Optimization Greenplum Database supports dynamic partition elimination and query memory optimization. Dynamic Partition Elimination disregards irrelevant partitions in a table and allows for significant reduction in the amount of data scanned and results in faster query execution times. The query memory optimization feature intelligently frees and reallocates memory to different operators during query processing, allowing for better memory utilization, higher throughput, and higher concurrency.
  • 3. IN-DATABASE ANALYTICS Native MapReduce Greenplum natively runs MapReduce programs within its parallel engine and supports PL/ Java, optimized C, and Java function support. High-Performance gNet for Hadoop Greenplum Database enables high-performance parallel import and export of compressed and uncompressed data from Hadoop clusters using gNet for Hadoop, a parallel communications transport with the industry’s first direct query interoperability between Greenplum Database nodes and corresponding Hadoop nodes. To further streamline resource consumption during load times, custom-format data (binary, Pig, Hive, etc.) in Hadoop can be converted to GPDB Format via MapReduce, and then imported into Greenplum Database. This is a high-speed direct integration option that provides an efficient and flexible data exchange between Greenplum Database and Hadoop. gNet for Hadoop is available for both Greenplum HD Community Edition and Enterprise Edition. Advanced Analytical Functions Greenplum Database provides analytical functions (t-statistics, p-values, and Naïve Bayes) for advanced in-database analytics. These functions provide the needed metrics for variable selection to improve the quality of a regression model, as well as enhance the ability to understand and reason about the edge cases. Programmable Analytics A new level of parallel analysis capabilities for mathematicians and statisticians and support for R, linear algebra, and machine learning primitives is offered. Greenplum Database Extension Framework and Turnkey In-Database Analytics Greenplum Database delivers an agile, extensible platform for in-database analytics, leveraging the system’s massively parallel architecture. Greenplum Database enables turnkey in-database analytics via Greenplum Extensions, which can be downloaded from EMC Subscribenet and installed using the new Greenplum Package Manager. This new Greenplum Database utility ensures automatic installation and updates of functional extensions like in-database GeoSpatial functions, PL/R, PL/Java, PL/Python, and PL/Perl. Greenplum Extensions dramatically simplify the task of enabling and managing advanced in-database functionality across a cluster. For example, extensions automatically get deployed on new nodes during expansions of Greenplum clusters. DATABASE MANAGEMENT TOOLS Online System Expansion You can add servers to increase storage capacity, processing performance, and loading performance. The database can remain online and fully available while the expansion process takes place in the background. Performance and capacity increase linearly as servers are added. Workload Management With administrative control over system resources and their allocation to queries, users can be assigned to resource queues that manage the inflow of work to the database. Workload management also enables priority adjustment of running queries. Dynamic Query Prioritization Greenplum’s Advanced Workload Management is extended with patent-pending technology that provides continuous realtime balancing of the resources of the entire cluster across all running queries. This gives DBAs the controls they need to meet workload service-level agreements in complex, mixed-workload environments.
  • 4. Database Performance Monitor Tool The Greenplum Database’s Performance Monitor data collection agents gather metrics to help administrators analyze network patterns of Greenplum Database. Collecting these metrics allows system administrators to pinpoint the cause of network issues, and separate hardware issues from software issues. Simple and Fast Parallel Installation The parallel installation utility allows system administrators to install the Greenplum Database software on multiple hosts at once. When run as root, it also automates other system configuration tasks such as creating the Greenplum system user (gpadmin), setting the system user’s password, setting the ownership of the Greenplum Database installation directory, and exchanging ssh keys between all specified host address names. HIGH-AVAILABILITY, BACKUP, AND DISASTER RECOVERY SUPPORT Self-Healing Fault Tolerance Traditional MPP database fault-tolerance techniques were suitable for environments with less than 100 servers, but TCO has increased dramatically beyond that scale. Greenplum’s fault-tolerance capabilities provide intelligent fault detection and fast online differential recovery, lowering TCO and enabling cloud-scale systems with the highest levels of availability. Post-Recovery, Online Segment Rebalancing After segment recovery, the EMC Greenplum Database segments can be rebalanced while the system is online. All client sessions remain connected to allow no downtime. The database remains functional while the system is recovered back into an optimal state. Simpler, Scalable Backup with Data Domain Boost Greenplum Database now includes advanced integration with EMC Data Domain® deduplication storage systems via EMC Data Domain Boost for faster, more efficient backup. This integration distributes parts of the deduplication process to Greenplum Database servers, enabling them to send only unique data to the Data Domain system. This dramatically increases aggregate throughput, reduces the amount of data transferred over the network, and eliminates the need for NFS mount management. INTEROPERABILITY Indexes—B-Tree, Bitmap, and More Greenplum Database supports a range of index types, including B-Tree and Bitmap. Comprehensive SQL Support Greenplum Database offers comprehensive SQL-92 and SQL-99 support with SQL 2003 OLAP extensions and full standard support, including window functions, rollup, cube, and a wide range of other expressive functionality. All queries are parallelized and executed across the entire system. Greenplum Database offers enhanced SQL support, including native support of 20+ Oracle functions, correlated sub queries, non-recursive WITH clause, and fixed format loader. These enhancements streamline support of third-party tools that generate such queries and make migration to Greenplum Database faster and simpler. Client Access and Third-Party Tools Greenplum supports standard database interfaces (PostgreSQL, SQL, ODBC, JDBC, OLEDB, etc.) and is fully supported and certified by a wide range of business intelligence (BI) and extract/ transform/load (ETL) tools.
  • 5. pgAdmin3 for GPDB pgAdmin3 is the most popular and feature-rich Open Source administration and development platform for PostgreSQL. Greenplum Database ships with an enhanced version of pgAdmin3 that has been extended to work with Greenplum Database and provides full support for Greenplum-specific capabilities. XML Support Greenplum Database provides support for XML, enabling high-performance, parallel load of XML documents into the database, support for XML data type and the XML Path language (xpath). MAXIMIZE EMC GREENPLUM DCA BENEFITS WITH EMC GLOBAL SERVICES EMC delivers a full complement of services for EMC Greenplum hardware and software to ensure that your system performs as expected in your environment, while minimizing risk to your business and budget. Expert planning, design, and implementation services help you quickly realize the value of your hardware and software in your environment—no matter how simple or complex. After implementation, EMC’s data migration services can help you plan, design, and safely migrate your critical data over any distance to your new system. EMC will also help you integrate your new system into your information architecture and business intelligence and analytics applications (for example: SAS, MicroStrategy, Business Objects, Tableau, etc.), and manage your new environment when it is complete. Extensively trained professional services personnel and project management teams, leveraging EMC’s extensive data warehousing/business intelligence deployment best practices and guided by our proven methodology, accelerate the business results you need without straining the resources you have. Contact Us For more information, contact your EMC Greenplum representative or visit www.greemplum.com. EMC2, EMC, Data Domain, Greenplum, and the EMC logo are registered trademarks or trademarks of EMC Corporation in the United States and other countries. All other trademarks used herein are the property of their respective owners. © Copyright 2011 EMC Corporation. All rights reserved. Published in the USA. 11/11 Data Sheet H8995 EMC Greenplum Division Headquarters 1900 South Norfolk Street San Mateo, CA 94403 Tel: 650-286-8012 www.greenplum.com