SlideShare a Scribd company logo
1 of 22
SCHOOL OF COMPUTER SCIENCE & IT-
         DAVV, INDORE




        Presentation on
   Oracle 11g Data warehouse
          Introduction
            Presented by:
            Aditya Trivedi
     aadi.trivedi01@gmail.com
Goal:
   The goal of this presentation is to introduce you to the data
    warehousing solutions available in the Oracle Database.
   The primary interfaces used are
       Oracle Enterprise Manager,
       Oracle Warehouse Builder, and
       SQL*Plus.




                         25 November 2011                       2
Data Warehouse:
   A data warehouse is a relational or multidimensional database that
    is designed for query and analysis. They are not optimized for
    transaction processing, which is the domain of OLTP systems.
   Data warehouses usually consolidate historical and transactional
    data derived from multiple sources.
   Data in a data warehouse usually stores many months or years of
    data to support historical analysis.
   The data in a data warehouse is typically loaded through an
    extraction, transformation, and loading (ETL) process from one or
    more data sources such as OLTP applications, mainframe
    applications, or external data providers.




                           25 November 2011                              3
The Key Characteristics of a Data Warehouse:
The key characteristics of a data warehouse are as follows:
      Some data is denormalized for simplification and to improve
       performance.
      Large amounts of historical data are used.
      Queries often retrieve large amounts of data.
      Both planned and ad hoc queries are common.
      The data load is controlled.
 In general, high data throughput is the key to a successful data
  warehouse.




                        25 November 2011                         4
Common Oracle Data Warehousing Tasks:

As an Oracle data warehousing administrator or designer, you can
  expect to be involved in the following tasks:
 Configuring an Oracle database for use as a data warehouse
 Designing data warehouses
 Performing upgrades of the database and software to new release
  levels
 Managing schema objects, such as tables, indexes, and materialized
  views
 Managing users and security
 Developing routines used for the Extraction, Transformation, and
  Loading (ETL) process
 Creating reports based on the data in the data warehouse
 Backing up the data warehouse and performing recovery when
  necessary
 Monitoring the data warehouse's performance and taking preventive
  or corrective action as required

                         25 November 2011                          5
Setting up your Data Warehouse System:
This section discusses how to initially configure your data warehouse
  environment. It includes the following topics:
 General Steps for Setting up a Data Warehouse System
 Preparing the Environment
 Setting Up a Database for a Data Warehouse
 Accessing Oracle Warehouse Builder




                          25 November 2011                              6
General Steps for Setting up a Data Warehouse
System:
   Size and configure your hardware.
   Install the Oracle Database software.
   Optimize the Database for use as a data warehouse.
   Access the Oracle Warehouse Builder software.




                           25 November 2011              7
Preparing the Environment:

For better throughput and performance we need to follow these steps:
 How Many CPUs and What Clock Speed Do I Need?
 How Much Memory Do I Need?
 How Many Disks Do I Need?
 How Do I Determine Sufficient I/O Bandwidth?




                          25 November 2011                             8
How Many CPUs and What Clock Speed Do I
Need?
You must have sufficient CPU power to perform the data warehouse
  operations.
Use the estimated highest throughput as a guideline for the number of
  CPUs you need. As a rough estimate, use the following formula:

<number of CPUs> = <maximum throughput in MB/s> / 200




                          25 November 2011                          9
How Much Memory Do I Need?

Memory in a data warehouse is particularly important for processing
  memory-intensive operations such as large sorts.
The number of CPUs provides you a good guideline for the amount of
  memory you need. Use the following simplified formula to derive
  the amount of memory you need from the CPUs you selected:

<amount of memory in GB> = 2 * <number of CPUs>




                         25 November 2011                         10
How Many Disks Do I Need?

 Use the throughput you require to find out how many disk
  arrays you need.
 Use the storage provider's specifications to find out how
  much throughput a disk array can sustain.
Note: Storage providers measure in Gb per second, and your
  initial throughput estimate is based on MB second. An
  average disk controller has a maximum throughput of 2
  Gb second, which translates to a sustainable throughput
  of about (70% * 2 Gbit/s) /8 = 180 MB/s.
Use the following formula to determine the number of disk
  arrays you need:

<number of disk controllers> = <throughput in MB/s> /
  <individual controller throughput in MB/s>

                      25 November 2011                    11
How Do I Determine Sufficient I/O
Bandwidth?
   Host Bus Adapters (HBAs)- between server and storage
   Switches- between servers and SAN
   Ethernet adapters- for network connectivity
   Wires- to connect individual components
   Use the conversion in the following table to translate the vendors'
    maximum throughput numbers in bits into sustainable throughput
    numbers in bytes.
Component            Bits                        Bytes/second
HBA                  2 Gbit                      200 MB
16 port switches     8*2 Gbits                   1200 MB
Fibre Channel        2 Gbit                      200 MB
GigE NIC             1 Gbit                      80 MB
Inf-2 Gbit           2 Gbit                      160 MB

                              25 November 2011                        12
Verifying the Data Warehouse Hardware
Configuration:
Two tools for verifying throughput are
 ddutility
 Orion
An Oracle-supplied tool (we’ll consider here dd utility only)




                         25 November 2011                       13
dd Utility:
First, the most important options for using dd are the following:
bs=BYTES: Read BYTES bytes at a time; use 1 MB
count=BLOCKS: copy only BLOCKS input blocks
if=FILE: read from FILE;
of=FILE: write to FILE; set to /dev/null to evaluate read performance;
write to disk would erase all existing data!!!
skip=BLOCKS: skip BLOCKS BYTES-sized blocks at start of input




                          25 November 2011                           14
dd Utility (...)

The following dd command performs random sequential disk access
across two devices reading a total of 2 GB. The throughput is 2 GB
divided by the time it takes to finish the following command:




                          25 November 2011                           15
Setting Up a Database for a Data Warehouse:

You can use Oracle Enterprise Manager to set up your data warehouse.
 To view various parameter settings, navigate to the Database page,
  then click,
     ◦ Server
    Under Database Configuration, click
     ◦ Memory Parameters or
     ◦ All Initialization Parameters.




                                25 November 2011                   16
Memory management parameters:
Two types of memory:
 Shared memory(SGA): SGA_TARGET
    ◦ Allow 25% of total memory
    ◦ Minimum 100 MB
   Session-based memory(PGA):PGA_AGGREGATE_TARGET
    ◦ PGA_AGGREGATE_TARGET = 3 * SGA_TARGET




                             25 November 2011        17
Initialization parameters:
   COMPATIBLE
   OPTIMIZER_FEATURES_ENABLE
   DB_BLOCK_SIZE
   DB_FILE_MULTIBLOCK_READ_COUNT
    ◦ DB_FILE_MULTIBLOCK_READ_COUNT * DB_BLOCK_SIZE = <maximum
      operating system I/O size>
   PARALLEL_MAX_SERVERS
    ◦ (CPU_COUNT x PARALLEL_THREADS_PER_CPU x (2 if
      PGA_AGGREGATE_TARGET > 0; otherwise 1) x 5)
    ◦ 2 x DOP x NUMBER_OF_CONCURRENT_USERS
   PARALLEL_ADAPTIVE_MULTI_USER
   QUERY_REWRITE_ENABLED
   QUERY_REWRITE_INTEGRITY
   STAR_TRANSFORMATION_ENABLED



                         25 November 2011                    18
Accessing Oracle Warehouse Builder:

To enable Warehouse Builder, complete the following steps:
 Ensure that you have access to either a Enterprise or Standard Edition
  of the Oracle Database 11g.
 To utilize the default Warehouse Builder schema installed in Oracle
  Database 11g, first unlock the schema.
 Connect to SQL*Plus as the SYS or SYSDBA user. Execute the
  following commands:
   SQL> ALTER USER OWBSYS ACCOUNT UNLOCK;
   SQL> ALTER USER OWBSYS IDENTIFIED BY owbsys_passwd;
 Launch the Warehouse Builder Design Centre.
 For Windows, select Start, Programs, Oracle, Warehouse
  Builder and then select Design Centre.



                              25 November 2011                             19
Accessing Oracle Warehouse Builder:
(cntd.)
   For UNIX, locate owb home/owb/bin/unix and then execute owbclient.sh
   Define a workspace and assign a user to the workspace.
    In the single Warehouse Builder repository, you can define multiple
    workspaces with each workspace corresponding to a set of users working on
    related projects. For instance, you could create a workspace for each of the
    following environments: development, test, and production.
    For simplicity, create one workspace MY_WORKSPACE and assign a user.
    In the Design Centre dialog box, click Show Details and then Workspace
    Management.
    The Repository Assistant displays.
    Following the prompts and accept the default settings in the Repository
    Assistant, you create a workspace and assign a user as the workspace owner.
   Log into the Design Center with the user name and password you created.




                                25 November 2011                               20
References:
   Setting up your data warehouse system
    “http://docs.oracle.com/cd/B28359_01/server.111/b28314/tdpdw_sy
    stem.htm”




                          25 November 2011                        21
Thank you..!!




   25 November 2011   22

More Related Content

What's hot

The Database Environment Chapter 13
The Database Environment Chapter 13The Database Environment Chapter 13
The Database Environment Chapter 13Jeanie Arnoco
 
Introduction to microsoft sql server 2008 r2
Introduction to microsoft sql server 2008 r2Introduction to microsoft sql server 2008 r2
Introduction to microsoft sql server 2008 r2Eduardo Castro
 
Architecting your own DBaaS in a Private Cloud with EM12c (WP)
Architecting your own DBaaS in a Private Cloud with EM12c (WP)Architecting your own DBaaS in a Private Cloud with EM12c (WP)
Architecting your own DBaaS in a Private Cloud with EM12c (WP)Gustavo Rene Antunez
 
Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architectureDeepak Chaurasia
 
Teradata 13.10
Teradata 13.10Teradata 13.10
Teradata 13.10Teradata
 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsCloudera, Inc.
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems divjeev
 
The IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceThe IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceIBM Danmark
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsDavid Portnoy
 
Day 2 Data Stage Manager 11.0
Day 2 Data Stage Manager 11.0Day 2 Data Stage Manager 11.0
Day 2 Data Stage Manager 11.0kshanmug2
 
Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfan
 
Azure Data Factory usage at Aucfanlab
Azure Data Factory usage at AucfanlabAzure Data Factory usage at Aucfanlab
Azure Data Factory usage at AucfanlabAucfan
 
DBA 101 : Calling all New Database Administrators (PPT)
DBA 101 : Calling all New Database Administrators (PPT)DBA 101 : Calling all New Database Administrators (PPT)
DBA 101 : Calling all New Database Administrators (PPT)Gustavo Rene Antunez
 
Breakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreBreakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreCloudera, Inc.
 
Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001Abhishek Satyam
 
Hadoop Integration into Data Warehousing Architectures
Hadoop Integration into Data Warehousing ArchitecturesHadoop Integration into Data Warehousing Architectures
Hadoop Integration into Data Warehousing ArchitecturesHumza Naseer
 

What's hot (19)

The Database Environment Chapter 13
The Database Environment Chapter 13The Database Environment Chapter 13
The Database Environment Chapter 13
 
Introduction to microsoft sql server 2008 r2
Introduction to microsoft sql server 2008 r2Introduction to microsoft sql server 2008 r2
Introduction to microsoft sql server 2008 r2
 
Architecting your own DBaaS in a Private Cloud with EM12c (WP)
Architecting your own DBaaS in a Private Cloud with EM12c (WP)Architecting your own DBaaS in a Private Cloud with EM12c (WP)
Architecting your own DBaaS in a Private Cloud with EM12c (WP)
 
Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
 
Teradata 13.10
Teradata 13.10Teradata 13.10
Teradata 13.10
 
Oracle GoldenGate for Oracle DBAs
Oracle GoldenGate for Oracle DBAsOracle GoldenGate for Oracle DBAs
Oracle GoldenGate for Oracle DBAs
 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems
 
Netezza pure data
Netezza pure dataNetezza pure data
Netezza pure data
 
Open Source Datawarehouse
Open Source DatawarehouseOpen Source Datawarehouse
Open Source Datawarehouse
 
The IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceThe IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse appliance
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop Implementations
 
Day 2 Data Stage Manager 11.0
Day 2 Data Stage Manager 11.0Day 2 Data Stage Manager 11.0
Day 2 Data Stage Manager 11.0
 
Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -
 
Azure Data Factory usage at Aucfanlab
Azure Data Factory usage at AucfanlabAzure Data Factory usage at Aucfanlab
Azure Data Factory usage at Aucfanlab
 
DBA 101 : Calling all New Database Administrators (PPT)
DBA 101 : Calling all New Database Administrators (PPT)DBA 101 : Calling all New Database Administrators (PPT)
DBA 101 : Calling all New Database Administrators (PPT)
 
Breakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreBreakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data Store
 
Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001
 
Hadoop Integration into Data Warehousing Architectures
Hadoop Integration into Data Warehousing ArchitecturesHadoop Integration into Data Warehousing Architectures
Hadoop Integration into Data Warehousing Architectures
 

Viewers also liked

Data warehousing labs maunal
Data warehousing labs maunalData warehousing labs maunal
Data warehousing labs maunalEducation
 
Data Warehousing Practical for T.Y.I.T.
Data Warehousing Practical for T.Y.I.T.Data Warehousing Practical for T.Y.I.T.
Data Warehousing Practical for T.Y.I.T.Niraj Bharambe
 
The Data Warehouse Lifecycle
The Data Warehouse LifecycleThe Data Warehouse Lifecycle
The Data Warehouse Lifecyclebartlowe
 
TYBSC IT SEM 6 PROJECT MANAGEMENT NOTES
TYBSC IT SEM 6 PROJECT MANAGEMENT NOTESTYBSC IT SEM 6 PROJECT MANAGEMENT NOTES
TYBSC IT SEM 6 PROJECT MANAGEMENT NOTESWE-IT TUTORIALS
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 

Viewers also liked (8)

Data warehousing labs maunal
Data warehousing labs maunalData warehousing labs maunal
Data warehousing labs maunal
 
Data Warehousing Practical for T.Y.I.T.
Data Warehousing Practical for T.Y.I.T.Data Warehousing Practical for T.Y.I.T.
Data Warehousing Practical for T.Y.I.T.
 
Module Owb Process Flows
Module Owb Process FlowsModule Owb Process Flows
Module Owb Process Flows
 
Module Owb Basics
Module Owb BasicsModule Owb Basics
Module Owb Basics
 
Module Owb Lifecycle
Module Owb LifecycleModule Owb Lifecycle
Module Owb Lifecycle
 
The Data Warehouse Lifecycle
The Data Warehouse LifecycleThe Data Warehouse Lifecycle
The Data Warehouse Lifecycle
 
TYBSC IT SEM 6 PROJECT MANAGEMENT NOTES
TYBSC IT SEM 6 PROJECT MANAGEMENT NOTESTYBSC IT SEM 6 PROJECT MANAGEMENT NOTES
TYBSC IT SEM 6 PROJECT MANAGEMENT NOTES
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 

Similar to Oracle 11g data warehouse introdution

Top 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseTop 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseSandesh Rao
 
Sql Health in a SharePoint environment
Sql Health in a SharePoint environmentSql Health in a SharePoint environment
Sql Health in a SharePoint environmentEnrique Lima
 
History of Oracle and Databases
History of Oracle and DatabasesHistory of Oracle and Databases
History of Oracle and DatabasesConnor McDonald
 
MySQL Cluster Asynchronous replication (2014)
MySQL Cluster Asynchronous replication (2014) MySQL Cluster Asynchronous replication (2014)
MySQL Cluster Asynchronous replication (2014) Frazer Clement
 
Oracle 11gR2 plain servers vs Exadata - 2013
Oracle 11gR2 plain servers vs Exadata - 2013Oracle 11gR2 plain servers vs Exadata - 2013
Oracle 11gR2 plain servers vs Exadata - 2013Connor McDonald
 
Oracle Database 11g SQL Tuning Workshop - Student Guide.pdf
Oracle Database 11g SQL Tuning Workshop - Student Guide.pdfOracle Database 11g SQL Tuning Workshop - Student Guide.pdf
Oracle Database 11g SQL Tuning Workshop - Student Guide.pdfRajendra Jain
 
SQLintersection keynote a tale of two teams
SQLintersection keynote a tale of two teamsSQLintersection keynote a tale of two teams
SQLintersection keynote a tale of two teamsSumeet Bansal
 
Things learned from OpenWorld 2013
Things learned from OpenWorld 2013Things learned from OpenWorld 2013
Things learned from OpenWorld 2013Connor McDonald
 
Ceph Day Beijing: Newstore: Design, Implementation, and Next Step Work
Ceph Day Beijing: Newstore: Design, Implementation, and Next Step Work Ceph Day Beijing: Newstore: Design, Implementation, and Next Step Work
Ceph Day Beijing: Newstore: Design, Implementation, and Next Step Work Ceph Community
 
Automate DG Best Practices
Automate DG  Best PracticesAutomate DG  Best Practices
Automate DG Best PracticesMohsen B
 
Ashnik EnterpriseDB PostgreSQL - A real alternative to Oracle
Ashnik EnterpriseDB PostgreSQL - A real alternative to Oracle Ashnik EnterpriseDB PostgreSQL - A real alternative to Oracle
Ashnik EnterpriseDB PostgreSQL - A real alternative to Oracle Ashnikbiz
 
Using oracle12c pluggable databases to archive
Using oracle12c pluggable databases to archiveUsing oracle12c pluggable databases to archive
Using oracle12c pluggable databases to archiveSecure-24
 
Long and winding road - 2014
Long and winding road  - 2014Long and winding road  - 2014
Long and winding road - 2014Connor McDonald
 
0396 oracle-goldengate-12c-tutorial
0396 oracle-goldengate-12c-tutorial0396 oracle-goldengate-12c-tutorial
0396 oracle-goldengate-12c-tutorialKlausePaulino
 
GLOC 2014 NEOOUG - Oracle Database 12c New Features
GLOC 2014 NEOOUG - Oracle Database 12c New FeaturesGLOC 2014 NEOOUG - Oracle Database 12c New Features
GLOC 2014 NEOOUG - Oracle Database 12c New FeaturesBiju Thomas
 
Offer faster access to critical data and achieve greater inline data reductio...
Offer faster access to critical data and achieve greater inline data reductio...Offer faster access to critical data and achieve greater inline data reductio...
Offer faster access to critical data and achieve greater inline data reductio...Principled Technologies
 
Large Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint DeploymentsLarge Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint DeploymentsJoel Oleson
 

Similar to Oracle 11g data warehouse introdution (20)

Top 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseTop 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous Database
 
11g R2
11g R211g R2
11g R2
 
Oracle Database 12c : Multitenant
Oracle Database 12c : MultitenantOracle Database 12c : Multitenant
Oracle Database 12c : Multitenant
 
Sql Health in a SharePoint environment
Sql Health in a SharePoint environmentSql Health in a SharePoint environment
Sql Health in a SharePoint environment
 
History of Oracle and Databases
History of Oracle and DatabasesHistory of Oracle and Databases
History of Oracle and Databases
 
MySQL Cluster Asynchronous replication (2014)
MySQL Cluster Asynchronous replication (2014) MySQL Cluster Asynchronous replication (2014)
MySQL Cluster Asynchronous replication (2014)
 
Oracle 11gR2 plain servers vs Exadata - 2013
Oracle 11gR2 plain servers vs Exadata - 2013Oracle 11gR2 plain servers vs Exadata - 2013
Oracle 11gR2 plain servers vs Exadata - 2013
 
Oracle Database 11g SQL Tuning Workshop - Student Guide.pdf
Oracle Database 11g SQL Tuning Workshop - Student Guide.pdfOracle Database 11g SQL Tuning Workshop - Student Guide.pdf
Oracle Database 11g SQL Tuning Workshop - Student Guide.pdf
 
SQLintersection keynote a tale of two teams
SQLintersection keynote a tale of two teamsSQLintersection keynote a tale of two teams
SQLintersection keynote a tale of two teams
 
Things learned from OpenWorld 2013
Things learned from OpenWorld 2013Things learned from OpenWorld 2013
Things learned from OpenWorld 2013
 
Resize sga
Resize sgaResize sga
Resize sga
 
Ceph Day Beijing: Newstore: Design, Implementation, and Next Step Work
Ceph Day Beijing: Newstore: Design, Implementation, and Next Step Work Ceph Day Beijing: Newstore: Design, Implementation, and Next Step Work
Ceph Day Beijing: Newstore: Design, Implementation, and Next Step Work
 
Automate DG Best Practices
Automate DG  Best PracticesAutomate DG  Best Practices
Automate DG Best Practices
 
Ashnik EnterpriseDB PostgreSQL - A real alternative to Oracle
Ashnik EnterpriseDB PostgreSQL - A real alternative to Oracle Ashnik EnterpriseDB PostgreSQL - A real alternative to Oracle
Ashnik EnterpriseDB PostgreSQL - A real alternative to Oracle
 
Using oracle12c pluggable databases to archive
Using oracle12c pluggable databases to archiveUsing oracle12c pluggable databases to archive
Using oracle12c pluggable databases to archive
 
Long and winding road - 2014
Long and winding road  - 2014Long and winding road  - 2014
Long and winding road - 2014
 
0396 oracle-goldengate-12c-tutorial
0396 oracle-goldengate-12c-tutorial0396 oracle-goldengate-12c-tutorial
0396 oracle-goldengate-12c-tutorial
 
GLOC 2014 NEOOUG - Oracle Database 12c New Features
GLOC 2014 NEOOUG - Oracle Database 12c New FeaturesGLOC 2014 NEOOUG - Oracle Database 12c New Features
GLOC 2014 NEOOUG - Oracle Database 12c New Features
 
Offer faster access to critical data and achieve greater inline data reductio...
Offer faster access to critical data and achieve greater inline data reductio...Offer faster access to critical data and achieve greater inline data reductio...
Offer faster access to critical data and achieve greater inline data reductio...
 
Large Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint DeploymentsLarge Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint Deployments
 

Recently uploaded

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Oracle 11g data warehouse introdution

  • 1. SCHOOL OF COMPUTER SCIENCE & IT- DAVV, INDORE Presentation on Oracle 11g Data warehouse Introduction Presented by: Aditya Trivedi aadi.trivedi01@gmail.com
  • 2. Goal:  The goal of this presentation is to introduce you to the data warehousing solutions available in the Oracle Database.  The primary interfaces used are  Oracle Enterprise Manager,  Oracle Warehouse Builder, and  SQL*Plus. 25 November 2011 2
  • 3. Data Warehouse:  A data warehouse is a relational or multidimensional database that is designed for query and analysis. They are not optimized for transaction processing, which is the domain of OLTP systems.  Data warehouses usually consolidate historical and transactional data derived from multiple sources.  Data in a data warehouse usually stores many months or years of data to support historical analysis.  The data in a data warehouse is typically loaded through an extraction, transformation, and loading (ETL) process from one or more data sources such as OLTP applications, mainframe applications, or external data providers. 25 November 2011 3
  • 4. The Key Characteristics of a Data Warehouse: The key characteristics of a data warehouse are as follows:  Some data is denormalized for simplification and to improve performance.  Large amounts of historical data are used.  Queries often retrieve large amounts of data.  Both planned and ad hoc queries are common.  The data load is controlled.  In general, high data throughput is the key to a successful data warehouse. 25 November 2011 4
  • 5. Common Oracle Data Warehousing Tasks: As an Oracle data warehousing administrator or designer, you can expect to be involved in the following tasks:  Configuring an Oracle database for use as a data warehouse  Designing data warehouses  Performing upgrades of the database and software to new release levels  Managing schema objects, such as tables, indexes, and materialized views  Managing users and security  Developing routines used for the Extraction, Transformation, and Loading (ETL) process  Creating reports based on the data in the data warehouse  Backing up the data warehouse and performing recovery when necessary  Monitoring the data warehouse's performance and taking preventive or corrective action as required 25 November 2011 5
  • 6. Setting up your Data Warehouse System: This section discusses how to initially configure your data warehouse environment. It includes the following topics:  General Steps for Setting up a Data Warehouse System  Preparing the Environment  Setting Up a Database for a Data Warehouse  Accessing Oracle Warehouse Builder 25 November 2011 6
  • 7. General Steps for Setting up a Data Warehouse System:  Size and configure your hardware.  Install the Oracle Database software.  Optimize the Database for use as a data warehouse.  Access the Oracle Warehouse Builder software. 25 November 2011 7
  • 8. Preparing the Environment: For better throughput and performance we need to follow these steps:  How Many CPUs and What Clock Speed Do I Need?  How Much Memory Do I Need?  How Many Disks Do I Need?  How Do I Determine Sufficient I/O Bandwidth? 25 November 2011 8
  • 9. How Many CPUs and What Clock Speed Do I Need? You must have sufficient CPU power to perform the data warehouse operations. Use the estimated highest throughput as a guideline for the number of CPUs you need. As a rough estimate, use the following formula: <number of CPUs> = <maximum throughput in MB/s> / 200 25 November 2011 9
  • 10. How Much Memory Do I Need? Memory in a data warehouse is particularly important for processing memory-intensive operations such as large sorts. The number of CPUs provides you a good guideline for the amount of memory you need. Use the following simplified formula to derive the amount of memory you need from the CPUs you selected: <amount of memory in GB> = 2 * <number of CPUs> 25 November 2011 10
  • 11. How Many Disks Do I Need?  Use the throughput you require to find out how many disk arrays you need.  Use the storage provider's specifications to find out how much throughput a disk array can sustain. Note: Storage providers measure in Gb per second, and your initial throughput estimate is based on MB second. An average disk controller has a maximum throughput of 2 Gb second, which translates to a sustainable throughput of about (70% * 2 Gbit/s) /8 = 180 MB/s. Use the following formula to determine the number of disk arrays you need: <number of disk controllers> = <throughput in MB/s> / <individual controller throughput in MB/s> 25 November 2011 11
  • 12. How Do I Determine Sufficient I/O Bandwidth?  Host Bus Adapters (HBAs)- between server and storage  Switches- between servers and SAN  Ethernet adapters- for network connectivity  Wires- to connect individual components  Use the conversion in the following table to translate the vendors' maximum throughput numbers in bits into sustainable throughput numbers in bytes. Component Bits Bytes/second HBA 2 Gbit 200 MB 16 port switches 8*2 Gbits 1200 MB Fibre Channel 2 Gbit 200 MB GigE NIC 1 Gbit 80 MB Inf-2 Gbit 2 Gbit 160 MB 25 November 2011 12
  • 13. Verifying the Data Warehouse Hardware Configuration: Two tools for verifying throughput are  ddutility  Orion An Oracle-supplied tool (we’ll consider here dd utility only) 25 November 2011 13
  • 14. dd Utility: First, the most important options for using dd are the following: bs=BYTES: Read BYTES bytes at a time; use 1 MB count=BLOCKS: copy only BLOCKS input blocks if=FILE: read from FILE; of=FILE: write to FILE; set to /dev/null to evaluate read performance; write to disk would erase all existing data!!! skip=BLOCKS: skip BLOCKS BYTES-sized blocks at start of input 25 November 2011 14
  • 15. dd Utility (...) The following dd command performs random sequential disk access across two devices reading a total of 2 GB. The throughput is 2 GB divided by the time it takes to finish the following command: 25 November 2011 15
  • 16. Setting Up a Database for a Data Warehouse: You can use Oracle Enterprise Manager to set up your data warehouse.  To view various parameter settings, navigate to the Database page, then click, ◦ Server  Under Database Configuration, click ◦ Memory Parameters or ◦ All Initialization Parameters. 25 November 2011 16
  • 17. Memory management parameters: Two types of memory:  Shared memory(SGA): SGA_TARGET ◦ Allow 25% of total memory ◦ Minimum 100 MB  Session-based memory(PGA):PGA_AGGREGATE_TARGET ◦ PGA_AGGREGATE_TARGET = 3 * SGA_TARGET 25 November 2011 17
  • 18. Initialization parameters:  COMPATIBLE  OPTIMIZER_FEATURES_ENABLE  DB_BLOCK_SIZE  DB_FILE_MULTIBLOCK_READ_COUNT ◦ DB_FILE_MULTIBLOCK_READ_COUNT * DB_BLOCK_SIZE = <maximum operating system I/O size>  PARALLEL_MAX_SERVERS ◦ (CPU_COUNT x PARALLEL_THREADS_PER_CPU x (2 if PGA_AGGREGATE_TARGET > 0; otherwise 1) x 5) ◦ 2 x DOP x NUMBER_OF_CONCURRENT_USERS  PARALLEL_ADAPTIVE_MULTI_USER  QUERY_REWRITE_ENABLED  QUERY_REWRITE_INTEGRITY  STAR_TRANSFORMATION_ENABLED 25 November 2011 18
  • 19. Accessing Oracle Warehouse Builder: To enable Warehouse Builder, complete the following steps:  Ensure that you have access to either a Enterprise or Standard Edition of the Oracle Database 11g.  To utilize the default Warehouse Builder schema installed in Oracle Database 11g, first unlock the schema.  Connect to SQL*Plus as the SYS or SYSDBA user. Execute the following commands: SQL> ALTER USER OWBSYS ACCOUNT UNLOCK; SQL> ALTER USER OWBSYS IDENTIFIED BY owbsys_passwd;  Launch the Warehouse Builder Design Centre.  For Windows, select Start, Programs, Oracle, Warehouse Builder and then select Design Centre. 25 November 2011 19
  • 20. Accessing Oracle Warehouse Builder: (cntd.)  For UNIX, locate owb home/owb/bin/unix and then execute owbclient.sh  Define a workspace and assign a user to the workspace. In the single Warehouse Builder repository, you can define multiple workspaces with each workspace corresponding to a set of users working on related projects. For instance, you could create a workspace for each of the following environments: development, test, and production. For simplicity, create one workspace MY_WORKSPACE and assign a user. In the Design Centre dialog box, click Show Details and then Workspace Management. The Repository Assistant displays. Following the prompts and accept the default settings in the Repository Assistant, you create a workspace and assign a user as the workspace owner.  Log into the Design Center with the user name and password you created. 25 November 2011 20
  • 21. References:  Setting up your data warehouse system “http://docs.oracle.com/cd/B28359_01/server.111/b28314/tdpdw_sy stem.htm” 25 November 2011 21
  • 22. Thank you..!! 25 November 2011 22