Data validation option


Published on

Published in: Technology
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Data validation option

  1. 1. • What is Data Validation Option What can be tested
  2. 2. What is Data Validation Option  Data Validation Option is a ETL Testing tool which comes along with power center and enables to test and validate data  Data Validation Option reads table definitions from PowerCenter metadata repositories, and checks the data for inconsistencies What can be tested  Verify that moved or transformed data is complete and accurate and has not been changed because of errors in the movement or transformation process  Will not check the correctness of transformations or mappings designed in Informatica
  3. 3. Data Validation Workflow A typical workflow for data validation consists of multiple tasks-  Data Validation Option reads one or more PowerCenter metadata repositories.  We define the validation rules(creation of tests) in Data Validation Option  We run the rules(tests) to ensure the data conforms to the validation rules. When w Validation Option performs the following tasks: 1. 2. Creates and executes all tests through PowerCenter. Loads results into the Data Validation Option results database and displays them in the Data Valida  We examine the results and identify data inconsistencies in the ETL process  You repeat this process for new records.
  4. 4. Data Validation Option Client Layout Overview The Data Validation Option Client contains multiple areas that allow us to perform different tasks. • allow us to perform different tasks.
  5. 5. Repositories…  Add  Modify  Delete  Refresh  Exporting Repository Metadata
  6. 6. Table Pairs A table pair is the basis for all tests that compare one table to another. We can select a relational table, flat file, lookup view, or SQL view as one or both tables in a table pair.  Add  Modify  Delete
  7. 7. Tests for Table Pairs We can createrun the following types of tests on table pairs: Aggregate • Includes COUNT, COUNT_DISTINCT, COUNT_ROWS, MIN, MAX, AVG, and SUM. • Set • Includes AinB, BinA, and AeqB. • Value • Includes VALUE and OUTER_VALUE . Note: When we run tests, the target folder must be closed in the Designer and Workflow Manager. If the target folder is open, Data Validation Option cannot write to the folder, and the tests return an error
  8. 8. Single Table Constraints Pairs  single-table pair will be used to run tests on a single table. We can enforce valid values, aggregates, formats, and uniqueness  To run single-table constraints, we must create a single table. You can select a relational table, flat file, lookup view, or SQL view as a single table
  9. 9. SQL Views  SQL views facilitate the use of more complex functionality for single tables and table pairs.  An SQL view allows to use several tables and several calculations in a query to produce a set of fields that you can use as a table in a single table or table pair.  This functionality is similar to the SQL override in PowerCenter or a view in a relational database. You can use any valid SQL statement to create an SQL view.
  10. 10. Lookup Views  Data Validation Option lookup views allows to test the validity of the lookup logic in transformation layer  One disadvantage of the SQL view is that it does not allow the use of flat files or heterogeneous database tables  Join two heterogeneous sources with a lookup view
  11. 11. Import and export of Tests  Data Validation Option allows you to export and import test metadata.  Metadata import and export allows users to share tests and allows rapid generation o scripting.  You can import and export the following metadata: • Table Pairs • Single Tables • PowerCenter Sources • SQL views • Lookup views • Join views
  12. 12. Reports Generation Data Validation Option stores all test definitions and test results in the Data Validation Option repository. We can run reports to display test definitions and results. Types of reports:  Summary of Testing Activities  Table Pair Summary  Detailed Test Results
  13. 13. Run tests through Command prompt Invoke Data Validation Option capabilities at the command line. Running tests at the command line allows you to schedule test execution. It also all embed a specific test as part of the ETL workflow or as part of another process. For can create an ETL process that moves data from source to staging, runs validation, data into the target or an error table based on the validation results. Send Emails  Data Validation Option sends an email when the tests are complete.  You can provide multiple email addresses separated by commas.  The email specifies whether the test has passed or failed and provides a link to the results.
  14. 14. END