SlideShare a Scribd company logo
Ab Initio OverviewAb Initio Overview
Co>Operating
system
EME
DTM
GDE
User
User
User
Create all
your
graphs
Graph when
deployed
generate .ksh
Run all your
graphs
Store all variables
in a repository / is
also used for
control / also
collects all
metadata about
graph developed
in GDE
Used to schedule graphs developed in
GDE. It also has capability to maintain
dependencies between graphs
Continue To Next SlideContinue To Next Slide
More Details VisitMore Details Visit
 More Details blog:http://sandyclassic.wordpress.com
 linkedin:https://www.linkedin.com/in/sandepsharma
 slideshare:
http://www.slideshare.net/SandeepSharma65
 facebook:https://facebook.com/sandeepclassic
 google+ http://google.com/+SandeepSharmaa
 Twitter: https://twitter.com/sandeeclassic
 http://thedatawarehouseclassics.wordpress.com
 http://businessintelligencetechnologytrend.wordpress.
com
About Ab InitioAbout Ab Initio
 Ab Initio is a general purpose data processing platform for enterpriseAb Initio is a general purpose data processing platform for enterprise
class, mission critical applications such as data warehousing,class, mission critical applications such as data warehousing,
clickstream processing, data movement, data transformation andclickstream processing, data movement, data transformation and
analytics.analytics.
 Supports integration of arbitrary data sources and programs, andSupports integration of arbitrary data sources and programs, and
provides complete metadata management across the enterprise.provides complete metadata management across the enterprise.
 Proven best of breed ETL solution.Proven best of breed ETL solution.
 Applications of Ab Initio:Applications of Ab Initio:
– ETL for data warehouses, data marts and operational data sources.ETL for data warehouses, data marts and operational data sources.
– Parallel data cleansing and validation.Parallel data cleansing and validation.
– Parallel data transformation and filtering.Parallel data transformation and filtering.
– High performance analyticsHigh performance analytics
– Real time, parallel data capture.Real time, parallel data capture.
Ab Initio ArchitectureAb Initio Architecture
Native Operating System
UNIX Windows NT
Ab Initio Co>Operating System
Component
Library
User-defined
Components
Third Party
Components
Application Development Environments
Graphical C ++ Shell
Applications
Ab Initio
Metadata
Repository
Co>Operating SystemCo>Operating System
 The Co>Operating System is core software that unites aThe Co>Operating System is core software that unites a
network of computing resources-CPUs, storage disks,network of computing resources-CPUs, storage disks,
programs, datasets-into a production-quality dataprograms, datasets-into a production-quality data
processing system with scalable performance andprocessing system with scalable performance and
mainframe reliabilitymainframe reliability..
 The Co>Operating System is layered on top of the nativeThe Co>Operating System is layered on top of the native
operating systems of a collection of computers. It providesoperating systems of a collection of computers. It provides
a distributed model for process execution, filea distributed model for process execution, file
management, process monitoring, check-pointing, andmanagement, process monitoring, check-pointing, and
debugging.debugging.
Co>Operating SystemCo>Operating System
 The Graphical Development Environment (GDE) providesThe Graphical Development Environment (GDE) provides
a graphical user interface into the services of thea graphical user interface into the services of the
Co>Operating System.Co>Operating System.
 Unlimited scalability : Data parallelism results in speedupsUnlimited scalability : Data parallelism results in speedups
proportional to the hardware resources provided, doubleproportional to the hardware resources provided, double
the number of CPUs and execution time is halved.the number of CPUs and execution time is halved.
 Flexibility : Provides a powerful and efficient dataFlexibility : Provides a powerful and efficient data
transformation engine and an open component model fortransformation engine and an open component model for
extending and customizing Ab Initio’s functionality.extending and customizing Ab Initio’s functionality.
 Portability : Runs heterogeneously across a huge variety ofPortability : Runs heterogeneously across a huge variety of
operating system and hardware platforms.operating system and hardware platforms.
Graphical Development EnvironmentGraphical Development Environment
(GDE)(GDE)
 GDE lets create applications by dragging and droppingGDE lets create applications by dragging and dropping
components onto a canvas configuring them withcomponents onto a canvas configuring them with
familiar, intuitive point and click operations, andfamiliar, intuitive point and click operations, and
connecting them into executable flowcharts.connecting them into executable flowcharts.
 These diagrams are architectural documents thatThese diagrams are architectural documents that
developers and managers alike can understand anddevelopers and managers alike can understand and
use, but they are not mere pictures: the co>operatinguse, but they are not mere pictures: the co>operating
system executes these flowcharts directly. This meanssystem executes these flowcharts directly. This means
that there is a seamless and solid connection betweenthat there is a seamless and solid connection between
the abstract picture of the application and the concretethe abstract picture of the application and the concrete
reality of its execution.reality of its execution.
Ab Initio S/w Versions & File ExtensionsAb Initio S/w Versions & File Extensions
 Software VersionsSoftware Versions
– Co>Operating System Version =>Co>Operating System Version =>
– GDE Version =>GDE Version =>
 File ExtensionsFile Extensions
– .mp.mp Stored Ab Initio graph or graph componentStored Ab Initio graph or graph component
– .mpc.mpc Program or custom componentProgram or custom component
– .mdc.mdc Dataset or custom dataset componentDataset or custom dataset component
– .dml.dml Data Manipulation Language file or record typeData Manipulation Language file or record type
definitiondefinition
– .xfr.xfr Transform function fileTransform function file
– .dat.dat Data file (either serial file or multifile)Data file (either serial file or multifile)
Connecting to Co>op Server from GDEConnecting to Co>op Server from GDE
Host Profile SettingHost Profile Setting
1.1. Choose settings from the run menuChoose settings from the run menu
2.2. Check the use host profile setting checkbox.Check the use host profile setting checkbox.
3.3. Click Edit button to open the Host profile dialog.Click Edit button to open the Host profile dialog.
4.4. If running Ab Initio on your local NT system, check LocalIf running Ab Initio on your local NT system, check Local
Execution (NT) checkbox and go to step 6.Execution (NT) checkbox and go to step 6.
5.5. If running Ab Initio on a Remote UNIX system, fill in theIf running Ab Initio on a Remote UNIX system, fill in the
path to the Host and Host Login and Password.path to the Host and Host Login and Password.
6.6. Type the full path of Host directory.Type the full path of Host directory.
7.7. Select the Shell Type from pull down menu.Select the Shell Type from pull down menu.
8.8. Test Login and if necessary make changes.Test Login and if necessary make changes.
Host ProfileHost Profile
Select the
Shell Type
Enter Host,
Login,
Password &
Host directory
Ab Initio ComponentsAb Initio Components
Ab Initio provided
components. Datasets,
Partition, Transform,
Sort, Database are
frequently used.
Creating GraphCreating Graph
Type the
Label
Specify the
Input .dat
file
Create Graph - DmlCreate Graph - Dml
 Propagate from Neighbors: CopyPropagate from Neighbors: Copy
record formats from connected flow.record formats from connected flow.
 Same As: Copy record format’sSame As: Copy record format’s
from a specific component’s port.from a specific component’s port.
 Path: Store record formats in aPath: Store record formats in a
Local file, Host File, or in the AbLocal file, Host File, or in the Ab
Initio repository.Initio repository.
 Embedded: Type the record formatEmbedded: Type the record format
directly in a string.directly in a string.
Specify
the .dml file
Creating Graph - dmlCreating Graph - dml
 DML is Ab Initio’s DataDML is Ab Initio’s Data
Manipulation Language.Manipulation Language.
 DML describes data in termsDML describes data in terms
ofof
– Record Formats that list theRecord Formats that list the
fields and format of input,fields and format of input,
output, and intermediateoutput, and intermediate
records.records.
– Expressions that defineExpressions that define
simple computations, forsimple computations, for
example, selection.example, selection.
– Transform Functions thatTransform Functions that
control reformatting,control reformatting,
aggregation, and other dataaggregation, and other data
transformations.transformations.
– Keys that specify groupings,Keys that specify groupings,
ordering, and partitioningordering, and partitioning
relationships betweenrelationships between
records.records.
Editing .dml file through
Record Format Editor – Grid
View
Creating Graph - TransformCreating Graph - Transform
 A transform function is either aA transform function is either a
DML file or a DML string thatDML file or a DML string that
describes how you manipulatedescribes how you manipulate
your data.your data.
 Ab Initio transform functionsAb Initio transform functions
mainly consist of a series ofmainly consist of a series of
assignment statements. Eachassignment statements. Each
statement is called a businessstatement is called a business
rule.rule.
 When Ab Initio evaluates aWhen Ab Initio evaluates a
transform function, it performstransform function, it performs
following tasks:following tasks:
– Initializes local variablesInitializes local variables
– Evaluates statementsEvaluates statements
– Evaluates rules.Evaluates rules.
 Transform function files have theTransform function files have the
xfr extension.xfr extension.
Specify the .xfr file
Creating Graph - xfrCreating Graph - xfr
 Transform functions: A setTransform functions: A set
of rules that computeof rules that compute
output values from inputoutput values from input
values.values.
 Business rule: Part of aBusiness rule: Part of a
transform function thattransform function that
describes how youdescribes how you
manipulate one field ofmanipulate one field of
your output data.your output data.
 Variable: Optional part of aVariable: Optional part of a
transform function thattransform function that
provides storage forprovides storage for
temporary values.temporary values.
 Statement: Optional part ofStatement: Optional part of
a transform function thata transform function that
assigns values of variablesassigns values of variables
in a specific order.in a specific order.
Sample ComponentsSample Components
 SortSort
 DedupDedup
 JoinJoin
 ReplicateReplicate
 RollupRollup
 Filter by ExpressionFilter by Expression
 MergeMerge
 LookupLookup
 Reformat etc.Reformat etc.
Creating Graph – Sort ComponentCreating Graph – Sort Component
 Sort: The sort componentSort: The sort component
reorders data. Itreorders data. It
comprises twocomprises two
parameters: Key andparameters: Key and
max-core.max-core.
 Key: The Key is one ofKey: The Key is one of
the parameters for Sortthe parameters for Sort
component whichcomponent which
describes the collationdescribes the collation
order.order.
 Max-core: The max-coreMax-core: The max-core
parameter controls howparameter controls how
often the sort componentoften the sort component
dumps data fromdumps data from
memory to disk.memory to disk.
Specify Key for
the Sort
Creating Graph – Dedup componentCreating Graph – Dedup component
 Dedup componentDedup component
removes duplicateremoves duplicate
records.records.
 Dedup criteria willDedup criteria will
be either unique-be either unique-
only, First or Last.only, First or Last.
Select Dedup criteria.
Creating Graph – Replicate ComponentCreating Graph – Replicate Component
 ReplicateReplicate
combines thecombines the
data records fromdata records from
the inputs intothe inputs into
one flow andone flow and
writes a copy ofwrites a copy of
that flow to eachthat flow to each
of its output ports.of its output ports.
 Use Replicate toUse Replicate to
supportsupport
componentcomponent
parallelism.parallelism.
Creating Graph – Join ComponentCreating Graph – Join Component
• Specify the key for join
• Specify Type of Join
Database Configuration (.dbc)Database Configuration (.dbc)
 A file with a .dbc extension which provides the GDE withA file with a .dbc extension which provides the GDE with
the information it needs to connect to a database. Athe information it needs to connect to a database. A
configuration file contains the following information:configuration file contains the following information:
– The name and version number of the database to which you wantThe name and version number of the database to which you want
to connect.to connect.
– The name of the computer on which the database instance orThe name of the computer on which the database instance or
server to which you want to connect runs, or on which the databaseserver to which you want to connect runs, or on which the database
remote access software is installed.remote access software is installed.
– The name of the database instance, server, or provider to whichThe name of the database instance, server, or provider to which
you want to connect.you want to connect.
– You generate a configuration file by using the Properties dialog boxYou generate a configuration file by using the Properties dialog box
for one of the Database components.for one of the Database components.
Creating Parallel ApplicationsCreating Parallel Applications
 Types of Parallel ProcessingTypes of Parallel Processing
– Component-level Parallelism: An application with multipleComponent-level Parallelism: An application with multiple
components running simultaneously on separate data usescomponents running simultaneously on separate data uses
component parallelism.component parallelism.
– Pipeline parallelism: An application with multiple componentsPipeline parallelism: An application with multiple components
running simultaneously on the same data uses pipeline parallelism.running simultaneously on the same data uses pipeline parallelism.
– Data Parallelism: An application with data divided into segmentsData Parallelism: An application with data divided into segments
that operates on each segment simultaneously uses datathat operates on each segment simultaneously uses data
parallelism.parallelism.
Partition ComponentsPartition Components
 Partition by Expression: Dividing data according to a DML expression.Partition by Expression: Dividing data according to a DML expression.
 Partition by Key: Grouping data by a key.Partition by Key: Grouping data by a key.
 Partition with Load balance: Dynamic load balancing.Partition with Load balance: Dynamic load balancing.
 Partition by Percentage: Distributing data, so the output is proportionalPartition by Percentage: Distributing data, so the output is proportional
to fractions of 100.to fractions of 100.
 Partition by Range: Dividing data evenly among nodes, based on a keyPartition by Range: Dividing data evenly among nodes, based on a key
and a set of partitioning ranges.and a set of partitioning ranges.
 Partition by Round-robin: Distributing data evenly, in blocksize chunks,Partition by Round-robin: Distributing data evenly, in blocksize chunks,
across the output partitions.across the output partitions.
Departition ComponentsDepartition Components
 Concatenate: Concatenate component produces a single output flowConcatenate: Concatenate component produces a single output flow
that contains first all the records from the first input partition, then allthat contains first all the records from the first input partition, then all
the records from the second input partition and so on.the records from the second input partition and so on.
 Gather: Gather component collects inputs from multiple partitions in anGather: Gather component collects inputs from multiple partitions in an
arbitrary manner, and produces a single output flow, does not maintainarbitrary manner, and produces a single output flow, does not maintain
sort order.sort order.
 Interleave: Interleave component collects records from many sourcesInterleave: Interleave component collects records from many sources
in round robin fashion.in round robin fashion.
 Merge: Merge component collects inputs from multiple sorted partitionsMerge: Merge component collects inputs from multiple sorted partitions
and maintains the sort order.and maintains the sort order.
Multifile systemsMultifile systems
 A multifile system is a specially created set of directories, possibly onA multifile system is a specially created set of directories, possibly on
different machines, which have identical substructure.different machines, which have identical substructure.
 Each directory is a partition of the multifile system. When a multifile isEach directory is a partition of the multifile system. When a multifile is
placed in a multifile system, its partitions are files within each of theplaced in a multifile system, its partitions are files within each of the
partitions of the multifile system.partitions of the multifile system.
 Multifile system leads to better performance than flat file systemsMultifile system leads to better performance than flat file systems
because multifile systems can divide your data among multiple disks orbecause multifile systems can divide your data among multiple disks or
CPUs.CPUs.
 Typically (SMP machine is exception) a multifile system is created withTypically (SMP machine is exception) a multifile system is created with
the control partition on one node and data partitions on other nodes tothe control partition on one node and data partitions on other nodes to
distribute the work and improve performance.distribute the work and improve performance.
 To do this use full internet URLs that specify file and directory namesTo do this use full internet URLs that specify file and directory names
and locations on remote machines.and locations on remote machines.
MultifileMultifile
SANDBOXSANDBOX
 A sandbox is a collection of graphs and related files thatA sandbox is a collection of graphs and related files that
are stored in a single directory tree, and treated as a groupare stored in a single directory tree, and treated as a group
for purposes of version control, navigation, and migration.for purposes of version control, navigation, and migration.
 A sandbox can be a file system copy of a datastore projectA sandbox can be a file system copy of a datastore project..
 In the graph, instead of specifying the entire path for anyIn the graph, instead of specifying the entire path for any
file location ,we specify only the sandbox parameterfile location ,we specify only the sandbox parameter
variable. For ex : $AI_IN_DATA/customer_info.dat. wherevariable. For ex : $AI_IN_DATA/customer_info.dat. where
$AI_IN_DATA contains the entire path with reference to$AI_IN_DATA contains the entire path with reference to
the sandbox $AI_HOME variable.the sandbox $AI_HOME variable.
 The actual in_data dir is $AI_HOME/in_data in sandboxThe actual in_data dir is $AI_HOME/in_data in sandbox
SANDBOXSANDBOX
 The sandbox provides an excellent mechanism toThe sandbox provides an excellent mechanism to
maintain uniqueness while moving frommaintain uniqueness while moving from
development to production environment by meansdevelopment to production environment by means
switch parameters.switch parameters.
 We can define parameters in sandbox those canWe can define parameters in sandbox those can
be used across all the graphs pertaining to thatbe used across all the graphs pertaining to that
sandbox.sandbox.
 The topmost variable $PROJECT_DIR containsThe topmost variable $PROJECT_DIR contains
the path of the home directorythe path of the home directory
SANDBOXSANDBOX
DeployingDeploying
 Every graph after validation and testing has to be deployedEvery graph after validation and testing has to be deployed
as .ksh file into the run directory on UNIX.as .ksh file into the run directory on UNIX.
 This .ksh file is an executable file which is the backbone forThis .ksh file is an executable file which is the backbone for
the entire automation/wrapper process.the entire automation/wrapper process.
 The wrapper automation consists of .run, .env,The wrapper automation consists of .run, .env,
dependency list ,job list etcdependency list ,job list etc
 For a detailed description on wrapper and differentFor a detailed description on wrapper and different
directories and files , Please refer the documentation ondirectories and files , Please refer the documentation on
wrapper / UNIX presentation.wrapper / UNIX presentation.
ReferencesReferences
 Ab Initio TutorialAb Initio Tutorial
 Ab Initio Online HelpAb Initio Online Help
 Website (abinitio.com)Website (abinitio.com)
 http://sandyclassic.wordpress.comhttp://sandyclassic.wordpress.com
 Data warehouse : http://datawarehouseview.wordpress.com/
 Data Science View : http://thedatascience.wordpress.com/
 Business Intelligence trends :
http://businessintelligencetechnologytrend.wordpress.com/
 Business Architect :http://businessarchitectview.wordpress.com/
 Enterprise Architect : http://enterprisearchitectview.wordpress.com/
 Project Manager :http://projectmanagerview.wordpress.com
 data Architect : https://dataarchitectview.wordpress.com/

More Related Content

What's hot

Hive(ppt)
Hive(ppt)Hive(ppt)
Hive(ppt)
Abhinav Tyagi
 
Introduction to SAP Gateway and OData
Introduction to SAP Gateway and ODataIntroduction to SAP Gateway and OData
Introduction to SAP Gateway and OData
Chris Whealy
 
Apache hive introduction
Apache hive introductionApache hive introduction
Apache hive introduction
Mahmood Reza Esmaili Zand
 
Power BI Full Course | Power BI Tutorial for Beginners | Edureka
Power BI Full Course | Power BI Tutorial for Beginners | EdurekaPower BI Full Course | Power BI Tutorial for Beginners | Edureka
Power BI Full Course | Power BI Tutorial for Beginners | Edureka
Edureka!
 
DBMS
DBMSDBMS
Creating a database
Creating a databaseCreating a database
Creating a databaseRahul Gupta
 
Power bi
Power biPower bi
Power bi
jainema23
 
asp.net Webconfiguration
asp.net Webconfigurationasp.net Webconfiguration
asp.net Webconfiguration
Ma Kik
 
BI in the Cloud - Microsoft Power BI Overview and Demo
BI in the Cloud - Microsoft Power BI Overview and DemoBI in the Cloud - Microsoft Power BI Overview and Demo
BI in the Cloud - Microsoft Power BI Overview and Demo
Christopher Foot
 
Dbms
DbmsDbms
11 Database Concepts
11 Database Concepts11 Database Concepts
11 Database Concepts
Praveen M Jigajinni
 
SAP ODATA Overview & Guidelines
SAP ODATA Overview & GuidelinesSAP ODATA Overview & Guidelines
SAP ODATA Overview & Guidelines
Ashish Saxena
 
Power BI Charts Tutorial | Counter Strike Data Analysis using Power BI | Powe...
Power BI Charts Tutorial | Counter Strike Data Analysis using Power BI | Powe...Power BI Charts Tutorial | Counter Strike Data Analysis using Power BI | Powe...
Power BI Charts Tutorial | Counter Strike Data Analysis using Power BI | Powe...
Edureka!
 
android sqlite
android sqliteandroid sqlite
android sqliteDeepa Rani
 
Map reduce prashant
Map reduce prashantMap reduce prashant
Map reduce prashant
Prashant Gupta
 
SAP CPI - DS
SAP CPI - DSSAP CPI - DS
SAP CPI - DS
Geetha Madhuri
 
Power bi introduction
Power bi introductionPower bi introduction
Power bi introduction
Bishwadeb Dey
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
Introduction To Pentaho
Introduction To PentahoIntroduction To Pentaho
Introduction To Pentaho
pentaho Content
 
Power BI: From the Basics
Power BI: From the BasicsPower BI: From the Basics
Power BI: From the Basics
Nikkia Carter
 

What's hot (20)

Hive(ppt)
Hive(ppt)Hive(ppt)
Hive(ppt)
 
Introduction to SAP Gateway and OData
Introduction to SAP Gateway and ODataIntroduction to SAP Gateway and OData
Introduction to SAP Gateway and OData
 
Apache hive introduction
Apache hive introductionApache hive introduction
Apache hive introduction
 
Power BI Full Course | Power BI Tutorial for Beginners | Edureka
Power BI Full Course | Power BI Tutorial for Beginners | EdurekaPower BI Full Course | Power BI Tutorial for Beginners | Edureka
Power BI Full Course | Power BI Tutorial for Beginners | Edureka
 
DBMS
DBMSDBMS
DBMS
 
Creating a database
Creating a databaseCreating a database
Creating a database
 
Power bi
Power biPower bi
Power bi
 
asp.net Webconfiguration
asp.net Webconfigurationasp.net Webconfiguration
asp.net Webconfiguration
 
BI in the Cloud - Microsoft Power BI Overview and Demo
BI in the Cloud - Microsoft Power BI Overview and DemoBI in the Cloud - Microsoft Power BI Overview and Demo
BI in the Cloud - Microsoft Power BI Overview and Demo
 
Dbms
DbmsDbms
Dbms
 
11 Database Concepts
11 Database Concepts11 Database Concepts
11 Database Concepts
 
SAP ODATA Overview & Guidelines
SAP ODATA Overview & GuidelinesSAP ODATA Overview & Guidelines
SAP ODATA Overview & Guidelines
 
Power BI Charts Tutorial | Counter Strike Data Analysis using Power BI | Powe...
Power BI Charts Tutorial | Counter Strike Data Analysis using Power BI | Powe...Power BI Charts Tutorial | Counter Strike Data Analysis using Power BI | Powe...
Power BI Charts Tutorial | Counter Strike Data Analysis using Power BI | Powe...
 
android sqlite
android sqliteandroid sqlite
android sqlite
 
Map reduce prashant
Map reduce prashantMap reduce prashant
Map reduce prashant
 
SAP CPI - DS
SAP CPI - DSSAP CPI - DS
SAP CPI - DS
 
Power bi introduction
Power bi introductionPower bi introduction
Power bi introduction
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Introduction To Pentaho
Introduction To PentahoIntroduction To Pentaho
Introduction To Pentaho
 
Power BI: From the Basics
Power BI: From the BasicsPower BI: From the Basics
Power BI: From the Basics
 

Similar to Ab initio training Ab-initio Architecture

ab-initio-training basics and architecture
ab-initio-training basics and architectureab-initio-training basics and architecture
ab-initio-training basics and architecture
saipriyacoool
 
Recover 30% of your day with IBM Development Tools (Smarter Mainframe Develop...
Recover 30% of your day with IBM Development Tools (Smarter Mainframe Develop...Recover 30% of your day with IBM Development Tools (Smarter Mainframe Develop...
Recover 30% of your day with IBM Development Tools (Smarter Mainframe Develop...
Susan Yoskin
 
Lect 21 components_of_database_management_system
Lect 21 components_of_database_management_systemLect 21 components_of_database_management_system
Lect 21 components_of_database_management_systemnadine016
 
FME:23 for the Enterprise - A Deep Dive into Key New Features
FME:23 for the Enterprise - A Deep Dive into Key New FeaturesFME:23 for the Enterprise - A Deep Dive into Key New Features
FME:23 for the Enterprise - A Deep Dive into Key New Features
Safe Software
 
Multi platform application deployment with urban code deploy
Multi platform application deployment with urban code deployMulti platform application deployment with urban code deploy
Multi platform application deployment with urban code deploy
Saranga Tripathy
 
Ugif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUgif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUGIF
 
Easydd program
Easydd programEasydd program
Easydd program
Taha Sochi
 
Problem Determination Tools
Problem Determination ToolsProblem Determination Tools
Problem Determination ToolsCICS ROADSHOW
 
Product Overview: The New IBM UrbanCode Deploy 6.0
Product Overview: The New IBM UrbanCode Deploy 6.0Product Overview: The New IBM UrbanCode Deploy 6.0
Product Overview: The New IBM UrbanCode Deploy 6.0IBM UrbanCode Products
 
R sharing 101
R sharing 101R sharing 101
R sharing 101
Omnia Safaan
 
C# Production Debugging Made Easy
 C# Production Debugging Made Easy C# Production Debugging Made Easy
C# Production Debugging Made Easy
Alon Fliess
 
Managing Your Runtime With P2
Managing Your Runtime With P2Managing Your Runtime With P2
Managing Your Runtime With P2Pascal Rapicault
 
Software Build processes and Git
Software Build processes and GitSoftware Build processes and Git
Software Build processes and Git
Alec Clews
 
Safetty systems intro_embedded_c
Safetty systems intro_embedded_cSafetty systems intro_embedded_c
Safetty systems intro_embedded_c
Maria Cida Rosa
 
Software and its types
Software and its typesSoftware and its types
Software and its types
Waheed Iqbal Boss
 
What's New in InTouch Machine Edition (ITME)
What's New in InTouch Machine Edition (ITME)What's New in InTouch Machine Edition (ITME)
What's New in InTouch Machine Edition (ITME)
Wonderware InTouch Machine Edition
 
Behavior Driven GUI Testing
Behavior Driven GUI TestingBehavior Driven GUI Testing
Behavior Driven GUI Testing
Reginald Stadlbauer
 
Game Studio
Game StudioGame Studio
Game Studio
MarinaOpera
 

Similar to Ab initio training Ab-initio Architecture (20)

ab-initio-training basics and architecture
ab-initio-training basics and architectureab-initio-training basics and architecture
ab-initio-training basics and architecture
 
Recover 30% of your day with IBM Development Tools (Smarter Mainframe Develop...
Recover 30% of your day with IBM Development Tools (Smarter Mainframe Develop...Recover 30% of your day with IBM Development Tools (Smarter Mainframe Develop...
Recover 30% of your day with IBM Development Tools (Smarter Mainframe Develop...
 
Lect 21 components_of_database_management_system
Lect 21 components_of_database_management_systemLect 21 components_of_database_management_system
Lect 21 components_of_database_management_system
 
FME:23 for the Enterprise - A Deep Dive into Key New Features
FME:23 for the Enterprise - A Deep Dive into Key New FeaturesFME:23 for the Enterprise - A Deep Dive into Key New Features
FME:23 for the Enterprise - A Deep Dive into Key New Features
 
Multi platform application deployment with urban code deploy
Multi platform application deployment with urban code deployMulti platform application deployment with urban code deploy
Multi platform application deployment with urban code deploy
 
Ugif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUgif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutes
 
Easydd program
Easydd programEasydd program
Easydd program
 
Fedora15 lovelock-pres
Fedora15 lovelock-presFedora15 lovelock-pres
Fedora15 lovelock-pres
 
Problem Determination Tools
Problem Determination ToolsProblem Determination Tools
Problem Determination Tools
 
Product Overview: The New IBM UrbanCode Deploy 6.0
Product Overview: The New IBM UrbanCode Deploy 6.0Product Overview: The New IBM UrbanCode Deploy 6.0
Product Overview: The New IBM UrbanCode Deploy 6.0
 
R sharing 101
R sharing 101R sharing 101
R sharing 101
 
C# Production Debugging Made Easy
 C# Production Debugging Made Easy C# Production Debugging Made Easy
C# Production Debugging Made Easy
 
Managing Your Runtime With P2
Managing Your Runtime With P2Managing Your Runtime With P2
Managing Your Runtime With P2
 
Software Build processes and Git
Software Build processes and GitSoftware Build processes and Git
Software Build processes and Git
 
Safetty systems intro_embedded_c
Safetty systems intro_embedded_cSafetty systems intro_embedded_c
Safetty systems intro_embedded_c
 
Software and its types
Software and its typesSoftware and its types
Software and its types
 
What's New in InTouch Machine Edition (ITME)
What's New in InTouch Machine Edition (ITME)What's New in InTouch Machine Edition (ITME)
What's New in InTouch Machine Edition (ITME)
 
Uday Resume
Uday ResumeUday Resume
Uday Resume
 
Behavior Driven GUI Testing
Behavior Driven GUI TestingBehavior Driven GUI Testing
Behavior Driven GUI Testing
 
Game Studio
Game StudioGame Studio
Game Studio
 

More from Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW

Management Consultancy Saudi Telecom Digital Transformation Design Thinking
Management Consultancy Saudi Telecom Digital Transformation Design ThinkingManagement Consultancy Saudi Telecom Digital Transformation Design Thinking
Management Consultancy Saudi Telecom Digital Transformation Design Thinking
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Digital transformation journey Consulting
Digital transformation journey ConsultingDigital transformation journey Consulting
Digital transformation journey Consulting
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Agile Jira Reporting
Agile Jira Reporting Agile Jira Reporting
Lnt and bbby Retail Houseare industry Case assignment sandeep sharma
Lnt and bbby Retail Houseare industry Case assignment  sandeep sharmaLnt and bbby Retail Houseare industry Case assignment  sandeep sharma
Lnt and bbby Retail Houseare industry Case assignment sandeep sharma
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Risk management Consulting For Municipality
Risk management Consulting For MunicipalityRisk management Consulting For Municipality
Risk management Consulting For Municipality
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
GDPR And Privacy By design Consultancy
GDPR And Privacy By design ConsultancyGDPR And Privacy By design Consultancy
GDPR And Privacy By design Consultancy
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Real implementation Blockchain Best Use Cases Examples
Real implementation Blockchain Best Use Cases ExamplesReal implementation Blockchain Best Use Cases Examples
Real implementation Blockchain Best Use Cases Examples
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Ffd 05 2012
Ffd 05 2012Ffd 05 2012
Biztalk architecture for Configured SMS service
Biztalk architecture for Configured SMS serviceBiztalk architecture for Configured SMS service
Biztalk architecture for Configured SMS service
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Data modelling interview question
Data modelling interview questionData modelling interview question
Pmo best practices
Pmo best practicesPmo best practices
Agile project management
Agile project managementAgile project management
Enroll hostel Business Model
Enroll hostel Business ModelEnroll hostel Business Model
Cloud manager client provisioning guideline draft 1.0
Cloud manager client provisioning guideline draft 1.0Cloud manager client provisioning guideline draft 1.0
Cloud manager client provisioning guideline draft 1.0
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Bpm digital transformation
Bpm digital transformationBpm digital transformation
Digital transformation explained
Digital transformation explainedDigital transformation explained
Government Digital transformation trend draft 1.0
Government Digital transformation trend draft 1.0Government Digital transformation trend draft 1.0
Government Digital transformation trend draft 1.0
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Enterprise architecture maturity rating draft 1.0
Enterprise architecture maturity rating draft 1.0Enterprise architecture maturity rating draft 1.0
Enterprise architecture maturity rating draft 1.0
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Organisation Structure For digital Transformation Team
Organisation Structure For digital Transformation TeamOrganisation Structure For digital Transformation Team
Organisation Structure For digital Transformation Team
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 

More from Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW (20)

Management Consultancy Saudi Telecom Digital Transformation Design Thinking
Management Consultancy Saudi Telecom Digital Transformation Design ThinkingManagement Consultancy Saudi Telecom Digital Transformation Design Thinking
Management Consultancy Saudi Telecom Digital Transformation Design Thinking
 
Major new initiatives
Major new initiativesMajor new initiatives
Major new initiatives
 
Digital transformation journey Consulting
Digital transformation journey ConsultingDigital transformation journey Consulting
Digital transformation journey Consulting
 
Agile Jira Reporting
Agile Jira Reporting Agile Jira Reporting
Agile Jira Reporting
 
Lnt and bbby Retail Houseare industry Case assignment sandeep sharma
Lnt and bbby Retail Houseare industry Case assignment  sandeep sharmaLnt and bbby Retail Houseare industry Case assignment  sandeep sharma
Lnt and bbby Retail Houseare industry Case assignment sandeep sharma
 
Risk management Consulting For Municipality
Risk management Consulting For MunicipalityRisk management Consulting For Municipality
Risk management Consulting For Municipality
 
GDPR And Privacy By design Consultancy
GDPR And Privacy By design ConsultancyGDPR And Privacy By design Consultancy
GDPR And Privacy By design Consultancy
 
Real implementation Blockchain Best Use Cases Examples
Real implementation Blockchain Best Use Cases ExamplesReal implementation Blockchain Best Use Cases Examples
Real implementation Blockchain Best Use Cases Examples
 
Ffd 05 2012
Ffd 05 2012Ffd 05 2012
Ffd 05 2012
 
Biztalk architecture for Configured SMS service
Biztalk architecture for Configured SMS serviceBiztalk architecture for Configured SMS service
Biztalk architecture for Configured SMS service
 
Data modelling interview question
Data modelling interview questionData modelling interview question
Data modelling interview question
 
Pmo best practices
Pmo best practicesPmo best practices
Pmo best practices
 
Agile project management
Agile project managementAgile project management
Agile project management
 
Enroll hostel Business Model
Enroll hostel Business ModelEnroll hostel Business Model
Enroll hostel Business Model
 
Cloud manager client provisioning guideline draft 1.0
Cloud manager client provisioning guideline draft 1.0Cloud manager client provisioning guideline draft 1.0
Cloud manager client provisioning guideline draft 1.0
 
Bpm digital transformation
Bpm digital transformationBpm digital transformation
Bpm digital transformation
 
Digital transformation explained
Digital transformation explainedDigital transformation explained
Digital transformation explained
 
Government Digital transformation trend draft 1.0
Government Digital transformation trend draft 1.0Government Digital transformation trend draft 1.0
Government Digital transformation trend draft 1.0
 
Enterprise architecture maturity rating draft 1.0
Enterprise architecture maturity rating draft 1.0Enterprise architecture maturity rating draft 1.0
Enterprise architecture maturity rating draft 1.0
 
Organisation Structure For digital Transformation Team
Organisation Structure For digital Transformation TeamOrganisation Structure For digital Transformation Team
Organisation Structure For digital Transformation Team
 

Recently uploaded

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 

Ab initio training Ab-initio Architecture

  • 1. Ab Initio OverviewAb Initio Overview Co>Operating system EME DTM GDE User User User Create all your graphs Graph when deployed generate .ksh Run all your graphs Store all variables in a repository / is also used for control / also collects all metadata about graph developed in GDE Used to schedule graphs developed in GDE. It also has capability to maintain dependencies between graphs
  • 2. Continue To Next SlideContinue To Next Slide More Details VisitMore Details Visit  More Details blog:http://sandyclassic.wordpress.com  linkedin:https://www.linkedin.com/in/sandepsharma  slideshare: http://www.slideshare.net/SandeepSharma65  facebook:https://facebook.com/sandeepclassic  google+ http://google.com/+SandeepSharmaa  Twitter: https://twitter.com/sandeeclassic  http://thedatawarehouseclassics.wordpress.com  http://businessintelligencetechnologytrend.wordpress. com
  • 3. About Ab InitioAbout Ab Initio  Ab Initio is a general purpose data processing platform for enterpriseAb Initio is a general purpose data processing platform for enterprise class, mission critical applications such as data warehousing,class, mission critical applications such as data warehousing, clickstream processing, data movement, data transformation andclickstream processing, data movement, data transformation and analytics.analytics.  Supports integration of arbitrary data sources and programs, andSupports integration of arbitrary data sources and programs, and provides complete metadata management across the enterprise.provides complete metadata management across the enterprise.  Proven best of breed ETL solution.Proven best of breed ETL solution.  Applications of Ab Initio:Applications of Ab Initio: – ETL for data warehouses, data marts and operational data sources.ETL for data warehouses, data marts and operational data sources. – Parallel data cleansing and validation.Parallel data cleansing and validation. – Parallel data transformation and filtering.Parallel data transformation and filtering. – High performance analyticsHigh performance analytics – Real time, parallel data capture.Real time, parallel data capture.
  • 4. Ab Initio ArchitectureAb Initio Architecture Native Operating System UNIX Windows NT Ab Initio Co>Operating System Component Library User-defined Components Third Party Components Application Development Environments Graphical C ++ Shell Applications Ab Initio Metadata Repository
  • 5. Co>Operating SystemCo>Operating System  The Co>Operating System is core software that unites aThe Co>Operating System is core software that unites a network of computing resources-CPUs, storage disks,network of computing resources-CPUs, storage disks, programs, datasets-into a production-quality dataprograms, datasets-into a production-quality data processing system with scalable performance andprocessing system with scalable performance and mainframe reliabilitymainframe reliability..  The Co>Operating System is layered on top of the nativeThe Co>Operating System is layered on top of the native operating systems of a collection of computers. It providesoperating systems of a collection of computers. It provides a distributed model for process execution, filea distributed model for process execution, file management, process monitoring, check-pointing, andmanagement, process monitoring, check-pointing, and debugging.debugging.
  • 6. Co>Operating SystemCo>Operating System  The Graphical Development Environment (GDE) providesThe Graphical Development Environment (GDE) provides a graphical user interface into the services of thea graphical user interface into the services of the Co>Operating System.Co>Operating System.  Unlimited scalability : Data parallelism results in speedupsUnlimited scalability : Data parallelism results in speedups proportional to the hardware resources provided, doubleproportional to the hardware resources provided, double the number of CPUs and execution time is halved.the number of CPUs and execution time is halved.  Flexibility : Provides a powerful and efficient dataFlexibility : Provides a powerful and efficient data transformation engine and an open component model fortransformation engine and an open component model for extending and customizing Ab Initio’s functionality.extending and customizing Ab Initio’s functionality.  Portability : Runs heterogeneously across a huge variety ofPortability : Runs heterogeneously across a huge variety of operating system and hardware platforms.operating system and hardware platforms.
  • 7. Graphical Development EnvironmentGraphical Development Environment (GDE)(GDE)  GDE lets create applications by dragging and droppingGDE lets create applications by dragging and dropping components onto a canvas configuring them withcomponents onto a canvas configuring them with familiar, intuitive point and click operations, andfamiliar, intuitive point and click operations, and connecting them into executable flowcharts.connecting them into executable flowcharts.  These diagrams are architectural documents thatThese diagrams are architectural documents that developers and managers alike can understand anddevelopers and managers alike can understand and use, but they are not mere pictures: the co>operatinguse, but they are not mere pictures: the co>operating system executes these flowcharts directly. This meanssystem executes these flowcharts directly. This means that there is a seamless and solid connection betweenthat there is a seamless and solid connection between the abstract picture of the application and the concretethe abstract picture of the application and the concrete reality of its execution.reality of its execution.
  • 8. Ab Initio S/w Versions & File ExtensionsAb Initio S/w Versions & File Extensions  Software VersionsSoftware Versions – Co>Operating System Version =>Co>Operating System Version => – GDE Version =>GDE Version =>  File ExtensionsFile Extensions – .mp.mp Stored Ab Initio graph or graph componentStored Ab Initio graph or graph component – .mpc.mpc Program or custom componentProgram or custom component – .mdc.mdc Dataset or custom dataset componentDataset or custom dataset component – .dml.dml Data Manipulation Language file or record typeData Manipulation Language file or record type definitiondefinition – .xfr.xfr Transform function fileTransform function file – .dat.dat Data file (either serial file or multifile)Data file (either serial file or multifile)
  • 9. Connecting to Co>op Server from GDEConnecting to Co>op Server from GDE
  • 10. Host Profile SettingHost Profile Setting 1.1. Choose settings from the run menuChoose settings from the run menu 2.2. Check the use host profile setting checkbox.Check the use host profile setting checkbox. 3.3. Click Edit button to open the Host profile dialog.Click Edit button to open the Host profile dialog. 4.4. If running Ab Initio on your local NT system, check LocalIf running Ab Initio on your local NT system, check Local Execution (NT) checkbox and go to step 6.Execution (NT) checkbox and go to step 6. 5.5. If running Ab Initio on a Remote UNIX system, fill in theIf running Ab Initio on a Remote UNIX system, fill in the path to the Host and Host Login and Password.path to the Host and Host Login and Password. 6.6. Type the full path of Host directory.Type the full path of Host directory. 7.7. Select the Shell Type from pull down menu.Select the Shell Type from pull down menu. 8.8. Test Login and if necessary make changes.Test Login and if necessary make changes.
  • 11. Host ProfileHost Profile Select the Shell Type Enter Host, Login, Password & Host directory
  • 12. Ab Initio ComponentsAb Initio Components Ab Initio provided components. Datasets, Partition, Transform, Sort, Database are frequently used.
  • 13. Creating GraphCreating Graph Type the Label Specify the Input .dat file
  • 14. Create Graph - DmlCreate Graph - Dml  Propagate from Neighbors: CopyPropagate from Neighbors: Copy record formats from connected flow.record formats from connected flow.  Same As: Copy record format’sSame As: Copy record format’s from a specific component’s port.from a specific component’s port.  Path: Store record formats in aPath: Store record formats in a Local file, Host File, or in the AbLocal file, Host File, or in the Ab Initio repository.Initio repository.  Embedded: Type the record formatEmbedded: Type the record format directly in a string.directly in a string. Specify the .dml file
  • 15. Creating Graph - dmlCreating Graph - dml  DML is Ab Initio’s DataDML is Ab Initio’s Data Manipulation Language.Manipulation Language.  DML describes data in termsDML describes data in terms ofof – Record Formats that list theRecord Formats that list the fields and format of input,fields and format of input, output, and intermediateoutput, and intermediate records.records. – Expressions that defineExpressions that define simple computations, forsimple computations, for example, selection.example, selection. – Transform Functions thatTransform Functions that control reformatting,control reformatting, aggregation, and other dataaggregation, and other data transformations.transformations. – Keys that specify groupings,Keys that specify groupings, ordering, and partitioningordering, and partitioning relationships betweenrelationships between records.records. Editing .dml file through Record Format Editor – Grid View
  • 16. Creating Graph - TransformCreating Graph - Transform  A transform function is either aA transform function is either a DML file or a DML string thatDML file or a DML string that describes how you manipulatedescribes how you manipulate your data.your data.  Ab Initio transform functionsAb Initio transform functions mainly consist of a series ofmainly consist of a series of assignment statements. Eachassignment statements. Each statement is called a businessstatement is called a business rule.rule.  When Ab Initio evaluates aWhen Ab Initio evaluates a transform function, it performstransform function, it performs following tasks:following tasks: – Initializes local variablesInitializes local variables – Evaluates statementsEvaluates statements – Evaluates rules.Evaluates rules.  Transform function files have theTransform function files have the xfr extension.xfr extension. Specify the .xfr file
  • 17. Creating Graph - xfrCreating Graph - xfr  Transform functions: A setTransform functions: A set of rules that computeof rules that compute output values from inputoutput values from input values.values.  Business rule: Part of aBusiness rule: Part of a transform function thattransform function that describes how youdescribes how you manipulate one field ofmanipulate one field of your output data.your output data.  Variable: Optional part of aVariable: Optional part of a transform function thattransform function that provides storage forprovides storage for temporary values.temporary values.  Statement: Optional part ofStatement: Optional part of a transform function thata transform function that assigns values of variablesassigns values of variables in a specific order.in a specific order.
  • 18. Sample ComponentsSample Components  SortSort  DedupDedup  JoinJoin  ReplicateReplicate  RollupRollup  Filter by ExpressionFilter by Expression  MergeMerge  LookupLookup  Reformat etc.Reformat etc.
  • 19. Creating Graph – Sort ComponentCreating Graph – Sort Component  Sort: The sort componentSort: The sort component reorders data. Itreorders data. It comprises twocomprises two parameters: Key andparameters: Key and max-core.max-core.  Key: The Key is one ofKey: The Key is one of the parameters for Sortthe parameters for Sort component whichcomponent which describes the collationdescribes the collation order.order.  Max-core: The max-coreMax-core: The max-core parameter controls howparameter controls how often the sort componentoften the sort component dumps data fromdumps data from memory to disk.memory to disk. Specify Key for the Sort
  • 20. Creating Graph – Dedup componentCreating Graph – Dedup component  Dedup componentDedup component removes duplicateremoves duplicate records.records.  Dedup criteria willDedup criteria will be either unique-be either unique- only, First or Last.only, First or Last. Select Dedup criteria.
  • 21. Creating Graph – Replicate ComponentCreating Graph – Replicate Component  ReplicateReplicate combines thecombines the data records fromdata records from the inputs intothe inputs into one flow andone flow and writes a copy ofwrites a copy of that flow to eachthat flow to each of its output ports.of its output ports.  Use Replicate toUse Replicate to supportsupport componentcomponent parallelism.parallelism.
  • 22. Creating Graph – Join ComponentCreating Graph – Join Component • Specify the key for join • Specify Type of Join
  • 23. Database Configuration (.dbc)Database Configuration (.dbc)  A file with a .dbc extension which provides the GDE withA file with a .dbc extension which provides the GDE with the information it needs to connect to a database. Athe information it needs to connect to a database. A configuration file contains the following information:configuration file contains the following information: – The name and version number of the database to which you wantThe name and version number of the database to which you want to connect.to connect. – The name of the computer on which the database instance orThe name of the computer on which the database instance or server to which you want to connect runs, or on which the databaseserver to which you want to connect runs, or on which the database remote access software is installed.remote access software is installed. – The name of the database instance, server, or provider to whichThe name of the database instance, server, or provider to which you want to connect.you want to connect. – You generate a configuration file by using the Properties dialog boxYou generate a configuration file by using the Properties dialog box for one of the Database components.for one of the Database components.
  • 24. Creating Parallel ApplicationsCreating Parallel Applications  Types of Parallel ProcessingTypes of Parallel Processing – Component-level Parallelism: An application with multipleComponent-level Parallelism: An application with multiple components running simultaneously on separate data usescomponents running simultaneously on separate data uses component parallelism.component parallelism. – Pipeline parallelism: An application with multiple componentsPipeline parallelism: An application with multiple components running simultaneously on the same data uses pipeline parallelism.running simultaneously on the same data uses pipeline parallelism. – Data Parallelism: An application with data divided into segmentsData Parallelism: An application with data divided into segments that operates on each segment simultaneously uses datathat operates on each segment simultaneously uses data parallelism.parallelism.
  • 25. Partition ComponentsPartition Components  Partition by Expression: Dividing data according to a DML expression.Partition by Expression: Dividing data according to a DML expression.  Partition by Key: Grouping data by a key.Partition by Key: Grouping data by a key.  Partition with Load balance: Dynamic load balancing.Partition with Load balance: Dynamic load balancing.  Partition by Percentage: Distributing data, so the output is proportionalPartition by Percentage: Distributing data, so the output is proportional to fractions of 100.to fractions of 100.  Partition by Range: Dividing data evenly among nodes, based on a keyPartition by Range: Dividing data evenly among nodes, based on a key and a set of partitioning ranges.and a set of partitioning ranges.  Partition by Round-robin: Distributing data evenly, in blocksize chunks,Partition by Round-robin: Distributing data evenly, in blocksize chunks, across the output partitions.across the output partitions.
  • 26. Departition ComponentsDepartition Components  Concatenate: Concatenate component produces a single output flowConcatenate: Concatenate component produces a single output flow that contains first all the records from the first input partition, then allthat contains first all the records from the first input partition, then all the records from the second input partition and so on.the records from the second input partition and so on.  Gather: Gather component collects inputs from multiple partitions in anGather: Gather component collects inputs from multiple partitions in an arbitrary manner, and produces a single output flow, does not maintainarbitrary manner, and produces a single output flow, does not maintain sort order.sort order.  Interleave: Interleave component collects records from many sourcesInterleave: Interleave component collects records from many sources in round robin fashion.in round robin fashion.  Merge: Merge component collects inputs from multiple sorted partitionsMerge: Merge component collects inputs from multiple sorted partitions and maintains the sort order.and maintains the sort order.
  • 27. Multifile systemsMultifile systems  A multifile system is a specially created set of directories, possibly onA multifile system is a specially created set of directories, possibly on different machines, which have identical substructure.different machines, which have identical substructure.  Each directory is a partition of the multifile system. When a multifile isEach directory is a partition of the multifile system. When a multifile is placed in a multifile system, its partitions are files within each of theplaced in a multifile system, its partitions are files within each of the partitions of the multifile system.partitions of the multifile system.  Multifile system leads to better performance than flat file systemsMultifile system leads to better performance than flat file systems because multifile systems can divide your data among multiple disks orbecause multifile systems can divide your data among multiple disks or CPUs.CPUs.  Typically (SMP machine is exception) a multifile system is created withTypically (SMP machine is exception) a multifile system is created with the control partition on one node and data partitions on other nodes tothe control partition on one node and data partitions on other nodes to distribute the work and improve performance.distribute the work and improve performance.  To do this use full internet URLs that specify file and directory namesTo do this use full internet URLs that specify file and directory names and locations on remote machines.and locations on remote machines.
  • 29. SANDBOXSANDBOX  A sandbox is a collection of graphs and related files thatA sandbox is a collection of graphs and related files that are stored in a single directory tree, and treated as a groupare stored in a single directory tree, and treated as a group for purposes of version control, navigation, and migration.for purposes of version control, navigation, and migration.  A sandbox can be a file system copy of a datastore projectA sandbox can be a file system copy of a datastore project..  In the graph, instead of specifying the entire path for anyIn the graph, instead of specifying the entire path for any file location ,we specify only the sandbox parameterfile location ,we specify only the sandbox parameter variable. For ex : $AI_IN_DATA/customer_info.dat. wherevariable. For ex : $AI_IN_DATA/customer_info.dat. where $AI_IN_DATA contains the entire path with reference to$AI_IN_DATA contains the entire path with reference to the sandbox $AI_HOME variable.the sandbox $AI_HOME variable.  The actual in_data dir is $AI_HOME/in_data in sandboxThe actual in_data dir is $AI_HOME/in_data in sandbox
  • 30. SANDBOXSANDBOX  The sandbox provides an excellent mechanism toThe sandbox provides an excellent mechanism to maintain uniqueness while moving frommaintain uniqueness while moving from development to production environment by meansdevelopment to production environment by means switch parameters.switch parameters.  We can define parameters in sandbox those canWe can define parameters in sandbox those can be used across all the graphs pertaining to thatbe used across all the graphs pertaining to that sandbox.sandbox.  The topmost variable $PROJECT_DIR containsThe topmost variable $PROJECT_DIR contains the path of the home directorythe path of the home directory
  • 32. DeployingDeploying  Every graph after validation and testing has to be deployedEvery graph after validation and testing has to be deployed as .ksh file into the run directory on UNIX.as .ksh file into the run directory on UNIX.  This .ksh file is an executable file which is the backbone forThis .ksh file is an executable file which is the backbone for the entire automation/wrapper process.the entire automation/wrapper process.  The wrapper automation consists of .run, .env,The wrapper automation consists of .run, .env, dependency list ,job list etcdependency list ,job list etc  For a detailed description on wrapper and differentFor a detailed description on wrapper and different directories and files , Please refer the documentation ondirectories and files , Please refer the documentation on wrapper / UNIX presentation.wrapper / UNIX presentation.
  • 33. ReferencesReferences  Ab Initio TutorialAb Initio Tutorial  Ab Initio Online HelpAb Initio Online Help  Website (abinitio.com)Website (abinitio.com)  http://sandyclassic.wordpress.comhttp://sandyclassic.wordpress.com  Data warehouse : http://datawarehouseview.wordpress.com/  Data Science View : http://thedatascience.wordpress.com/  Business Intelligence trends : http://businessintelligencetechnologytrend.wordpress.com/  Business Architect :http://businessarchitectview.wordpress.com/  Enterprise Architect : http://enterprisearchitectview.wordpress.com/  Project Manager :http://projectmanagerview.wordpress.com  data Architect : https://dataarchitectview.wordpress.com/