Spark ETL Techniques - Creating An Optimal Fantasy Baseball Roster
Data stage scenario design5 - job1
1. Something about DataStage, DataStage Administration, Job Designing,Developing, DataStage troubleshooting, DataStage Installation & Configuration, ETL, DataWareHousing, DB2,
Teradata, Oracle and Scripting.
Nuts & Bolts of DataStage
Home Interview Questions DataStage Scenarios Series Posts EBooks About Me !!
Thursday, April 24, 2014
DataStage Scenario Problem > DataStage Scenario Problem5
Solution Design :
a) Job Design :
Below is the design which can achieve the output as we needed. Here, we are reading seq file as a input, then data is passing through a Sort and
Transformer stage to achieve the output.
b) Sort Stage Properties
In Sort stage, we will sort the data based on column “Char” in ascending order.
DataStage Scenario Design5 job1
Total Pageviews
1 4 5 4 6 1 6
Search
Try Me
DataSet in DataStage
Issuing commands to a Queue Manager (runmqsc)
Hash Files in DataStage
XMeta DB : Datastage Repository
InfoSphere DataStage Jobstatus returned Codes from
dsjob
Conductor Node in Datastage
Schema File in Datastage
Sort stage to remove duplicate
14 Good design tips in Datastage
Datastage Coding Checklist
Must Reads
3 More Next Blog» Create Blog Sign In
3. e) OutPut File
In Output file, We will use the inline sorting to sort the data on "Occurrence" column in ascending order.
no, char, occurrence
1,a,1
3,a,2
5,a,3
6,a,4
8,a,5
2,b,1
4,b,2
7,b,3
For More > VISIT THIS LINK
By ETL DataStage at 08:05 0 Comments
Labels: Data, DataSet, DataStage, design, develop, duplicate, input, output, problem, sort, stages, transformer
► March (9)
► February (16)
► January (12)
► 2013 (167)
► 2012 (175)
► 2011 (8)
Administration
application authorities
client Code column
commands Concept
Configuration
create Data database DataSet
DataStage DataWareHouse DB2 DBMS
debug delete design develop
difference director
Documentation dsenv dsjob DSRPC
environment Errors ETL
file
function
Information input install Interview
Job keys Link Linux list
Logging Logical logs lookup
managers message queue
Metadata Model MQ names
Optimizing Oracle
output Parallel parameter partition
performance Physical
port problem process
Project Putty Questions
remove
routine rows
scenario Schema Script
Seq File sequence Server Service Setting
Shell shell scripting
sort source SQL stages
Tags Cloud
&PH& 421 advantage Agents aggregator
Answers architecture ASB attribute
backup basic binary block books Buffer certification change
channel checkpoint cleanup clear
Column Generator compiler
Conceptual conductor container copy
counter Crontab
deadlock deploy
dimension Dimensional
DSparam dump
duplicate encrypt engine exception
execution export fact factless FAQ FileSet filter free ftp
fun fundamentals granularity Guest hadoop handling
hash head hide horizontal Host huge hyperlink import increase
index issue istool Java
jdbc join leaders listener load
local locks Login macro mail
maintenance memory merge
modify Monitor MQSC multiple
NLS node notes notification odbc odbc.ini operator
orchadmin ORLogging orphan OS osh
package Parallelism
password peek Perl phantom pivot
player Practices profile
programming purge read registry
reject release report Resource Restart Roles
row generator RTLogging run sample SCD
scheduler score Scratch section
session
Share shortcuts show slowly
snowflake solution space SSH
4. Newer Post Older PostHome
Subscribe to: Post Comments (Atom)
0 Comments DataStage4You Login
Sort by Best Share ⤤
Start the discussion…
Be the first to comment.
Subscribe✉ Add Disqus to your sited Privacy
Favorite ★
Start Stop
surrogate table target teradata
tips tool transformer
Troubleshoot Tutorial Unix User
Utility UV variables
warnings WAS websphere
windows XMETA
Standards Star statistics status storage
switch system tail temporary
time trace transformation trigger
tuning type unique
uvodbc.config version videos view
Vincent McBurney Virtual
write Write Range Map xml z/OS
The postings on this site are my own and don't necessarily represent IBM's or other companies positions, strategies or opinions. All content provided on this blog is for informational purposes only. The owner of this
blog makes no representations as to the accuracy or completeness of any information on this site or found by following any link on this site. The owner will not be liable for any errors or omissions in this information
nor for the availability of this information. The owner will not be liable for any losses, injuries, or damages from the display or use of his information. //
Disclaimer
Did you find this Blog helpful ?? Let me know @ www.facebook.com/datastage4you. Ethereal template. Powered by Blogger.