SlideShare a Scribd company logo
1 of 29
Building a Requirements Management System for Software Requirements
Kevin Ryan
Honolulu Community College
Rob Nelson, Peter Konohia, Des Iorgova
Akimeka, LLC
• Akimeka is a software development firm that specializes in IT
solutions focused for the Department of Defense.
 Services Include:
 Enterprise Architecture
 Cloud Infrastructure
 Virtualization
 Web-Based Technologies
 Database Management
 Software Development
 Network Engineering
 Information Security Assurance
Akimeka LLC
Software Company
1305 N Holopono St #3, Kihei, HI 96753
Phone: (808) 442-7100
Akimeka, LLC
Project Context
• Akimeka for many years has maintained, improved, and developed
specialized computer systems and software for the DoD.
• Medical Record Database Directory
• User/Mobile friendly website
• The development team enhances the software per requirements
defined by the DoD.
• Requirement has to be verifiable, clear, and concise.
• Example Requirement: The input form must produce validated JSON and
send output to the name server.
EXCEL
Example Requirement
Project Context
The input form must produce validated JSON and send output to
the name server.
• That requirement needs to be properly documented in a
Requirements Management System
• The current system is a hybrid of MS Excel sheets and a MS Word document
• Difficult to analyze requirements
• Traceability is an issue
• Potential for inconsistency and duplication.
EXCEL
Example Requirement
MS Word
Example Requirement
Project Objective
• The objective of the Requirements Management System is to store
baselined software requirements from the DoD.
• By doing this we will improve on…
• Requirements analysis
• Data accuracy (No duplicates)
• Search capability (Query displays all pertaining records)
• Traceability
• Consistency
My Project
• My project was to create a Extract, Transform, & Load application
that completely and accurately moved data from MS Excel
spreadsheets into Oracle database tables.
• The ETL Process will structure the data and baseline software
requirements into the Requirements Management System.
• ETL Goals:
• All records are migrated to the database.
• Values that are “N/A” will be replaced to NULL.
• Uses SQL Scripts to compare tables so that no duplicate data will be
baselined or processed.
Custom Java vs Pentaho Data Integration
Custom Java vs Pentaho Data Integration
Criteria
Custom Java
Code
Pentaho Data
Integration
(Kettle)
The solution must be able to extract all data from the excel spreadsheet. Yes Yes
The solution must map data to the specific key value pairs in the data dictionary
for JSON output.
Yes Yes
The solution must map data to the specific column name in the data dictionary
for SQL output.
Yes Yes
The solution must be able to write output as JSON. Yes Yes
The solution must be able to write output as SQL. Yes Yes
The solution must be able to adapt to changes quickly in the event that datatypes
change.
No Yes
Can we plan design, build, implement, test, and document the solution within 5
weeks?
No Yes
Custom Java vs Pentaho Data Integration
Criteria
Custom Java
Code
Pentaho Data
Integration
(Kettle)
The solution must be able to extract all data from the excel spreadsheet. Yes Yes
The solution must map data to the specific key value pairs in the data dictionary
for JSON output.
Yes Yes
The solution must map data to the specific column name in the data dictionary
for SQL output.
Yes Yes
The solution must be able to write output as JSON. Yes Yes
The solution must be able to write output as SQL. Yes Yes
The solution must be able to adapt to changes quickly in the event that datatypes
change.
No Yes
Can we plan design, build, implement, test, and document the solution within 5
weeks?
No Yes
Custom Java vs Pentaho Data Integration
Criteria
Custom Java
Code
Pentaho Data
Integration
(Kettle)
The solution must be able to extract all data from the excel spreadsheet. Yes Yes
The solution must map data to the specific key value pairs in the data dictionary
for JSON output.
Yes Yes
The solution must map data to the specific column name in the data dictionary
for SQL output.
Yes Yes
The solution must be able to write output as JSON. Yes Yes
The solution must be able to write output as SQL. Yes Yes
The solution must be able to adapt to changes quickly in the event that datatypes
change.
No Yes
Can we plan design, build, implement, test, and document the solution within 5
weeks?
No Yes
Data Integration Development Environment
Data Integration Development Environment
Flow Chart of the Data for ETL processing
EXCEL
EXCEL
EXCEL
Flow Chart of the Data for ETL processing
EXCEL
Staging
Table
EXCEL
EXCEL
Staging
Table
Staging
Table
Flow Chart of the Data for ETL processing
EXCEL
Staging
Table
EXCEL
EXCEL
Staging
Table
Staging
Table
Data Cleanup
and
Transform
Draft
If No Management Review
Flow Chart of the Data for ETL processing
EXCEL
Staging
Table
EXCEL
EXCEL
Staging
Table
Staging
Table
Data Cleanup
and
Transform
Draft
If No Management Review
Final
If Yes
Oracle
Database
Load raw data into staging table
EXCEL
Staging
Table
EXCEL
EXCEL
Staging
Table
Staging
Table
Data Cleanup
and
Transform
Draft
If No Management Review
Final
If Yes
Oracle
Database
Pentaho Data IntegrationPass 1: Load raw data into staging table
Oracle Database: PR_STG
After Evaluating data from the Staging Table
• As expected, a SQL SELECT Query will not reference the correct
amount of records.
• SELECT * FROM PR.STG WHERE PROJECT_NAME=‘Medical Situational
Awareness In Theater (MSAT)’
• PROJECT_NAME
• Medical Situational Awareness In Theater
• Medical Situational Awareness In Theater (MSAT)
• Medical Situation Awareness In Theater (MSAT)
Create a Transform: Find and Replace Method
STAGING
TABLE
REFERENCE TABLE
(MAP_COLLECTED_
TO_TARGET)
DRAFT TABLESQL Script SQL Script
SELECTS
DISTINCT
VALUES
FIND AND
REPLACE
Executes SQL Script to collect data for
MAP_COLLECTED_TO_TARGET Table
MAP_COLLECTED_TO_TARGET Table
• Medical Situational Awareness In Theater
• Medical Situational Awareness In Theater (MSAT)
• Medical Situation Awareness In Theater (MSAT)
RCS_COLUMN_NAME COLLECTED_VALUE TARGET_VALUE
PROJECT_NAME Medical Situational Awareness In Theater Medical Situational Awareness In Theater (MSAT)
PROJECT_NAME Medical Situational Awareness In Theater
(MSAT)
Medical Situational Awareness In Theater (MSAT)
PROJECT_NAME Medical Situation Awareness In Theater Medical Situational Awareness In Theater (MSAT)
RELEASE_NUMBER One.7.5.4 1.7.5.4
Performs DATA Cleanup Utilizing the
MAP_COLLECTED_TO_TARGET Table
Flow Chart of the Data for ETL processing
EXCEL
Staging
Table
EXCEL
EXCEL
Staging
Table
Staging
Table
Data Cleanup
and
Transform
Draft
If No Management Review
Final
If Yes
Oracle
Database
Pentaho Data Integration
Results
• Ingested 5608 rows of data, each requirement is unique and baselined.
• Structured
• Accurate
• Searchable
• SELECT COUNT (*) FROM RM.REQUIREMENTS
• All records are migrated to the database.
• SELECT * FROM RM.REQUIREMENTS WHERE COLUMN_NAME=‘N/A’
• Values that were “N/A” was replaced to NULL.
• Input validation/exception handling
• Utilizes SQL Scripts to compare tables so that no duplicate data will be
baselined or processed.
Acknowledgments
Akamai is led and managed by the Institute for Scientist & Engineer Educators at the University of
California Santa Cruz, in partnership with the University of Hawaii Institute for Astronomy. Funding
for the 2015 Akamai Internship and Mentor Program is provided by: Thirty Meter Telescope
International Observatory, THINK Fund at the Hawaii Community Foundation, University of Hawaii
System, University of Hawaii at Hilo, National Science Foundation (AST#1347767), and National Solar
Observatory.
Akimeka
Rob Nelson
Peter Konohia
Des Iorgova
Akamai Workforce Initiative
Lisa Hunter
Austin Barnes
Jerome Shaw
David Harrington
Mike Nassir
ETL Dataflow

More Related Content

What's hot

BI Environment Technical Analysis
BI Environment Technical AnalysisBI Environment Technical Analysis
BI Environment Technical AnalysisRyan Casey
 
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...RTTS
 
QuerySurge for DevOps
QuerySurge for DevOpsQuerySurge for DevOps
QuerySurge for DevOpsRTTS
 
Improve the Health of Your Data
Improve the Health of Your DataImprove the Health of Your Data
Improve the Health of Your DataRTTS
 
Workload_Migration
Workload_MigrationWorkload_Migration
Workload_MigrationAditya Singh
 
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...Gabriele Baldassarre
 
Kailsh K_Resume-Aug2016
Kailsh K_Resume-Aug2016Kailsh K_Resume-Aug2016
Kailsh K_Resume-Aug2016Kailash K
 
Simplified Workload Migration to Big Data Warehouse
Simplified Workload Migration to Big Data WarehouseSimplified Workload Migration to Big Data Warehouse
Simplified Workload Migration to Big Data WarehouseAtul Sharma
 
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)Roland Bouman
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...Kent Graziano
 
Talend Open Studio Introduction - OSSCamp 2014
Talend Open Studio Introduction - OSSCamp 2014Talend Open Studio Introduction - OSSCamp 2014
Talend Open Studio Introduction - OSSCamp 2014OSSCube
 
SQL Server Workshop for Developers - Visual Studio Live! NY 2012
SQL Server Workshop for Developers - Visual Studio Live! NY 2012SQL Server Workshop for Developers - Visual Studio Live! NY 2012
SQL Server Workshop for Developers - Visual Studio Live! NY 2012Andrew Brust
 
Mohamed sakr Senior ETL Developer
Mohamed sakr   Senior ETL Developer Mohamed sakr   Senior ETL Developer
Mohamed sakr Senior ETL Developer Mohamed Sakr
 
Intro to Talend Open Studio for Data Integration
Intro to Talend Open Studio for Data IntegrationIntro to Talend Open Studio for Data Integration
Intro to Talend Open Studio for Data IntegrationPhilip Yurchuk
 
Etl with talend (big data)
Etl with talend (big data)Etl with talend (big data)
Etl with talend (big data)pomishra
 
Designing dashboards for performance shridhar wip 040613
Designing dashboards for performance shridhar wip 040613Designing dashboards for performance shridhar wip 040613
Designing dashboards for performance shridhar wip 040613Mrunal Shridhar
 
Agile Methods and Data Warehousing
Agile Methods and Data WarehousingAgile Methods and Data Warehousing
Agile Methods and Data WarehousingKent Graziano
 
Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data IntegrationRoberto Marchetto
 
Dimensional modeling in oracle sql developer
Dimensional modeling in oracle sql developerDimensional modeling in oracle sql developer
Dimensional modeling in oracle sql developerJeff Smith
 

What's hot (20)

BI Environment Technical Analysis
BI Environment Technical AnalysisBI Environment Technical Analysis
BI Environment Technical Analysis
 
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
 
QuerySurge for DevOps
QuerySurge for DevOpsQuerySurge for DevOps
QuerySurge for DevOps
 
Improve the Health of Your Data
Improve the Health of Your DataImprove the Health of Your Data
Improve the Health of Your Data
 
Workload_Migration
Workload_MigrationWorkload_Migration
Workload_Migration
 
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
Talend Open Studio Fundamentals #1: Workspaces, Jobs, Metadata and Trips & Tr...
 
Kailsh K_Resume-Aug2016
Kailsh K_Resume-Aug2016Kailsh K_Resume-Aug2016
Kailsh K_Resume-Aug2016
 
Simplified Workload Migration to Big Data Warehouse
Simplified Workload Migration to Big Data WarehouseSimplified Workload Migration to Big Data Warehouse
Simplified Workload Migration to Big Data Warehouse
 
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
 
Talend Open Studio Introduction - OSSCamp 2014
Talend Open Studio Introduction - OSSCamp 2014Talend Open Studio Introduction - OSSCamp 2014
Talend Open Studio Introduction - OSSCamp 2014
 
SQL Server Workshop for Developers - Visual Studio Live! NY 2012
SQL Server Workshop for Developers - Visual Studio Live! NY 2012SQL Server Workshop for Developers - Visual Studio Live! NY 2012
SQL Server Workshop for Developers - Visual Studio Live! NY 2012
 
Mohamed sakr Senior ETL Developer
Mohamed sakr   Senior ETL Developer Mohamed sakr   Senior ETL Developer
Mohamed sakr Senior ETL Developer
 
Intro to Talend Open Studio for Data Integration
Intro to Talend Open Studio for Data IntegrationIntro to Talend Open Studio for Data Integration
Intro to Talend Open Studio for Data Integration
 
Oracle data integrator (odi) online training
Oracle data integrator (odi) online trainingOracle data integrator (odi) online training
Oracle data integrator (odi) online training
 
Etl with talend (big data)
Etl with talend (big data)Etl with talend (big data)
Etl with talend (big data)
 
Designing dashboards for performance shridhar wip 040613
Designing dashboards for performance shridhar wip 040613Designing dashboards for performance shridhar wip 040613
Designing dashboards for performance shridhar wip 040613
 
Agile Methods and Data Warehousing
Agile Methods and Data WarehousingAgile Methods and Data Warehousing
Agile Methods and Data Warehousing
 
Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data Integration
 
Dimensional modeling in oracle sql developer
Dimensional modeling in oracle sql developerDimensional modeling in oracle sql developer
Dimensional modeling in oracle sql developer
 

Viewers also liked

Allzweckwaffe für Kommunikation und Marketing - Social Media bei Scout24
Allzweckwaffe für Kommunikation und Marketing  - Social Media bei Scout24Allzweckwaffe für Kommunikation und Marketing  - Social Media bei Scout24
Allzweckwaffe für Kommunikation und Marketing - Social Media bei Scout24Scout24
 
Yes clinic flyer
Yes clinic flyerYes clinic flyer
Yes clinic flyerYesRecYale
 
Human Trafficking Global Horizons Indictment 9 2-10
Human Trafficking Global Horizons Indictment 9 2-10Human Trafficking Global Horizons Indictment 9 2-10
Human Trafficking Global Horizons Indictment 9 2-10Honolulu Civil Beat
 
Open innovation und Crowdsourcing in der Finanzdienstleistungsbranche
Open innovation und Crowdsourcing in der FinanzdienstleistungsbrancheOpen innovation und Crowdsourcing in der Finanzdienstleistungsbranche
Open innovation und Crowdsourcing in der FinanzdienstleistungsbrancheVolksbank_Buehl
 
Opettajan ja opiskelijan TVT-osaaminen
Opettajan ja opiskelijan TVT-osaaminen Opettajan ja opiskelijan TVT-osaaminen
Opettajan ja opiskelijan TVT-osaaminen KirsiViitanen
 
Letter from Ige, DHHL to Schatz, Saiki
Letter from Ige, DHHL to Schatz, SaikiLetter from Ige, DHHL to Schatz, Saiki
Letter from Ige, DHHL to Schatz, SaikiHonolulu Civil Beat
 
Opetusmateriaalit verkossa ja tekijänoikeudet opetuksessa
Opetusmateriaalit verkossa ja tekijänoikeudet opetuksessaOpetusmateriaalit verkossa ja tekijänoikeudet opetuksessa
Opetusmateriaalit verkossa ja tekijänoikeudet opetuksessaKirsiViitanen
 
Palaute verkko-oppimisessa ja -ohjaamisessa
Palaute verkko-oppimisessa ja -ohjaamisessaPalaute verkko-oppimisessa ja -ohjaamisessa
Palaute verkko-oppimisessa ja -ohjaamisessaKirsiViitanen
 
Hawaii vs. Trump motion againt TRO
Hawaii vs. Trump motion againt TROHawaii vs. Trump motion againt TRO
Hawaii vs. Trump motion againt TROHonolulu Civil Beat
 

Viewers also liked (13)

Allzweckwaffe für Kommunikation und Marketing - Social Media bei Scout24
Allzweckwaffe für Kommunikation und Marketing  - Social Media bei Scout24Allzweckwaffe für Kommunikation und Marketing  - Social Media bei Scout24
Allzweckwaffe für Kommunikation und Marketing - Social Media bei Scout24
 
2011 01-18
2011 01-182011 01-18
2011 01-18
 
Yes clinic flyer
Yes clinic flyerYes clinic flyer
Yes clinic flyer
 
Usd417045
Usd417045Usd417045
Usd417045
 
Human Trafficking Global Horizons Indictment 9 2-10
Human Trafficking Global Horizons Indictment 9 2-10Human Trafficking Global Horizons Indictment 9 2-10
Human Trafficking Global Horizons Indictment 9 2-10
 
Citizen Ecodrive
Citizen EcodriveCitizen Ecodrive
Citizen Ecodrive
 
primera unidad enef
primera unidad enef primera unidad enef
primera unidad enef
 
Open innovation und Crowdsourcing in der Finanzdienstleistungsbranche
Open innovation und Crowdsourcing in der FinanzdienstleistungsbrancheOpen innovation und Crowdsourcing in der Finanzdienstleistungsbranche
Open innovation und Crowdsourcing in der Finanzdienstleistungsbranche
 
Opettajan ja opiskelijan TVT-osaaminen
Opettajan ja opiskelijan TVT-osaaminen Opettajan ja opiskelijan TVT-osaaminen
Opettajan ja opiskelijan TVT-osaaminen
 
Letter from Ige, DHHL to Schatz, Saiki
Letter from Ige, DHHL to Schatz, SaikiLetter from Ige, DHHL to Schatz, Saiki
Letter from Ige, DHHL to Schatz, Saiki
 
Opetusmateriaalit verkossa ja tekijänoikeudet opetuksessa
Opetusmateriaalit verkossa ja tekijänoikeudet opetuksessaOpetusmateriaalit verkossa ja tekijänoikeudet opetuksessa
Opetusmateriaalit verkossa ja tekijänoikeudet opetuksessa
 
Palaute verkko-oppimisessa ja -ohjaamisessa
Palaute verkko-oppimisessa ja -ohjaamisessaPalaute verkko-oppimisessa ja -ohjaamisessa
Palaute verkko-oppimisessa ja -ohjaamisessa
 
Hawaii vs. Trump motion againt TRO
Hawaii vs. Trump motion againt TROHawaii vs. Trump motion againt TRO
Hawaii vs. Trump motion againt TRO
 

Similar to Ryan-Symposium-v5

Shane_O'Neill_CV_slim
Shane_O'Neill_CV_slimShane_O'Neill_CV_slim
Shane_O'Neill_CV_slimShane O'Neill
 
Krishna_IBM_Infosphere_Certified_Datastage_Consultant
Krishna_IBM_Infosphere_Certified_Datastage_Consultant Krishna_IBM_Infosphere_Certified_Datastage_Consultant
Krishna_IBM_Infosphere_Certified_Datastage_Consultant Krishna Kishore
 
Resume - Deepak v.s
Resume -  Deepak v.sResume -  Deepak v.s
Resume - Deepak v.sDeepak V S
 
Natural Language to SQL Query conversion using Machine Learning Techniques on...
Natural Language to SQL Query conversion using Machine Learning Techniques on...Natural Language to SQL Query conversion using Machine Learning Techniques on...
Natural Language to SQL Query conversion using Machine Learning Techniques on...HPCC Systems
 
Resume of Derek
Resume of DerekResume of Derek
Resume of DerekDerek Xu
 
Spark SQL In Depth www.syedacademy.com
Spark SQL In Depth www.syedacademy.comSpark SQL In Depth www.syedacademy.com
Spark SQL In Depth www.syedacademy.comSyed Hadoop
 
Milan jain resume
Milan jain resumeMilan jain resume
Milan jain resumeMilan Jain
 
Resume of Jay Zabinsky
Resume of Jay ZabinskyResume of Jay Zabinsky
Resume of Jay Zabinskyjzabinsky
 
Riyas_Oracle DBA_BA_ST_Resume_Latest
Riyas_Oracle DBA_BA_ST_Resume_LatestRiyas_Oracle DBA_BA_ST_Resume_Latest
Riyas_Oracle DBA_BA_ST_Resume_LatestRiyas Mohamed
 
Elizabeth Mc Rae Resume Sf
Elizabeth Mc Rae Resume SfElizabeth Mc Rae Resume Sf
Elizabeth Mc Rae Resume Sfemmcare
 
Performance Stability, Tips and Tricks and Underscores
Performance Stability, Tips and Tricks and UnderscoresPerformance Stability, Tips and Tricks and Underscores
Performance Stability, Tips and Tricks and UnderscoresJitendra Singh
 
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...Databricks
 

Similar to Ryan-Symposium-v5 (20)

OConnell Resume
OConnell ResumeOConnell Resume
OConnell Resume
 
Shane_O'Neill_CV_slim
Shane_O'Neill_CV_slimShane_O'Neill_CV_slim
Shane_O'Neill_CV_slim
 
Krishna_IBM_Infosphere_Certified_Datastage_Consultant
Krishna_IBM_Infosphere_Certified_Datastage_Consultant Krishna_IBM_Infosphere_Certified_Datastage_Consultant
Krishna_IBM_Infosphere_Certified_Datastage_Consultant
 
Resume - Deepak v.s
Resume -  Deepak v.sResume -  Deepak v.s
Resume - Deepak v.s
 
sandhya exp resume
sandhya exp resume sandhya exp resume
sandhya exp resume
 
Natural Language to SQL Query conversion using Machine Learning Techniques on...
Natural Language to SQL Query conversion using Machine Learning Techniques on...Natural Language to SQL Query conversion using Machine Learning Techniques on...
Natural Language to SQL Query conversion using Machine Learning Techniques on...
 
Resume sailaja
Resume sailajaResume sailaja
Resume sailaja
 
Resume of Derek
Resume of DerekResume of Derek
Resume of Derek
 
Taming the shrew Power BI
Taming the shrew Power BITaming the shrew Power BI
Taming the shrew Power BI
 
Nitin Paliwal
Nitin PaliwalNitin Paliwal
Nitin Paliwal
 
Spark SQL In Depth www.syedacademy.com
Spark SQL In Depth www.syedacademy.comSpark SQL In Depth www.syedacademy.com
Spark SQL In Depth www.syedacademy.com
 
Milan jain resume
Milan jain resumeMilan jain resume
Milan jain resume
 
Resume of Jay Zabinsky
Resume of Jay ZabinskyResume of Jay Zabinsky
Resume of Jay Zabinsky
 
Riyas_Oracle DBA_BA_ST_Resume_Latest
Riyas_Oracle DBA_BA_ST_Resume_LatestRiyas_Oracle DBA_BA_ST_Resume_Latest
Riyas_Oracle DBA_BA_ST_Resume_Latest
 
Resume_of_Vasudevan - Hadoop
Resume_of_Vasudevan - HadoopResume_of_Vasudevan - Hadoop
Resume_of_Vasudevan - Hadoop
 
Elizabeth Mc Rae Resume Sf
Elizabeth Mc Rae Resume SfElizabeth Mc Rae Resume Sf
Elizabeth Mc Rae Resume Sf
 
Performance Stability, Tips and Tricks and Underscores
Performance Stability, Tips and Tricks and UnderscoresPerformance Stability, Tips and Tricks and Underscores
Performance Stability, Tips and Tricks and Underscores
 
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
 
Bhaviyaa Bhagawan Resume
Bhaviyaa Bhagawan ResumeBhaviyaa Bhagawan Resume
Bhaviyaa Bhagawan Resume
 
Resume_of_sayeed
Resume_of_sayeedResume_of_sayeed
Resume_of_sayeed
 

Ryan-Symposium-v5

  • 1. Building a Requirements Management System for Software Requirements Kevin Ryan Honolulu Community College Rob Nelson, Peter Konohia, Des Iorgova Akimeka, LLC
  • 2. • Akimeka is a software development firm that specializes in IT solutions focused for the Department of Defense.  Services Include:  Enterprise Architecture  Cloud Infrastructure  Virtualization  Web-Based Technologies  Database Management  Software Development  Network Engineering  Information Security Assurance Akimeka LLC Software Company 1305 N Holopono St #3, Kihei, HI 96753 Phone: (808) 442-7100 Akimeka, LLC
  • 3. Project Context • Akimeka for many years has maintained, improved, and developed specialized computer systems and software for the DoD. • Medical Record Database Directory • User/Mobile friendly website • The development team enhances the software per requirements defined by the DoD. • Requirement has to be verifiable, clear, and concise. • Example Requirement: The input form must produce validated JSON and send output to the name server. EXCEL Example Requirement
  • 4. Project Context The input form must produce validated JSON and send output to the name server. • That requirement needs to be properly documented in a Requirements Management System • The current system is a hybrid of MS Excel sheets and a MS Word document • Difficult to analyze requirements • Traceability is an issue • Potential for inconsistency and duplication. EXCEL Example Requirement MS Word Example Requirement
  • 5. Project Objective • The objective of the Requirements Management System is to store baselined software requirements from the DoD. • By doing this we will improve on… • Requirements analysis • Data accuracy (No duplicates) • Search capability (Query displays all pertaining records) • Traceability • Consistency
  • 6. My Project • My project was to create a Extract, Transform, & Load application that completely and accurately moved data from MS Excel spreadsheets into Oracle database tables. • The ETL Process will structure the data and baseline software requirements into the Requirements Management System. • ETL Goals: • All records are migrated to the database. • Values that are “N/A” will be replaced to NULL. • Uses SQL Scripts to compare tables so that no duplicate data will be baselined or processed.
  • 7. Custom Java vs Pentaho Data Integration
  • 8. Custom Java vs Pentaho Data Integration Criteria Custom Java Code Pentaho Data Integration (Kettle) The solution must be able to extract all data from the excel spreadsheet. Yes Yes The solution must map data to the specific key value pairs in the data dictionary for JSON output. Yes Yes The solution must map data to the specific column name in the data dictionary for SQL output. Yes Yes The solution must be able to write output as JSON. Yes Yes The solution must be able to write output as SQL. Yes Yes The solution must be able to adapt to changes quickly in the event that datatypes change. No Yes Can we plan design, build, implement, test, and document the solution within 5 weeks? No Yes
  • 9. Custom Java vs Pentaho Data Integration Criteria Custom Java Code Pentaho Data Integration (Kettle) The solution must be able to extract all data from the excel spreadsheet. Yes Yes The solution must map data to the specific key value pairs in the data dictionary for JSON output. Yes Yes The solution must map data to the specific column name in the data dictionary for SQL output. Yes Yes The solution must be able to write output as JSON. Yes Yes The solution must be able to write output as SQL. Yes Yes The solution must be able to adapt to changes quickly in the event that datatypes change. No Yes Can we plan design, build, implement, test, and document the solution within 5 weeks? No Yes
  • 10. Custom Java vs Pentaho Data Integration Criteria Custom Java Code Pentaho Data Integration (Kettle) The solution must be able to extract all data from the excel spreadsheet. Yes Yes The solution must map data to the specific key value pairs in the data dictionary for JSON output. Yes Yes The solution must map data to the specific column name in the data dictionary for SQL output. Yes Yes The solution must be able to write output as JSON. Yes Yes The solution must be able to write output as SQL. Yes Yes The solution must be able to adapt to changes quickly in the event that datatypes change. No Yes Can we plan design, build, implement, test, and document the solution within 5 weeks? No Yes
  • 13. Flow Chart of the Data for ETL processing EXCEL EXCEL EXCEL
  • 14. Flow Chart of the Data for ETL processing EXCEL Staging Table EXCEL EXCEL Staging Table Staging Table
  • 15. Flow Chart of the Data for ETL processing EXCEL Staging Table EXCEL EXCEL Staging Table Staging Table Data Cleanup and Transform Draft If No Management Review
  • 16. Flow Chart of the Data for ETL processing EXCEL Staging Table EXCEL EXCEL Staging Table Staging Table Data Cleanup and Transform Draft If No Management Review Final If Yes Oracle Database
  • 17. Load raw data into staging table EXCEL Staging Table EXCEL EXCEL Staging Table Staging Table Data Cleanup and Transform Draft If No Management Review Final If Yes Oracle Database
  • 18. Pentaho Data IntegrationPass 1: Load raw data into staging table
  • 20. After Evaluating data from the Staging Table • As expected, a SQL SELECT Query will not reference the correct amount of records. • SELECT * FROM PR.STG WHERE PROJECT_NAME=‘Medical Situational Awareness In Theater (MSAT)’ • PROJECT_NAME • Medical Situational Awareness In Theater • Medical Situational Awareness In Theater (MSAT) • Medical Situation Awareness In Theater (MSAT)
  • 21. Create a Transform: Find and Replace Method STAGING TABLE REFERENCE TABLE (MAP_COLLECTED_ TO_TARGET) DRAFT TABLESQL Script SQL Script SELECTS DISTINCT VALUES FIND AND REPLACE
  • 22. Executes SQL Script to collect data for MAP_COLLECTED_TO_TARGET Table
  • 23. MAP_COLLECTED_TO_TARGET Table • Medical Situational Awareness In Theater • Medical Situational Awareness In Theater (MSAT) • Medical Situation Awareness In Theater (MSAT) RCS_COLUMN_NAME COLLECTED_VALUE TARGET_VALUE PROJECT_NAME Medical Situational Awareness In Theater Medical Situational Awareness In Theater (MSAT) PROJECT_NAME Medical Situational Awareness In Theater (MSAT) Medical Situational Awareness In Theater (MSAT) PROJECT_NAME Medical Situation Awareness In Theater Medical Situational Awareness In Theater (MSAT) RELEASE_NUMBER One.7.5.4 1.7.5.4
  • 24. Performs DATA Cleanup Utilizing the MAP_COLLECTED_TO_TARGET Table
  • 25. Flow Chart of the Data for ETL processing EXCEL Staging Table EXCEL EXCEL Staging Table Staging Table Data Cleanup and Transform Draft If No Management Review Final If Yes Oracle Database
  • 27. Results • Ingested 5608 rows of data, each requirement is unique and baselined. • Structured • Accurate • Searchable • SELECT COUNT (*) FROM RM.REQUIREMENTS • All records are migrated to the database. • SELECT * FROM RM.REQUIREMENTS WHERE COLUMN_NAME=‘N/A’ • Values that were “N/A” was replaced to NULL. • Input validation/exception handling • Utilizes SQL Scripts to compare tables so that no duplicate data will be baselined or processed.
  • 28. Acknowledgments Akamai is led and managed by the Institute for Scientist & Engineer Educators at the University of California Santa Cruz, in partnership with the University of Hawaii Institute for Astronomy. Funding for the 2015 Akamai Internship and Mentor Program is provided by: Thirty Meter Telescope International Observatory, THINK Fund at the Hawaii Community Foundation, University of Hawaii System, University of Hawaii at Hilo, National Science Foundation (AST#1347767), and National Solar Observatory. Akimeka Rob Nelson Peter Konohia Des Iorgova Akamai Workforce Initiative Lisa Hunter Austin Barnes Jerome Shaw David Harrington Mike Nassir