SlideShare a Scribd company logo
1 of 10
EMT – EIM Migration Tool ©
Data Migration Solution Overview - Version 1.2
External Adapter Integration : Cassandra Adapter
Product names, designations, logos, and symbols may be trademarks or registered trademarks of their respective owners.
• EMT is the result of many years of experiences in complex and performance demanding data migration projects providing a Migration
Solution decoupled and independent from the migration strategies.
• EMT addresses organizations to deliver a rapid and successful project implementation due to the capability of combining a migration
software layer with a proven Process Development Methodology:
• Data Extraction, Data Validation and Data Transformation processes can be developed using EMT procedures and EMT PL-SQL
templates.
• EMT Process Development Methodology allows achieving the best performances during the parallel execution.
• Upload processes are orchestrated by EMT engine and do not require development activities.
• EMT uses migration interface tables called “EIM” to execute and distribute CRUD operations into multiple target systems.
• EIM processes are integrated with:
• Siebel Enterprise Server
• Oracle Database Server
• Apache Cassandra (*)
(*) Integrated via “EMT External Adapter Interface”: Additional Adapters with a common interface can be used to upload different target systems.
EIM Migration Tool © (EMT)
“EMT is a Software Solution designed to reduce costs and complexity of large scale data migration projects”
EMT - Architecture
EMT can be integrated in many Migration Architectures since is fully configurable.
EMT engine is developed in PL-SQL and runs on Oracle © Database
Server (Version 10.0.6.2 or above).
A single EMT instance is able to upload a maximum number of 5
Target Systems using the EIM upload component.
The usage of dedicated EMT Oracle Database Server (Staging Area
DB) is highly recommended for Operational Architecture:
• Staging database allows executing data extraction, data
validation and data transformation processes without
transactional impacts on the target databases.
Multiple EMT instances can be installed on the same Oracle
Database Server (Non Operational Architecture).
• Automatic migration process (Start, Stop, Restart Migration flow).
• High performance and scalability:
• Data Distribution
• Pipeline Process Execution
• Parallel Processing.
• Deadlock Prevention.
• Low Data Contention.
• High quality process log.
• Hierarchy structures processing (E.g. Large Account Corporation, Corporate Discount, ..)
• Error Handling and Tracking of migration execution results.
• Automatic Recovery and Rollback.
• Migration Execution Statistics, Performance and Execution Reports.
• Configuration Checks.
• Configuration Versioning and Deploy.
EMT – Main Features
Thanks to EMT parallel processing:
Migration performances are ruled only by the
migration process with lower performance rate.
EMT Cassandra Adapter
Java Class implemented using DataStax © Driver 3.6.0
• It’s executed by EMT engine as EIM process enabling Parallel Execution mode and multiple EIM tasks.
• It uses EIM tables as interface to Cassandra Tables executing the following operations:
• Insert
• Update
• Upsert (Insert or Update)
• Delete (Rollback)
• Reconcile
• Updates the execution result for each record.
EMT - Cassandra Adapter
Process: UPL_EIM_CASS_1
Operation: Insert
Parallel Degree: 3
Target: Table_Cass1
B1
EIM
Task 1
• BATCH_NUM
1-10
EIM
Task 2
• BATCH_NUM
11-20
EIM
Task 3
• BATCH_NUM
21-30
Task 1 – EIM Batch 1..10 - EMT Cassandra Adapter OP: Insert
Task 2 – EIM Batch 11..20 - EMT Cassandra Adapter OP: Insert
Task 3 – EIM Batch 21..30 - EMT Cassandra Adapter OP: Insert
EIM Task Execution
1. EMT Cassandra Adapter is executed by EIM tasks based on the EIM Process
configuration.
2. Retrieve the Default Target Configuration or specific Process Configuration (Override
Default Configuration) to build Cassandra Cluster interface:
• Connection Configuration Policy
• Authentication
• Target Keyspace
• Load Balance Policies
• Pooling Options
• Session Options
• Consistency Level
• Synchronous/Asynchronous execution
• Skip Null Values (Evict Tombstones)
• Time To Live
• Protocol and Socket
• SSL
• Compression
• Timeouts
3. Build Cassandra Cluster Interface and establish connection.
4. Read Source EIM records.
5. Prepare statement and execute Cassandra operation in Synchronous or
Asynchronous mode (Insert, Update, Upsert, Delete, Reconcile).
6. Update Execution Results on EIM Table.
EMT Cassandra Adapter – Architecture & Flow
• Translation consists in defining an ORACLE Base Table with 1:1 Cassandra Column Mapping and converting Cassandra Datatypes into ORACLE
Datatypes. Primary Key must be the same for both tables.
• EMT Cassandra Adapter uses pre-built implicit datatype Cast solving (whenever possible) the conversion between ORACLE Datatypes and
Cassandra Datatypes.
• In most of the cases, the implicit EMT datatype Cast do not require additional mapping configuration activity.
• Using EMT procedures, EIM table or Multiple EIM tables are generated to define the mapping between EMT and Cassandra.
• EIM mapping Tables are created into a dedicated Schema. Base tables are used only for EIM table creation.
EMT Cassandra Adapter – Data Mapping
Data Mapping is implemented translating Cassandra Tables into ORACLE tables.
EMT Cassandra Adapter uses additional Data Mapping Configuration to manage the following scenarios:
• Cassandra Long Names and Oracle Keywords
• Cassandra Target Tables Name or Column Identifier can exceed 35 characters or conflicts with Oracle keywords (E.g. Column Name “ID” ).
In those cases and additional configuration must be used:
• Table Long Name Mapping (Oracle Table Name  Cassandra Table Name)
• Column Long Name Mapping (Oracle Column Name  Cassandra Column Name)
• Cassandra Functions
• Target Columns updated with Cassandra functions are configured using the Cassandra column name and the Cassandra function.
Those columns are not mapped into EIM table columns.
• E.g. 1: CassandraTable.timestamp_column  toUnixTimestamp(now())
• E.g. 2: CassandraTable.counter_likes  counter_likes + 1
• EMT Cast Functions
• EMT Datatype Cast functions are used to define the datatype mapping for Columns overriding the implicit mapping:
• E.g.1 : EIMTable1.duration_column  EMT_DUR_SEC() - Cast Duration from nanoseconds (Default) to seconds.
• EMT_CQL()
• EMT_CQL() function enables CQL parse to process target columns having data types:
• Collection (List, Map, Set)
• User Defined Types
E.g.1 : EIMTable1.column_map  EMT_CQL()
EIM_TABLE1.COLUMN_MAP - Type VARCHAR2 or (N)CLOB CASSANADRA_TABLE1.COLUMN_MAP - Type map<int, text>
--------------------------------------------------------------------------------------  ------------------------------------------------------------------------------------------
{ 1000: 'Element Map N.1', 1001: 'Element Map N.2‘ } { 1000: 'Element Map N.1', 1001: 'Element Map N.2‘ }
EMT Cassandra Adapter – Data Mapping
Information in this document is subject to change without notice and may contain inaccuracies or typographical errors.
No part of this document may be reproduced, stored in, or introduced into a retrieval system, or transmitted in any form or by any
means (electronic, mechanical, photocopying, recording, or otherwise), or for any purpose, without the express written permission of
Gecob It Consulting.
Except as expressly provided in any written license agreement from Gecob It Consulting, the furnishing of this document does not give
you any license to trademarks, copyrights, or other intellectual property.
Product names, designations, logos, and symbols may be trademarks or registered trademarks of their respective owners.
For further information please contact us at the following e-mail address: emt@gecobitconsulting.com
EMT – COPYRIGHT NOTICE
EMT ©2010 Gecob It Consulting Ltd.

More Related Content

What's hot

Solving the DB2 LUW Administration Dilemma
Solving the DB2 LUW Administration DilemmaSolving the DB2 LUW Administration Dilemma
Solving the DB2 LUW Administration DilemmaRandy Goering
 
DB2 LUW Access Plan Stability
DB2 LUW Access Plan StabilityDB2 LUW Access Plan Stability
DB2 LUW Access Plan Stabilitydmcmichael
 
Xml to-object-transformer
Xml to-object-transformerXml to-object-transformer
Xml to-object-transformerSanni Kumar Pal
 
Elements in a muleflow
Elements in a muleflowElements in a muleflow
Elements in a muleflowThang Loi
 
Dynamic Load balancing Linux private Cloud (DRS)
Dynamic Load balancing Linux private Cloud (DRS)Dynamic Load balancing Linux private Cloud (DRS)
Dynamic Load balancing Linux private Cloud (DRS)kamrankausar
 
Understanding DB2 Optimizer
Understanding DB2 OptimizerUnderstanding DB2 Optimizer
Understanding DB2 Optimizerterraborealis
 
[Altibase] 10 replication part3 (system design)
[Altibase] 10 replication part3 (system design)[Altibase] 10 replication part3 (system design)
[Altibase] 10 replication part3 (system design)altistory
 
Mule debugging logging_configuration_in_mule
Mule debugging logging_configuration_in_muleMule debugging logging_configuration_in_mule
Mule debugging logging_configuration_in_mulekunal vishe
 
[Altibase] 12 replication part5 (optimization and monitoring)
[Altibase] 12 replication part5 (optimization and monitoring)[Altibase] 12 replication part5 (optimization and monitoring)
[Altibase] 12 replication part5 (optimization and monitoring)altistory
 
Oracle Database 12c features for DBA
Oracle Database 12c features for DBAOracle Database 12c features for DBA
Oracle Database 12c features for DBAKaran Kukreja
 
High Availability Options for DB2 Data Centre
High Availability Options for DB2 Data CentreHigh Availability Options for DB2 Data Centre
High Availability Options for DB2 Data Centreterraborealis
 
Optimizing Your Cloud Applications in RightScale
Optimizing Your Cloud Applications in RightScaleOptimizing Your Cloud Applications in RightScale
Optimizing Your Cloud Applications in RightScaleRightScale
 
Oracle hard and soft parsing
Oracle hard and soft parsingOracle hard and soft parsing
Oracle hard and soft parsingIshaan Guliani
 
Xml to xml transformation
Xml to xml transformationXml to xml transformation
Xml to xml transformationSon Nguyen
 
Mule transformers
Mule transformersMule transformers
Mule transformersThang Loi
 

What's hot (18)

Solving the DB2 LUW Administration Dilemma
Solving the DB2 LUW Administration DilemmaSolving the DB2 LUW Administration Dilemma
Solving the DB2 LUW Administration Dilemma
 
DB2 LUW Access Plan Stability
DB2 LUW Access Plan StabilityDB2 LUW Access Plan Stability
DB2 LUW Access Plan Stability
 
Xml to-object-transformer
Xml to-object-transformerXml to-object-transformer
Xml to-object-transformer
 
Elements in a muleflow
Elements in a muleflowElements in a muleflow
Elements in a muleflow
 
Dynamic Load balancing Linux private Cloud (DRS)
Dynamic Load balancing Linux private Cloud (DRS)Dynamic Load balancing Linux private Cloud (DRS)
Dynamic Load balancing Linux private Cloud (DRS)
 
Understanding DB2 Optimizer
Understanding DB2 OptimizerUnderstanding DB2 Optimizer
Understanding DB2 Optimizer
 
[Altibase] 10 replication part3 (system design)
[Altibase] 10 replication part3 (system design)[Altibase] 10 replication part3 (system design)
[Altibase] 10 replication part3 (system design)
 
Mule debugging logging_configuration_in_mule
Mule debugging logging_configuration_in_muleMule debugging logging_configuration_in_mule
Mule debugging logging_configuration_in_mule
 
[Altibase] 12 replication part5 (optimization and monitoring)
[Altibase] 12 replication part5 (optimization and monitoring)[Altibase] 12 replication part5 (optimization and monitoring)
[Altibase] 12 replication part5 (optimization and monitoring)
 
Oracle Database 12c features for DBA
Oracle Database 12c features for DBAOracle Database 12c features for DBA
Oracle Database 12c features for DBA
 
High Availability Options for DB2 Data Centre
High Availability Options for DB2 Data CentreHigh Availability Options for DB2 Data Centre
High Availability Options for DB2 Data Centre
 
Optimizing Your Cloud Applications in RightScale
Optimizing Your Cloud Applications in RightScaleOptimizing Your Cloud Applications in RightScale
Optimizing Your Cloud Applications in RightScale
 
Les 01 core
Les 01 coreLes 01 core
Les 01 core
 
Oracle hard and soft parsing
Oracle hard and soft parsingOracle hard and soft parsing
Oracle hard and soft parsing
 
Les 06 rec
Les 06 recLes 06 rec
Les 06 rec
 
Xml to xml transformation
Xml to xml transformationXml to xml transformation
Xml to xml transformation
 
Mule transformers
Mule transformersMule transformers
Mule transformers
 
Les 10 fl1
Les 10 fl1Les 10 fl1
Les 10 fl1
 

Similar to Cassandra Data Migration

Datastage to ODI
Datastage to ODIDatastage to ODI
Datastage to ODINagendra K
 
An introduction to Apache Camel
An introduction to Apache CamelAn introduction to Apache Camel
An introduction to Apache CamelKapil Kumar
 
Optimization SQL Server for Dynamics AX 2012 R3
Optimization SQL Server for Dynamics AX 2012 R3Optimization SQL Server for Dynamics AX 2012 R3
Optimization SQL Server for Dynamics AX 2012 R3Juan Fabian
 
AWS (Amazon Redshift) presentation
AWS (Amazon Redshift) presentationAWS (Amazon Redshift) presentation
AWS (Amazon Redshift) presentationVolodymyr Rovetskiy
 
Cloudsim & greencloud
Cloudsim & greencloud Cloudsim & greencloud
Cloudsim & greencloud nedamaleki87
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersShehap Elnagar
 
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...Amazon Web Services
 
Cloudsim & Green Cloud
Cloudsim & Green CloudCloudsim & Green Cloud
Cloudsim & Green CloudNeda Maleki
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersShehap Elnagar
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersShehap Elnagar
 
Introduction to embedded System.pptx
Introduction to embedded System.pptxIntroduction to embedded System.pptx
Introduction to embedded System.pptxPratik Gohel
 
Presentación Oracle Database Migración consideraciones 10g/11g/12c
Presentación Oracle Database Migración consideraciones 10g/11g/12cPresentación Oracle Database Migración consideraciones 10g/11g/12c
Presentación Oracle Database Migración consideraciones 10g/11g/12cRonald Francisco Vargas Quesada
 
Simulating Heterogeneous Resources in CloudLightning
Simulating Heterogeneous Resources in CloudLightningSimulating Heterogeneous Resources in CloudLightning
Simulating Heterogeneous Resources in CloudLightningCloudLightning
 
All (that i know) about exadata external
All (that i know) about exadata externalAll (that i know) about exadata external
All (that i know) about exadata externalPrasad Chitta
 
Java- JDBC- Mazenet Solution
Java- JDBC- Mazenet SolutionJava- JDBC- Mazenet Solution
Java- JDBC- Mazenet SolutionMazenetsolution
 
London Redshift Meetup - July 2017
London Redshift Meetup - July 2017London Redshift Meetup - July 2017
London Redshift Meetup - July 2017Pratim Das
 
EPM Automate - Automating Enterprise Performance Management Cloud Solutions
EPM Automate - Automating Enterprise Performance Management Cloud SolutionsEPM Automate - Automating Enterprise Performance Management Cloud Solutions
EPM Automate - Automating Enterprise Performance Management Cloud SolutionsAlithya
 
Oracle Query Optimizer - An Introduction
Oracle Query Optimizer - An IntroductionOracle Query Optimizer - An Introduction
Oracle Query Optimizer - An Introductionadryanbub
 

Similar to Cassandra Data Migration (20)

Datastage to ODI
Datastage to ODIDatastage to ODI
Datastage to ODI
 
An introduction to Apache Camel
An introduction to Apache CamelAn introduction to Apache Camel
An introduction to Apache Camel
 
Teradata training
Teradata trainingTeradata training
Teradata training
 
Optimization SQL Server for Dynamics AX 2012 R3
Optimization SQL Server for Dynamics AX 2012 R3Optimization SQL Server for Dynamics AX 2012 R3
Optimization SQL Server for Dynamics AX 2012 R3
 
AWS (Amazon Redshift) presentation
AWS (Amazon Redshift) presentationAWS (Amazon Redshift) presentation
AWS (Amazon Redshift) presentation
 
Cloudsim & greencloud
Cloudsim & greencloud Cloudsim & greencloud
Cloudsim & greencloud
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powers
 
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...
 
Cloudsim & Green Cloud
Cloudsim & Green CloudCloudsim & Green Cloud
Cloudsim & Green Cloud
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powers
 
T sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powersT sql performance guidelines for better db stress powers
T sql performance guidelines for better db stress powers
 
Introduction to embedded System.pptx
Introduction to embedded System.pptxIntroduction to embedded System.pptx
Introduction to embedded System.pptx
 
Presentación Oracle Database Migración consideraciones 10g/11g/12c
Presentación Oracle Database Migración consideraciones 10g/11g/12cPresentación Oracle Database Migración consideraciones 10g/11g/12c
Presentación Oracle Database Migración consideraciones 10g/11g/12c
 
Simulating Heterogeneous Resources in CloudLightning
Simulating Heterogeneous Resources in CloudLightningSimulating Heterogeneous Resources in CloudLightning
Simulating Heterogeneous Resources in CloudLightning
 
All (that i know) about exadata external
All (that i know) about exadata externalAll (that i know) about exadata external
All (that i know) about exadata external
 
2015 product java
2015 product java2015 product java
2015 product java
 
Java- JDBC- Mazenet Solution
Java- JDBC- Mazenet SolutionJava- JDBC- Mazenet Solution
Java- JDBC- Mazenet Solution
 
London Redshift Meetup - July 2017
London Redshift Meetup - July 2017London Redshift Meetup - July 2017
London Redshift Meetup - July 2017
 
EPM Automate - Automating Enterprise Performance Management Cloud Solutions
EPM Automate - Automating Enterprise Performance Management Cloud SolutionsEPM Automate - Automating Enterprise Performance Management Cloud Solutions
EPM Automate - Automating Enterprise Performance Management Cloud Solutions
 
Oracle Query Optimizer - An Introduction
Oracle Query Optimizer - An IntroductionOracle Query Optimizer - An Introduction
Oracle Query Optimizer - An Introduction
 

Recently uploaded

chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutionsmonugehlot87
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 

Recently uploaded (20)

chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutions
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 

Cassandra Data Migration

  • 1. EMT – EIM Migration Tool © Data Migration Solution Overview - Version 1.2 External Adapter Integration : Cassandra Adapter Product names, designations, logos, and symbols may be trademarks or registered trademarks of their respective owners.
  • 2. • EMT is the result of many years of experiences in complex and performance demanding data migration projects providing a Migration Solution decoupled and independent from the migration strategies. • EMT addresses organizations to deliver a rapid and successful project implementation due to the capability of combining a migration software layer with a proven Process Development Methodology: • Data Extraction, Data Validation and Data Transformation processes can be developed using EMT procedures and EMT PL-SQL templates. • EMT Process Development Methodology allows achieving the best performances during the parallel execution. • Upload processes are orchestrated by EMT engine and do not require development activities. • EMT uses migration interface tables called “EIM” to execute and distribute CRUD operations into multiple target systems. • EIM processes are integrated with: • Siebel Enterprise Server • Oracle Database Server • Apache Cassandra (*) (*) Integrated via “EMT External Adapter Interface”: Additional Adapters with a common interface can be used to upload different target systems. EIM Migration Tool © (EMT) “EMT is a Software Solution designed to reduce costs and complexity of large scale data migration projects”
  • 3. EMT - Architecture EMT can be integrated in many Migration Architectures since is fully configurable. EMT engine is developed in PL-SQL and runs on Oracle © Database Server (Version 10.0.6.2 or above). A single EMT instance is able to upload a maximum number of 5 Target Systems using the EIM upload component. The usage of dedicated EMT Oracle Database Server (Staging Area DB) is highly recommended for Operational Architecture: • Staging database allows executing data extraction, data validation and data transformation processes without transactional impacts on the target databases. Multiple EMT instances can be installed on the same Oracle Database Server (Non Operational Architecture).
  • 4. • Automatic migration process (Start, Stop, Restart Migration flow). • High performance and scalability: • Data Distribution • Pipeline Process Execution • Parallel Processing. • Deadlock Prevention. • Low Data Contention. • High quality process log. • Hierarchy structures processing (E.g. Large Account Corporation, Corporate Discount, ..) • Error Handling and Tracking of migration execution results. • Automatic Recovery and Rollback. • Migration Execution Statistics, Performance and Execution Reports. • Configuration Checks. • Configuration Versioning and Deploy. EMT – Main Features Thanks to EMT parallel processing: Migration performances are ruled only by the migration process with lower performance rate.
  • 6. Java Class implemented using DataStax © Driver 3.6.0 • It’s executed by EMT engine as EIM process enabling Parallel Execution mode and multiple EIM tasks. • It uses EIM tables as interface to Cassandra Tables executing the following operations: • Insert • Update • Upsert (Insert or Update) • Delete (Rollback) • Reconcile • Updates the execution result for each record. EMT - Cassandra Adapter Process: UPL_EIM_CASS_1 Operation: Insert Parallel Degree: 3 Target: Table_Cass1 B1 EIM Task 1 • BATCH_NUM 1-10 EIM Task 2 • BATCH_NUM 11-20 EIM Task 3 • BATCH_NUM 21-30 Task 1 – EIM Batch 1..10 - EMT Cassandra Adapter OP: Insert Task 2 – EIM Batch 11..20 - EMT Cassandra Adapter OP: Insert Task 3 – EIM Batch 21..30 - EMT Cassandra Adapter OP: Insert EIM Task Execution
  • 7. 1. EMT Cassandra Adapter is executed by EIM tasks based on the EIM Process configuration. 2. Retrieve the Default Target Configuration or specific Process Configuration (Override Default Configuration) to build Cassandra Cluster interface: • Connection Configuration Policy • Authentication • Target Keyspace • Load Balance Policies • Pooling Options • Session Options • Consistency Level • Synchronous/Asynchronous execution • Skip Null Values (Evict Tombstones) • Time To Live • Protocol and Socket • SSL • Compression • Timeouts 3. Build Cassandra Cluster Interface and establish connection. 4. Read Source EIM records. 5. Prepare statement and execute Cassandra operation in Synchronous or Asynchronous mode (Insert, Update, Upsert, Delete, Reconcile). 6. Update Execution Results on EIM Table. EMT Cassandra Adapter – Architecture & Flow
  • 8. • Translation consists in defining an ORACLE Base Table with 1:1 Cassandra Column Mapping and converting Cassandra Datatypes into ORACLE Datatypes. Primary Key must be the same for both tables. • EMT Cassandra Adapter uses pre-built implicit datatype Cast solving (whenever possible) the conversion between ORACLE Datatypes and Cassandra Datatypes. • In most of the cases, the implicit EMT datatype Cast do not require additional mapping configuration activity. • Using EMT procedures, EIM table or Multiple EIM tables are generated to define the mapping between EMT and Cassandra. • EIM mapping Tables are created into a dedicated Schema. Base tables are used only for EIM table creation. EMT Cassandra Adapter – Data Mapping Data Mapping is implemented translating Cassandra Tables into ORACLE tables.
  • 9. EMT Cassandra Adapter uses additional Data Mapping Configuration to manage the following scenarios: • Cassandra Long Names and Oracle Keywords • Cassandra Target Tables Name or Column Identifier can exceed 35 characters or conflicts with Oracle keywords (E.g. Column Name “ID” ). In those cases and additional configuration must be used: • Table Long Name Mapping (Oracle Table Name  Cassandra Table Name) • Column Long Name Mapping (Oracle Column Name  Cassandra Column Name) • Cassandra Functions • Target Columns updated with Cassandra functions are configured using the Cassandra column name and the Cassandra function. Those columns are not mapped into EIM table columns. • E.g. 1: CassandraTable.timestamp_column  toUnixTimestamp(now()) • E.g. 2: CassandraTable.counter_likes  counter_likes + 1 • EMT Cast Functions • EMT Datatype Cast functions are used to define the datatype mapping for Columns overriding the implicit mapping: • E.g.1 : EIMTable1.duration_column  EMT_DUR_SEC() - Cast Duration from nanoseconds (Default) to seconds. • EMT_CQL() • EMT_CQL() function enables CQL parse to process target columns having data types: • Collection (List, Map, Set) • User Defined Types E.g.1 : EIMTable1.column_map  EMT_CQL() EIM_TABLE1.COLUMN_MAP - Type VARCHAR2 or (N)CLOB CASSANADRA_TABLE1.COLUMN_MAP - Type map<int, text> --------------------------------------------------------------------------------------  ------------------------------------------------------------------------------------------ { 1000: 'Element Map N.1', 1001: 'Element Map N.2‘ } { 1000: 'Element Map N.1', 1001: 'Element Map N.2‘ } EMT Cassandra Adapter – Data Mapping
  • 10. Information in this document is subject to change without notice and may contain inaccuracies or typographical errors. No part of this document may be reproduced, stored in, or introduced into a retrieval system, or transmitted in any form or by any means (electronic, mechanical, photocopying, recording, or otherwise), or for any purpose, without the express written permission of Gecob It Consulting. Except as expressly provided in any written license agreement from Gecob It Consulting, the furnishing of this document does not give you any license to trademarks, copyrights, or other intellectual property. Product names, designations, logos, and symbols may be trademarks or registered trademarks of their respective owners. For further information please contact us at the following e-mail address: emt@gecobitconsulting.com EMT – COPYRIGHT NOTICE EMT ©2010 Gecob It Consulting Ltd.