SlideShare a Scribd company logo
The Inevitable Unicode 
Project 
CONTENTS 
The Inevitable Unicode Project ................ 1 
What is Unicode?................................... 2 
Why should I adopt Unicode? .................. 2 
Planning System Resources for Unicode 
System ................................................. 3 
What should you know about Endians in 
Unicode conversion? .............................. 4 
What resources do I need to convert my 
non-Unicode system to Unicode?............. 4 
Converting MDMP systems to Unicode 
means more complexity? ........................ 5 
Unicode conversion project phases .......... 7 
A. Planning ........................................ 7 
B. Execution....................................... 7 
C. Finalization .................................... 7 
I) Unicode Preparation steps ................ 7 
II) Unicode conversion......................... 9 
III) Post-conversion steps....................10 
Summary .............................................10 
About the Author ...................................10 
Tikkana Akurat i , UnicodeSAP.com 
Upgrade & Unicode Special ist 
The Inevitable Unicode Project Page 1 of 10
What is Unicode? 
Following description from Unicode.org best 
describes what Unicode is. 
“Computers store letters and other 
characters by assigning a number for each 
one. Unicode provides a unique number for 
every character, no matter what the platform, 
no matter what the program, no matter what 
the language. “ 
The old standard code pages have the 
following disadvantages: 
They cover only a subset of all 
characters 
Different codepages have 
incompatibilities between each other. 
Data exchange is restricted between 
code pages 
There are simply too many code pages. 
Unicode has the capability to support all the 
languages in the world in one code page! It 
supports 65,000 characters and has room to 
support an additional 1 million characters. 
Why should I adopt 
Unicode? 
Whether you like it or not Unicode is here to 
stay and you will be lured to adopt Unicode, 
because of following reasons and situations: 
ECC6 (NW70) and after, new 
installations will only be possible with 
Unicode. 
If your company is supporting multiple 
regions of the world, in most probability 
you might be using MDMP in your SAP 
system. MDMP systems deploy more 
than one system code page on the 
application server. This has the flaw that 
the data would become corrupt if a user 
logs onto one language and tries to 
work with data in another language. 
Multiple Display/Multiple Processing 
(MDMP) support is dropped by SAP 
starting with NW2004s. 
Many companies are adopting Service 
Oriented Architecture standards 
providing Web Services to enable global 
interoperability. These standards require 
Unicode. 
SAP provides full Unicode support 
starting from Web Application Server 
(Web AS) 6.20. Current releases of 
NetWeaver and mySAP Business Suite 
run Unicode-enabled. 
Unicode is mandatory for SAP systems 
deploying Java applications. 
The Inevitable Unicode Project Page 2 of 10
Planning System 
Resources for Unicode 
System 
Plan for the increased computing resources 
for your system to run Unicode converted 
system. I put together a few figures below to 
give a visual perception of the increased 
demand on resources. 
CPU: Increase CPU power by upto 
+30% due to double byte handling 
in Unicode 
Figure 1 Increased CPU requirement for 
Unicode systems 
Memory: Increase Memory 
Consumption by +50% as Unicode 
SAP application servers are based 
on UTF-16 internally. 
Figure 2 Increased Memory requirement for 
Unicode systems 
DB Size: Increase database size by 
10% to 30% depending on which 
codepage representation used by 
the Database. 
Figure 3 Increased disk size requirement for 
databases in Unicode systems 
This is because, in a Unicode system 
one character is not always equal to 1 
byte. For example in UTF16 (A fixed 
length Unicode Transformation Format, 
16 bit encoding), 1 character = 2 bytes 
and in UTF-8, 1 character = 1 to 4 bytes. 
Note: In my experience, even though it 
is recommended to plan for the 
increased database size, the demand 
for additional disk space for the 
database is not seen immediately during 
the conversion. In fact, there is usually a 
reduction of the database size between 
10% to 30% which can be attributed to 
the implicit database reorganization 
during the export / import process and 
better use of Oracle extents into new 
reduced tablespaces. 
Network: No impact on network due 
to efficient compression techniques 
employed by SAP between DB and 
application servers. 
The Inevitable Unicode Project Page 3 of 10
What should you know 
about Endians in Unicode 
conversion? 
When converting to Unicode, the export 
code page must correspond to the 
Endianness of the target system. But what is 
an Endian, and how many Endians are out 
there? Fortunately there are only two 
Endians. The word “Endian” has to do with 
where the most significant byte (MSB) 
comes first or in other words where big “end” 
comes first. If the MSB comes at the lowest 
address and the least significant byte at the 
highest address then you are dealing with 
Big Endian. Little Endian is the other type of 
arrangement of MSB ordering. That is if the 
least significant byte of the number is stored 
in memory at the lowest address and the 
most significant byte at the highest address 
then it is Little Endian. An example would be 
the following: 
0xAABBCC would be stored as follows: 
Address 
Big 
Endian 
Little 
Endian 
2 CC AA 
1 BB BB 
0 AA CC 
The export code page should match the 
Endianness of the machine that runs R3load 
to import data. 
Here is a table that helps which code page 
to use: 
Code 
page 
Endian 
type Machine Architecture 
4103 
Little 
Endian 
Alpha, Intel X86 (and 
clones),X86_64, Itanium 
(Windows+Linux),Solaris_X86_64 
4102 
Big 
Endian 
IBM 390, AS/400, PowerPC 
(AIX),Linux on zSeries (S/390), 
Linux on Power, Solaris_SPARC, 
HP PA-RISC, Itanium (HP-UX) 
What resources do I need 
to convert my non-Unicode 
system to Unicode? 
Unicode conversion projects need thorough 
planning and meticulous execution. A good 
project plan can help you immensely in the 
long run. For large systems’ conversion 
process, you need at least one more system 
that is similar to your non-Unicode system. 
Your non-Unicode system is usually called 
the Source and other system is called the 
Target. You would convert and export your 
source system to dump files on a shared 
disk space and import into the target system. 
The export and import above can happen in 
parallel if you use Unicode conversion tools 
like Migration Monitor (MigMon) and/or 
Distribution Monitor (DistMon) to speed up 
the process. Good experience using these 
tools is essential for a successful conversion. 
The Inevitable Unicode Project Page 4 of 10
Here is a quick list of the essential resources 
you need: 
1) Additional system for preparing an 
empty target Unicode system. (You will 
import your Unicode converted data into 
this system). 
2) Additional disk space for common 
export area and for running language 
scans for a source MDMP system. 
3) ABAP team to convert your non – 
Unicode programs to Unicode 
compatibility. 
4) Language team to map the vocabulary 
in different languages in your system to 
proper languages/codepages. 
5) Testing team to test interfaces and/or 
scenario based verifications. 
6) A dedicated Unicode conversion 
specialist. (If you are going to use 
CU&UC – Combined Upgrade and 
Unicode Conversion, then Knowledge of 
Upgrades is also essential). 
Converting MDMP systems 
to Unicode means more 
complexity? 
Unicode preparation of MDMP systems has 
additional steps of scanning the database 
for vocabulary that needs to be mapped to 
different languages. These scans can take a 
long time depending on the size and nature 
of your MDMP system. The assignment of 
languages to the collected vocabulary can 
be done through automatical methods and 
manual effort. SAP provides tools like 
SPUMG / SPUM4 and SUMG for such 
vocabulary work. The vocabulary mapping is 
often an underestimated task in the MDMP-Unicode 
projects. To give an idea of how 
long this manual task of vocabulary work 
may take, here is an example. 
In one MDMP – Unicode conversion project 
following were Vocabulary work related 
resource numbers: 
Database size: 2 TB 
Vocabulary collected: 183,475 unique words 
Vocabulary team size: 10 
Time taken to map all vocabulary: 4 weeks 
The Inevitable Unicode Project Page 5 of 10
Figure 4 Screenshot of vocabulary screen from SPUMG transaction in a MDMP Unicode 
conversion project. 
The Inevitable Unicode Project Page 6 of 10
Unicode conversion project 
phases 
The project mainly consists of three phases: 
A. Planning 
B. Execution or Unicode conversion 
C. Finalization 
A. Planning 
In this phase, you start building your project 
team, check the pre-requisites for Unicode 
conversion, gather requirements for system 
resources, plan the downtime for conversion 
and start enabling customer developments 
to Unicode compatibility. 
B. Execution 
This is the phase where the conversion 
happens and where the export / import of 
your non-Unicode system happen during a 
downtime. 
The project phase “B. Execution of Unicode 
conversion” also referred to as “Realization” 
phase is the longest phase in the project. 
There are three main activities that should 
happen in this phase: 
I. Unicode preparation steps 
o Vocabulary fixes 
o ABAP program fixes 
o 3rd party tools and interfaces fixes 
II. Unicode conversion 
III. Post-conversion steps 
C. Finalization 
During this phase your Unicode system is 
given to teams for functional, technical 
testing and validation. The testing can 
involve custom testing scripts focusing on 
different languages and the vocabulary that 
exists in your system. 
Test and re-test. You should validate your 
system with respect to the ABAP changes 
that happened during the project and all 
interfaces that connect to your Unicode 
system. A “scenario based” verification 
methodology can be employed to validate 
data consistency. 
I) Unicode Preparation steps 
Activities in this phase are performed on the 
source system. There are three major 
activities in this phase that take most of the 
time. 
Vocabulary fixes: You should collect the 
vocabulary in your system that needs 
determination of the codepage they 
should belong to. The language/ 
vocabulary scans are performed in this 
phase which can take up considerable 
computer time to run. See Figure 5 for 
the scans that you will need to go 
through in transaction SPUMG (or 
SPUM4 if your source system is 46C 
MDMP). Once the vocabulary is 
scanned the vocabulary work is 
performed by your language/vocabulary 
team in this phase. 
The Inevitable Unicode Project Page 7 of 10
Basis Tip: Plan for extra space to increase 
PSAPTEMP and PSAPUNDO tablespaces 
for the scans to complete without fail. 
ABAP program fixes: you should also 
enable all your ABAP programs to 
Unicode compatible syntax in this phase. 
The ABAP enablement of your 
programs can take considerable time as 
well depending on how many custom 
programs you have in the system. SAP 
provides the transaction UCCHECK to 
identify all the programs that need to be 
adjusted for Unicode compatibility. 
3rd party tools and interfaces fixes: you 
should also address the compatibility of 
your future Unicode system with the 
existing interfaces and third party tools. 
Thorough testing should be done for 
compatibility issues with a test Unicode 
system in your environment. Some of 
the often found third party tools are 
Vertex, BSI, RWD, BMC and IXOS. Find 
Unicode compatible releases of these to 
install in your Unicode system. 
Figure 5 SPUMG transaction screenshot showing all the tabs for the 
language scans in a Unicode conversion of an MDMP system. 
The Inevitable Unicode Project Page 8 of 10
II) Unicode conversion 
This phase involves export and import of 
your non-Unicode system to the target 
Unicode system; hence the system 
downtime begins in this phase. 
It is best to use tools like Distribution Monitor, 
Migration Monitor, Table Splitter, OrderBy 
and Package Splitter to speed up the 
conversion process. Please note that the 
actual Unicode conversion of your data 
happens during the export by R3load. Plan 
for extra disk space for the export dump files 
and other Unicode tools to be stored. 
Figure 6 Depiction of two application servers performing parallel export/ 
import from Source DB to Target DB 
The Inevitable Unicode Project Page 9 of 10
III) Post-conversion steps 
This phase involves steps after the import of 
the Unicode data into your target system. 
The steps are performed in the target 
system. If your source system is MDMP, 
then your language team will need to 
perform final vocabulary adjustments in a 
transaction called SUMG on the target 
system. 
Summary 
Unicode conversion projects are often 
under-estimated. A Unicode conversion 
project should be treated the same if not 
more complex than a typical Upgrade 
project. A high level view of the Unicode 
project and some key project steps were 
given in this article. There are many topics 
that you will come across during an Unicode 
Conversion project (for example archiving, 
data reduction, performance tuning for 
reduced conversion downtime, adoption of 
new tablespace naming convention and/or 
custom tablespace names etc.,). The side 
benefits of a Unicode project are reduced 
number of tablespaces and a defragmented 
database. A unicode conversion enables 
your company to benefit from new 
technologies and support all languages in 
the world. 
Resources 
> https://service.sap.com/Unicode 
> https://service.sap.com/globalization 
About the Author 
Tikkana Akurati holds a Masters degree in Computer and Information Sciences. He has 20 years of 
experience in IT. Tikkana has played many roles in the area of SAP Basis for more than 11 years. His 
current focus is on SAP upgrades and Unicode conversions. He has completed numerous Unicode 
conversion projects as an independent consultant and for SAP America. 
He can be reached at Ph:847-804-2970 or via email to Tikkana@yahoo.com or 
Tikkana@UnicodeSAP.com 
The Inevitable Unicode Project Page 10 of 10

More Related Content

What's hot

What is sap client
What is sap clientWhat is sap client
What is sap client
nanda nanda
 
How to use abap cds for data provisioning in bw
How to use abap cds for data provisioning in bwHow to use abap cds for data provisioning in bw
How to use abap cds for data provisioning in bw
Luc Vanrobays
 
Sap basis made easy
Sap basis made easySap basis made easy
Sap basis made easy
Durga Balaji M
 
62620940 charm-configuration-procedures
62620940 charm-configuration-procedures62620940 charm-configuration-procedures
62620940 charm-configuration-procedures
narendar99
 
SAP S4HANA : Learn From Our Implementation Journey
SAP S4HANA : Learn From Our Implementation JourneySAP S4HANA : Learn From Our Implementation Journey
SAP S4HANA : Learn From Our Implementation Journey
Anup Lakra
 
SAP Document Management System Integration with Content Servers
SAP Document Management System Integration with Content Servers SAP Document Management System Integration with Content Servers
SAP Document Management System Integration with Content Servers
Verbella CMG
 
BW Migration to HANA Part 3 - Post-processing on the Migrated System
BW Migration to HANA Part 3 - Post-processing on the Migrated SystemBW Migration to HANA Part 3 - Post-processing on the Migrated System
BW Migration to HANA Part 3 - Post-processing on the Migrated System
Linh Nguyen
 
Migration scenarios RISE with SAP S4HANA Cloud, Private Edition - Version #1....
Migration scenarios RISE with SAP S4HANA Cloud, Private Edition - Version #1....Migration scenarios RISE with SAP S4HANA Cloud, Private Edition - Version #1....
Migration scenarios RISE with SAP S4HANA Cloud, Private Edition - Version #1....
Yevilina Rizka
 
Fut1441 SUSE Linux Enterprise Server for SAP Solutions
Fut1441  SUSE Linux Enterprise Server for SAP SolutionsFut1441  SUSE Linux Enterprise Server for SAP Solutions
Fut1441 SUSE Linux Enterprise Server for SAP Solutions
Christian Holsing
 
Ragavendiran's Resume
Ragavendiran's ResumeRagavendiran's Resume
Ragavendiran's Resume
Ragavendiran Kulothungan
 
4G troubleshooting
4G troubleshooting4G troubleshooting
4G troubleshooting
Thang Dang
 
Sap Change And Transport Management
Sap Change And Transport ManagementSap Change And Transport Management
Sap Change And Transport Management
Ravi Jain
 
Gsm rf-optimization
Gsm rf-optimizationGsm rf-optimization
Gsm rf-optimization
karimfeel
 
ORACLE HA NFS over Oracle ASM
ORACLE HA NFS over Oracle ASMORACLE HA NFS over Oracle ASM
ORACLE HA NFS over Oracle ASM
Pierrick Guicheteau
 
SAP System copy
SAP System copySAP System copy
SAP System copy
ashish_bbd
 
Implementing parallel processing in abap
Implementing parallel processing in abapImplementing parallel processing in abap
Implementing parallel processing in abap
Noman Mohamed Hanif
 
Introductionto SDN
Introductionto SDN Introductionto SDN
Introductionto SDN
Md. Shariful Islam Robin
 
SUCCESSFUL CHARM IMPLEMENTATION IN A VALIDATED ENVIRONMENT
SUCCESSFUL CHARM IMPLEMENTATION IN A VALIDATED ENVIRONMENTSUCCESSFUL CHARM IMPLEMENTATION IN A VALIDATED ENVIRONMENT
SUCCESSFUL CHARM IMPLEMENTATION IN A VALIDATED ENVIRONMENT
Alpha Sirius
 
Sap hana client_installation_update_guide_en
Sap hana client_installation_update_guide_enSap hana client_installation_update_guide_en
Sap hana client_installation_update_guide_en
Shobha Nand Kumar
 
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
SAP Cloud Platform
 

What's hot (20)

What is sap client
What is sap clientWhat is sap client
What is sap client
 
How to use abap cds for data provisioning in bw
How to use abap cds for data provisioning in bwHow to use abap cds for data provisioning in bw
How to use abap cds for data provisioning in bw
 
Sap basis made easy
Sap basis made easySap basis made easy
Sap basis made easy
 
62620940 charm-configuration-procedures
62620940 charm-configuration-procedures62620940 charm-configuration-procedures
62620940 charm-configuration-procedures
 
SAP S4HANA : Learn From Our Implementation Journey
SAP S4HANA : Learn From Our Implementation JourneySAP S4HANA : Learn From Our Implementation Journey
SAP S4HANA : Learn From Our Implementation Journey
 
SAP Document Management System Integration with Content Servers
SAP Document Management System Integration with Content Servers SAP Document Management System Integration with Content Servers
SAP Document Management System Integration with Content Servers
 
BW Migration to HANA Part 3 - Post-processing on the Migrated System
BW Migration to HANA Part 3 - Post-processing on the Migrated SystemBW Migration to HANA Part 3 - Post-processing on the Migrated System
BW Migration to HANA Part 3 - Post-processing on the Migrated System
 
Migration scenarios RISE with SAP S4HANA Cloud, Private Edition - Version #1....
Migration scenarios RISE with SAP S4HANA Cloud, Private Edition - Version #1....Migration scenarios RISE with SAP S4HANA Cloud, Private Edition - Version #1....
Migration scenarios RISE with SAP S4HANA Cloud, Private Edition - Version #1....
 
Fut1441 SUSE Linux Enterprise Server for SAP Solutions
Fut1441  SUSE Linux Enterprise Server for SAP SolutionsFut1441  SUSE Linux Enterprise Server for SAP Solutions
Fut1441 SUSE Linux Enterprise Server for SAP Solutions
 
Ragavendiran's Resume
Ragavendiran's ResumeRagavendiran's Resume
Ragavendiran's Resume
 
4G troubleshooting
4G troubleshooting4G troubleshooting
4G troubleshooting
 
Sap Change And Transport Management
Sap Change And Transport ManagementSap Change And Transport Management
Sap Change And Transport Management
 
Gsm rf-optimization
Gsm rf-optimizationGsm rf-optimization
Gsm rf-optimization
 
ORACLE HA NFS over Oracle ASM
ORACLE HA NFS over Oracle ASMORACLE HA NFS over Oracle ASM
ORACLE HA NFS over Oracle ASM
 
SAP System copy
SAP System copySAP System copy
SAP System copy
 
Implementing parallel processing in abap
Implementing parallel processing in abapImplementing parallel processing in abap
Implementing parallel processing in abap
 
Introductionto SDN
Introductionto SDN Introductionto SDN
Introductionto SDN
 
SUCCESSFUL CHARM IMPLEMENTATION IN A VALIDATED ENVIRONMENT
SUCCESSFUL CHARM IMPLEMENTATION IN A VALIDATED ENVIRONMENTSUCCESSFUL CHARM IMPLEMENTATION IN A VALIDATED ENVIRONMENT
SUCCESSFUL CHARM IMPLEMENTATION IN A VALIDATED ENVIRONMENT
 
Sap hana client_installation_update_guide_en
Sap hana client_installation_update_guide_enSap hana client_installation_update_guide_en
Sap hana client_installation_update_guide_en
 
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
 

Similar to The inevitable unicode conversion project white paper by tikkana

test4
test4test4
test4
test4test4
test3
test3test3
Web
WebWeb
ELC-E 2016 Neil Armstrong - No, it's never too late to upstream your legacy l...
ELC-E 2016 Neil Armstrong - No, it's never too late to upstream your legacy l...ELC-E 2016 Neil Armstrong - No, it's never too late to upstream your legacy l...
ELC-E 2016 Neil Armstrong - No, it's never too late to upstream your legacy l...
Neil Armstrong
 
Database Migration Assistant for Unicode (DMU)
Database Migration Assistant for Unicode (DMU)Database Migration Assistant for Unicode (DMU)
Database Migration Assistant for Unicode (DMU)
Ludovico Caldara
 
Linux-Based Data Acquisition and Processing On Palmtop Computer
Linux-Based Data Acquisition and Processing On Palmtop ComputerLinux-Based Data Acquisition and Processing On Palmtop Computer
Linux-Based Data Acquisition and Processing On Palmtop Computer
IOSR Journals
 
Linux-Based Data Acquisition and Processing On Palmtop Computer
Linux-Based Data Acquisition and Processing On Palmtop ComputerLinux-Based Data Acquisition and Processing On Palmtop Computer
Linux-Based Data Acquisition and Processing On Palmtop Computer
IOSR Journals
 
Infrastructure student
Infrastructure studentInfrastructure student
Infrastructure student
John Scrugham
 
Chap 4 hardware & software
Chap 4 hardware & softwareChap 4 hardware & software
Chap 4 hardware & software
UMaine
 
Making Networking Apps Scream on Windows with DPDK
Making Networking Apps Scream on Windows with DPDKMaking Networking Apps Scream on Windows with DPDK
Making Networking Apps Scream on Windows with DPDK
Michelle Holley
 
Custom Software Development Checklist by Michael Cordova
Custom Software Development Checklist by Michael CordovaCustom Software Development Checklist by Michael Cordova
Custom Software Development Checklist by Michael Cordova
hoolikar77
 
LKGtoPG - Basics of C Language
LKGtoPG - Basics of  C LanguageLKGtoPG - Basics of  C Language
LKGtoPG - Basics of C Language
lkgtopg jobs
 
Core java kvr - satya
Core  java kvr - satyaCore  java kvr - satya
Core java kvr - satya
Satya Johnny
 
Overcoming software development challenges by using an integrated software fr...
Overcoming software development challenges by using an integrated software fr...Overcoming software development challenges by using an integrated software fr...
Overcoming software development challenges by using an integrated software fr...
Design World
 
Abinaya-Resume
Abinaya-ResumeAbinaya-Resume
Abinaya-Resume
Abinaya Devarasu
 
SLC ASP.NET Framework and BPM (Eng)
SLC ASP.NET Framework and BPM (Eng)SLC ASP.NET Framework and BPM (Eng)
SLC ASP.NET Framework and BPM (Eng)
Selcuk Celik
 
Ovp Introduction Presentation (04 Feb 10)
Ovp Introduction Presentation (04 Feb 10)Ovp Introduction Presentation (04 Feb 10)
Ovp Introduction Presentation (04 Feb 10)
simon56
 
Report hospital
Report hospitalReport hospital
Report hospital
Rachit Gaur
 
Nanaji_Jonnadula
Nanaji_JonnadulaNanaji_Jonnadula
Nanaji_Jonnadula
Nanaji Jonnadula
 

Similar to The inevitable unicode conversion project white paper by tikkana (20)

test4
test4test4
test4
 
test4
test4test4
test4
 
test3
test3test3
test3
 
Web
WebWeb
Web
 
ELC-E 2016 Neil Armstrong - No, it's never too late to upstream your legacy l...
ELC-E 2016 Neil Armstrong - No, it's never too late to upstream your legacy l...ELC-E 2016 Neil Armstrong - No, it's never too late to upstream your legacy l...
ELC-E 2016 Neil Armstrong - No, it's never too late to upstream your legacy l...
 
Database Migration Assistant for Unicode (DMU)
Database Migration Assistant for Unicode (DMU)Database Migration Assistant for Unicode (DMU)
Database Migration Assistant for Unicode (DMU)
 
Linux-Based Data Acquisition and Processing On Palmtop Computer
Linux-Based Data Acquisition and Processing On Palmtop ComputerLinux-Based Data Acquisition and Processing On Palmtop Computer
Linux-Based Data Acquisition and Processing On Palmtop Computer
 
Linux-Based Data Acquisition and Processing On Palmtop Computer
Linux-Based Data Acquisition and Processing On Palmtop ComputerLinux-Based Data Acquisition and Processing On Palmtop Computer
Linux-Based Data Acquisition and Processing On Palmtop Computer
 
Infrastructure student
Infrastructure studentInfrastructure student
Infrastructure student
 
Chap 4 hardware & software
Chap 4 hardware & softwareChap 4 hardware & software
Chap 4 hardware & software
 
Making Networking Apps Scream on Windows with DPDK
Making Networking Apps Scream on Windows with DPDKMaking Networking Apps Scream on Windows with DPDK
Making Networking Apps Scream on Windows with DPDK
 
Custom Software Development Checklist by Michael Cordova
Custom Software Development Checklist by Michael CordovaCustom Software Development Checklist by Michael Cordova
Custom Software Development Checklist by Michael Cordova
 
LKGtoPG - Basics of C Language
LKGtoPG - Basics of  C LanguageLKGtoPG - Basics of  C Language
LKGtoPG - Basics of C Language
 
Core java kvr - satya
Core  java kvr - satyaCore  java kvr - satya
Core java kvr - satya
 
Overcoming software development challenges by using an integrated software fr...
Overcoming software development challenges by using an integrated software fr...Overcoming software development challenges by using an integrated software fr...
Overcoming software development challenges by using an integrated software fr...
 
Abinaya-Resume
Abinaya-ResumeAbinaya-Resume
Abinaya-Resume
 
SLC ASP.NET Framework and BPM (Eng)
SLC ASP.NET Framework and BPM (Eng)SLC ASP.NET Framework and BPM (Eng)
SLC ASP.NET Framework and BPM (Eng)
 
Ovp Introduction Presentation (04 Feb 10)
Ovp Introduction Presentation (04 Feb 10)Ovp Introduction Presentation (04 Feb 10)
Ovp Introduction Presentation (04 Feb 10)
 
Report hospital
Report hospitalReport hospital
Report hospital
 
Nanaji_Jonnadula
Nanaji_JonnadulaNanaji_Jonnadula
Nanaji_Jonnadula
 

Recently uploaded

Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 

Recently uploaded (20)

Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 

The inevitable unicode conversion project white paper by tikkana

  • 1. The Inevitable Unicode Project CONTENTS The Inevitable Unicode Project ................ 1 What is Unicode?................................... 2 Why should I adopt Unicode? .................. 2 Planning System Resources for Unicode System ................................................. 3 What should you know about Endians in Unicode conversion? .............................. 4 What resources do I need to convert my non-Unicode system to Unicode?............. 4 Converting MDMP systems to Unicode means more complexity? ........................ 5 Unicode conversion project phases .......... 7 A. Planning ........................................ 7 B. Execution....................................... 7 C. Finalization .................................... 7 I) Unicode Preparation steps ................ 7 II) Unicode conversion......................... 9 III) Post-conversion steps....................10 Summary .............................................10 About the Author ...................................10 Tikkana Akurat i , UnicodeSAP.com Upgrade & Unicode Special ist The Inevitable Unicode Project Page 1 of 10
  • 2. What is Unicode? Following description from Unicode.org best describes what Unicode is. “Computers store letters and other characters by assigning a number for each one. Unicode provides a unique number for every character, no matter what the platform, no matter what the program, no matter what the language. “ The old standard code pages have the following disadvantages: They cover only a subset of all characters Different codepages have incompatibilities between each other. Data exchange is restricted between code pages There are simply too many code pages. Unicode has the capability to support all the languages in the world in one code page! It supports 65,000 characters and has room to support an additional 1 million characters. Why should I adopt Unicode? Whether you like it or not Unicode is here to stay and you will be lured to adopt Unicode, because of following reasons and situations: ECC6 (NW70) and after, new installations will only be possible with Unicode. If your company is supporting multiple regions of the world, in most probability you might be using MDMP in your SAP system. MDMP systems deploy more than one system code page on the application server. This has the flaw that the data would become corrupt if a user logs onto one language and tries to work with data in another language. Multiple Display/Multiple Processing (MDMP) support is dropped by SAP starting with NW2004s. Many companies are adopting Service Oriented Architecture standards providing Web Services to enable global interoperability. These standards require Unicode. SAP provides full Unicode support starting from Web Application Server (Web AS) 6.20. Current releases of NetWeaver and mySAP Business Suite run Unicode-enabled. Unicode is mandatory for SAP systems deploying Java applications. The Inevitable Unicode Project Page 2 of 10
  • 3. Planning System Resources for Unicode System Plan for the increased computing resources for your system to run Unicode converted system. I put together a few figures below to give a visual perception of the increased demand on resources. CPU: Increase CPU power by upto +30% due to double byte handling in Unicode Figure 1 Increased CPU requirement for Unicode systems Memory: Increase Memory Consumption by +50% as Unicode SAP application servers are based on UTF-16 internally. Figure 2 Increased Memory requirement for Unicode systems DB Size: Increase database size by 10% to 30% depending on which codepage representation used by the Database. Figure 3 Increased disk size requirement for databases in Unicode systems This is because, in a Unicode system one character is not always equal to 1 byte. For example in UTF16 (A fixed length Unicode Transformation Format, 16 bit encoding), 1 character = 2 bytes and in UTF-8, 1 character = 1 to 4 bytes. Note: In my experience, even though it is recommended to plan for the increased database size, the demand for additional disk space for the database is not seen immediately during the conversion. In fact, there is usually a reduction of the database size between 10% to 30% which can be attributed to the implicit database reorganization during the export / import process and better use of Oracle extents into new reduced tablespaces. Network: No impact on network due to efficient compression techniques employed by SAP between DB and application servers. The Inevitable Unicode Project Page 3 of 10
  • 4. What should you know about Endians in Unicode conversion? When converting to Unicode, the export code page must correspond to the Endianness of the target system. But what is an Endian, and how many Endians are out there? Fortunately there are only two Endians. The word “Endian” has to do with where the most significant byte (MSB) comes first or in other words where big “end” comes first. If the MSB comes at the lowest address and the least significant byte at the highest address then you are dealing with Big Endian. Little Endian is the other type of arrangement of MSB ordering. That is if the least significant byte of the number is stored in memory at the lowest address and the most significant byte at the highest address then it is Little Endian. An example would be the following: 0xAABBCC would be stored as follows: Address Big Endian Little Endian 2 CC AA 1 BB BB 0 AA CC The export code page should match the Endianness of the machine that runs R3load to import data. Here is a table that helps which code page to use: Code page Endian type Machine Architecture 4103 Little Endian Alpha, Intel X86 (and clones),X86_64, Itanium (Windows+Linux),Solaris_X86_64 4102 Big Endian IBM 390, AS/400, PowerPC (AIX),Linux on zSeries (S/390), Linux on Power, Solaris_SPARC, HP PA-RISC, Itanium (HP-UX) What resources do I need to convert my non-Unicode system to Unicode? Unicode conversion projects need thorough planning and meticulous execution. A good project plan can help you immensely in the long run. For large systems’ conversion process, you need at least one more system that is similar to your non-Unicode system. Your non-Unicode system is usually called the Source and other system is called the Target. You would convert and export your source system to dump files on a shared disk space and import into the target system. The export and import above can happen in parallel if you use Unicode conversion tools like Migration Monitor (MigMon) and/or Distribution Monitor (DistMon) to speed up the process. Good experience using these tools is essential for a successful conversion. The Inevitable Unicode Project Page 4 of 10
  • 5. Here is a quick list of the essential resources you need: 1) Additional system for preparing an empty target Unicode system. (You will import your Unicode converted data into this system). 2) Additional disk space for common export area and for running language scans for a source MDMP system. 3) ABAP team to convert your non – Unicode programs to Unicode compatibility. 4) Language team to map the vocabulary in different languages in your system to proper languages/codepages. 5) Testing team to test interfaces and/or scenario based verifications. 6) A dedicated Unicode conversion specialist. (If you are going to use CU&UC – Combined Upgrade and Unicode Conversion, then Knowledge of Upgrades is also essential). Converting MDMP systems to Unicode means more complexity? Unicode preparation of MDMP systems has additional steps of scanning the database for vocabulary that needs to be mapped to different languages. These scans can take a long time depending on the size and nature of your MDMP system. The assignment of languages to the collected vocabulary can be done through automatical methods and manual effort. SAP provides tools like SPUMG / SPUM4 and SUMG for such vocabulary work. The vocabulary mapping is often an underestimated task in the MDMP-Unicode projects. To give an idea of how long this manual task of vocabulary work may take, here is an example. In one MDMP – Unicode conversion project following were Vocabulary work related resource numbers: Database size: 2 TB Vocabulary collected: 183,475 unique words Vocabulary team size: 10 Time taken to map all vocabulary: 4 weeks The Inevitable Unicode Project Page 5 of 10
  • 6. Figure 4 Screenshot of vocabulary screen from SPUMG transaction in a MDMP Unicode conversion project. The Inevitable Unicode Project Page 6 of 10
  • 7. Unicode conversion project phases The project mainly consists of three phases: A. Planning B. Execution or Unicode conversion C. Finalization A. Planning In this phase, you start building your project team, check the pre-requisites for Unicode conversion, gather requirements for system resources, plan the downtime for conversion and start enabling customer developments to Unicode compatibility. B. Execution This is the phase where the conversion happens and where the export / import of your non-Unicode system happen during a downtime. The project phase “B. Execution of Unicode conversion” also referred to as “Realization” phase is the longest phase in the project. There are three main activities that should happen in this phase: I. Unicode preparation steps o Vocabulary fixes o ABAP program fixes o 3rd party tools and interfaces fixes II. Unicode conversion III. Post-conversion steps C. Finalization During this phase your Unicode system is given to teams for functional, technical testing and validation. The testing can involve custom testing scripts focusing on different languages and the vocabulary that exists in your system. Test and re-test. You should validate your system with respect to the ABAP changes that happened during the project and all interfaces that connect to your Unicode system. A “scenario based” verification methodology can be employed to validate data consistency. I) Unicode Preparation steps Activities in this phase are performed on the source system. There are three major activities in this phase that take most of the time. Vocabulary fixes: You should collect the vocabulary in your system that needs determination of the codepage they should belong to. The language/ vocabulary scans are performed in this phase which can take up considerable computer time to run. See Figure 5 for the scans that you will need to go through in transaction SPUMG (or SPUM4 if your source system is 46C MDMP). Once the vocabulary is scanned the vocabulary work is performed by your language/vocabulary team in this phase. The Inevitable Unicode Project Page 7 of 10
  • 8. Basis Tip: Plan for extra space to increase PSAPTEMP and PSAPUNDO tablespaces for the scans to complete without fail. ABAP program fixes: you should also enable all your ABAP programs to Unicode compatible syntax in this phase. The ABAP enablement of your programs can take considerable time as well depending on how many custom programs you have in the system. SAP provides the transaction UCCHECK to identify all the programs that need to be adjusted for Unicode compatibility. 3rd party tools and interfaces fixes: you should also address the compatibility of your future Unicode system with the existing interfaces and third party tools. Thorough testing should be done for compatibility issues with a test Unicode system in your environment. Some of the often found third party tools are Vertex, BSI, RWD, BMC and IXOS. Find Unicode compatible releases of these to install in your Unicode system. Figure 5 SPUMG transaction screenshot showing all the tabs for the language scans in a Unicode conversion of an MDMP system. The Inevitable Unicode Project Page 8 of 10
  • 9. II) Unicode conversion This phase involves export and import of your non-Unicode system to the target Unicode system; hence the system downtime begins in this phase. It is best to use tools like Distribution Monitor, Migration Monitor, Table Splitter, OrderBy and Package Splitter to speed up the conversion process. Please note that the actual Unicode conversion of your data happens during the export by R3load. Plan for extra disk space for the export dump files and other Unicode tools to be stored. Figure 6 Depiction of two application servers performing parallel export/ import from Source DB to Target DB The Inevitable Unicode Project Page 9 of 10
  • 10. III) Post-conversion steps This phase involves steps after the import of the Unicode data into your target system. The steps are performed in the target system. If your source system is MDMP, then your language team will need to perform final vocabulary adjustments in a transaction called SUMG on the target system. Summary Unicode conversion projects are often under-estimated. A Unicode conversion project should be treated the same if not more complex than a typical Upgrade project. A high level view of the Unicode project and some key project steps were given in this article. There are many topics that you will come across during an Unicode Conversion project (for example archiving, data reduction, performance tuning for reduced conversion downtime, adoption of new tablespace naming convention and/or custom tablespace names etc.,). The side benefits of a Unicode project are reduced number of tablespaces and a defragmented database. A unicode conversion enables your company to benefit from new technologies and support all languages in the world. Resources > https://service.sap.com/Unicode > https://service.sap.com/globalization About the Author Tikkana Akurati holds a Masters degree in Computer and Information Sciences. He has 20 years of experience in IT. Tikkana has played many roles in the area of SAP Basis for more than 11 years. His current focus is on SAP upgrades and Unicode conversions. He has completed numerous Unicode conversion projects as an independent consultant and for SAP America. He can be reached at Ph:847-804-2970 or via email to Tikkana@yahoo.com or Tikkana@UnicodeSAP.com The Inevitable Unicode Project Page 10 of 10