This project optimized the Alcatel-Lucent transmission network for China Unicom Beijing to improve service protection capabilities. The engineer led a team that analyzed the original network configuration, formulated optimization plans, adjusted circuit routing over one year, and upgraded software. The key activities included developing network and routing adjustment plans, implementing optical path adjustments in stages, and testing the upgraded network's protection functions. The project enhanced the network to meet future service demands and provide automatic protection for all private lines.
1. CAREER EPISODE 3
Project Name: Beijing Unicom Alcatel-Lucent Transmission Network Optimization
Project Duration: From 10 May, 2010 to 17 May, 2011
Employer’s Name: China Unicom Beijing Long-haul Trunks Maintenance Center
Position: Engineer
INTRODUCTION
The aim of this project was to solve the service protection defect problems in the Alcatel-Lucent
Transmission Network. As the leader of this project, I was responsible for coordinating the
Alcatel-Lucent engineers and the members in Engineering Departments to complete this project.
Because of the large amount of circuit adjustment as well as the frequent turnover in my company,
the project had once suspended for a long time. It took us approximately one year to complete this
project, which involved optimization scheme formulation, circuit route adjustment, and system
acceptance. In this project, I had solved a lot of technical difficulties and submitted many
optimization schemes.
PROJECT BACKGROUND
In May 2010, Beijing Unicom Alcatel-Lucent Network had been running for three years. During
that period, there were frequent accidents and serious fault which affected almost all services in
the equipment occurred many times. Further, the locating of network problems was extremely
difficult. Because the network was constructed in tree structure, the services running on the system
could not be protected automatically and the subordinate service would be fully affected when a
fault occurred in a root node device. Meanwhile, all the new services should use SNCP (Sub
Network Connection Protection) mechanism; however, both the software and the hardware of the
original network could not provide it. Therefore, to solve the problem, the Maintenance Center
gave three requirements towards this project:
(1) Improve the band width of Alcatel-Lucent Transmission Network to meet the need of service
requirement in the future;
(2) Conduct optimization and reconstruction towards Alcatel-Lucent Transmission Network and
improve the protection ability as well as provide automatic protection for all users’ private lines
running in the system;
(3) Upgrade the software in Alcatel-Lucent equipment and install new management function.
In accordance with the requirements of the Maintenance Center, I conducted optimization towards
the original network together with employees in other departments and in other the companies.
More than ten engineers had participated in this project.
2. Picture 3-1: Project team chart
Personal Engineering Activity
1. Plan Making
As a team leader, I need make sure that the project was completed before the deadline. I made this
implementation plan so that my team members could prepare the relevant work in time.According
to the requirement of this project, I divided the working content into three parts:
Collect materials and regulate network designing method and service protection mode;
Implement optimization;
Upgrade the software in Alcatel-Lucent equipment and ensure the protection function could
be activated normally.
2.Content of Implementation
2.1 Present Network Status Analysis and Optimization Schemes drawing up
2.1.1 Present Network Status Analysis
Firstly, I collected network topology data. The original network had used Alcatel-Lucent 1678
Equipment to form the core network. The 1660R5 Equipment was connected to the optical
interface of 1678 Equipment. The transmission resources had accounted for more than 80%. When
a problem occurred to some 1678 Equipment, all the services running in 1660R5 Equipment
would be interrupted. That is to say, the potential trouble was even serious.
Reply Construction Progress
Assistance
Investment
Project Draft Superior
Department
Network Design
Project Requirement
Equipment Installation Department
Alcatel-Lucent Manufacturer
Team Leader
3. Picture 3-2: ALCATEL-LUCENT NETWORK TOPOLOGY BEFORE IMPLEMENTATION
2.1.2 Network and Synchronous Clock Source Transformation Optimization Plan
After confirming the network structure, I began to formulate optimization schemes. I decided that
the best way to solve the problem that the services could not be protected automatically was that
the network topology should be altered and the connection between different layers in the network
would have two paths to share the risk caused by the faults of 1678 Equipment. Compared with
several plans I had made, I finally chose to introduce six optical paths, A, B, C, D, J, and K
between the Alcatel-Lucent 1660R5 Equipment, whose connection was completed by other
transmission machine rooms of the Maintenance Center. All 1660R5 Equipment should be
connected with 1678 Equipment by two paths, so that the problem that the single-path connection
could not provide protection would be solved. In addition, I conducted optimization and
adjustment towards the Synchronous Clock Source of the original network.
This method could avoid interrupting users’ services and could avoid a second time cut over after
implementation. Further, it could reduce the consumption of network resource. As a result, It was
the final implementation plan.
4. Picture 3-3: ALCATEL-LUCENT NETWORK TOPOLOGY AFTER IMPLEMENTATION
2.1.3 Circuit Routing Adjustment Plan Formulation
After confirming the networking and protection mode, I began to make comparison with circuit
routing adjustment plans.
The original configuration could not provide service protection when the fault occurred to the
Alcatel-Lucent 1678 Equipment because each route of the same private line could not avoid
passing through the similar transmission equipment.
Picture 3-4: SERVICE CONFIGURATION IN ALCATEL-LUCENT NETWORK BEFORE
IMPLEMENTATION
5. In accordance with the network usage situation, I decided to share the service equally on each link
in order to reduce the service blocking probability. I distributed services to the core layer
equipment (Alcatel-Lucent 1678) in other central offices and planned the roundabout transmission
paths to activate automatic protection for some circuits.
Picture 3-5: SERVICE CONFIGURATION IN ALCATEL-LUCENT NETWORK AFTER
IMPLEMENTATION
All the changes undertaken in the network needed to conduct reconfiguration after
implementation, and followed the rule that the same service could not pass the same Alcatel-
Lucent 1678 equipment. As a consequence, the problems caused by single node failure of Alcatel-
6. Lucent 1678 equipment could be completely avoided.
As the network load had exceeded 80%, there were not enough resources that could complete
routing adjustment at one time. The adjustment needed to be completed divided into several steps.
Moreover, before implementation, there was no certain circuit configuration principle, the
configuration mode was very complex and the load of each device could not be seen clearly
through the existing data, which caused great difficulty to the carrying out of this project.
Compared with the configuration data in the network element manager of Alcatel-Lucent
Equipment and in accordance with the bearing service situation, I drew a statistical table of
routing adjustment information by myself. Later, it was completed by the factory engineers of
Alcatel-Lucent Equipment and me together. The table had recorded all the data of each running
service in the network at present, idle resources as well as the order of routing adjustment of each
step.
Table 3-1 SERVICE ROUTING
2.2 Alcatel-Lucent Network Optimization Plan Implementation
After completing service routing adjusting plan, I formulated a set of detailed implementation
plans together with Alcatel-Lucent engineers.
2.2.1 Optical Path Adjustment
Completed the core layer adjustment in Alcatel-Lucent Equipment Network and test whether it is
qualified for further routing changes.
2.2.1.1 Check before Adjustment
To make sure that the operation could be implemented, we checked the network status at the
beginning.
Examine whether there were alarms that indicated abnormalities in Alcatel-Lucent 1353NM
and 1354RM Network Manager;
Confirm the status of Synchronous Clock Source;
Prepare alternate routing resources to deal with emergency.
7. 2.2.1.2 Process of Adjustment
Conduct the replacement of the circuit boards and the connection of optical fibers as
designed;
Establish physical connection and add spare routing to the optical path to provide protection
for services;
Delete the former circuit routing in the cut over optical paths, adjust the protection routing to
the new physical connection, and complete the routing adjustment.
Because this routing adjustment involved Alcatel-Lucent 1678 Equipment, in which the optical
interface resources were insufficient and some equipment had no idle ports, I planned a way to
carry out the project. Firstly, I deleted the original circuit routing. Then the employees in
construction department adjusted the optical path. After the adjustment of the optical path, the
Alcatel-Lucent engineers and I added spare routing to the circuits. Because other equipment used
new optical interfaces after adjustment, the employees in construction department connected the
new optical paths and then Alcatel-Lucent engineers deleted the former routing of the circuits in
the cut over optical path, finally adjusted the protection routing to the new physical connection.
Specific Operation:
For the First Time
Add connection: ZAO1678-2-4---FXM1660-34
Add connection: FZ 1678-2-4---LU1660-34
Add connection: ZAO1678-3-4---FZ1678-3-4
We completed the circuit routing transformation between FXM and LU. Because the routing was
not dismantled, it would not affect the users’ services in the network.
For the Second Time
Dismantle FXM1678-3-3---FXM1660-28
Dismantle FZ1678-3-2---FZ1660-28
Dismantle ZAO1678-3-2---ZAO1660-28
Dismantle LU1678-3-3---LU1660-28
Add connection: FXM1678-3-3---FZ1660-34
Add connection: ZAO1678-3-3---FZ1678-3-2
Add connection: ZAO1660-34-LU1678-2-4
Complete the routing adjustment of the remaining circuit
2.2.1.3 Emergency Plan
According to my experience, although the procedure had been thoroughly discussed, failure would
still be possible. So I should take all scenarios into consideration. I investigated maintenance
documents and discussed with team members and finally worked out an emergency plan to
address all backout scenarios.
If the circuit interruption occurred in the process of cut over, the implementation shall be
stopped immediately and the interrupted circuit shall be recovered preferentially;
8. After the occurrence of fault, the Alcatel-Lucent Equipment engineers and the relevant
employees in other central offices of the Maintenance Center should decide whether to
continue cut over or analyze the alarm and handle the fault.
2.2.1.4 Risk Analysis
I made a risk analysis to see if the operation was feasible.
Conduct the replacement of the circuit boards and the connection of optical fibers
Operation analysis: No risks.
Establish physical connection and added the protection routing to the services which had no
protection
Operation analysis: No risks.
Delete the routing of the circuits in the original optical path and adjust the protection routing
to the new physical connections to complete the routing adjustment.
Operation analysis: No risks.
After all optimization plans were completed, I successfully completed the first stage of network
transformation project. Then the Alcatel-Lucent engineers upgraded the software in 1678
equipment to guarantee the activation of the protection function.
2.2.2 Upgrade the Software of the Equipment
There were still certain risks because the operation was taken on circuit boards that bearing
services. As a consequence, the Alcatel-Lucent engineers and I formulated upgrade plans and
carried them out together.
2.2.2.1 Update Operation Steps
a)Conduct pre-check, back up equipment data and download the software to all FLC
The new software (rpm packages) shall be downloaded to the FLC of all nodes in advance. The
core nodes shall be downloaded from NM and non-core nodes shall be downloaded from the core
nodes, which could be carried out at the same time.
b ) Start upgrade, set TE link as: Maintenance Status (The service could not start ASON
protection under the status.)
c)Second FLC installation
Stop the application software of the alternate FLC (there was no FLC redundancy at the moment)
and upgrade the second FLC at the same time. After the new software is installed, the application
software would automatically restart and the system would recover to the redundancy status. In
the process, FLC could not be protected. If the other FLC were damaged, it would need to be
replaced, and restore installation and backup (synchrony).
d)FLC Rearrangement
9. Rearrange FLC and activate new software. In the process of rearranging FLC, NM could not
monitor the network state in a short time. When SLC-A/B Configuration Reload is restarted,
FPGA would be rewritten and the time of GBE port interruption would not be more than five
minutes at most. If any faults occurred, we should reactivate the software of the old version, and
download the old hardware configuration to FLC-CONGI.
e)Installation of the Second FLC
Install a second FLC in the equipment. After the installation, the new software would
automatically restart and the system had two pieces of FLC redundancy of the new software
version. In that process, FLC could not be protected. If the other FLC were damaged, it would
need to be replaced, and restore installation and backup (synchrony).
In the whole upgrade process, the time of GBE port interruption would not last more than five
minutes at most.
In the whole upgrade process, the network manager (RM and NM) could conduct monitoring, but
any configuration changing (such as Cross Connect and TP Configuration, etc) would not be
allowed.
2.2.2.2 Probable Effects and Countermeasures
If any emergency happened, the following procedure would be useful.
In the process of 1678MCC Equipment upgrade, the network monitoring management would
not be affected, but the operation of the network element should be suspended. Meanwhile,
ASON protection would be disabled.
In the process of 1678MCC Equipment upgrade, FPGA would be rewritten; the time of GBE
port interruption would not last more than five minutes at most, which would not affect other
ports and business;
The installation and activation of the new software need ISU Software, and be conducted
according to the plans. To make sure the process would be reliable, the steps and the software
had been tested repeatedly. If any abnormality was found, Alcatel-Lucent engineers and the
employees in Maintenance Center have the authority to decide whether suspend the upgrade
or not;
In the process of upgrade, two sets of spare parts should be prepared.
After this plan was completed, I successfully completed the second stage of network
transformation project.
2.3 Alcatel-Lucent Equipment Network Protection Test
After completing the upgrade of software, Alcatel-Lucent engineers and I jointly conducted
verification towards protection capability of the network: verified the automatic protection
function of MATRIXN Circuit Packs(Cross-Connect Packs) and Service Packs with Optical
Interfaces towards system faults.
2.3.1 MATRIXN Circuit Packs Test
10. a)Observe the status of MATRIXN Circuit Packs of Alcatel-Lucent Equipment through the
network terminal. The primary state should be normal. The light on the circuit pack should be
green and the standby circuit pack should be orange.
b)MATRIXN Circuit Packs in the equipment are located at slot 23 and slot 40. Before switching
MATRIXN Circuit Packs, make records of the locations of the primary state and the standby state,
as well as the number of the equipment alarms.
c)Then switch the matrix card from main to standby by the network manager. After the switchover
between main and standby boards, observe it for 15 to 25 minutes, then switch MATRIXN Circuit
Packs again and concretely checked their state, as well as the number change of the alarms.
d)In the process of the switchover, if new alarms appeared, check and deal with them. After the
switchover, verify the alarms of the equipment and compare the number of alarms before and after
the switchover.
2.3.2 Service Packs Test:
a)Check the configuration information of Cross-Connect Packs and Service Packs. Remark the
number of the normal circuits, the number of unused circuit as well as the number of the circuit
alarms.
b)Take advantage of the laser shutoff function in the network manager to test if the SNCP
protection would function normally.
c)Monitor the services in the tested boards and observed the switching state of them and whether
the transmission was normal.
d)Wwitch on the laser in testing boards and the services switched to main path automatically, then
observe the status of the services and confirm if the users’ private lines are under normal state
and could be used normally.
2.3.3 Emergency Plan
According to my experience, although the procedure had been thoroughly discussed, failure would
still be possible. So I should take all scenarios into consideration. I investigated maintenance
documents and discussed with team members and finally worked out an emergency plan to
address all backout scenarios.
Conduct data backup of the network manager well before the circuit boards replacement;
Check equipment alarm and record it; if there was any abnormal alarm in the process of
switching circuit boards, the switching process should be suspended immediately; after
eliminating the abnormal alarm, the process could be continued;
After switching the machine panel, if there was a loss of data, Alcatel-Lucent engineers and I
would decide to start data recovery steps, and rapidly restore the system with the back-up
data to ensure the normal operation of the network during the whole process;
11. After completing the switching, check whether there are new alarms generated. If there were
any new alarm, it should be solved immediately.
Through the test, all MATRIXN Circuit Packs and Service Packs worked normally and could
complete protection action as required.
In addition, in order to ensure the operation of the network and the state of the services undertaken
by the network were normal, I configured three trial services and tested them with special
instrument. If the meter showed that there was no error code in 24 hours, tested the next
equipment; if there was error code, I would handle the problem till it could fulfill the requirement.
The test lasted almost one week and finally the network fulfilled the opening standards and the
project was completed on 17 May.
SUMMARY
In this project,I worked closely with members in other departments in my company and Alcatel-
Lucent engineers. I had given solutions to the problems encountered in the project and carried
them out, found the best way to reduce or avoid the happening of problems in Alcatel
Transmission Network, and improved the communication quality. Besides, I formulated the
planning of equipment replacement, performance improvement and system optimization for the
network. Furthermore, I completed the upgrade procedure of the software in the equipment so that
the network protection function was enhanced. Beijing Unicom praised my work in this project
and granted me 2010 Network Optimization Reward.