The document describes NTT's strategy for performing live upgrades of OpenStack without downtime. It discusses pre-upgrade investigation, considerations for the upgrade procedure, testing the upgrade process, and results. The key aspects covered are migrating user resources, upgrading components in a specific order while blocking requests, and evaluating the upgrade to ensure no impact on users or their API calls. Some issues identified included errors from Active/Standby switches and RPC API version mismatches between components.
Rolta’s application testing services for handling ever changing environment. Rolta
There are many changes take place every day every minute. The changes in form of updates, upgrades, patches and many more, to handle these everyday changes and alleviate testing pressure, Rolta introduces Real Application Testing (RAT). RAT comes in real handy when Oracle’s applications like SQL performance analyzer and Database Replay are in use. Presentation also gives examples of couple of test cases.
SAP HANA 2 SPS02 Upgrade and Dynamic Tiering ImplementationLinh Nguyen
This is a compilation of notes taken while doing the upgrade of HANA 2 SPS00 server and COCKPIT to SPS02, followed by installation/implementation of Dynamic Tiering.
with Enhanced SAP HANA Monitoring by IT-Conductor
Rolta’s application testing services for handling ever changing environment. Rolta
There are many changes take place every day every minute. The changes in form of updates, upgrades, patches and many more, to handle these everyday changes and alleviate testing pressure, Rolta introduces Real Application Testing (RAT). RAT comes in real handy when Oracle’s applications like SQL performance analyzer and Database Replay are in use. Presentation also gives examples of couple of test cases.
SAP HANA 2 SPS02 Upgrade and Dynamic Tiering ImplementationLinh Nguyen
This is a compilation of notes taken while doing the upgrade of HANA 2 SPS00 server and COCKPIT to SPS02, followed by installation/implementation of Dynamic Tiering.
with Enhanced SAP HANA Monitoring by IT-Conductor
SQL Tuning made easier with SQLTXPLAIN (SQLT)Carlos Sierra
If you have experienced the frustration of dealing with an application transaction that takes too long to execute, and which may be affecting your company's business, then it is possible you have one or more SQL statements that may need some performance tuning. This session covers capabilities, installation and use of the free SQL Tuning tool: SQLTXPLAIN.
This session is an introduction to SQLTXPLAIN (SQLT) as a tool to aid in the complex task of SQL Tuning. It assumes nothing. From what is SQLT and its features, to how it is implemented and used, this presentation gives you a good idea how you can benefit of this free tool. As an additional value, since Oracle Support has been requesting the output of this tool on most Query Tuning Service Requests (SR), by learning how to use the right SQLT method, you may help to expedite the resolution of your own SR! During this session you will see how SQLT is installed and how the most common methods - XTRACT and XECUTE are used. A navigation through the main output is also part of this SQLT introductory session.
If you are a DBA that besides many other things is also responsible of Performance issues, or you are a Developer that has to find answers when a SQL performs poorly, you may consider attending this session. SQL Tuning is complex, so simply by collecting the right information on a timely manner is a big help. Nothing to lose when a free tool is available for you to try and use. No strings attached!
Hitchhiker's Guide to free Oracle tuning toolsBjoern Rost
Instance and SQL tuning with EM12c Cloud Control is so easy, it is not even much fun
anymore. Also, not every customer may have the appropriate license or database
edition, or all you have available remotely is a command-line login to a database.
This presentation showcases a few open-source database tuning tools such as Snapper
and ASH replacements that DBAs can use to gather and review metrics and wait events
from the command line and even in standard edition.
My presentation and Live DEMO at "Oracle Hardware Solutions Event". Demonstrated live Java Server Loader (JSL) on Solaris Sparc T5-2 Server and did performance tests (CPU, Memory, Network Socket) instantly at session duration.
Artifacts:
Java Server Loader (JSL)
WLSDM (WebLogic Smart Dashboard and Monitoring)
Oracle WebLogic (12.2.1)
Oracle Solaris 11.3
Java 1.8_071
Configuring and Monitoring HANA Extension NodeLinh Nguyen
Continuing the warm data management exploration, we implemented and tested another feature – the HANA Extension Node, which exists in scale-out landscape as a slave node, for use in either Native HANA or in BW scenarios
Smart monitoring how does oracle rac manage resource, state ukoug19Anil Nair
An important requirement for HA and to provide scalability is to detect problems and resolve them quickly before the user sessions get affected. Oracle RAC along with its Family of Solutions work together cohesively to detect conditions such as "Un-responsive Instances", Network issues quickly and resolve them by either redirecting the work to other instances or redundant network paths
How a Developer can Troubleshoot a SQL performing poorly on a Production DBCarlos Sierra
This session is about some free small scripts you can execute from SQL*Plus, which provide you with some basics about a SQL statement, like the Execution Plan from multiple child cursors, their SQL Monitor report if your site has the Tuning Pack, and some useful Active Session History (ASH) summaries for your SQL if your site has the Diagnostics Pack. And if you have neither the Tuning nor the Diagnostics Pack then you may want to learn about some alternatives to collect important performance metrics.
Haj 4308-open jpa, eclipselink, and the migration toolkitKevin Sutter
Our InterConnect 2017 session on OpenJPA, EclipseLink, and the WebSphere Migration Toolkit. WebSphere has extended it's support for JPA by including the Reference Implementation (EclipseLink) in support of the JPA 2.1 specification. Learn about the gotchas with migrating from OpenJPA to EclipseLink.
Oracle SQL tuning with SQL Plan ManagementBjoern Rost
Regression in SQL plans are a frequent cause for performance related incidents when the cost-based optimizer comes up with a new plan due to changes in data distribution, statistics, or binds. While most organizations have very strict processes for changes to applications or infrastructure, the CBO is most often left alone, accepting that SQL execution performance could change at any time. But with SQL Plan Management it does not take much effort to implement a process that makes changes to SQL plans manageable. It starts with monitoring regression in execution times, capturing baselines, auto pre-evaluating potentially better plans, and documenting information needed to accept the change. We will not only cover how SPM works, but also how you can start using it in your organization today.
This joint webinar for DBmaestro (www.dbmaestro.com)and Delphix discuss the synergy between Delphix’s Database Virtualiztion and DBmaestro’s Database Enforced Change Management solutions.
The session discuss the challenges in database development and show in practice how Database Enforced
Change Management and Database Virtualization work together to create a version control, branching and merging method that addresses these challenges.
Any DBA from beginner to advanced level, who wants to fill in some gaps in his/her knowledge about Performance Tuning on an Oracle Database, will benefit from this workshop.
In this talk from the OpenStack Summit in Barcelona (October 2016), we explain the appetite for skipping OpenStack upstream releases in production deployments, talk about when skipping releases might or might not be right for your cloud deployment, and demonstrate how we successfully upgraded OpenStack clouds from IceHouse to Kilo (skipping Juno) and Kilo to Mitaka (skipping Liberty).
SQL Tuning made easier with SQLTXPLAIN (SQLT)Carlos Sierra
If you have experienced the frustration of dealing with an application transaction that takes too long to execute, and which may be affecting your company's business, then it is possible you have one or more SQL statements that may need some performance tuning. This session covers capabilities, installation and use of the free SQL Tuning tool: SQLTXPLAIN.
This session is an introduction to SQLTXPLAIN (SQLT) as a tool to aid in the complex task of SQL Tuning. It assumes nothing. From what is SQLT and its features, to how it is implemented and used, this presentation gives you a good idea how you can benefit of this free tool. As an additional value, since Oracle Support has been requesting the output of this tool on most Query Tuning Service Requests (SR), by learning how to use the right SQLT method, you may help to expedite the resolution of your own SR! During this session you will see how SQLT is installed and how the most common methods - XTRACT and XECUTE are used. A navigation through the main output is also part of this SQLT introductory session.
If you are a DBA that besides many other things is also responsible of Performance issues, or you are a Developer that has to find answers when a SQL performs poorly, you may consider attending this session. SQL Tuning is complex, so simply by collecting the right information on a timely manner is a big help. Nothing to lose when a free tool is available for you to try and use. No strings attached!
Hitchhiker's Guide to free Oracle tuning toolsBjoern Rost
Instance and SQL tuning with EM12c Cloud Control is so easy, it is not even much fun
anymore. Also, not every customer may have the appropriate license or database
edition, or all you have available remotely is a command-line login to a database.
This presentation showcases a few open-source database tuning tools such as Snapper
and ASH replacements that DBAs can use to gather and review metrics and wait events
from the command line and even in standard edition.
My presentation and Live DEMO at "Oracle Hardware Solutions Event". Demonstrated live Java Server Loader (JSL) on Solaris Sparc T5-2 Server and did performance tests (CPU, Memory, Network Socket) instantly at session duration.
Artifacts:
Java Server Loader (JSL)
WLSDM (WebLogic Smart Dashboard and Monitoring)
Oracle WebLogic (12.2.1)
Oracle Solaris 11.3
Java 1.8_071
Configuring and Monitoring HANA Extension NodeLinh Nguyen
Continuing the warm data management exploration, we implemented and tested another feature – the HANA Extension Node, which exists in scale-out landscape as a slave node, for use in either Native HANA or in BW scenarios
Smart monitoring how does oracle rac manage resource, state ukoug19Anil Nair
An important requirement for HA and to provide scalability is to detect problems and resolve them quickly before the user sessions get affected. Oracle RAC along with its Family of Solutions work together cohesively to detect conditions such as "Un-responsive Instances", Network issues quickly and resolve them by either redirecting the work to other instances or redundant network paths
How a Developer can Troubleshoot a SQL performing poorly on a Production DBCarlos Sierra
This session is about some free small scripts you can execute from SQL*Plus, which provide you with some basics about a SQL statement, like the Execution Plan from multiple child cursors, their SQL Monitor report if your site has the Tuning Pack, and some useful Active Session History (ASH) summaries for your SQL if your site has the Diagnostics Pack. And if you have neither the Tuning nor the Diagnostics Pack then you may want to learn about some alternatives to collect important performance metrics.
Haj 4308-open jpa, eclipselink, and the migration toolkitKevin Sutter
Our InterConnect 2017 session on OpenJPA, EclipseLink, and the WebSphere Migration Toolkit. WebSphere has extended it's support for JPA by including the Reference Implementation (EclipseLink) in support of the JPA 2.1 specification. Learn about the gotchas with migrating from OpenJPA to EclipseLink.
Oracle SQL tuning with SQL Plan ManagementBjoern Rost
Regression in SQL plans are a frequent cause for performance related incidents when the cost-based optimizer comes up with a new plan due to changes in data distribution, statistics, or binds. While most organizations have very strict processes for changes to applications or infrastructure, the CBO is most often left alone, accepting that SQL execution performance could change at any time. But with SQL Plan Management it does not take much effort to implement a process that makes changes to SQL plans manageable. It starts with monitoring regression in execution times, capturing baselines, auto pre-evaluating potentially better plans, and documenting information needed to accept the change. We will not only cover how SPM works, but also how you can start using it in your organization today.
This joint webinar for DBmaestro (www.dbmaestro.com)and Delphix discuss the synergy between Delphix’s Database Virtualiztion and DBmaestro’s Database Enforced Change Management solutions.
The session discuss the challenges in database development and show in practice how Database Enforced
Change Management and Database Virtualization work together to create a version control, branching and merging method that addresses these challenges.
Any DBA from beginner to advanced level, who wants to fill in some gaps in his/her knowledge about Performance Tuning on an Oracle Database, will benefit from this workshop.
In this talk from the OpenStack Summit in Barcelona (October 2016), we explain the appetite for skipping OpenStack upstream releases in production deployments, talk about when skipping releases might or might not be right for your cloud deployment, and demonstrate how we successfully upgraded OpenStack clouds from IceHouse to Kilo (skipping Juno) and Kilo to Mitaka (skipping Liberty).
In our recent webinar hosted by Mike Current, a member of the Hyland Upgrade Council, and Mark Hamilton, DataBank's Infrastructure Engineer, we expanded on how upgrading OnBase offers the ability to not only gain enhancements and fixes, but also radically improve the security, stability and architecture of your entire OnBase environment.
In this presentation you will...
1. Learn the formula for upgrade success with actionable items to work through right away
2. Understand the team needed to get the job done and how DataBank can step in to help
3. The importance of establishing a test environment and more
You can also watch the full webinar here: http://info.databankimx.com/Upgrade-Webinar-RCD.html
Download the Hyland 3rd Part Compatibility Matrix from slide #25 here: http://info.databankimx.com/rs/167-SSD-475/images/Third%20Party%20Product%20Compatibility%20Matrix.pdf
The attached presentation is meant to serve as a readiness material for the SQL Server DBAs who are preparing to upgrade their SQL Server instance to SQL 2008 R2 or SQL 2012
A DevOps Journey - An experience report after 6 years of implementing DevOps and Continuous Delivery in Frende Forsikring, a small insurance company in Norway.
As a service provider, Rackspace is constantly bringing new OpenStack capacity online. In this session, we will detail a myriad of challenges around adding new compute capacity. These include: planning, automation, organizational, quality assurance, monitoring, security, networking, integration, and more.
Load testing is an important part of the performance engineering process. It remains the main way to ensure appropriate performance and reliability in production. It is important to see a bigger picture beyond stereotypical, last-moment load testing. There are multiple dimensions of load testing: environment, load generation, testing approach, life-cycle integration, feedback and analysis. This paper discusses these dimensions and how load testing tools support them.
In these slides you will be able to learn about:
1. Traditional Network Upgrades
2. Controller Upgrade CI/CD Toolsets
3. Data and Control Layer Separation
4. Challenges with OpenFlow Hitless Upgrade
5. Controller APP Change
6. Controller Infrastructure
7. No pipeline change
8. Node Upgrades
9. Controller & Application Upgrades
10. Multi Site Cluster/Controller groups
Guide to continuous delivery and the journey wix.com had made transitioning to DevOps and continuous delivery culture making ~100 production changes daily
The DevOps model is rapidly transforming IT operations and development practices. But what are the precursors necessary to implement DevOps? To achieve an agile, virtualized, and highly automated IT environment, what technological requirements need to be in place? OpenStack has the potential to facilitate DevOps implementation and practices at several different layers in the data center. In this session we'll quickly discuss what DevOps is, then discuss many components that are logically required to move towards DevOps in your environment. Finally we'll explore in depth several ways OpenStack can provide these baseline components.
Watch the DevNet 1104 replay from the Cisco Live On-Demand Library at: https://www.ciscolive.com/online/connect/sessionDetail.ww?SESSION_ID=92695&backBtn=true
Check out more and register for Cisco DevNet: http://ow.ly/jCNV3030OfS
Improving DevOps through Cloud Automation and Management - Real-World Rocket ...Ostrato
Explore how DevOps processes can be made more efficient through improved service delivery and cloud automation. Check out this real-world example to see how Chef and Ostrato helped OpenWhere, a geospatial analytics startup, compete in the hyper-competitive defense marketplace.
Chef allows enterprises like OpenWhere to automate infrastructure deployments to accelerate and simplify the development process. Ostrato’s cloud management platform enables enterprises to control costs and institute governance in hybrid cloud environments.
This talk provides an introduction to the OpenStack Interop Working Group, what it does, and how it works. We'll also look into some upcoming new work, such as the development of vertical programs (e.g. for clouds being built for NFV or other specific use cases).
Similar to Openstack upgrade without_down_time_20141103r1 (20)
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Orkestra
UIIN Conference, Madrid, 27-29 May 2024
James Wilson, Orkestra and Deusto Business School
Emily Wise, Lund University
Madeline Smith, The Glasgow School of Art
Acorn Recovery: Restore IT infra within minutesIP ServerOne
Introducing Acorn Recovery as a Service, a simple, fast, and secure managed disaster recovery (DRaaS) by IP ServerOne. A DR solution that helps restore your IT infra within minutes.
0x01 - Newton's Third Law: Static vs. Dynamic AbusersOWASP Beja
f you offer a service on the web, odds are that someone will abuse it. Be it an API, a SaaS, a PaaS, or even a static website, someone somewhere will try to figure out a way to use it to their own needs. In this talk we'll compare measures that are effective against static attackers and how to battle a dynamic attacker who adapts to your counter-measures.
About the Speaker
===============
Diogo Sousa, Engineering Manager @ Canonical
An opinionated individual with an interest in cryptography and its intersection with secure software development.
Have you ever wondered how search works while visiting an e-commerce site, internal website, or searching through other types of online resources? Look no further than this informative session on the ways that taxonomies help end-users navigate the internet! Hear from taxonomists and other information professionals who have first-hand experience creating and working with taxonomies that aid in navigation, search, and discovery across a range of disciplines.
This presentation, created by Syed Faiz ul Hassan, explores the profound influence of media on public perception and behavior. It delves into the evolution of media from oral traditions to modern digital and social media platforms. Key topics include the role of media in information propagation, socialization, crisis awareness, globalization, and education. The presentation also examines media influence through agenda setting, propaganda, and manipulative techniques used by advertisers and marketers. Furthermore, it highlights the impact of surveillance enabled by media technologies on personal behavior and preferences. Through this comprehensive overview, the presentation aims to shed light on how media shapes collective consciousness and public opinion.
This presentation by Morris Kleiner (University of Minnesota), was made during the discussion “Competition and Regulation in Professions and Occupations” held at the Working Party No. 2 on Competition and Regulation on 10 June 2024. More papers and presentations on the topic can be found out at oe.cd/crps.
This presentation was uploaded with the author’s consent.
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Openstack upgrade without_down_time_20141103r1
1. Openstack Upgrade Without Down Time
November 5, 2014
Takashi Natsume, Software Innovation Center, NTT
natsume.takashi@lab.ntt.co.jp
Yankai Liu, Canonical
yankai.liu@canonical.com
2. Agenda
● Introduction
● Live Upgrade Test Strategy and Plan
○ Pre-upgrade Investigation
○ Considerations in Creating Upgrade Procedure
○ Concrete Upgrade Procedure
○ Testing
○ Upgrade Test Results and Issues
● Summary
2
4. Introduction
Who We Are:
Takashi Natsume
Takashi Natsume has been working for NTT corporation since April,
2013.I am engaged in system design of public cloud systems based on
OpenStack and functional verification of OpenStack.
Before I was engaged in performance analysis and performance
troubleshooting for systems.
Yankai Liu
Yankai Liu is the Cloud Architect at Canonical being responsible for
cloud architecture design and delivery.
I worked with NTT team to provide consultancy on the upgrade test
project.
4
5. Openstack Upgrade Overview
With the fast openstack releases rolling out, openstack upgrade becomes one
of the key operation factors for the deployments, which can be performed off-line
or live-upgrade.
For the production deployments, live upgrade is desired to achieve these
goals:
● Minimal or no down time
● Catch up the short release cycle of Openstack [1]
● Ensure the maintenance support(because of short maintenance period[2])
● Reduce the cost comparing to off-line upgrade
In this session, we will introduce how NTT designed and tested the live
upgrade from Havana to Icehouse service by service.
5
6. The Goal of NTT Cloud Live Upgrade
No impact on users’ resources usage
● Users can utilize their resources(VMs, virtual volumes,virtual networks)
that have already created or are running without any interruption during
live upgrade.
For example, VM stop and network communication interruption
● No performance problem that affects users’ resource utilization
significantly.
No impact on users’ API calls
● During live upgrade, users can use the openstack API services as usual
with:
No errors or fails
No incorrect results
No performance problem that affects users’ operations
significantly.
6
7. Upgrade environment and components
•System environment
• Built a test environment based on NTT production public cloud
system architecture
(See the figure in the next page.)
•Upgrade components
• OpenStack components
• Nova, Cinder, Glance,Neutron,Keystone,Heat
• Non-openstack components such as MySQL, RabbitMQ、Load
balancer(ldirector) and OS were NOT included.
•Upgrade version
• Stable/havana(2014.2.2) to icehouse-1(nova, icehouse-3)
7
8. System Architecture Built for Upgrade Testing
Active/Active: processes that do not retain their state
Active/Standby: processes that retain their state No HA(single): hypervisor hosts
Processes that receive REST API requests can be blocked by deploying load balancers in front of them.
OS: Ubuntu Server 12.04 LTS
8
10. NTT Cloud Live Upgrade Test Strategy and Plan
Overall Strategy
● Step-by-step(Rolling) upgrade is needed for live upgrade
● Openstack components co-exist on different versions
Live Upgrade Test Plan
1. Pre-upgrade investigation: items that should be considered in
advance
2. Considerations in creating details procedure
3. Concrete upgrade procedure
4. Testing
5. Upgrade Test results and issues
10
12. Pre-Investigation for Live Upgrade
A) Database schema
• Some cases that OpenStack database schemas are different
between new version and old version.
• Investigate on the DB schema changes before creating the
upgrade plans
B) Consistency of APIs between components
C) Consistency of APIs in each component.
• REST API
• RPC API
12
13. Live Upgrade Test Strategy and Plan
- Considerations in Creating Upgrade Procedure -
14. Considerations in Creating Upgrade Procedure
•User resources
• User resources that are on hosts to upgrade need be migrated to
another host.
14
15. The order of upgrade
Decide the upgrade order based on RPC API version compatibility
in the component
Process C Process B Process A
Legends:
RPC call
Server
Process
A caller is upgraded after a callee upgrade.
In this case, upgrade is performed in the
order of process A, process B and process C.
15
16. Operations Required for Step by Step Upgrade
•Blockade(Blocking requests)
• load balancer (ldirectord(LVS))
• Disable Service(nova-compute, cinder-volume)
•Check processings in progress
• Check connections at the load balancer
• e.g. glance-api
• Check child processes
• e.g. nova-novncproxy
•If a graceful shutdown function can be used, it had
better be used.
• Nova: icehouse-1 or later
• Cinder: icehouse-1 or later
• Neutron: icehouse-2 or later
• Heat: havana-3 or later(We fixed a bug in juno-1)
• Glance: No need in our environment
• Keystone: No need in our environment
16
17. Database Schema
• Change database schema at the beginning of
procedure and the end of procedure
• The beginning
• Add tables, add columns and add indexes
• The end
• Drop tables, delete columns and delete indexes
• In current nova live upgrade procedure(community), nova-conductors
are upgraded at the same time.
(New version and old version nova-conductors don’t run at the same
time.)
• Conversion of data format should be considered
• We need not convert the data format in our trial. There is no problem.
• Check codes that define the database schema sufficiently.
• For example, in nova
• nova/db/sqlalchemy/migrate_repo/versions/*
• Data conversion may be needed in some cases.
• Adding 'triggers' in database tables?
17
18. Database Schema (cont’d)
• Avoid database lock for a long time
• We can use some tools
• pt-online-schema-change[3]
• oak-online-alter-table[4]
18
19. HA Configuration
• In the point of view of live upgrade, Active/Active
configuration is better.
• But there are some cases that Active/Active cannot
be configured, so Active/Standby is forced.
• cinder-volume(depends on backends)
• Active/Active can be configured by using Ceph
(Refer to the discussion https://bugs.launchpad.
net/cinder/+bug/1280367)
• While Active/Active setup can’t be supported by all the drivers.
https://bugs.launchpad.net/cinder/+bug/1322190
• neutron-server(depends on plugin)
• neutron-l3-agent/neutron-dhcp-agent
• nova-consoleauth
• heat-engine(but multiple engine function has been implemented in
icehouse-2.)
19
20. HA Configuration (cont’d)
•In Active/Active case(controller)
• At Load balancer, blocking the node which is in the upgrade process
•In Active/Standby case
• When switching Active/Standby, there is service down time of the
component as expected.
20
21. Upgrade Procedure by HA Configuration
Active/Active configuration
Block
requests/connections
to target host
Migrate users’
reources
Upgrade host
Unblock
Repeat on each target hosts
No HA(Single)
Block requests
to target host
Migrate users’
reources
Upgrade host
Unblock
Active/Standby configuration
Upgrade
‘Standby’ host
Block requests
to ‘Active’ host
(if possible)
Switch
Active/Standby
Unblock
Repeat on each target hosts Repeat on each target hosts
21
23. System Architecture Built for Upgrade Testing
Active/Active: processes that do not retain their state
Active/Standby: processes that retain their state No HA(single): hypervisor hosts
Processes that receive REST API requests can be blocked by deploying load balancers in front of them.
OS: Ubuntu Server 12.04 LTS
23
26. Create test plans, test tools and test data
•Background workload during upgrade test
• Background workload(API requests) covered patterns of calls
between components and between processes in components in our
use case.
• Network communication(ping)
• North-South
• East-West
• Remain VNC console connected during upgrade test
26
27. Build a test environment
•Build a test environment
• Same configurations as a production environment
• HA configuration(Active/Active, Active/Standby) required.
• In order to repeat upgrade testing, we constructed the
environment to get back easily by using chef.
27
28. Execute(Test) the procedure
•Evaluation criteria
• No impact on users’ resources
• Users can utilize their resources(VMs, virtual volumes,virtual
networks) that have already created or are running without any
interruption.
• No performance problem that affects users’ resource utilization
significantly.
• No impact on users’ API calls
• No error
• No ‘wrong’ results
• No performance problem that affects users’ operations significantly
• Operation step does not need a lot of time
• Consistency between records that OpenStack manages and actual
resources.
28
29. Live Upgrade Test Strategy and Plan
- Upgrade Test results and issues -
30. Identify issues
•Solved issues
• Heat Graceful shutdown issue
• NTT team fixed it in juno-1
• https://bugs.launchpad.net/heat/+bug/1304244
•Remaining issues
• Errors due to Active/Standby switchover
• Volume Resource creation failure(ERROR state)
• Errors due to mismatch of RPC API major versions
• From nova-compute to nova-consoleauth
• From nova-novncproxy to nova-consoleauth
Communication interruption (expected to be resolved in Juno)
• Neutron-l3-agent
• Changing ‘admin_state_up’ of neutron-l3-agent to False solves
‘scheduling’ issue, but communication interruption occurred.
• Interruption of the console connection
• VM live migration/nova-novncproxy upgrade
• Impossible to fallback after changing DB schema at the beginning
30
31. Lesson learns
•Clean install
• Some source code directories/files should be removed during the
upgrade and fallback. Otherwise it will cause errors and issues.
• When overwriting openstack components’ files, errors occurred.
• AttributeError: type object 'foo' has no attribute 'bar'
31
33. Summary
● The goal of the upgrade test is to achieve the upgrade without down
time.But there were some issues to prevent us from achieving
upgrade openstack without down time.
● During our upgrade test, the down time of the services including:
○ Network downtime
■ neutron-l3-agent (expected to be fixed in Juno)
● Trade-off between the new vRouter creation failure and VM
communication, e.g. a few of minutes downtime to schedule the
new vRouter creation OR a few of minutes communication
interruption for some VMs communication
○ Some API requests downtime during the Active/Standby switchover
● Neutron server
● Heat engine
● Cinder volume
○ Nova instance console connection interruption
■ Need reconnect or Need getting console url again.
33
34. Suggestions for communities
• Cinder-volume drivers Active/Active HA support
• Presently some drivers for commercial products prevent from configuring
Active/Active
• Consistency of RPC API major versions
• 1 version rolling upgrade is (limited) supported in Nova.
• It should be considered in all core projects.
• If OpenStack components utilize oslo.messaging, errors caused by RPC API
major version difference might occur during live upgrade.
• Seamless console connection
• There is a discussion In Juno summit for console seamless migration [5]
• Consider live upgrade in REST API versions deprecation
• SDN controller Active/Active HA support should be considered when
integrating into Neutron as a plugin
• Although Ceilometer is not in the test scope, there are still gaps to support
Active/Active HA
• Graceful shutdown of all services
34