SlideShare a Scribd company logo
August 4th
Objective : This project is a TECHNICAL INSIGHT
of the Improvement Opportunities in the
BackUp and Restore Area for Century Link .
 Accomplishments.
 New Findings –
 Help Needed.
Initiatives
1) Developed Health Check Script & implemented health
checks of TSM SERVERS as a process every six hours.
2) TSM STORAGEPOOL Script implemented
3) Implemented Journal Based Backup to resolve the
issue on server "QTCMHPNTIAP01".
Benefits
1) Pro-active monitoring of TSM Servers Every Six Hours
– Has Reduced the Downtime Risk of TSM Servers and
improved the Back up Success Rate from
91% to 97% .
2) This script has helped in Evaluating the storage Pool
space to accommodate fresh back ups from oracle with
out having them to Fail – This has resulted in Improving
the Oracle Backup Success Rate.
3) This technic can be implemented on servers which
carry more numbers files.
This helps in reducing the Backup Window from
For Eg 2 Million Files which take 5 working days can be
backed up in approximately 2 hours.
New Findings - Streamline of TSM TAPE StorageNew Findings - Streamline of TSM TAPE Storage
UtilizationUtilization
◦ .
NODENAME
110330ELXDNP49EL
110330ELXDNP50EL
110330ELXDNT47EL
110330ELXDNT48EL
110413ELXDENVMTC057L
110413EQTDENVMDT143L
110417EQTDENVD188N
110417EQTDENVD189N
110509ERASCALSS
 We have around 573 Expired Nodes to be
removed from TSM SERVERS
 Removing expired nodes data from TSM
SERVERS will release 1000 Tapes
approximately .
 Can benefit the account from XX $$ Value.
Tape Data associated to
Decommissioned servers is not
managed (Deleted)– Resulting into
More Tapes Consumption.
Note : Not aware of any Hidden Associated Risk to delete
the nodes data
New – Findings Scope (Intel) : Configuration Issues resulting in
to Data Duplication
NTOMT78CS NTOMT78A NCLUNTOMT78AN NCLUNTOMT78SQL1N No'S'Drive AIOMB52J
QTOMASQLDB4CS QTOMASQLDB4A BCLUQTOMASQLDB4AN BCLUQTOMASQLDB4SQL1N No'S'Drive AIOMB53J
QTOMASQLDB4CS QTOMASQLDB4B BCLUQTOMASQLDB4BN No'R'Drive BCLUQTOMASQLDB4SQL2N AIOMB53J
qtomasql16acs QTOMASQL16A DCLUQTOMASQL16AN DCLUQTOMASQL16SQL1N DCLUQTOMASQL16SQL2N AIOMB54J
QTOMASQL14CS QTOMASQL14A MCLUQTOMASQL14AN MCLUQTOMASQL14SQL1N No'S'Drive AIOMB55J
QTOMASQL11CS QTOMASQL11A DCLUQTOMASQL11AN DCLUQTOMASQL11SQL1N No'S'Drive AIOMB54J
QTDENSQL1CS QTDENSQL1B HQTDENSQL1BN No'R'Drive HCLUQTDENSQL1SQL2N AIDNB866
QTDENSQL2CS QTDENSQL2A HQTDENSQL2AN HCLUQTDENSQL2SQL1N No'S'Drive AIDNB866
QTDENSQL2CS QTDENSQL2B HQTDENSQL2BN No'R'Drive HCLUQTDENSQL2SQL2N AIDNB866
QTDENSQL3CS QTDENSQL3A ACLUQTDENSQL3AN ACLUQTDENSQL3SQL1N No'S'Drive AIDNB864
Identified few Cluster
nodes with TB’s of
duplicated Data :Getting
Rid of this data will save
around 500 Tapes
Approximately.
SCHEDULE_NAME COMMAND ACTIVE IssuesinthisTSMSERVER
BACKUP_DEVCONF badevconf YES
BACKUP_STG_ARCHIVE_DISKbastgarchive_diskoffsite_tapeYES
BACKUP_STG_ARCHIVE_TAPEbastgarchive_tapeoffsite_tapeYES
BACKUP_STG_DBA_DISK bastgdba_diskdba_offsiteYES
BACKUP_STG_DBA_FILE bastgdba_filedba_offsiteYES
BACKUP_STG_DBA_TAPE bastgdba_tape_t10kdba_offsiteYES
BACKUP_STG_GOLD_DISK bastggold_diskgold_offsiteYES
BACKUP_STG_GOLD_FILE bastggold_filegold_offsiteYES
BACKUP_STG_GOLD_TAPE bastggold_tape_t10kgold_offsiteYES
AIDNB866-2
Utilization 0%
Utilization 0%
NoBackupStgforDBA_TAPE
NoBackupStgforDBA_VTL
NoBackupStgfor
DBA_XIV_VTL
New – Findings : Configuration Issues resulting in to TSM SERVERTSM SERVER
Storage pools having no offsite copy.Storage pools having no offsite copy.
Approximately 40% of the
Data doesn’t have an
offsite Copy.
 What all new tools/technologies can be implemented to resolve few issues?
 Tivoli Monitoring to be in place.
 Use Journal Based Backup wherever required.
 Implementing LANFREE for the servers which has 500GB and above data.
 What all changes needed in DESIGN to have better environment.
 TSM SERVER Configuration needs to be thoroughly checked.
 TEST/DEV to be separated with different Storagepools.
 Password Expiration.
 Help Needed
 Approval for removing the expired & Duplicate data .
 Once Approved, the plan of action will be submitted after working with the US Counter part.
 S3
– Save Storage Space [YES CUBE]
 Team Responsibilities
 Weekend Tasks
 Technical KT
 Finding the nodes which has more backup data. [List Collected]
 Through check on that host’s each object and checking with customer or AIP whether
these files/objects really need for backup.
 Ex:
 AIOMB08R
◦ NSUOMT102S 326.3 GB
◦ NSUOMT800S 2.8 TB
◦ NECOMT106L 379.6 GB
 AIOMB09R
◦ BHPOMP267H 372.5 GB
 AIOMB54J
◦ DSUOMP34QS 187.5 GB
◦ DSUOMP34QS 206.2 GB
 Taking individual responsibility on the remedy tickets and resolve them with in the SLA.
Ex:

 More attention on monitoring the TSM Storage Pool migrations to reduce the production
database backup failures count.
◦ Indentified File systems which has no backup from long time and working with Server
Owner’s/AIP’s to get confirmation to remove the data from TSM Storage pool which has
occupied TB’S of space.[Report will be generated by Friday and Weekend Shift Team is
working on this ]
◦ Irrespective of daily missed/failed backups; we have initiated digging in to servers which has
no backup in last 1-5 Days. [This is to ensure that we should not have any of missed/failed
backups repeated on Monday.]
 We have started Knowledge Share within the team members on the topics
raised by team
◦ Duration : 1 Hour
◦ Frequency : 2/Week
◦ Topics : Everything related to BACKUP/SAN/AIX
 Topics Covered
◦ LANFREE BACKUPS
◦ NAS BACKUPS
PROJECT GREEN

More Related Content

What's hot

Web TCard - Speed optimization
Web TCard - Speed optimizationWeb TCard - Speed optimization
Web TCard - Speed optimization
Eric Guo
 
Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...
Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...
Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...
Accumulo Summit
 
A Deeper Dive into EXPLAIN
A Deeper Dive into EXPLAINA Deeper Dive into EXPLAIN
A Deeper Dive into EXPLAIN
EDB
 
Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]
Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]
Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]
Accumulo Summit
 
Scylla’s Journey Towards Being an Elastic Cloud Native Database
Scylla’s Journey Towards Being an Elastic Cloud Native DatabaseScylla’s Journey Towards Being an Elastic Cloud Native Database
Scylla’s Journey Towards Being an Elastic Cloud Native Database
ScyllaDB
 
Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...
Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...
Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...
DataStax
 
CI, CD, CT, Deploy, IaaS, DevOps, Stage
CI, CD, CT, Deploy, IaaS, DevOps, StageCI, CD, CT, Deploy, IaaS, DevOps, Stage
CI, CD, CT, Deploy, IaaS, DevOps, Stage
Artur Basak
 
AppOS: PostgreSQL Extension for Scalable File I/O @ PGConf.Asia 2019
AppOS: PostgreSQL Extension for Scalable File I/O @ PGConf.Asia 2019AppOS: PostgreSQL Extension for Scalable File I/O @ PGConf.Asia 2019
AppOS: PostgreSQL Extension for Scalable File I/O @ PGConf.Asia 2019
Sangwook Kim
 
Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter ...
Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter ...Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter ...
Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter ...
Joseph Luchette
 
(SDD409) Amazon RDS for PostgreSQL Deep Dive | AWS re:Invent 2014
(SDD409) Amazon RDS for PostgreSQL Deep Dive | AWS re:Invent 2014(SDD409) Amazon RDS for PostgreSQL Deep Dive | AWS re:Invent 2014
(SDD409) Amazon RDS for PostgreSQL Deep Dive | AWS re:Invent 2014
Amazon Web Services
 
IBM Spectrum Control creating a heat map with Cognos
IBM Spectrum Control creating a heat map with CognosIBM Spectrum Control creating a heat map with Cognos
IBM Spectrum Control creating a heat map with Cognos
Johan van Arendonk
 
Using Time Window Compaction Strategy For Time Series Workloads
Using Time Window Compaction Strategy For Time Series WorkloadsUsing Time Window Compaction Strategy For Time Series Workloads
Using Time Window Compaction Strategy For Time Series Workloads
Jeff Jirsa
 
Odoo Performance Limits
Odoo Performance LimitsOdoo Performance Limits
Odoo Performance Limits
Odoo
 
How We Made Scylla Maintenance Easier, Safer and Faster
How We Made Scylla Maintenance Easier, Safer and FasterHow We Made Scylla Maintenance Easier, Safer and Faster
How We Made Scylla Maintenance Easier, Safer and Faster
ScyllaDB
 
Lab: JVM Production Debugging 101
Lab: JVM Production Debugging 101Lab: JVM Production Debugging 101
Lab: JVM Production Debugging 101
Tomer Gabel
 
Aerospike & GCE (LSPE Talk)
Aerospike & GCE (LSPE Talk)Aerospike & GCE (LSPE Talk)
Aerospike & GCE (LSPE Talk)
Sayyaparaju Sunil
 
V781 throttling
V781 throttlingV781 throttling
V781 throttling
Johan van Arendonk
 
AgileMidwest2018-Becker-DatabasesAndCattle
AgileMidwest2018-Becker-DatabasesAndCattleAgileMidwest2018-Becker-DatabasesAndCattle
AgileMidwest2018-Becker-DatabasesAndCattle
Jason Tice
 
Advanced kapacitor
Advanced kapacitorAdvanced kapacitor
Advanced kapacitor
InfluxData
 
Looking towards an official cassandra sidecar netflix
Looking towards an official cassandra sidecar   netflixLooking towards an official cassandra sidecar   netflix
Looking towards an official cassandra sidecar netflix
Vinay Kumar Chella
 

What's hot (20)

Web TCard - Speed optimization
Web TCard - Speed optimizationWeb TCard - Speed optimization
Web TCard - Speed optimization
 
Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...
Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...
Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...
 
A Deeper Dive into EXPLAIN
A Deeper Dive into EXPLAINA Deeper Dive into EXPLAIN
A Deeper Dive into EXPLAIN
 
Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]
Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]
Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]
 
Scylla’s Journey Towards Being an Elastic Cloud Native Database
Scylla’s Journey Towards Being an Elastic Cloud Native DatabaseScylla’s Journey Towards Being an Elastic Cloud Native Database
Scylla’s Journey Towards Being an Elastic Cloud Native Database
 
Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...
Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...
Real World Tales of Repair (Alexander Dejanovski, The Last Pickle) | Cassandr...
 
CI, CD, CT, Deploy, IaaS, DevOps, Stage
CI, CD, CT, Deploy, IaaS, DevOps, StageCI, CD, CT, Deploy, IaaS, DevOps, Stage
CI, CD, CT, Deploy, IaaS, DevOps, Stage
 
AppOS: PostgreSQL Extension for Scalable File I/O @ PGConf.Asia 2019
AppOS: PostgreSQL Extension for Scalable File I/O @ PGConf.Asia 2019AppOS: PostgreSQL Extension for Scalable File I/O @ PGConf.Asia 2019
AppOS: PostgreSQL Extension for Scalable File I/O @ PGConf.Asia 2019
 
Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter ...
Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter ...Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter ...
Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter ...
 
(SDD409) Amazon RDS for PostgreSQL Deep Dive | AWS re:Invent 2014
(SDD409) Amazon RDS for PostgreSQL Deep Dive | AWS re:Invent 2014(SDD409) Amazon RDS for PostgreSQL Deep Dive | AWS re:Invent 2014
(SDD409) Amazon RDS for PostgreSQL Deep Dive | AWS re:Invent 2014
 
IBM Spectrum Control creating a heat map with Cognos
IBM Spectrum Control creating a heat map with CognosIBM Spectrum Control creating a heat map with Cognos
IBM Spectrum Control creating a heat map with Cognos
 
Using Time Window Compaction Strategy For Time Series Workloads
Using Time Window Compaction Strategy For Time Series WorkloadsUsing Time Window Compaction Strategy For Time Series Workloads
Using Time Window Compaction Strategy For Time Series Workloads
 
Odoo Performance Limits
Odoo Performance LimitsOdoo Performance Limits
Odoo Performance Limits
 
How We Made Scylla Maintenance Easier, Safer and Faster
How We Made Scylla Maintenance Easier, Safer and FasterHow We Made Scylla Maintenance Easier, Safer and Faster
How We Made Scylla Maintenance Easier, Safer and Faster
 
Lab: JVM Production Debugging 101
Lab: JVM Production Debugging 101Lab: JVM Production Debugging 101
Lab: JVM Production Debugging 101
 
Aerospike & GCE (LSPE Talk)
Aerospike & GCE (LSPE Talk)Aerospike & GCE (LSPE Talk)
Aerospike & GCE (LSPE Talk)
 
V781 throttling
V781 throttlingV781 throttling
V781 throttling
 
AgileMidwest2018-Becker-DatabasesAndCattle
AgileMidwest2018-Becker-DatabasesAndCattleAgileMidwest2018-Becker-DatabasesAndCattle
AgileMidwest2018-Becker-DatabasesAndCattle
 
Advanced kapacitor
Advanced kapacitorAdvanced kapacitor
Advanced kapacitor
 
Looking towards an official cassandra sidecar netflix
Looking towards an official cassandra sidecar   netflixLooking towards an official cassandra sidecar   netflix
Looking towards an official cassandra sidecar netflix
 

Viewers also liked

GWRRA April Quebec Article GLB 032215
GWRRA April Quebec Article GLB 032215GWRRA April Quebec Article GLB 032215
GWRRA April Quebec Article GLB 032215Gary Burgess - PE
 
Escuelas deportivas 2016 17
Escuelas deportivas 2016 17Escuelas deportivas 2016 17
Escuelas deportivas 2016 17
Francisco Manuel González Pérez
 
Royce Gibson Resume, 11-14
Royce Gibson Resume, 11-14Royce Gibson Resume, 11-14
Royce Gibson Resume, 11-14Royce Gibson
 
Ingeniería civil
Ingeniería civil Ingeniería civil
Ingeniería civil
paulybc26
 
Taller partículas atómicas y propiedades periódicas001 (2)
Taller partículas atómicas y propiedades periódicas001 (2)Taller partículas atómicas y propiedades periódicas001 (2)
Taller partículas atómicas y propiedades periódicas001 (2)
karenina25
 
diapositivas
diapositivasdiapositivas
diapositivas
paulybc26
 
Diapositivas tics
Diapositivas ticsDiapositivas tics
Diapositivas ticspaulybc26
 
Clasificación 1ª y 2ª SEMANA 1
Clasificación 1ª y 2ª SEMANA 1Clasificación 1ª y 2ª SEMANA 1
Clasificación 1ª y 2ª SEMANA 1
Francisco Manuel González Pérez
 
Mini Infographic- Cost-Insitu vs extractive
Mini Infographic- Cost-Insitu vs extractive Mini Infographic- Cost-Insitu vs extractive
Mini Infographic- Cost-Insitu vs extractive
jennie421
 
tecnología
tecnologíatecnología
tecnología
paulybc26
 
automotive emission and control
automotive emission and controlautomotive emission and control
automotive emission and controlLokendra singh
 

Viewers also liked (17)

GWRRA April Quebec Article GLB 032215
GWRRA April Quebec Article GLB 032215GWRRA April Quebec Article GLB 032215
GWRRA April Quebec Article GLB 032215
 
Escuelas deportivas 2016 17
Escuelas deportivas 2016 17Escuelas deportivas 2016 17
Escuelas deportivas 2016 17
 
Cartel semanal 2013 14
Cartel semanal 2013 14Cartel semanal 2013 14
Cartel semanal 2013 14
 
Royce Gibson Resume, 11-14
Royce Gibson Resume, 11-14Royce Gibson Resume, 11-14
Royce Gibson Resume, 11-14
 
Ingeniería civil
Ingeniería civil Ingeniería civil
Ingeniería civil
 
Karnak
KarnakKarnak
Karnak
 
Burgos
BurgosBurgos
Burgos
 
Ppt
PptPpt
Ppt
 
Taller partículas atómicas y propiedades periódicas001 (2)
Taller partículas atómicas y propiedades periódicas001 (2)Taller partículas atómicas y propiedades periódicas001 (2)
Taller partículas atómicas y propiedades periódicas001 (2)
 
diapositivas
diapositivasdiapositivas
diapositivas
 
Polis
PolisPolis
Polis
 
Diapositivas tics
Diapositivas ticsDiapositivas tics
Diapositivas tics
 
Clasificación 1ª y 2ª SEMANA 1
Clasificación 1ª y 2ª SEMANA 1Clasificación 1ª y 2ª SEMANA 1
Clasificación 1ª y 2ª SEMANA 1
 
Mini Infographic- Cost-Insitu vs extractive
Mini Infographic- Cost-Insitu vs extractive Mini Infographic- Cost-Insitu vs extractive
Mini Infographic- Cost-Insitu vs extractive
 
tecnología
tecnologíatecnología
tecnología
 
automotive emission and control
automotive emission and controlautomotive emission and control
automotive emission and control
 
resume
resumeresume
resume
 

Similar to PROJECT GREEN

Ensuring Kubernetes Cost Efficiency across (many) Clusters - DevOps Gathering...
Ensuring Kubernetes Cost Efficiency across (many) Clusters - DevOps Gathering...Ensuring Kubernetes Cost Efficiency across (many) Clusters - DevOps Gathering...
Ensuring Kubernetes Cost Efficiency across (many) Clusters - DevOps Gathering...
Henning Jacobs
 
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
Datadog: a Real-Time Metrics Database for One Quadrillion Points/DayDatadog: a Real-Time Metrics Database for One Quadrillion Points/Day
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
C4Media
 
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
Scylla Summit 2019 Keynote - Dor Laor - Beyond CassandraScylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
ScyllaDB
 
NYC Java Meetup - Profiling and Performance
NYC Java Meetup - Profiling and PerformanceNYC Java Meetup - Profiling and Performance
NYC Java Meetup - Profiling and Performance
Jason Shao
 
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latency
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and LatencyOptimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latency
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latency
Henning Jacobs
 
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Henning Jacobs
 
Webinar slides: An Introduction to Performance Monitoring for PostgreSQL
Webinar slides: An Introduction to Performance Monitoring for PostgreSQLWebinar slides: An Introduction to Performance Monitoring for PostgreSQL
Webinar slides: An Introduction to Performance Monitoring for PostgreSQL
Severalnines
 
Java one2013 con4540-keenan
Java one2013 con4540-keenanJava one2013 con4540-keenan
Java one2013 con4540-keenan
ddkeenan
 
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
VMware Tanzu
 
Instaclustr introduction to managing cassandra
Instaclustr introduction to managing cassandraInstaclustr introduction to managing cassandra
Instaclustr introduction to managing cassandra
Instaclustr
 
Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...
Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...
Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...
DataStax
 
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Henning Jacobs
 
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
Amazon Web Services
 
Deep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instancesDeep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instances
Amazon Web Services
 
Truly non-intrusive OpenStack Cinder backup for mission critical systems
Truly non-intrusive OpenStack Cinder backup for mission critical systemsTruly non-intrusive OpenStack Cinder backup for mission critical systems
Truly non-intrusive OpenStack Cinder backup for mission critical systems
Dipak Kumar Singh
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive


Cloudera, Inc.
 
Using AWR for IO Subsystem Analysis
Using AWR for IO Subsystem AnalysisUsing AWR for IO Subsystem Analysis
Using AWR for IO Subsystem Analysis
Texas Memory Systems, and IBM Company
 
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
Amazon Web Services
 
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
Amazon Web Services
 
Cloud-based Virtualization for Test Automation
Cloud-based Virtualization for Test AutomationCloud-based Virtualization for Test Automation
Cloud-based Virtualization for Test Automation
Vikram G Hosakote
 

Similar to PROJECT GREEN (20)

Ensuring Kubernetes Cost Efficiency across (many) Clusters - DevOps Gathering...
Ensuring Kubernetes Cost Efficiency across (many) Clusters - DevOps Gathering...Ensuring Kubernetes Cost Efficiency across (many) Clusters - DevOps Gathering...
Ensuring Kubernetes Cost Efficiency across (many) Clusters - DevOps Gathering...
 
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
Datadog: a Real-Time Metrics Database for One Quadrillion Points/DayDatadog: a Real-Time Metrics Database for One Quadrillion Points/Day
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
 
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
Scylla Summit 2019 Keynote - Dor Laor - Beyond CassandraScylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
 
NYC Java Meetup - Profiling and Performance
NYC Java Meetup - Profiling and PerformanceNYC Java Meetup - Profiling and Performance
NYC Java Meetup - Profiling and Performance
 
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latency
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and LatencyOptimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latency
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latency
 
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
 
Webinar slides: An Introduction to Performance Monitoring for PostgreSQL
Webinar slides: An Introduction to Performance Monitoring for PostgreSQLWebinar slides: An Introduction to Performance Monitoring for PostgreSQL
Webinar slides: An Introduction to Performance Monitoring for PostgreSQL
 
Java one2013 con4540-keenan
Java one2013 con4540-keenanJava one2013 con4540-keenan
Java one2013 con4540-keenan
 
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
 
Instaclustr introduction to managing cassandra
Instaclustr introduction to managing cassandraInstaclustr introduction to managing cassandra
Instaclustr introduction to managing cassandra
 
Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...
Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...
Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...
 
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
 
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
(DAT402) Amazon RDS PostgreSQL:Lessons Learned & New Features
 
Deep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instancesDeep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instances
 
Truly non-intrusive OpenStack Cinder backup for mission critical systems
Truly non-intrusive OpenStack Cinder backup for mission critical systemsTruly non-intrusive OpenStack Cinder backup for mission critical systems
Truly non-intrusive OpenStack Cinder backup for mission critical systems
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive


 
Using AWR for IO Subsystem Analysis
Using AWR for IO Subsystem AnalysisUsing AWR for IO Subsystem Analysis
Using AWR for IO Subsystem Analysis
 
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
 
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
SRV402 Deep Dive on Amazon EC2 Instances, Featuring Performance Optimization ...
 
Cloud-based Virtualization for Test Automation
Cloud-based Virtualization for Test AutomationCloud-based Virtualization for Test Automation
Cloud-based Virtualization for Test Automation
 

PROJECT GREEN

  • 2. Objective : This project is a TECHNICAL INSIGHT of the Improvement Opportunities in the BackUp and Restore Area for Century Link .  Accomplishments.  New Findings –  Help Needed.
  • 3. Initiatives 1) Developed Health Check Script & implemented health checks of TSM SERVERS as a process every six hours. 2) TSM STORAGEPOOL Script implemented 3) Implemented Journal Based Backup to resolve the issue on server "QTCMHPNTIAP01". Benefits 1) Pro-active monitoring of TSM Servers Every Six Hours – Has Reduced the Downtime Risk of TSM Servers and improved the Back up Success Rate from 91% to 97% . 2) This script has helped in Evaluating the storage Pool space to accommodate fresh back ups from oracle with out having them to Fail – This has resulted in Improving the Oracle Backup Success Rate. 3) This technic can be implemented on servers which carry more numbers files. This helps in reducing the Backup Window from For Eg 2 Million Files which take 5 working days can be backed up in approximately 2 hours.
  • 4. New Findings - Streamline of TSM TAPE StorageNew Findings - Streamline of TSM TAPE Storage UtilizationUtilization ◦ . NODENAME 110330ELXDNP49EL 110330ELXDNP50EL 110330ELXDNT47EL 110330ELXDNT48EL 110413ELXDENVMTC057L 110413EQTDENVMDT143L 110417EQTDENVD188N 110417EQTDENVD189N 110509ERASCALSS  We have around 573 Expired Nodes to be removed from TSM SERVERS  Removing expired nodes data from TSM SERVERS will release 1000 Tapes approximately .  Can benefit the account from XX $$ Value. Tape Data associated to Decommissioned servers is not managed (Deleted)– Resulting into More Tapes Consumption. Note : Not aware of any Hidden Associated Risk to delete the nodes data
  • 5. New – Findings Scope (Intel) : Configuration Issues resulting in to Data Duplication NTOMT78CS NTOMT78A NCLUNTOMT78AN NCLUNTOMT78SQL1N No'S'Drive AIOMB52J QTOMASQLDB4CS QTOMASQLDB4A BCLUQTOMASQLDB4AN BCLUQTOMASQLDB4SQL1N No'S'Drive AIOMB53J QTOMASQLDB4CS QTOMASQLDB4B BCLUQTOMASQLDB4BN No'R'Drive BCLUQTOMASQLDB4SQL2N AIOMB53J qtomasql16acs QTOMASQL16A DCLUQTOMASQL16AN DCLUQTOMASQL16SQL1N DCLUQTOMASQL16SQL2N AIOMB54J QTOMASQL14CS QTOMASQL14A MCLUQTOMASQL14AN MCLUQTOMASQL14SQL1N No'S'Drive AIOMB55J QTOMASQL11CS QTOMASQL11A DCLUQTOMASQL11AN DCLUQTOMASQL11SQL1N No'S'Drive AIOMB54J QTDENSQL1CS QTDENSQL1B HQTDENSQL1BN No'R'Drive HCLUQTDENSQL1SQL2N AIDNB866 QTDENSQL2CS QTDENSQL2A HQTDENSQL2AN HCLUQTDENSQL2SQL1N No'S'Drive AIDNB866 QTDENSQL2CS QTDENSQL2B HQTDENSQL2BN No'R'Drive HCLUQTDENSQL2SQL2N AIDNB866 QTDENSQL3CS QTDENSQL3A ACLUQTDENSQL3AN ACLUQTDENSQL3SQL1N No'S'Drive AIDNB864 Identified few Cluster nodes with TB’s of duplicated Data :Getting Rid of this data will save around 500 Tapes Approximately.
  • 6. SCHEDULE_NAME COMMAND ACTIVE IssuesinthisTSMSERVER BACKUP_DEVCONF badevconf YES BACKUP_STG_ARCHIVE_DISKbastgarchive_diskoffsite_tapeYES BACKUP_STG_ARCHIVE_TAPEbastgarchive_tapeoffsite_tapeYES BACKUP_STG_DBA_DISK bastgdba_diskdba_offsiteYES BACKUP_STG_DBA_FILE bastgdba_filedba_offsiteYES BACKUP_STG_DBA_TAPE bastgdba_tape_t10kdba_offsiteYES BACKUP_STG_GOLD_DISK bastggold_diskgold_offsiteYES BACKUP_STG_GOLD_FILE bastggold_filegold_offsiteYES BACKUP_STG_GOLD_TAPE bastggold_tape_t10kgold_offsiteYES AIDNB866-2 Utilization 0% Utilization 0% NoBackupStgforDBA_TAPE NoBackupStgforDBA_VTL NoBackupStgfor DBA_XIV_VTL New – Findings : Configuration Issues resulting in to TSM SERVERTSM SERVER Storage pools having no offsite copy.Storage pools having no offsite copy. Approximately 40% of the Data doesn’t have an offsite Copy.
  • 7.  What all new tools/technologies can be implemented to resolve few issues?  Tivoli Monitoring to be in place.  Use Journal Based Backup wherever required.  Implementing LANFREE for the servers which has 500GB and above data.  What all changes needed in DESIGN to have better environment.  TSM SERVER Configuration needs to be thoroughly checked.  TEST/DEV to be separated with different Storagepools.  Password Expiration.  Help Needed  Approval for removing the expired & Duplicate data .  Once Approved, the plan of action will be submitted after working with the US Counter part.
  • 8.  S3 – Save Storage Space [YES CUBE]  Team Responsibilities  Weekend Tasks  Technical KT
  • 9.  Finding the nodes which has more backup data. [List Collected]  Through check on that host’s each object and checking with customer or AIP whether these files/objects really need for backup.  Ex:
  • 10.  AIOMB08R ◦ NSUOMT102S 326.3 GB ◦ NSUOMT800S 2.8 TB ◦ NECOMT106L 379.6 GB  AIOMB09R ◦ BHPOMP267H 372.5 GB  AIOMB54J ◦ DSUOMP34QS 187.5 GB ◦ DSUOMP34QS 206.2 GB
  • 11.  Taking individual responsibility on the remedy tickets and resolve them with in the SLA. Ex:   More attention on monitoring the TSM Storage Pool migrations to reduce the production database backup failures count.
  • 12. ◦ Indentified File systems which has no backup from long time and working with Server Owner’s/AIP’s to get confirmation to remove the data from TSM Storage pool which has occupied TB’S of space.[Report will be generated by Friday and Weekend Shift Team is working on this ] ◦ Irrespective of daily missed/failed backups; we have initiated digging in to servers which has no backup in last 1-5 Days. [This is to ensure that we should not have any of missed/failed backups repeated on Monday.]
  • 13.  We have started Knowledge Share within the team members on the topics raised by team ◦ Duration : 1 Hour ◦ Frequency : 2/Week ◦ Topics : Everything related to BACKUP/SAN/AIX  Topics Covered ◦ LANFREE BACKUPS ◦ NAS BACKUPS