SlideShare a Scribd company logo
1 of 27
Boris A. Velikovich
December 7, 2013


Lead Developer/Architect
Email: boris.velikovich@exostar.com
 LinkedIn: www.linkedin.com/in/bvelikovich/
 Blog: http://kiwiboris.blogspot.com
 Twitter: @BVelikovich




Working for Exostar since 2007


Involved in A&D and pharmaceutical projects








Leader in secure cloud-based solutions that improve
collaboration, information sharing, and supply chain management for
over 100,000 companies worldwide, including some of the largest players
in aerospace and defense, life sciences, pharmaceuticals, and financial
services.
Exostar’s solutions are accessed with our award winning Identity &
Access Management services that enable speedy adoption of your trusted
external partners.
Exostar’s ForumPass is a cloud-based, enterprise-class, collaboration
platform that addresses the needs of users sharing sensitive data with
external partners. The Cloud/SaaS “ready infrastructure” delivery
provides real time scalability for 20-30 people collaborative teams or
larger teams with 1,000s of users. The service is delivered on a per seat
month basis which helps our customers avoid costly startup costs for a
robust, secure service.
ForumPass is built on the SharePoint platform, currently utilizing the
SharePoint 2010 Enterprise version.




Per Microsoft, there are about 9,000 mediumsize businesses that have SharePoint data
storages ranging from 2 TB to 100 TB
A lot of these storages eventually need to be
migrated






To a new version of SharePoint
To a new location
Etc.

By the time of migration, some of the site
collections can grow quite large


One of the ForumPass site collections is 8 TB
This is twice as large as the recommended maximum for
any content database, and 81.92 times larger than the
recommended maximum for a site collection
 More than 30,000 users
 Content is added/modified very frequently




The farm is migrated to a new version of
SharePoint
 The maintenance window is only 4 days (96 hours)
 At least 99% of data must be preserved during the migration
 The big question: How to migrate the 8TB site collection?


Not feasible to migrate without splitting the
site collection





Copying would take much more time than the
maintenance window given
Even if somehow it would be feasible to finish the
migration within the maintenance window, it would
probably be wiser to find a long-term solution that
would conform to the Microsoft best practices

None of the out-of-the-box methods offered by
Microsoft is able to split a site collection


We chose Metalogix Content Matrix as our
migration software




Allows read-only direct connection to the source database
- important for performance reasons

Metalogix allows scripting migration activities
Provides PowerShell cmdlets
 Allows running several migration activities
simultaneously, thus speeding up the process
 Allows full and incremental copies


 Important because incremental copies take less time than full

copies – thus a final incremental copy is more likely than a
full copy to fit into the maintenance window



Each script can take parameters
• SharePoint farm installation
• Web application configuration
• Service application configuration
The new
• Firewalls configured
environment has to
be fully functional • Etc.

Code has to be
migrated

PowerShell has to be
prepared

• Feature IDs need to be preserved
• If migrating from MOSS 2007, code has to be compatible with SharePoint 2010
• In particular, code that refers to user profiles or search
• All the solutions need to be deployed

• Use Content Matrix PowerShell Console
• Make sure your powershell.exe.config file contains the settings necessary to initialize features
Each first-level subsite is promoted to a site
collection
Some but not all second-level subsites are promoted
to site collections

No other subsites are promoted to site collections
(for complexity reasons)
The content of the top-level site of the site collection
(libraries, lists, images, etc.) is NOT migrated
For
each
firstlevel
subsite

• Create a new content database
• In this content database, create a new site
collection based on the standard template
• Then, two options:
• 1) copy the content of the subsite to the toplevel site of this new site collection
• Since some second-level subsites are
promoted to their own site collection, a site
filter is required
• or
• 2) copy the subsite to this new site collection
Copy-MLAllSharePointSiteContent or
Copy-MLSharePointSite
 The specific parameters depend on the choice of the
cmdlet, as well as your migration requirements
 E.g., you don’t want to migrate themes if you are

migrating from MOSS 2007 to SharePoint 2010

Make sure that the SiteFilterExpression is present if
you plan to promote certain subsites to their own
site collections
 Certain parameters might affect performance
 Sometimes it is worth to prototype the migration
operation in the GUI



Use Copy-MLAllSharePointSiteContent when





The URL of the new site collection has to stay exactly
the same as in the first-level subsite, or
You want the first-level subsite content on the root
level of the newly-created site collection, and the site
template of that subsite does not interfere with the
site template of the root subsite

In all other cases, use Copy-MLSharePointSite
1) Input CSV files
2) Exclusion CSV file

3) Script configuration


At the very least, it should include:


Server-relative source url
 E.g., /sites/mycompany/SomeCoolSite



Managed path
 E.g., /customers/ or /sites/mycompany



Site Name
 E.g., SomeCoolSite



Site Description
 E.g., Some Cool Site



Whether migration is full or incremental


At the very least, it should contain the sitecollection-relative URLs of excluded subsites
Should
contain:

• Input CSV file path
• Exclusion CSV file path
• Source information
• DB Server, content DB, root
URL, template path, etc.
• Target information
• DB Server, farm
administrator, root url
• Metalogix job history path











Some second-level subsites are promoted to site
collections
These site collections’ URLs are new
A separate script is needed
Script configuration similar to what we’ve seen
Input CSV should include the URL of the new site
collection, as well as the web template of the site
copied
The Copy-MLSharePointSite cmdlet is used in the
script
New site collections are created in new content
databases



Be careful with Team Sites
-MergeSiteFeatures parameter




If it is true and you migrate from MOSS 2007 to
SharePoint 2010, then the web parts from
default.aspx will move to SitePages/Home.aspx and
default.aspx will be empty - causes great confusion
for users
If it is false and you used the CopyMLAllSharePointContent cmdlet, you need to make
sure that all necessary site collection features are
activated


Full copy: Workflow associations are copied, workflow
instances are NOT




Incremental copy: Workflow associations are NOT
copied




Possible to copy Nintex or SharePoint Designer workflow
associations

Thus, the users should NOT create new workflow
associations after the full copy ran

LegacyWorkflows feature needs to be activated on
newly-created site collections




Make sure you add site collection admins to
the newly-created site collections
Involve users (CFT)



Their feedback will identify the problem areas
Run incremental migrations as needed
 We needed several incremental migrations, plus a final

one to be run during the maintenance window





Metalogix allows comparison reports to verify
completeness of the migration job
Also, Metalogix provides logs for each job
When your testers identify a migration
issue, the reports and logs will help you
troubleshoot


Sometimes, an additional incremental copy might be
needed


The hardest thing to troubleshoot





Migrating a 8 TB site collection may well take more than 1024
times than migrating a 8 GB site collection
Migration rate can go down with time

C:UsersSomeUserAppDataRoamingMetalogix
Content Matrix Console – SharePoint
EditionApplicationSettings.xml


PerActionResourceUse - Controls how many migration
activities are run in parallel

 Trade-off - Higher value means more parallelism but less

predictability
 Since parallelism is available where possible, the variance of
load within a job is less predictable.




SQLQueryTimeoutTime – You can also lose data if the timeout
time is too low

Disable verbose logging






Accomplished a 96-hour go-live event
Migrated more than three million documents
Achieved 99.6% quality metric with customer
concurrence
Achieved major improvements in Metalogix
Content Matrix migration tool, in direct
collaboration with Metalogix


Migrating a very large site collection:
Typically involves splits, which means that a thirdparty product such as Metalogix Content Matrix will
be needed
 Can be scripted, with scripts running in parallel
 Requires comparison reports to ensure completeness
 Presents performance challenges as the migration
rate tends to go down

Migrating Very Large Site Collections (SPSDC)

More Related Content

What's hot

Mime Magic With Apache Tika
Mime Magic With Apache TikaMime Magic With Apache Tika
Mime Magic With Apache TikaJukka Zitting
 
(ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service (ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service BIOVIA
 
2012.10 Liferay Europe Symposium, Alistair Oldfield
2012.10 Liferay Europe Symposium, Alistair Oldfield2012.10 Liferay Europe Symposium, Alistair Oldfield
2012.10 Liferay Europe Symposium, Alistair OldfieldEmeldi Group
 
Sitecore9 key features by jitendra soni - Presented in Sitecore User Group UK
Sitecore9 key features by jitendra soni - Presented in Sitecore User Group UKSitecore9 key features by jitendra soni - Presented in Sitecore User Group UK
Sitecore9 key features by jitendra soni - Presented in Sitecore User Group UKJitendra Soni
 
Alfresco Tech Talk Live (Episode 70): Customizing Alfresco Share 4.2
Alfresco Tech Talk Live (Episode 70): Customizing Alfresco Share 4.2Alfresco Tech Talk Live (Episode 70): Customizing Alfresco Share 4.2
Alfresco Tech Talk Live (Episode 70): Customizing Alfresco Share 4.2Richard Esplin
 
Web Server - Internet Applications
Web Server - Internet ApplicationsWeb Server - Internet Applications
Web Server - Internet Applicationssandra sukarieh
 
Module Owb Source Metadata
Module Owb Source MetadataModule Owb Source Metadata
Module Owb Source MetadataNicholas Goodman
 
Developing, Debugging and Administrating Your Integration Scenarios with WSO2...
Developing, Debugging and Administrating Your Integration Scenarios with WSO2...Developing, Debugging and Administrating Your Integration Scenarios with WSO2...
Developing, Debugging and Administrating Your Integration Scenarios with WSO2...WSO2
 
Troubleshooting and Best Practices with WSO2 Enterprise Integrator
Troubleshooting and Best Practices with WSO2 Enterprise IntegratorTroubleshooting and Best Practices with WSO2 Enterprise Integrator
Troubleshooting and Best Practices with WSO2 Enterprise IntegratorWSO2
 
Introduction to Kafka with Spring Integration
Introduction to Kafka with Spring IntegrationIntroduction to Kafka with Spring Integration
Introduction to Kafka with Spring IntegrationBorislav Markov
 
Kafka Connect by Datio
Kafka Connect by DatioKafka Connect by Datio
Kafka Connect by DatioDatio Big Data
 
Biztalk 2013 Launch Briefing
Biztalk 2013 Launch BriefingBiztalk 2013 Launch Briefing
Biztalk 2013 Launch BriefingNikolai Blackie
 
Kafka Connect: Real-time Data Integration at Scale with Apache Kafka, Ewen Ch...
Kafka Connect: Real-time Data Integration at Scale with Apache Kafka, Ewen Ch...Kafka Connect: Real-time Data Integration at Scale with Apache Kafka, Ewen Ch...
Kafka Connect: Real-time Data Integration at Scale with Apache Kafka, Ewen Ch...confluent
 
Introduction to the Client OM in SharePoint 2010
Introduction to the Client OM in SharePoint 2010Introduction to the Client OM in SharePoint 2010
Introduction to the Client OM in SharePoint 2010Ben Robb
 
Spring Web Service, Spring Integration and Spring Batch
Spring Web Service, Spring Integration and Spring BatchSpring Web Service, Spring Integration and Spring Batch
Spring Web Service, Spring Integration and Spring BatchEberhard Wolff
 
Enterprise Integration Patterns - Spring way
Enterprise Integration Patterns - Spring wayEnterprise Integration Patterns - Spring way
Enterprise Integration Patterns - Spring wayDragan Gajic
 
Syntergy upgrade open text content server with replicator - 7-3-2016
Syntergy   upgrade open text content server with replicator - 7-3-2016Syntergy   upgrade open text content server with replicator - 7-3-2016
Syntergy upgrade open text content server with replicator - 7-3-2016Vijay Sharma
 

What's hot (20)

Mime Magic With Apache Tika
Mime Magic With Apache TikaMime Magic With Apache Tika
Mime Magic With Apache Tika
 
(ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service (ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service
 
2012.10 Liferay Europe Symposium, Alistair Oldfield
2012.10 Liferay Europe Symposium, Alistair Oldfield2012.10 Liferay Europe Symposium, Alistair Oldfield
2012.10 Liferay Europe Symposium, Alistair Oldfield
 
Sitecore9 key features by jitendra soni - Presented in Sitecore User Group UK
Sitecore9 key features by jitendra soni - Presented in Sitecore User Group UKSitecore9 key features by jitendra soni - Presented in Sitecore User Group UK
Sitecore9 key features by jitendra soni - Presented in Sitecore User Group UK
 
Alfresco Tech Talk Live (Episode 70): Customizing Alfresco Share 4.2
Alfresco Tech Talk Live (Episode 70): Customizing Alfresco Share 4.2Alfresco Tech Talk Live (Episode 70): Customizing Alfresco Share 4.2
Alfresco Tech Talk Live (Episode 70): Customizing Alfresco Share 4.2
 
Web Server - Internet Applications
Web Server - Internet ApplicationsWeb Server - Internet Applications
Web Server - Internet Applications
 
2012.10 Oldfield
2012.10 Oldfield2012.10 Oldfield
2012.10 Oldfield
 
Module Owb Source Metadata
Module Owb Source MetadataModule Owb Source Metadata
Module Owb Source Metadata
 
Developing, Debugging and Administrating Your Integration Scenarios with WSO2...
Developing, Debugging and Administrating Your Integration Scenarios with WSO2...Developing, Debugging and Administrating Your Integration Scenarios with WSO2...
Developing, Debugging and Administrating Your Integration Scenarios with WSO2...
 
Troubleshooting and Best Practices with WSO2 Enterprise Integrator
Troubleshooting and Best Practices with WSO2 Enterprise IntegratorTroubleshooting and Best Practices with WSO2 Enterprise Integrator
Troubleshooting and Best Practices with WSO2 Enterprise Integrator
 
Introduction to Kafka with Spring Integration
Introduction to Kafka with Spring IntegrationIntroduction to Kafka with Spring Integration
Introduction to Kafka with Spring Integration
 
Web Servers (ppt)
Web Servers (ppt)Web Servers (ppt)
Web Servers (ppt)
 
Kafka Connect by Datio
Kafka Connect by DatioKafka Connect by Datio
Kafka Connect by Datio
 
Biztalk 2013 Launch Briefing
Biztalk 2013 Launch BriefingBiztalk 2013 Launch Briefing
Biztalk 2013 Launch Briefing
 
Module Owb Repositories
Module Owb RepositoriesModule Owb Repositories
Module Owb Repositories
 
Kafka Connect: Real-time Data Integration at Scale with Apache Kafka, Ewen Ch...
Kafka Connect: Real-time Data Integration at Scale with Apache Kafka, Ewen Ch...Kafka Connect: Real-time Data Integration at Scale with Apache Kafka, Ewen Ch...
Kafka Connect: Real-time Data Integration at Scale with Apache Kafka, Ewen Ch...
 
Introduction to the Client OM in SharePoint 2010
Introduction to the Client OM in SharePoint 2010Introduction to the Client OM in SharePoint 2010
Introduction to the Client OM in SharePoint 2010
 
Spring Web Service, Spring Integration and Spring Batch
Spring Web Service, Spring Integration and Spring BatchSpring Web Service, Spring Integration and Spring Batch
Spring Web Service, Spring Integration and Spring Batch
 
Enterprise Integration Patterns - Spring way
Enterprise Integration Patterns - Spring wayEnterprise Integration Patterns - Spring way
Enterprise Integration Patterns - Spring way
 
Syntergy upgrade open text content server with replicator - 7-3-2016
Syntergy   upgrade open text content server with replicator - 7-3-2016Syntergy   upgrade open text content server with replicator - 7-3-2016
Syntergy upgrade open text content server with replicator - 7-3-2016
 

Similar to Migrating Very Large Site Collections (SPSDC)

Migrating very large site collections
Migrating very large site collectionsMigrating very large site collections
Migrating very large site collectionskiwiboris
 
Adopting AnswerModules ModuleSuite
Adopting AnswerModules ModuleSuiteAdopting AnswerModules ModuleSuite
Adopting AnswerModules ModuleSuiteAnswerModules
 
What Makes Migrating to the Cloud Different Than On-Premises
What Makes Migrating to the Cloud Different Than On-PremisesWhat Makes Migrating to the Cloud Different Than On-Premises
What Makes Migrating to the Cloud Different Than On-PremisesChristian Buckley
 
SharePoint Saturday Toronto 2015 - Inside the mind of a SharePoint Architect
SharePoint Saturday Toronto 2015 - Inside the mind of a SharePoint ArchitectSharePoint Saturday Toronto 2015 - Inside the mind of a SharePoint Architect
SharePoint Saturday Toronto 2015 - Inside the mind of a SharePoint ArchitectNoorez Khamis
 
Whats new in Enterprise 5.0 Product Suite
Whats new in Enterprise 5.0 Product SuiteWhats new in Enterprise 5.0 Product Suite
Whats new in Enterprise 5.0 Product SuiteMicro Focus
 
Nuts and bolts of running a popular site in the aws cloud
Nuts and bolts of running a popular site in the aws cloudNuts and bolts of running a popular site in the aws cloud
Nuts and bolts of running a popular site in the aws cloudDavid Veksler
 
Planning Your Migration to SharePoint Online #SPBiz60
Planning Your Migration to SharePoint Online #SPBiz60Planning Your Migration to SharePoint Online #SPBiz60
Planning Your Migration to SharePoint Online #SPBiz60Christian Buckley
 
Datasheet foldermanagementpluginforrd
Datasheet foldermanagementpluginforrdDatasheet foldermanagementpluginforrd
Datasheet foldermanagementpluginforrdMidVision
 
WebRadar
WebRadarWebRadar
WebRadarTiny
 
How to – wrap soap web service around a database
How to – wrap soap web service around a databaseHow to – wrap soap web service around a database
How to – wrap soap web service around a databaseSon Nguyen
 
Building the Perfect SharePoint 2010 Farm
Building the Perfect SharePoint 2010 FarmBuilding the Perfect SharePoint 2010 Farm
Building the Perfect SharePoint 2010 FarmMichael Noel
 
Advanced web application architecture - Talk
Advanced web application architecture - TalkAdvanced web application architecture - Talk
Advanced web application architecture - TalkMatthias Noback
 
Datasheet weblogic midvisionextensionforibmraf
Datasheet weblogic midvisionextensionforibmrafDatasheet weblogic midvisionextensionforibmraf
Datasheet weblogic midvisionextensionforibmrafMidVision
 
Datasheet was pluginforrd
Datasheet was pluginforrdDatasheet was pluginforrd
Datasheet was pluginforrdMidVision
 
Datasheet weblogicpluginforrd
Datasheet weblogicpluginforrdDatasheet weblogicpluginforrd
Datasheet weblogicpluginforrdMidVision
 
2011.10 Liferay European Symposium. Alistair Oldfield
2011.10 Liferay European Symposium. Alistair Oldfield2011.10 Liferay European Symposium. Alistair Oldfield
2011.10 Liferay European Symposium. Alistair OldfieldEmeldi Group
 
Nuxeo Platform LTS 2015 Highlights
Nuxeo Platform LTS 2015 HighlightsNuxeo Platform LTS 2015 Highlights
Nuxeo Platform LTS 2015 HighlightsNuxeo
 
Datasheet apachepluginforrd
Datasheet apachepluginforrdDatasheet apachepluginforrd
Datasheet apachepluginforrdMidVision
 

Similar to Migrating Very Large Site Collections (SPSDC) (20)

Migrating very large site collections
Migrating very large site collectionsMigrating very large site collections
Migrating very large site collections
 
Adopting AnswerModules ModuleSuite
Adopting AnswerModules ModuleSuiteAdopting AnswerModules ModuleSuite
Adopting AnswerModules ModuleSuite
 
What Makes Migrating to the Cloud Different Than On-Premises
What Makes Migrating to the Cloud Different Than On-PremisesWhat Makes Migrating to the Cloud Different Than On-Premises
What Makes Migrating to the Cloud Different Than On-Premises
 
SharePoint Saturday Toronto 2015 - Inside the mind of a SharePoint Architect
SharePoint Saturday Toronto 2015 - Inside the mind of a SharePoint ArchitectSharePoint Saturday Toronto 2015 - Inside the mind of a SharePoint Architect
SharePoint Saturday Toronto 2015 - Inside the mind of a SharePoint Architect
 
Whats new in Enterprise 5.0 Product Suite
Whats new in Enterprise 5.0 Product SuiteWhats new in Enterprise 5.0 Product Suite
Whats new in Enterprise 5.0 Product Suite
 
Nuts and bolts of running a popular site in the aws cloud
Nuts and bolts of running a popular site in the aws cloudNuts and bolts of running a popular site in the aws cloud
Nuts and bolts of running a popular site in the aws cloud
 
Planning Your Migration to SharePoint Online #SPBiz60
Planning Your Migration to SharePoint Online #SPBiz60Planning Your Migration to SharePoint Online #SPBiz60
Planning Your Migration to SharePoint Online #SPBiz60
 
Datasheet foldermanagementpluginforrd
Datasheet foldermanagementpluginforrdDatasheet foldermanagementpluginforrd
Datasheet foldermanagementpluginforrd
 
Webcast Wcm
Webcast WcmWebcast Wcm
Webcast Wcm
 
WebRadar
WebRadarWebRadar
WebRadar
 
How to – wrap soap web service around a database
How to – wrap soap web service around a databaseHow to – wrap soap web service around a database
How to – wrap soap web service around a database
 
Building the Perfect SharePoint 2010 Farm
Building the Perfect SharePoint 2010 FarmBuilding the Perfect SharePoint 2010 Farm
Building the Perfect SharePoint 2010 Farm
 
Advanced web application architecture - Talk
Advanced web application architecture - TalkAdvanced web application architecture - Talk
Advanced web application architecture - Talk
 
Datasheet weblogic midvisionextensionforibmraf
Datasheet weblogic midvisionextensionforibmrafDatasheet weblogic midvisionextensionforibmraf
Datasheet weblogic midvisionextensionforibmraf
 
Datasheet was pluginforrd
Datasheet was pluginforrdDatasheet was pluginforrd
Datasheet was pluginforrd
 
Datasheet weblogicpluginforrd
Datasheet weblogicpluginforrdDatasheet weblogicpluginforrd
Datasheet weblogicpluginforrd
 
2011.10 Liferay European Symposium. Alistair Oldfield
2011.10 Liferay European Symposium. Alistair Oldfield2011.10 Liferay European Symposium. Alistair Oldfield
2011.10 Liferay European Symposium. Alistair Oldfield
 
Nuxeo Platform LTS 2015 Highlights
Nuxeo Platform LTS 2015 HighlightsNuxeo Platform LTS 2015 Highlights
Nuxeo Platform LTS 2015 Highlights
 
Datasheet apachepluginforrd
Datasheet apachepluginforrdDatasheet apachepluginforrd
Datasheet apachepluginforrd
 
New cms final
New cms finalNew cms final
New cms final
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Migrating Very Large Site Collections (SPSDC)

  • 2.  Lead Developer/Architect Email: boris.velikovich@exostar.com  LinkedIn: www.linkedin.com/in/bvelikovich/  Blog: http://kiwiboris.blogspot.com  Twitter: @BVelikovich   Working for Exostar since 2007  Involved in A&D and pharmaceutical projects
  • 3.     Leader in secure cloud-based solutions that improve collaboration, information sharing, and supply chain management for over 100,000 companies worldwide, including some of the largest players in aerospace and defense, life sciences, pharmaceuticals, and financial services. Exostar’s solutions are accessed with our award winning Identity & Access Management services that enable speedy adoption of your trusted external partners. Exostar’s ForumPass is a cloud-based, enterprise-class, collaboration platform that addresses the needs of users sharing sensitive data with external partners. The Cloud/SaaS “ready infrastructure” delivery provides real time scalability for 20-30 people collaborative teams or larger teams with 1,000s of users. The service is delivered on a per seat month basis which helps our customers avoid costly startup costs for a robust, secure service. ForumPass is built on the SharePoint platform, currently utilizing the SharePoint 2010 Enterprise version.
  • 4.   Per Microsoft, there are about 9,000 mediumsize businesses that have SharePoint data storages ranging from 2 TB to 100 TB A lot of these storages eventually need to be migrated     To a new version of SharePoint To a new location Etc. By the time of migration, some of the site collections can grow quite large
  • 5.  One of the ForumPass site collections is 8 TB This is twice as large as the recommended maximum for any content database, and 81.92 times larger than the recommended maximum for a site collection  More than 30,000 users  Content is added/modified very frequently   The farm is migrated to a new version of SharePoint  The maintenance window is only 4 days (96 hours)  At least 99% of data must be preserved during the migration  The big question: How to migrate the 8TB site collection?
  • 6.  Not feasible to migrate without splitting the site collection    Copying would take much more time than the maintenance window given Even if somehow it would be feasible to finish the migration within the maintenance window, it would probably be wiser to find a long-term solution that would conform to the Microsoft best practices None of the out-of-the-box methods offered by Microsoft is able to split a site collection
  • 7.  We chose Metalogix Content Matrix as our migration software   Allows read-only direct connection to the source database - important for performance reasons Metalogix allows scripting migration activities Provides PowerShell cmdlets  Allows running several migration activities simultaneously, thus speeding up the process  Allows full and incremental copies   Important because incremental copies take less time than full copies – thus a final incremental copy is more likely than a full copy to fit into the maintenance window  Each script can take parameters
  • 8. • SharePoint farm installation • Web application configuration • Service application configuration The new • Firewalls configured environment has to be fully functional • Etc. Code has to be migrated PowerShell has to be prepared • Feature IDs need to be preserved • If migrating from MOSS 2007, code has to be compatible with SharePoint 2010 • In particular, code that refers to user profiles or search • All the solutions need to be deployed • Use Content Matrix PowerShell Console • Make sure your powershell.exe.config file contains the settings necessary to initialize features
  • 9. Each first-level subsite is promoted to a site collection Some but not all second-level subsites are promoted to site collections No other subsites are promoted to site collections (for complexity reasons) The content of the top-level site of the site collection (libraries, lists, images, etc.) is NOT migrated
  • 10. For each firstlevel subsite • Create a new content database • In this content database, create a new site collection based on the standard template • Then, two options: • 1) copy the content of the subsite to the toplevel site of this new site collection • Since some second-level subsites are promoted to their own site collection, a site filter is required • or • 2) copy the subsite to this new site collection
  • 11. Copy-MLAllSharePointSiteContent or Copy-MLSharePointSite  The specific parameters depend on the choice of the cmdlet, as well as your migration requirements  E.g., you don’t want to migrate themes if you are migrating from MOSS 2007 to SharePoint 2010 Make sure that the SiteFilterExpression is present if you plan to promote certain subsites to their own site collections  Certain parameters might affect performance  Sometimes it is worth to prototype the migration operation in the GUI 
  • 12.  Use Copy-MLAllSharePointSiteContent when    The URL of the new site collection has to stay exactly the same as in the first-level subsite, or You want the first-level subsite content on the root level of the newly-created site collection, and the site template of that subsite does not interfere with the site template of the root subsite In all other cases, use Copy-MLSharePointSite
  • 13. 1) Input CSV files 2) Exclusion CSV file 3) Script configuration
  • 14.  At the very least, it should include:  Server-relative source url  E.g., /sites/mycompany/SomeCoolSite  Managed path  E.g., /customers/ or /sites/mycompany  Site Name  E.g., SomeCoolSite  Site Description  E.g., Some Cool Site  Whether migration is full or incremental
  • 15.  At the very least, it should contain the sitecollection-relative URLs of excluded subsites
  • 16. Should contain: • Input CSV file path • Exclusion CSV file path • Source information • DB Server, content DB, root URL, template path, etc. • Target information • DB Server, farm administrator, root url • Metalogix job history path
  • 17.
  • 18.        Some second-level subsites are promoted to site collections These site collections’ URLs are new A separate script is needed Script configuration similar to what we’ve seen Input CSV should include the URL of the new site collection, as well as the web template of the site copied The Copy-MLSharePointSite cmdlet is used in the script New site collections are created in new content databases
  • 19.
  • 20.   Be careful with Team Sites -MergeSiteFeatures parameter   If it is true and you migrate from MOSS 2007 to SharePoint 2010, then the web parts from default.aspx will move to SitePages/Home.aspx and default.aspx will be empty - causes great confusion for users If it is false and you used the CopyMLAllSharePointContent cmdlet, you need to make sure that all necessary site collection features are activated
  • 21.  Full copy: Workflow associations are copied, workflow instances are NOT   Incremental copy: Workflow associations are NOT copied   Possible to copy Nintex or SharePoint Designer workflow associations Thus, the users should NOT create new workflow associations after the full copy ran LegacyWorkflows feature needs to be activated on newly-created site collections
  • 22.   Make sure you add site collection admins to the newly-created site collections Involve users (CFT)   Their feedback will identify the problem areas Run incremental migrations as needed  We needed several incremental migrations, plus a final one to be run during the maintenance window
  • 23.    Metalogix allows comparison reports to verify completeness of the migration job Also, Metalogix provides logs for each job When your testers identify a migration issue, the reports and logs will help you troubleshoot  Sometimes, an additional incremental copy might be needed
  • 24.  The hardest thing to troubleshoot    Migrating a 8 TB site collection may well take more than 1024 times than migrating a 8 GB site collection Migration rate can go down with time C:UsersSomeUserAppDataRoamingMetalogix Content Matrix Console – SharePoint EditionApplicationSettings.xml  PerActionResourceUse - Controls how many migration activities are run in parallel  Trade-off - Higher value means more parallelism but less predictability  Since parallelism is available where possible, the variance of load within a job is less predictable.   SQLQueryTimeoutTime – You can also lose data if the timeout time is too low Disable verbose logging
  • 25.     Accomplished a 96-hour go-live event Migrated more than three million documents Achieved 99.6% quality metric with customer concurrence Achieved major improvements in Metalogix Content Matrix migration tool, in direct collaboration with Metalogix
  • 26.  Migrating a very large site collection: Typically involves splits, which means that a thirdparty product such as Metalogix Content Matrix will be needed  Can be scripted, with scripts running in parallel  Requires comparison reports to ensure completeness  Presents performance challenges as the migration rate tends to go down 