SlideShare a Scribd company logo
1 of 22
Repository Scalability: Comparing
 Microsoft SharePoint 2010 and
        Oracle UCM 11g


    Raoul Miller and Brent Seaman,
       TEAM Informatics, inc.
Outline
• Our ingestion rate experiments
     • Hardware and Software setup
     • Experimental design
• Observations and Conclusions from these tests
• Implications for Repository Sizing and
  Organization in SharePoint and UCM
• Lessons Learned and Recommendations
• Q and A
Overall aims of this research
• Apply real-world scenarios to ingestion testing
     • Rather than ultra high performance / ultra high cost
• Determine actual ingestion rates for different
  scenarios on identical hardware
• Expose weaknesses / issues in large imports
• Derive recommendations for best practices in
  importing existing content into new CMS
  repositories
Experimental Approach
• Import existing files from file system into newly-installed
  CMS
   –   Standard configurations
   –   Commodity hardware
   –   No specialized tuning or optimizations
   –   Vendor recommended OS and databases


• Four scenarios
   –   20,000 files @ 40kB
   –   20,000 files @ 100kB
   –   1,000,000 files @ 40kB
   –   1,000,000 files @ 100kB
Are these Scenarios Realistic?
• >80% of single instance CMS repositories
  contain 50-200,000+ items
• Average “document” size in most
  industries is ~100kB.
• Most projects need to import existing
  content from file shares or other systems
Commodity Hardware
• Dell PowerEdge R710s server
• Dual Intel Xeon 5560 CPUs (@ quad core)
  running at 2.8Ghz
• 16GB RAM
• Eight 146GB 10K RPM SAS drives
UCM Installation
• Operating System:         RedHat Enterprise 5 (specifically
  theCentOS5 build)
• Database:                 Oracle 11g Standard Edition database
• Web / Application Server: Weblogic 11gR1 (10.3.3)
• Content ManagementUCM 11gR1 (11.1.1.4.0)
System:
• Java Runtime Environment:Sun Hotspot SDK (1.6.0_11) & JRockit
                            R28
• File storage:             File system (default) and JDBC
                            (SecureFiles)
SharePoint Installation
• Operating System:         Windows Server 2008 Std Edition for
                            Partners
•   Database:               Microsoft SQL Server 2008 R2
                            Enterprise
•   Web Server:             IIS7 (Standard with Windows Server
                            2008 - specifically v 7.5.76)
•   Content ManagementSharePoint Server 2010 Enterprise for
    System:                 Partners
•   File storage:           Database Storage in SQL server
Ingestion Approaches
• UCM
  – used BatchBuilder and BatchLoader
• SharePoint -
  – had to use third party tool (UploadZen by
    Roxority)
  – Need to organize content before import
  – Limited flexibility in directory size
Supported SharePoint 2010 bulk
         import strategies
• Multiple file upload applet
   – Silverlight; supports up to 100 docs, does not support
     subdirectories
• Windows Explorer view
   – Extension of webDAV
   – Limited performance
• SharePoint Workspace
   – Client integration
   – Only supports up to 500 documents
Differences between Import Strategies
• BatchLoader
  – Supported system tool
  – Allows automated file system crawl (BatchBuilder)
  – Storage / browse location in repository unrelated to source
    location
  – Supports high volume
• UploadZen
  –   Third-party application
  –   Requires organization and sizing of import directories
  –   Organization within repository reflects import location
  –   Major challenges with high volume imports
Considerations for Repository Sizing
1. Should be primarily driven by business / infosec needs
2. Practicality
   – Import / migrate
   – Search / organize
   – Backup / DR
3. Flexibility
   – Growth in content volume / size
   – Leverage HSM / partitioning
   – Provide options for storage strategies
Ingestion Rate Testing
• Major things to test:
  – Overall rate of ingestion with different sized
    files and different sized collections
  – Ease of use of import tools
  – Flexibility in organization of content during /
    after import
20,000 files – each 40kB
• First set of tests
• Single directory for SharePoint source

• UCM – File System storage – 198,000 docs/hr
• UCM – JDBC storage – 156,000 docs/hr
• SharePoint – 153,000 docs/hr
20,000 files – each 100kB
• UCM – File System storage – 171,000 docs/hr
• SharePoint – 138,000 docs/hr

• Ingestion rates fell 10-15% for larger file size
• SharePoint RAM usage higher, primarily in
  database
1,000,000 files – each 40kB
• Need to organize files in directories for SharePoint
    – 50 folders each with 20,000 items - failed
    – 2,000 folders each with 500 items – succeeded


•   UCM – FS storage & Sun JRE                205,000 docs/hr
•   UCM – FS storage & JRockit JRE            212,000 docs/hr
•   UCM – JDBC storage & Sun JRE              171,000 docs/hr
•   SharePoint w/ 50 import folders           failed
•   SharePoint w/ 2,000 import folders        217,000 docs/hr
1,000,000 files – each 40kB
             (contd.)

• Substantial work to organize content for
  SharePoint import
• SharePoint much more RAM intensive
  – Primarily with database process
• UCM more CPU intensive
  – Much more linear response
1,000,000 files – each 100kB
• UCM – FS storage & Sun JRE 179,000 docs/hr
  – 15% decrease in rate due to file size

• Unable to complete test with SharePoint
Conclusions
• SharePoint requires 3rd party tools and substantial work
  before import
• SharePoint has limited flexibility in terms of repository
  sizing, content organization, and import strategies
• With optimized import, SharePoint ingestion rates are
  comparable to UCM
• UCM has much more flexibility in import strategies
• UCM has consistent import rates between 156,000 and
  212,000 docs/hr (OOTB)
Conclusions (contd.)
• Ingestion rates are dependant on average file size (10-
  15% decrease in rate between 40kB and 100kB file size)
• UCM can be deployed on commodity hardware for
  repositories of 1,000,000 items
• SharePoint has challenges importing 1,000,000 files on
  commodity hardware
• Both systems function well on this hardware after import.
• SharePoint import is much more RAM intensive whereas
  UCM import is CPU intensive
Q&A
Comments / Questions /
         Feedback
Contact us:
  Raoul Miller
    Raoul.miller@teaminformatics.com


  Brent Seaman
    Brent.seaman@teaminformatics.com

More Related Content

What's hot

SharePoint 2010 upgrades!
SharePoint 2010 upgrades!SharePoint 2010 upgrades!
SharePoint 2010 upgrades!Eric VanRoy
 
Sql server 2014 y los datos
Sql server 2014 y los datosSql server 2014 y los datos
Sql server 2014 y los datosKenneth Urena
 
An Expert Guide to Migrating Legacy Databases to PostgreSQL
An Expert Guide to Migrating Legacy Databases to PostgreSQLAn Expert Guide to Migrating Legacy Databases to PostgreSQL
An Expert Guide to Migrating Legacy Databases to PostgreSQLEDB
 
Alfresco Day Roma 2015: Full Stack Load Testing
Alfresco Day Roma 2015: Full Stack Load TestingAlfresco Day Roma 2015: Full Stack Load Testing
Alfresco Day Roma 2015: Full Stack Load TestingAlfresco Software
 
WiredTiger MongoDB Integration
WiredTiger MongoDB Integration WiredTiger MongoDB Integration
WiredTiger MongoDB Integration MongoDB
 
Webinar: Introduction to MongoDB 3.0
Webinar: Introduction to MongoDB 3.0Webinar: Introduction to MongoDB 3.0
Webinar: Introduction to MongoDB 3.0MongoDB
 
SPSSac2014 - SharePoint Infrastructure Tips and Tricks for On-Premises and Hy...
SPSSac2014 - SharePoint Infrastructure Tips and Tricks for On-Premises and Hy...SPSSac2014 - SharePoint Infrastructure Tips and Tricks for On-Premises and Hy...
SPSSac2014 - SharePoint Infrastructure Tips and Tricks for On-Premises and Hy...Michael Noel
 
Managing storage on Prem and in Cloud
Managing storage on Prem and in CloudManaging storage on Prem and in Cloud
Managing storage on Prem and in CloudHoward Marks
 
SPSUtah 2014 SharePoint 2013 Performance (Admin)
SPSUtah 2014 SharePoint 2013 Performance (Admin)SPSUtah 2014 SharePoint 2013 Performance (Admin)
SPSUtah 2014 SharePoint 2013 Performance (Admin)Brian Culver
 
IBM #Softlayer infographic 2016
IBM #Softlayer infographic 2016IBM #Softlayer infographic 2016
IBM #Softlayer infographic 2016Patrick Bouillaud
 
Making Postgres Central in Your Data Center
Making Postgres Central in Your Data CenterMaking Postgres Central in Your Data Center
Making Postgres Central in Your Data CenterEDB
 
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...BIOVIA
 
Fontys Lecture - The Evolution of the Oracle Database 2016
Fontys Lecture -  The Evolution of the Oracle Database 2016Fontys Lecture -  The Evolution of the Oracle Database 2016
Fontys Lecture - The Evolution of the Oracle Database 2016Lucas Jellema
 
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...NetApp
 
SQL Server 2019 CTP2.4
SQL Server 2019 CTP2.4SQL Server 2019 CTP2.4
SQL Server 2019 CTP2.4Gianluca Hotz
 
SQL 2012 AlwaysOn Availability Groups for SharePoint 2013 - SharePoint Connec...
SQL 2012 AlwaysOn Availability Groups for SharePoint 2013 - SharePoint Connec...SQL 2012 AlwaysOn Availability Groups for SharePoint 2013 - SharePoint Connec...
SQL 2012 AlwaysOn Availability Groups for SharePoint 2013 - SharePoint Connec...Michael Noel
 

What's hot (20)

SharePoint 2010 upgrades!
SharePoint 2010 upgrades!SharePoint 2010 upgrades!
SharePoint 2010 upgrades!
 
Sql server 2014 y los datos
Sql server 2014 y los datosSql server 2014 y los datos
Sql server 2014 y los datos
 
An Expert Guide to Migrating Legacy Databases to PostgreSQL
An Expert Guide to Migrating Legacy Databases to PostgreSQLAn Expert Guide to Migrating Legacy Databases to PostgreSQL
An Expert Guide to Migrating Legacy Databases to PostgreSQL
 
Alfresco Day Roma 2015: Full Stack Load Testing
Alfresco Day Roma 2015: Full Stack Load TestingAlfresco Day Roma 2015: Full Stack Load Testing
Alfresco Day Roma 2015: Full Stack Load Testing
 
WiredTiger MongoDB Integration
WiredTiger MongoDB Integration WiredTiger MongoDB Integration
WiredTiger MongoDB Integration
 
Webinar: Introduction to MongoDB 3.0
Webinar: Introduction to MongoDB 3.0Webinar: Introduction to MongoDB 3.0
Webinar: Introduction to MongoDB 3.0
 
IaaS for DBAs in Azure
IaaS for DBAs in AzureIaaS for DBAs in Azure
IaaS for DBAs in Azure
 
XPages Performance Master Class - Survive in the fast lane on the Autobahn (E...
XPages Performance Master Class - Survive in the fast lane on the Autobahn (E...XPages Performance Master Class - Survive in the fast lane on the Autobahn (E...
XPages Performance Master Class - Survive in the fast lane on the Autobahn (E...
 
Alfresco in an hour
Alfresco in an hourAlfresco in an hour
Alfresco in an hour
 
SPSSac2014 - SharePoint Infrastructure Tips and Tricks for On-Premises and Hy...
SPSSac2014 - SharePoint Infrastructure Tips and Tricks for On-Premises and Hy...SPSSac2014 - SharePoint Infrastructure Tips and Tricks for On-Premises and Hy...
SPSSac2014 - SharePoint Infrastructure Tips and Tricks for On-Premises and Hy...
 
MySQL 5.7 what's new
MySQL 5.7 what's newMySQL 5.7 what's new
MySQL 5.7 what's new
 
Managing storage on Prem and in Cloud
Managing storage on Prem and in CloudManaging storage on Prem and in Cloud
Managing storage on Prem and in Cloud
 
SPSUtah 2014 SharePoint 2013 Performance (Admin)
SPSUtah 2014 SharePoint 2013 Performance (Admin)SPSUtah 2014 SharePoint 2013 Performance (Admin)
SPSUtah 2014 SharePoint 2013 Performance (Admin)
 
IBM #Softlayer infographic 2016
IBM #Softlayer infographic 2016IBM #Softlayer infographic 2016
IBM #Softlayer infographic 2016
 
Making Postgres Central in Your Data Center
Making Postgres Central in Your Data CenterMaking Postgres Central in Your Data Center
Making Postgres Central in Your Data Center
 
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...
 
Fontys Lecture - The Evolution of the Oracle Database 2016
Fontys Lecture -  The Evolution of the Oracle Database 2016Fontys Lecture -  The Evolution of the Oracle Database 2016
Fontys Lecture - The Evolution of the Oracle Database 2016
 
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
 
SQL Server 2019 CTP2.4
SQL Server 2019 CTP2.4SQL Server 2019 CTP2.4
SQL Server 2019 CTP2.4
 
SQL 2012 AlwaysOn Availability Groups for SharePoint 2013 - SharePoint Connec...
SQL 2012 AlwaysOn Availability Groups for SharePoint 2013 - SharePoint Connec...SQL 2012 AlwaysOn Availability Groups for SharePoint 2013 - SharePoint Connec...
SQL 2012 AlwaysOn Availability Groups for SharePoint 2013 - SharePoint Connec...
 

Similar to Repository Scalability - comparing SharePoint 2010 with Oracle UCM 11g

05. performance-concepts-26-slides
05. performance-concepts-26-slides05. performance-concepts-26-slides
05. performance-concepts-26-slidesMuhammad Ahad
 
05. performance-concepts
05. performance-concepts05. performance-concepts
05. performance-conceptsMuhammad Ahad
 
Building & Testing Scalable Rails Applications
Building & Testing Scalable Rails ApplicationsBuilding & Testing Scalable Rails Applications
Building & Testing Scalable Rails Applicationsevilmike
 
Architecture Patterns - Open Discussion
Architecture Patterns - Open DiscussionArchitecture Patterns - Open Discussion
Architecture Patterns - Open DiscussionNguyen Tung
 
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...European SharePoint Conference
 
Data Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfData Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfRob Winters
 
9.6_Course Material-Postgresql_002.pdf
9.6_Course Material-Postgresql_002.pdf9.6_Course Material-Postgresql_002.pdf
9.6_Course Material-Postgresql_002.pdfsreedb2
 
Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandb...
Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandb...Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandb...
Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandb...monsonc
 
Implementation of Oracle ExaData and OFM 11g with Banner in HCT
Implementation of Oracle ExaData and OFM 11g with Banner in HCTImplementation of Oracle ExaData and OFM 11g with Banner in HCT
Implementation of Oracle ExaData and OFM 11g with Banner in HCTKhalid Tariq
 
Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Alluxio, Inc.
 
(ATS4-PLAT08) Server Pool Management
(ATS4-PLAT08) Server Pool Management(ATS4-PLAT08) Server Pool Management
(ATS4-PLAT08) Server Pool ManagementBIOVIA
 
How_To_Soup_Up_Your_Farm
How_To_Soup_Up_Your_FarmHow_To_Soup_Up_Your_Farm
How_To_Soup_Up_Your_FarmNigel Price
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterpriseBert Poller
 
Oracle database 12c introduction- Satyendra Pasalapudi
Oracle database 12c introduction- Satyendra PasalapudiOracle database 12c introduction- Satyendra Pasalapudi
Oracle database 12c introduction- Satyendra Pasalapudipasalapudi123
 
Alluxio - Scalable Filesystem Metadata Services
Alluxio - Scalable Filesystem Metadata ServicesAlluxio - Scalable Filesystem Metadata Services
Alluxio - Scalable Filesystem Metadata ServicesAlluxio, Inc.
 
Postgresql in Education
Postgresql in EducationPostgresql in Education
Postgresql in Educationdostatni
 
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...Global Business Events
 
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...Dirk Petersen
 

Similar to Repository Scalability - comparing SharePoint 2010 with Oracle UCM 11g (20)

05. performance-concepts-26-slides
05. performance-concepts-26-slides05. performance-concepts-26-slides
05. performance-concepts-26-slides
 
05. performance-concepts
05. performance-concepts05. performance-concepts
05. performance-concepts
 
Tech4Africa 2014
Tech4Africa 2014Tech4Africa 2014
Tech4Africa 2014
 
Building & Testing Scalable Rails Applications
Building & Testing Scalable Rails ApplicationsBuilding & Testing Scalable Rails Applications
Building & Testing Scalable Rails Applications
 
Architecture Patterns - Open Discussion
Architecture Patterns - Open DiscussionArchitecture Patterns - Open Discussion
Architecture Patterns - Open Discussion
 
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
 
Data Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfData Vault Automation at the Bijenkorf
Data Vault Automation at the Bijenkorf
 
9.6_Course Material-Postgresql_002.pdf
9.6_Course Material-Postgresql_002.pdf9.6_Course Material-Postgresql_002.pdf
9.6_Course Material-Postgresql_002.pdf
 
Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandb...
Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandb...Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandb...
Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandb...
 
Implementation of Oracle ExaData and OFM 11g with Banner in HCT
Implementation of Oracle ExaData and OFM 11g with Banner in HCTImplementation of Oracle ExaData and OFM 11g with Banner in HCT
Implementation of Oracle ExaData and OFM 11g with Banner in HCT
 
Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...
 
(ATS4-PLAT08) Server Pool Management
(ATS4-PLAT08) Server Pool Management(ATS4-PLAT08) Server Pool Management
(ATS4-PLAT08) Server Pool Management
 
How_To_Soup_Up_Your_Farm
How_To_Soup_Up_Your_FarmHow_To_Soup_Up_Your_Farm
How_To_Soup_Up_Your_Farm
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterprise
 
Oracle database 12c introduction- Satyendra Pasalapudi
Oracle database 12c introduction- Satyendra PasalapudiOracle database 12c introduction- Satyendra Pasalapudi
Oracle database 12c introduction- Satyendra Pasalapudi
 
Alluxio - Scalable Filesystem Metadata Services
Alluxio - Scalable Filesystem Metadata ServicesAlluxio - Scalable Filesystem Metadata Services
Alluxio - Scalable Filesystem Metadata Services
 
Operational-Analytics
Operational-AnalyticsOperational-Analytics
Operational-Analytics
 
Postgresql in Education
Postgresql in EducationPostgresql in Education
Postgresql in Education
 
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
 
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
 

More from Raoul Miller

Multitenancy on OCI - FinalRM.pptx
Multitenancy on OCI - FinalRM.pptxMultitenancy on OCI - FinalRM.pptx
Multitenancy on OCI - FinalRM.pptxRaoul Miller
 
GSA Presentation - MILLER 251-4.pdf
GSA Presentation - MILLER 251-4.pdfGSA Presentation - MILLER 251-4.pdf
GSA Presentation - MILLER 251-4.pdfRaoul Miller
 
Choosing a new platform for records or document management
Choosing a new platform for records or document managementChoosing a new platform for records or document management
Choosing a new platform for records or document managementRaoul Miller
 
Developing a privacy compliance program
Developing a privacy compliance programDeveloping a privacy compliance program
Developing a privacy compliance programRaoul Miller
 
A simple guide for moving your content systems to the cloud
A simple guide for moving your content systems to the cloudA simple guide for moving your content systems to the cloud
A simple guide for moving your content systems to the cloudRaoul Miller
 
Boost privacy protections with attribute-based access control
Boost privacy protections with attribute-based access control Boost privacy protections with attribute-based access control
Boost privacy protections with attribute-based access control Raoul Miller
 
WebCenter as a Cloud App on Exalogic
WebCenter as a Cloud App on ExalogicWebCenter as a Cloud App on Exalogic
WebCenter as a Cloud App on ExalogicRaoul Miller
 
Email Management Using Oracle WebCenter Content Records
Email Management Using Oracle WebCenter Content RecordsEmail Management Using Oracle WebCenter Content Records
Email Management Using Oracle WebCenter Content RecordsRaoul Miller
 

More from Raoul Miller (8)

Multitenancy on OCI - FinalRM.pptx
Multitenancy on OCI - FinalRM.pptxMultitenancy on OCI - FinalRM.pptx
Multitenancy on OCI - FinalRM.pptx
 
GSA Presentation - MILLER 251-4.pdf
GSA Presentation - MILLER 251-4.pdfGSA Presentation - MILLER 251-4.pdf
GSA Presentation - MILLER 251-4.pdf
 
Choosing a new platform for records or document management
Choosing a new platform for records or document managementChoosing a new platform for records or document management
Choosing a new platform for records or document management
 
Developing a privacy compliance program
Developing a privacy compliance programDeveloping a privacy compliance program
Developing a privacy compliance program
 
A simple guide for moving your content systems to the cloud
A simple guide for moving your content systems to the cloudA simple guide for moving your content systems to the cloud
A simple guide for moving your content systems to the cloud
 
Boost privacy protections with attribute-based access control
Boost privacy protections with attribute-based access control Boost privacy protections with attribute-based access control
Boost privacy protections with attribute-based access control
 
WebCenter as a Cloud App on Exalogic
WebCenter as a Cloud App on ExalogicWebCenter as a Cloud App on Exalogic
WebCenter as a Cloud App on Exalogic
 
Email Management Using Oracle WebCenter Content Records
Email Management Using Oracle WebCenter Content RecordsEmail Management Using Oracle WebCenter Content Records
Email Management Using Oracle WebCenter Content Records
 

Recently uploaded

Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 

Recently uploaded (20)

Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 

Repository Scalability - comparing SharePoint 2010 with Oracle UCM 11g

  • 1. Repository Scalability: Comparing Microsoft SharePoint 2010 and Oracle UCM 11g Raoul Miller and Brent Seaman, TEAM Informatics, inc.
  • 2. Outline • Our ingestion rate experiments • Hardware and Software setup • Experimental design • Observations and Conclusions from these tests • Implications for Repository Sizing and Organization in SharePoint and UCM • Lessons Learned and Recommendations • Q and A
  • 3. Overall aims of this research • Apply real-world scenarios to ingestion testing • Rather than ultra high performance / ultra high cost • Determine actual ingestion rates for different scenarios on identical hardware • Expose weaknesses / issues in large imports • Derive recommendations for best practices in importing existing content into new CMS repositories
  • 4. Experimental Approach • Import existing files from file system into newly-installed CMS – Standard configurations – Commodity hardware – No specialized tuning or optimizations – Vendor recommended OS and databases • Four scenarios – 20,000 files @ 40kB – 20,000 files @ 100kB – 1,000,000 files @ 40kB – 1,000,000 files @ 100kB
  • 5. Are these Scenarios Realistic? • >80% of single instance CMS repositories contain 50-200,000+ items • Average “document” size in most industries is ~100kB. • Most projects need to import existing content from file shares or other systems
  • 6. Commodity Hardware • Dell PowerEdge R710s server • Dual Intel Xeon 5560 CPUs (@ quad core) running at 2.8Ghz • 16GB RAM • Eight 146GB 10K RPM SAS drives
  • 7. UCM Installation • Operating System: RedHat Enterprise 5 (specifically theCentOS5 build) • Database: Oracle 11g Standard Edition database • Web / Application Server: Weblogic 11gR1 (10.3.3) • Content ManagementUCM 11gR1 (11.1.1.4.0) System: • Java Runtime Environment:Sun Hotspot SDK (1.6.0_11) & JRockit R28 • File storage: File system (default) and JDBC (SecureFiles)
  • 8. SharePoint Installation • Operating System: Windows Server 2008 Std Edition for Partners • Database: Microsoft SQL Server 2008 R2 Enterprise • Web Server: IIS7 (Standard with Windows Server 2008 - specifically v 7.5.76) • Content ManagementSharePoint Server 2010 Enterprise for System: Partners • File storage: Database Storage in SQL server
  • 9. Ingestion Approaches • UCM – used BatchBuilder and BatchLoader • SharePoint - – had to use third party tool (UploadZen by Roxority) – Need to organize content before import – Limited flexibility in directory size
  • 10. Supported SharePoint 2010 bulk import strategies • Multiple file upload applet – Silverlight; supports up to 100 docs, does not support subdirectories • Windows Explorer view – Extension of webDAV – Limited performance • SharePoint Workspace – Client integration – Only supports up to 500 documents
  • 11. Differences between Import Strategies • BatchLoader – Supported system tool – Allows automated file system crawl (BatchBuilder) – Storage / browse location in repository unrelated to source location – Supports high volume • UploadZen – Third-party application – Requires organization and sizing of import directories – Organization within repository reflects import location – Major challenges with high volume imports
  • 12. Considerations for Repository Sizing 1. Should be primarily driven by business / infosec needs 2. Practicality – Import / migrate – Search / organize – Backup / DR 3. Flexibility – Growth in content volume / size – Leverage HSM / partitioning – Provide options for storage strategies
  • 13. Ingestion Rate Testing • Major things to test: – Overall rate of ingestion with different sized files and different sized collections – Ease of use of import tools – Flexibility in organization of content during / after import
  • 14. 20,000 files – each 40kB • First set of tests • Single directory for SharePoint source • UCM – File System storage – 198,000 docs/hr • UCM – JDBC storage – 156,000 docs/hr • SharePoint – 153,000 docs/hr
  • 15. 20,000 files – each 100kB • UCM – File System storage – 171,000 docs/hr • SharePoint – 138,000 docs/hr • Ingestion rates fell 10-15% for larger file size • SharePoint RAM usage higher, primarily in database
  • 16. 1,000,000 files – each 40kB • Need to organize files in directories for SharePoint – 50 folders each with 20,000 items - failed – 2,000 folders each with 500 items – succeeded • UCM – FS storage & Sun JRE 205,000 docs/hr • UCM – FS storage & JRockit JRE 212,000 docs/hr • UCM – JDBC storage & Sun JRE 171,000 docs/hr • SharePoint w/ 50 import folders failed • SharePoint w/ 2,000 import folders 217,000 docs/hr
  • 17. 1,000,000 files – each 40kB (contd.) • Substantial work to organize content for SharePoint import • SharePoint much more RAM intensive – Primarily with database process • UCM more CPU intensive – Much more linear response
  • 18. 1,000,000 files – each 100kB • UCM – FS storage & Sun JRE 179,000 docs/hr – 15% decrease in rate due to file size • Unable to complete test with SharePoint
  • 19. Conclusions • SharePoint requires 3rd party tools and substantial work before import • SharePoint has limited flexibility in terms of repository sizing, content organization, and import strategies • With optimized import, SharePoint ingestion rates are comparable to UCM • UCM has much more flexibility in import strategies • UCM has consistent import rates between 156,000 and 212,000 docs/hr (OOTB)
  • 20. Conclusions (contd.) • Ingestion rates are dependant on average file size (10- 15% decrease in rate between 40kB and 100kB file size) • UCM can be deployed on commodity hardware for repositories of 1,000,000 items • SharePoint has challenges importing 1,000,000 files on commodity hardware • Both systems function well on this hardware after import. • SharePoint import is much more RAM intensive whereas UCM import is CPU intensive
  • 21. Q&A
  • 22. Comments / Questions / Feedback Contact us: Raoul Miller Raoul.miller@teaminformatics.com Brent Seaman Brent.seaman@teaminformatics.com