When Smalltalk images get large

•

0 likes•280 views

ESUG

ESUG 2017

Software

• 7 gigabyte Pharo image is “the largest comfortable
Pharo image” [1].
• The largest GemStone production image is a 1.5
terabytes.
[1] https://clementbera.wordpress.com/2017/03/12/tuning-the-pharo-garbage-collector/
When Pharo images get
large

GemStone
Features
• Object Table
• Object Faulting
• Transactions
• Garbage Collection
• Multiple Vms
• Identity-based collections
• Indexed collections

Object Table
• every object has an object id
• Object Table maps an object id to a data page
• data pages are the unit of disk i/o
• 1-2000 objects can ﬁt on a data page
• an object larger than a page is broken up into
page-sized chunks

Object Faulting
• When an object is referenced, it’s page id is looked
up in the Object Table
• the data page is loaded into Shared Page Cache
(SPC)
• objects are copied from SPC to vm memory
• object ids converted to direct memory pointers
• stub objects in vm represent objects not yet loaded

Transactions
• On commit, only modiﬁed objects in vm are copied
to new pages in the SPC.
• A record of the object modiﬁcations is written to the
transaction log.
• The commit is complete when the transaction log is
successfully written to disk.
• On an abort, all modiﬁed objects in vm are converted
to stubs, as well as those changed by other sessions

Garbage
Collection
• GC is run as a separate (multi-threaded) operating
system process under your control.
• GC is designed to be run while the system is
actively committing.
• Schedule GC to minimize impacts on production
performance.

Multiple VMs
• An application can arrange to run multiple Smalltalk
vms to perform concurrent operations.
• Spread the work out over multiple CPUs and even
multiple machines.
• Concurrent commits are allowed as long as two
vms do not modify the same object during
overlapping commits.

Large Collections
• Excessive object faulting can occur, especially if all
of the objects in a large collection do not ﬁt in the
object memory for a vm (not enough object
memory for your working set)
• Identity-based Collections
• Indexed Collections

Identity-based Collections
(IdentityBag/Set)
• The vm performs identity comparisons within
primitives by comparing object ids, instead of
sending messages, so an object fault is not
required
• #includes: (implemented as a primitive) can be
performed without faulting in the elements of the
collection
• very fast even for large collections

Indexed Collections
• the query result for an indexed collection is created
by copying the object ids directly from the btree
nodes into the result set without faulting in the
objects

Develop in Pharo Deploy in GemStone
(DiPDiG)
• Not quite formalized technique[1]
• port application to GemStone/S, adding GemStone-speciﬁc
packages to your BaselineOf if needed
• production deployed in GemStone/S installation
• ongoing development in Pharo
• With gt4gemstone[2] the door is now open for expanding the
GemStone toolset to include direct support for DiPDiG
• PharoGs (future) should reduce “porting requirement”
[1] http://forum.world.st/How-do-you-develop-for-gemstone-in-open-source-tools-pharo-td4952364.html
[2] https://github.com/feenkcom/gt4gemstone

What's hot

Real World Event Sourcing and CQRSMatthew Hawkins

DEV03 - How Watson, Bluemix, Cloudant, and XPages Can Work Together In A Real...Frank van der Linden

Building & Testing Scalable Rails Applicationsevilmike

Introduction to Azure DocumentDBIke Ellis

Cooking Akka.net and Azure Service Fabric togetherAlessandro Melchiori

RavenDB PresentationMark Rodseth

Kafka Streams Windows: Behind the CurtainNeil Buesing

OpenStack Swift In the EnterpriseHostway|HOSTING

Monitoring docker: from zero to AzureAlessandro Melchiori

Persistent, Portable Storage for Docker Containers and MicroservicesClusterHQ

Document Databases & RavenDBBrian Ritchie

KubernetesSang-Min Park

Easy Object Storage Import/Export Using the S3 Connector on JetstreamGlobus

State of the Container EcosystemVinay Rao

02 integrate highchartErhwen Kuo

Utilizing the OpenNTF Domino APIOliver Busse

Docker y azure container serviceFernando Mejía

Deploying Data Science with Docker and AWSMatt McDonnell

Kubernetes on OpenStack @eBaySriram Subramanian

Wikipedia Cloud Search WebinarSearch Technologies

What's hot (20)

Real World Event Sourcing and CQRS

DEV03 - How Watson, Bluemix, Cloudant, and XPages Can Work Together In A Real...

Building & Testing Scalable Rails Applications

Introduction to Azure DocumentDB

Cooking Akka.net and Azure Service Fabric together

RavenDB Presentation

Kafka Streams Windows: Behind the Curtain

OpenStack Swift In the Enterprise

Monitoring docker: from zero to Azure

Persistent, Portable Storage for Docker Containers and Microservices

Document Databases & RavenDB

Kubernetes

Easy Object Storage Import/Export Using the S3 Connector on Jetstream

State of the Container Ecosystem

02 integrate highchart

Utilizing the OpenNTF Domino API

Docker y azure container service

Deploying Data Science with Docker and AWS

Kubernetes on OpenStack @eBay

Wikipedia Cloud Search Webinar

Similar to When Smalltalk images get large

MongoDB .local Bengaluru 2019: Realm: The Secret Sauce for Better Mobile AppsMongoDB

Building Rich Internet Apps with Silverlight 2Microsoft Iceland

Things I wish I knew about GemStoneESUG

Cloud and Windows AzureRadu Vunvulea

A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?DATAVERSITY

Realizing the Event Driven EnterpriseDavid Reines

Hibernate tutorialMumbai Academisc

IBM File Net P8Mohammed El Rafie Tarabay

SQL Queries on Smalltalk ObjectsESUG

Easy javascriptBui Kiet

Where to save my data, for devs!SharePoint Saturday New Jersey

Web Atoms - More Markup - Less ScriptAkash Kava

Introduction to ReactAustin Garrod

Hpts 2011 flexible_oltpJags Ramnarayan

Learn javascript easy stepsprince Loffar

DEV-1129 How Watson, Bluemix, Cloudant, and XPages Can Work Together In A Rea...Frank van der Linden

Design - Building a Foundation for Hybrid Cloud StorageLaurenWendler

04 integrate entityframeworkErhwen Kuo

Data Lake and the rise of the microservicesBigstep

Best Practices for Building Sites in dotCMSMichael Fienen

Similar to When Smalltalk images get large (20)

MongoDB .local Bengaluru 2019: Realm: The Secret Sauce for Better Mobile Apps

Building Rich Internet Apps with Silverlight 2

Things I wish I knew about GemStone

Cloud and Windows Azure

A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?

Realizing the Event Driven Enterprise

Hibernate tutorial

IBM File Net P8

SQL Queries on Smalltalk Objects

Easy javascript

Where to save my data, for devs!

Web Atoms - More Markup - Less Script

Introduction to React

Hpts 2011 flexible_oltp

Learn javascript easy steps

DEV-1129 How Watson, Bluemix, Cloudant, and XPages Can Work Together In A Rea...

Design - Building a Foundation for Hybrid Cloud Storage

04 integrate entityframework

Data Lake and the rise of the microservices

Best Practices for Building Sites in dotCMS

Recently uploaded

Software Quality Assurance Interview QuestionsArshad QA

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

DNT_Corporate presentation know about usDynamic Netsoft

Diamond Application Development Crafting Solutions with PrecisionSolGuruz

A Secure and Reliable Document Management System is Essential.docxComplianceQuest1

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171

Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH

5 Signs You Need a Fashion PLM Software.pdfWave PLM

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Active Directory Penetration Testing, cionsystems.com.pdfCionsystems

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

What is Binary Language? Computer Number SystemsJheuzeDellosa

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

Test Automation Strategy for Frontend and BackendArshad QA

Recently uploaded (20)

Software Quality Assurance Interview Questions

why an Opensea Clone Script might be your perfect match.pdf

DNT_Corporate presentation know about us

Diamond Application Development Crafting Solutions with Precision

A Secure and Reliable Document Management System is Essential.docx

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf

Der Spagat zwischen BIAS und FAIRNESS (2024)

5 Signs You Need a Fashion PLM Software.pdf

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...

HR Software Buyers Guide in 2024 - HRSoftware.com

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

Active Directory Penetration Testing, cionsystems.com.pdf

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications

What is Binary Language? Computer Number Systems

Salesforce Certified Field Service Consultant

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

Test Automation Strategy for Frontend and Backend

When Smalltalk images get large

1. When Smalltalk images get large Dale Henrichs GemTalk Systems ESUG 2017

2. • 7 gigabyte Pharo image is “the largest comfortable Pharo image” [1]. • The largest GemStone production image is a 1.5 terabytes. [1] https://clementbera.wordpress.com/2017/03/12/tuning-the-pharo-garbage-collector/ When Pharo images get large

3. GemStone Features • Object Table • Object Faulting • Transactions • Garbage Collection • Multiple Vms • Identity-based collections • Indexed collections

4. Object Table • every object has an object id • Object Table maps an object id to a data page • data pages are the unit of disk i/o • 1-2000 objects can ﬁt on a data page • an object larger than a page is broken up into page-sized chunks

5. Object Faulting • When an object is referenced, it’s page id is looked up in the Object Table • the data page is loaded into Shared Page Cache (SPC) • objects are copied from SPC to vm memory • object ids converted to direct memory pointers • stub objects in vm represent objects not yet loaded

6. Transactions • On commit, only modified objects in vm are copied to new pages in the SPC. • A record of the object modifications is written to the transaction log. • The commit is complete when the transaction log is successfully written to disk. • On an abort, all modified objects in vm are converted to stubs, as well as those changed by other sessions

7. Garbage Collection • GC is run as a separate (multi-threaded) operating system process under your control. • GC is designed to be run while the system is actively committing. • Schedule GC to minimize impacts on production performance.

8. Multiple VMs • An application can arrange to run multiple Smalltalk vms to perform concurrent operations. • Spread the work out over multiple CPUs and even multiple machines. • Concurrent commits are allowed as long as two vms do not modify the same object during overlapping commits.

9. Large Collections • Excessive object faulting can occur, especially if all of the objects in a large collection do not ﬁt in the object memory for a vm (not enough object memory for your working set) • Identity-based Collections • Indexed Collections

10. Identity-based Collections (IdentityBag/Set) • The vm performs identity comparisons within primitives by comparing object ids, instead of sending messages, so an object fault is not required • #includes: (implemented as a primitive) can be performed without faulting in the elements of the collection • very fast even for large collections

11. Indexed Collections • the query result for an indexed collection is created by copying the object ids directly from the btree nodes into the result set without faulting in the objects

12. Develop in Pharo Deploy in GemStone (DiPDiG) • Not quite formalized technique[1] • port application to GemStone/S, adding GemStone-speciﬁc packages to your BaselineOf if needed • production deployed in GemStone/S installation • ongoing development in Pharo • With gt4gemstone[2] the door is now open for expanding the GemStone toolset to include direct support for DiPDiG • PharoGs (future) should reduce “porting requirement” [1] http://forum.world.st/How-do-you-develop-for-gemstone-in-open-source-tools-pharo-td4952364.html [2] https://github.com/feenkcom/gt4gemstone

When Smalltalk images get large

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to When Smalltalk images get large

Similar to When Smalltalk images get large (20)

More from ESUG

More from ESUG (20)

Recently uploaded

Recently uploaded (20)

When Smalltalk images get large