Improving Performance Through Object Lifetime Profiling: the DataFrame Case

•

0 likes•47 views

This document discusses improving garbage collection performance in Pharo through object lifetime profiling. It presents Illimani, a lifetime profiler developed for Pharo. Illimani was used to profile the lifetimes of objects created when loading a large DataFrame. The profiling revealed that most objects had short lifetimes, suggesting the garbage collector could be tuned. Tuning the garbage collector parameters based on the lifetime profiles improved the performance of loading the DataFrame.

Improving Performance Through
Object Lifetime Pro
fi
ling: the
DataFrame Case
Sebastian JORDAN MONTAÑO, Nahuel PALUMBO, Guillermo POLITO,
Stéphane DUCASSE and Pablo TESONE
Inria, Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL
August 2023 Evref
fervE

Memory management in software
int* ptr = (int*) malloc(sizeof(int));
free(ptr);
2

Pharo’s garbage collector
Eden S1 S2 Tenured
+
Young generation Old generation
3

Research question
How does approximate object lifetimes lead to GC performance
improvements?
6

An object’s lifetime
Object’s allocation
Object becomes
unreachable
GC collects
the object
Actual lifetime
Lifetime that we capture?
7

An object’s approximated lifetime
Object’s allocation
Object becomes
unreachable
GC prepares
the object
Actual lifetime
Lifetime that we capture
Object is
fi
nalized
(we take the measurement)
8

Capturing the allocations
Array new: 7
9

Capturing the allocations
Array new: 7
10
Capture the allocation
Register the
fi
nalization
MethodProxies [1]
Ephemerons [2]
[1] github.com/pharo-contributions/MethodProxies
[2] github.com/pharo-project/pheps/blob/main/phep-0003.md

An object’s finalization at a time m
11
Ephemeron
#111
Object: Array
#123
Model
allocationTime: n
finalizationTime: m
m
o
u
r
n
finalize
E
p
h
e
m
e
r
o
n
i
s
c
o
n
s
u
m
e
d
1 2
3
key
Finalization Queue
Object #123 is
garbage collected
4

An object’s allocation at a time n
12
OrderedCollection class >> new: anInteger
^ self basicNew setCollection:
(self arrayType new: anInteger)
Behavior >> basicNew
<primitive: 70>
Array class >> basicNew: size
lines := OrderedCollection new
...

Paper’s contributions
Challenges of lifetime pro
fi
ling
Illimani: a lifetime pro
fi
ler on stock Pharo VM
13

The target application
https://github.com/PolyMathOrg/DataFrame
15

Object lifetimes for a 500MB DataFrame (memory)
-500 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000
0 B
9 B
116 B
1 KB
13 KB
145 KB
1 MB
16 MB
180 MB
1 GB
Lifetime in seconds
Memory
(log
scale)
60%
18

-500 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000
0
5
35
217
1,311
7,900
47,557
286,265
1,723,096
10,371,664
Lifetime in seconds
Number
of
objects
(log
scale)
75%
Object lifetimes for a 500MB DataFrame (# objects)
19

Common object lifetime distribution
20
Source: oracle.com

Pharo’s garbage collector
Eden S1 S2 Tenured
+
Young generation Old generation
22

Future work
Measure the precision of our approximate object lifetimes
Pro
fi
ling at VM level to reduce the overhead
Pre-tenuring
24

Summary
We developed a lifetime profiler
We profiled the object lifetimes and we validated our solution by
observing how lifetimes relate to performance improvements when
tuning the GC.
25
github.com/jordanmontt/illimani-memory-pro
fi
ler
Sebastian JORDAN MONTAÑO
sebastian.jordan@inria.fr

Plenary talk at the international Synchrotron Radiation Instrumentation conference in Taiwan, on work with great colleagues Ben Blaiszik, Ryan Chard, Logan Ward, and others. Rapidly growing data volumes at light sources demand increasingly automated data collection, distribution, and analysis processes, in order to enable new scientific discoveries while not overwhelming finite human capabilities. I present here three projects that use cloud-hosted data automation and enrichment services, institutional computing resources, and high- performance computing facilities to provide cost-effective, scalable, and reliable implementations of such processes. In the first, Globus cloud-hosted data automation services are used to implement data capture, distribution, and analysis workflows for Advanced Photon Source and Advanced Light Source beamlines, leveraging institutional storage and computing. In the second, such services are combined with cloud-hosted data indexing and institutional storage to create a collaborative data publication, indexing, and discovery service, the Materials Data Facility (MDF), built to support a host of informatics applications in materials science. The third integrates components of the previous two projects with machine learning capabilities provided by the Data and Learning Hub for science (DLHub) to enable on-demand access to machine learning models from light source data capture and analysis workflows, and provides simplified interfaces to train new models on data from sources such as MDF on leadership scale computing resources. I draw conclusions about best practices for building next-generation data automation systems for future light sources.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

Following the popularity of “Cloud Revolution: Exploring the New Wave of Serverless Spatial Data,” we’re thrilled to announce this much-anticipated encore webinar. In this sequel, we’ll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you’re building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Computing Outside The Box June 2009

Ian Foster

Keynote talk at the International Conference on Supercoming 2009, at IBM Yorktown in New York. This is a major update of a talk first given in New Zealand last January. The abstract follows. The past decade has seen increasingly ambitious and successful methods for outsourcing computing. Approaches such as utility computing, on-demand computing, grid computing, software as a service, and cloud computing all seek to free computer applications from the limiting confines of a single computer. Software that thus runs "outside the box" can be more powerful (think Google, TeraGrid), dynamic (think Animoto, caBIG), and collaborative (think FaceBook, myExperiment). It can also be cheaper, due to economies of scale in hardware and software. The combination of new functionality and new economics inspires new applications, reduces barriers to entry for application providers, and in general disrupts the computing ecosystem. I discuss the new applications that outside-the-box computing enables, in both business and science, and the hardware and software architectures that make these new applications possible.

Big data at experimental facilities

Ian Foster

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Micrometrics to forecast performance tsunamis

Tier1app

Tsunami waves travel at the speed of 500 - 600 miles/hr. Normal waves travel at the speed of 5 - 60 miles/hr. Due to technical limitations, even massive Tsunamis are hard to forecast and detect beforehand. In recent times, hyper sensitive micro-metrics measuring technologies are employed to forecast Tsunamis. Similarly, it’s hard to forecast production performance problems beforehand. In this session you will learn the micro-metrics to be measured in dev/test environments that can forecast production performance problems with a fair level of accuracy.

Key projects Data Science and Engineering

Vijayananda Mohire

This is our contributions to the Data Science projects, as developed in our startup. These are part of partner trainings and in-house design and development and testing of the course material and concepts in Data Science and Engineering. It covers Data ingestion, data wrangling, feature engineering, data analysis, data storage, data extraction, querying data, formatting and visualizing data for various dashboards.Data is prepared for accurate ML model predictions and Generative AI apps

A long time ago, there was Caffe and Theano, then came Torch and CNTK and Tensorflow, Keras and MXNet and Pytorch and Caffe2….a sea of Deep learning tools but none for Spark developers to dip into. Finally, there was BigDL, a deep learning library for Apache Spark. While BigDL is integrated into Spark and extends its capabilities to address the challenges of Big Data developers, will a library alone be enough to simplify and accelerate the deployment of ML/DL workloads on production clusters? From high level pipeline API support to feature transformers to pre-defined models and reference use cases, a rich repository of easy to use tools are now available with the ‘Analytics Zoo’. We’ll unpack the production challenges and opportunities with ML/DL on Spark and what the Zoo can do

DA 592 - Term Project Report - Berker Kozan Can KokluCan Köklü

kanimozhi2019.pdf

AshrafDabbas1

Fast object re-detection and localization in video for spatio-temporal fragme...

LinkedTV

Performance Optimization of CGYRO for Multiscale Turbulence Simulations

Igor Sfiligoi

ADS Team 8 Final Presentation

Pranay Mankad

PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems

NECST Lab @ Politecnico di Milano

Machine Learning (ML) models are often composed as pipelines of operators, from “classical” ML operators to pre-processing and featurization operators. Current systems deploy pipelines as "black boxes”, where the same implementation of training is run for inference. This solution is convenient but leaves large room to improve performance and resource usage. This talk presents Pretzel, a framework for deployment of ML pipelines that is inspired to Database Systems: Pretzel inspects and optimizes pipelines end-to-end much like queries, and manages resources common to multiple pipelines such as operators' state. Pretzel is joint work with University of Seoul and Microsoft Research and has recently been presented at OSDI ’18. After the overview, this talk also shows experimental results of Pretzel against state-of-art ML solutions and discusses limitations and extensions.

Interactive Data Analysis for End Users on HN Science Cloud

Helix Nebula The Science Cloud

The next generation of the Montage image mosaic engine

G. Bruce Berriman

GlobusWorld 2020 Keynote

Globus

Druid at naver.com - part 1

Jungsu Heo

Object extraction from satellite imagery using deep learning

Aly Abdelkareem

YOLOv4: optimal speed and accuracy of object detection review

LEE HOSEONG

PhD Thesis Proposal

Ziqiang Feng

"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...

Edge AI and Vision Alliance

For the full video of this presentation, please visit: https://www.embedded-vision.com/platinum-members/embedded-vision-alliance/embedded-vision-training/videos/pages/may-2019-embedded-vision-summit-guttmann For more information about embedded vision, please visit: http://www.embedded-vision.com Moses Guttmann, CTO and founder of Allegro, presents the "Optimizing SSD Object Detection for Low-power Devices" tutorial at the May 2019 Embedded Vision Summit. Deep learning-based computer vision models have gained traction in applications requiring object detection, thanks to their accuracy and flexibility. For deployment on low-power hardware, single-shot detection (SSD) models are attractive due to their speed when operating on inputs with small spatial dimensions. The key challenge in creating efficient embedded implementations of SSD is not in the feature extraction module, but rather is due to the non-linear bottleneck in the detection stage, which does not lend itself to parallelization. This hinders the ability to lower the processing time per frame, even with custom hardware. Guttmann describes in detail a data-centric optimization approach to SSD. The approach drastically lowers the number of priors (“anchors”) needed for the detection, and thus linearly decreases time spent on this costly part of the computation. Thus, specialized processors and custom hardware may be better utilized, yielding higher performance and lower latency regardless of the specific hardware used.

Chronix Poster for the Poster Session FAST 2017

Florian Lautenschlager

Virtual Science in the Cloudthetfoot

Workshop: Identifying concept inventories in agile programming

ESUG

Technical documentation support in Pharo

ESUG

Similar to Improving Performance Through Object Lifetime Profiling: the DataFrame Case

Key projects Data Science and Engineering

Vijayananda Mohire

Computing Outside The Box September 2009

Ian Foster

Fast object re detection and localization in video for spatio-temporal fragme...

MediaMixerCommunity

Analytics Zoo: Building Analytics and AI Pipeline for Apache Spark and BigDL ...

Databricks

DA 592 - Term Project Report - Berker Kozan Can KokluCan Köklü

kanimozhi2019.pdf

AshrafDabbas1

Fast object re-detection and localization in video for spatio-temporal fragme...

LinkedTV

Performance Optimization of CGYRO for Multiscale Turbulence Simulations

Igor Sfiligoi

ADS Team 8 Final Presentation

Pranay Mankad

PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems

NECST Lab @ Politecnico di Milano

Interactive Data Analysis for End Users on HN Science Cloud

Helix Nebula The Science Cloud

The next generation of the Montage image mosaic engine

G. Bruce Berriman

GlobusWorld 2020 Keynote

Globus

Druid at naver.com - part 1

Jungsu Heo

Object extraction from satellite imagery using deep learning

Aly Abdelkareem

YOLOv4: optimal speed and accuracy of object detection review

LEE HOSEONG

PhD Thesis Proposal

Ziqiang Feng

"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...

Edge AI and Vision Alliance

Chronix Poster for the Poster Session FAST 2017

Florian Lautenschlager

Virtual Science in the Cloudthetfoot

Similar to Improving Performance Through Object Lifetime Profiling: the DataFrame Case (20)

Key projects Data Science and Engineering

Computing Outside The Box September 2009

Fast object re detection and localization in video for spatio-temporal fragme...

Analytics Zoo: Building Analytics and AI Pipeline for Apache Spark and BigDL ...

DA 592 - Term Project Report - Berker Kozan Can Koklu

kanimozhi2019.pdf

Fast object re-detection and localization in video for spatio-temporal fragme...

Performance Optimization of CGYRO for Multiscale Turbulence Simulations

ADS Team 8 Final Presentation

PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems

Interactive Data Analysis for End Users on HN Science Cloud

The next generation of the Montage image mosaic engine

GlobusWorld 2020 Keynote

Druid at naver.com - part 1

Object extraction from satellite imagery using deep learning

YOLOv4: optimal speed and accuracy of object detection review

PhD Thesis Proposal

"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...

Chronix Poster for the Poster Session FAST 2017

Virtual Science in the Cloud

Recently uploaded

Orion Context Broker introduction 20240604

Fermin Galan

RISE with SAP and Journey to the Intelligent Enterprise

Srikant77

Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...

informapgpstrackings

May Marketo Masterclass, London MUG May 22 2024.pdf

Adele Miller

Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...

Globus

Large Language Models (LLMs) are currently the center of attention in the tech world, particularly for their potential to advance research. In this presentation, we'll explore a straightforward and effective method for quickly initiating inference runs on supercomputers using the vLLM tool with Globus Compute, specifically on the Polaris system at ALCF. We'll begin by briefly discussing the popularity and applications of LLMs in various fields. Following this, we will introduce the vLLM tool, and explain how it integrates with Globus Compute to efficiently manage LLM operations on Polaris. Attendees will learn the practical aspects of setting up and remotely triggering LLMs from local machines, focusing on ease of use and efficiency. This talk is ideal for researchers and practitioners looking to leverage the power of LLMs in their work, offering a clear guide to harnessing supercomputing resources for quick and effective LLM inference.

BoxLang: Review our Visionary Licenses of 2024

Ortus Solutions, Corp

Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx

rickgrimesss22

Vitthal Shirke Microservices Resume Montevideo

Vitthal Shirke

A Comprehensive Look at Generative AI in Retail App Testing.pdf

kalichargn70th171

A Sighting of filterA in Typelevel Rite of Passage

Philip Schwarz

Cyaniclab : Software Development Agency Portfolio.pdf

Cyanic lab

CyanicLab, an offshore custom software development company based in Sweden,India, Finland, is your go-to partner for startup development and innovative web design solutions. Our expert team specializes in crafting cutting-edge software tailored to meet the unique needs of startups and established enterprises alike. From conceptualization to execution, we offer comprehensive services including web and mobile app development, UI/UX design, and ongoing software maintenance. Ready to elevate your business? Contact CyanicLab today and let us propel your vision to success with our top-notch IT solutions.

2024 RoOUG Security model for the cloud.pptx

Georgi Kodinov

Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL

Natan Silnitsky

In software engineering, the right architecture is essential for robust, scalable platforms. Wix has undergone a pivotal shift from event sourcing to a CRUD-based model for its microservices. This talk will chart the course of this pivotal journey. Event sourcing, which records state changes as immutable events, provided robust auditing and "time travel" debugging for Wix Stores' microservices. Despite its benefits, the complexity it introduced in state management slowed development. Wix responded by adopting a simpler, unified CRUD model. This talk will explore the challenges of event sourcing and the advantages of Wix's new "CRUD on steroids" approach, which streamlines API integration and domain event management while preserving data integrity and system resilience. Participants will gain valuable insights into Wix's strategies for ensuring atomicity in database updates and event production, as well as caching, materialization, and performance optimization techniques within a distributed system. Join us to discover how Wix has mastered the art of balancing simplicity and extensibility, and learn how the re-adoption of the modest CRUD has turbocharged their development velocity, resilience, and scalability in a high-growth environment.

Developing Distributed High-performance Computing Capabilities of an Open Sci...

Globus

COVID-19 had an unprecedented impact on scientific collaboration. The pandemic and its broad response from the scientific community has forged new relationships among public health practitioners, mathematical modelers, and scientific computing specialists, while revealing critical gaps in exploiting advanced computing systems to support urgent decision making. Informed by our team’s work in applying high-performance computing in support of public health decision makers during the COVID-19 pandemic, we present how Globus technologies are enabling the development of an open science platform for robust epidemic analysis, with the goal of collaborative, secure, distributed, on-demand, and fast time-to-solution analyses to support public health.

OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam

takuyayamamoto1800

Corporate Management | Session 3 of 3 | Tendenci AMS

Tendenci - The Open Source AMS (Association Management Software)

Experience our free, in-depth three-part Tendenci Platform Corporate Membership Management workshop series! In Session 1 on May 14th, 2024, we began with an Introduction and Setup, mastering the configuration of your Corporate Membership Module settings to establish membership types, applications, and more. Then, on May 16th, 2024, in Session 2, we focused on binding individual members to a Corporate Membership and Corporate Reps, teaching you how to add individual members and assign Corporate Representatives to manage dues, renewals, and associated members. Finally, on May 28th, 2024, in Session 3, we covered questions and concerns, addressing any queries or issues you may have. For more Tendenci AMS events, check out www.tendenci.com/events

Graphic Design Crash Course for beginners

e20449

Accelerate Enterprise Software Engineering with Platformless

WSO2

Key takeaways: Challenges of building platforms and the benefits of platformless. Key principles of platformless, including API-first, cloud-native middleware, platform engineering, and developer experience. How Choreo enables the platformless experience. How key concepts like application architecture, domain-driven design, zero trust, and cell-based architecture are inherently a part of Choreo. Demo of an end-to-end app built and deployed on Choreo.

Using IESVE for Room Loads Analysis - Australia & New Zealand

IES VE

Globus Compute Introduction - GlobusWorld 2024

Globus

Recently uploaded (20)

Orion Context Broker introduction 20240604

RISE with SAP and Journey to the Intelligent Enterprise

Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...

May Marketo Masterclass, London MUG May 22 2024.pdf

Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...

BoxLang: Review our Visionary Licenses of 2024

Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx

Vitthal Shirke Microservices Resume Montevideo

A Comprehensive Look at Generative AI in Retail App Testing.pdf

A Sighting of filterA in Typelevel Rite of Passage

Cyaniclab : Software Development Agency Portfolio.pdf

2024 RoOUG Security model for the cloud.pptx

Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL

Developing Distributed High-performance Computing Capabilities of an Open Sci...

OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam

Corporate Management | Session 3 of 3 | Tendenci AMS

Graphic Design Crash Course for beginners

Accelerate Enterprise Software Engineering with Platformless

Using IESVE for Room Loads Analysis - Australia & New Zealand

Globus Compute Introduction - GlobusWorld 2024

Improving Performance Through Object Lifetime Profiling: the DataFrame Case

1. Improving Performance Through Object Lifetime Pro fi ling: the DataFrame Case Sebastian JORDAN MONTAÑO, Nahuel PALUMBO, Guillermo POLITO, Stéphane DUCASSE and Pablo TESONE Inria, Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL August 2023 Evref fervE

2. Memory management in software int* ptr = (int*) malloc(sizeof(int)); free(ptr); 2

3. Pharo’s garbage collector Eden S1 S2 Tenured + Young generation Old generation 3

4. GC parameters 4

5. Time spent on garbage collecting 5

6. Research question How does approximate object lifetimes lead to GC performance improvements? 6

7. An object’s lifetime Object’s allocation Object becomes unreachable GC collects the object Actual lifetime Lifetime that we capture? 7

8. An object’s approximated lifetime Object’s allocation Object becomes unreachable GC prepares the object Actual lifetime Lifetime that we capture Object is fi nalized (we take the measurement) 8

9. Capturing the allocations Array new: 7 9

10. Capturing the allocations Array new: 7 10 Capture the allocation Register the fi nalization MethodProxies [1] Ephemerons [2] [1] github.com/pharo-contributions/MethodProxies [2] github.com/pharo-project/pheps/blob/main/phep-0003.md

11. An object’s finalization at a time m 11 Ephemeron #111 Object: Array #123 Model allocationTime: n finalizationTime: m m o u r n finalize E p h e m e r o n i s c o n s u m e d 1 2 3 key Finalization Queue Object #123 is garbage collected 4

12. An object’s allocation at a time n 12 OrderedCollection class >> new: anInteger ^ self basicNew setCollection: (self arrayType new: anInteger) Behavior >> basicNew <primitive: 70> Array class >> basicNew: size lines := OrderedCollection new ...

13. Paper’s contributions Challenges of lifetime pro fi ling Illimani: a lifetime pro fi ler on stock Pharo VM 13

14. Methodology 14 Application’s Lifetime Profile P Application to Profile 3. Chose GC tuned parameters based on object lifetimes and benchmark information 5. Did the performance improved? GC tuned parameters 2. Benchmark it with the default GC parameters 4. Benchmark the application again with the tuned GC parameters 1. Profile the application Default GC parameters benchmark Tuned GC parameters benchmark

15. The target application https://github.com/PolyMathOrg/DataFrame 15

16. Benchmark the loading of DataFrame 16

17. Benchmark the loading of DataFrame 17

18. Object lifetimes for a 500MB DataFrame (memory) -500 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 0 B 9 B 116 B 1 KB 13 KB 145 KB 1 MB 16 MB 180 MB 1 GB Lifetime in seconds Memory (log scale) 60% 18

19. -500 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 0 5 35 217 1,311 7,900 47,557 286,265 1,723,096 10,371,664 Lifetime in seconds Number of objects (log scale) 75% Object lifetimes for a 500MB DataFrame (# objects) 19

20. Common object lifetime distribution 20 Source: oracle.com

21. GC custom parameters 21

22. Pharo’s garbage collector Eden S1 S2 Tenured + Young generation Old generation 22

23. Benchmarks results 23

24. Future work Measure the precision of our approximate object lifetimes Pro fi ling at VM level to reduce the overhead Pre-tenuring 24

25. Summary We developed a lifetime profiler We profiled the object lifetimes and we validated our solution by observing how lifetimes relate to performance improvements when tuning the GC. 25 github.com/jordanmontt/illimani-memory-pro fi ler Sebastian JORDAN MONTAÑO sebastian.jordan@inria.fr

Improving Performance Through Object Lifetime Profiling: the DataFrame Case

Recommended

Recommended

More Related Content

Similar to Improving Performance Through Object Lifetime Profiling: the DataFrame Case

Similar to Improving Performance Through Object Lifetime Profiling: the DataFrame Case (20)

More from ESUG

More from ESUG (20)

Recently uploaded

Recently uploaded (20)

Improving Performance Through Object Lifetime Profiling: the DataFrame Case