Python: Thanks for the memories

PYTHON:
THANKS FORTHE
MEMORIES
Danil Ineev
Python Dublin Meetup
September 2019

Agenda
■ Managed vs Unmanaged memory
■ Memory Allocation in CPython
■ GarbageCollection in CPython
■ How other Python implementations handle memory
■ Tips &Tricks
■ Q&A

About author
■ Cursed by a witch in 2008 and since then can only use programming languages with
managed memory
■ ~7 years of professional experience with Python
■ Still constantly learning something new
■ Currently developing software here atTenable (we’re hiring)
■ Quite passionate about photography and filmmaking, please visit my website
https://notreally.media/ (even if you don’t know Russian, you can always take a look at
pretty pictures)

Unmanaged Memory
■ Developer needs to manually
allocate and release memory
Code from https://www.codingunit.com/c-tutorial-the-functions-malloc-and-free

Managed memory
■ UnderlyingVM or runtime will take care of
all memory operations
Behold the power of Python

DEEP C
Memory allocation in CPython

Deep C
■ In actualVM someone still needs to write memory allocation
■ CPython has a few special allocators for both objects and non-objects
■ What allocator will be used also depends on a type of build
From https://docs.python.org/3/c-api/memory.html#default-memory-allocators

Memory Allocation Layers
Physical Memory Swap
OS-specificVirtual Memory Manager
Kernel dynamic storage allocation & management (page-based)
General-purpose allocator (e.g. malloc)
Virtual memory allocated for the python process
Python’s raw memory allocator (PyMem API)
Python memory
Python’s object allocator
Object memory Internal buffers
[ int ] [ dict ] [ list ] [ string ]
Object-specific memory
Python Core
Non-object
memory
0
-1
-2
1
2
3

Memory Layout
Arena 1 Arena 2
Pool 1 Pool 2 Pool 3 Pool 1 Pool 2
Block 1
Block 2
Block 3
Block 4
Block 5
Block 1

”Simple segregated storage based on array of free lists”

But that’s not all
■ Here you may start to think “it’s completely irrelevant to what I do in my day to day
job”
■ It is
■ Hence why I really don’t want to tell you a difference between untouched and free
memory blocks, for example

GIL
■ Global Interpreter Lock (GIL) is a lock (!) in interpreter (!!) that allows to execute only a
single thread at a time
■ It’s a quite controversial topic
■ Because of GIL CPython can’t into multithreading
■ But why do we even need GIL, if everyone hate it?
■ Also, why are we talking about GIL today?

Garbage Collection
■ GarbageCollector is a mechanism that automatically deletes unused objects
■ Python standard doesn’t force anyone to implement a certain type of GC
■ In CPython primary garbage collection mechanism is a reference counting

Reference counting
■ What is a variable?
■ Variable is just a label and reference for some object in memory
■ Object in memory can be referred without labels: see lists, tuples, etc.
■ If there are no references to an object – it can be deleted

Reference counting
■ When number of references increases?
■ Storing object in a new variable
■ Adding object to a collection
■ Passing object to a function

Reference counting
■ When number of references decreases?
■ Change variable value
■ Execution leaves a scope
■ Explicit call for `del var_name`
■ Removing from a collection
■ Global objects refcount can’t be 0

Reference counting
■ Reference counting caveats
■ Can’t handle cycle references
■ Not really thread safe (That’s why
CPython has GIL)
■ String constants and small integers can
be cached
■ Can’t be turned off

Generational GC
■ GenerationalGC is a built-in module to clean up everything that can’t be utilized by
reference counting
■ Based on a principle that most objects die young
■ Objects are tracked using the special lists — generations

Generational GC
■ There are only 3 generations
■ New objects are stored in a Generation 0
■ During the cleanup,GC will try to
determine, if every object in generation is
reachable from a root set of objects
■ If object is unreachable it will be deleted
■ If object survives in a cleanup, it will be
promoted to the next generation
Root Objects Generation X

Generational GC
■ Each generation have a threshold
■ If number of object in generation exceeds threshold, garbage collection will be
triggered automatically
■ Unlike the reference counting, GenerationalGC can be configured
■ User can set threshold levels
■ User can trigger collection manually

Other Python implementations
■ Jython and IronPython use whatever GC is in underlyingVM
■ PyPy has a pluggableGC architecture and a list of ready made garbage collectors
■ Default GC is called MinimarkGC, a super smart generationalGC
■ There is also an ongoing experiment to implement STM

SoftwareTransactional Memory
■ Instead of locking on a global level, uses transactions for more granular memory
operations
■ It means that using STM we can have real multithreading in python
■ Huge performance gain if program uses a lot of simultaneous threads
■ Significant performance loss for singlethreaded programs
■ Still not ready for production after many years of development

TIPS &TRICKS
How to improve memory usage in your project?

How to improve memory usage in your
project?
■ General rule: create benchmarks before starting the optimization
■ Useful tools for profiling: memory-profiler, pympler, objgraph

array
■ Memory efficient dynamic arrays
■ Can only store simple data types
■ Homogenous

External modules
■ Lots of external Python modules are written in C in a very efficient way
■ Most commonly used is NumPy
■ You can write your ownC/C++/Rust extension using FFI
■ Cython is also a good choice for memory and/orCPU heavy modules

Generators
■ Very often you don’t really need to store the whole list/dict/etc.
■ Passing generator to a function looks prettier and usually more efficient
foo([x.bar for x in arr])
vs
foo(x.bar for x in arr)
■ Use generators as a first choice option for every new method returning a collection
■ Side effect: it will be easier to transform your code to be asynchronous

Generators
Returns array
Returns generator

Tail call optimization
■ Recursion is not only limited in python, but also consumes a lot of memory
■ Certain recursive methods could be rewritten to be tail recursive
■ It means that every return statement is independent
■ By default, Python doesn’t optimize tail recursion
■ It can be easily optimized using simple decorators

Migrate to Python 3.7+
■ Latest versions have many memory optimizations
■ Don’t need to think about __slots__
■ Better finalizers handling
■ Python 2 sunsetting 1 January 2020

GC tuning
■ GenerationalGC can be tuned
■ Should only be done if you really have no other options
■ Threshold values can be optimized for your use-cases
■ GenerationalGC may be completely turned off

What to read next?
■ CPython source code. Its documentation is really great:
– cpython/Objects/obmalloc.c
– cpython/Modules/gcmodule.c
■ PyPy documentation on GC and STM:
– https://doc.pypy.org/en/latest/gc_info.html
– https://doc.pypy.org/en/latest/stm.html
■ Various talks over the last few years at PyCon
■ Instagram Engineering blogs, especially these two articles:
– https://instagram-engineering.com/dismissing-python-garbage-collection-at-instagram-
4dca40b29172
– https://instagram-engineering.com/copy-on-write-friendly-python-garbage-collection-
ad6ed5233ddf

Wrap up
■ If you don’t know anything about internal Python memory management – it’s fine
■ Memory usage can be reduced, but optimization techniques are limited
■ If you to have fine control over memory – don’t use Python

Python: Thanks for the memories

Recommended

Recommended

More Related Content

Similar to Python: Thanks for the memories

Similar to Python: Thanks for the memories (20)

Recently uploaded

Recently uploaded (20)

Python: Thanks for the memories

Editor's Notes