4. Arenas

An arena is an object that encapsulates the state of the Memory Pool System, and tells it where to get the memory it manages. You typically start a session with the MPS by creating an arena with mps_arena_create_k() and end the session by destroying it with mps_arena_destroy(). The only function you might need to call before making an arena is mps_telemetry_control().

Before destroying an arena, you must first destroy all objects and data in it, as usual for abstract data types in the MPS. If you can’t destroy the arena properly (for example, because your program has crashed and you are at the debugger prompt), you can still call mps_telemetry_flush() explicitly.

Other types of objects in the MPS are created “in the arena”. They are part of the world within the arena, and may interact and affect each other.

Note

The MPS allows creation of multiple arenas, but you would only do this in unusual circumstances. It might be useful to have two active arenas and to try different things out in them, or you might be in the process of integrating two pieces of software that each independently uses the MPS.

Arenas do not normally interact, but they compete with each other for resources, and references from one arena to another are not traced, though you can declare roots pointing from one arena to another. It is not efficient to have multiple arenas containing automatically managed pools: if you find yourself in this situation it’s best to find a way to move all the automatically managed pools to one arena.

The open source MPS comes with two classes of arena, Client arenas and Virtual memory arenas. These differ in the way that they acquire the memory to be managed.

Note

The MPS is designed to be extensible with new arena classes. If you need features that are not provided by any of the open source arena classes, contact us.

mps_arena_t

The type of arenas.

An arena is responsible for requesting memory (3) from the operating system, making it available to pools, and for garbage collection.

mps_arena_class_t

The type of arena classes.

mps_res_t mps_arena_create_k(mps_arena_t *arena_o, mps_arena_class_t arena_class, mps_arg_s args[])

Create an arena.

arena_o points to a location that will hold a pointer to the new arena.

arena_class is the arena class.

args are keyword arguments specific to the arena class. See the documentation for the arena class.

Returns MPS_RES_OK if the arena is created successfully, or another result code otherwise.

The arena persists until it is destroyed by calling mps_arena_destroy().

mps_res_t mps_arena_create(mps_arena_t *arena_o, mps_arena_class_t arena_class, ...)

Deprecated

starting with version 1.112.

Use mps_arena_create_k() instead: the keyword arguments interface is more reliable and produces better error messages.

An alternative to mps_arena_create_k() that takes its extra arguments using the standard C variable argument list mechanism.

mps_res_t mps_arena_create_v(mps_arena_t *arena_o, mps_arena_class_t arena_class, va_list args)

Deprecated

starting with version 1.112.

Use mps_arena_create_k() instead: the keyword arguments interface is more reliable and produces better error messages.

An alternative to mps_arena_create_k() that takes its extra arguments using the standard C va_list mechanism.

void mps_arena_destroy(mps_arena_t arena)

Destroy an arena.

arena is the arena to destroy.

This function checks the consistency of the arena, flushes the telemetry stream and destroys the arena’s internal control structures. Additionally, virtual memory arenas return their reserved address space to the operating system if possible.

It is an error to destroy an arena without first destroying all generation chains, object formats, pools and roots created in the arena, and deregistering all threads registered with the arena.

4.1. Client arenas

#include "mpsacl.h"
mps_arena_class_t mps_arena_class_cl(void)

Return the arena class for a client arena.

A client arena gets its managed memory from the client program. This memory chunk is passed when the arena is created.

When creating a client arena, mps_arena_create_k() requires two keyword arguments:

  • MPS_KEY_ARENA_CL_BASE (type mps_addr_t) is the address of the chunk of memory that will be managed by the arena.

  • MPS_KEY_ARENA_SIZE (type size_t) is its size.

For example:

MPS_ARGS_BEGIN(args) {
    MPS_ARGS_ADD(args, MPS_KEY_ARENA_CL_BASE, base);
    MPS_ARGS_ADD(args, MPS_KEY_ARENA_SIZE, size);
    res = mps_arena_create_k(&arena, mps_arena_class_cl(), args);
} MPS_ARGS_END(args);

If the chunk is too small to hold the internal arena structures, mps_arena_create_k() returns MPS_RES_MEMORY. In this case, you need to use a (much) larger chunk.

Note

You don’t have to provide all the memory up front: you can call mps_arena_extend() later on.

Client arenas have no mechanism for returning unused memory.

Deprecated

starting with version 1.112.

When using mps_arena_create(), pass the size and base address like this:

mps_res_t mps_arena_create(mps_arena_t *arena_o,
                           mps_arena_class_t mps_arena_class_cl,
                           size_t size, mps_addr_t base)
mps_res_t mps_arena_extend(mps_arena_t arena, mps_addr_t base, size_t size)

Extend a client arena with another block of memory.

base is the address of the block of memory that will be managed by the arena.

size is its size.

Return MPS_RES_OK if successful, or another result code if it fails.

4.2. Virtual memory arenas

#include "mpsavm.h"
mps_arena_class_t mps_arena_class_vm(void)

Return the arena class for a virtual memory arena.

A virtual memory arena uses the operating system’s virtual memory interface to allocate memory. The chief consequence of this is that the arena can manage many more virtual addresses than it needs to commit memory to. This gives it flexibility as to where to place blocks, which reduces fragmentation and helps make garbage collection more efficient.

When creating a virtual memory arena, mps_arena_create_k() requires one keyword argument:

  • MPS_KEY_ARENA_SIZE (type size_t). is the initial amount of virtual address space, in bytes (1), that the arena will reserve (this space is initially reserved so that the arena can subsequently use it without interference from other parts of the program, but most of it is not committed, so it doesn’t require any RAM or backing store). The arena may allocate more virtual address space beyond this initial reservation as and when it deems it necessary. The MPS is most efficient if you reserve an address space that is several times larger than your peak memory usage.

    Note

    The MPS asks for more address space if it runs out, but the more times it has to extend its address space, the less efficient garbage collection will become.

An optional keyword argument may be passed, but is only used on the Windows operating system:

  • MPS_KEY_VMW3_TOP_DOWN (type mps_bool_t). If true, the arena will allocate address space starting at the highest possible address and working downwards through memory.

    Note

    This causes the arena to pass the MEM_TOP_DOWN flag to VirtualAlloc.

If the MPS fails to reserve adequate address space to place the arena in, mps_arena_create_k() returns MPS_RES_RESOURCE. Possibly this means that other parts of the program are reserving too much virtual memory.

If the MPS fails to allocate memory for the internal arena structures, mps_arena_create_k() returns MPS_RES_MEMORY. Either size was far too small or the operating system refused to provide enough memory.

For example:

MPS_ARGS_BEGIN(args) {
    MPS_ARGS_ADD(args, MPS_KEY_ARENA_SIZE, size);
    res = mps_arena_create_k(&arena, mps_arena_class_cl(), args);
} MPS_ARGS_END(args);

Deprecated

starting with version 1.112.

When using mps_arena_create(), pass the size like this:

mps_res_t mps_arena_create(mps_arena_t *arena_o,
                           mps_arena_class_t arena_class_vm(),
                           size_t size)

4.3. Arena properties

mps_word_t mps_collections(mps_arena_t arena)

Return the number of flips that have taken place in an arena since it was created.

arena is the arena.

size_t mps_arena_commit_limit(mps_arena_t arena)

Return the current commit limit for an arena.

arena is the arena to return the commit limit for.

Returns the commit limit in bytes (1). The commit limit controls how much memory the MPS can obtain from the operating system, and can be changed by calling mps_arena_commit_limit_set().

mps_res_t mps_arena_commit_limit_set(mps_arena_t arena, size_t limit)

Change the commit limit for an arena.

arena is the arena to change the commit limit for.

limit is the new commit limit in bytes (1).

Returns MPS_RES_OK if successful, or another result code if not.

If successful, the commit limit for arena is set to limit. The commit limit controls how much memory the MPS will obtain from the operating system. The commit limit cannot be set to a value that is lower than the number of bytes that the MPS is using. If an attempt is made to set the commit limit to a value greater than or equal to that returned by mps_arena_committed() then it will succeed. If an attempt is made to set the commit limit to a value less than that returned by mps_arena_committed() then it will succeed only if the amount committed by the MPS can be reduced by reducing the amount of spare committed memory; in such a case the spare committed memory will be reduced appropriately and the attempt will succeed.

Note

mps_arena_commit_limit_set() puts a limit on all memory committed by the MPS. The spare committed memory can be limited separately with mps_arena_spare_commit_limit_set(). Note that “spare committed” memory is subject to both limits; there cannot be more spare committed memory than the spare commit limit, and there can’t be so much spare committed memory that there is more committed memory than the commit limit.

size_t mps_arena_committed(mps_arena_t arena)

Return the total committed memory for an arena.

arena is the arena.

Returns the total amount of memory that has been committed to RAM by the MPS, in bytes (1).

The committed memory is generally larger than the sum of the sizes of the allocated blocks. The reasons for this are:

  • some memory is used internally by the MPS to manage its own data structures and to record information about allocated blocks (such as free lists, page tables, colour tables, statistics, and so on);

  • operating systems (and hardware) typically restrict programs to requesting and releasing memory with a certain granularity (for example, pages), so extra memory is committed when this rounding is necessary;

  • there might also be spare committed memory: see mps_arena_spare_committed().

The amount of committed memory is a good measure of how much virtual memory resource (“swap space”) the MPS is using from the operating system.

The function mps_arena_committed() may be called whatever state the the arena is in (unclamped, clamped, or parked). If it is called when the arena is in the unclamped state then the value may change after this function returns. A possible use might be to call it just after mps_arena_collect() to (over-)estimate the size of the heap.

If you want to know how much memory the MPS is using then you’re probably interested in the value mps_arena_committed() - mps_arena_spare_committed().

The amount of committed memory can be limited with the function mps_arena_commit_limit().

size_t mps_arena_reserved(mps_arena_t arena)

Return the total address space reserved by an arena, in bytes (1).

arena is the arena.

For a virtual memory arena, this is the total address space reserved via the operating system’s virtual memory interface.

For a client arena, this is the sum of the usable portions of the chunks of memory passed to the arena by the client program via mps_arena_create_k() and mps_arena_extend().

Note

For a client arena, the reserved address may be lower than the sum of the MPS_KEY_ARENA_SIZE keyword argument passed to mps_arena_create_k() and the size arguments passed to mps_arena_extend(), because the arena may be unable to use the whole of each chunk for reasons of alignment.

size_t mps_arena_spare_commit_limit(mps_arena_t arena)

Return the current spare commit limit for an arena.

arena is the arena to return the spare commit limit for.

Returns the spare commit limit in bytes (1). The spare commit limit can be changed by calling mps_arena_spare_commit_limit_set().

void mps_arena_spare_commit_limit_set(mps_arena_t arena, size_t limit)

Change the spare commit limit for an arena.

arena is the arena to change the spare commit limit for.

limit is the new spare commit limit in bytes (1).

The spare commit limit is the maximum amount of spare committed memory the MPS is allowed to have. Setting it to a value lower than the current amount of spare committed memory causes spare committed memory to be uncommitted so as to bring the value under the limit. In particular, setting it to 0 will mean that the MPS will have no spare committed memory.

Non-virtual-memory arena classes (for example, a client arena) do not have spare committed memory. For these arenas, this function functions sets a value but has no other effect.

Initially the spare commit limit is a configuration-dependent value. The value of the limit can be retrieved by the function mps_arena_spare_commit_limit().

size_t mps_arena_spare_committed(mps_arena_t arena)

Return the total spare committed memory for an arena.

arena is the arena.

Returns the number of bytes of spare committed memory.

Spare committed memory is memory which the arena is managing as free memory (not in use by any pool and not otherwise in use for internal reasons) but which remains committed (mapped to RAM by the operating system). It is used by the arena to (attempt to) avoid calling the operating system to repeatedly map and unmap areas of virtual memory as the amount of memory in use goes up and down. Spare committed memory is counted as committed memory by mps_arena_committed() and is restricted by mps_arena_commit_limit().

The amount of “spare committed” memory can be limited by calling mps_arena_spare_commit_limit_set(), and the value of that limit can be retrieved with mps_arena_spare_commit_limit(). This is analogous to the functions for limiting the amount of committed memory.

4.4. Arena states

An arena is always in one of three states.

  1. In the unclamped state, garbage collection may take place, objects may move in memory, references may be updated, location dependencies may become stale, virtual memory may be requested from or return to the operating system, and other kinds of background activity may occur. This is the normal state.

  2. In the clamped state, objects do not move in memory, references do not change, the staleness of location dependencies does not change, and memory occupied by unreachable objects is not recycled.

    However, a garbage collection may be in progress and incremental collection may still occur, but it will not be visible to the client program and no new collections will begin.

  3. The parked state is the same as the clamped state, with the additional constraint that no garbage collections are in progress.

Here’s a summary:

State

unclamped

clamped

parked

Collections may be running?

yes

yes

no

New collections may start?

yes

no

no

Objects may move?

yes

no

no

Location dependencies may become stale?

yes

no

no

Memory may be returned to the OS?

yes

no

no

Functions that leave arena in this state

mps_arena_create(), mps_arena_release(), mps_arena_start_collect(), mps_arena_step()

mps_arena_clamp(), mps_arena_step()

mps_arena_park(), mps_arena_collect()

The clamped and parked states are important when introspecting and debugging. If you are examining the contents of the heap, you don’t want data moving under your feet. So for example, if your program is stopped in GDB you might type:

(gdb) print mps_arena_clamp(arena)

before inspecting memory, and:

(gdb) print mps_arena_release(arena)

afterward.

The results of introspection functions like mps_arena_has_addr() only remain valid while the arena remains in the parked state, and functions like mps_arena_roots_walk() can only be called in this state.

void mps_arena_clamp(mps_arena_t arena)

Put an arena into the clamped state.

arena is the arena to clamp.

In the clamped state, no object motion will occur and the staleness of location dependencies will not change. All references to objects loaded while the arena is clamped will keep the same binary representation until after it is released by calling mps_arena_release().

In a clamped arena, incremental collection may still occur, but it will not be visible to the mutator and no new collections will begin. Space used by unreachable objects will not be recycled until the arena is unclamped.

void mps_arena_park(mps_arena_t arena)

Put an arena into the parked state.

arena is the arena to park.

While an arena is parked, no object motion will occur and the staleness of location dependencies will not change. All references to objects loaded while the arena is parked will keep the same binary representation until after it is released.

Any current collection is run to completion before the arena is parked, and no new collections will start. When an arena is in the parked state, it is necessarily not in the middle of a collection.

void mps_arena_release(mps_arena_t arena)

Puts an arena into the unclamped state.

arena is the arena to unclamp.

While an arena is unclamped, garbage collection, object motion, and other background activity can take place.

4.5. Running garbage collections

The Memory Pool System’s garbage collector runs asynchronously and incrementally. This means that it is not normally necessary to tell it when to start garbage collections, or to wait until it has finished collecting. (But if your program has idle time that could be productively spent by the MPS, see Using idle time for collection below.)

However, during development and testing it is useful to be able to request that MPS run a full collection cycle. For example, you might run frequent collections in an attempt to detect bugs in your allocation and scanning code.

void mps_arena_collect(mps_arena_t arena)

Collect an arena and put it into the parked state.

arena is the arena to collect.

The collector attempts to recycle as many unreachable objects as possible and reduce the size of the arena as much as possible (though in some cases it may increase because it becomes more fragmented). Note that the collector may not be able to recycle some objects (such as those near the destination of ambiguous references) even though they are not reachable.

If you do not want the arena to remain in the parked state, you must explicitly call mps_arena_release() afterwards.

Note

It is not normally necessary to call this function: in the unclamped state, collections start automatically. However, it may be useful during development and debugging: the more frequently the collector runs, the sooner and more reliably errors are discovered. See General debugging advice.

mps_res_t mps_arena_start_collect(mps_arena_t arena)

Request an arena to start a full collection cycle.

arena is the arena.

Returns MPS_RES_OK if a collection is started, or another result code if not.

This function puts arena into the unclamped state and requests that it start a full collection cycle. The call to mps_arena_start_collect() returns quickly, leaving the collection to proceed incrementally (as for a collection that is scheduled automatically).

Note

Contrast with mps_arena_collect(), which does not return until the collection has completed.

4.6. Using idle time for collection

Some types of program have “idle time” in which they are waiting for an external event such as user input or network activity. The MPS provides a function, mps_arena_step(), for making use of idle time to make memory management progress.

Here’s an example illustrating the use of this function in a program’s event loop. When the program is idle (there are no client actions to perform), it requests that the MPS spend up to 10 milliseconds on incremental work, by calling mps_arena_step(arena, 0.010, 0.0). When this returns false to indicate that there is no more work to do, the program blocks on the client for two seconds: if this times out, it predicts that the user will remain idle for at least a further second, so it calls mps_arena_step(arena, 0.010, 100.0) to tell that it’s a good time to start a collection taking up to 10 ms × 100 = 1 second, but not to pause for more than 10 ms.

The program remains responsive: the MPS doesn’t take control for more than a few milliseconds at a time (at most 10). But at the same time, major collection work can get done at times when the program would otherwise be idle. Of course the numbers here are only for illustration and should be chosen based on the requirements of the application.

for (;;) { /* event loop */
    for (;;) {
        if (client_is_waiting()) {
            perform_client_action();
        } else if (!mps_arena_step(arena, 0.010, 0.0)) {
            /* no incremental MPS work remaining */
            break;
        }
    }

    if (!block_on_client_with_timeout(2.0)) {
        /* Perhaps the user has gone for a cup of coffee? Allow the
         * MPS to start a big piece of work, but don't actually pause
         * for more than 10 ms. */
        mps_arena_step(arena, 0.010, 100.0);
    }
}
mps_bool_t mps_arena_step(mps_arena_t arena, double interval, double multiplier)

Request an arena to do some work during a period where the client program is idle.

arena is the arena.

interval is the time, in seconds, the MPS is permitted to take. It must not be negative, but may be 0.0.

multiplier is the number of further similar calls that the client program expects to make during this idle period.

Returns true if there was work for the MPS to do in arena (regardless of whether or not it did any) or false if there was nothing to do.

mps_arena_step() allows the client program to make use of idle time to do some garbage collection, for example when it is waiting for interactive input. The MPS makes every effort to return from this function within interval seconds, but cannot guarantee to do so, as it may need to call your own scanning code. It uses multiplier to decide whether to commence long-duration operations that consume CPU (such as a full collection): it will only start such an operation if it is expected to be completed within multiplier * interval seconds.

If the arena was in the parked state or the clamped state before mps_arena_step() was called, it is in the clamped state afterwards. It it was in the unclamped state, it remains there.

4.7. Arena introspection

Note

Introspection functions covered in other chapters are:

mps_bool_t mps_arena_has_addr(mps_arena_t arena, mps_addr_t addr)

Test whether an address is managed by an arena.

arena is an arena.

addr is an address.

Returns true if addr is managed by arena; false otherwise.

An arena manages a portion of address space. No two arenas overlap, so for any particular address this function will return true for at most one arena.

In general, not all addresses are managed by any arena. This is what allows the MPS to cooperate with other memory managers, shared object loaders, memory mapped file input/ouput, and so on: it does not steal the whole address space.

Note

The result from this function is valid only at the instant at which the function returned. In some circumstances the result may immediately become invalidated (for example, a garbage collection may occur, the address in question may become free, the arena may choose to unmap the address and return storage to the operating system). For reliable results call this function and interpret the result while the arena is in the parked state.

4.8. Protection interface

void mps_arena_expose(mps_arena_t arena)

Deprecated

starting with version 1.111.

Ensure that the MPS is not protecting any page in the arena with a read barrier or write barrier.

mps_arena is the arena to expose.

This is expected to only be useful for debugging. The arena is left in the clamped state.

Since barriers are used during a collection, calling this function has the same effect as calling mps_arena_park(): all collections are run to completion, and the arena is clamped so that no new collections begin. The MPS also uses barriers to maintain remembered sets, so calling this function will effectively destroy the remembered sets and any optimization gains from them.

Calling this function is time-consuming: any active collections will be run to completion; and the next collection will have to recompute all the remembered sets by scanning the entire arena.

The recomputation of the remembered sets can be avoided by calling mps_arena_unsafe_expose_remember_protection() instead of mps_arena_expose(), and by calling mps_arena_unsafe_restore_protection() before calling mps_arena_release(). Those functions have unsafe aspects and place restrictions on what the client program can do (basically no exposed data can be changed).

void mps_arena_unsafe_expose_remember_protection(mps_arena_t arena)

Deprecated

starting with version 1.111.

Ensure that the MPS is not protecting any page in the arena with a read barrier or write barrier. In addition, request the MPS to remember some parts of its internal state so that they can be restored later.

mps_arena is the arena to expose.

This function is the same as mps_arena_expose(), but additionally causes the MPS to remember its protection state. The remembered protection state can optionally be restored later by calling the mps_arena_unsafe_restore_protection() function. This is an optimization that avoids the MPS having to recompute all the remembered sets by scanning the entire arena.

However, restoring the remembered protections is only safe if the contents of the exposed pages have not been changed; therefore this function should only be used if you do not intend to change the pages, and the remembered protection must only be restored if the pages have not been changed.

The MPS will only remember the protection state if resources (memory) are available. If memory is low then only some or possibly none of the protection state will be remembered, with a corresponding necessity to recompute it later. The MPS provides no mechanism for the client program to determine whether the MPS has in fact remembered the protection state.

The remembered protection state, if any, is discarded after calling mps_arena_unsafe_restore_protection(), or as soon as the arena leaves the clamped state by calling mps_arena_release().

void mps_arena_unsafe_restore_protection(mps_arena_t arena)

Deprecated

starting with version 1.111.

Restore the remembered protection state for an arena.

mps_arena is the arena to restore the protection state for.

This function restores the protection state that the MPS has remembered when the client program called mps_arena_unsafe_expose_remember_protection(). The purpose of remembering and restoring the protection state is to avoid the need for the MPS to recompute all the remembered sets by scanning the entire arena, that occurs when mps_arena_expose() is used, and which causes the next garbage collection to be slow.

The client program must not change the exposed data between the call to mps_arena_unsafe_expose_remember_protection() and mps_arena_unsafe_restore_protection(). If the client program has changed the exposed data then mps_arena_unsafe_restore_protection() must not be called: in this case simply call mps_arena_release().

Calling this function does not release the arena from the clamped state: mps_arena_release() must be called to continue normal collections.

Calling this function causes the MPS to forget the remember protection state; as a consequence the same remembered state cannot be restored more than once.