#allocator #no-std #binned-allocator

no-std rsbmalloc

A simple but fast binned allocator for Rust. Single-threaded for no_std, multi-threaded otherwise

6 releases

new 0.4.4 Apr 23, 2024
0.4.3 Aug 29, 2023
0.4.0 Jun 7, 2023
0.3.1 Apr 3, 2023

#167 in Memory management

Download history 6/week @ 2024-02-25 21/week @ 2024-03-03 13/week @ 2024-03-10 1/week @ 2024-03-17 4/week @ 2024-03-31 129/week @ 2024-04-21

133 downloads per month

MIT/Apache

29KB
698 lines

rsbmalloc

A binned allocator for Rust. It’s quite simple, but reasonably fast single and multi-threaded. Single-threaded, it generally similar to the built-in allocator, sometimes faster, but sometimes with higher memory usage. Multi-threaded, it ranges from similar speed to quite a bit slower. It’s pure Rust, so it should work smoothly on any platform that provides standard mmap and munmap functions, and also Windows (though Windows support isn’t tested).

Relies exclusively on thread-local caches for multi-threaded support. 4 times the number of cpus are created on the first allocation and no more are created after that, so each ‘thread-local’ cache is fully thread-safe in case it is reused between threads.

rsbmalloc is entirely a binned allocator, with bins ranging from 4 bytes to 16 KiB (some ARM pages sizes are 16 KiB). If an allocation is larger than 16 KiB, it gets counted as a large allocation and goes straight to mmap and munmap. So, when freed in Rust, it gets munmap-ed. Bins, however, are allocated a page at a time as necessary and are never released back to the OS. Freed slots just act as a linked list that can be reused by the same thread (or another thread that scores the same thread cache).

It implements the GlobalAllocator trait, and comes with a single-threaded no_std version. The no_std version still requires a libc with mmap and munmap or Windows, but it doesn’t depend on the Rust standard library. Note that the no_std version is still thead-safe, it just doesn’t use the thread-local caches, so it’s a lot slower because it relies of spinlocks when operating multi-threaded. On the other hand, it uses less memory and would be a similar speed if there’s no lock contention. Once the allocator-api is stable, it should be a fairly easy port to that.

rsbmallocc provides a slightly slower C interface to rsbmalloc, including standard names (malloc, free) and prefixed names (rsbmalloc, rsbfree).

rsbmalloc also exposes the page-only allocator it uses under the hood.

A Broch Web Solutions project.

Check out the blog post for more info.

Dependencies

~230KB