GC proposal #50

craff · 2024-11-29T18:12:09Z

No description provided.

lpw25 · 2024-12-01T09:49:19Z

@damiendoligez Implemented various approaches to aging in the past, which is what your first suggestion is. They always turned out to perform worse than the current approach because the extra book-keeping dominated the improvement in promotion rate. So eventually we gave up on them.

stedolan · 2024-12-02T10:01:30Z

This proposal was harder to read than it could have been because some terminology is used in a non-standard way. What's called the "grayval list" here is usually known as the remembered set. The grey values or mark stack is a different data structure, used in major GC marking and not relevant for the minor GC. There's an excellent glossary at memorymanagement.org which defines all the relevant terms.

For the actual second suggestion, I'm afraid that incrementing and decrementing a ref counter in the minor heap is likely to be more expensive than maintaining a remembered set in the way it is currently done, at least with multiple domains. The remembered set is per-domain so can be accessed directly, while the counters are shared so must use slower atomic accesses and will cause cache contention between domains.

craff · 2024-12-03T17:26:48Z

This proposal was harder to read than it could have been because some terminology is used in a non-standard way. What's called the "grayval list" here is usually known as the remembered set. The grey values or mark stack is a different data structure, used in major GC marking and not relevant for the minor GC. There's an excellent glossary at memorymanagement.org which defines all the relevant terms.

For the actual second suggestion, I'm afraid that incrementing and decrementing a ref counter in the minor heap is likely to be more expensive than maintaining a remembered set in the way it is currently done, at least with multiple domains. The remembered set is per-domain so can be accessed directly, while the counters are shared so must use slower atomic accesses and will cause cache contention between domains.

Thanks for the remark, I updated the term.

In my mind the ref counter is next to the block in the minor heap itself. And incrementing/decrementing
does not require a mutex, if it is done atomically. Last time I looked, reference from major head to minor heap where kept in a doubly linked list ? Was this changed ?

craff · 2024-12-03T17:29:53Z

@damiendoligez Implemented various approaches to aging in the past, which is what your first suggestion is. They always turned out to perform worse than the current approach because the extra book-keeping dominated the improvement in promotion rate. So eventually we gave up on them.

It is not only a problem a speed when we compare equal minor heap size in the two approach. I would like to have a 16Mb (even more) minor heap in simple_httpd with 64Kb-256Kb slices to keep good latencies. When I see the memory consumption of simple_httpd compared to apache, I can afford very large minor heaps except in the current version latency is too much impacted.

I think this approach could work.

stedolan · 2024-12-16T11:43:28Z

In my mind the ref counter is next to the block in the minor heap itself. And incrementing/decrementing
does not require a mutex, if it is done atomically. Last time I looked, reference from major head to minor heap where kept in a doubly linked list ? Was this changed ?

No, I don't think it was ever a doubly linked list? It's a flat per-domain array, which is cheaper than atomic operations that may contend.

It is not only a problem a speed when we compare equal minor heap size in the two approach. I would like to have a 16Mb (even more) minor heap in simple_httpd with 64Kb-256Kb slices to keep good latencies. When I see the memory consumption of simple_httpd compared to apache, I can afford very large minor heaps except in the current version latency is too much impacted.

I think this approach could work.

You'll need to explain why you think this approach could work. The description so far sounds similar to things that have been tried before and did not improve performance.

In particular, your idea for representing the remembered set sounds like it involves scanning the whole minor heap at collection time to find the objects that must be preserved. That alone seems more expensive than the current GC, which touches only the live data.

GC proposal

adb39a7

Update gc-proposal.md

21099f9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GC proposal #50

GC proposal #50

craff commented Nov 29, 2024

lpw25 commented Dec 1, 2024

stedolan commented Dec 2, 2024

craff commented Dec 3, 2024

craff commented Dec 3, 2024

stedolan commented Dec 16, 2024

GC proposal #50

Are you sure you want to change the base?

GC proposal #50

Conversation

craff commented Nov 29, 2024

lpw25 commented Dec 1, 2024

stedolan commented Dec 2, 2024

craff commented Dec 3, 2024

craff commented Dec 3, 2024

stedolan commented Dec 16, 2024