Separate the mechanisms and APIs for dependent memory and custom blocks #3597

stedolan · 2025-02-19T14:59:34Z

Previously, allocating a block with caml_alloc_custom or caml_alloc_custom_mem would both register a custom block finaliser and accelerate the GC.

Now, caml_alloc_custom(_mem) has no effect on GC, and the functions caml_adjust_gc_speed and caml_adjust_minor_gc_speed become noops. The GC speed increase for dependent memory happens only through caml_alloc_dependent_memory and caml_free_dependent_memory.

The function caml_alloc_custom_dep is available to call both caml_alloc_custom and caml_alloc_dependent_memory. However, the user must ensure to call caml_free_dependent_memory in their finaliser. (Bigarrays have already been updated to use this API in a previous patch)

This is to pave the way for a new GC pacing policy. However, no change to pacing is made by this patch - the GC should perform as previously on e.g. bigarray values, which have been ported to use the new API.

(Most of the code in this patch was written by @damiendoligez )

@damiendoligez

Previously, allocating a block with caml_alloc_custom or caml_alloc_custom_mem would both register a custom block finaliser and accelerate the GC. Now, caml_alloc_custom(_mem) has no effect on GC, and the functions caml_adjust_gc_speed and caml_adjust_minor_gc_speed become noops. The GC speed increase for dependent memory happens only through caml_alloc_dependent_memory and caml_free_dependent_memory. The function caml_alloc_custom_dep is available to call both caml_alloc_custom and caml_alloc_dependent_memory. However, the user must ensure to call caml_free_dependent_memory in their finaliser. (Bigarrays have already been updated to use this API in a previous patch) This is to pave the way for a new GC pacing policy. However, no change to pacing is made by this patch - the GC should perform as previously on e.g. bigarray values, which have been ported to use the new API. (Most of the code in this patch was written by @damiendoligez)

NickBarnes

LGTM, modulo the pacing code which is about to change. One comment is just plain wrong, and the FIXME asks the important question (which I'm sure will be answered by the next PR).

NickBarnes · 2025-02-19T16:29:19Z

runtime/custom.c

 static value alloc_custom_gen (const struct custom_operations * ops,
                               uintnat bsz,
-                               mlsize_t mem,
-                               mlsize_t max_major,
-                               mlsize_t max_minor,
                               int minor_ok,


Any chance of making these bool while we're here?

NickBarnes · 2025-02-19T16:30:53Z

runtime/custom.c

 CAMLexport value caml_alloc_custom(const struct custom_operations * ops,
                                   uintnat bsz,
                                   mlsize_t mem,
                                   mlsize_t max)
 {
-  return caml_alloc_custom0(ops, bsz, mem, max, 0);
+  return alloc_custom_gen(ops, bsz, /* minor_ok: */ 1, /* local: */ 0);


Should this call caml_memprof_sample_block?

Ah, no. The proxy block is already sampled, and this is now the path for custom blocks without foreign memory.

NickBarnes · 2025-02-20T15:34:57Z

runtime/caml/domain_state.tbl

-
-DOMAIN_STATE(uintnat, dependent_size)
-DOMAIN_STATE(uintnat, dependent_allocated)
+/* How much external memory is currenty held by the minor and major heap. */


s/and major//

NickBarnes · 2025-02-20T15:40:45Z

runtime/major_gc.c

-  intnat alloc_work, dependent_work, extra_work, new_work;
-  intnat my_alloc_count, my_alloc_direct_count, my_dependent_count;
+  intnat alloc_work, extra_work, new_work;
+  intnat my_alloc_count, my_alloc_direct_count;


I'm only skimming this function as I review this PR because I know it's about to change radically (in the next week or so).

NickBarnes · 2025-02-20T15:42:50Z

runtime/memory.c

+          / 100 * caml_custom_minor_ratio){
+      caml_request_minor_gc ();
+    }
+  }else{


horrible whitespace in this function.

NickBarnes · 2025-02-20T15:52:14Z

runtime/memory.c

+  }else{
+    caml_add_dependent_bytes (Caml_state->shared_heap, nbytes);
+    Caml_state->allocated_dependent_bytes += nbytes;
+    /* FIXME sdolan: what's the right condition here? */


An excellent question. I don't really believe in any of this caml_custom_get_max_major() stuff. "2/3 of 44% of the heap size, divided by 5". I keep expecting to subtract the number I first thought of and tada: it's the playing card hidden in the orange.

stedolan requested a review from NickBarnes February 19, 2025 14:59

stedolan added the runtime label Feb 19, 2025

NickBarnes requested changes Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate the mechanisms and APIs for dependent memory and custom blocks #3597

Separate the mechanisms and APIs for dependent memory and custom blocks #3597

stedolan commented Feb 19, 2025

NickBarnes left a comment

NickBarnes Feb 19, 2025

NickBarnes Feb 19, 2025

NickBarnes Feb 20, 2025

NickBarnes Feb 20, 2025

NickBarnes Feb 20, 2025

NickBarnes Feb 20, 2025

NickBarnes Feb 20, 2025

Separate the mechanisms and APIs for dependent memory and custom blocks #3597

Are you sure you want to change the base?

Separate the mechanisms and APIs for dependent memory and custom blocks #3597

Conversation

stedolan commented Feb 19, 2025

NickBarnes left a comment

Choose a reason for hiding this comment

NickBarnes Feb 19, 2025

Choose a reason for hiding this comment

NickBarnes Feb 19, 2025

Choose a reason for hiding this comment

NickBarnes Feb 20, 2025

Choose a reason for hiding this comment

NickBarnes Feb 20, 2025

Choose a reason for hiding this comment

NickBarnes Feb 20, 2025

Choose a reason for hiding this comment

NickBarnes Feb 20, 2025

Choose a reason for hiding this comment

NickBarnes Feb 20, 2025

Choose a reason for hiding this comment