feat: Nx.elixir_call/3 #1627

polvalente · 2025-08-09T17:43:51Z

Allows calling Elixir functions inside defn expressions.

Currently limited to Nx.Defn.Evaluator. EXLA will be able to support this in a future PR by making use of Nx.Defn.Graph, splitting the expression before and after the elixir_call node, creating an isolated stage for the elixir call.

josevalim · 2025-08-10T10:32:11Z

Great! Although I'm not sure if it elixir_call is the best name? It also seems this relates to optional callbacks somehow? For example, optional callbacks require a default implementation in Elixir to be given, so they have similar dispatch mechanisms. On the other hand, we may also want to allow what is defined as an Elixir call to go through grad or be optimised in EXLA. So I'm thinking there is an overall unified mechanism where they are specified the same, but the compiler decides if it is a split or compiled, based on its structure at compile time. Glad to chat about it later!

…-feat/elixir-call

exla/c_src/exla/exla_nif_util.h

exla/c_src/exla/exla.cc

jonatanklosko · 2025-11-25T12:48:06Z

exla/c_src/exla/exla.cc

+ElixirCallbackBridgeState *GetElixirCallbackBridgeState() {
+  static ElixirCallbackBridgeState *state = new ElixirCallbackBridgeState();
+  return state;


Is this supposed to allocate the state every time, or rather be a global? Currently the NIFs call this function and don't deallocate the state.

Should be "global". Although I should probably be attaching the lifecycle to the handler process' lifetime.

exla/c_src/exla/exla.cc

polvalente · 2025-11-26T08:32:15Z

exla/c_src/exla/custom_calls/elixir_callback.cc

@@ -0,0 +1,107 @@
+#include "elixir_callback_bridge.h"


@jonatanklosko I've moved things around a bit and complied to basically all of your reviews. The only one I was unable to do is use named processes.

exla/c_src/exla/custom_calls/elixir_callback_bridge.h

exla/c_src/exla/custom_calls/elixir_callback_bridge.cc

exla/c_src/exla/exla.cc

exla/c_src/exla/custom_calls/elixir_callback_bridge.cc

exla/lib/exla/callback_server.ex

exla/c_src/exla/custom_calls/elixir_callback_bridge.cc

josevalim · 2025-11-27T13:22:02Z

exla/lib/exla/callback_server.ex

+  end
+
+  defp ensure_compatible(%Nx.Tensor{} = left, %Nx.Tensor{} = right) do
+    if left.shape == right.shape and left.type == right.type and left.names == right.names do


Don't we have Nx.compatible? or something?

josevalim · 2025-11-27T13:23:09Z

nx/lib/nx.ex

+  """
+  @doc type: :backend
+  def elixir_call(output, args, fun) when is_list(args) and is_function(fun) do
+    {:arity, arity} = Function.info(fun, :arity)


Dynamic arities will be a pain to type, my suggestion is to force either tuples or maps.

Maybe two arguments: tensors and options.

Let's force 1 or 2 arguments. First is a tensor or tensor container, second is the options.

I went with one tnesor-container argument and another for opts :)

josevalim · 2025-11-27T13:23:58Z

nx/lib/nx/backend.ex

+  to a custom_call that interacts with Erlang/Elixir via C; pure CPU
+  backends may call the function directly.
+  """
+  @callback elixir_call(out :: tensor | tuple, [term], fun) :: tensor


This is a compiler function, not a backend one? 🤔

Not sure what you mean by that. This is a backend callback at least because we need Nx.Defn.Expr to have this defined, but when running Nx.Defn.Evaluator we can have backends call the function directly and so on.

josevalim · 2025-11-27T13:24:42Z

nx/lib/nx.ex

+  any list arguments. Lists (including keyword lists) are treated as static
+  Elixir data that is appended to the callback at runtime, while the leading
+  non-list arguments are compiled as tensors and shipped to the target
+  backend. Passing a tensor after a list argument raises an error.


We need examples.

josevalim · 2025-11-27T13:29:15Z

torchx/test/torchx/defn/elixir_call_test.exs

+    expected = Nx.add(Nx.multiply(fx, 2.0), Nx.add(fx, 1.0))
+    assert_equal(y, expected)
+  end
+end


IMO these tests are not necessary as there is nothing torch specific!

josevalim · 2025-11-27T13:30:07Z

nx/test/nx/defn/elixir_call_evaluator_test.exs

+    fx = Nx.as_type(x, :f32)
+    expected = Nx.add(Nx.multiply(fx, 2.0), Nx.add(fx, 1.0))
+    assert expected == y
+  end


We should check map output too and potentially nesting? We have the foundation for this already inside Nx.Defn anyway.

polvalente added 5 commits August 8, 2025 12:12

feat: add initial draft

d72680a

evaluator mode working

da7d7e4

test: add tests

fc9c28c

fix grad

25300b7

Merge remote-tracking branch 'origin/main' into pv-feat/elixir-call

a91c42b

polvalente added 9 commits November 22, 2025 01:29

feat(exla): initial Nx.elixir_call/3 CPU wiring

aa35431

feat: seemingly working mvp

37a15af

feat: first working version

7127a8c

wip: step through code review

bc52205

finish changes code review

95f7860

Merge branch 'main' into pv-feat/elixir-call

2a1e627

chore: remove unused files

c9aa9bd

Merge branch 'pv-feat/elixir-call' of github.com:elixir-nx/nx into pv…

c36a4c6

…-feat/elixir-call

docs: document the lock issue

c7c4871