These frameworks allow writing Python code, sometimes fairly idiomatic, sometimes constrained, that can be translated and/or compiled to run natively and/or in GPU or other. (See, for example: https://www.tensorflow.org/xla) Might not be in the realm of what you're looking for, but maybe worth considering, at least.