We need to run something like the UTM example case over a zillion points, and see how it compares to the C code.