We left quite a few messes behind in our evaluator - let's get rid of some ref
s, stop returning multiple values in our test code, and handle blocks similarly to other code.
More info: https://artificialworlds.net/blog/2025/06/27/mini-rust-in-rust-035-tidying-evaluator