Tidy up the tests with boilerplate relative error calculation. Fix eval_ldexp for mpf_t. Fix power and log calculations. Add acos. Get all the tests passing.