Added SSE2 optimised Lanczos approximations. Reordered the tgamma function to reduce the number of comparisons needed.