Thanks Vilem, but I have to code for platform independence at a binary level, ie. no compile time flags.
SIMD is a not an option, and as Reed has already noticed, it's counter productive on a hell of a lot of platforms.
My stuff used to be very efficient, but I have noticed that some of the methods are invalidating the cache now when they didn't used to, and they seem to have got bigger as well.
I really don't have the time to investigate why and fix it. usual problem, too much work, not enough time.
Now I am doing more OpenGLES 2 stuff, and efficient maths library is much more important than it used to be.
Before I could just use glTranslatef, glRotatef, glPushMatrix, et al.
Now I have to code all of those myself, not a complex task at all, in fact I just took my old code from my software rasteriser and dropped it in.
TheNut, I think you are right. I'm just going to have to find some time aren't I? :wacko: