amd with ssse3 (bulldozer and fusion) has serious performance problems with the vpaes code. (-evp is 40% slower)