Nice comments in the code, spotted a typo while reading them, might as well report it.

siphash.c line 25

why the 2-4 variant slowed

Should read:

while the 2-4 variant slowed