Hello! As discussed, I'm sending patches that reimplement our SSE4.2 search_line_fast helper with SSSE3, and then add the corresponding AVX2 helper. They are on top of Andi's "Remove MMX code path in lexer" patch, which was approved, but not committed yet (Andi, can you push your own patch?).
Apparently the branch where we find a possible EOL and return from the function is poorly predictable, hence a small win from AVX2 use (wider vectors => fewer mispredicts). I'm also attaching here a microbenchmark for testing all variants in isolation. Alexander
search-line-bench.tgz
Description: application/gzip