c: use AVX and SSE to bulk scan by rmg · Pull Request #1 · rmg/hexgrep

rmg · 2019-08-03T23:56:01Z

Use fancy instructions to scan 16, 32, and 64 byte ranges instead of
only inspecting a single byte at a time.

While this was a lot of fun to do, it turns out to not be as efficient
as being clever about avoiding comparisons whenever possible. That is,
reading only 1/20 bytes is better than reading 32 at once even if they are
the same number of instructions. This is because there is overhead in
loading the 128 and 256 bit registers and that overhead reduces the
gains enough to give us a net speed that is slightly slower.

Use fancy instructions to scan 16, 32, and 64 byte ranges instead of only inspecting a single byte at a time. While this was a lot of fun to do, it turns out to not be as efficient as being clever about avoiding comparisons whenever possible. That is, reading 1/10 bytes is better than reading 10 at once even if they are the same number of instructions. This is because there is overhead in loading the 128 and 256 bit registers and that overhead reduces the gains enough to give us a net speed that is slightly slower.

rmg force-pushed the fun-with-avx branch from 201780f to 873323b Compare August 6, 2019 20:21

rmg force-pushed the fun-with-avx branch from 873323b to 1a8d416 Compare August 8, 2019 00:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

c: use AVX and SSE to bulk scan#1

c: use AVX and SSE to bulk scan#1
rmg wants to merge 1 commit intomasterfrom
fun-with-avx

rmg commented Aug 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rmg commented Aug 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant