Record evaluation speed for all three datasets vs reference implementations.
Record evaluation speed for all three datasets vs reference implementations.