skip_whitespace is inefficient for non-whitespace tokens #209

jdm · 2017-12-08T19:33:42Z

Depending on how often we call skip_whitespace when non-whitespace tokens are expected (which is actually quite common, IIRC) we can probably make it more efficient by adding a case to the top of the match at https://github.com/servo/rust-cssparser/blob/master/src/tokenizer.rs#L459 that catches any byte after / and immediately returns.

The text was updated successfully, but these errors were encountered:

SimonSapin · 2017-12-08T19:40:29Z

Line 459 is currently b' ' | b'\t' => {, I don’t understand what you mean by "any byte after /".

jdm · 2017-12-08T19:45:09Z

I mean that / has the highest ascii value, so any byte value greater than it is clearly not whitespace. Adding a check for that case before any other might be more efficient.

SimonSapin · 2017-12-08T20:02:32Z

Oh I see. So something like byte if byte > b'/' => return, ?

jdm · 2017-12-08T21:38:48Z

Yeah.

emilio · 2023-08-02T13:06:17Z

We use match_byte for this now, which uses a lookup table, which seems to perform well

emilio closed this as completed Aug 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

skip_whitespace is inefficient for non-whitespace tokens #209

skip_whitespace is inefficient for non-whitespace tokens #209

jdm commented Dec 8, 2017

SimonSapin commented Dec 8, 2017

Uh oh!

jdm commented Dec 8, 2017

Uh oh!

SimonSapin commented Dec 8, 2017

Uh oh!

jdm commented Dec 8, 2017

Uh oh!

emilio commented Aug 2, 2023

Uh oh!

skip_whitespace is inefficient for non-whitespace tokens #209

skip_whitespace is inefficient for non-whitespace tokens #209

Comments

jdm commented Dec 8, 2017

SimonSapin commented Dec 8, 2017

Uh oh!

jdm commented Dec 8, 2017

Uh oh!

SimonSapin commented Dec 8, 2017

Uh oh!

jdm commented Dec 8, 2017

Uh oh!

emilio commented Aug 2, 2023

Uh oh!