Ensure the oxide parser has feature parity with the stable RegEx parser #11389

RobinMalfait · 2023-06-07T14:11:38Z

This PR ensures that the Rust based parser now extracts all the expected candidates that the RegEx based parser also extracts.

This will allow us to eventually switch to the Rust based parser by default.

There are a few caveats:

If a custom transformer or extractor is required for a file, then the RegEx based parser will be used for that file.
If a custom separator or prefix is used, then we fallback to the RegEx based parser.

These tests will run against the `Regex` and `Rust` based parsers. We have groups of classes of various shapes and forms + variants and rendered in various template situation (plain, html, Vue, ...) + enable all skipped tests

The classes with variants are built in the `templateTable` function, so we get them out again by using the potional arguments of the `test.each` cb function.

To make sure that we are _not_ parsing out certain values given a certain input.

The RegEx parser does extract `underline` from ```html <div class="peer-aria-[labelledby='a_b']:underline"></div> ``` ... but that's not needed and is not happening in the oxide parser This means that we have to make the output check a little bit different but they are explicit based on the feature flag.

This makes sure all the fancy SIMD stuff is as early as possible. This results in an extremely minor perf increase.

no meaningful perf difference in real world scenarios

It needs to be done in a different spot so it doesn’t affect how things are returned

adamwathan and others added 30 commits June 7, 2023 15:56

WIP

b8bdcdc

use parse instead of defaultExtractor

a898328

skip Vue describe block

7f318a7

add a few more dedicated arbitrary values/properties tests

baa9afe

use parallel parsing

831fc88

splitup Vue tests

fbb9375

add some Rust specific tests

a0d1116

setup parse candidate strings test system

d1c039f

These tests will run against the `Regex` and `Rust` based parsers. We have groups of classes of various shapes and forms + variants and rendered in various template situation (plain, html, Vue, ...) + enable all skipped tests

ensure we also validate the classes with variants

1e15887

The classes with variants are built in the `templateTable` function, so we get them out again by using the potional arguments of the `test.each` cb function.

cleanup test suite

a3c3c37

add "anti-test" tests

3b2055a

To make sure that we are _not_ parsing out certain values given a certain input.

Add ParseAction enum

9dad5a0

Restart parsing following an arbitrary parse failure

0bc7ef5

Split variants off before validating the uility part

3d5c29c

Collapse candidate from the end when validation fails

d2f34b4

Support <, and > in variant position

d6352b2

fix error

7372973

format parser.rs

df36a77

Refactor

e980e99

Update editorconfig

dc2cadd

wip

0f1d397

wip

1c9b2cd

Refactor

2e3f1be

Refactor

4aef1a7

Simplify

f796c13

wip

8281db4

wip

27e2f1e

wip

efe0f82

wip

d47bfdc

wip

478743a

thecrypticace and others added 28 commits June 7, 2023 15:56

fmt

82fa69f

Simplify

f0a09f1

Add cursor details to trace

80aa8b8

cargo fmt

364b1a9

use preferred zoom-0.5 name instead of zoom-.5

d2c81a1

allow extracting variants+utilities inside {} for the oxide parser

c0a71c1

characters in candidates such as group-${id} should not be allowed

42f035d

do not extract any of the following candidate w-[foo-bar]w-[bar-baz]

4043815

ensure we can consume the full candidate and discard it

6067112

Add fast skipping of whitespace

0a9369f

Use fast skipping whenever possible

b23bc76

Add fast skipping to benchmark

d3fa21f

Hand-tune to generate more optimized assembly

ab8a087

Move code around a bit

b811d06

This makes sure all the fancy SIMD stuff is as early as possible. This results in an extremely minor perf increase.

Undo tweak

18e02c0

no meaningful perf difference in real world scenarios

Disable fast skipping for now

49a0801

It needs to be done in a different spot so it doesn’t affect how things are returned

Change test names

f216989

Fix normalize config error

5a7f703

cleanup a bit

bf848dd

Cleanup

d9a9228

Extract validation result enum

6d74c8f

Cleanup comments

d139046

Simplify

4f46796

Fix formatting

099f3c6

Run clippy

7f849c1

wip

6af1934

add md> under the special characters test set

dd1c659

RobinMalfait merged commit 55daf8e into master Jun 7, 2023

RobinMalfait deleted the test-both-parsers branch June 7, 2023 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Ensure the oxide parser has feature parity with the stable RegEx parser #11389

Ensure the oxide parser has feature parity with the stable RegEx parser #11389

Uh oh!

RobinMalfait commented Jun 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Ensure the oxide parser has feature parity with the stable RegEx parser #11389

Ensure the oxide parser has feature parity with the stable RegEx parser #11389

Uh oh!

Conversation

RobinMalfait commented Jun 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants