Skip to content

Evaluate zlib-cloudflare for 15% performance speedup of WarcRecordWriter #22

@tfmorris

Description

@tfmorris

According to this 2019 analysis, fully 1/3 of WarcRecordWriter's time is being spent in zlib.so. Cloudflare has a performance enhanced drop-in compatible version of zlib, zlib-cloudflare, which is claimed to be almost twice as fast at gzip compression.

This could provide a significant speedup (~15% overall) for minimal implementation cost. There is documentation available which describes how to set it up. Ignore the fact that it's a Graviton page. It applies to all architectures.

While switching to a different algorithm is also possible, that would be much more disruptive to the ecosystem as compared to a drop-in replacement implementing the same algorithm.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions