Skip to content

A different dictionary can trivially improve g-zip compress ratio #8

@qeleb

Description

@qeleb

Currently, the default dictionary is: _-abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789

Based on a character frequency analysis of the code in a few of my projects it should be re-ordered to this: etionraldfps0gx-1chbum4v6w25k9y873zjHCONADLYqBEFGIJKMPQRSTUVWXZ_

For me this measurably improved g-zip compression ratio without any cost. Perhaps a deeper analysis could be done for the most optimal default dictionary, but this is at least a step in the right direction.

Metadata

Metadata

Assignees

No one assigned

    Labels

    improvementSomething existing could be improved

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions