[gguf] Add descriptions to quantization types #615

mishig25 · 2024-04-09T13:00:03Z

I have not found a single place where all different data/quant types of gguf is documented. Therefore, creating this description object that would be useful to the community for understanding different data/quant types.

Afterwards, I plan to make the description available at:

hf.co/docs/hub/gguf
GGUF tensor inspector
More importantly, community can have a source of information that can be used in their projects

mishig25 · 2024-04-09T13:03:46Z

packages/gguf/src/quant_descriptions.ts

+	[GGMLQuantizationType.Q5_K]: `"type-1" 5-bit quantization. Same super-block structure as Q4_K resulting in 5.5 bpw. In "type-1", weights are given by w = d * q + m, where m is the block minimum.`, // src: https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305
+	[GGMLQuantizationType.Q6_K]: `"type-0" 6-bit quantization. Super-blocks with 16 blocks, each block having 16 weights. Scales are quantized with 8 bits. This ends up using 6.5625 bpw. In "type-0", weights w are obtained from quants q using w = d * q, where d is the block scale.`, // src: https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305
+	[GGMLQuantizationType.Q8_K]: `"type-0" 8-bit quantization. Only used for quantizing intermediate results. The difference to the existing Q8_0 is that the block size is 256. All 2-6 bit dot products are implemented for this quantization type. In "type-0", weights w are obtained from quants q using w = d * q, where d is the block scale.`, // src: https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305
+	[GGMLQuantizationType.IQ2_XXS]: "", // todo: add description


@ikawrakow @ggerganov @younesbelkada @FL33TW00D or anyone, I'd greatly appreciate if you can supply any of the the missing descriptions.

You can just post as a comment and I can add/commit it to the file

According to ggerganov/llama.cpp#5063 + offline discussion with @FL33TW00D I would say:

Q4_0: Round-to-Nearest group-wise quantization with a blocksize of 32 and 4-bit quantized weights. Block weights are simply given by w = q * s. Legacy quantization method, and not really used by the community as of today.

I would say Q5_0 / Q8_0 is also RTN but for 5 / 8-bit, not sure yet what _1 stands for Q4_1 - I will let others comment on this

i might got it right for QK_1:

Q4_1: Round-to-Nearest group-wise quantization with a blocksize of 32 and 4-bit quantized weights with an additional term that is added after the de-quantization step. Block weights are simply given by w = q * s + m with m being the minimum of the block. Legacy quantization method, and not really used by the community as of today.

Same comment applies for Q5_1 and Q8_1 I think

julien-c · 2024-04-09T19:17:53Z

packages/gguf/src/quant_descriptions.ts

+	[GGMLQuantizationType.Q5_1]: "", // todo: add description
+	[GGMLQuantizationType.Q8_0]: "", // todo: add description
+	[GGMLQuantizationType.Q8_1]: "", // todo: add description
+	[GGMLQuantizationType.Q2_K]: `"type-1" 2-bit quantization in super-blocks containing 16 blocks, each block having 16 weight. Block scales and mins are quantized with 4 bits. This ends up effectively using 2.5625 bits per weight (bpw). In "type-1", weights are given by w = d * q + m, where m is the block minimum.`, // src: https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305


should you encode the src link in the code itself (so a Record<GGMLQuantizationType, { txt: string; url: string }> to be able to link to a reference from the UI?

Another potential idea: indicate a few of them with a featured or popular flag so we can showcase in a UI or something

handled in 240f0df

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: FL33TW00D <chris@fleetwood.dev>

[gguf] rename QUANT_DESCRIPTIONS -> GGUF_QUANT_DESCRIPTIONS follow up to #615

mishig25 requested review from julien-c and coyotte508 as code owners April 9, 2024 13:00

mishig25 commented Apr 9, 2024

View reviewed changes

[gguf] Describe data & quant types

f2690a9

mishig25 force-pushed the gguf_desc branch from 60564c0 to f2690a9 Compare April 9, 2024 13:07

julien-c reviewed Apr 9, 2024

View reviewed changes

mishig25 changed the title ~~[gguf] Add descriptions~~ [gguf] Add descriptions to quantization types Apr 9, 2024

mishig25 added 2 commits April 10, 2024 11:45

cleaner & shorter description

fdb78f1

chore

7e83f0b

mishig25 force-pushed the gguf_desc branch from 2afbef6 to abe0d33 Compare April 10, 2024 10:03

Q4_0, Q4_1, Q5_0, Q5_1, Q8_0, Q8_1

b1aeff5

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: FL33TW00D <chris@fleetwood.dev>

mishig25 force-pushed the gguf_desc branch from abe0d33 to b1aeff5 Compare April 10, 2024 10:05

mishig25 added 5 commits April 10, 2024 12:16

cleaner & shorter

94e0aed

src comments

1c1ef0e

add src comments

fc27b4c

better js object

240f0df

re-export

0d425f9

mishig25 merged commit 0ed8d60 into main Apr 10, 2024
2 checks passed

mishig25 deleted the gguf_desc branch April 10, 2024 14:33

mishig25 mentioned this pull request Apr 10, 2024

[gguf] rename QUANT_DESCRIPTIONS -> GGUF_QUANT_DESCRIPTIONS #618

Merged

mishig25 pushed a commit that referenced this pull request Apr 10, 2024

[gguf] rename QUANT_DESCRIPTIONS -> GGUF_QUANT_DESCRIPTIONS (#618)

1f49e2a

[gguf] rename QUANT_DESCRIPTIONS -> GGUF_QUANT_DESCRIPTIONS follow up to #615

mishig25 mentioned this pull request Apr 10, 2024

update gguf docs huggingface/hub-docs#1268

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gguf] Add descriptions to quantization types #615

[gguf] Add descriptions to quantization types #615

mishig25 commented Apr 9, 2024 •

edited

Loading

mishig25 Apr 9, 2024 •

edited

Loading

younesbelkada Apr 9, 2024 •

edited

Loading

younesbelkada Apr 9, 2024

julien-c Apr 9, 2024

julien-c Apr 9, 2024

mishig25 Apr 10, 2024

[gguf] Add descriptions to quantization types #615

[gguf] Add descriptions to quantization types #615

Conversation

mishig25 commented Apr 9, 2024 • edited Loading

mishig25 Apr 9, 2024 • edited Loading

Choose a reason for hiding this comment

younesbelkada Apr 9, 2024 • edited Loading

Choose a reason for hiding this comment

younesbelkada Apr 9, 2024

Choose a reason for hiding this comment

julien-c Apr 9, 2024

Choose a reason for hiding this comment

julien-c Apr 9, 2024

Choose a reason for hiding this comment

mishig25 Apr 10, 2024

Choose a reason for hiding this comment

mishig25 commented Apr 9, 2024 •

edited

Loading

mishig25 Apr 9, 2024 •

edited

Loading

younesbelkada Apr 9, 2024 •

edited

Loading