Sub 4-bit Power-of-Two-Based Mixed-Precision Quantization for Efficient LLM Compression and Acceleration - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Han Cho, Apurba Prasad Padhy, Fernando Camacho, Saibal Mukhopadhyay. Sub 4-bit Power-of-Two-Based Mixed-Precision Quantization for Efficient LLM Compression and Acceleration. IEEE Access, 13:209356-209367, 2025. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL