Sub 4-bit Power-of-Two-Based Mixed-Precision Quantization for Efficient LLM Compression and Acceleration

Han Cho, Apurba Prasad Padhy, Fernando Camacho, Saibal Mukhopadhyay. Sub 4-bit Power-of-Two-Based Mixed-Precision Quantization for Efficient LLM Compression and Acceleration. IEEE Access, 13:209356-209367, 2025. [doi]

Authors

Han Cho

This author has not been identified. Look up 'Han Cho' in Google

Apurba Prasad Padhy

This author has not been identified. Look up 'Apurba Prasad Padhy' in Google

Fernando Camacho

This author has not been identified. Look up 'Fernando Camacho' in Google

Saibal Mukhopadhyay

This author has not been identified. Look up 'Saibal Mukhopadhyay' in Google