Blogs
首页
归档
分类
标签
关于
首页
归档
分类
标签
关于
论文阅读
论文阅读习惯
2024-11-11
科研
论文阅读
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models
2024-11-09
科研
论文阅读
OneBit: Towards Extremely Low-bit Large Language Models
2024-11-08
科研
论文阅读
深度学习
,
量化
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
2024-10-27
科研
论文阅读
深度学习
,
量化
大模型量化~GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
2024-10-24
科研
论文阅读
深度学习
,
量化
llm.int8
2024-10-24
科研
论文阅读
深度学习
,
量化
论文阅读:A_Survey_of_Quantization_Methods_for_Efficient_Neural_Network_Inference
2023-10-23
科研
论文阅读
量化
论文阅读:BitNet_Scaling_1-bit_Transformers_for_Large_Language_Models
2023-10-23
科研
论文阅读
量化