Merge pull request #969 from youkaichao/rmsnorm
Merge pull request #969 from youkaichao/rmsnorm act_quant_kernel
Merge pull request #969 from youkaichao/rmsnorm act_quant_kernel
fix rmsnorm and act_quant_kernel
fix act_quant_kernel (#968) Signed-off-by: youkaichao
support scale_fmt=ue8m0 (#964) * support scale_fmt=ue8m0 * keep improving Signed-off-by: youkaichao * keep improving Signed-off-by: youkaichao * add clamp min of 1e-4 Signed-off-by: youkaichao * rename…
Merge pull request #903 from yixing1992/main Update README.md for Huawei Ascend NPU support modes
Update README.md for Huawei Ascend NPU support modes
Merge pull request #666 from codinglover222/deepseek-doc-fix fix an args description.
Merge pull request #736 from shihaobai/main Docs: add LightLLM as supported engine
Merge pull request #816 from KPCOFGS/main Update README.md
Merge pull request #720 from xiaokongkong/main modify the explanation of MLA