返回
RCreddit.com
8
·开发者社区 · RSS

HIP: use hipBLAS for dense prefill on gfx900, keep MMQ for MoE by DEV-DUFORD · Pull Request #24588 · ggml-org/llama.cpp

查看原文

Vega GPU, codename vega10, including Radeon Vega Frontier Edition, Radeon RX Vega 56/64, Radeon RX Vega 64 Liquid, Radeon Pro Vega 48/56/64/64X, Radeon Pro WX 8200/9100, Radeon Pro V320/V340/SSG, Radeon Instinct MI25

Those are really great numbers for such old architecture & cards. Great for those card holders.

主题标签Llama
原始关键词#hipblas#prefill#request#duford#gfx900#24588
查看原文reddit.com
单一来源,暂无交叉验证
HIP: use hipBLAS for dense prefill on gfx900, keep MMQ for MoE by DEV-DUFORD · Pull Request #24588 · ggml-org/llama.cpp · BuzzRadr