RCreddit.com
8
·开发者社区 · RSS
HIP: use hipBLAS for dense prefill on gfx900, keep MMQ for MoE by DEV-DUFORD · Pull Request #24588 · ggml-org/llama.cpp
Vega GPU, codename vega10, including Radeon Vega Frontier Edition, Radeon RX Vega 56/64, Radeon RX Vega 64 Liquid, Radeon Pro Vega 48/56/64/64X, Radeon Pro WX 8200/9100, Radeon Pro V320/V340/SSG, Radeon Instinct MI25
Those are really great numbers for such old architecture & cards. Great for those card holders.
主题标签Llama
原始关键词#hipblas#prefill#request#duford#gfx900#24588