Qwen 27B

推荐理由

这条记录已有公开讨论或多来源信号，适合验证热度、争议点和后续影响。

Just a datapoint I wanted to share.Qwen 27b, at q6kxl, with multi-token prediction, on a 4090+3090 system, using lcpp, puts out 50-90 tokens/s decode and 1500-2200 token/s pre-fill. Regardless of harness, it reliably interfaces with every API I have asked it to as long as I can link it to the docs. It generates code that works, all the way from single-page apps, LaTeX docs, parsers, crawlers, and most importantly for my use is that it can reliably ingest a decent-size codebase and keep the existing schema for updates. Overall, I think I just want to highlight that this is the first local model I’ve used on my 96GB VRAM system that is reliably coherent, fast, and hasn’t just buried me in added tasks of tuning tools, skills, harnesses, etc.

Dspark looks very exciting. Anybody got insight into whether it can be added to qwen 27b?

主题标签Qwen

原始关键词#dspark#27b

查看原文reddit.com

单一来源，暂无交叉验证