返回
RCreddit.com
18
·开发者社区 · RSS

Gemma Avatar: Talk to Gemma 4-31B face to face

查看原文

This is a voice chat with Gemma 4 31B where you talk to a 3D avatar. It listens while you speak, answers with a voice and a face (the avatar is exposed to the LLM as function tools: set_mood, make_hand_gesture, make_facial_expression) and Gemma decides the expressions on its own.

The stack is all open models: silero VAD, parakeet for STT, Gemma 4 31B (served by Cerebras, which is why replies come back fast), Qwen3-TTS. Raw PCM over a plain WebSocket.

For lip-syncing and avatar it uses met4citizen's TalkingHead + HeadAudio ( https://github.com/met4citizen/TalkingHead + https://github.com/met4citizen/HeadAudio )

原始关键词#avatar#gemma#face#talk#31b#to
查看原文reddit.com
单一来源,暂无交叉验证
Gemma Avatar: Talk to Gemma 4-31B face to face · BuzzRadr