Uncle Xiang's Notebook

Collecting Moments, Marking Time

随记
2. Dark Mode

Categories

Computer 122 Diary Ramblings 19 Financial Knowledge Base 19 The Seven Seconds of a Fish 18 Investment 16 Repost / Share 14 AI Inspiration Hub 12

Tags

Ai 38 AI Inspiration Hub 31 Troubleshooting 17 C++ 15 Investment 13 Hong Kong Stocks 11 Xiaomi 10 Windows 9 Large Model 8 Programming 8 Bai-Yansong 6 Blog 6 Git 6 Life-Lessons 6 Linux 6

历史项目

AI Coding Demo 演示AI编码能力、界面设计 Hugo Blog 多语言翻译助手博客内容翻译工具，支持自动翻译和内容管理

Tags

1 page

Mac

Google has released Gemma 4 this time (III)

While browsing the forum this time, what struck me most wasn’t which company released another leaderboard, but a very basic statement: “Not enough VRAM; no matter how large the parameters are, it’s useless.”

Previously, I always understood “slow model” as a computational power issue. However, the more I read, the clearer it became that often, the problem isn’t that the GPU can’t compute it, but rather that the data cannot reside in the right place. Just by changing the memory path, the token speed doesn’t just slow down; it drops drastically.

© 2019 - 2026 Uncle Xiang's Notebook

A financial IT programmer's tinkering and daily life musings
Built with Hugo
Theme Stack designed by Jimmy