[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"news-5ef6f8fe-9877-4632-bb4a-690c7e73975e":3},{"id":4,"title":5,"summary":6,"original_url":7,"source_id":8,"tags":9,"published_at":23,"created_at":24,"modified_at":25,"is_published":26,"publish_type":27,"image_url":13,"view_count":28},"5ef6f8fe-9877-4632-bb4a-690c7e73975e","Gemini 3.5 Flash 内置 Computer Use：OSWorld 78.4 把屏幕操控推成工程能力","6 月 24 日，Google 把 Computer Use 直接焊进 Gemini 3.5 Flash。开发者只需在 API 启用 `computer_use` 工具，模型就能截屏看屏幕、以鼠标键盘动作，浏览器、移动端、桌面共用一套接口。这是 Google 首次把这个能力下放到 Flash 级别，而非仅 Pro 或独立 preview。\n\nOSWorld-Verified 上 Gemini 3.5 Flash 拿到 78.4，比 Gemini 3 Flash（65.1）提升 13.3 分，超过 GPT-5.4 mini（72.1），与 Sonnet 4.6 持平（78.4），仅落后 GPT-5.5（78.7）和 Opus 4.8（83.4）几个点。Flash 级的推理成本拿到这个分数，意味着 Computer Use 走出实验室 demo，进入工程现实。\n\n更值得关注的是安全设计。Computer Use 最大隐患是 prompt injection——恶意网页指令就可能劫持 agent 行为。Google 给出三道防线：对抗训练打底、敏感动作需用户确认的开关、检测到间接 prompt injection 时自动中止。配合文档反复强调的沙箱、人审、最小权限，构成工程级防御姿态。\n\nGitHub 同步开源参考实现（google-gemini\u002Fcomputer-use-preview），Browserbase 给出在线 demo。Flash 级模型能直接操作屏幕后，每个跑 SaaS 自动化的团队都可以问一句：我们还有多少 RPA 脚本是非必要的？","https:\u002F\u002Fblog.google\u002Finnovation-and-ai\u002Fmodels-and-research\u002Fgemini-models\u002Fintroducing-computer-use-gemini-3-5-flash\u002F","4d11edad-2df6-45f6-b71f-70f65de7f7fd",[10,14,17,20],{"id":11,"name":12,"slug":12,"description":13,"color":13},"a9524a82-a7c5-4daa-bb4b-a7ee77bb0b94","gemini",null,{"id":15,"name":16,"slug":16,"description":13,"color":13},"8cf7490f-2449-4ba7-be19-61befa0d92b4","google",{"id":18,"name":19,"slug":19,"description":13,"color":13},"7e89b5cc-57db-4f37-bc6d-28919a73931c","model-release",{"id":21,"name":22,"slug":22,"description":13,"color":13},"499f4b56-819d-49a3-9609-33e775143b86","multimodal","2026-06-26T00:00:00Z","2026-06-26T00:08:41.592402Z","2026-06-26T00:08:41.592413Z",true,"agent",2]