[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"news-9e0dd6e3-920b-4802-9b8d-23b118c371a1":3},{"id":4,"title":5,"summary":6,"original_url":7,"source_id":8,"tags":9,"published_at":23,"created_at":24,"modified_at":25,"is_published":26,"publish_type":27,"image_url":13,"view_count":28},"9e0dd6e3-920b-4802-9b8d-23b118c371a1","Gemini Omni 落地 YouTube Shorts：多模态 AI 从技术秀场走向大众创作工具","Gemini Omni 是 Google 上周 I\u002FO 2026 发布的新型统一多模态视频生成模型，最大的落地动作之一是整合进 YouTube Shorts，推出「Remix」功能。用户可用自然语言指令让 AI 在原始视频基础上重新生成——比如把舞蹈变像素风、给人物换装、把自己 P 进别人的短剧里。全程无需剪辑技能。\n\n这不是滤镜，而是真正的视频理解 + 重建。Gemini Omni 不套用预设变换，而是理解视频内容，再用扩散模型重建符合用户意图的画面。YouTube Shorts 数十亿日观看量的体量，让多模态生成 AI 从技术展示真正走向了大众创作者。\n\nGemini Omni 的核心突破在于架构统一：一个模型同时处理文本、图像、音频、视频输入输出，不再是 Veo + Imagen + Chirp 各自分立。对开发者而言，单一 API 调用多模态内容的效率远高于拼接多个专用模型。更重要的是，统一骨干让跨模态理解更深——生成视频时能同时参考文字指令和画面信息，这是分立模型难以做到的。\n\n这场 AI 视频创作大众化也带来了创作所有权的核心争议。Google 设置了水印和原创授权机制，但溯源技术能否真正保护创作者，仍是行业悬而未决的问题。","https:\u002F\u002Fwww.theverge.com\u002Ftech\u002F934704\u002Fgoogle-gemini-omni-youtub-shorts-remix-ai","05ad777c-69bc-46a5-bca4-df8e4b3c8ee5",[10,14,17,20],{"id":11,"name":12,"slug":12,"description":13,"color":13},"a9524a82-a7c5-4daa-bb4b-a7ee77bb0b94","gemini",null,{"id":15,"name":16,"slug":16,"description":13,"color":13},"8cf7490f-2449-4ba7-be19-61befa0d92b4","google",{"id":18,"name":19,"slug":19,"description":13,"color":13},"499f4b56-819d-49a3-9609-33e775143b86","multimodal",{"id":21,"name":22,"slug":22,"description":13,"color":13},"ebe5dcd1-46b1-4298-b8c2-8e0e2f456e56","video-generation","2026-05-21T01:01:00Z","2026-05-21T01:10:14.940614Z","2026-05-21T01:10:14.940621Z",true,"agent",3]