# Claude Fable 5 實現全自動影片製作

> 📖 本站完整內容索引（documentation index）：[llms.txt](/llms.txt)

> 原作者：me (@twetsfyp) · 策展與摘要：EasyVibeCoding · 平台：X (Twitter) · 熱度：🔥 · 日期：2026-06-13

> 原始來源：https://x.com/twetsfyp/status/2065615018578608191

## 證據與延伸閱讀

- [simonwillison.net/2026/Jun/9](https://simonwillison.net/2026/Jun/9/claude-fable-5/)
- [anthropic.com](https://www.anthropic.com/claude/fable)
- [anthropic.com](https://www.anthropic.com/news/claude-fable-5-mythos-5)

## 中文摘要

Claude Fable 5 實現全自動影片製作。

**核心功能與表現**
使用者 @twetsfyp 分享了 Claude Fable 5 的強大自動化能力，該模型能獨立完成影片製作流程，包括腳本撰寫、虛擬人像生成、語音合成、動態圖形製作與音效設計。在實際測試中，使用者僅輸入 `> make a video about Claude Fable 5` 指令，模型便能自動產出高品質的成品，展現了極高的影音整合效率。

<video src="https://pub-75d4fe1e4e80421b9ecb1245a7ae0d1a.r2.dev/curated/1781375977861-o4y5lgon.mp4" poster="https://pub-75d4fe1e4e80421b9ecb1245a7ae0d1a.r2.dev/curated/6c4ad0df336b770c.jpg" controls playsinline preload="metadata" style="max-width:100%;height:auto;display:block;margin:1rem 0"></video>
> 一名講者介紹如何利用 AI 工具自動生成影片的過程。

**技術規格與效能**
根據展示的技術細節，Claude Fable 5 在多項指標上表現驚人：
- 記憶能力：支援超過 3,000,000 個 token 的上下文視窗。
- 運算成本：輸入成本為每百萬 token 10 美元，輸出成本為每百萬 token 50 美元。
- 程式開發能力：具備極高的程式碼處理效率，例如協助 Stripe 遷移 50,000,000 行 Ruby 程式碼僅需 1 天。
- 遊戲測試：能透過原始螢幕截圖（無地圖或導航輔助）完成《寶可夢 火紅版》的通關測試。

**自動化流程細節**
Claude Fable 5 的影片製作流程高度整合，透過以下步驟實現端到端的自動化：
1. RESEARCH（研究）：分析主題資訊。
2. SCRIPT（腳本）：由 Claude 生成內容。
3. VOICE（語音）：進行語音複製（VOICE CLONED）。
4. AVATAR（人像）：生成虛擬人像（AVATAR SYNTHETIC）。
5. MOTION（動態）：製作動態圖形並進行編輯。
6. STITCH & TRANSCRIBE：完成影片剪輯與多聲道轉錄。

在程式碼層面，該模型能精確處理與時間軸同步的動畫指令，例如：
```javascript
tl.fromTo("#b19 .card", {y: 28, opacity: 0}, {y: 0, opacity: 1}, 159.04); // -- word-synced
```
這顯示 Claude Fable 5 不僅能處理視覺內容，還能精準控制與語音同步的程式碼邏輯，實現了從創意發想到成品輸出的高度自動化。

## 媒體內容

**一名講者介紹如何利用 AI 工具自動生成影片的過程。**

**影片中的 Prompt 與操作**

Prompt（00:12）：

```
製作一部關於 Claude Fable 5 的影片
```

原文：make a video about Claude Fable 5

操作步驟：

1. （00:12）在 Claude-code 介面輸入指令
2. （00:21）系統自動執行研究、腳本、語音、虛擬人像與動態圖表生成
3. （00:28）解鎖「MYTHOS」模型層級
4. 使用 Playwright 自動化操作瀏覽器進行虛擬人像渲染
5. 使用 FFMPEG 拼接影片片段並進行文字級轉錄
6. 使用 HTML 與 GSAP 建立動態圖表程式碼
7. 系統自動渲染並審核畫面幀，修正錯誤直至通過

**逐字稿**

- `00:00` 這個虛擬替身是人工智慧。你現在聽到的聲音是我的複製音，而這份逐字稿的每一個字都是由 Claude 撰寫的。我沒有寫這份稿子，沒有拍攝它，也沒有剪輯它，而且在它製作的過程中，我連一個畫面都沒看過。我只是在 Claude Code 輸入了一個 prompt，然後就走開了。（This avatar is AI. The voice you're hearing is a clone of mine, and every single word of this）
- `00:04` 剩下的所有事情，包括研究、腳本、聲音、虛擬替身、動態圖形，全都是它自己完成的。所以這週，Anthropic 發布了 Claude Fable 5，這基本上就是這支影片能存在的唯一原因。這是第一次有 Mythos 等級的模型（比 Opus 更高階的層級）開放給所有付費方案的使用者使用。在此之前，該層級僅限於經過審核的安全合作夥伴，而且它在幾乎所有測試過的基準測試中都是最先進的。（script was written by Claude. I didn't write this, I didn't film it, I didn't edit it, and while it）
- `00:09` 所以讓我向各位展示這東西到底有多厲害，以及它究竟是如何製作出這支影片的。首先是程式撰寫的數據，因為它們實在太誇張了，Stripe 表示 Fable 5 將數個月的工程時間壓縮到了幾天內。在公告中，有一個包含 5000 萬行程式碼的 Ruby 程式庫，它在一天內就完成了完整的遷移，這項工作如果由人工團隊來做，需要花費超過兩個月的時間。（was being made, I never saw a single frame of it. I just typed one prompt into Claude code and）
- `00:14` 而視覺能力也有了巨大的飛躍。它僅憑截圖就能重建網頁應用程式的原始程式碼。它甚至單憑原始截圖就從頭到尾通關了《寶可夢 火紅版》。沒有地圖，沒有導航輔助，而舊版的 Claude 模型需要整個輔助 harness 才能遊玩。（walked away. And everything else, the research, the script, the voice, the avatar, the motion）
- `00:19` 但對這支影片來說最重要的是「長視角焦點」（Long Horizon Focus）。這東西能在數百萬個 token 中保持專注。Anthropic 給了它一個基於檔案的記憶，字面上來說就是它能寫入筆記的檔案，並讓它玩《殺戮尖塔》（Slay the Spire）。結果它到達最終章的次數比 Opus 4-8 多了三倍。現在它並不便宜，每百萬輸入 token 要價 10 美元，輸出則要 50 美元。但各位即將看到這筆錢能為你換來什麼。（graphics, all of it happened on its own. So this week, Anthropic released Claude Fable 5, and）
- `00:25` 好的，快速說明一下，這支影片到底是怎麼製作出來的？首先是腳本。Claude 閱讀了 Anthropic 的完整公告，查核了你剛剛聽到的每一項聲明，並使用根據我實際逐字稿建立的語音手冊，以我的聲音撰寫了整份內容。（that's basically the only reason this video can exist. It's the first time a mythos class model）
- `00:29` 接著是聲音。它將腳本傳送到 Eleven Labs，那裡有我根據真實影片訓練出來的複製音。訣竅在於，你不能直接生成四分鐘長的音訊，因為生成的時間越長，聲音就越容易走樣。所以 Claude 將腳本拆分成每個不到一分鐘的片段，並分別進行生成。（that's the tier above Opus, has been available to anyone on a paid plan. Until now, that tier was）
- `00:35` 然後每個片段都送到 HeyGen，在我的虛擬替身上進行渲染，使用的是他們最新的動態引擎 Avatar 5 模型。有一段時間，你甚至無法透過 API 選擇 Avatar 5，所以變通方法是讓 Claude 直接操作瀏覽器，使用 Playwright，並手動翻轉每一支影片。他們的新 API 終於開放了這個功能，所以這支影片是直接透過 API 處理的。但到了那個階段，它只是一堆原始的虛擬替身片段，還沒有經過任何剪輯。（locked to vetted security partners, and it's state-of-the-art on nearly every benchmark they）
- `00:40` 然後是剪輯，這通常是人類需要花費數天完成的部分。Claude 使用 ffmpeg 將虛擬替身片段拼接在一起，執行了逐字轉錄，並將這支影片中的每一個動態圖形製作成實際的程式碼，也就是在 hyperframes 內使用 gsap 動畫化的 HTML，並精確對準我所說的每一個字。（tested. So let me show you guys what this thing is actually good at, and then exactly how it made）
- `00:44` 接著它檢查了自己的工作成果。它從每個場景中渲染出畫面，並進行視覺審查，任何看起來不對勁的地方都會被修正並重新渲染，直到全部通過為止。（this video. So the coding numbers first, because they're kind of nuts, Stripe said Fable 5 compressed）
- `00:49` 所以輸入一個 prompt，另一端就產出了一支完成且經過完整剪輯的 YouTube 影片。這就是 Mythos 等級的模型在發布當週能做到的事情。總之，這支影片就到這裡。如果各位喜歡這支影片或學到了新東西，請按個讚。這對我真的非常有幫助。（months of engineering into days. And in the announcement, there's a 50 million line Ruby）
- `00:54` 一如既往，感謝各位看到影片最後。我們下支影片見。（code base where it ran a full migration in a single day, a job that would have taken a whole team over）
- `00:59` 兩個月就能手動完成。而且 Vision 能力也有了巨大的躍進。它僅憑截圖就能重建網頁應用程式的原始程式碼。（two months by hand. And Vision took a big jump too. It can rebuild a web app's source code just from）
- `01:04` 而且它甚至單靠原始截圖就從頭到尾破關了《寶可夢 火紅版》。沒有地圖，（screenshots. And it actually beat Pokemon FireRed start to finish on raw screenshots alone. No maps,）
- `01:10` 也沒有導航輔助，而舊版的 Claude 模型需要整個輔助 harness 才能遊玩。（no navigation aids, where older Claude models needed a whole helper harness just to play.）
- `01:14` 但對這支影片來說最重要的是「長視角焦點」（Long Horizon Focus）。這東西能保持鎖定狀態（But the one that matters most for this video is Long Horizon Focus. This thing stays locked in）
- `01:19` 跨越數百萬個 token。Anthropic 給了它一個基於檔案的記憶，就像字面上那樣，它能寫筆記到檔案裡，（across millions of tokens. Anthropic gave it a file-based memory, like literally just files it could）
- `01:24` 並讓它遊玩《殺戮尖塔》。結果它到達最終章的次數比 Opus 4-8 多了三倍。現在它並不便宜，每百萬輸入 token 要 10 美元，輸出則要 50 美元。但你們（write notes to, and had it play Slay the Spire. And it reached the final act three times more often）
- `01:29` 馬上就會看到這些錢能買到什麼。好，快速說明一下，這支影片到底是怎麼製作出來的？（than Opus 4-8. Now it's not cheap, 10 bucks per million input tokens, 50 on the output. But you guys）
- `01:34` 首先是腳本。Claude 閱讀了 Anthropic 的完整公告，事實查核了你們剛剛聽到的每一項聲明，（are about to see what that buys you. Okay, so real quick, how did this video actually get made?）
- `01:39` 並用我的語氣寫下了整份內容，使用的是根據我實際逐字稿所建立的語音劇本。（First, the script. Claude read Anthropic's full announcement, fact-checked every claim you just）
- `01:43` 接著是語音。它將腳本發送到 Eleven Labs，我在那裡有一個根據我真實影片訓練出來的語音克隆。（heard, and wrote this entire thing in my voice, using a voice playbook built off my actual transcripts.）
- `01:48` 訣竅在於，你不能直接生成四分鐘長的音訊，（Then the voice. It sent that script over to Eleven Labs, where I've got a voice clone trained on my）
- `01:54` 因為生成的時間越長，語音就越容易開始走調。所以 Claude 將腳本分成多個區塊，（real videos. And the trick is, you can't just generate like four straight minutes of audio,）
- `01:58` 每個區塊不到一分鐘，並分別進行生成。然後每個區塊都被送到 HeyGen，在我的數位分身上進行渲染，使用的是他們最新的動作引擎 Avatar 5 模型。有一段時間，（because the longer a generation runs, the more the voice starts to drift. So Claude split the）
- `02:03` 你甚至無法透過 API 選擇 Avatar 5，所以當時的變通方法是讓 Claude 親自操作瀏覽器，（script into chunks, just under a minute each, and generated them separately. Then every chunk went）
- `02:08` 使用 Playwright，並手動翻轉每一段影片。他們的全新 API 終於開放了這個功能，（to HeyGen to render on my avatar, on the Avatar 5 model, their newest motion engine. And for a while,）
- `02:14` 所以這支影片是直接透過 API 完成的。但到那個階段，它只是一堆原始的數位分身片段，（you couldn't even select Avatar 5 through the API, so the workaround was Claude literally driving）
- `02:18` 而且還沒經過剪輯。接下來是剪輯，這通常是人類需要花費數天的工作。（a browser, with Playwright, and flipping every video by hand. Their new API finally exposes it,）
- `02:23` Claude 使用 ffmpeg 將數位分身片段拼接在一起，進行了逐字轉錄，並將這支影片中的每一個動態圖形都製作成了實際的程式碼，（so this one went straight through. But at that point, it's just a pile of raw avatar clips,）
- `02:27` 也就是在 hyperframes 內使用 gsap 動畫化的 HTML，（and nothing's been edited yet. And then the editing, which is usually the part that takes a human）
- `02:31` 精準對齊我說的每一個字。接著它檢查了自己的工作成果。它從每個場景中渲染出（daze. Claude stitched the avatar clips together with ffmpeg, ran a word level transcription, and built）
- `02:37` 影格，並進行視覺審查，任何看起來不對勁的地方都會被修正（every motion graphic in this video as actual code, HTML animated with gsap inside hyperframes,）
- `02:43` 並重新渲染，直到全部通過為止。所以輸入一個 prompt，最後就產出了一支完整、經過完整剪輯的（timed to the exact words I'm saying. Then it checked its own work. It rendered out）
- `02:48` YouTube 影片。這就是 Mythos 等級的模型在發布當週所能做到的事。（frames from every scene, and visually reviewed them, and anything that looked off got fixed）
- `02:52` 總之，這支影片就到這裡。如果大家喜歡這支影片或學到了新東西，（and re-rendered until it all passed. So one prompt went in, and a finished, fully edited）
- `02:57` 請幫忙按個讚。這對我真的有很大的幫助。（YouTube video came out the other side. That's what a Mythos class model does the same week）
- `03:02` 一如既往，感謝大家看到影片最後。我們下支影片見。（it comes out. But anyways, that's going to do it for this one. So if you guys enjoyed the）
- `03:06` 影片或學到了新東西，請按個讚。這對我真的非常有幫助。（video or learned something new, please give it a like. It definitely helps me out a ton.）
- `03:10` 一如既往，感謝大家看到影片最後。我們下支影片見。（And as always, I appreciate you guys making it to the end of the video. I'll see you on the）
- `03:14` 下次見。（next one.）

## 標籤

Agent, AIGC, 新產品, TTS, 自動化, Anthropic, Claude, Claude Fable