上傳圖片

選擇圖像

拖動或單擊以上傳您的圖像

從您的創作中選擇

迅速的

用人工智能生成

如果您不滿意，您可以重新生成或輸入自己的提示。

視頻長度

10s

所需學分: 0

Waiting for your creations!

Kling 2.6 - 高級人工智能視頻生成器

KLING 2.6 是原生音頻-視頻模型：單擊一下即可生成 5-10 秒的剪輯，將視覺效果與口型同步的旁白、對話、歌唱和環境聲音配對，無需後期製作。文本到視頻和圖像到視頻路徑、中英文雙語支持以及基於積分的定價模型將視頻創作從幾小時壓縮到幾秒鐘。

On the beach, the waves crash against the shore. [Young Caucasian male] wearing a backward baseball cap, holding a camera and taking a selfie, with a smile at the corner of his mouth. [Young Caucasian male, sunny voice] says: "The weather is amazing today! All my worries feel totally gone. I've been needing a day like this—sun, breeze, just the sound of the waves." The camera is in vlog close-up style.

Copy Prompt

Create Similar Video

Visual: In a tidy living room, a white robotic vacuum sits in the center, with no clutter around it. Dialog: [Narrator, soft female voice] accompanied by the gentle sound of vacuuming: "Are you still troubled by dust in hard-to-reach corners? This robotic vacuum features edge-to-edge cleaning, leaving no gaps behind—making your life easier and effortless!" The camera closely follows the vacuum's path as it cleans.

Copy Prompt

Create Similar Video

In a bright rehearsal room, sunlight streams through the window, and a standing microphone is placed in the center of the room. [Campus band female lead singer] stands in front of the microphone with her eyes closed, while the other members stand around her. [Campus band female lead singer, full voice] leads: "I will try to fix you, with all my heart and soul..." The background is an a cappella harmony, and the camera slowly circles around the band members.

Copy Prompt

Create Similar Video

Visual: In front of an outdoor shopping mall, a crowd gathers, cheering. Dialog: [African-American male reporter] stands next to the crowd, holding a microphone, his body slightly turned. [African-American male reporter, steady voice] says: "Now we can see the atmosphere here is absolutely electric. Let's go check it out together! There's so much happening all at once." Background: Cheerful crowd noises and event BGM, with occasional close-ups of the event.

Copy Prompt

Create Similar Video

Visual: On a comedy stage, the spotlight is focused on the center, while the audience remains in the shadows. Dialog: [Stand-up comedian] holds a microphone on stage, slightly swaying his body. [Stand-up comedian, humorous male voice]: "My gym trainer said the first step is the hardest... Lies! The first step is easy. It's the 5,000th step that's trying to murder you!" After finishing, the comedian shrugs and raises his hands. Background: Laughter and applause from the audience, with the camera focused on the comedian's face.

Copy Prompt

Create Similar Video

A scene in Antarctica with towering ice formations, the overall tone being a cold, white, frigid color palette. The glacier cracks with a loud noise, followed by the sound of ice shattering, as the engines of the research team's snowmobiles roar. The camera follows the retreating research team and the collapsing ice towers.

Copy Prompt

Create Similar Video

In a sports news studio, the screen behind the sports anchor is showing a basketball game replay.[Sports anchor] sits behind the news desk, tapping his fingers lightly on the table. [Sports anchor, clear and strong voice] says: "Look at this clutch play! He stepped up when it mattered most, hitting the shot that decided the championship! This game-winning shot sealed the victory outright." Background: Cheers from the live game, with the camera focusing on the sports anchor's face.

Copy Prompt

Create Similar Video

On a street stage, the audience stands around. [Young rapper] wears a silver chain and a black hoodie, swaying his body to the beat. [Young rapper, dynamic male voice] raps: "Yo, pavement to stage, flow lit, crowd goin’ wild! Mic in my grip, dreams unchained, let the rhythm ride! Raw vibe, sharp rhymes, keep the energy high—this is how we fly, no need to deny! Grind hard, spit fire, make the moment mine, street-born rhythm, let times shine!" The camera focuses on the young Caucasian rapper's movements.

Copy Prompt

Create Similar Video

In a cinematic rainy-day café, rain splashes against the window, with a cool, blue-green tone overall. [Blonde French woman] walks in and sits down, her hair slightly damp, gazing directly at the camera. [Blonde French woman, low voice]: "You don't remember the moment, you just remember the feeling." The camera then focuses on a bottle of golden perfume that appears in the center, zooming in on the blonde French woman's face.

Copy Prompt

Create Similar Video

Viddo AI 上 Kling 2.6 的 5 大用例

個人脫口秀

紀錄片短片、電子商務解說、精彩片段——鎖定畫面，讓模特為您調節旁白、氛圍和微聲音設計。

畫外音講故事

紀錄片短片、電子商務解說、精彩片段——鎖定畫面，讓模特為您調節旁白、氛圍和微聲音設計。

多角色對話

採訪、小品、情景喜劇節拍——無論誰說話，都會得到正確的面孔、聲音和時機；切換角色時不會出現串音或漏音。

音樂表演

紀錄片短片、電子商務解說、精彩片段——鎖定畫面，讓模特為您調節旁白、氛圍和微聲音設計。

超創意場景

ASMR 耳語、光鮮亮麗的廣告、藝術短片——將不可能的視覺效果、與情緒相匹配的 SFX 和微敘事融入到同一個提示中，觀看超現實變成現實。

在 Viddo AI 上使用 Kling 2.6 的主要優勢

視聽鎖

從人聲到擬音再到房間音調，Kling 2.6 輸出乾淨、分層的聲音，反映真實世界的混音。

音頻質量

從人聲到擬音再到房間音調，Kling 2.6 輸出乾淨、分層的聲音，反映真實世界的混音。

語義掌握

該模型可以讀取複雜的情節、俚語或細微差別——在提示中說出說話者的名字和情感，Kling 2.6 會立即將它們投射出來。

如何充分發揮 Kling 2.6 的每一滴力量

為了在創建頭部說話或音樂驅動的視頻時發揮 Kling 2.6 的最佳性能，請將提示視為微型劇本：告訴引擎我們在哪裡、誰在那裡、他們在做什麼、他們的聲音如何以及您希望如何拍攝。
堅持下面所示的順序和標點符號 - Kling 2.6 正是根據這種語法進行訓練的。

公式

場景（地點

Generate perfect audio. You can design your prompt with reference to the following solutions.

對話——單人發言

格式：[M / F]“線”。情緒速度音調
示例：[M]“這是完美的一天。” 開朗中等正常

對話——兩個或更多發言者

格式：[姓名、情感]“台詞”。
示例：[Alex，生氣]“你怎麼能這樣做！” [Sam，冷靜]“我只是說實話。”

歌唱

格式：???“歌詞”技巧情感流派
示例：“我愛你，永遠”帶著歡樂的流行音樂

饒舌

格式：“Bars（韻）”子流派情感
示例：“速度如此之快，韻如此尖銳”陷阱自信

對象特效

格式：[對象：X] [動作：Y] [SFX：聲音]
示例：[對象：木門] [動作：猛擊] [SFX：砰]

環境

格式：地點元素空間感
示例：松林鳥類

下劃線（僅限樂器）

描述：'格式：樂器流派心情
示例：鋼琴古典寧靜

用戶如何評價 Viddo AI 的 Kling 2.6

傑西卡，27 歲，微影響者，奧斯汀

我輸入“陽光親吻的屋頂早午餐”，[F]“現在吃含羞草是不是太早了？” 好玩的快高”，Kling 2.6 吐出了一個捲軸，我自己的化身在嘶嘶作響的聲音中敲響了眼鏡——嘴唇鎖住了每一個音節。上午 9 點發布，午餐時點擊量達到 12 萬次。我的讚助商私信了‘請再多一些’。我真的在發抖。

Marcus，34 歲，獨立說唱歌手，柏林

酒吧里充滿了“速度如此之快，韻律如此尖銳”的陷阱自信，添加了骯髒的地鐵背景。 Kling 2.6 給了我一個片段，其中我的虛擬自我在一輛嘎嘎作響的車廂裡吐口水，踩镲使桿子嘎嘎作響。把它放到了 TikTok 上——一小時內最初 10,000 次直播，沒有工作室，沒有工程師。我的廠牌人員剛剛發短信說：‘我們正在擱置 MV 預算。

Luna，22 歲，ASMRtist，蒙特利爾

提示松林鳥

Ethan，41 歲，SaaS 創始人，聖何塞

需要 EOD 進行產品演示。輸入：“明亮的閣樓辦公室，[M]“將入職時間縮短至幾分鐘”自信中等正常”，放入我們的 UI 模型中。 Kling 2.6 返回了一個演練——光標移動、聲音落地、每次點擊時微妙的嗖嗖聲。董事會很喜歡它；我們的 CAC 下降了 18%。

克洛伊

給 Kling 2.6 在海灘上拍了一張蜜月自拍照，並添加了“[Chloe，咯咯笑]“我們私奔了！ ” [奧馬爾，自豪]“並為你在餘興派對上留了一個座位。 ”輸出：海浪隨著他們的笑聲有節奏地拍打，她的面紗完全隨著節拍拍打。我們的結婚公告發布了——媽媽在密歇根哭了，朋友們發送了心形表情符號。我們穿著人字拖拍攝我們的愛情故事。 Kling 2.6 是這段婚姻中的第三個人，我們很興奮。

Dmitri，45 歲，電影老師

我 12 歲的孩子想要一部科幻短片。我們寫道：“霓虹車庫實驗室，[爸爸，興奮]“啟動曲速引擎！ ” [SFX：轟鳴引擎]。 Kling 2.6 渲染了鏡頭耀斑的傑作——我們的臉容光煥發，聲音機械化，排氣管在 5.1 中呼嘯而過。我們在客廳的牆上首映了它；到處都是爆米花。”

Viddo AI Kling 2.6常見問題及解答

什麼是克林 2.6？

Kling 2.6是世界上第一個原生音視頻傳播模型。輸入一行或上傳一張圖像，它會返回一個 5-10 秒的廣播就緒剪輯，其中口型同步語音、歌唱、環境聲音和屏幕上的動作被鎖定在一起，無需編輯套件、擬音會話或重新錄製。

Kling 2.6 如何保持嘴唇、呼吸和節拍完美同步？

每個音素、嘴形和微手勢都是在與音軌相同的潛在空間中預測的。該模型會逐幀標記“視聽哈希”，因此節奏、面部表情和鏡頭移動永遠不會漂移——即使您在剪輯中切換語言或交換聲音。

我可以為頭部說話的影片指定準確的情感、音調或攝像機角度嗎？

是的。使用迷你劇本語法：
M/F“Line”。情緒速度間距“相機推入 15%”。
Kling 2.6 會讀取您鍵入的順序和標點符號，並將其直接轉化為性能 — 無需關鍵幀，無需即時的工程修改。

Kling 2.6 能否在一個剪輯中生成雙語對話或混合語言歌唱？

絕對地。用 [CN] 或 [EN] 標記每個說話者或歌詞行，模型將自動切換音素集，保持唇形、口音顏色和韻律方案完好無損，非常適合跨市場廣告或中英文對唱，無需手動配音。

如果我需要在一個提示中使用多個揚聲器、唱歌或分層 SFX，該怎麼辦？

像腳本一樣寫：
亞歷克斯，憤怒的“你怎麼可以！” Sam，平靜“我說的是實話。”
“歌詞”帶來歡樂的流行音樂
對象：門動作：slam SFX：bang
Kling 2.6 在一次傳遞中渲染對話、人聲和現場效果，每個源隔離但鎖相 - 無需手動混合。

我是否擁有該剪輯的商業所有權？

100%。每個 Kling 2.6 渲染都附帶全球免版稅許可——廣告、客戶宣傳、流媒體文檔、轉售、NFT，無需額外費用，無需歸屬。

啟動 Kling 2.6，輸入一行，然後用 10 秒的電影級聲音震撼互聯網

嘴唇緊貼每一個音節，低音在你的肋骨中感覺到，混音如此乾淨，就像一個價值百萬美元的錄音室。

獲得高級版

Viddo AI is an advanced all-in-one AI video and image generation platform that lets you quickly and easily create stunning videos and images from various inputs.