Waiting for your creations!
KLING 2.6 是原生音頻-視頻模型:單擊一下即可生成 5-10 秒的剪輯,將視覺效果與口型同步的旁白、對話、歌唱和環境聲音配對,無需後期製作。文本到視頻和圖像到視頻路徑、中英文雙語支持以及基於積分的定價模型將視頻創作從幾小時壓縮到幾秒鐘。
On the beach, the waves crash against the shore. [Young Caucasian male] wearing a backward baseball cap, holding a camera and taking a selfie, with a smile at the corner of his mouth. [Young Caucasian male, sunny voice] says: "The weather is amazing today! All my worries feel totally gone. I've been needing a day like this—sun, breeze, just the sound of the waves." The camera is in vlog close-up style.
Visual: In a tidy living room, a white robotic vacuum sits in the center, with no clutter around it. Dialog: [Narrator, soft female voice] accompanied by the gentle sound of vacuuming: "Are you still troubled by dust in hard-to-reach corners? This robotic vacuum features edge-to-edge cleaning, leaving no gaps behind—making your life easier and effortless!" The camera closely follows the vacuum's path as it cleans.
In a bright rehearsal room, sunlight streams through the window, and a standing microphone is placed in the center of the room. [Campus band female lead singer] stands in front of the microphone with her eyes closed, while the other members stand around her. [Campus band female lead singer, full voice] leads: "I will try to fix you, with all my heart and soul..." The background is an a cappella harmony, and the camera slowly circles around the band members.
Visual: In front of an outdoor shopping mall, a crowd gathers, cheering. Dialog: [African-American male reporter] stands next to the crowd, holding a microphone, his body slightly turned. [African-American male reporter, steady voice] says: "Now we can see the atmosphere here is absolutely electric. Let's go check it out together! There's so much happening all at once." Background: Cheerful crowd noises and event BGM, with occasional close-ups of the event.
Visual: On a comedy stage, the spotlight is focused on the center, while the audience remains in the shadows. Dialog: [Stand-up comedian] holds a microphone on stage, slightly swaying his body. [Stand-up comedian, humorous male voice]: "My gym trainer said the first step is the hardest... Lies! The first step is easy. It's the 5,000th step that's trying to murder you!" After finishing, the comedian shrugs and raises his hands. Background: Laughter and applause from the audience, with the camera focused on the comedian's face.
A scene in Antarctica with towering ice formations, the overall tone being a cold, white, frigid color palette. The glacier cracks with a loud noise, followed by the sound of ice shattering, as the engines of the research team's snowmobiles roar. The camera follows the retreating research team and the collapsing ice towers.
In a sports news studio, the screen behind the sports anchor is showing a basketball game replay.[Sports anchor] sits behind the news desk, tapping his fingers lightly on the table. [Sports anchor, clear and strong voice] says: "Look at this clutch play! He stepped up when it mattered most, hitting the shot that decided the championship! This game-winning shot sealed the victory outright." Background: Cheers from the live game, with the camera focusing on the sports anchor's face.
On a street stage, the audience stands around. [Young rapper] wears a silver chain and a black hoodie, swaying his body to the beat. [Young rapper, dynamic male voice] raps: "Yo, pavement to stage, flow lit, crowd goin’ wild! Mic in my grip, dreams unchained, let the rhythm ride! Raw vibe, sharp rhymes, keep the energy high—this is how we fly, no need to deny! Grind hard, spit fire, make the moment mine, street-born rhythm, let times shine!" The camera focuses on the young Caucasian rapper's movements.
In a cinematic rainy-day café, rain splashes against the window, with a cool, blue-green tone overall. [Blonde French woman] walks in and sits down, her hair slightly damp, gazing directly at the camera. [Blonde French woman, low voice]: "You don't remember the moment, you just remember the feeling." The camera then focuses on a bottle of golden perfume that appears in the center, zooming in on the blonde French woman's face.
紀錄片短片、電子商務解說、精彩片段——鎖定畫面,讓模特為您調節旁白、氛圍和微聲音設計。

紀錄片短片、電子商務解說、精彩片段——鎖定畫面,讓模特為您調節旁白、氛圍和微聲音設計。

採訪、小品、情景喜劇節拍——無論誰說話,都會得到正確的面孔、聲音和時機;切換角色時不會出現串音或漏音。

紀錄片短片、電子商務解說、精彩片段——鎖定畫面,讓模特為您調節旁白、氛圍和微聲音設計。

ASMR 耳語、光鮮亮麗的廣告、藝術短片——將不可能的視覺效果、與情緒相匹配的 SFX 和微敘事融入到同一個提示中,觀看超現實變成現實。




為了在創建頭部說話或音樂驅動的視頻時發揮 Kling 2.6 的最佳性能,請將提示視為微型劇本:告訴引擎我們在哪裡、誰在那裡、他們在做什麼、他們的聲音如何以及您希望如何拍攝。
堅持下面所示的順序和標點符號 - Kling 2.6 正是根據這種語法進行訓練的。
場景(地點
Generate perfect audio. You can design your prompt with reference to the following solutions.
傑西卡,27 歲,微影響者,奧斯汀
我輸入“陽光親吻的屋頂早午餐”,[F]“現在吃含羞草是不是太早了?” 好玩的快高”,Kling 2.6 吐出了一個捲軸,我自己的化身在嘶嘶作響的聲音中敲響了眼鏡——嘴唇鎖住了每一個音節。上午 9 點發布,午餐時點擊量達到 12 萬次。我的讚助商私信了‘請再多一些’。我真的在發抖。
Marcus,34 歲,獨立說唱歌手,柏林
酒吧里充滿了“速度如此之快,韻律如此尖銳”的陷阱自信,添加了骯髒的地鐵背景。 Kling 2.6 給了我一個片段,其中我的虛擬自我在一輛嘎嘎作響的車廂裡吐口水,踩镲使桿子嘎嘎作響。把它放到了 TikTok 上——一小時內最初 10,000 次直播,沒有工作室,沒有工程師。我的廠牌人員剛剛發短信說:‘我們正在擱置 MV 預算。
Luna,22 歲,ASMRtist,蒙特利爾
提示松林鳥
Ethan,41 歲,SaaS 創始人,聖何塞
需要 EOD 進行產品演示。輸入:“明亮的閣樓辦公室,[M]“將入職時間縮短至幾分鐘”自信中等正常”,放入我們的 UI 模型中。 Kling 2.6 返回了一個演練——光標移動、聲音落地、每次點擊時微妙的嗖嗖聲。董事會很喜歡它;我們的 CAC 下降了 18%。
克洛伊
給 Kling 2.6 在海灘上拍了一張蜜月自拍照,並添加了“[Chloe,咯咯笑]“我們私奔了! ” [奧馬爾,自豪]“並為你在餘興派對上留了一個座位。 ”輸出:海浪隨著他們的笑聲有節奏地拍打,她的面紗完全隨著節拍拍打。我們的結婚公告發布了——媽媽在密歇根哭了,朋友們發送了心形表情符號。我們穿著人字拖拍攝我們的愛情故事。 Kling 2.6 是這段婚姻中的第三個人,我們很興奮。
Dmitri,45 歲,電影老師
我 12 歲的孩子想要一部科幻短片。我們寫道:“霓虹車庫實驗室,[爸爸,興奮]“啟動曲速引擎! ” [SFX:轟鳴引擎]。 Kling 2.6 渲染了鏡頭耀斑的傑作——我們的臉容光煥發,聲音機械化,排氣管在 5.1 中呼嘯而過。我們在客廳的牆上首映了它;到處都是爆米花。”
Kling 2.6是世界上第一個原生音視頻傳播模型。輸入一行或上傳一張圖像,它會返回一個 5-10 秒的廣播就緒剪輯,其中口型同步語音、歌唱、環境聲音和屏幕上的動作被鎖定在一起,無需編輯套件、擬音會話或重新錄製。
每個音素、嘴形和微手勢都是在與音軌相同的潛在空間中預測的。該模型會逐幀標記“視聽哈希”,因此節奏、面部表情和鏡頭移動永遠不會漂移——即使您在剪輯中切換語言或交換聲音。
是的。使用迷你劇本語法:
M/F“Line”。 情緒速度間距“相機推入 15%”。
Kling 2.6 會讀取您鍵入的順序和標點符號,並將其直接轉化為性能 — 無需關鍵幀,無需即時的工程修改。
絕對地。用 [CN] 或 [EN] 標記每個說話者或歌詞行,模型將自動切換音素集,保持唇形、口音顏色和韻律方案完好無損,非常適合跨市場廣告或中英文對唱,無需手動配音。
像腳本一樣寫:
亞歷克斯,憤怒的“你怎麼可以!” Sam,平靜“我說的是實話。”
“歌詞”帶來歡樂的流行音樂
對象:門動作:slam SFX:bang
Kling 2.6 在一次傳遞中渲染對話、人聲和現場效果,每個源隔離但鎖相 - 無需手動混合。
100%。每個 Kling 2.6 渲染都附帶全球免版稅許可——廣告、客戶宣傳、流媒體文檔、轉售、NFT,無需額外費用,無需歸屬。
