Building YouTube Shorts With Runway ML and Suno, End-to-End For Indian Channels
Script to published short, with original soundtrack, all on a Rs 2,500/month budget

YouTube Shorts is currently the fastest-growing content format, and AI tools have dropped production cost 80-90%. This tutorial gives an Indian content creator an end-to-end workflow, even if you don't have your own camera setup.
Tech Stack (Monthly Budget)
| Tool | Use | Cost |
|---|---|---|
| ChatGPT Go | Script writing | Rs 0 (till Dec 2026) |
| Runway ML | Video generation | Rs 1,250/mo (Standard) |
| Suno AI | Music | Rs 830/mo (Pro) |
| CapCut | Editing | Rs 0 (free tier) |
| Thumbnails | Canva Pro | Rs 420/mo |
| Total | ~Rs 2,500/mo |
You can comfortably produce 30-60 Shorts a month on this budget.
Step 1, Script With ChatGPT
A Short is a 15-30 second hook + 30-60 second payoff. Template:
Write a 45-second YouTube Shorts script about [TOPIC].
Structure:
- Hook (first 3 seconds): a surprising fact or question
- Body (15 sec): main content, 3 key points
- CTA (last 5 sec): specific action ("Subscribe," "Follow for part 2")
Audience: Indian viewers aged 18-28, mobile-first
Tone: conversational, high energy
Include: 2-3 visual cues describing what to show
Iterate two or three times. Lock the final script before moving to production.
Step 2, Storyboard (Quick Sketch)
Shorts usually need 5-8 shots. Extract visual cues from the script:
Shot 1 (0-3s): Close-up of Indian street food vendor
Shot 2 (3-8s): Wide of bustling market
Shot 3 (8-15s): Macro shot of chai pouring
...
Step 3, Video Generation (Runway ML Gen-4)
Runway Gen-4 is currently the best option for short-form clips. 10-second clips maximum (V4 limit).
Prompt structure:
[CAMERA TYPE] [SHOT TYPE] of [SUBJECT],
[SETTING/ENVIRONMENT],
[LIGHTING],
[CAMERA MOVEMENT],
[MOOD/STYLE]
Example:
35mm macro close-up of hot chai being poured into a glass tumbler,
Indian street tea stall at dusk,
warm golden hour lighting with steam visible,
slow downward tilt,
documentary style, nostalgic mood
Key prompting principles:
- Camera specificity: "35mm," "wide," "drone shot", shot type matters
- Lighting is mandatory: "golden hour," "overhead fluorescent," "neon night", define the ambient lighting
- Explicit motion: "slow pan left," "handheld shaky," "smooth dolly"
- Avoid text in video: Runway is weak at text, add titles in CapCut
Alternatives
- Pika: faster, cheaper (~Rs 830/mo); stylised animation is better
- Sora (OpenAI): API access now available; premium quality but expensive
- Luma Dream Machine: generous free tier
Step 4, Music With Suno AI
Suno can generate original music in Hindi, English, or a mix.
Prompt For A Background Score
[Instrumental, no lyrics]
Upbeat Bollywood-influenced background music,
tabla and synth fusion, 120 BPM,
energetic for a YouTube Shorts intro,
30 seconds length
Prompt For A Vocal Track
[Style: Hindi indie pop]
Lyrics in Hindi about [topic]:
[paste 4-line hook]
Vocals: young male, soft indie tone
Instruments: acoustic guitar, light percussion, subtle strings
Mood: hopeful, modern
Length: 1 minute
Suno generates two versions per prompt. Download the best one.
License note: Suno Pro allows commercial use. The free tier does not.
Step 5, Edit In CapCut
CapCut's free tier is plenty for Shorts:
- Import all Runway clips (crop to vertical 9:16)
- Add the Suno music as background
- Voiceover via ElevenLabs (free tier) or your own voice
- Captions, CapCut auto-captions Hindi too (Indian-language update shipped)
- Transitions, keep it minimal, two or three types max
- Colour grade, apply one LUT for a consistent look
Shorts-specific settings:
- Aspect ratio: 9:16 (vertical)
- Duration: 45-60 sec is the sweet spot
- First 3 seconds: the HOOK, retention drops sharply in the first 3s
Step 6, Thumbnail (Canva Pro)
Thumbnails matter less for Shorts (auto-scroll feed), but they show up on your channel page:
- Use Canva Pro's free templates
- Bright colour + big, contrasted text
- Face close-up (yours, or generated via Midjourney)
Step 7, Upload Strategy
YouTube best practices for Shorts:
- Title: 40-60 characters, include the keyword
- Description: 3-4 lines, a mix of Hindi and English
- Tags: 5-10 relevant (
#Shortsis mandatory) - Hashtags in title: max 3
- Post consistently: daily beats sporadic
Timing (Indian audience):
- Prime slots: 8-10 AM or 7-10 PM IST
- Weekends beat weekdays
Content Ideas That Work
- Tech explainers in 60 seconds ("Claude vs ChatGPT, the real difference" style)
- Life hacks in Hindi
- Indian street-food stories
- Cricket / IPL analysis
- Movie reviews in a 3-bullet format
- AI tool reviews, your AutoKaam content source!
Scale Tactics
Once this workflow is working, add automation:
- Batch-generate 10 scripts per week with ChatGPT
- Queue 10 Runway clips for batch generation
- Sunday edit marathon, five Shorts in one sitting
Consistent posting gets algorithm lift. Hit 100 Shorts and you're eligible for YPP monetisation.
End-To-End Pipeline, Step By Step
A working day on this stack looks like one hour of focused work, max Rs 150 in tool spend per Short. Time and cost per stage:
| # | Stage | Time | Cost |
|---|---|---|---|
| 1 | Topic ideation from trending Twitter and Indian news feeds | 10 min | Free |
| 2 | Script in Claude or ChatGPT, locked after two iterations | 5 min | ~Rs 5 |
| 3 | Hindi voiceover, Bhashini API or ElevenLabs Hindi | 10 min | Free or ~Rs 50 |
| 4 | Visuals via Runway image-to-video or Krea Realtime | 15 min | ~Rs 100 |
| 5 | Music via Suno, two prompts, pick better cut | 5 min | Free (50/mo) |
| 6 | Shotcut or CapCut edit, captions, transitions, LUT | 15 min | Free |
| 7 | Thumbnail in Canva or Gamma, then upload | 5 min | Free |
Total: roughly one hour, ~Rs 150 max per Short. Five Shorts a day is realistic if topic ideation runs in batches and you queue Runway generations while editing the previous one. The bottleneck is review and rejection, not generation. Budget two extra Shorts as cuttable rejects per session.
Hindi Voiceover Options, Bhashini vs Premium
The voice layer is where AI Shorts most often sound robotic. Three real options for Indian creators in 2026:
- Bhashini API (Indian government, free): five Hindi voices, neutral pronunciation, fine for explainer content where the voice does not need charisma. No commercial-use friction. The endpoint accepts SSML so you can control pacing for hooks.
- ElevenLabs Hindi (premium, ~Rs 800/mo for ~30K characters): noticeably better expressiveness, breath sounds, paralinguistic emphasis. The free tier ships 10K characters per month. For five daily Shorts at 60 seconds each, you bill roughly 4,500 characters per month, so the free tier just covers it. The Creator plan ($22/mo, ~Rs 1,830) buys 100K characters and voice cloning if you want a recognisable channel host.
- Sarvam (Indian, free tier, paid scale): better Hindi intonation than Bhashini in our tests, including code-mixed numbers like "saath hazaar paanch sau" rendered cleanly. Use Sarvam if you ship in regional Indian languages too, since it covers Tamil, Telugu, Marathi, Bengali in the same key.
Pick Bhashini if you do not care about voice charisma and want zero ongoing cost. Pick ElevenLabs if your channel identity rests on the voice and you can afford the Creator tier. Avoid mixing voices across a series, since YouTube's algorithm sometimes treats voice changes as a different creator and resets watch-time signals.
Monetization Reality, India Shorts Fund
YouTube replaced the original Shorts Fund in February 2023 with an ad-revenue share model. Creators in the YouTube Partner Programme now earn from ads served between Shorts in the feed. The hard threshold is 1,000 subscribers plus 10 million Shorts views in the trailing 90 days, then channels can apply.
The Indian RPM is the painful part. Typical Indian Shorts RPM in 2026 published creator reports lands at Rs 0.40 to Rs 0.80 per 1,000 Shorts views, well below long-form video. So 1 million Shorts views earns Rs 400 to Rs 800. A channel that ships 30 Shorts a month with average 50K views per Short and one viral 1M-view spike clears roughly Rs 5,000 to Rs 15,000 monthly from ad share alone. Rounded numbers, real channels, three-month grind to threshold.
The bigger lever is everything outside ad revenue:
- Affiliate links in pinned comments and channel-page links: cheap to add, scales with watch volume. Amazon India Associates, Flipkart, Hostinger and BigRock for tech.
- Brand integrations: Indian D2C brands now pay Rs 5,000 to Rs 50,000 per integrated Short for channels with 50K+ subs, depending on niche.
- Course or product pull-through: a Short that funnels viewers to your AutoKaam-style site or a Razorpay-billed course outearns ad share by 5-10x.
If the goal is income, ad revenue is the floor and affiliates plus brand work are the ceiling. Plan accordingly from Short one.
Avoiding YouTube's AI Detection And Demonetization
YouTube does not blanket-ban AI-assisted content. The 2024 policy clarified that disclosure (the "altered or synthetic content" toggle) is required for realistic AI-generated faces and voices, but explainer Shorts with stock visuals and AI voiceover are fine when disclosed. The actual demonetization risk sits inside the Reused Content category, which targets channels with low-effort, repetitive, mass-produced uploads.
Patterns that get flagged:
- Identical music bed plus identical voice plus identical visual style across 100 consecutive Shorts.
- Auto-generated text-on-stock-footage with no narrative arc.
- Near-duplicate Shorts where only the topic word changes.
- AI-generated thumbnails that recycle the same composition daily.
Patterns that survive review:
- Mix in your own voice clips, even short bumpers, once per five Shorts.
- Custom thumbnails for at least the channel-page rotation; do not let Canva templates dominate.
- One human appearance per ten Shorts, even if it is a cold open or sign-off.
- Vary music genre and BPM across the catalog, not the same Suno preset.
- Build a clear human host on the channel page, with a real bio and at least one long-form video that introduces the channel's point of view.
The channel page is what reviewers see when manually checking flagged accounts. A page that reads like a person, with consistent uploads, an About section, a thumbnail set that suggests editorial intent, gets the benefit of the doubt. A page that reads like a content factory does not.