Crafting experience...
3/8/2026
Built At
HuddleHive's WIT Hackathon #5
Hosted By
Content creators—from TikTok enthusiasts to YouTube Shorts makers—face massive friction when creating short-form videos. The current workflow forces users to:
Record video (native camera app or external recorder)
Pick music (Spotify, Apple Music, or generative tools)
Edit and sync (CapCut, Adobe Premiere Rush, or similar)
Add effects/trim (additional editing software)
Publish (platform-specific apps)
This means opening 4-5 different applications, learning each interface, and context-switching constantly. A creator who could ideate and ship in 5 minutes now loses 25+ minutes to app-hopping. For casual creators, this friction kills momentum. For content creators at scale, it's a productivity drain.
CleanShort is a unified, one-stop tool that compresses the entire short-form creation pipeline into four simple steps:
Import raw video — User uploads their video file (no manual generation, just their content)
Select Vibe — Gemini analyzes the entire video file, understanding motion, composition, pacing, and mood, then recommends music vibes via Lyria integration based on beats and emotional tone
Trim/Edit — Integrated editing interface (leveraging CapCut integration) for quick adjustments
Ready to Share — Publish directly in-app or export to device/social platform
The core innovation: Gemini as the intelligent intermediary. Instead of creators manually hunting for the "right" song, Gemini understands what their video is and recommends vibes that actually fit. Lyria then generates original, royalty-free music to those specs.
User Journey:
Creator opens CleanShort with a raw video file (shot on their phone, GoPro, or any device)
Gemini processes the entire video contextually—analyzing visual rhythm, emotional beats, scene composition, motion density, color palette, and pacing
Based on that analysis, Gemini recommends 3-5 music "vibes" (e.g., "high-energy electronic for fast cuts," "moody indie for slow reveals")
Creator picks a vibe → Lyria generates original music to that spec
Creator trims/fine-tunes in the built-in editor
Result: A polished, ready-to-share 10-second short in under 5 minutes
Why Gemini Matters Here: Gemini isn't just analyzing metadata. It's reading the content—understanding narrative flow, emotional arc, visual intensity. This makes recommendations contextual and useful, not random.
Integration Architecture:
Gemini API — Video analysis, vibe recommendation engine
Lyria 3 — Music generation based on recommended vibe
CapCut Integration — Trim/edit UI (leveraging existing, trusted workflow)
Publishing Layer — Direct in-app publish or export to device/socials
What We Built:
Full four-step pipeline functional in AI Studio
Gemini video analysis working: correctly identifies mood, pacing, and recommends matching vibes
Integration with Lyria for vibe-based music generation
Trim/edit and publish/export workflows live
Still Being Refined:
Audio playback/sync within the app (in progress; core functionality shipping post-hackathon)
Cross-platform responsiveness (currently optimized for mobile)
Key Learnings:
Gemini's context understanding is the differentiator — The real value isn't speed; it's that Gemini "gets" what the creator is making and recommends accordingly. This is what separates CleanShort from just stringing APIs together.
Users want recommendations, not automation — Early testing showed creators liked curated options (3-5 vibes to pick from) rather than auto-generated final cuts. Control + simplicity wins.
Integration, not replacement — Creators already love CapCut for editing. CleanShort works with their tools, not against them. This lowers friction to adoption.
Immediate (Post-Hackathon):
Polish Lyria audio integration (sync playback, preview in-app)
Add user preference learning (Gemini learns creator's style over time)
Mobile-first UI refinement
Medium Term:
Desktop web version
Integration with TikTok, Instagram, YouTube Shorts APIs for direct publish
Advanced features: multi-clip assembly, auto-captioning via Gemini, style presets
Long Term:
Potential B2B: brands and content agencies using CleanShort for rapid asset generation
CleanShort isn't just a productivity tool—it's a creative enabler. By removing friction, it lets more people make instead of struggle. And by putting Gemini's intelligence at the heart of the recommendation engine, it shows what AI can do when it understands context: not replace creativity, but amplify it.
Proof of concept: A/B tested with the raw footage from our scavenger hunt prototype. Manual method (2 phones, 4-5 apps, 25+ minutes) vs. CleanShort (4 steps, one tool, under 5 minutes). Same quality output. Fraction of the friction.