Clean Visuals Studio: Product Image Cleanup + AI Voiceover Videos Using Cleanup.pictures & Synthesys
Category: Industry Trends
Last Updated: February 2, 2026 | Playbook: photo cleanup + AI voiceover video studio for ecommerce & local businesses
1 · Where sellers are quietly losing money
Most small sellers shoot on their phone in bad lighting. They know the photos look amateur, but getting a pro shoot feels like $500–$1,000+ they don’t have—especially for lower‑priced products or experimental offers.
Product videos sound complicated: talking on camera, editing, audio, B‑roll, text overlays. So they never try—or they put up a shaky, 2‑minute clip that actually hurts trust.
On marketplaces, buyers compare visuals before they read text. If the competitor has clean, bright images and a short video walkthrough, they’ll win—even if your product is better.
They’re running a shop, packing orders, dealing with customers. Learning pro tools is unrealistic. They try free apps, get mediocre results, give up—and keep using the same bad pictures for years.
You’re not promising them the “perfect brand.” You’re promising something much more grounded: “Your photos will stop losing you sales.”
2 · Tool roles: who does what in your little studio
• Remove people, clutter, logos, exit signs, wires, text in seconds .
• Free tier: unlimited images, export limited to 720p .
• Pro: about $5/month or $36/year for unlimited resolution and a “high quality refiner” for better detail .
• Turn text into natural‑sounding audio in 140+ languages and hundreds of voices .
• Generate avatar videos that can “speak” your script in 1080p/4K on paid tiers .
• Free plan is for testing; Creator/Business include commercial licenses so you can use outputs in ads and client work .
• Decide which photos are usable.
• Decide what the video should actually say.
• Make sure image edits don’t misrepresent the product.
• Package everything into simple deliverables sellers understand.
3 · Offers you can sell without needing an agency
• 20–40 images cleaned: remove clutter, people, background distractions.
• Cropped & resized for marketplace or Shopify theme.
• Delivered in web‑ready (JPEG) + backup originals.
Good for: Etsy / eBay sellers, real estate agents, restaurants, salons.
• Everything in Photo Cleanup Pack +
• 1× 30–60 second product/listing video:
– slideshow of cleaned images
– AI voiceover script (written by you)
– optional talking avatar intro
Good for: higher‑ticket items (courses, furniture, real estate, equipment).
• X cleaned photos/month (e.g., 30).
• Y short videos/month (e.g., 4–8).
• Simple analytics report: which creatives perform best.
This is where stable income lives: you become their “visual content person.”
• Audit of current listing visuals.
• 10 cleaned hero images.
• 2 hero videos (e.g., brand intro + best seller).
• Simple checklist for them to follow going forward.
Great as a first project before you try to sell a retainer.
4 · Cleanup.pictures SOP: from messy photos → clean assets
• Start on the free tier while you practice: unlimited images, export at up to 720p—fine for web thumbnails and many social uses .
• For paid client work (especially if they want large images/print), use Pro:
around $5/month or $36/year with unlimited resolution and access to the “High quality refiner” .
The extra few dollars are easy to justify if you’re charging even $10–20 per image for editing.
- Ask them to upload original photos (not compressed screenshots) to a shared folder.
- Ask:
- Where these images will be used (Etsy, Amazon, Shopify, Instagram, etc.).
- What must stay (e.g., certain props, labels).
- What must go (logos, people, clutter, background text).
- Tag each image with a short filename: sku123-front.jpg, sku123-closeup-1.jpg, etc.
- Go to cleanup.pictures and drag & drop the first photo.
- Select the brush size:
- Small for fine details (wrinkles, small text).
- Large for people, big objects, shadows.
- Paint over:
- Unwanted people in the background.
- Brand names you don’t have rights to display.
- Exit signs, trash cans, cables, random clutter.
- Use the “overflow” rule from their own tips: cover a slightly bigger area than the object or text you remove so the AI can reconstruct background cleanly .
- If results look weird:
- Undo and try a slightly larger or smaller brush.
- Remove problematic objects in 2 passes instead of 1.
- Download the result (HD if on Pro; otherwise 720px on free) .
- Safety labels, ingredient lists, legal disclaimers.
- Defects that materially change a second‑hand item (e.g., big scratches) if they’re selling used goods and must disclose flaws.
- Watermarks on stock images you don’t have rights to edit (their FAQ explicitly flags copyright responsibility) .
5 · Synthesys SOP: from product bullet points → voiceover video
Synthesys uses a unified credit system: 1 credit ≈ 1 second of generation, used across voice, video, and images . Personal (~$20/mo) and Creator (~$41/mo) plans on the official pricing page unlock the full platform, with Creator and Business including commercial licenses explicitly so freelancers and agencies can monetize outputs .
Rule of thumb: use at least Creator if you’re producing videos for clients or ads. Free/personal tiers may be limited or personal‑license only; always double‑check current terms.
Product videos don’t need to be long. For most listings, 30–45 seconds is enough:
[HOOK: 1 sentence] "Finally, a [product] that doesn’t [annoying problem]." [3 FEATURES → BENEFITS] "First, [feature 1] means [benefit]." "Second, [feature 2] helps you [benefit]." "Third, [feature 3] makes it easier to [benefit]." [HOW TO USE: 1–2 sentences] "Just [simple steps]." [CTA] "Tap 'Add to cart' to get yours in [timeframe]."
Use your client’s wording where possible. The goal is for customers to feel like the brand is talking, not a template.
- Log into Synthesys → choose AI Video or AI Voice module depending on style:
- Voiceover‑only video: you assemble a slideshow elsewhere, using Synthesys just for audio.
- Talking avatar: Synthesys handles avatar + background + voice in one go.
- Pick a voice from their 300+ voices / 140+ languages that fits your client’s brand (calm, friendly, energetic, etc.) .
- Paste your script into the editor.
- For avatar style videos:
- Pick avatar, upload cleaned product photo as background or use a simple gradient.
- Limit slides to 3–6 for short listings; no one wants 30 slides for a mug.
- Preview, tweak speed and emphasis, then render in 1080p (paid tiers support HD/Full HD) .
- Don’t claim things the product can’t do (“waterproof” if it’s just “splash‑resistant”).
- Don’t use AI to fake ingredients, certifications, or awards.
- Confirm any regulated claims (health, finance) with the client and, ideally, with real references.
6 · Putting it together: one full “listing makeover” from A to Z
Let’s walk one realistic example: a small Etsy jewelry seller with 15 products and very average photos. You’ll do a Photo Cleanup Pack + 1 Listing Video for their best seller.
- Video/voice call (or email) with client:
- Ask which product brings most profit.
- Ask what customers love & complain about.
- Ask where they sell (Etsy, Instagram Shop, etc.).
- They upload 30–40 photos into a shared folder: hero shots, lifestyle shots, detail close‑ups.
- You confirm:
- What you can safely remove (people, clutter, logos).
- What must stay (their logo, particular props).
- Open Cleanup.pictures, process 20–30 key photos:
- Remove random objects from backgrounds.
- Remove other products that distract from the hero item.
- Remove any unwanted branding or personal items.
- Keep a “before/after” pair for 3–5 shots (great for showing your value on your own site or in your portfolio—without exposing client data).
- Save all cleaned images in organized folders:
/client-name/products/sku123/clean/
- Write a 40–50 second script describing:
- Who the jewelry is for.
- 3 reasons it’s different.
- How to care for it.
- In Synthesys, create a voiceover‑only video:
- Export cleaned images as a simple 1920×1080 slideshow (you can use Canva/Figma/Keynote if you like).
- Render audio from Synthesys as MP3.
- Combine slideshow + audio in a simple editor (CapCut, Descript, etc.).
- Export one horizontal 16:9 version and one vertical 9:16 cut for Reels/TikTok.
- Deliver:
- “Clean photos” folder.
- HD/horizontal video file.
- Vertical version (optional add‑on).
- Short PDF with “How to use these assets” (embed in listing, pin to top of shop, use in ads).
- Show 2–3 side‑by‑side before/after images on the call so they feel the difference.
- Soft upsell:
“If you like this, we can do 3 more videos next month for your other best sellers.”
7 · What to charge (and what not to promise)
| Service | Scope | Your time | Suggested fee |
|---|---|---|---|
| Basic cleanup only | 20–30 images, no video | 2–4 hours | $100–$300 |
| Listing Video Kit (1 product) | 10–15 images cleaned + 1 short video | 4–6 hours | $200–$500 |
| Monthly Visual Retainer | 30+ images + 4–8 simple product videos | 8–15 hours/month | $400–$1,200/month |
Cleanup.pictures charges roughly $3/month billed annually (or $5 monthly) for Pro as of early 2026 . Synthesys Creator sits around $41/month with a commercial license . If you’re charging even a few hundred dollars, tool costs are a small, predictable part of your margin.
Don’t promise “I’ll triple your sales.” You can honestly say: “I’ll make your listings look and sound like you take your product seriously. That usually improves click‑through and trust, but results depend on traffic and pricing too.”










