Tools8 min readRichard Byrne

GetImg.ai for Video Producers: Building Custom Image Assets Without Midjourney

Tools covered

GetImg.ai

Kling AI

Runway

GetImg.ai for Video Producers: Building Custom Image Assets Without Midjourney

There's a stage in almost every AI video production that people don't talk about enough: the image layer.

Before a single frame of video renders, you need reference frames. Storyboard panels. Mood board images that communicate the visual direction to a client without burning a generation budget on a brief that might pivot three times. Product hero shots to feed into a CGI sequence. Precise compositional references for img2vid. And increasingly, inpainted corrections to fix that one element in a generated still that's almost right but not quite.

This is quiet, unglamorous work. But it's where a lot of productions actually live or die.

I was doing all of this in Midjourney. Then I started using GetImg.ai, and my workflow changed. Not because GetImg is flashier — it isn't. Because for production use, it solves real problems that Midjourney was creating.

Why Video Producers Need AI Image Generation

If you're primarily a video director, you might be wondering why you'd invest serious time in an image tool. The answer is that modern AI video production is not a single-step process.

Storyboards and client alignment come first. Before you generate video, you need approval on visual direction. Generating ten storyboard images in GetImg is faster and cheaper than generating ten video clips in Runway or Kling, and it lets you lock composition, lighting, and character look before committing to compute.

Reference frames for img2vid are the more critical use case. Both Runway Gen-4 and Kling 3.0 accept image inputs and animate from them. The quality of your image input directly determines the quality of your video output. If your reference frame has weak composition, flat lighting, or wrong aspect ratio, no amount of prompting fixes that downstream. Getting the image right first is the highest-leverage thing you can do.

Product CGI pipelines require hero imagery that brands can review and approve before the video render begins. Clients from Novartis to Dell have shown me that the approvals process is always easier with stills — you can annotate them, mark revisions, and establish a visual reference that the video simply animates.

Thumbnail and marketing assets are part of deliverables. A finished video still needs a cover image, a thumbnail, a still for social. GetImg produces these as a natural byproduct of the reference frame workflow.

What GetImg.ai Actually Is

GetImg is a web-based AI image platform built primarily around FLUX models — the open-weight image generation architecture from Black Forest Labs that has largely replaced Stable Diffusion as the serious practitioner's choice for controllable, high-quality image generation.

The platform is not trying to be Midjourney. It's not a community feed, it's not optimised around aesthetic discovery and ambient inspiration. It's a production tool with a browser-based studio interface, a robust API, and a model library that includes both FLUX variants and a range of fine-tuned community models.

The core generation modes are:

Text to Image — standard generation from prompts, using FLUX Dev, FLUX Pro, or a selection of fine-tuned models
Image to Image — variation and transformation from an uploaded reference
Inpainting — masked region editing within an existing image
Outpainting — extending an image beyond its original boundaries
AI Canvas — a combined editing environment where you work with all of the above in a single interface

The output quality at FLUX Pro tier is genuinely excellent — clean, coherent, with the detail retention and prompt fidelity that FLUX delivers better than the older Stable Diffusion pipeline.

FLUX vs the Competition

FLUX models produce images with a specific visual signature: strong photorealism in lighting and material rendering, excellent text rendering (historically a weakness in diffusion models), and detailed facial accuracy. For production reference frames that need to look like they came from an actual camera, FLUX's output language is closer to photography than Midjourney's tends to be.

Midjourney's aesthetic is distinctive — it has a visual polish and a certain compositional confidence that comes from years of fine-tuning on creative community feedback. It's excellent for mood boards where you want inspiration. It's less consistent when you want a specific result that looks like a particular type of camera, lens, and lighting setup.

For img2vid input — where you need the image to resemble a real photograph closely enough that Runway or Kling can animate it convincingly — FLUX via GetImg regularly outperforms Midjourney in my tests. The image doesn't fight the video model. It gives the animation engine something it knows how to work with.

Inpainting and Outpainting in Production

This is where the value compounds significantly.

When a generated reference frame is 90% right — correct composition, correct lighting, correct subject — but has one broken hand, one misaligned logo, or one background element you didn't ask for, Midjourney's fix workflow is frustrating. You're back in Discord, re-rolling, hoping the variation lands better.

In GetImg.ai, you open the Canvas, paint a mask over the problem area, and prompt the correction directly. The inpainting is local — it respects the surrounding image. For a production environment where you've already spent time dialling in the rest of the frame, this is not a minor convenience. It's a significant time save.

Outpainting matters for format conversion. A generated image at 1:1 needs to become 16:9 for video. GetImg's outpainting extends the frame intelligently, extrapolating the background, the environment, the lighting. Doing this repeatedly for a batch of reference frames — across a storyboard of twenty panels, for example — is where the API becomes relevant.

The API: Where GetImg Separates From Midjourney

Midjourney has no public API. That is a production constraint that is easy to underestimate until you're trying to build any kind of automated workflow.

GetImg's API is clean, well-documented, and priced per generation. This means:

You can script batch generation of an entire storyboard from a structured prompt list
You can integrate image generation into a production pipeline that feeds downstream to video tools
You can generate product variants programmatically — same environment, different SKU, repeated for fifteen products — without clicking through a web interface fifteen times
You can build internal client tools that generate reference imagery without requiring clients to interact with third-party platforms

For anyone running a production service rather than a hobby workflow, this is material. The API access alone justifies the platform comparison.

Honest Comparison: GetImg vs Midjourney

I want to be direct about where each tool is stronger, because the answer is not one-sided.

Where GetImg is stronger:

API access — Midjourney doesn't have one; GetImg does. Full stop.
Inpainting and editing — local, mask-based editing in GetImg's Canvas is a proper production tool. Midjourney's variation workflow is not.
Cost at volume — API pricing per generation is lower than Midjourney Pro subscription cost at meaningful generation volumes
No Discord — I cannot stress this enough. Running a professional production through a chat interface is not a sustainable workflow
FLUX models for photorealism — specifically for reference frames that will feed into img2vid, FLUX's output is more suitable

Where Midjourney is stronger:

Aesthetic range and model variety — Midjourney's community models and aesthetic diversity are unmatched. For pure creative exploration, it's a wider palette
Community and prompt learning — the Midjourney feed is a live education in what works. There's no equivalent in GetImg
Stylised, non-photorealistic output — for illustrated, painterly, or heavily stylised aesthetics, Midjourney still produces stronger results with less prompting effort

The conclusion I've reached is that these tools serve different stages of the creative process. Midjourney for aesthetic discovery and mood boarding when the brief is open. GetImg for production execution when you know what you need and you need it to be controllable, editable, and scriptable.

The img2vid Workflow in Practice

Here's the sequence I now use on most commercial productions:

1. Generate reference frames in GetImg. Using FLUX Pro, I generate the key compositional frames for the video: opening shot, hero product moment, closing call to action. These are generated at 16:9, 1920x1080, with detailed lighting and camera language in the prompt.

2. Inpaint corrections. Any frame that needs a fix — a product label that rendered wrong, a background element that's distracting — gets corrected in the Canvas without regenerating the whole image.

3. Client approval on stills. I present the reference frames as the visual brief. Clients annotate, feedback, approve. This stage catches 80% of directorial feedback before a single second of video is generated.

4. Feed approved frames into Kling or Runway as img2vid input. Kling 3.0 and Runway Gen-4 both accept image-to-video inputs, and both produce significantly more consistent results when the input image is high-quality and appropriately formatted. The GetImg-generated references perform well here — the FLUX photorealism gives the video models something coherent to animate from.

5. Post-generation corrections. If a video frame needs a background fix before the animation pass, outpainting in GetImg handles format extension. If I need a new thumbnail from a specific video moment, I bring the frame back into GetImg for treatment.

This workflow reduced my generation waste — the ratio of unusable outputs to usable ones — by a meaningful margin. Less re-rolling, more intentional production.

Pricing

GetImg operates on a credit system. The free tier gives you 100 images monthly, which is enough to evaluate the platform but not to run production work through it. Paid plans start at $12/month for 3,000 credits, which at FLUX Dev generation cost is a substantial monthly volume.

API credits are purchased separately and priced per call, with FLUX Pro generations at the higher end and faster, lighter models at lower cost. For production use, the API pricing is rational — you're paying for what you generate, not a flat subscription that encourages overuse.

Verdict

GetImg.ai has replaced Midjourney as my primary image generation tool for production work. Not because it's more creative or more inspiring — Midjourney still wins that comparison. Because it's more controllable, more scriptable, and more honest about being a production instrument rather than a creative community.

If you're building AI video as a professional service, the image layer matters more than most people realise. Getting that layer right — with clean reference frames, editable assets, and API access for batch workflows — makes everything downstream more reliable.

Try GetImg.ai →

Richard Byrne is a creative director with 25 years of production experience, working with clients including BBC, Novartis, Dell, and the Cannes Film Festival. For AI video production enquiries, contact via PeoplePerHour.

Back to Intelligence Feed