Testing Midjourney V6: A Practical Look at Natural Language and Creative Workflows

Apr 30

5 min read

midjourney blog post image — A Midjourney generated image using Midjourney Automation Suite

Midjourney V6 brought some exciting changes to how we create images with AI. A big focus was improving how well it understands straightforward, natural language prompts. This means you might not need as many technical keywords to get the look you want. Let's take a practical look at how V6 performs based on recent tests and explore how it fits into a full creative process, like building a story with images and audio.

Midjourney V6: Understanding Prompts

Natural Language Adherence

One key test for V6 was seeing if it could follow prompts written in everyday language. In past versions, getting specific compositional elements or details often required special phrasing. With V6, tests show a good improvement. Prompting with simple descriptions results in images that stick closely to what was asked for, especially concerning composition and the scale of elements within the scene.

For example, describing the relationship between a monster and a wizard in an image produced results where their sizes and placement were quite accurate based on the natural language input. This is a notable step forward for making Midjourney easier to use and more predictable with simple descriptions.

V6 vs. Older Versions

Comparing results from V6 with a prompt used in V5.2 highlights the difference. The same prompt produced a completely different image style and composition in the older version. This confirms that V6 has a significantly updated understanding of language and image generation, moving away from the need for "junk words" or overly long, technical prompts that were sometimes needed in V5.2.

Style Raw and Default Differences

Midjourney V6 offers different style settings, including the default and 'Style Raw'. Tests show there is a clear difference between the two. Style Raw seems to adhere more closely to the prompt's requested composition, keeping elements positioned as described. The default setting, while producing good images, may sometimes change the layout or composition compared to Style Raw. Experimenting with both is key to finding the look you need for specific requests.

Working with Text and Images in V6

Generating Text on Images

Generating readable text directly on images has been a challenge for AI. In V6, you can specify text by putting it in quotes. Early tests show that V6 can generate text, but getting it placed exactly where you want it (like centered at the top for a book cover) still requires some instruction. The 'Style Raw' setting, possibly with a lower stylize value, might also help the text generation look better and stay closer to instructions.

Unlike V5.2 which might add gibberish text automatically for 'book cover' prompts, V6 seems to require the text to be cued specifically. While text generation isn't perfect, V6 makes it possible to guide placement which is a step in the right direction.

Referencing Existing Images

Midjourney V6 can use a reference image to influence the style or composition of new images. When testing this feature, V6 successfully incorporated elements from the reference image. However, recreating very specific styles from a reference, like a 'colorized black and white' effect, might still require additional prompting and experimentation with settings like 'stylize'. Differences were observed when using different 'stylize' values (e.g., 80 vs 800) alongside Style Raw, showing that these parameters significantly impact the final look and adherence to the reference.

Beyond Just Prompting: Enhancing AI Art

The Role of External Editing Tools

AI image generators like Midjourney are powerful, but to get truly finished, professional-quality results, external editing software is often necessary. Tools like Photoshop or Illustrator allow you to refine details, fix minor imperfections, add specific text precisely, or combine elements from different images. Think of Midjourney as getting you 80-90% of the way there; the final touch-ups and unique flair often come from traditional graphic design skills. Being able to use these other tools can help you create standing-apart art that goes beyond what simple prompting can achieve.

Managing large collections of generated images, selecting the best ones, and preparing them for editing can be time-consuming. Tools like the Midjourney Automation Suite from TitanXT can streamline tasks like this, helping you manage and process your generated images faster, freeing up time for the crucial editing phase.

Building a Story with AI Tools

Starting the Creative Process

Generating a visual story, like a comic or graphic novel, involves several steps using AI. You can start by using a language model (like GPT or Claude) to help structure your story or script. Then, use this text to generate descriptions for the scenes and characters you need to visualize. These descriptions become your prompts for Midjourney or other image generators.

Generating Images and Scenes

Armed with scene descriptions, you generate images in Midjourney V6. You might need to experiment with different prompts, styles (default, Style Raw), and parameters (like 'chaos' for variation) to get the desired look and feel for your story. Consistency in characters can be a challenge, but using reference images and Midjourney's adherence capabilities in V6 (especially with Style Raw for composition) can help. Even so, you may need to generate multiple versions or refine images in editing software.

Automating Image Handling

Midjourney often provides images in a grid of four. For storyboarding or using individual images, you need to split these grids into separate files. Doing this manually for dozens or hundreds of images is impractical. Using simple scripts (like Python) can automate the process of naming and splitting these image grids into individual files, dramatically speeding up the workflow.

Managing complex creative projects with AI involves many steps. Explore how the Midjourney Automation Suite from TitanXT can simplify your workflow, from generation to organization, including handling tasks like splitting image grids.

Adding Voice and Sound

For a full audio-visual story experience, you need voiceover and sound effects. Text-to-speech tools like Eleven Labs can turn your story script into narration. They often have features to choose different voices and manage longer scripts as "projects". Finding suitable background music and specific sound effects for scenes (like weather, ambient sounds, or actions) can be done using resources like Envato Elements.

Bringing It All Together

The final step is assembling all the pieces: the story script, the generated and refined images, the narration, music, and sound effects. This typically happens in video editing software like Premiere Pro. Here, you sequence the images, time them with the audio, and add any final visual effects or transitions to create your finished story.

Midjourney V6 offers powerful capabilities for image generation with improved prompt understanding and style controls. While AI tools can automate and speed up many parts of the creative process, combining them with traditional editing skills and a structured workflow allows for truly unique and polished results. To make your creative process even smoother and more efficient, consider the Midjourney Automation Suite from TitanXT.