top of page

Talk to Midjourney: How Voice Mode Speeds Up AI Image Creation

May 3

4 min read

0

7

0

midjourney blog post image
A Midjourney generated image using Midjourney Automation Suite

Imagine turning your thoughts into amazing pictures just by speaking. Midjourney's new V7 draft mode makes this possible. It's faster and uses less power. With the new voice mode, you can speak your ideas aloud, and Midjourney brings them to life visually.

This feature helps creators like bloggers and designers prototype and make images easily. You don't need to figure out the right written prompt right away. This update is a big step for making content creation easier.

Getting Started with Draft and Voice Mode

To use these new tools, you need to be on the Midjourney website. Voice mode seems to work best there for now. Remember that as of April 2025, these features are still being tested. You might find some small issues. They worked well for me on Safari, but results can vary on Chrome or Firefox.

Draft mode creates quick versions of your ideas. These won't be as detailed as the final images Midjourney makes, but they are great for brainstorming. You can improve or make bigger versions of these drafts later.

Using Your Voice to Create Images

First, go to the "Create" area on the Midjourney website. You'll see a button for "Draft Mode." Click it to turn it on (it will turn red).

Once draft mode is on, a microphone icon appears. Click this icon to start voice mode (it also turns red when active). Make sure your microphone is set up correctly.

Now, just speak what you want to create into your microphone. The amazing part is that Midjourney understands natural talk. You can talk like you are working with a friend. It will listen and create a prompt from what you say to make your images.

Want to get more out of your Midjourney image creation process? Consider the Midjourney Automation Suite from TitanXT. It can help you manage and scale your image generation tasks more efficiently.

Examples of Using Voice Mode

Let's look at how voice mode handles different requests:

Product Photos

You can easily ask for specific details. For example, telling it, "Create a stock photo for a perfume company," and then adding, "Make the image 16x9 aspect ratio," and finally, "Can we make the perfume pink? Let's add the brand name Lust on the perfume bottle." Midjourney can often place text correctly on the bottle and change the image shape just by listening to these simple requests. Switching aspect ratios for mobile-friendly images is very fast.

Creative Scenes

Voice mode also works for more fun ideas. For example, "Make an image of a bulldog on a skateboard in the city." You can then refine it, like saying, "Can you change the aspect ratio to 16 by 9? Let's use a fisheye lens style. And I want to give the bulldog some sunglasses." The AI takes these basic instructions and builds a more detailed prompt, which is helpful if you want to use it later.

Characters and Settings

Creating specific character looks and settings is also easy. "Make a cinematic image of a woman sitting in a cafe drinking coffee." Then you can add details: "Can we give her long straight brown hair? Let's have her looking into the camera. And let's make her a little more happy. Can we zoom in into a portrait mode of her face? And let's make her in her 30s." All these changes come from voice commands.

Illustrations

Voice mode works for different styles too, like cartoons. "Create a cartoon character of a raindrop for a children's book. Can you make it 2x3 aspect ratio? And can you add some fun and creative font that says the funny drop." While writing text on images can sometimes have small errors, voice mode handles the main idea well. You can also refine the style and setting: "Let's make it more of a 3D Pixar style. And can you have the setting be a bathtub? And let's make the background tile. And let's make the color scheme purple."

Generating many variations and styles like this can become time-consuming. automate parts of your workflow with the Midjourney Automation Suite from TitanXT.

Working with Draft Images

When you create images in draft mode, you get four versions. Midjourney numbers these as 1, 2, 3, and 4. You can create variations of any of these images using your voice, by saying something like, "create a variation of image number four." These variations are not draft mode and take slightly longer to appear.

Even though draft images are quicker and lower quality, they still look good. You can improve them later.

Enhance Draft Images

[P]To improve a draft image, click on the image you like. You'll see buttons below it. Look for the "more settings" option and click "enhance." This creates a new set of four improved images. Since they started as drafts, it takes a moment for these enhanced versions to finish.[/H3]

Upscale Draft Images

[P]You can also make a draft image bigger. Click on the image, then find the "upscale" section in the settings. Choose "subtle" or "creative." Both double the image size. The subtle option keeps the image very similar to the draft. The creative option might add new details and change it a bit. Upscaling takes a few seconds and uses some GPU time.[/H3]

Finishing Up

When you are done using voice mode, click the microphone icon again to turn it off (it will turn white). Do the same for draft mode when you don't need it anymore.

Conclusion

That covers how to use Midjourney V7's Draft and Voice Mode. These features are great for quickly getting ideas into images and making quick edits by just talking. It changes how easily you can create visuals.

For those who use Midjourney often and need to handle many images, automating parts of the process can be a huge help. Check out the Midjourney Automation Suite from TitanXT to streamline your workflow and make the most of these powerful tools.

May 3

4 min read

0

7

0

Related Posts

Comments

Share Your ThoughtsBe the first to write a comment.
bottom of page
Midjourney Automation Suite - Automate your image generation workflows on Midjourney | Product Hunt