Big AI News: Midjourney V7, Llama 4, and New Tools

May 14

5 min read

midjourney blog post image — A Midjourney generated image using Midjourney Automation Suite

The world of AI is always moving fast. This past week brought big updates from major players like OpenAI, Meta with Llama, and Midjourney. New models, features, and tools are changing how we create, work, and think about the future of artificial intelligence. Let's look at the key announcements you might have missed.

OpenAI Updates and Future Plans

OpenAI shared several interesting points recently.

GPT-3.5 and GPT-4 Mini are expected to arrive separately soon. While GPT-5 will be a bigger system, these models will be released on their own, possibly for developers.
A GPT-3 Pro version with more processing power is also planned.
OpenAI mentioned moving toward open-source releases for some models. This is seen as a good sign, potentially allowing developers more freedom, though some see it as a public relations move.
They released a new benchmark called Paper Bench. This test measures how well AI models can understand and replicate results from scientific studies, like math or programming tasks. In this test, Claude 3.5 Sonnet scored well, showing its ability to follow complex instructions, while other models, including OpenAI's own, had lower scores.
Free ChatGPT Plus subscriptions are now available for US and Canadian college students, clearly meant to encourage use during exams. Big investments, including $10 billion upfront from a $40 billion round, are funding these initiatives and large infrastructure projects like the planned Stargate data center.

Image Generation: More Hype

OpenAI's image generator has created a lot of excitement. A video highlighting 28 ways to use it quickly gained attention. Millions more users joined ChatGPT after this feature was added, showing how popular image creation is. Some reports even show that queries for ChatGPT were close to queries for popular adult content, seen by some as a measure of its widespread appeal.

Llama 4 Arrives

Meta introduced Llama 4, with different versions available for businesses. Key models include Llama 4 Maverick and Llama 4 Scout, which have large numbers of parameters and experts. A significant feature is the 10-million token context window for Llama 4 Scout, allowing it to process vast amounts of information in one go. This could transform work with large codebases, documents, or support systems. Benchmarks look promising compared to competitors like Gemini and Claude models. A future 'thinking' model, Llama 4 Rein, is also expected soon.

Comparing AI Tool Costs and Features

Google announced API prices for its Gemini 1.5 Pro model. The pricing is seen as quite competitive, especially for smaller models. Google appears to offer good value for companies building applications using their models compared to others currently available.

New AI Agents Emerge

Two new AI agents, one from Convergence AI and another called Gent, have appeared. These are similar to agents that can perform complex tasks by breaking them down and using tools like browsing the web or writing code. While still developing, some tests show agents like Gent can automatically gather data and create things like price comparison graphs, though results can still be inconsistent.

Discover how AI can handle tasks for you. Many users find that automating creative processes, like generating images, saves significant time and effort. Explore how tools designed for automation can boost your workflow. The TitanXT Midjourney Automation Suite offers ways to streamline your Midjourney tasks.

Advances in Video and Audio AI

Video and audio AI tools continue to improve rapidly.

Hixfield AI offers templates for video creation, making it easier for beginners to get specific results without needing deep technical knowledge or prompt engineering skills.
A new tool from China allows users to create speaking videos from a single image. The mouth and facial movements are closely synced to the audio input, even for complex languages. This shows potential for creating highly realistic video content easily.
Lumra R2 added more control over camera movement in generated videos.
Kling's Bloom effect helps add spring-themed visuals to videos. This is a simple but popular feature emphasizing seasonal trends.
MiniMax's Speech 02 model improved text-to-speech capabilities, including seamless switching between multiple languages in one audio track.
Audio generation tools like Murica AI's Murica 01 are getting more advanced. They can now analyze a music track's style and let users create new songs in that exact style. The interface makes customization easy, similar to image generation interfaces that offer fine-tuned control.
ElevenLabs, known for realistic text-to-speech, even released a playful 'text-to-bark' tool (an April Fool's joke, but technically possible).
Adobe Premiere Pro 25.2 added a long-awaited feature: the ability to automatically generate missing video Frames when stretching clips. This helps create smooth transitions and adjust timing easily. It's currently free but might use Firefly tokens later.
Runway released Gen 4, generating very high-quality videos with stable styles. It's seen as a major step forward in video generation capability. Simple short animations are already being created using this tool.
Crea updated its platform with a redesigned, simpler interface and improved 3D tools. They also added image generation features, including integrating models like Gemini.

Midjourney V7 is Here

Midjourney version 7 has arrived. Announcements and updates are often shared on their Telegram channel. V7 features a rebuilt model architecture for better prompt understanding. Two notable features were added:

Personalization: By default, your likes and dislikes on images now influence future generations. This helps tailor results to your specific taste, a unique feature among image generators.
Draft Mode: Generate images much faster and cheaper for quickly testing ideas or prompt changes. It's lower quality but helps rapid iteration to find the style and composition you want before generating the final high-quality image.

Curious about how to make the most of these new features or handle large volumes of generations testing draft mode? Exploring automation tools can amplify your creative process. The Midjourney Automation Suite from TitanXT is designed to help streamline your image generation workflow.

Looking Ahead: AI Safety and Predictions

Two recent forecasts looked at the future of AI. An article called "AI 2027," written by known figures in AI prediction, discussed potential paths. One path described uncontrolled growth and a race between countries ignoring safety, leading to potential problems. Another path suggested humanity might create safe, highly capable AI 'agents' that work faster than humans but stay aligned with human goals. DeepMind also shared predictions about General AI appearing by late 2029 or 2030 and categorized potential risks like misuse, misalignment (AI goals not matching human goals), errors, and structural risks. They suggested solutions like training AI models to monitor other models for safety issues.

These discussions are important and show that AI safety is a topic being actively considered, offering some hope amidst rapid progress.

Other Interesting AI Uses

An experimental 'swipe right' game on Tinder uses AI characters and voice flirting, giving real-time feedback on flirting skills. It's currently limited, but shows how AI is being used in unexpected ways like dating simulation.

Conclusion

From more capable language models and AI agents to advanced video creation tools and new Midjourney features, the pace of AI development is incredible. Staying informed about these changes can help you use AI more effectively in your work and creative projects.

Want to make your image generation work easier? Instead of manual steps, consider automating parts of your process. Visit TitanXT's Midjourney Automation Suite to see how automation can help manage your creative output.