Plumstead-White Analytics

Empowering Your AI Journey

AI Readiness & Implementation Assessments

Latest News and Updates

If AI Change is a River - Get Ready for White-Water Rapids

Check out the pace of new developments over the Past 12 Months

Generative AI Innovation: Past 12 Months in Review

Over the past year (mid-2024 to mid-2025), generative AI has experienced an unprecedented surge of new applications and major upgrades across text, image, audio, and video domains. The pace of change has been rapid – each quarter brought more breakthroughs than the last, from powerful new language models to multimodal creative tools. Global investment and adoption have exploded: venture funding for generative AI nearly tripled in 2024 to $56 billion (885 deals), and AI-related ad spending in early 2024 was 19× higher than the year prior. Open-source innovation also boomed as Meta’s Llama models saw download counts grow tenfold, reaching hundreds of millions. In Y Combinator’s Winter 2024 startup batch, ~70% of companies were AI-focused (up from ~32% a year earlier) – a clear sign of the startup spike driven by generative AI.

Major model launches in text generation led the charge. OpenAI’s GPT-4 Turbo upgrade (announced Nov 2023) expanded context windows to 128K tokens and introduced vision and speech capabilities, making AI interactions more detailed and multimodal. This was soon followed by OpenAI’s GPT-4o (“GPT-4 Omni”) in May 2024 – a flagship model enabling realistic voice conversations and image understanding in real time. Anthropic answered with Claude 3 in March 2024, a suite of models (Haiku, Sonnet, Opus) that pushed context length and reasoning even further and claimed to outperform prior leaders like GPT-4 and Google’s Gemini on certain benchmarks. By late 2024, Google introduced Gemini as a multimodal rival – designed from the ground up to handle text, code, images, and audio simultaneously. Gemini’s integration into Google’s products (Search, Gmail, Docs via Vertex AI) exemplified how quickly new models were being folded into mainstream platforms. These rapid-fire releases underscore an industry race to one-up model capabilities on a near monthly basis.

Generative AI for images, audio, and video also accelerated. OpenAI’s DALL·E 3 model (integrated into ChatGPT in late 2023) and Midjourney’s version 6 upgrades in 2024 delivered more coherent, photorealistic visuals, making text-to-image generation more powerful than ever. Adobe’s Firefly family of image models gained immense traction, generating over 13 billion images within a year of launch, and by October 2024 Adobe unveiled Firefly Video – the first public text-to-video model designed for commercial use. New audio synthesis tools emerged as well: OpenAI gave ChatGPT a voice (enabling spoken conversations), and startups like ElevenLabs rolled out generative audio features – in mid-2024 ElevenLabs even previewed an AI tool to create music from a single text prompt. By the end of 2024, the first wave of AI video generators arrived: OpenAI officially released its Sora text-to-video platform, Google launched a tool called Veo, and Amazon debuted its Nova AI video model suite – bringing high-quality video generation to the public. These multimodal innovations, combined with widespread integration into products from Microsoft 365 to Adobe Creative Cloud, illustrate how ubiquitous generative AI has become in a very short time.

Major generative AI releases by month (May 2024–April 2025)

The rapid uptick in late 2024 highlights the accelerating pace of innovation.

As shown above, the number of high-profile GenAI launches climbed steadily through 2024, peaking in Q4. Below is a timeline of notable launches and enhancements over the past 12 months, highlighting month-by-month breakthroughs across text, image, audio, and video generation:

Bottom Line

In the span of 12 months, generative AI has evolved from a novelty into a ubiquitous technology layer. New models capable of human-like creativity and conversation are arriving almost every month, and existing tools are upgraded with astonishing speed. Whether it’s long-form text generation, realistic image creation, human-like audio synthesis, or even full video generation, the capabilities of generative AI have expanded rapidly – and have been integrated into products used by billions of people. This past year’s month-by-month cascade of launches (from GPT-4 Turbo to Gemini to Sora and beyond) showcases an industry in overdrive. If the current trend holds, 2025 is on track to witness even more groundbreaking AI tools, as the cycle of innovation in generative AI continues at an extraordinary pace.