Three unique Artificial Intelligence (AI) models were introduced by Microsoft. which focus on image creation, voice generation and speech-to-text transcription. The Redmond-based tech company claims these models outperform models from Google, OpenAI and other companies. The models MAI-Transcribe-1, MAI-Voice-1 and MAI-Image are also said to focus on fast content creation and affordable prices. These models are currently available through Microsoft Foundry and are also being incorporated into various consumer products.
After OnePlus, this company gave a blow to users! Smartphone prices increased by Rs. 5 thousand
According to the information received, this tech company introduced three new Large Language Models (LLM). All of these are currently available through Microsoft Foundry and MAI Playground. Most notable is the MAI-Transcribe-1, which the company claims provides excellent (SOTA) speech-to-text transcription in 25 of the most widely used languages.
These claims are based on Microsoft’s internal testing on the FLEURS benchmark. According to the data, this model outperforms Gemini 3.1 Flash and GPT-Transcribe in terms of error rate. Furthermore, the company says that Foundry users will find this model offers “the best performance at the best price of any major cloud provider.”
As for MAI-Voice-1, the AI model is said to be a “natural, realistic voice that includes nuances, a full range of emotions and body language”. This model is able to maintain consistency in voice and speech even when producing large text. In Foundry, the model will also allow users to create their own sounds using just a few seconds of audio clips.
Microsoft claims that this process is completely safe. According to reports, this model can produce a 60-second audio clip in just one second. Interestingly, this AI model will also power Copilot Audio Expressions and Copilot Podcasts.
The MAI-Image-2 model is an improvement over its predecessor and is said to deliver better quality output faster than before. Microsoft said the model was developed in collaboration with photographers, designers and visual storytellers, with a focus on natural light, accurate textures and clear text in images.
Notably, WPP is one of the first enterprise partners to adopt the AI model. This model, like the other two models, will be available through Microsoft Foundry and MAI Playground. It’s also rolling out to Copilot, Bing, and PowerPoint.
Difficulty in NASA’s lunar mission! Microsoft Outlook betrayed in space, raining memes on social media