Multimodal Model - Search News

Multimodal AI Model Can Predict Intimate Partner Violence

A multimodal artificial intelligence (AI) model can identify patients at risk of intimate partner violence (IPV) years before ...

Mistral's Small 4 consolidates reasoning, vision and coding into one model — at a fraction of the inference cost

Mistral's Small 4 combines reasoning, multimodal analysis and agentic coding in a single open-source model with configurable ...

The Tech Portal

Meta introduces ‘TRIBE v2’ AI model that predicts human brain activity patterns

Meta has introduced TRIBE v2 (TRImodal Brain Encoder version 2), a next-generation multimodal AI system designed to predict ...

techtimes

Meta Is Developing New Multimodal AI Model Chameleon to Rival OpenAI's GPT-4o

Following the recent AI offerings showdown between OpenAI and Google, Meta's AI researchers seem ready to join the contest with their own multimodal model. Multimodal AI models are evolved versions of ...

TechCrunch

Mistral releases Pixtral 12B, its first multimodal model

French AI startup Mistral has released its first model that can process images as well as text. Called Pixtral 12B, the 12-billion-parameter model is about 24GB in size. Parameters roughly correspond ...

9to5Mac

New Apple model combines vision understanding and image generation with impressive results

In the study titled MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer, a team of nearly 30 Apple researchers details a novel unified approach that enables both ...

Geeky Gadgets

New Google Gemma 3 Multimodal AI Model Beats DeepSeek V3 : Performance Tested

Gemma 3, Google’s latest suite of lightweight, open source AI models, is reshaping the landscape of artificial intelligence by emphasizing efficiency and accessibility. Despite its compact design, it ...

GPT Proto Expands AI Model Catalogue with Support for Google’s Gemini 3.1 Pro Preview

Hong Kong-based API platform adds Google's latest multimodal model to its growing roster, expanding developer access to ...

techtimes

Kling AI Unveils Unified Multimodal Video Model O1 and Video 2.6 to Reshape Creative Production

In the rapidly accelerating landscape of generative AI, creators continue to struggle with fragmented workflows: one model for video generation, another for post-production editing, and yet another ...

InfoWorld

Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

Microsoft has introduced a new AI model that, it says, can process speech, vision, and text locally on-device using less compute capacity than previous models. Innovation in generative artificial ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results