models

Multimodal

An AI that can process multiple types of input — text, images, audio, video — not just words. A multimodal model can look at a photo and describe what's in it.

Want to learn more about AI?

Peter Saddington has trained 17,000+ people on agile and AI. Let’s talk.

Work with Peter