AgentReadyHomeAgent ListingPricing

← Agent Listing

Pixtral 12B 24.09

LLMFreeOpen SourceHorizontal

Multimodal AI for image-text tasks with variable image support and 128K context

🛡️ AgentReady threat assessment

MAESTRO 7-layer threat model + OWASP AIVSS risk score for Pixtral 12B 24.09, derived from its capabilities.

AIVSS 5.5 · Medium
View MAESTRO 7-layer threat model →

These scores are auto-generated from public information (the agent's own listing, docs, and repository) using the canonical OWASP AIVSS formula and the MAESTRO framework — an estimate for guidance, not a penetration test, audit, or certification. See the scoring methodology. Are you the vendor? Factual corrections are free.

Overview

Pixtral-12B-2409 is a 12-billion-parameter multimodal model by Mistral AI, combining a 12B-parameter text decoder with a 400M-parameter vision encoder. It processes interleaved text and images natively, supporting variable image sizes and a 128K-token context window for long-form document analysis or multi-image workflows. The model excels in tasks like chart understanding, OCR, and multilingual reasoning, outperforming similar-sized open models (e.g., Qwen2-VL 7B, LLaVA-OV 7B) and even larger models like Llama-3.2 90B in benchmarks like MMMU (52.5%) and MathVista (58.0%)

Key features

Use cases