Meta Llama 3 11B Vision

Visit Resource

Meta Llama 3 11B Vision is a large-scale multimodal language model optimized for vision and language tasks, enabling developers and enterprises to build advanced AI applications that integrate visual understanding with natural language processing, with improved accuracy and efficiency over previous models.

Provider: meta-llamaProprietaryNo API
Context: 131.1K
Multimodal

LLM Specifications

Context Length:131.1K
Max Output:16.4K

Pricing

Input Cost:$0.05 / 1M tokens
Output Cost:$0.05 / 1M tokens

Supported Formats

TextImage