- Home
- Large Language Models
- Meta Llama 3 11B Vision
Meta Llama 3 11B Vision
Visit ResourceMeta Llama 3 11B Vision is a large-scale multimodal language model optimized for vision and language tasks, enabling developers and enterprises to build advanced AI applications that integrate visual understanding with natural language processing, with improved accuracy and efficiency over previous models.
Provider: meta-llamaProprietaryNo API
Context: 131.1K
Multimodal
LLM Specifications
Context Length:131.1K
Max Output:16.4K
Pricing
Input Cost:$0.05 / 1M tokens
Output Cost:$0.05 / 1M tokens
Supported Formats
TextImage