- Free Tools
- LLM Cost Estimator
- Kimi K2.5

Kimi K2.5 via Deep Infra
Specifications
Context Window
262,144 tokens
Release Date
2026-01-27
Capabilities
AttachmentsReasoningTool callingStructured outputTemperatureImage inputVideo input
Availability
Open Weights
Model Overview
Deep Infra is a serverless AI inference platform that hosts popular open-source models. They offer competitive pricing and fast inference for models like LLaMA, Mixtral, and Qwen.
Kimi K2.5 is a kimi-family model by Deep Infra with a 262k token context window and up to 33k output tokens. It is priced at $0.5000/1M input tokens and $2.80/1M output tokens.
Key capabilities include: attachments, reasoning, tool calling, structured output, temperature, image input, video input. It supports advanced reasoning for complex multi-step tasks. It can call external tools and functions for agentic workflows.






