Why max_model_len only 8192 when inferencing with vLLM for DeepSeek-V2-Chat?

And what is the max value of `max_model_len` for DeepSeek-V2-Chat?

```
from transformers import AutoTokenizer
from vllm import LLM, SamplingParams

max_model_len, tp_size = 8192, 8

```