Skip to content

AI4Chem/api_backend

Folders and files

NameName
Last commit message
Last commit date

Latest commit

0beb008 · Apr 2, 2024
Aug 12, 2023
Apr 2, 2024
Mar 14, 2024
Mar 14, 2024
Jan 19, 2024
Dec 1, 2023
Jan 20, 2024
Mar 16, 2024
Nov 28, 2023
Apr 2, 2024
Apr 2, 2024
Apr 2, 2024
Apr 2, 2024
Apr 2, 2024
Apr 2, 2024
Jun 12, 2023
Apr 2, 2024
Nov 24, 2023
Nov 24, 2023
Nov 24, 2023
Apr 2, 2024
Feb 17, 2024
Apr 2, 2024
Apr 2, 2024
Apr 2, 2024

Repository files navigation

API for Open LLMs

Original Document

Quick Deployment

Install Dependencies

conda env create -f ./server.yml

Modify Configuration (Optional)

Modify the local .env file, refer to

PORT=10086

# model related
MODEL_NAME=internlm2
MODEL_PATH=AI4Chem/ChemLLM-7B-Chat-1.5-DPO
EMBEDDING_NAME=jinaai/jina-embeddings-v2-base-zh
CONTEXT_LEN=32000
LOAD_IN_8BIT=false
LOAD_IN_4BIT=false
USING_PTUNING_V2=false
STREAM_INTERVERL=2
PROMPT_NAME=

# device related
DEVICE=

# "auto", "cuda:0", "cuda:1", ...
DEVICE_MAP=auto
GPUS=
NUM_GPUs=2
DTYPE=half


# api related
API_PREFIX=/v1

USE_STREAMER_V2=false

# vllm related
ENGINE=default

Inference

conda activate server
python server.py

原文档

快速部署

安装依赖

conda env create -f ./server.yml

修改配置(可选)

修改本地.env文件,参考

PORT=10086

# model related
MODEL_NAME=internlm2
MODEL_PATH=AI4Chem/ChemLLM-7B-Chat-1.5-DPO
EMBEDDING_NAME=jinaai/jina-embeddings-v2-base-zh
CONTEXT_LEN=32000
LOAD_IN_8BIT=false
LOAD_IN_4BIT=false
USING_PTUNING_V2=false
STREAM_INTERVERL=2
PROMPT_NAME=

# device related
DEVICE=

# "auto", "cuda:0", "cuda:1", ...
DEVICE_MAP=auto
GPUS=
NUM_GPUs=2
DTYPE=half


# api related
API_PREFIX=/v1

USE_STREAMER_V2=false

# vllm related
ENGINE=default

推理

conda activate server
python server.py

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages