DeepInfra is a platform that allows users to deploy machine learning models. They offer a variety of models, including text-generation, text-to-image, and automatic speech recognition. Users can pay per use for the models they deploy. DeepInfra offers both custom models and pre-trained models. Pre-trained models include openchat/openchat-3.6-8b, nvidia/Nemotron-4-340B-Instruct, and microsoft/WizardLM-2-7B.
deepinfra
const { LLMInterface } = require('llm-interface');
LLMInterface.setApiKey({'deepinfra': process.env.DEEPINFRA_API_KEY});
async function main() {
try {
const response = await LLMInterface.sendMessage('deepinfra', 'Explain the importance of low latency LLMs.');
console.log(response.results);
} catch (error) {
console.error(error);
throw error;
}
}
main();
The following model aliases are provided for this provider.
default
: openchat/openchat-3.6-8blarge
: nvidia/Nemotron-4-340B-Instructsmall
: microsoft/WizardLM-2-7Bagent
: Qwen/Qwen2-7B-Instruct
default
: BAAI/bge-base-en-v1.5large
: BAAI/bge-large-en-v1.5small
: BAAI/bge-base-en-v1.5
The following parameters can be passed through options
.
echo
: Details not available, please refer to the LLM provider documentation.frequency_penalty
: Details not available, please refer to the LLM provider documentation.max_tokens
: Details not available, please refer to the LLM provider documentation.n
: Details not available, please refer to the LLM provider documentation.presence_penalty
: Details not available, please refer to the LLM provider documentation.response_format
: Details not available, please refer to the LLM provider documentation.stop
: Details not available, please refer to the LLM provider documentation.stream
: Details not available, please refer to the LLM provider documentation.temperature
: Details not available, please refer to the LLM provider documentation.tool_choice
: Details not available, please refer to the LLM provider documentation.tools
: Details not available, please refer to the LLM provider documentation.top_p
: Details not available, please refer to the LLM provider documentation.
- Native JSON Mode
- Streaming
- Tools
- Embeddings
Commercial with Free Trial: The DeepInfra API is a commercial product, but new accounts start with a $1.80 credit.
To get an API key, first create a DeepInfra account, then visit the link below.
DeepInfra documentation is available here.
Anthropic