CLI Arguments
Cli arguments, --host, --port, --num_workers
--host
- Default:
'0.0.0.0' - The host for the server to listen on.
- Usage:
litellm --host 127.0.0.1 - Usage - set Environment Variable:
HOSTexport HOST=127.0.0.1
litellm
--port
- Default:
4000 - The port to bind the server to.
- Usage:
litellm --port 8080 - Usage - set Environment Variable:
PORTexport PORT=8080
litellm
--num_workers
- Default:
1 - The number of uvicorn workers to spin up.
- Usage:
litellm --num_workers 4 - Usage - set Environment Variable:
NUM_WORKERSexport NUM_WORKERS=4
litellm
--api_base
- Default:
None - The API base for the model litellm should call.
- Usage:
litellm --model huggingface/tinyllama --api_base https://k58ory32yinf1ly0.us-east-1.aws.endpoints.huggingface.cloud
--api_version
- Default:
None - For Azure services, specify the API version.
- Usage:
litellm --model azure/gpt-deployment --api_version 2023-08-01 --api_base https://<your api base>"
--model or -m
- Default:
None - The model name to pass to Litellm.
- Usage:
litellm --model gpt-3.5-turbo
--test
- Type:
bool(Flag) - Proxy chat completions URL to make a test request.
- Usage:
litellm --test
--health
- Type:
bool(Flag) - Runs a health check on all models in config.yaml
- Usage:
litellm --health
--alias
- Default:
None - An alias for the model, for user-friendly reference.
- Usage:
litellm --alias my-gpt-model
--debug
- Default:
False - Type:
bool(Flag) - Enable debugging mode for the input.
- Usage:
litellm --debug - Usage - set Environment Variable:
DEBUGexport DEBUG=True
litellm
--detailed_debug
- Default:
False - Type:
bool(Flag) - Enable debugging mode for the input.
- Usage:
litellm --detailed_debug - Usage - set Environment Variable:
DETAILED_DEBUGexport DETAILED_DEBUG=True
litellm
--temperature
- Default:
None - Type:
float - Set the temperature for the model.
- Usage:
litellm --temperature 0.7
--max_tokens
- Default:
None - Type:
int - Set the maximum number of tokens for the model output.
- Usage:
litellm --max_tokens 50
--request_timeout
- Default:
6000 - Type:
int - Set the timeout in seconds for completion calls.
- Usage:
litellm --request_timeout 300
--drop_params
- Type:
bool(Flag) - Drop any unmapped params.
- Usage:
litellm --drop_params
--add_function_to_prompt
- Type:
bool(Flag) - If a function passed but unsupported, pass it as a part of the prompt.
- Usage:
litellm --add_function_to_prompt
--config
- Configure Litellm by providing a configuration file path.
- Usage:
litellm --config path/to/config.yaml
--telemetry
- Default:
True - Type:
bool - Help track usage of this feature.
- Usage:
litellm --telemetry False
--log_config
- Default:
None - Type:
str - Specify a log configuration file for uvicorn.
- Usage:
litellm --log_config path/to/log_config.conf