Deployment
DeploymentStep
Bases: PipelineStep
Pipeline step for deploying a model using FastAPI and ONNX Runtime.
Source code in textforge/deployment.py
__init__()
run(model_path, quantize=False)
Run the deployment process.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
model_path
|
str
|
Path to the model directory. |
required |
quantize
|
bool
|
Flag to indicate whether to quantize the model. Defaults to False. |
False
|