Create an AlibabaCloud AI Search inference endpoint
PUT
/_inference/{task_type}/{alibabacloud_inference_id} Create an inference endpoint to perform an inference task with the alibabacloud-ai-search service.
Required authorization
- Cluster privileges:
manage_inference
Parameters
path Path Parameters
| Name | Type |
|---|---|
task_type
required
The type of the inference task that the model will perform. | type InferenceTypesAlibabaCloudTaskType = "completion" | "rerank" | "sparse_embedding" | "text_embedding" |
alibabacloud_inference_id
required
The unique identifier of the inference endpoint. | type TypesId = string |
query Query Parameters
| Name | Type |
|---|---|
timeout Specifies the amount of time to wait for the inference endpoint to be created. | type TypesDuration = string | "-1" | "0" |
Request Body
application/json
required
{
chunking_settings?:InferenceTypesInferenceChunkingSettings ;
service:InferenceTypesAlibabaCloudServiceType ;
service_settings:InferenceTypesAlibabaCloudServiceSettings ;
task_settings?:InferenceTypesAlibabaCloudTaskSettings ;
}
chunking_settings?:
service:
service_settings:
task_settings?:
}
Responses
200 application/json
type InferenceTypesInferenceEndpointInfoAlibabaCloudAI = interface InferenceTypesInferenceEndpoint {
chunking_settings?:InferenceTypesInferenceChunkingSettings ;
service: string;
service_settings:InferenceTypesServiceSettings ;
task_settings?:InferenceTypesTaskSettings ;
} & { inference_id: string;task_type:InferenceTypesTaskTypeAlibabaCloudAI ; }
chunking_settings?:
service: string;
service_settings:
task_settings?:
} & { inference_id: string;task_type: