Create a Cohere inference endpoint
PUT
/_inference/{task_type}/{cohere_inference_id} Create an inference endpoint to perform an inference task with the cohere service.
Required authorization
- Cluster privileges:
manage_inference
Parameters
path Path Parameters
| Name | Type |
|---|---|
task_type
required
The type of the inference task that the model will perform. | type InferenceTypesCohereTaskType = "completion" | "rerank" | "text_embedding" |
cohere_inference_id
required
The unique identifier of the inference endpoint. | type TypesId = string |
query Query Parameters
| Name | Type |
|---|---|
timeout Specifies the amount of time to wait for the inference endpoint to be created. | type TypesDuration = string | "-1" | "0" |
Request Body
application/json
required
{
chunking_settings?:InferenceTypesInferenceChunkingSettings ;
service:InferenceTypesCohereServiceType ;
service_settings:InferenceTypesCohereServiceSettings ;
task_settings?:InferenceTypesCohereTaskSettings ;
}
chunking_settings?:
service:
service_settings:
task_settings?:
}
Responses
200 application/json
type InferenceTypesInferenceEndpointInfoCohere = interface InferenceTypesInferenceEndpoint {
chunking_settings?:InferenceTypesInferenceChunkingSettings ;
service: string;
service_settings:InferenceTypesServiceSettings ;
task_settings?:InferenceTypesTaskSettings ;
} & { inference_id: string;task_type:InferenceTypesTaskTypeCohere ; }
chunking_settings?:
service: string;
service_settings:
task_settings?:
} & { inference_id: string;task_type: