Interface BatchChatModel
- All Known Implementing Classes:
GoogleAiGeminiBatchChatModel
A model interface for processing multiple chat requests asynchronously in a batch.
Batch processing typically offers significant cost reductions compared to real-time chat requests and is ideal for large-scale, non-urgent conversational or instruction-following tasks.
- See Also:
-
Method Summary
Modifier and TypeMethodDescriptionvoidCancels a chat batch operation that is currently pending or running.list(@Nullable BatchPagination pagination) Lists chat batch jobs with optional pagination.Retrieves the current state and results of a chat batch operation.submit(BatchRequest<ChatRequest> request) Creates a batch of chat requests and submits them for asynchronous processing.
-
Method Details
-
submit
Creates a batch of chat requests and submits them for asynchronous processing.The returned
BatchResponserepresents the status of the batch operation.- Parameters:
request- the list of chat requests to process in the batch- Returns:
- a
BatchResponserepresenting the initial state of the batch operation
-
retrieve
Retrieves the current state and results of a chat batch operation.The response indicates whether the batch is still processing, completed successfully, or failed. Clients should poll this method at intervals until the batch completes.
- Parameters:
batchId- the batch identifier obtained fromsubmit(BatchRequest)- Returns:
- a
BatchResponserepresenting the current state of the chat batch operation
-
cancel
Cancels a chat batch operation that is currently pending or running.- Parameters:
batchId- the batch identifier to cancel
-
list
Lists chat batch jobs with optional pagination.- Parameters:
pagination- the maximum number of batch jobs to return and token for retrieving a specific page; if null, uses server default- Returns:
- a
BatchPagecontaining chat batch responses and pagination information
-