Class InferenceOperator

All Implemented Interfaces:
Closeable, AutoCloseable, Operator, org.elasticsearch.core.Releasable
Direct Known Subclasses:
CompletionOperator, RerankOperator, TextEmbeddingOperator

public abstract class InferenceOperator extends AsyncOperator<InferenceOperator.OngoingInferenceResult>
An abstract asynchronous operator that performs throttled bulk inference execution.

The InferenceOperator integrates with the compute framework and supports throttled bulk execution of inference requests. It transforms input Page into inference requests, asynchronously executes them, and uses the responses to build a new Page.