Class E5SmallV2QuantizedEmbeddingModel

All Implemented Interfaces:
EmbeddingModel

public class E5SmallV2QuantizedEmbeddingModel extends AbstractInProcessEmbeddingModel
Quantized Microsoft E5-small-v2 embedding model that runs within your Java application's process.

Maximum length of text (in tokens) that can be embedded at once: unlimited. However, while you can embed very long texts, the quality of the embedding degrades as the text lengthens. It is recommended to embed segments of no more than 512 tokens long.

Embedding dimensions: 384

It is recommended to use the "query:" prefix for queries and the "passage:" prefix for segments.

Uses an Executor to parallelize the embedding process. By default, uses a cached thread pool with the number of threads equal to the number of available processors. Threads are cached for 1 second.

More details here

  • Constructor Details

    • E5SmallV2QuantizedEmbeddingModel

      public E5SmallV2QuantizedEmbeddingModel()
      Creates an instance of an E5SmallV2QuantizedEmbeddingModel. Uses a cached thread pool with the number of threads equal to the number of available processors. Threads are cached for 1 second.
    • E5SmallV2QuantizedEmbeddingModel

      public E5SmallV2QuantizedEmbeddingModel(Executor executor)
      Creates an instance of an E5SmallV2QuantizedEmbeddingModel.
      Parameters:
      executor - The executor to use to parallelize the embedding process.
  • Method Details