- When embedding documents, allow batching the documents using some criteria.
BatchingStrategyinterface with aTokenCountBatchingStrategyimplementation that uses the openai max input token size of 8191 as the default.- Add a default method in EmbeddingModel to embed document using this new batching strategy.
- Change
MilvusVectorStoreto make use of this new batching API. - Adding unit tests for
TokenCountBatchingStrategy. - Adding openai integration test to call the embed API that uses batching.
Resolves https://github.com/spring-projects/spring-ai/issues/1214
Comment From: markpollack
merged in 949f1ed4e80345fe45acb0ba75de32f09989f111