-
Fix OpenAI ChatModel's call() operation
-
When toolcalling is used, calculate cumulative usage from the preceding ChatResponses
-
Fix OpenAI ChatModel's stream() operation
-
Make sure that cumulative usage is calculated from the ChatResponse which has a valid usage
- Use overlapping buffer to check and store the usage from the response that holds the usage.
-
Add tests for both call() and stream()
Comment From: ilayaperumalg
This was merged as 37cf20dfe4679a17e96759c139b4ba3ec765fe9c