6 Commits

Author SHA1 Message Date
Ozan Armağan
6e24c06e35
Add support for vLLM RAG (#1563)
* Add support for vLLM RAG

* Add `max_bytes` propety for conversation models

* Fix truncate conversation test

* Refactoring
2024-02-26 11:18:14 +05:30
Ozan Armağan
4371cfd48e
Use regex for parsing CF RAG results and log full response when partial (#1554)
* Use regex for parsing CF RAG results and log full  response when it is partial

* Refactor & add test
2024-02-16 09:07:14 +05:30
Ozan Armağan
ded3a5ec08
Fix context length for RAG models (#1544)
* Fix context length for RAG models

* Fix prompt for cloudflare model

* Fix error in multi search

* Add error handling for Cloudflare API response
2024-02-12 21:44:52 +05:30
ozanarmagan
6963aa3ece Add support for Cloudflare Workers AI 2023-12-26 19:25:37 +03:00
ozanarmagan
822cb9c2cd Remove system_prompt parameter and add it to conversation models & support excluding conversation history 2023-11-04 14:43:46 +03:00
ozanarmagan
405b0037da Add support for conversation model endpoints + multi search with conversations 2023-09-14 23:09:30 +03:00