Ozan Armağan
|
6e24c06e35
|
Add support for vLLM RAG (#1563)
* Add support for vLLM RAG
* Add `max_bytes` propety for conversation models
* Fix truncate conversation test
* Refactoring
|
2024-02-26 11:18:14 +05:30 |
|
Ozan Armağan
|
4371cfd48e
|
Use regex for parsing CF RAG results and log full response when partial (#1554)
* Use regex for parsing CF RAG results and log full response when it is partial
* Refactor & add test
|
2024-02-16 09:07:14 +05:30 |
|
Ozan Armağan
|
ded3a5ec08
|
Fix context length for RAG models (#1544)
* Fix context length for RAG models
* Fix prompt for cloudflare model
* Fix error in multi search
* Add error handling for Cloudflare API response
|
2024-02-12 21:44:52 +05:30 |
|
ozanarmagan
|
6963aa3ece
|
Add support for Cloudflare Workers AI
|
2023-12-26 19:25:37 +03:00 |
|
ozanarmagan
|
822cb9c2cd
|
Remove system_prompt parameter and add it to conversation models & support excluding conversation history
|
2023-11-04 14:43:46 +03:00 |
|
ozanarmagan
|
405b0037da
|
Add support for conversation model endpoints + multi search with conversations
|
2023-09-14 23:09:30 +03:00 |
|