Harpreet Sangar
ec186c032e
Empty filter value fix. ( #1571 )
...
* Empty filter value fix.
* Add test case.
2024-02-22 15:21:49 +05:30
Kishore Nallan
0013c50ffd
Fix mac arm build.
2024-02-22 10:11:00 +05:30
Ozan Armağan
670b97dbcd
Fix snowball bazel file for MacOS ( #1566 )
2024-02-21 16:03:54 +05:30
Kishore Nallan
f1e1485d6b
Fix nohits query aggregation.
2024-02-20 20:45:49 +05:30
Ozan Armağan
5722a9dddd
Add support for openai/text-embedding-3-*
( #1558 )
...
* Add support for `openai/text-embedding-3-`
* Fix test
* Rename openai_custom_dims to has_custom_dims
* Sİmplify custom dimension check condition
2024-02-20 18:26:59 +05:30
Krunal Gandhi
285eb2fa5d
fix excluding upper range val in search ( #1564 )
...
* fix excluding upper range val in search
* check lower_range while searching for range
2024-02-20 16:51:46 +05:30
Jason Bosco
996224ee91
Don't error out if bazel cache is not found
2024-02-19 22:06:00 +05:30
Krunal Gandhi
fe2a0be564
refactor counter events with tests ( #1557 )
...
* add tests for persisting events
* fix test by adding unique event name
* fix persistance with analytics events
* early return raft_server check
* increment populairty count instead of overwrite
* extract method serialize_as_docs
* move func definition from header to source
* counter events refactor and tests
* fix test
* configure log_to_file per counter event
2024-02-19 20:47:32 +05:30
Krunal Gandhi
01bedbb342
fix the wrong boolean val ( #1559 )
2024-02-19 20:31:06 +05:30
Kishore Nallan
7b7f0c79d7
Optimize id list iteration.
2024-02-18 07:49:37 +05:30
Ozan Armağan
4371cfd48e
Use regex for parsing CF RAG results and log full response when partial ( #1554 )
...
* Use regex for parsing CF RAG results and log full response when it is partial
* Refactor & add test
2024-02-16 09:07:14 +05:30
Kishore Nallan
6e2931bd02
Add guard for long hostnames.
2024-02-15 17:19:36 +05:30
Krunal Gandhi
c7c24e6ab9
Analytics manager fixes ( #1553 )
...
* add tests for persisting events
* fix test by adding unique event name
* fix persistance with analytics events
* early return raft_server check
* increment populairty count instead of overwrite
* extract method serialize_as_docs
* move func definition from header to source
2024-02-15 16:51:08 +05:30
Kishore Nallan
bf8a2fc6e5
Don't accept empty nodes file.
2024-02-14 21:39:50 +05:30
Kishore Nallan
cc980c2ecd
Treat zero facet sample percent as not sampled.
2024-02-14 12:29:48 +05:30
Vegard Stikbakke
420c55ee6d
Fix typo in error message for non-sortable field ( #1547 )
2024-02-13 17:38:30 +05:30
Kishore Nallan
f7c1678cc3
Handle wraparound of token offset for large doc highlight.
2024-02-13 17:38:23 +05:30
Ozan Armağan
76c57ec407
Add ts
namespace for voice query models ( #1549 )
...
* Add `ts` namespace for voice query models
* Fix voice query test
* Update error message in invalid voice query test
2024-02-13 16:39:31 +05:30
Ozan Armağan
59cc66248d
Add stemming for queries ( #1548 )
2024-02-13 15:05:56 +05:30
Kishore Nallan
da8cac463c
Fix inheritance of sort field property for nested field.
2024-02-12 21:47:33 +05:30
Harpreet Sangar
54492fafdc
num_tree iterator ( #1538 )
...
* Add `num_tree_t::iterator_t`.
* Add `num_tree_t::iterator_t` tests.
* Add `bool_iterator` in `filter_result_iterator_t`.
* Fix `filter_result_iterator_t::compute_iterators`.
2024-02-12 21:45:49 +05:30
Ozan Armağan
ded3a5ec08
Fix context length for RAG models ( #1544 )
...
* Fix context length for RAG models
* Fix prompt for cloudflare model
* Fix error in multi search
* Add error handling for Cloudflare API response
2024-02-12 21:44:52 +05:30
Kishore Nallan
ff6e0e43c8
Ignore errors during analytics json serialization.
2024-02-12 15:34:06 +05:30
Krunal Gandhi
c6b25a7964
return error response when stopword is not found ( #1545 )
...
* return error response when stopword is not found
* use shared_lock
2024-02-12 14:54:38 +05:30
Kishore Nallan
fc80cc3a72
Handle special characters within non-English locale.
...
Unless present in symbols to index / separators, it will be skipped.
2024-02-08 16:52:25 +05:30
Ozan Armağan
48df1e70e8
Fix voice query model path for the test ( #1540 )
2024-02-07 19:35:18 +05:30
Kishore Nallan
ea29b3bea1
Handle empty query while logging analytics.
2024-02-07 18:32:09 +05:30
Kishore Nallan
062d9c1cee
Remove stray log.
2024-02-07 18:25:27 +05:30
Harpreet Sangar
a94062481f
Fix crash on calling compute_string_components
multiple times in a complex filter query. ( #1539 )
...
* Add failing test.
* Fix crash on calling multiple times in a complex filter query.
* Rename `compute_result` to `compute_string_components`.
2024-02-07 12:48:19 +05:30
Krunal Gandhi
7b616b19fa
fix asan warning in analytics manager tests ( #1537 )
...
* reorder tests to fix asan issues
* fix iterator invalidation
* make approach more verbose
2024-02-06 18:17:57 +05:30
Krunal Gandhi
562e152d9b
remove error response when deleting non existing doc ( #1532 )
...
* remove error response when deleting non existing doc
* handle other response when doc not found
* add test and refactor approach
* add response msg when doc not found
* add optional flag ignore_not_found
* change reponse msg
2024-02-06 15:49:05 +05:30
Ozan Armağan
559b2de337
Fix duplicate results from vector index ( #1536 )
2024-02-06 13:45:59 +05:30
Harpreet Sangar
56a69d9844
Compute filter result into an array. ( #1533 )
...
* Compute filter result into an array.
* Add tests.
* Add tests.
2024-02-06 12:55:47 +05:30
Krunal Gandhi
36fcdccddc
add typo_prefix score and num_tokens_dropped in text_match_info ( #1529 )
...
* add num_drop_tokens info in text_match_info
* add typo_prefix_score in text_match_info
* add more tests
* add test with drop_token_threshold=1
2024-02-05 20:44:39 +05:30
Kishore Nallan
fb6cf36604
Make cache num entries configurable.
2024-02-05 10:53:38 +05:30
Kishore Nallan
4a2892c886
Add flag for logging search query at the start of req cycle.
2024-02-02 16:12:56 +05:30
Kishore Nallan
0c814cb3d2
Remove unwanted build flags.
2024-02-02 15:34:44 +05:30
Harpreet Sangar
7e9cb789a7
String not equals filter logic refactoring. ( #1528 )
...
* Compute result in case of string not equals filter matching too many ids.
* Deleted id should not be considered a match for string not equals filter.
* Add tests.
* Use `id_list_t::iterator_t` to check if `seq_id` exists in index.
2024-02-02 15:32:18 +05:30
Krunal Gandhi
206cecffbd
Event analytics revised ( #1522 )
...
* remove query hits aggregation & store
* refactor analytics changes
* avoid string copy
* typo correction
* event analytics revised
* refactoring code
* add test for collection array
2024-02-02 11:50:42 +05:30
Ozan Armağan
18306e3dd2
Make HNSW params configurable ( #1521 )
2024-01-30 18:30:03 +05:30
Kishore Nallan
f9242dd4a5
Don't highlight very large docs > 64K words.
2024-01-28 17:47:43 +05:30
Harpreet Sangar
ec4311635e
Fix document update failing if reference field was not included. ( #1515 )
2024-01-27 12:26:36 +05:30
Kishore Nallan
525995c7e9
Make analytics event cache a member.
2024-01-26 22:14:02 +05:30
Kishore Nallan
e898daa0a1
Early return when analytics log not open.
2024-01-26 18:50:16 +05:30
Kishore Nallan
a290fd6205
Always populate ip address.
2024-01-26 18:49:01 +05:30
Kishore Nallan
71f905f18c
Flush analytics event log on every persist call.
2024-01-26 16:33:04 +05:30
Kishore Nallan
261664fe11
Add flag to prevent bazel cache bust on clion.
...
Clion adds this flag during debugging which causes onnxruntime to be rebuilt from scratch when switched between command line and clion build.
2024-01-26 05:36:21 +05:30
Harpreet Sangar
b0e523caa5
filter_result_iterator_t
shouldn't timeout during update and delete by filter query operations. (#1511 )
2024-01-25 15:09:02 +05:30
Kishore Nallan
94c560429d
Fix hash-based faceting for coerced field values.
...
When an integer field is coerced to string via schema update, the underlying data on disk will still be integer. We have to account for this during hash based faceting.
2024-01-25 11:19:05 +05:30
Krunal Gandhi
77ade5ff65
add resource error response in /health end-point ( #1505 )
...
* add resource error response in /health end-point
* use resource_check_t enum for error response
* return enum instead of string error
2024-01-24 12:30:56 +05:30