Harpreet Sangar
1f6fbed372
Refactor referenced_in
initialization of a collection. ( #1585 )
...
* Refactor `referenced_in` initialization of a collection.
* Review changes.
* Review changes.
---------
Co-authored-by: Kishore Nallan <kishorenc@gmail.com>
2024-02-28 20:48:31 +05:30
Krunal Gandhi
482858d05d
pagination for collections GET ( #1581 )
...
* add limit and offset support for collection
* use req->params, dont restrict limit param when records are less
* add more checks and not sort with pagination
* minor updations
2024-02-28 19:09:33 +05:30
Ozan Armağan
c30fc2791f
Add support for custom OpenAI API URL ( #1583 )
...
* Add support for custom OpenAI API URL
* Fix test
2024-02-28 15:08:37 +05:30
Krunal Gandhi
f514c42e2d
Fix pinned_hits_str ids order ( #1578 )
...
* fix pinned_hits_with_grouping
* remove repeated group_limit check
* fix pinned_hits ids order
* remove redundant sort call
2024-02-27 11:26:23 +05:30
Krunal Gandhi
7762f2d080
respect num of groups when typo_tokens_threshold is applied ( #1574 )
...
* respect num of groups when typo_tokens_threshold is applied
* update changes after merge
2024-02-26 15:56:08 +05:30
Ozan Armağan
6e24c06e35
Add support for vLLM RAG ( #1563 )
...
* Add support for vLLM RAG
* Add `max_bytes` propety for conversation models
* Fix truncate conversation test
* Refactoring
2024-02-26 11:18:14 +05:30
Kishore Nallan
1b5846fe49
Remove old search_field API and code.
2024-02-24 12:28:51 +05:30
Krunal Gandhi
d1079c633c
fix pinned hits with grouping and filter ( #1572 )
...
* fix pinned_hits_with_grouping
* remove repeated group_limit check
2024-02-22 18:47:42 +05:30
Ozan Armağan
2c21e1306b
Fix stemming bug when searching with synonyms ( #1569 )
...
* Fix stemming bug when searching with synonyms
* Remove unnecessary parameter in `parse_search`query`
2024-02-22 16:26:03 +05:30
Harpreet Sangar
ec186c032e
Empty filter value fix. ( #1571 )
...
* Empty filter value fix.
* Add test case.
2024-02-22 15:21:49 +05:30
Kishore Nallan
f1e1485d6b
Fix nohits query aggregation.
2024-02-20 20:45:49 +05:30
Ozan Armağan
5722a9dddd
Add support for openai/text-embedding-3-*
( #1558 )
...
* Add support for `openai/text-embedding-3-`
* Fix test
* Rename openai_custom_dims to has_custom_dims
* Sİmplify custom dimension check condition
2024-02-20 18:26:59 +05:30
Krunal Gandhi
285eb2fa5d
fix excluding upper range val in search ( #1564 )
...
* fix excluding upper range val in search
* check lower_range while searching for range
2024-02-20 16:51:46 +05:30
Krunal Gandhi
fe2a0be564
refactor counter events with tests ( #1557 )
...
* add tests for persisting events
* fix test by adding unique event name
* fix persistance with analytics events
* early return raft_server check
* increment populairty count instead of overwrite
* extract method serialize_as_docs
* move func definition from header to source
* counter events refactor and tests
* fix test
* configure log_to_file per counter event
2024-02-19 20:47:32 +05:30
Krunal Gandhi
01bedbb342
fix the wrong boolean val ( #1559 )
2024-02-19 20:31:06 +05:30
Kishore Nallan
7b7f0c79d7
Optimize id list iteration.
2024-02-18 07:49:37 +05:30
Ozan Armağan
4371cfd48e
Use regex for parsing CF RAG results and log full response when partial ( #1554 )
...
* Use regex for parsing CF RAG results and log full response when it is partial
* Refactor & add test
2024-02-16 09:07:14 +05:30
Kishore Nallan
6e2931bd02
Add guard for long hostnames.
2024-02-15 17:19:36 +05:30
Krunal Gandhi
c7c24e6ab9
Analytics manager fixes ( #1553 )
...
* add tests for persisting events
* fix test by adding unique event name
* fix persistance with analytics events
* early return raft_server check
* increment populairty count instead of overwrite
* extract method serialize_as_docs
* move func definition from header to source
2024-02-15 16:51:08 +05:30
Kishore Nallan
cc980c2ecd
Treat zero facet sample percent as not sampled.
2024-02-14 12:29:48 +05:30
Kishore Nallan
f7c1678cc3
Handle wraparound of token offset for large doc highlight.
2024-02-13 17:38:23 +05:30
Ozan Armağan
76c57ec407
Add ts
namespace for voice query models ( #1549 )
...
* Add `ts` namespace for voice query models
* Fix voice query test
* Update error message in invalid voice query test
2024-02-13 16:39:31 +05:30
Ozan Armağan
59cc66248d
Add stemming for queries ( #1548 )
2024-02-13 15:05:56 +05:30
Kishore Nallan
da8cac463c
Fix inheritance of sort field property for nested field.
2024-02-12 21:47:33 +05:30
Harpreet Sangar
54492fafdc
num_tree iterator ( #1538 )
...
* Add `num_tree_t::iterator_t`.
* Add `num_tree_t::iterator_t` tests.
* Add `bool_iterator` in `filter_result_iterator_t`.
* Fix `filter_result_iterator_t::compute_iterators`.
2024-02-12 21:45:49 +05:30
Krunal Gandhi
c6b25a7964
return error response when stopword is not found ( #1545 )
...
* return error response when stopword is not found
* use shared_lock
2024-02-12 14:54:38 +05:30
Kishore Nallan
fc80cc3a72
Handle special characters within non-English locale.
...
Unless present in symbols to index / separators, it will be skipped.
2024-02-08 16:52:25 +05:30
Ozan Armağan
48df1e70e8
Fix voice query model path for the test ( #1540 )
2024-02-07 19:35:18 +05:30
Harpreet Sangar
a94062481f
Fix crash on calling compute_string_components
multiple times in a complex filter query. ( #1539 )
...
* Add failing test.
* Fix crash on calling multiple times in a complex filter query.
* Rename `compute_result` to `compute_string_components`.
2024-02-07 12:48:19 +05:30
Krunal Gandhi
562e152d9b
remove error response when deleting non existing doc ( #1532 )
...
* remove error response when deleting non existing doc
* handle other response when doc not found
* add test and refactor approach
* add response msg when doc not found
* add optional flag ignore_not_found
* change reponse msg
2024-02-06 15:49:05 +05:30
Ozan Armağan
559b2de337
Fix duplicate results from vector index ( #1536 )
2024-02-06 13:45:59 +05:30
Harpreet Sangar
56a69d9844
Compute filter result into an array. ( #1533 )
...
* Compute filter result into an array.
* Add tests.
* Add tests.
2024-02-06 12:55:47 +05:30
Krunal Gandhi
36fcdccddc
add typo_prefix score and num_tokens_dropped in text_match_info ( #1529 )
...
* add num_drop_tokens info in text_match_info
* add typo_prefix_score in text_match_info
* add more tests
* add test with drop_token_threshold=1
2024-02-05 20:44:39 +05:30
Harpreet Sangar
7e9cb789a7
String not equals filter logic refactoring. ( #1528 )
...
* Compute result in case of string not equals filter matching too many ids.
* Deleted id should not be considered a match for string not equals filter.
* Add tests.
* Use `id_list_t::iterator_t` to check if `seq_id` exists in index.
2024-02-02 15:32:18 +05:30
Krunal Gandhi
206cecffbd
Event analytics revised ( #1522 )
...
* remove query hits aggregation & store
* refactor analytics changes
* avoid string copy
* typo correction
* event analytics revised
* refactoring code
* add test for collection array
2024-02-02 11:50:42 +05:30
Ozan Armağan
18306e3dd2
Make HNSW params configurable ( #1521 )
2024-01-30 18:30:03 +05:30
Kishore Nallan
f9242dd4a5
Don't highlight very large docs > 64K words.
2024-01-28 17:47:43 +05:30
Harpreet Sangar
ec4311635e
Fix document update failing if reference field was not included. ( #1515 )
2024-01-27 12:26:36 +05:30
Kishore Nallan
525995c7e9
Make analytics event cache a member.
2024-01-26 22:14:02 +05:30
Kishore Nallan
94c560429d
Fix hash-based faceting for coerced field values.
...
When an integer field is coerced to string via schema update, the underlying data on disk will still be integer. We have to account for this during hash based faceting.
2024-01-25 11:19:05 +05:30
Harpreet Sangar
3ade1f7895
Fix wildcard search with filter_by returning found: 1
in case of empty collection. ( #1504 )
2024-01-23 12:02:44 +05:30
Ozan Armağan
0f54bf7101
Add stemming support for fields ( #1498 )
...
* Add stemming support for fields
* Add tests for stemming
* Check if the stemmer is loaded correctly
* Add mutex for stemmer initialization and usage
* Fix get_stemmer_mutex() function signature
* Add StemmerManager class and refactor field.cpp and index.cpp
* Add mutex for thread safety in StemmerManager
* update delete_all_stemmers()
* Update field property name from "stemming" to "stem"
2024-01-20 21:18:54 +05:30
Krunal Gandhi
9565efefb3
remove query hits aggregation & store ( #1496 )
2024-01-19 10:57:25 +05:30
Krunal Gandhi
e51055712c
facilitate to add metadata while creating collection ( #1491 )
...
* facilitate to add metadata while creating collection
* allow empty metadata keys
* refactor style
2024-01-17 18:57:32 +05:30
Krunal Gandhi
85dd93bc10
Disable typo tolerance for digits ( #1478 )
...
* set max_cost to 0 when matching digits in art_fuzzy_search
* fix condition of trailing digits
* refactor the approach
* updating test
* change param name
2024-01-16 15:01:17 +05:30
Kishore Nallan
cb4cac8d07
Merge pull request #1476 from ozanarmagan/v0.26-facets
...
Add support for voice query
2024-01-11 13:06:50 +05:30
ozanarmagan
c938c8440e
Fix tests
2024-01-11 00:51:19 +03:00
ozanarmagan
1b22de4df8
Refactor to use const references
2024-01-10 22:46:13 +03:00
Kishore Nallan
a79356e055
Match record validation behavior during alter with other places.
...
This ensures that we don't end up making the validation stricter after a bad record has been indexed earlier.
2024-01-10 17:22:33 +05:30
ozanarmagan
47284942ab
Update voice query test
2024-01-10 14:30:59 +03:00