3662 Commits

Author SHA1 Message Date
Kishore Nallan
ff6e0e43c8 Ignore errors during analytics json serialization. 2024-02-12 15:34:06 +05:30
Krunal Gandhi
c6b25a7964
return error response when stopword is not found (#1545)
* return error response when stopword is not found

* use shared_lock
2024-02-12 14:54:38 +05:30
Kishore Nallan
fc80cc3a72 Handle special characters within non-English locale.
Unless present in symbols to index / separators, it will be skipped.
2024-02-08 16:52:25 +05:30
Ozan Armağan
48df1e70e8
Fix voice query model path for the test (#1540) 2024-02-07 19:35:18 +05:30
Kishore Nallan
ea29b3bea1 Handle empty query while logging analytics. 2024-02-07 18:32:09 +05:30
Kishore Nallan
062d9c1cee Remove stray log. 2024-02-07 18:25:27 +05:30
Harpreet Sangar
a94062481f
Fix crash on calling compute_string_components multiple times in a complex filter query. (#1539)
* Add failing test.

* Fix crash on calling  multiple times in  a complex filter query.

* Rename `compute_result` to `compute_string_components`.
2024-02-07 12:48:19 +05:30
Krunal Gandhi
7b616b19fa
fix asan warning in analytics manager tests (#1537)
* reorder tests to fix asan issues

* fix iterator invalidation

* make approach more verbose
2024-02-06 18:17:57 +05:30
Krunal Gandhi
562e152d9b
remove error response when deleting non existing doc (#1532)
* remove error response when deleting non existing doc

* handle other response when doc not found

* add test and refactor approach

* add response msg when doc not found

* add optional flag ignore_not_found

* change reponse msg
2024-02-06 15:49:05 +05:30
Ozan Armağan
559b2de337
Fix duplicate results from vector index (#1536) 2024-02-06 13:45:59 +05:30
Harpreet Sangar
56a69d9844
Compute filter result into an array. (#1533)
* Compute filter result into an array.

* Add tests.

* Add tests.
2024-02-06 12:55:47 +05:30
Krunal Gandhi
36fcdccddc
add typo_prefix score and num_tokens_dropped in text_match_info (#1529)
* add num_drop_tokens info in text_match_info

* add typo_prefix_score in text_match_info

* add more tests

* add test with drop_token_threshold=1
2024-02-05 20:44:39 +05:30
Kishore Nallan
fb6cf36604 Make cache num entries configurable. 2024-02-05 10:53:38 +05:30
Kishore Nallan
4a2892c886 Add flag for logging search query at the start of req cycle. 2024-02-02 16:12:56 +05:30
Kishore Nallan
0c814cb3d2 Remove unwanted build flags. 2024-02-02 15:34:44 +05:30
Harpreet Sangar
7e9cb789a7
String not equals filter logic refactoring. (#1528)
* Compute result in case of string not equals filter matching too many ids.

* Deleted id should not be considered a match for string not equals filter.

* Add tests.

* Use `id_list_t::iterator_t` to check if `seq_id` exists in index.
2024-02-02 15:32:18 +05:30
Krunal Gandhi
206cecffbd
Event analytics revised (#1522)
* remove query hits aggregation & store

* refactor analytics changes

* avoid string copy

* typo correction

* event analytics revised

* refactoring code

* add test for collection array
2024-02-02 11:50:42 +05:30
Ozan Armağan
18306e3dd2
Make HNSW params configurable (#1521) 2024-01-30 18:30:03 +05:30
Kishore Nallan
f9242dd4a5 Don't highlight very large docs > 64K words. 2024-01-28 17:47:43 +05:30
Harpreet Sangar
ec4311635e
Fix document update failing if reference field was not included. (#1515) 2024-01-27 12:26:36 +05:30
Kishore Nallan
525995c7e9 Make analytics event cache a member. 2024-01-26 22:14:02 +05:30
Kishore Nallan
e898daa0a1 Early return when analytics log not open. 2024-01-26 18:50:16 +05:30
Kishore Nallan
a290fd6205 Always populate ip address. 2024-01-26 18:49:01 +05:30
Kishore Nallan
71f905f18c Flush analytics event log on every persist call. 2024-01-26 16:33:04 +05:30
Kishore Nallan
261664fe11 Add flag to prevent bazel cache bust on clion.
Clion adds this flag during debugging which causes onnxruntime to be rebuilt from scratch when switched between command line and clion build.
2024-01-26 05:36:21 +05:30
Harpreet Sangar
b0e523caa5
filter_result_iterator_t shouldn't timeout during update and delete by filter query operations. (#1511) 2024-01-25 15:09:02 +05:30
Kishore Nallan
94c560429d Fix hash-based faceting for coerced field values.
When an integer field is coerced to string via schema update, the underlying data on disk will still be integer. We have to account for this during hash based faceting.
2024-01-25 11:19:05 +05:30
Krunal Gandhi
77ade5ff65
add resource error response in /health end-point (#1505)
* add resource error response in /health end-point

* use resource_check_t enum for error response

* return enum instead of string error
2024-01-24 12:30:56 +05:30
Kishore Nallan
544e22830c Use fusion flag for speeding up debug linking. 2024-01-23 13:04:38 +05:30
Harpreet Sangar
3ade1f7895
Fix wildcard search with filter_by returning found: 1 in case of empty collection. (#1504) 2024-01-23 12:02:44 +05:30
Krunal Gandhi
cde131f4b3
Refactor analytics changes (#1502)
* remove query hits aggregation & store

* refactor analytics changes

* avoid string copy

* typo correction
2024-01-22 17:37:29 +05:30
Krunal Gandhi
e463d2f0f1
export swap used in /metrics.json end-point (#1499)
* export swap used in /metrics.json end-point

* remove logs

* add macro to run on linux only

* update the swap usage fetch approach

* remove check to maintain consistent API response
2024-01-22 13:40:39 +05:30
Kishore Nallan
37f8ea0dbd Account for no api passed in header for multi_search. 2024-01-21 21:58:18 +05:30
Ozan Armağan
0f54bf7101
Add stemming support for fields (#1498)
* Add stemming support for fields

* Add tests for stemming

* Check if the stemmer is  loaded correctly

* Add mutex for stemmer initialization and usage

* Fix get_stemmer_mutex() function signature

* Add StemmerManager class and refactor field.cpp and index.cpp

* Add mutex for thread safety in StemmerManager

* update delete_all_stemmers()

* Update field property name from "stemming" to "stem"
2024-01-20 21:18:54 +05:30
Kishore Nallan
f5add7865a Fix synonym test 2024-01-20 11:37:00 +05:30
Kishore Nallan
efa73804ae Fix stray lock 2024-01-20 09:58:24 +05:30
Kishore Nallan
525b75268c Improve locking for synonym index. 2024-01-19 22:00:19 +05:30
Kishore Nallan
60e599e283 Don't limit results for phrase search. 2024-01-19 16:55:50 +05:30
Krunal Gandhi
9565efefb3
remove query hits aggregation & store (#1496) 2024-01-19 10:57:25 +05:30
Kishore Nallan
964774c1a3 Merge branch 'main' into v0.26-facets 2024-01-19 09:12:12 +05:30
Jason Bosco
303a48c473 Add whisper to gpu dependencies bundle 2024-01-18 17:14:50 -06:00
Kishore Nallan
897fd0d698 Refactor new line response in import. 2024-01-18 07:04:02 +05:30
Krunal Gandhi
e51055712c
facilitate to add metadata while creating collection (#1491)
* facilitate to add metadata while creating collection

* allow empty metadata keys

* refactor style
2024-01-17 18:57:32 +05:30
Jason Bosco
94f6af39ab Update build script 2024-01-16 20:55:14 -06:00
Krunal Gandhi
85dd93bc10
Disable typo tolerance for digits (#1478)
* set max_cost to 0 when matching digits in art_fuzzy_search

* fix condition of trailing digits

* refactor the approach

* updating test

* change param name
2024-01-16 15:01:17 +05:30
Kishore Nallan
36af68a672
Merge pull request #1479 from ozanarmagan/v0.26-facets
Whisper runtime dynamic library linking for GPU inference
2024-01-15 15:11:22 +05:30
Kishore Nallan
d73d13f5bd Use LLD for linking.
Without this, debugging on Clion on Bazel 6 breaks.
2024-01-15 11:29:14 +03:00
ozanarmagan
319c40a95f Update whisper build to make GPU inference optional with runtime dynamic liking 2024-01-12 18:36:34 +03:00
Kishore Nallan
c1a71bc83f Handle autodelete key property not being available. 2024-01-12 08:39:31 +05:30
Kishore Nallan
a15ef0e9ae Merge branch 'main' into v0.26-facets 2024-01-12 06:49:14 +05:30