54 Commits

Author SHA1 Message Date
Kishore Nallan
776730b571 Improve prefix typos in fuzzy search. 2023-07-06 11:17:49 +05:30
Kishore Nallan
5eda7668b9 Refactor fuzzy search restrictions. 2023-02-14 14:28:39 +05:30
Kishore Nallan
e8e1772858 Refactor fuzzy search state transition.
Handle extra chars in the middle of a query.
2023-01-26 18:24:43 +05:30
Kishore Nallan
652472f32a Cmake compatible bazel build. 2022-12-18 20:49:51 +05:30
Kishore Nallan
c6ea968f01 Merge branch 'v0.25' into bazel-build
# Conflicts:
#	.gitignore
2022-12-15 21:19:31 +05:30
0x2Adr1
bbebb1a567
Bazel (#736) 2022-12-15 21:09:06 +05:30
Kishore Nallan
1b46165423 Reduce false positives when considering trailing typos. 2022-07-15 11:31:56 +05:30
Kishore Nallan
1d0917dc41 Fix typo correction edge case. 2022-06-05 18:00:27 +05:30
Kishore Nallan
c72abf89a6 More fixes for prefix expansion. 2022-05-21 20:47:02 +05:30
Kishore Nallan
b51c6b2ddf Fix art delete type. 2022-03-28 13:02:29 +05:30
Kishore Nallan
8ceb571cc7 Fix test resource leaks. 2021-12-31 18:53:31 +05:30
Kishore Nallan
35a2a05709 Prefix search must return exact matching string first. 2021-12-30 21:49:05 +05:30
Kishore Nallan
14f4c3e834 Field wise indexing. 2021-11-11 19:11:06 +05:30
Kishore Nallan
25d29919ae Multiplex frequency + score based token selection. 2021-09-19 16:31:28 +05:30
Kishore Nallan
d6e8156973 Fix test again. 2021-09-02 21:11:30 +05:30
Kishore Nallan
75263d52a3 Fix test. 2021-09-02 20:38:31 +05:30
Kishore Nallan
19badcd0cb Move to precise token candidate selection.
No longer approximate.
2021-09-02 17:44:32 +05:30
Kishore Nallan
55535198a4 Prefix search to be used only for last token. 2021-08-07 13:12:06 +05:30
Kishore Nallan
929b6796cc Posting list integration. 2021-06-18 12:52:06 +05:30
Kishore Nallan
8457947d6a Fix fuzzy search perf regression. 2021-05-12 18:24:30 +05:30
Kishore Nallan
b13d093b0a Refactor fuzzy search to address some obscure bugs. 2021-05-02 21:49:47 +05:30
Kishore Nallan
51f57d3dd7 Fixed an issue with prefix searching. 2021-04-23 18:32:25 +05:30
Kishore Nallan
b3b47f5651 Refactor highlighting + tokenizer to simplify logic. 2021-04-18 20:37:58 +05:30
Jason Bosco
a08fb7738f Exhaustive token searching with filter_ids 2021-01-22 20:06:18 -08:00
kishorenc
fe5bdf617c Fixed an edge case in trie fuzzy search. 2020-12-28 19:19:59 +05:30
kishorenc
6883b4db36 Speed up numerical filter + fixed edge case with -ve value. 2020-12-28 19:19:59 +05:30
kishorenc
1ab9b65098 Force use of jemalloc via static linking. 2020-07-21 17:36:27 +05:30
kishorenc
a3024a7342 Fixed an edge case in fuzzy search with SKU-like tokens. 2020-05-20 19:17:43 +05:30
kishorenc
7c4d0f9286 Fixed a bug with unicode search. 2019-09-25 20:41:58 +05:30
Kishore Nallan
2d7e75caa5 Support wildcard query: using a * would ignore the search query and consider all records for filter+sort. 2018-05-12 09:48:29 +05:30
Kishore Nallan
d4d1a9e3bd Fuzzy prefix searching should rely on the intermediate cost rather than the final cost.
Since we are only trying to match the prefix part, comparing the prefix with the full string makes no sense.
2018-04-21 20:14:10 +05:30
Kishore Nallan
6b37d43fd7 Use a proper logger for logging! 2018-02-02 08:22:23 -05:00
Kishore Nallan
13f4c72827 Use rocksdb batch put for atomic insertion. 2018-01-28 14:29:07 -05:00
Kishore Nallan
ba49971ddd Fixed a few more -Wall warnings. 2018-01-26 13:38:02 +00:00
Kishore Nallan
60288631be Check partial node text iteratively for prefix match. 2017-12-17 10:26:37 +05:30
Kishore Nallan
e24e0fae5d Node score should be a int32_t. 2017-09-21 19:40:41 +05:30
Kishore Nallan
e384b777a1 Collection operations on float fields. 2017-08-10 18:20:58 -04:00
Kishore Nallan
a2f475d7fc Enable ART to index and search on floating point numbers. 2017-08-09 18:17:26 -04:00
Kishore Nallan
8850f8b169 Patch RocksDB build to ensure that it links against only specific libraries. 2017-04-07 08:58:29 +05:30
Kishore Nallan
c698c3be1a Fix a few compile warnings/errors. 2017-03-30 18:25:37 +05:30
Kishore Nallan
4378083e73 Searching for an int32 against an int32 array. 2017-02-12 17:35:43 +05:30
Kishore Nallan
3339fdeec0 Added tests for int64 search. 2017-01-22 07:06:35 +05:30
Kishore Nallan
da68fb17e8 Support LESS_THAN and GREATER_THAN. 2017-01-22 05:40:10 +05:30
Kishore Nallan
af8e8ba460 No special trailing character is required when int is serialized since all strings of equal length. 2017-01-19 23:54:43 +05:30
Kishore Nallan
0fcdb6b479 Support signed ints in art int search. 2017-01-12 21:20:52 +05:30
Kishore Nallan
0b88e669f6 Make ART fuzzy_search take min_cost and max_cost instead of only max_cost. 2016-12-28 18:16:43 +05:30
Kishore Nallan
12276b651f Base work for supporting multiple indexable fields. 2016-12-22 22:26:33 +05:30
Kishore Nallan
9b0c347334 ART - integer range search. 2016-12-11 13:47:43 +05:30
Kishore Nallan
e1526319f7 Building up support for prefix based searching and for ranking token suggestions by either frequency or max_score. 2016-11-27 14:56:15 +05:30
Kishore Nallan
ea0da73cfb Fix C++ 11 warnings. 2016-11-13 09:56:13 +05:30