140 Commits

Author SHA1 Message Date
Kishore Nallan
0c459609c0 Parameterize filter short circuit + enable only for search. 2021-10-17 11:34:09 +05:30
Kishore Nallan
b82b4a5bce Make remove_matched_tokens: true by default. 2021-10-13 12:42:43 +05:30
Kishore Nallan
1fa2193c50 Configurable 1-typo and 2-typo min token lengths. 2021-10-04 18:18:05 +05:30
Kishore Nallan
74f7d829bb Improve override tests + coverage. 2021-10-02 12:16:32 +05:30
Kishore Nallan
4c1dc3a495 Tweak locking for filtering. 2021-10-02 09:15:57 +05:30
Kishore Nallan
947a5019d9 Handle override with synonyms. 2021-10-02 08:07:08 +05:30
Kishore Nallan
36a26f3f40 Rename mutate_query_string -> remove_matched_tokens 2021-09-29 18:25:19 +05:30
Kishore Nallan
6210e82390 Option to curtail expensive queries. 2021-09-29 11:48:26 +05:30
Kishore Nallan
c0602da4fc Parallelize parts of indexing. 2021-09-27 17:09:35 +05:30
Kishore Nallan
043535ee6b Support array of geo points. 2021-09-24 12:43:26 +05:30
Kishore Nallan
70059dec15 Dynamic filtering using field interpolation. 2021-09-22 11:46:20 +05:30
Kishore Nallan
fc4da1983d Prevent duplicate searches. 2021-09-18 18:10:44 +05:30
Kishore Nallan
c6fe1369b9 Enable filtering via overrides. 2021-09-08 18:43:45 +05:30
Kishore Nallan
2317c6adb0 Tweak locking for filtering. 2021-09-04 20:01:38 +05:30
Kishore Nallan
266a08c568 Separate candidate limit and combination limit. 2021-09-02 20:49:00 +05:30
Kishore Nallan
19badcd0cb Move to precise token candidate selection.
No longer approximate.
2021-09-02 17:44:32 +05:30
Kishore Nallan
055f2c7695 Fix perf in scoring results. 2021-09-01 15:27:05 +05:30
Kishore Nallan
7b4450bbf9 Bake concurrency into a single index. 2021-08-31 13:11:50 +05:30
Kishore Nallan
9659d60047 Exhaustive search should ignore typo and drop token thresholds. 2021-08-28 19:33:40 +05:30
Kishore Nallan
07d838e385 Make symbols for indexing and segmentation configurable. 2021-08-26 10:27:18 +05:30
Kishore Nallan
d4bd6e67e5 Further tweak exact match logic. 2021-08-22 15:47:21 +05:30
Kishore Nallan
2df55e7991 Fix exact value matching. 2021-08-22 13:45:26 +05:30
Kishore Nallan
76915eed20 Speed up exact matching on single tokens. 2021-08-19 16:43:53 +05:30
Kishore Nallan
0e2adb4242 Copy-free intersect + score. 2021-08-17 18:37:42 +05:30
Kishore Nallan
22670b1342 Skip offset compute for non-exact single token. 2021-08-13 16:22:57 +05:30
Kishore Nallan
26351a6984 Change default value of typo/drop tokens threshold to 1. 2021-08-11 14:20:28 +05:30
Kishore Nallan
7703939430 Parameterize combination limit for more exhaustive searching. 2021-08-07 18:13:29 +05:30
Kishore Nallan
41c16fb7a7 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	include/index.h
#	include/posting.h
#	include/posting_list.h
#	src/art.cpp
#	src/collection.cpp
#	src/index.cpp
#	src/posting.cpp
#	src/posting_list.cpp
#	test/art_test.cpp
#	test/collection_specific_test.cpp
#	test/collection_test.cpp
#	test/posting_list_test.cpp
2021-07-24 17:10:54 +05:30
Kishore Nallan
89a509513a Ensure that weights can fully control cross-field matching. 2021-07-24 15:08:08 +05:30
Kishore Nallan
56247ce6ac Prefix match must be differentiated from single typo. 2021-07-14 11:44:01 +05:30
Kishore Nallan
994f5021e6 Ensure that geopoint is validated before indexing. 2021-07-14 11:44:01 +05:30
Kishore Nallan
56bbf8df26 Separate geo index for every field + proper deletion. 2021-07-14 11:44:01 +05:30
Kishore Nallan
e4936a9f1a Simplify wildcard query result generation. 2021-07-14 11:44:01 +05:30
Kishore Nallan
5cbf810fe5 Fix upsert behavior: should accept only whole documents. 2021-07-14 11:44:01 +05:30
Kishore Nallan
2391dad879 Field level prefix configuration. 2021-07-14 11:44:01 +05:30
Kishore Nallan
48c423b85a Basics of a block based posting list container. 2021-07-14 11:44:01 +05:30
Kishore Nallan
78ea80153f Allow num_typos to be configured at a per-field level. 2021-07-14 11:44:01 +05:30
Kishore Nallan
529bb55c5c Make exact match behavior configurable. 2021-07-14 11:44:00 +05:30
Kishore Nallan
e0dc73af3e Swap out underlying geo library. 2021-07-14 11:44:00 +05:30
Kishore Nallan
f9a037a4d5 Reduce no-op operations during updates to fix perf. 2021-07-14 11:44:00 +05:30
Kishore Nallan
25f6fe0614 Prioritize records whose fields match exactly with query. 2021-07-14 11:44:00 +05:30
Kishore Nallan
88f0d8b6a4 Merge branch 'postings-refactor' into postings-refactor-integration
# Conflicts:
#	src/index.cpp
2021-07-11 11:43:17 +05:30
Kishore Nallan
d1256a7c27 Prefix match must be differentiated from single typo. 2021-07-08 15:37:52 +05:30
Kishore Nallan
dfb164234f Ensure that geopoint is validated before indexing. 2021-07-01 21:21:07 +05:30
Kishore Nallan
2652751823 Merge branch 'postings-refactor' into postings-refactor-integration
# Conflicts:
#	include/index.h
#	src/posting_list.cpp
#	test/posting_list_test.cpp
2021-06-28 21:05:58 +05:30
Kishore Nallan
d8391ceef9 Separate geo index for every field + proper deletion. 2021-06-26 17:44:14 +05:30
Kishore Nallan
b066909205 Simplify wildcard query result generation. 2021-06-26 15:16:36 +05:30
Kishore Nallan
a5efcd6897 Fix upsert behavior: should accept only whole documents. 2021-06-25 21:02:33 +05:30
Kishore Nallan
929b6796cc Posting list integration. 2021-06-18 12:52:06 +05:30
Kishore Nallan
b141e01b1b Field level prefix configuration. 2021-06-11 19:00:11 +05:30