792 Commits

Author SHA1 Message Date
Kishore Nallan
1ad6619267 Polygon points used for filtering can be either CW or CCW.
Both clock-wise and anti-clock-wise now supported.
2021-10-04 19:38:43 +05:30
Kishore Nallan
1fa2193c50 Configurable 1-typo and 2-typo min token lengths. 2021-10-04 18:18:05 +05:30
Kishore Nallan
dbb688254e Fix edge case with ID-based filtering.
When no IDs match, no results should be found.
2021-10-04 13:54:11 +05:30
Kishore Nallan
6da39ff51c Expose write stats. 2021-10-02 18:54:30 +05:30
Kishore Nallan
74f7d829bb Improve override tests + coverage. 2021-10-02 12:16:32 +05:30
Kishore Nallan
947a5019d9 Handle override with synonyms. 2021-10-02 08:07:08 +05:30
Kishore Nallan
36a26f3f40 Rename mutate_query_string -> remove_matched_tokens 2021-09-29 18:25:19 +05:30
Kishore Nallan
6210e82390 Option to curtail expensive queries. 2021-09-29 11:48:26 +05:30
Kishore Nallan
c0602da4fc Parallelize parts of indexing. 2021-09-27 17:09:35 +05:30
Kishore Nallan
043535ee6b Support array of geo points. 2021-09-24 12:43:26 +05:30
Kishore Nallan
70059dec15 Dynamic filtering using field interpolation. 2021-09-22 11:46:20 +05:30
Kishore Nallan
71b1a6e7a3 Add tests for collection extraction for auth. 2021-09-21 13:44:15 +05:30
Kishore Nallan
071750c663 Fix collection creation action auth. 2021-09-21 13:12:32 +05:30
Kishore Nallan
39910c872a Capture search related metrics separately. 2021-09-21 11:40:41 +05:30
Kishore Nallan
d75e834ac3 Address warnings. 2021-09-19 21:39:15 +05:30
Kishore Nallan
5b0690fcd8 Allow filtering and deleting using doc IDs. 2021-09-19 18:38:43 +05:30
Kishore Nallan
25d29919ae Multiplex frequency + score based token selection. 2021-09-19 16:31:28 +05:30
Kishore Nallan
27b392cee9 Exhaustive search should not always be enabled during token drop search. 2021-09-18 15:59:35 +05:30
Kishore Nallan
703110264a Dropped tokens should not be prioritized as exact matches. 2021-09-13 16:23:56 +05:30
Kishore Nallan
902704887c Return total_values as part of facet stats, even for strings. 2021-09-10 21:18:21 +05:30
Kishore Nallan
1afa193161 Fix faceting count edge case. 2021-09-10 16:32:22 +05:30
Kishore Nallan
c6fe1369b9 Enable filtering via overrides. 2021-09-08 18:43:45 +05:30
Kishore Nallan
c0fce41c3b Ensure that an import batch cannot contain duplicate doc IDs. 2021-09-07 17:02:58 +05:30
Kishore Nallan
ba67efb7da Support zero weighting for multi-field match scoring. 2021-09-05 14:54:21 +05:30
Kishore Nallan
910256d82c Fix valgrind warnings. 2021-09-05 08:07:00 +05:30
Kishore Nallan
6fc18a0971 Fix test consistency. 2021-09-04 08:53:10 +05:30
Kishore Nallan
48ac6bb82a Weight all components of cross-field match score. 2021-09-03 11:38:46 +05:30
Kishore Nallan
d6e8156973 Fix test again. 2021-09-02 21:11:30 +05:30
Kishore Nallan
75263d52a3 Fix test. 2021-09-02 20:38:31 +05:30
Kishore Nallan
19badcd0cb Move to precise token candidate selection.
No longer approximate.
2021-09-02 17:44:32 +05:30
Kishore Nallan
055f2c7695 Fix perf in scoring results. 2021-09-01 15:27:05 +05:30
Kishore Nallan
7b4450bbf9 Bake concurrency into a single index. 2021-08-31 13:11:50 +05:30
Kishore Nallan
adc816e662 Use token separators whule parsing search query as well. 2021-08-28 20:59:05 +05:30
Kishore Nallan
9659d60047 Exhaustive search should ignore typo and drop token thresholds. 2021-08-28 19:33:40 +05:30
Kishore Nallan
b6f1885aec Stricter bounding of typo correction threshold. 2021-08-28 16:38:07 +05:30
Kishore Nallan
ce7b6e12e9 Prioritize record with a field containing all tokens in the query. 2021-08-27 20:52:51 +05:30
Kishore Nallan
07d838e385 Make symbols for indexing and segmentation configurable. 2021-08-26 10:27:18 +05:30
Kishore Nallan
a931bb4b2a Handle highlighting on a field with empty array value. 2021-08-25 17:05:06 +05:30
Kishore Nallan
d4bd6e67e5 Further tweak exact match logic. 2021-08-22 15:47:21 +05:30
Kishore Nallan
2df55e7991 Fix exact value matching. 2021-08-22 13:45:26 +05:30
Kishore Nallan
793e21a1c2 Ensure that search does not fetch existing tokens. 2021-08-18 18:51:39 +05:30
Kishore Nallan
0e2adb4242 Copy-free intersect + score. 2021-08-17 18:37:42 +05:30
Kishore Nallan
26351a6984 Change default value of typo/drop tokens threshold to 1. 2021-08-11 14:20:28 +05:30
Kishore Nallan
55535198a4 Prefix search to be used only for last token. 2021-08-07 13:12:06 +05:30
Kishore Nallan
b5e3a28ace More fixes for highlighting. 2021-08-05 21:31:04 +05:30
Kishore Nallan
261536d0f4 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	src/collection.cpp
#	src/index.cpp
#	test/collection_specific_test.cpp
2021-07-31 21:35:30 +05:30
Kishore Nallan
b2c12a9b2c Fix more edge cases in highlighting. 2021-07-31 08:59:49 +05:30
Kishore Nallan
331db4f27e Add precision option to geo field sorting. 2021-07-27 19:57:56 +05:30
Kishore Nallan
13cb7b9364 Revert "Highlight field value that is a prefix of the query."
This reverts commit 545027a59bc55b24c2fece112b4fa6a655a1f79e.

# Conflicts:
#	test/collection_specific_test.cpp
2021-07-27 17:57:49 +05:30
Kishore Nallan
b4c222064c Handle bad data in ingestion text gracefully. 2021-07-26 19:44:38 +05:30