273 Commits

Author SHA1 Message Date
Kishore Nallan
4f961f4919 Highlight only the prefix. 2022-01-02 18:08:05 +05:30
Kishore Nallan
ab7a3f53bb Phrase search skeleton. 2021-12-28 08:29:31 +05:30
Kishore Nallan
3a8a2fcadb Support emplace action for ingestion. 2021-12-25 09:39:23 +05:30
Kishore Nallan
97b1981fcd Buffer IDs during exhaustive search. 2021-11-22 15:18:37 +05:30
Kishore Nallan
774f3d3ce3 Move facet tokens and groups to separate maps.
This speeds up faceting and reduces memory footprint during faceting.
2021-11-18 12:38:12 +05:30
Kishore Nallan
14f4c3e834 Field wise indexing. 2021-11-11 19:11:06 +05:30
Kishore Nallan
ebbab76a43 Gracefully quit during loading on signal. 2021-10-20 21:24:12 +05:30
Kishore Nallan
25d29919ae Multiplex frequency + score based token selection. 2021-09-19 16:31:28 +05:30
Kishore Nallan
27b392cee9 Exhaustive search should not always be enabled during token drop search. 2021-09-18 15:59:35 +05:30
Kishore Nallan
48ac6bb82a Weight all components of cross-field match score. 2021-09-03 11:38:46 +05:30
Kishore Nallan
75263d52a3 Fix test. 2021-09-02 20:38:31 +05:30
Kishore Nallan
19badcd0cb Move to precise token candidate selection.
No longer approximate.
2021-09-02 17:44:32 +05:30
Kishore Nallan
7b4450bbf9 Bake concurrency into a single index. 2021-08-31 13:11:50 +05:30
Kishore Nallan
b6f1885aec Stricter bounding of typo correction threshold. 2021-08-28 16:38:07 +05:30
Kishore Nallan
ce7b6e12e9 Prioritize record with a field containing all tokens in the query. 2021-08-27 20:52:51 +05:30
Kishore Nallan
26351a6984 Change default value of typo/drop tokens threshold to 1. 2021-08-11 14:20:28 +05:30
Kishore Nallan
b5e3a28ace More fixes for highlighting. 2021-08-05 21:31:04 +05:30
Kishore Nallan
261536d0f4 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	src/collection.cpp
#	src/index.cpp
#	test/collection_specific_test.cpp
2021-07-31 21:35:30 +05:30
Kishore Nallan
b2c12a9b2c Fix more edge cases in highlighting. 2021-07-31 08:59:49 +05:30
Kishore Nallan
13cb7b9364 Revert "Highlight field value that is a prefix of the query."
This reverts commit 545027a59bc55b24c2fece112b4fa6a655a1f79e.

# Conflicts:
#	test/collection_specific_test.cpp
2021-07-27 17:57:49 +05:30
Kishore Nallan
38d44a7c8a Highlight field value that is a prefix of the query. 2021-07-26 15:33:03 +05:30
Kishore Nallan
41c16fb7a7 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	include/index.h
#	include/posting.h
#	include/posting_list.h
#	src/art.cpp
#	src/collection.cpp
#	src/index.cpp
#	src/posting.cpp
#	src/posting_list.cpp
#	test/art_test.cpp
#	test/collection_specific_test.cpp
#	test/collection_test.cpp
#	test/posting_list_test.cpp
2021-07-24 17:10:54 +05:30
Kishore Nallan
89a509513a Ensure that weights can fully control cross-field matching. 2021-07-24 15:08:08 +05:30
Kishore Nallan
672c895805 Typo and drop tokens thresholds must be applied independently. 2021-07-16 13:39:52 +05:30
Kishore Nallan
56247ce6ac Prefix match must be differentiated from single typo. 2021-07-14 11:44:01 +05:30
Kishore Nallan
e4936a9f1a Simplify wildcard query result generation. 2021-07-14 11:44:01 +05:30
Kishore Nallan
5cbf810fe5 Fix upsert behavior: should accept only whole documents. 2021-07-14 11:44:01 +05:30
Kishore Nallan
6b41c3bf29 Support not equals on boolean filds. 2021-07-14 11:44:01 +05:30
Kishore Nallan
2391dad879 Field level prefix configuration. 2021-07-14 11:44:01 +05:30
Kishore Nallan
1189b7d124 Test for verifying that unindexed field is upserted.
♥ SSK.
2021-07-14 11:44:01 +05:30
Kishore Nallan
16d91aec6f Fix highlight edge case: random first token highlight. 2021-07-14 11:44:01 +05:30
Kishore Nallan
78ea80153f Allow num_typos to be configured at a per-field level. 2021-07-14 11:44:01 +05:30
Kishore Nallan
8aa50af121 Skip empty strings during indexing. 2021-07-14 11:44:00 +05:30
Kishore Nallan
88ec671d65 Option fields can have a null value. 2021-07-14 11:44:00 +05:30
Kishore Nallan
529bb55c5c Make exact match behavior configurable. 2021-07-14 11:44:00 +05:30
Kishore Nallan
09e2e62312 Fixed an edge case of exact match across multiple fields. 2021-07-14 11:44:00 +05:30
Kishore Nallan
25f6fe0614 Prioritize records whose fields match exactly with query. 2021-07-14 11:44:00 +05:30
Kishore Nallan
74bb5f1383 Refactor fuzzy search to address some obscure bugs. 2021-07-14 11:44:00 +05:30
Kishore Nallan
88f0d8b6a4 Merge branch 'postings-refactor' into postings-refactor-integration
# Conflicts:
#	src/index.cpp
2021-07-11 11:43:17 +05:30
Kishore Nallan
d1256a7c27 Prefix match must be differentiated from single typo. 2021-07-08 15:37:52 +05:30
Kishore Nallan
2652751823 Merge branch 'postings-refactor' into postings-refactor-integration
# Conflicts:
#	include/index.h
#	src/posting_list.cpp
#	test/posting_list_test.cpp
2021-06-28 21:05:58 +05:30
Kishore Nallan
b066909205 Simplify wildcard query result generation. 2021-06-26 15:16:36 +05:30
Kishore Nallan
a5efcd6897 Fix upsert behavior: should accept only whole documents. 2021-06-25 21:02:33 +05:30
Kishore Nallan
e42acc4307 Merge branch 'postings-refactor' into postings-refactor-integration
# Conflicts:
#	src/index.cpp
2021-06-18 20:34:56 +05:30
Kishore Nallan
876f0f64dc Support not equals on boolean filds. 2021-06-18 20:14:36 +05:30
Kishore Nallan
929b6796cc Posting list integration. 2021-06-18 12:52:06 +05:30
Kishore Nallan
b141e01b1b Field level prefix configuration. 2021-06-11 19:00:11 +05:30
Kishore Nallan
d5655809ac Test for verifying that unindexed field is upserted.
♥ SSK.
2021-06-06 15:42:01 +05:30
Kishore Nallan
e3a5c3d9aa Fix highlight edge case: random first token highlight. 2021-05-20 11:12:27 +05:30
Kishore Nallan
32eda90e16 Allow num_typos to be configured at a per-field level. 2021-05-20 07:22:01 +05:30