113 Commits

Author SHA1 Message Date
Kishore Nallan
41c16fb7a7 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	include/index.h
#	include/posting.h
#	include/posting_list.h
#	src/art.cpp
#	src/collection.cpp
#	src/index.cpp
#	src/posting.cpp
#	src/posting_list.cpp
#	test/art_test.cpp
#	test/collection_specific_test.cpp
#	test/collection_test.cpp
#	test/posting_list_test.cpp
2021-07-24 17:10:54 +05:30
Kishore Nallan
89a509513a Ensure that weights can fully control cross-field matching. 2021-07-24 15:08:08 +05:30
Kishore Nallan
56247ce6ac Prefix match must be differentiated from single typo. 2021-07-14 11:44:01 +05:30
Kishore Nallan
994f5021e6 Ensure that geopoint is validated before indexing. 2021-07-14 11:44:01 +05:30
Kishore Nallan
56bbf8df26 Separate geo index for every field + proper deletion. 2021-07-14 11:44:01 +05:30
Kishore Nallan
e4936a9f1a Simplify wildcard query result generation. 2021-07-14 11:44:01 +05:30
Kishore Nallan
5cbf810fe5 Fix upsert behavior: should accept only whole documents. 2021-07-14 11:44:01 +05:30
Kishore Nallan
2391dad879 Field level prefix configuration. 2021-07-14 11:44:01 +05:30
Kishore Nallan
48c423b85a Basics of a block based posting list container. 2021-07-14 11:44:01 +05:30
Kishore Nallan
78ea80153f Allow num_typos to be configured at a per-field level. 2021-07-14 11:44:01 +05:30
Kishore Nallan
529bb55c5c Make exact match behavior configurable. 2021-07-14 11:44:00 +05:30
Kishore Nallan
e0dc73af3e Swap out underlying geo library. 2021-07-14 11:44:00 +05:30
Kishore Nallan
f9a037a4d5 Reduce no-op operations during updates to fix perf. 2021-07-14 11:44:00 +05:30
Kishore Nallan
25f6fe0614 Prioritize records whose fields match exactly with query. 2021-07-14 11:44:00 +05:30
Kishore Nallan
88f0d8b6a4 Merge branch 'postings-refactor' into postings-refactor-integration
# Conflicts:
#	src/index.cpp
2021-07-11 11:43:17 +05:30
Kishore Nallan
d1256a7c27 Prefix match must be differentiated from single typo. 2021-07-08 15:37:52 +05:30
Kishore Nallan
dfb164234f Ensure that geopoint is validated before indexing. 2021-07-01 21:21:07 +05:30
Kishore Nallan
2652751823 Merge branch 'postings-refactor' into postings-refactor-integration
# Conflicts:
#	include/index.h
#	src/posting_list.cpp
#	test/posting_list_test.cpp
2021-06-28 21:05:58 +05:30
Kishore Nallan
d8391ceef9 Separate geo index for every field + proper deletion. 2021-06-26 17:44:14 +05:30
Kishore Nallan
b066909205 Simplify wildcard query result generation. 2021-06-26 15:16:36 +05:30
Kishore Nallan
a5efcd6897 Fix upsert behavior: should accept only whole documents. 2021-06-25 21:02:33 +05:30
Kishore Nallan
929b6796cc Posting list integration. 2021-06-18 12:52:06 +05:30
Kishore Nallan
b141e01b1b Field level prefix configuration. 2021-06-11 19:00:11 +05:30
Kishore Nallan
8b24fe82b8 Basics of a block based posting list container. 2021-05-22 15:05:46 +05:30
Kishore Nallan
32eda90e16 Allow num_typos to be configured at a per-field level. 2021-05-20 07:22:01 +05:30
Kishore Nallan
987535c1be Make exact match behavior configurable. 2021-05-13 15:13:29 +05:30
Kishore Nallan
d827ddbc7a Swap out underlying geo library. 2021-05-11 12:33:38 +05:30
Kishore Nallan
40b5eca7d0 Reduce no-op operations during updates to fix perf. 2021-05-08 17:16:43 +05:30
Kishore Nallan
348d672d0c Prioritize records whose fields match exactly with query. 2021-05-05 16:24:55 +05:30
Kishore Nallan
1d1712f391 Refactor tokenizer to use index, skip and separate logic. 2021-04-16 17:55:52 +05:30
kishorenc
dd72e2a78c Introduce field level locale. 2021-04-02 21:28:49 +05:30
kishorenc
c2eec85277 Fix highlighting of strings with special characters. 2021-03-20 12:58:30 +05:30
kishorenc
42732c454d Handle bad bool field coercion. 2021-03-08 14:41:07 +05:30
kishorenc
46856701d6 Use sparse map for facet values. 2021-03-06 11:48:22 +05:30
kishorenc
33f705cace Geo polygon filtering. 2021-03-03 07:23:25 +05:30
kishorenc
4e3307a891 Use string* to specify string/string array coercion. 2021-02-25 15:07:37 +05:30
kishorenc
3a4d21992c Fix edge cases in schema detection. 2021-02-24 21:38:55 +05:30
kishorenc
0a9cf4aee0 Add more tests for testing schema detection. 2021-02-23 20:04:37 +05:30
kishorenc
f1b70384cc Allow fields to be stringified automatically. 2021-02-23 12:58:14 +05:30
kishorenc
c24fc02d4d Persist per-doc coerce setting + allow dropping of bad values. 2021-02-23 09:35:36 +05:30
kishorenc
d2a825799b Make default sorting field optional. 2021-02-21 19:55:31 +05:30
kishorenc
11c41804e5 Handle bad data gracefully. 2021-02-20 12:49:41 +05:30
kishorenc
e9df6e58e2 Allow indexing of fields without pre-defined schema.
# Conflicts:
#	include/collection.h
#	include/index.h
#	src/collection.cpp
#	src/collection_manager.cpp
2021-02-18 19:08:42 +05:30
kishorenc
17fbbd0838 Refactor concurrency model. 2021-02-06 20:17:18 +05:30
kishorenc
b2fba69a73 Address some warnings related to update doc scrubbing. 2020-12-28 19:20:00 +05:30
kishorenc
302cdf137b Fix field-wise num results used for threshold matching. 2020-12-28 19:20:00 +05:30
kishorenc
bc1d88f1eb Consider tokens matching across fields during ranking. 2020-12-28 19:20:00 +05:30
kishorenc
66a44a5afc Expose field weights used for scoring. 2020-12-28 19:20:00 +05:30
kishorenc
435476df5d Rank prefix match below exact match. 2020-12-28 19:19:59 +05:30
kishorenc
8f818f7fcb More exhaustive multi-field ranking. 2020-12-28 19:19:59 +05:30