286 Commits

Author SHA1 Message Date
Kishore Nallan
bef79000e1 Skeleton for nested field indexing and highlighting. 2022-07-25 17:25:49 +05:30
Kishore Nallan
aa3eddf9f4 Add total matching fields to match score. 2022-07-21 16:02:22 +05:30
RedSnail
ccb05375b7
Option to return document ID and body in import response. (#615)
* have a strange error, I'll try to compile without changes

* added return_id keyword which returns only id

* added test entry and also added test option to docker-build.sh

* sorry, I can't run tests my self, so i run it with your CI

* It didn't work 'cause i've been writing in python for too long :)
2022-06-13 10:21:44 +05:30
Kishore Nallan
43c26b1c7e Retain verbatim match on array. 2022-05-30 07:13:22 +05:30
Kishore Nallan
a409df8dad Fix verbatim match on array. 2022-05-29 19:24:55 +05:30
Kishore Nallan
d935cb4041 Fix proper prefix expansion. 2022-05-19 16:12:33 +05:30
Kishore Nallan
bba8c5feb2 Fix highlighting around puncutation. 2022-04-14 15:21:05 +05:30
Kishore Nallan
4889fba179 OR based token intersection. 2022-04-05 17:50:30 +05:30
Kishore Nallan
66cb71039f Do drop tokens at a global level. 2022-03-23 18:53:48 +05:30
Kishore Nallan
51a5f268e2 Take care of token order in phrase search. 2022-03-11 13:22:14 +05:30
Kishore Nallan
12c443e222 Tweak relevancy scoring. 2022-03-06 18:06:12 +05:30
Kishore Nallan
31503f699f Ensure that emplace allows non-optional field update. 2022-02-03 16:16:56 +05:30
Kishore Nallan
302f8a34e8 Handle typo/drop token threshold better globally. 2022-02-02 20:37:25 +05:30
Kishore Nallan
4f961f4919 Highlight only the prefix. 2022-01-02 18:08:05 +05:30
Kishore Nallan
ab7a3f53bb Phrase search skeleton. 2021-12-28 08:29:31 +05:30
Kishore Nallan
3a8a2fcadb Support emplace action for ingestion. 2021-12-25 09:39:23 +05:30
Kishore Nallan
97b1981fcd Buffer IDs during exhaustive search. 2021-11-22 15:18:37 +05:30
Kishore Nallan
774f3d3ce3 Move facet tokens and groups to separate maps.
This speeds up faceting and reduces memory footprint during faceting.
2021-11-18 12:38:12 +05:30
Kishore Nallan
14f4c3e834 Field wise indexing. 2021-11-11 19:11:06 +05:30
Kishore Nallan
ebbab76a43 Gracefully quit during loading on signal. 2021-10-20 21:24:12 +05:30
Kishore Nallan
25d29919ae Multiplex frequency + score based token selection. 2021-09-19 16:31:28 +05:30
Kishore Nallan
27b392cee9 Exhaustive search should not always be enabled during token drop search. 2021-09-18 15:59:35 +05:30
Kishore Nallan
48ac6bb82a Weight all components of cross-field match score. 2021-09-03 11:38:46 +05:30
Kishore Nallan
75263d52a3 Fix test. 2021-09-02 20:38:31 +05:30
Kishore Nallan
19badcd0cb Move to precise token candidate selection.
No longer approximate.
2021-09-02 17:44:32 +05:30
Kishore Nallan
7b4450bbf9 Bake concurrency into a single index. 2021-08-31 13:11:50 +05:30
Kishore Nallan
b6f1885aec Stricter bounding of typo correction threshold. 2021-08-28 16:38:07 +05:30
Kishore Nallan
ce7b6e12e9 Prioritize record with a field containing all tokens in the query. 2021-08-27 20:52:51 +05:30
Kishore Nallan
26351a6984 Change default value of typo/drop tokens threshold to 1. 2021-08-11 14:20:28 +05:30
Kishore Nallan
b5e3a28ace More fixes for highlighting. 2021-08-05 21:31:04 +05:30
Kishore Nallan
261536d0f4 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	src/collection.cpp
#	src/index.cpp
#	test/collection_specific_test.cpp
2021-07-31 21:35:30 +05:30
Kishore Nallan
b2c12a9b2c Fix more edge cases in highlighting. 2021-07-31 08:59:49 +05:30
Kishore Nallan
13cb7b9364 Revert "Highlight field value that is a prefix of the query."
This reverts commit 545027a59bc55b24c2fece112b4fa6a655a1f79e.

# Conflicts:
#	test/collection_specific_test.cpp
2021-07-27 17:57:49 +05:30
Kishore Nallan
38d44a7c8a Highlight field value that is a prefix of the query. 2021-07-26 15:33:03 +05:30
Kishore Nallan
41c16fb7a7 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	include/index.h
#	include/posting.h
#	include/posting_list.h
#	src/art.cpp
#	src/collection.cpp
#	src/index.cpp
#	src/posting.cpp
#	src/posting_list.cpp
#	test/art_test.cpp
#	test/collection_specific_test.cpp
#	test/collection_test.cpp
#	test/posting_list_test.cpp
2021-07-24 17:10:54 +05:30
Kishore Nallan
89a509513a Ensure that weights can fully control cross-field matching. 2021-07-24 15:08:08 +05:30
Kishore Nallan
672c895805 Typo and drop tokens thresholds must be applied independently. 2021-07-16 13:39:52 +05:30
Kishore Nallan
56247ce6ac Prefix match must be differentiated from single typo. 2021-07-14 11:44:01 +05:30
Kishore Nallan
e4936a9f1a Simplify wildcard query result generation. 2021-07-14 11:44:01 +05:30
Kishore Nallan
5cbf810fe5 Fix upsert behavior: should accept only whole documents. 2021-07-14 11:44:01 +05:30
Kishore Nallan
6b41c3bf29 Support not equals on boolean filds. 2021-07-14 11:44:01 +05:30
Kishore Nallan
2391dad879 Field level prefix configuration. 2021-07-14 11:44:01 +05:30
Kishore Nallan
1189b7d124 Test for verifying that unindexed field is upserted.
♥ SSK.
2021-07-14 11:44:01 +05:30
Kishore Nallan
16d91aec6f Fix highlight edge case: random first token highlight. 2021-07-14 11:44:01 +05:30
Kishore Nallan
78ea80153f Allow num_typos to be configured at a per-field level. 2021-07-14 11:44:01 +05:30
Kishore Nallan
8aa50af121 Skip empty strings during indexing. 2021-07-14 11:44:00 +05:30
Kishore Nallan
88ec671d65 Option fields can have a null value. 2021-07-14 11:44:00 +05:30
Kishore Nallan
529bb55c5c Make exact match behavior configurable. 2021-07-14 11:44:00 +05:30
Kishore Nallan
09e2e62312 Fixed an edge case of exact match across multiple fields. 2021-07-14 11:44:00 +05:30
Kishore Nallan
25f6fe0614 Prioritize records whose fields match exactly with query. 2021-07-14 11:44:00 +05:30