297 Commits

Author SHA1 Message Date
ozanarmagan
e1d452b713 Review changes II 2023-02-27 13:24:42 +03:00
Ozan Armağan
807a95b383
Merge branch 'v0.25-join' into v0.25 2023-02-16 16:15:20 +03:00
ozanarmagan
4f921a3040 Auto vector generation & Hybrid Search 2023-02-16 14:47:17 +03:00
Kishore Nallan
5eda7668b9 Refactor fuzzy search restrictions. 2023-02-14 14:28:39 +05:30
Kishore Nallan
e8e1772858 Refactor fuzzy search state transition.
Handle extra chars in the middle of a query.
2023-01-26 18:24:43 +05:30
Harpreet Sangar
d7ff4bdc0e
Support wildcard. (#852)
* Support wildcard.

* Add test case for `query_by=user.*`

* Remove duplicate.

* Add WildcardHighlightFields test.

* Add WildcardHighlightFullFields test.

* Review changes.
2023-01-11 20:30:07 +05:30
Harpreet Sangar
5f034f8b3b
Update documents using filter_by. (#845)
* Update documents.

* Return number of documents updated in the response.

* Update test.

* Use `add_many` to update documents.

* Test batching of updation.

* Support updation of all documents by `filter_by=*`.

* Update test to work on schema with nesting enabled.

* Fix memory leak.

* Add nested field updation test.

* Remove iterator key comparison.
2023-01-06 12:36:56 +05:30
Kishore Nallan
652472f32a Cmake compatible bazel build. 2022-12-18 20:49:51 +05:30
Kishore Nallan
c6ea968f01 Merge branch 'v0.25' into bazel-build
# Conflicts:
#	.gitignore
2022-12-15 21:19:31 +05:30
0x2Adr1
bbebb1a567
Bazel (#736) 2022-12-15 21:09:06 +05:30
Kishore Nallan
85d5ce06b2 Refactor prefix search expansion. 2022-10-05 16:14:32 +05:30
Kishore Nallan
bef79000e1 Skeleton for nested field indexing and highlighting. 2022-07-25 17:25:49 +05:30
Kishore Nallan
aa3eddf9f4 Add total matching fields to match score. 2022-07-21 16:02:22 +05:30
RedSnail
ccb05375b7
Option to return document ID and body in import response. (#615)
* have a strange error, I'll try to compile without changes

* added return_id keyword which returns only id

* added test entry and also added test option to docker-build.sh

* sorry, I can't run tests my self, so i run it with your CI

* It didn't work 'cause i've been writing in python for too long :)
2022-06-13 10:21:44 +05:30
Kishore Nallan
43c26b1c7e Retain verbatim match on array. 2022-05-30 07:13:22 +05:30
Kishore Nallan
a409df8dad Fix verbatim match on array. 2022-05-29 19:24:55 +05:30
Kishore Nallan
d935cb4041 Fix proper prefix expansion. 2022-05-19 16:12:33 +05:30
Kishore Nallan
bba8c5feb2 Fix highlighting around puncutation. 2022-04-14 15:21:05 +05:30
Kishore Nallan
4889fba179 OR based token intersection. 2022-04-05 17:50:30 +05:30
Kishore Nallan
66cb71039f Do drop tokens at a global level. 2022-03-23 18:53:48 +05:30
Kishore Nallan
51a5f268e2 Take care of token order in phrase search. 2022-03-11 13:22:14 +05:30
Kishore Nallan
12c443e222 Tweak relevancy scoring. 2022-03-06 18:06:12 +05:30
Kishore Nallan
31503f699f Ensure that emplace allows non-optional field update. 2022-02-03 16:16:56 +05:30
Kishore Nallan
302f8a34e8 Handle typo/drop token threshold better globally. 2022-02-02 20:37:25 +05:30
Kishore Nallan
4f961f4919 Highlight only the prefix. 2022-01-02 18:08:05 +05:30
Kishore Nallan
ab7a3f53bb Phrase search skeleton. 2021-12-28 08:29:31 +05:30
Kishore Nallan
3a8a2fcadb Support emplace action for ingestion. 2021-12-25 09:39:23 +05:30
Kishore Nallan
97b1981fcd Buffer IDs during exhaustive search. 2021-11-22 15:18:37 +05:30
Kishore Nallan
774f3d3ce3 Move facet tokens and groups to separate maps.
This speeds up faceting and reduces memory footprint during faceting.
2021-11-18 12:38:12 +05:30
Kishore Nallan
14f4c3e834 Field wise indexing. 2021-11-11 19:11:06 +05:30
Kishore Nallan
ebbab76a43 Gracefully quit during loading on signal. 2021-10-20 21:24:12 +05:30
Kishore Nallan
25d29919ae Multiplex frequency + score based token selection. 2021-09-19 16:31:28 +05:30
Kishore Nallan
27b392cee9 Exhaustive search should not always be enabled during token drop search. 2021-09-18 15:59:35 +05:30
Kishore Nallan
48ac6bb82a Weight all components of cross-field match score. 2021-09-03 11:38:46 +05:30
Kishore Nallan
75263d52a3 Fix test. 2021-09-02 20:38:31 +05:30
Kishore Nallan
19badcd0cb Move to precise token candidate selection.
No longer approximate.
2021-09-02 17:44:32 +05:30
Kishore Nallan
7b4450bbf9 Bake concurrency into a single index. 2021-08-31 13:11:50 +05:30
Kishore Nallan
b6f1885aec Stricter bounding of typo correction threshold. 2021-08-28 16:38:07 +05:30
Kishore Nallan
ce7b6e12e9 Prioritize record with a field containing all tokens in the query. 2021-08-27 20:52:51 +05:30
Kishore Nallan
26351a6984 Change default value of typo/drop tokens threshold to 1. 2021-08-11 14:20:28 +05:30
Kishore Nallan
b5e3a28ace More fixes for highlighting. 2021-08-05 21:31:04 +05:30
Kishore Nallan
261536d0f4 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	src/collection.cpp
#	src/index.cpp
#	test/collection_specific_test.cpp
2021-07-31 21:35:30 +05:30
Kishore Nallan
b2c12a9b2c Fix more edge cases in highlighting. 2021-07-31 08:59:49 +05:30
Kishore Nallan
13cb7b9364 Revert "Highlight field value that is a prefix of the query."
This reverts commit 545027a59bc55b24c2fece112b4fa6a655a1f79e.

# Conflicts:
#	test/collection_specific_test.cpp
2021-07-27 17:57:49 +05:30
Kishore Nallan
38d44a7c8a Highlight field value that is a prefix of the query. 2021-07-26 15:33:03 +05:30
Kishore Nallan
41c16fb7a7 Merge branch '0.22.0-rc' into postings-refactor-integration
# Conflicts:
#	include/index.h
#	include/posting.h
#	include/posting_list.h
#	src/art.cpp
#	src/collection.cpp
#	src/index.cpp
#	src/posting.cpp
#	src/posting_list.cpp
#	test/art_test.cpp
#	test/collection_specific_test.cpp
#	test/collection_test.cpp
#	test/posting_list_test.cpp
2021-07-24 17:10:54 +05:30
Kishore Nallan
89a509513a Ensure that weights can fully control cross-field matching. 2021-07-24 15:08:08 +05:30
Kishore Nallan
672c895805 Typo and drop tokens thresholds must be applied independently. 2021-07-16 13:39:52 +05:30
Kishore Nallan
56247ce6ac Prefix match must be differentiated from single typo. 2021-07-14 11:44:01 +05:30
Kishore Nallan
e4936a9f1a Simplify wildcard query result generation. 2021-07-14 11:44:01 +05:30