227 Commits

Author SHA1 Message Date
Kishore Nallan
1a66a25e2f Enable slow searches logging by default (30s cutoff) 2023-01-02 18:53:36 +05:30
Kishore Nallan
0e1d70ebf6 Add flag to disable old highlight structure. 2022-12-25 21:04:22 +05:30
Kishore Nallan
856ad0efe5 Fix new snippetting on flat fields. 2022-11-25 18:50:05 +05:30
Kishore Nallan
cbf5985307 Fix duplication of nested fields during flattening. 2022-11-23 15:08:04 +05:30
Kishore Nallan
4e7c5560aa Handle missing optional object fields. 2022-11-19 08:49:11 +05:30
Kishore Nallan
23bd0af779 Simplify nested highlighting format. 2022-11-11 19:01:41 +05:30
Kishore Nallan
5292956718 Fix highlight meta structure. 2022-11-09 12:41:40 +05:30
Kishore Nallan
c87e0ddf9d Highlight meta should follow nested doc structure. 2022-10-31 16:19:51 +05:30
Kishore Nallan
643ba73ec7 Fix edge case with field names containing dots.
Should not be treated as nested fields.
2022-10-15 17:34:05 +05:30
Harpreet Sangar
f75cbca920
Filter query (#737)
* Implement advance filter query parser.

* Remove `exclude_filters`.

* Review changes

* Implement do_filtering2

* Use updated implementation

* Pass failing tests.

* Handle special characters in value.

* Add space around  in embedded filters

* Update tokenize

* Rename to `recursive_filtering`

* Deconstruct filter tree.

* Remove separating space requirement for operators

* Fix rebase errors.

* Undo filter override change

* Format changes

* Pass StaticFilteringMultipleRuleMatch

* Undo formatting changes

* Pass DynamicFilteringMultipleRuleMatch

* Review changes.

* Add filter_query guard.
2022-10-11 11:21:20 +05:30
Kishore Nallan
d2daad2c3d Basic alter support for nested fields.
No longer storing .flat inside the document.
2022-09-13 14:47:17 +05:30
Kishore Nallan
dff7f8b746 Don't persist flattened nested field values on disk.
Instead, we will reconstruct them when we fetch them from disk, trading cpu for storage.
2022-09-12 15:04:12 +05:30
Kishore Nallan
0e39736327 Refactor / simplify schema update logic. 2022-09-09 14:14:21 +05:30
Kishore Nallan
44edeb8a5e Buffer IDs during filtering. 2022-09-03 18:35:14 +05:30
Kishore Nallan
b20c32046b Support simple filter_by rule in overrides. 2022-09-02 13:27:17 +05:30
Kishore Nallan
70e6a89ea8 Do word level tokenization for th locale.
This ensures that we handle change in text length due to NFKC normalization.
2022-08-31 07:41:33 +05:30
Kishore Nallan
a98e5bacdd Vector search basics. 2022-08-27 13:47:09 +05:30
Kishore Nallan
b7e33584fb Ensure that only highlighted fields exist in snippet. 2022-08-25 12:47:45 +05:30
Kishore Nallan
d748c90b4b Merge branch 'v0.24-changes' into v0.24-nested
# Conflicts:
#	include/collection.h
#	src/collection.cpp
#	test/collection_specific_more_test.cpp
2022-08-13 19:03:03 +05:30
Kishore Nallan
6bb2e6e6f8 Fix ordered weighted field index properties. 2022-08-10 20:45:27 +05:30
Kishore Nallan
134af13281 By default, don't enable nested fields. 2022-08-07 17:36:28 +05:30
Kishore Nallan
869d515688 Handle optional restriction for nested fields. 2022-08-05 19:24:06 +05:30
Kishore Nallan
a90bfba930 Multiple explicit nested fields should be allowed. 2022-08-03 18:38:18 +05:30
Kishore Nallan
903c9f6330 Merge branch 'v0.24-changes' into v0.24-nested 2022-08-02 18:42:53 +05:30
Kishore Nallan
4c6eac9840 Allow cloning of collection & assets from a reference collection. 2022-08-02 18:02:23 +05:30
Kishore Nallan
f86f8b5487 Support nested fields with explicit schema. 2022-08-01 19:44:25 +05:30
Kishore Nallan
11d157463c Fix unicode character boundary highlighting. 2022-07-28 17:59:16 +05:30
Kishore Nallan
5a220e7398 Split highlight into meta/snippet/full. 2022-07-27 17:47:21 +05:30
Kishore Nallan
bef79000e1 Skeleton for nested field indexing and highlighting. 2022-07-25 17:25:49 +05:30
Kishore Nallan
308b34eb9c Provide breakdown of text match score in response. 2022-07-22 17:12:51 +05:30
Kishore Nallan
aa3eddf9f4 Add total matching fields to match score. 2022-07-21 16:02:22 +05:30
Kishore Nallan
2e3bbca1b8 Fix symbols not being supported in synonyms. 2022-07-14 16:14:35 +05:30
Kishore Nallan
5483ae4631 Support both include/exclude + filter by override. 2022-07-12 13:44:05 +05:30
Kishore Nallan
f5808532fa Allow split join tokens to be configured in always mode.
Default remains as fallback.
2022-06-26 19:55:00 +05:30
Kishore Nallan
3c5f843ce8 Merge branch 'main' into v0.24 2022-06-13 15:23:11 +05:30
RedSnail
ccb05375b7
Option to return document ID and body in import response. (#615)
* have a strange error, I'll try to compile without changes

* added return_id keyword which returns only id

* added test entry and also added test option to docker-build.sh

* sorry, I can't run tests my self, so i run it with your CI

* It didn't work 'cause i've been writing in python for too long :)
2022-06-13 10:21:44 +05:30
Kishore Nallan
223b104823 Introduce option to rank text by token position. 2022-06-07 11:00:24 +05:30
Kishore Nallan
0da51cb874 Support sort_by in overrides. 2022-06-05 18:00:41 +05:30
Kishore Nallan
284c3d0dc7 Fix alter on dynamic fields. 2022-05-07 20:52:28 +05:30
Kishore Nallan
0a308b70c0 Alter for dynamic fields. 2022-04-20 07:45:12 +05:30
Kishore Nallan
bba8c5feb2 Fix highlighting around puncutation. 2022-04-14 15:21:05 +05:30
Kishore Nallan
5b462f5233 Drop/readd within same schema change set. 2022-04-14 12:23:38 +05:30
Kishore Nallan
4889fba179 OR based token intersection. 2022-04-05 17:50:30 +05:30
Kishore Nallan
a5b06a0e0d Schema change basics. 2022-03-26 08:22:40 +05:30
Kishore Nallan
66cb71039f Do drop tokens at a global level. 2022-03-23 18:53:48 +05:30
Kishore Nallan
f4d3ecc84e Default fields to highlight must respect include_fields. 2022-03-20 09:53:36 +05:30
Kishore Nallan
3fe3a6ca7a Fix rogue group limit config regression. 2022-03-15 07:47:42 +05:30
Kishore Nallan
39f12edd3d Add override level option to filter curated hits. 2022-03-13 20:53:38 +05:30
Kishore Nallan
9197627e81 Add option to filter curated hits. 2022-03-11 20:52:22 +05:30
Kishore Nallan
d0a0597fcb Parameterize facet query num typos. 2022-03-10 19:05:30 +05:30