200 Commits

Author SHA1 Message Date
Kishore Nallan
ad42c40e6d Add health check end-point. 2018-08-15 09:50:43 -04:00
Kishore Nallan
3275ea877d Refactor - move more implementation code away from headers. 2018-08-03 10:35:07 +05:30
Kishore Nallan
13964b6a57 Allow collections to be symlinked by an alternative name. 2018-07-30 19:48:17 +05:30
Kishore Nallan
6bb139399d Fixed an edge case that causes overflow when querying large string fields. 2018-06-08 20:59:14 +05:30
Kishore Nallan
c5b7f3c7e2 Return snippets of best matched array elements instead of just the best matched element.
However, we limit the number of snippets returned to 5 for now.
2018-05-22 14:48:25 +05:30
Kishore Nallan
0014e19ea6 Highlighting should include all search fields, and not just the best matched field. 2018-05-14 14:11:14 +05:30
Kishore Nallan
2d7e75caa5 Support wildcard query: using a * would ignore the search query and consider all records for filter+sort. 2018-05-12 09:48:29 +05:30
Kishore Nallan
95112a8086 Support inclusion and exclusion of document fields that are returned in search response. 2018-05-08 07:53:13 +05:30
Kishore Nallan
46983ba6b4 Simplify the structure of the "highlight" field in response. 2018-05-05 17:48:04 +05:30
Kishore Nallan
56ed39e3ff Refactor topster to ensure that it handles insertion of duplicate keys.
Instead of ignoring a duplicate blindly, ignore when match score is less than existing key.
2018-04-30 20:28:56 +05:30
Kishore Nallan
71f1fbb4aa Refactor query index logic. 2018-04-25 19:32:46 +05:30
Kishore Nallan
1d1cd2459b When multiple fields are searched, the same document should not be returned twice. 2018-04-24 17:49:04 +05:30
Kishore Nallan
33552ca627 Change structure of hightlight in response to allow sending the matched array index. 2018-04-23 21:54:08 +05:30
Kishore Nallan
d4d1a9e3bd Fuzzy prefix searching should rely on the intermediate cost rather than the final cost.
Since we are only trying to match the prefix part, comparing the prefix with the full string makes no sense.
2018-04-21 20:14:10 +05:30
Kishore Nallan
b669a47c29 Highlight best matched string in an array.
An ARRAY_SEPARATOR delimiter is used to demarcate offsets of tokens from different indices of an array. Plain string fields are treated like a single-element array field, but without needing to use a delimiter.
2018-04-20 16:44:17 +05:30
Kishore Nallan
b186816ca5 Respect drop_tokens_threshold even when that token does not exist in the index. 2018-04-13 16:52:51 +05:30
Kishore Nallan
874b5beb89 Fix initialization of priority queue. 2018-04-12 20:56:44 +05:30
Kishore Nallan
6ad5e36179 Allow configuring a threshold for dropping tokens.
Fixes https://github.com/typesense/typesense/issues/20
2018-04-12 19:25:59 +05:30
Kishore Nallan
c0d7e57c62 Only lowercase, don't substitute latin chars to ASCII.
There was a noticeable slow-down. To be explored in future, potentially behind an optional flag.
2018-04-04 19:05:48 +05:30
Kishore Nallan
55ac4feb08 Normalize unicode characters using ICU. 2018-04-02 11:26:32 +05:30
Kishore Nallan
e43a101c50 If no sort_by fields are provided, sort descending on default_sorting_field. 2018-02-24 12:53:52 +05:30
Kishore Nallan
ca2c084d86 Rename token_ranking_field to default_sorting_field and make it mandatory. 2018-02-24 12:18:48 +05:30
Kishore Nallan
dc9ff3033b Fix failing test. 2018-02-08 16:57:32 -08:00
Kishore Nallan
b451a559e0 Improve error message logged when a replica is pointed to an old master.
In such a case, the WAL entries would have expired - logging that explicitly now.
2018-02-03 10:05:24 -05:00
Kishore Nallan
0f7530ed13 Serialize and deserialize next sequence id properly instead of string encoding. 2018-02-02 19:02:19 -05:00
Kishore Nallan
6b37d43fd7 Use a proper logger for logging! 2018-02-02 08:22:23 -05:00
Kishore Nallan
13f4c72827 Use rocksdb batch put for atomic insertion. 2018-01-28 14:29:07 -05:00
Kishore Nallan
c16513005b Return 409 status code when document with given ID already exists. 2018-01-27 08:47:28 -05:00
Kishore Nallan
85f9887a2f Fixed a bug where a document with a duplicate ID was being inserted without an error. 2018-01-26 21:55:59 +00:00
Kishore Nallan
ba49971ddd Fixed a few more -Wall warnings. 2018-01-26 13:38:02 +00:00
Kishore Nallan
f8be8f4f6a Handle normalization of unicode elegantly. 2018-01-26 12:54:00 +00:00
Kishore Nallan
c3298ba6d8 Address -Wall and -Wextra warnings. 2018-01-25 20:08:13 +05:30
Kishore Nallan
491de5a325 Remove ascii special characters during string normalization.
Unicode special chars are retained verbatim - will be addressed in future.
2018-01-16 21:16:24 +05:30
Kishore Nallan
192b00e71f Address API review comments.
1. Move document specific actions under /documents
2. Document creation echoes the full document
3. Collections summary returns full detail on each each collection
4. Collections summary endpoint has no nested root attribute
5. When collection or document is deleted, the whole entity is returned in response
2018-01-02 21:35:24 +05:30
Kishore Nallan
f6612cb34e Refactor scoring loop. 2017-12-30 21:14:31 +05:30
Kishore Nallan
21d3de6145 Improve handling of replication errors / edge cases. 2017-12-24 21:03:30 +05:30
Kishore Nallan
01275c38f2 Use unorderd_map for low-volume meta datastructures.
Order of spp:sparse_hash_map during iteration is different in clang and gcc.
2017-12-20 09:16:15 +05:30
Kishore Nallan
8d5f7c18a3 Fix a couple of test compile warnings. 2017-12-17 21:42:45 +05:30
Kishore Nallan
60288631be Check partial node text iteratively for prefix match. 2017-12-17 10:26:37 +05:30
Kishore Nallan
b1ac1d7d7c Allow API key to be passed as a GET parameter as a backup to a header.
Required for authentication of JSONP requests from JS client.
2017-12-17 07:35:59 +05:30
Kishore Nallan
3a9743aa90 Parameterize API key used by replication. 2017-12-16 22:12:03 +05:30
Kishore Nallan
fe0db59877 Support indexing of bool fields. 2017-12-13 20:24:44 +05:30
Kishore Nallan
78b9ee52ec Make match score computation predictable and consistent across multiple indexes. 2017-11-12 22:31:29 +05:30
Kishore Nallan
c1c7d493e5 Allow an integer to be passed to a field defined as a float.
In the real world, this is common and it's better to be practical here.
2017-11-05 10:45:45 +05:30
Kishore Nallan
3907c2d3f9 Fixed an out-of-bounds bug with highlighting. 2017-11-03 21:07:56 +05:30
Kishore Nallan
a7479171b1 Limit number of prefix candidates to a constant.
Using number of results for comparison results in change of total count during pagination.
2017-11-02 18:04:55 +05:30
Kishore Nallan
93430444d0 Prefix sort field can also be a float 2017-10-25 18:18:07 +05:30
Kishore Nallan
a2ce56fd67 Allow string fields to be filtered on.
A rather convenient feature to have - but it would be a match of all tokens without typo tolerance.
2017-10-22 19:52:09 +05:30
Kishore Nallan
da295e90e8 Tests for array utils. 2017-10-22 17:05:25 +05:30
Kishore Nallan
0e4517d901 Combine search, facet and sort fields into a single "fields".
1. All numerical fields are added to sort index automatically since that makes logical sense.
2. Search fields to be used as a facet are to have a `facet: true` property - removes duplication.
3. If someone wants to use a faceted field also to search against (rare scenario), then they can duplicate that field without the `facet: true` property.
2017-10-17 21:51:07 +05:30