579 Commits

Author SHA1 Message Date
Kishore Nallan
e1526319f7 Building up support for prefix based searching and for ranking token suggestions by either frequency or max_score. 2016-11-27 14:56:15 +05:30
Kishore Nallan
db22d01b84 Added an ART search token cache.
To cache previous searches so that we don't repeatedly call ART search as we iterate through the correction.
2016-11-26 17:57:05 +05:30
Kishore Nallan
4e10fadeb7 Settle for partial matches when the whole query produces no results. 2016-11-26 17:13:16 +05:30
Kishore Nallan
396e10be5d Refactor collection's search method to be more judicious in using higher costs.
Earlier, even if one token produced no result, ALL tokens were searched with a higher cost. This change ensures that we first retry only the token that did not produce results with a larger cost before doing the same for other tokens.
2016-11-24 21:39:20 +05:30
Kishore Nallan
44d55cb13d Fixed a search issue: tokens that are not found in the index should be skipped. 2016-11-19 16:56:59 +05:30
Kishore Nallan
5736888935 Tests for collection. 2016-11-13 21:59:32 +05:30
Kishore Nallan
ea0da73cfb Fix C++ 11 warnings. 2016-11-13 09:56:13 +05:30
Kishore Nallan
18a4528540 Forarray tests. 2016-11-13 09:53:30 +05:30
Kishore Nallan
9bb24331cc Fuzzy search test - multiple results. 2016-11-12 21:30:22 +05:30
Kishore Nallan
aab5912110 Fuzzy search tests. 2016-11-07 19:36:28 +05:30
Kishore Nallan
c7e58efafd Add some regression tests for checking out of bounds. 2016-11-06 08:30:00 +05:30
Kishore Nallan
7a0187e6b3 Import and port art tests. 2016-11-01 18:19:21 +05:30
Kishore Nallan
d2b903a931 Set-up google test. 2016-10-16 22:15:11 +05:30
Kishore Nallan
e777afc97f API for removing a document from index. 2016-09-24 18:08:57 +05:30
Kishore Nallan
e7c6c6d3cb Fixed multi word queries. 2016-09-12 14:25:07 +05:30
Kishore Nallan
1a53d5692e Fuzzy match rewrite - still need to work on matching perf. 2016-09-04 12:22:16 +05:30
Kishore Nallan
1d3af330dd JSON document as input to collection.add method. 2016-08-28 09:23:30 +05:30
Kishore Nallan
c079b22cbd Fix typo in test document harness.
Added better print debugging in the process.
2016-08-23 22:37:54 +05:30
Kishore Nallan
e1f4b3d513 Constantize arguments, some clean-up code. 2016-08-05 18:26:31 -07:00
Kishore Nallan
bb0e7aefb9 Rename score to max_score for internal node and leaf structs. 2016-06-08 11:26:52 +05:30
Kishore Nallan
80d9f57b7b Code clean-up. 2016-05-30 20:13:55 +05:30
Kishore Nallan
b88241d9e9 Bug fix: word suggestions were not showing up sorted on their document scores.
Somehow, std::max() on uint16_t does not seem to work. Using a MAX macro.
2016-02-21 19:21:20 +05:30
Kishore Nallan
ee77fb4d22 Add 2 more external dependencies via git submodule. 2016-01-24 14:35:40 +05:30
Kishore Nallan
2dfc31a519 Sorting on popularity metric - WIP. Still has bugs. 2015-12-29 20:55:50 +05:30
Kishore Nallan
0d1eca8229 Move duplicating code to macro. 2015-11-29 09:08:39 +05:30
Kishore Nallan
e4a2be3ac3 Rewriting fuzzy look-up using incremental levenshtein matrix. WIP. 2015-11-28 22:41:26 +05:30
Kishore Nallan
b7dbec8535 More bug fixes for fuzzy match. 2015-11-26 08:01:08 +05:30
Kishore Nallan
025d3b6bce Fix bugs in fuzzy match. 2015-11-26 07:21:01 +05:30
Kishore Nallan
64f53b6420 Initial commit. Fuzzy prefix match works. 2015-11-10 19:44:44 +05:30