Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve algorithm for Dice Coefficient #24

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open

Improve algorithm for Dice Coefficient #24

wants to merge 1 commit into from

Conversation

rlue
Copy link

@rlue rlue commented Jan 17, 2017

Pad words at start and end to compensate for underweighting of characters
at word edges. In FuzzyMatch::Similarity, remove fallback to Levenshtein
for words < 3 characters long.

(Fix for issue #22.)

Pad words at start and end to compensate for underweighting of characters
at word edges. In FuzzyMatch::Similarity, remove fallback to Levenshtein
for words < 3 characters long.
@rlue
Copy link
Author

rlue commented Jul 11, 2017

Paging @seamusabshere, any objections to merging this?

@karpet
Copy link

karpet commented Oct 18, 2017

It appears this gem has gone fallow/neglected, which is too bad because it's a great place to start with for its purpose. I am using this patch from @rlue in an app and it definitely improves my matching for strings with short tokens.

@seamusabshere
Copy link
Owner

seamusabshere commented Oct 18, 2017

hey @rlue @karpet , sorry for the abandonware.

ASAP, i'll update this to use https://github.com/seamusabshere/pg_trgm internally - replacing dice's (aka bigram) and levenshtein, and proven to work like postgres

gyfis added a commit to gyfis/fuzzy_match that referenced this pull request Dec 13, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants