Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
-
Updated
Jun 1, 2022 - Java
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
A .NET port of java-string-similarity
Gentle introduction to basic elasticsearch constructs boosting search: ngrams, shingles, stemmers, suggesters and fuzzy queries.
Free WordPress Plugin: Use a free roofing square footage calculator to determine your roof size and get the right amount of roofing shingles for your job. Estimate cost, time, and labor for DIY or contractors. www.calculator.io/roofing-calculator/
Lucene token filter that removes trailing stopwords from shingles.
Rust min-shingle hashing implementation
📚 Word shingling for near duplicate document detection
Plagiarism Detection System, designed to identify similarities between a given text and existing online content.
First story detection using shingling, LSH and graphical methods
Using shingles/most used phrases in elasticsearch(v7) and Kibana graph
Implementation of LSH algorithm for jobs announcements in Kijiji website
Testing Jaccard similarity and Cosine similarity techniques to calculate the similarity between two questions.
Search engine for plagiarism over the internet. Based on Google API.
Code for Shingling
Golang shingles algorithm implementation for english, french, norwegian, russian, spanish and swedish
Data Mining Projects 2017
Add a description, image, and links to the shingles topic page so that developers can more easily learn about it.
To associate your repository with the shingles topic, visit your repo's landing page and select "manage topics."