Document similarity using cosine distance, tf-idf, and latent semantic analysis.
-
Updated
Feb 15, 2017 - R
Document similarity using cosine distance, tf-idf, and latent semantic analysis.
Big data homework solutions
NLP Projects
Index documents in Apache Solr and see similarities in the document's contents.
Simple document similarity module implemented in NodeJS
Using Jaccard-Similarity and Minhashing to determine similarity between two text documents
Document searching from queries using Inverted index
Natural Lang processing scripts
Data mining on stack overflow Q/A data to understand the landscape of languages and developers in computer science
Classifying news articles with deep learning to build an automatic newsletter
This is a program used to check document similarity using Natural Language Tool Kit,using Cosine Similarity.
A system for automatic tagging of metadata of theses and dissertations from Bicol University
The Bitnation Jurisdiction Public Notary DApp
Code to train a LSI model using Pubmed OA medical documents and to use pre-trained Pubmed models on your own corpus for document similarity.
A Clojure library for querying large data-sets on similarity
a search engine for Pubmed artitcal
Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.
Add a description, image, and links to the document-similarity topic page so that developers can more easily learn about it.
To associate your repository with the document-similarity topic, visit your repo's landing page and select "manage topics."