Skip to content

kapilggg10/Plagiarism-checker-nltk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

File structure:

"files" :  directory contains training data. i.e Files containing text to train a model

"plag_training.py" : is the file to create a model and return 'tf idf' , 'tfs' and 'files'. Here 'tf idf' is the vectorized and tokenized model of the training data 'tfs' is the matrix model of the data and 'files' is the list of all training files.



"plagiarism.py" : is the file to check plagiarism between input file and trained data.

About

Plagiarism checker using nltk and cosine similarity

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages