Skip to content

A KNN model for classifying text reviews into binary 1 and -1 scores to indicate positive and negative sentiments with NLP text preprocessing, feature extraction, dimensionality reduction and k-fold cross validation.

Notifications You must be signed in to change notification settings

amaddha/KNN-Text-review-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KNN-Text-review-classification

A KNN model for classifying text reviews into binary 1 and -1 scores to indicate positive and negative sentiments with NLP text preprocessing, feature extraction, dimensionality reduction and k-fold cross validation.

  1. Preprocessing:
  • Removing
    • HTML tags
    • URLs
    • Email-ids
    • Numbers
    • Punctuation
    • Accented text
  • Tokenizing
  • Removing stop words
  • Lemmatizing
  1. Feature extraction - Countvectorizer
  2. Similarity/Distance metric used for knn - Cosine similarity
  3. Cross-validation - k-fold

About

A KNN model for classifying text reviews into binary 1 and -1 scores to indicate positive and negative sentiments with NLP text preprocessing, feature extraction, dimensionality reduction and k-fold cross validation.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published