Skip to content

ticogarcia/final-project-code-and-report-machine_learners

 
 

Repository files navigation

Final Project

Training Data

The training data set is the same for every team in this course.

You can download the data in our Canvas page:

About The Project

  1. train.ipynb - Jupyter notebook with code including all preprocessing, GridSearch CV, and then training on final model and saving the model as "Final_CNN".

For preprocessing we tried many cv2 functions such as MORPH_OPEN, CLOSE, GRADIENT, Blurs but finally we used only medianblur and Morph_Open.

  1. Final_CNN - This is our saved CNN model after training on all data_train.npy images with best parameters achieved through GridSearch CV.

After saving the model we can use it in easy_test and hard_test files to test on new data.

  1. easy_test.ipynb - Jupyter notebook used to classify the images and test the accuracy of Easy Test Dataset.
  2. Final_Report.docx - Report of our project
  3. new_labels_train.npy - Corrected labels file of traing data
  4. data.npy - temporary data of 400 images to test our data, just to check if the code is working fine, any data with proper dimensions mentioned below can be used.
  5. labels.npy - labels for above data.npy

Getting Started

In this section you should provide instructions on how to use this repository to recreate your project locally.

  1. You can use 'git clone' to clone this repository (Project link is given below)
  2. Just change the np.load() and load your own data with dimension(90000,N)
  3. For training use can use data from the links given above.

Dependencies

  1. Tensorflow 2.6 (pip install tensorflow==2.6)
  2. OpenCV 4.5.2 (pip install opencv-python)
  3. Seaborn (pip install seaborn)
  4. Sklearn (pip install sklearn)
  5. sckit-learn (pip install -U scikit-learn)
  6. keras (pip install keras)
  7. numpy (pip install numpy)
  8. matplotlib (pip install matplotlib)
  9. pandas (pip install pandas)

Usage

  1. All packages mentioned above should be installed before runnning any code file.
  2. You just have to change np.load() in the first cell and load your own data
  3. Dimension of data.npy file should be (90000,N), where N is number of images or samples and Labels.npy dimensions should be (N)
  4. Now, run all the cells in Easy_Test and Hard_Test and you will see that Confusion Matrix and Accuracy in the last 3 cells. For more examples, please refer to the Documentation

Authors

Rajat Bishnoi - [email protected] James Overmeyer - [email protected] Cody Hutcheson - [email protected]

Acknowledgements

  1. Catia Silva
  2. GitHub
  3. Stackoverflow
  4. Medium.com
  5. towardsdatascience.com
  6. tensorflow.org

Thank you

About

final-project-code-and-report-machine_learners created by GitHub Classroom

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.5%
  • PureBasic 0.5%