Skip to content

Latest commit

 

History

History
95 lines (73 loc) · 6.04 KB

README.md

File metadata and controls

95 lines (73 loc) · 6.04 KB

RentML

Table of Contents


Idea behind the project

Using own prior experience within the field, I came up with a tool, that would make my life easier back in the day.
With every car moving in and out, there had to be a protocol manually filled with current date and mileage.
This program does this job automatically by simply reading from a picture and adding it to a database.
Additionally program visualizes mileage growth over time with a covenient chart.

Project is based on Streamlit, which makes it especially convenient for its purpose, enabling user to take pictures directly into the program with a phone.


Getting started

File downloads.ipynb contains functions to download proper libraries, datasets and image processing model required for the project.
Library versions are contained in requirements.txt file.

  1. Open project folder in your IDE
  2. Before first use, use downloads.ipynb to download image dataset and image recognition model
  3. Run main.py
  4. In terminal launch command: streamlit run main.py. Script will provide you with command upon launching
  5. Launched streamlit will provide local IP address to open in your browser while also opening it automatically.

If Streamlit does not launch properly, make sure terminal is operating within the ReadML folder.


Usage

Upon launching Streamlit app using main.py, file upload widget can be seen.
Below there is Altair chart showing car mileages. You can adjust view of each car using checkboxes above.
Hover the chart to see exact time and mileage in each dot.

Upon importing file using the widget, imported image will be shown and processing will start. Upon completion, mileage will be shown along with entire record containing information about the image. Data from processed images lands in mileage.json file.


Libraries used

  • Streamlit - Provides GUI and ability to use the app in mobile phone - that will come in handy
  • Altair - Data visualization tool for interactive chart
  • EasyOCR - Optical Image Recognition model
  • Torch andTorchVision - Build Neural Network model for car recognition
  • OpenCV - Image preprocessing - results available in drafts

Technical challenges

Initially the idea was to use multiple layers of preprocessing to squeeze out best readability of every picture.
OCR turned out to be satisfying enough by itself. Dataset was collected without the intent to ever be used this way, so multiple preprocessing settings were successful for one image only, while taking way too long to make sense.
Eventually, a part of initial dataset had to be removed because of that.
Historic data in this project only serve to provide rough estimations New pictures used in this app will be taken with intent to be OCRed, guaranteeing better reliability.
That's why whole preprocessing is commented out from the program. drafts folder contains some attempts.


Project structure

├───data
│   ├───main-dataset         -dataset of 60 pictures for end-user
│   ├───recognition-model    -contains a script to generate recognition model and the model itself
│   └───result               -scanned files are stored in json file inside this folder
├───drafts                   -experiments, pre-processing tests and visualization methods
├───modules                  -stores all functions used in project

Lessons learned

As this was my first project ever, I approached this project with only basic knowledge about python in general.
Idea for this project was not suited for my knowledge, but it solves problem I have been facing.
Therefore I decided to not limit myself with currently posessed knowledge and limit-test Github Copilot for generating code snippets.

Having the goal in mind, Kanban board within Github Projects allowed me to divide the idea into tiny pieces, thinking everything through.
https://github.com/users/Ne0bliviscaris/projects/2 (currently in polish, will translate it later)

  • First issue with this project was navigation and code readability. Exporting functions to separate files, along with typehints and docstrings enabled me to quickly navigate through it.
    Initially everything beside dataset was located in main folder. Organizing file structure further improved navigation.

  • Second issue was OCR reading precision. Dataset was not taken with this in mind, so I had to test multiple methods of preprocessing. Big part of the dataset was extremely difficult to read, so I ended up removing them from dataset and commenting out preprocessing functions from main program. Important part here was to decide if it will impact the functionality of the program. Historic data only serve as a background, while reading newly taken pictures is the main program feature. Simply keeping that in mind while taking picture for the program will already solve most readability issues. However, precision of reads from phone camera is still a subject to improve.

  • Another issue was categorizing cars by their mileages. 2 of 3 cars had extremely simillar mileages, so it was impossible to precisely differentiate them without using machine learning. Copilot provided the code to train my own neural network model with a part of existing dataset. Using that model solved the issue of intertwined mileages, as the model categorized them based on dashboard.

  • Commenting every major step in functions provided further understanding and easier navigation within each function.


Author

Mateusz Ratajczak