Media Classifier

Visual media recognition using a vocabulary tree and homographic projections.

What does it do?

Given a set of training images, the system can be used to label unseen images, and thus be used for looking up data from databases and such. A simple program that uses the system has been integrated (simple_matcher.py).

The project can be used with other types of images, but all training-testing was performed using Stanford's media dataset. The training data (Reference), as well as the last model trained, have been included on the repo for the sake of showing some samples. https://exhibits.stanford.edu/data/catalog/rr389hv5603

For the first part, the following project attempts to implement the following paper: http://www.cs.ubc.ca/~lowe/525/papers/nisterCVPR06.pdf

It also adds an extra layer of processing on top of the original implementation. It takes advantage of homographic projections to further analyse the list of prospective candidates. Some context: https://en.wikipedia.org/wiki/Homography_(computer_vision)

DEPENDENCIES

The project is built using Python 3.6, and assumes you have scikit-learn, numpy and opencv available as dependencies.

HIGH-LEVEL DETAILS

Most configurable items (hyperparameters and preferences) can be changed from the config.json file.
The trainer can be used to create a model from the data you desire.
The retriever can be used to load a model and score an image against the trained database. It uses the L1 norm.
The homographic projection layer can be used to filter a list of matches and provide stronger predictions.
You can play around with the validation class to test different configurations!

DISCLAIMER

The system may not be the most efficient speed wise (and I would recommend checking the scoring logic in case it is not implemented as specified).
The modularization of the project allows its components to be treated/improved/adjusted independently, but it can always be improved!

On simple_matcher.py

This little program uses a saved model to score an image against the database to get the top N matches, and then extracts the best match using homographic projections. The results are then placed on an HTML page and shown to the user.

The Result folder belongs only to simple_matcher. In theory they could be decoupled from the rest of the dataset.

To run: python simple_matcher.py (-t optional to train model using current configs) file_path_to_match

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Model		Model
Result		Result
Trainer/Data/Reference		Trainer/Data/Reference
README.md		README.md
config.json		config.json
homography.py		homography.py
image_retriever.py		image_retriever.py
sift_extractor.py		sift_extractor.py
simple_matcher.py		simple_matcher.py
utils.py		utils.py
validation.py		validation.py
vocabulary_tree.py		vocabulary_tree.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Media Classifier

What does it do?

DEPENDENCIES

HIGH-LEVEL DETAILS

DISCLAIMER

On simple_matcher.py

About

Releases

Packages

Languages

danoc93/media_recognition

Folders and files

Latest commit

History

Repository files navigation

Media Classifier

What does it do?

DEPENDENCIES

HIGH-LEVEL DETAILS

DISCLAIMER

On simple_matcher.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages