This repository contains a Artificial Backpropagation Multilayer Neural Network implementation developed on the "Artificial Intelligence" course at FEUP. The goal was to build an efficient Neural Network, using the Backpropagation algorithm in the learning process, to diagnose patients from a dataset based on their voice's physical characteristics.
This project was developed by JPMMaia and msandim.
To run the application, go to the releases page. For further information on the topics of this project, check the report (portuguese only).
- data - contains the train and test data;
- resources - resources for this README;
- src - source code of the application;
- test - unit tests used in the development phase;
- test_data - data files used in test;
- uml - class diagram of the application.
Note: The main function is included in "GUIMain.java". The main function in "Program.java" was only used for development debugging.
The dataset involved in this project (including training and testing sets) is described and available for download here and includes several features from patient voice recordings (vowels, numbers, short sentences and words). The "UPDRS" feature was ignored in this analysis.
In order to find the connection weights that minimize the model's cost function, the Backpropagation algorithm was used. The minimization step is based on the Gradient Descent algorithm, with an added "momentum" term that avoids imprisonments in local minimums of the cost function (which may not be convex).
More information on this algorithm is available here:
Fausett, Laurene. Fundamentals of neural networks: architectures, algorithms, and applications. Prentice-Hall, Inc., 1994.
The interface allows the user to modify the:
- Number of neurons on each hidden layer;
- Learning rate and momentum values;
- Train and test data paths;
- Medium Squared Error value to achieve convergence in the algorithm;
- Maximum number of iterations;
- Types of recordings present on the train and test phases.
On the right panel, the MSE for the training set on each iteration is showed, along with the number of well-classified cases. After the training process is concluded, the same information is reported for the test set.