diff --git a/docs/layer_activation.html b/docs/layer_activation.html
index e69de29..62d7617 100644
--- a/docs/layer_activation.html
+++ b/docs/layer_activation.html
@@ -0,0 +1,56 @@
+
+
+
+
+
+
Layers > Activation
+
+
What is it
+
+ The Activation layer takes all the inputs and applies the activation function on each one.
+
+
+ It allows the network to learn non-linear functions, so without an Activation layer the network would only be able to learn linear functions.
+ So it's an important part of the network.
+
+
+
How to create an Activation layer
+
+ To create a layer we first need a neural network to create the layer in, as seen in the XOR example.
+
+
+ We can then call the add_layer method on the neural network and create the layer that we want as the parameter for the function.
+ In our case we are going to create an Activation layer.
+
+
+ We only need one parameter, it is the activation function we chose for this layer.
+ A commonly used one is the leaky relu function, as the derivative is easy to compute and the derivative does not fades out as for a sigmoid for example.
+ Other activation functions can be found in here.
+
+
+model.add_layer(nn.Activation.new(.leaky_relu))
+
+
+
+
+
+
\ No newline at end of file
diff --git a/docs/layer_dense.html b/docs/layer_dense.html
index e69de29..7b584a1 100644
--- a/docs/layer_dense.html
+++ b/docs/layer_dense.html
@@ -0,0 +1,57 @@
+
+
+
+
+
+
Layers > Dense
+
+
What is it
+
+ A neural network is made out of layers. The dense layer is also called the fully connected layer or just known as a 'normal' layer.
+ It is composed of weights connecting each input with every output, and then it adds the biases to each output.
+
+
+ The Dense layer is complementary with the Activation layer as the dense layer is the part that will train and improve and the activation layer
+ produces non-linearity or said in another way: it allows the network to learn many more things.
+
+
+
+
How to create a Dense layer
+
+ To create a layer we first need a neural network to create the layer in, as seen in the XOR example.
+
+
+ We can then call the add_layer method on the neural network and create the layer that we want as the parameter for the function.
+ In our case we are going to create a Dense layer.
+
+
+ The first two arguments are for the number of inputs and the number of outputs of the layer, because as said earlier each input is connected with every output and there is a bias for each output.
+ The third ad fourth arguments are the range for initialisation of weights and biases respectively. In this example for weights, they can be initialized between -0.7 to 0.7.
+
+
+model.add_layer(nn.Dense.new(2, 3, 0.7, 0.65))
+
+
+
+
+
+
\ No newline at end of file
diff --git a/docs/network.html b/docs/network.html
index 459c985..136c5da 100644
--- a/docs/network.html
+++ b/docs/network.html
@@ -11,6 +11,11 @@
@@ -48,9 +53,11 @@
Structure of the code
Then we add the layers that we want our network to have.
We need our network to have 2 inputs and 1 output to match the XOR gate.
- So we will first add a Dense layer with 2 inputs and 3 outputs, 3 is arbitrary but works well.
+ So we will first add a Dense layer with 2 inputs and 3 outputs, 3 is arbitrary but works well.
The two numbers after the number of inputs/outputs is the range for the initialisation of random weights and biases.
- Then an Activation layer, the Dense and Activation layers are complementary so we will add one per Dense layer.
+
+
+ Then an Activation layer, the Dense and Activation layers are complementary so we will add one Activation per Dense layer.
The activation function that we will use for this layer is leaky relu, as it is convenient.
We add a second Dense layer with 3 input and 1 output and the Activation layer that goes with it.
@@ -64,9 +71,13 @@
Structure of the code
Then we need to create the parametters for the training.
The learning rate, momentum, number of epochs are found by trial and error and these work well.
The cost function that we will use is the Mean Squared Error (MSE).
+
+
We then add the dataset that the network will use for it's training.
And same for the testing, in a real example the test data is unseen during the training to be able to see how well the networks does in an unseen situation
but as we have only 4 different possible inputs we can not show unseen data to the network so we will use the same data.
+
+
The neural newtork will print it's performance every print_interval epochs.
For the test parameters, every training_interval epochs it will run the test dataset and print the results from the print_startth element of the test dataset to the print_endth one.
diff --git a/docs/style.css b/docs/style.css
index 07982d7..f00859c 100644
--- a/docs/style.css
+++ b/docs/style.css
@@ -98,6 +98,13 @@ a {
display: block;
}
+.sidenavbar p{
+ margin-bottom: 0em;
+ text-decoration: none;
+ font-size: 18px;
+ color: #a6adc8;
+}
+
.sidenavbar a:hover {
color: #94e2d5;
}
@@ -105,7 +112,7 @@ a {
i {
font-style: normal;
font-family: "JetBrainsMonoXLight";
- background-color:#313244;
+ background-color:#45475a;
padding: 0em 0.5em 0em 0.5em;
border-radius: 0.99em;
}