With all the hype about machine learning and artificial intelligence I thought it would be a good time to refresh my knowledge on the subject. Machine learning meets a lot of scepticism because it is mostly in the media for questionable tasks. Like all tools it can be used for evil things but it also has a lot of beneficial applications. I’m mostly interested in using it for computer graphics, games and tools.
I have implemented a multi layer perceptron in C++, using the Eigen template library for linear algebra. The code is quite simple, it allows to create a neural network with a variable number of layers and nodes per layer, to train the network, to query the network and to reverse query the network.
I started with a standard test: Recognition of handwritten numbers using the MNIST handwritten digit database. The dataset contains images of handwritten digits in a resolultion of 28×28 pixels. After having some problems with the network weights (their inital values were too big), I got it running and started some tests. The network I used had three layers, the input layer with one node per pixel, a hidden layer with just hundred nodes and the output layer with one node for each digit. I first trained the neural network with all training images multiple times and queried it afterwards.
From 10000 test images 97.05% were classified correctly. Given how simple the neural network was, this is pretty good in my opinion. However, for a real world application the error is still too big.
After that I made a reverse query, to see what image the neural network expects for each digit for a perfect match. Here are some images (enlarged):
data:image/s3,"s3://crabby-images/b5775/b577592b61cc72ee0c632a218dabb57e6f30aaa3" alt="0"
data:image/s3,"s3://crabby-images/cfd68/cfd68aa86ded427eefb4ba7f830f362dd73ddf55" alt="1"
data:image/s3,"s3://crabby-images/2ac9f/2ac9ffec1140815175876b5e54e4f19f92765c16" alt="2"
data:image/s3,"s3://crabby-images/5fd8c/5fd8c8892d02fd20630f8489bd444e91ca7efb52" alt="3"
data:image/s3,"s3://crabby-images/8eb4d/8eb4d0211c71e4556057ca66cf6ebd959f7634c5" alt="4"
data:image/s3,"s3://crabby-images/abc5d/abc5d16b8bfa0743098685b34be0edceeda1e4bf" alt="5"
data:image/s3,"s3://crabby-images/cf30c/cf30cc325cf730aa566cadc49f59be8b6aefa75f" alt="6"
data:image/s3,"s3://crabby-images/d67b5/d67b5b224e49fbd5bee75bc0d3bd9d4df443bfc9" alt="7"
data:image/s3,"s3://crabby-images/dd6cc/dd6cce98af15bb116eab691d12eab190cb7fdd08" alt="8"
data:image/s3,"s3://crabby-images/04dcf/04dcf675a7a3ec6f89916bdab3d16af9c7647f2d" alt="9"
As a next step I plan to implement a convolutional neural network, but I’m also still looking for cool applications for it.