Project goal

This project aims to recognize handwritten digits in real-time. The main goal is to allow a user to write a mathematical expression (like 2 + 3) in any surrounding they wish, and the system will basically read (as in extract) all the operands, understand the operation that is to be performed, and display the correct result.

Methodology

Image Acquisition:

a. Collect digits/symbols written on paper and scan/capture them: I used the addition and subtraction symbols from this kaggle dataset: https://www.kaggle.com/datasets/sagyamthapa/handwritten-math-symbols b. Use MNIST dataset for standard digits

Image Preprocessing a. Convert to grayscale b. Apply thresholding and morphological operations. c. Detect contours and extract characters via bounding boxes. d. Resize and normalize each extracted character to 28x28 pixels. e. Invert if necessary to match MNIST style (white digits on black).

The code for this is written in the file named: preProcess_final.py

Model Training a. Use a CNN architecture trained on the original MNIST dataset first. For this, I tried using knn and svm models first but they weren’t able to accurately label the digits. So I experimented with a few different CNN architectures and decided to go with the one having 4 convolutional layers, 3 MaxPoolingLayers and 2 Dense Layers. The chosen model was compiled with the adam optimizer having sparse categorical crossentropy loss.

This is the model summary:

I trained for approximately 14 epochs with a validation split of 20.

With this model, I was able to accurately detect digits as shown below:

After finalizing this model, I combined the MNIST dataset with the add and subtract images of the kaggle dataset, and this was the output:

I'm still working on improving the accuracy of the new model.

About the files

classifier.py:
mnist.py:
mnistTest.py:
morphologicalExperiment.py:
preProcess_final.py:
preProcessing.py:
tensorModel.py:
digitRecognition.ipynb:
index.html and script.js:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project goal

Methodology

About the files

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
images		images
models		models
sklearn-env		sklearn-env
tfjs_model		tfjs_model
utils		utils
README.md		README.md
classifier.py		classifier.py
digitRecognition.ipynb		digitRecognition.ipynb
index.html		index.html
mnist.py		mnist.py
mnistTest.py		mnistTest.py
morphologicalExperiment.py		morphologicalExperiment.py
preProcess_final.py		preProcess_final.py
preProcessing.py		preProcessing.py
script.js		script.js
tensorModel.py		tensorModel.py

kammeows/digit-recognition-project

Folders and files

Latest commit

History

Repository files navigation

Project goal

Methodology

About the files

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages