VU CharWizard Dataset

This repository contains a dataset collected from the students in years 2020 and 2025, and currently amounts to a total of 2840 images. The images are divided into 15 classes, containing the letters shown in the image below.

In this repository, you can find the folders with the letters available in 2 resolutions: 28 x 28 (MNIST style) stored in dataset-28 and a slightly sharper 64 x 64 stored in dataset-64. In raw-data you can find the raw scans of the cards kindly provided by the students, as well as cropped and partially processed letters themselves. In scripts you will see 2 short python scripts that demonstrate the image processing that was performed, as well as to count the final number of files in the datasets.

Note

To use the letters from this dataset, copy the entire dataset-28 or dataset-64 folder into your downloads folder and specify its full path when you train your neural network. To read the dataset into memory, you can use the functionality of imageDatastore

Below you will find an explanation of how the letters were digitized and processed.

How do we process the images?

Step 1

First thing that is always needed is an initial picture. In our case, I was scanning 5 x 3 grids of letters that some of you may remember filling out. After cropping the image, we get the following:

Step 2

Now, we can divide the image into 15 squares, each containing exactly 1 letter. I will showcase the rest of the processing pipeline using the letter A as an example:

Step 3

After cropping, we can now begin the proper processing. First, we convert the image to grayscale and then apply variable thresholding to binarize it. As you can see, at this point the outline of the letter is black on a white background. However, traditionally it is the white objects on a black background that are treated as BLOBs in most morphological operations, rather than the other way around. So we invert the pixel values of the image so that white pixels (255, 255, 255) become black (0, 0, 0) and black become white. Now we have a white BLOB roughly in the center of the image with a black background. You can observe the results of these operations below:

Step 4

But we are not done yet! Sometimes because of imperfect cropping, some artifacts may appear on the image during processing. These artifacts may later cause wrong groups of neurons to activate during the CNN training and subsequently cause misclassifications. So naturally we would like to get rid of them. We can get rid of small artifacts by opening an image. Just look at the before and after:

Step 5

Now that we have a clear outline of the letter, we can further crop it to the bounding box of the remaining BLOB to get rid of inconsistent padding caused by letters being written in different parts of the initial bounding square. We also pad the image to make it square again:

Step 6

As the final step, we resize the image to the smaller dimensions (28x28 in this case) by taking the averages of several pixels (similar principle to the mean filter). To get rid of the "blurriness", we apply thresholding once more and obtain a final sharp image of the letter:

Note

You can find the entire code used for processing in scripts/readme-demo.py.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
dataset-28		dataset-28
dataset-64		dataset-64
raw-data		raw-data
readme-images		readme-images
scripts		scripts
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VU CharWizard Dataset

How do we process the images?

Step 1

Step 2

Step 3

Step 4

Step 5

Step 6

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VU CharWizard Dataset

How do we process the images?

Step 1

Step 2

Step 3

Step 4

Step 5

Step 6

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages