Hello sir , i just saw the mnist dataset , i want to create dataset for character classification (ol chiki - a tribal language ) how would i create a dataset like this . I have written down some of the lettar in paper and scanned it , what is the next step . Thank you .
In nist , it is english language i think ,but i want to create about other langugae one of them is ol chiki . How did you create such dataaset . What is the algorithmic procedure
I didn't created it but I have done ML for years by now, drop me a line: eric.rubiel(at)u.northwestern.edu
In general you will need a lot of people to write the characters Then you scan those characters (you need to control the quality up to some degree, like make sure to separate those characters) Then you fix a dimension, and create images of that fixed dimension, ideally the characters should be centered a big part of this can be automatized but you need to plan ahead the process, you also need to keep track of the label: for every image, what is the character associated to the image.
Hey all, I'm trying to preprocess images of clothing from another fashion store into mnist format and then trying to get embeddings from them. Shifting them into grayscale and resizing to 28x28 seems fine, it's just they all have white background. Changing the background makes black clothing blend in with the background. Any ideas what would you do?
you dont want the background to be noise that distracts algorithms, does a white background gives you a problem with white clothes?
can anyone give me a link for that
i want to download fashion-mnist dataset by images
@Rubiel1 In the end I made image mask black and added little bit of white to the rest of the picture, so there would be some difference between background and black clothing. I havent tried using white background, but this approach seemed to work OK