Missing some words or random strokes in output images #10

AkashDataScience · 2023-10-31T09:45:42Z

I generated images for 50 different strings. 20% of them have half sentences or random strokes. I set bias to 1 and thickness to 10 (default). Is there any limit on word count or other variable effecting output image?

X-rayLaser · 2023-10-31T15:16:54Z

This is a known problem. Sometimes, the model goes crazy after sampling certain number of points. Also, the model seems to struggle with producing rare letters and words. It is possible that it is somewhat undertrained or the dataset it was trained on contains some corrupted examples. Perhaps, there is also a bug in data preparation code. In either case, there is no way to flexibly control the output and prevent those failure cases.

You can try different checkpoints or experiment with the bias parameter. Surprisingly, I observed quite a bit more failures with bias=1 than with smaller ones.

As for the word count, there is a hardcoded attribute num_steps set to 1500 on this line:

pytorch-handwriting-synthesis-toolkit/handwriting_synthesis/sampling.py

Line 39 in 899e943

return cls(model, mu, sd, charset, num_steps=1500)

This attribute sets the maximum number of points to generate for a given handwriting. You can try bigger values if this value is not big enough.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing some words or random strokes in output images #10

Missing some words or random strokes in output images #10

AkashDataScience commented Oct 31, 2023

X-rayLaser commented Oct 31, 2023 •

edited

Loading

Missing some words or random strokes in output images #10

Missing some words or random strokes in output images #10

Comments

AkashDataScience commented Oct 31, 2023

X-rayLaser commented Oct 31, 2023 • edited Loading

X-rayLaser commented Oct 31, 2023 •

edited

Loading