PSA-HWT: handwritten font generation based on pyramid squeeze attention.

PeerJ Comput Sci

School of Computer and Communication, Lanzhou University of Technology, Lanzhou, Gansu, China.

Published: August 2024

The generator, which combines convolutional neural network (CNN) and Transformer as its core modules, serves as the primary model for the handwriting font generation network and demonstrates effective performance. However, there are still problems with insufficient feature extraction in the overall structure of the font, the thickness of strokes, and the curvature of strokes, resulting in subpar detail in the generated fonts. To solve the problems, we propose a method for constructing a handwritten font generation model based on Pyramid Squeeze Attention, called PSA-HWT. The PSA-HWT model is divided into two parts: an encoder and a decoder. In the encoder, a multi-branch structure is used to extract spatial information at different scales from the input feature map, achieving multi-scale feature extraction. This helps better capture the semantic information and global structure of the font, aiding the generation model in understanding fine-grained features such as the shape, thickness, and curvature of the font. In the decoder, it uses a self-attention mechanism to capture dependencies across various positions in the input sequence. This helps to better understand the relationship between the generated strokes or characters and the handwritten font being generated, ensuring the overall coherence of the generated handwritten text. The experimental results on the IAM dataset demonstrate that PSA-HWT achieves a 16.35% decrease in Fréchet inception distance (FID) score and a 13.09% decrease in Geometry Score (GS) compared to the current advanced methods. This indicates that PSA-HWT generates handwritten fonts of higher quality, making it more practically valuable.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639151PMC
http://dx.doi.org/10.7717/peerj-cs.2261DOI Listing

Publication Analysis

Top Keywords

handwritten font
12
font generation
12
based pyramid
8
pyramid squeeze
8
squeeze attention
8
feature extraction
8
structure font
8
generation model
8
helps better
8
font
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!