Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: https://elib.vku.udn.vn/handle/123456789/2733
Nhan đề: Extending OCR Model for Font and Style Classification
Tác giả: Vu, Dinh Nguyen
Nguyen, Tien Dong
Dang, Minh Tuan
Ninh, Thi Anh Ngoc
Nguyen, Vu Son Lam
Nguyen, Viet Anh
Nguyen, Hoang Dang
Từ khoá: Font Classification
Style Classification
OCR
Năm xuất bản: thá-2023
Nhà xuất bản: Springer Nature
Tóm tắt: Font and style classification aims to recognize which font and which style the characters in the input image belong to. The con- junction of font and style classification with traditional OCR systems is important in the reconstruction of visually-rich documents. However, the current text recognition systems have yet to take into account these tasks and focus solely on the recognition of characters from input images. The separation of these tasks makes the document reconstruction systems computationally expensive. In this paper, we propose a new approach that extends the current text recognition model to include font and style classification. We also present a dataset comprising input images and corresponding characters, fonts, and styles in Vietnamese. We evaluate the effectiveness of this extension on multiple recent OCR models, including VST [10], CRNN [8], ViSTR [1], TROCR [7], SVTR [4] . Our results demonstrate that our extension achieves decent accuracy rates of 98.1% and 90% for font and style classification, respectively. Moreover, our extension can even boost the performance of the original OCR models.
Mô tả: Lecture Notes in Networks and Systems (LNNS, volume 734); CITA: Conference on Information Technology and its Applications; pp: 193-204.
Định danh: https://link.springer.com/chapter/10.1007/978-3-031-36886-8_16
http://elib.vku.udn.vn/handle/123456789/2733
ISBN: 978-3-031-36886-8
Bộ sưu tập: CITA 2023 (International)

Các tập tin trong tài liệu này:

 Đăng nhập để xem toàn văn



Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.