UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
SauvolaNet: Learning Adaptive Sauvola Network for Degraded Document Binarization
Li, Deng1; Wu, Yue2; Zhou, Yicong1
2021-09-02
Conference Name16th International Conference on Document Analysis and Recognition, ICDAR 2021
Source PublicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12824 LNCS
Pages538-553
Conference DateSeptember 5-10, 2021
Conference PlaceLausanne, Switzerland
CountrySwitzerland
Author of SourceLladós J., Lopresti D., Uchida S.
Publication PlaceBERLIN, GERMANY
PublisherSpringer Science and Business Media Deutschland GmbH
Abstract

Inspired by the classic Sauvola local image thresholding approach, we systematically study it from the deep neural network (DNN) perspective and propose a new solution called SauvolaNet for degraded document binarization (DDB). It is composed of three explainable modules, namely, Multi-Window Sauvola (MWS), Pixelwise Window Attention (PWA), and Adaptive Sauolva Threshold (AST). The MWS module honestly reflects the classic Sauvola but with trainable parameters and multi-window settings. The PWA module estimates the preferred window sizes for each pixel location. The AST module further consolidates the outputs from MWS and PWA and predicts the final adaptive threshold for each pixel location. As a result, SauvolaNet becomes end-to-end trainable and significantly reduces the number of required network parameters to 40K – it is only 1% of MobileNetV2. In the meantime, it achieves the State-of-The-Art (SoTA) performance for the DDB task – SauvolaNet is at least comparable to, if not better than, SoTA binarization solutions in our extensive studies on the 13 public document binarization datasets. Our source code is available at https://github.com/Leedeng/SauvolaNet.

KeywordBinarization Sauvola Document Processing
DOI10.1007/978-3-030-86337-1_36
URLView the original
Indexed ByCPCI-S
Language英語English
WOS Research AreaComputer Science ; Imaging Science & Photographic Technology
WOS SubjectComputer Science, Artificial Intelligence ; Computer Science, Software Engineering ; Imaging Science & Photographic Technology
WOS IDWOS:000711880100036
Scopus ID2-s2.0-85115317409
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
Corresponding AuthorZhou, Yicong
Affiliation1.Department of Computer and Information Science, University of Macau, Macau, China
2.Amazon Alexa Natural Understanding, Manhattan Beach, United States
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Li, Deng,Wu, Yue,Zhou, Yicong. SauvolaNet: Learning Adaptive Sauvola Network for Degraded Document Binarization[C]. Lladós J., Lopresti D., Uchida S., BERLIN, GERMANY:Springer Science and Business Media Deutschland GmbH, 2021, 538-553.
APA Li, Deng., Wu, Yue., & Zhou, Yicong (2021). SauvolaNet: Learning Adaptive Sauvola Network for Degraded Document Binarization. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 12824 LNCS, 538-553.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Li, Deng]'s Articles
[Wu, Yue]'s Articles
[Zhou, Yicong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Li, Deng]'s Articles
[Wu, Yue]'s Articles
[Zhou, Yicong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Li, Deng]'s Articles
[Wu, Yue]'s Articles
[Zhou, Yicong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.