Residential College | false |
Status | 已發表Published |
SauvolaNet: Learning Adaptive Sauvola Network for Degraded Document Binarization | |
Li, Deng1; Wu, Yue2; Zhou, Yicong1 | |
2021-09-02 | |
Conference Name | 16th International Conference on Document Analysis and Recognition, ICDAR 2021 |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Volume | 12824 LNCS |
Pages | 538-553 |
Conference Date | September 5-10, 2021 |
Conference Place | Lausanne, Switzerland |
Country | Switzerland |
Author of Source | Lladós J., Lopresti D., Uchida S. |
Publication Place | BERLIN, GERMANY |
Publisher | Springer Science and Business Media Deutschland GmbH |
Abstract | Inspired by the classic Sauvola local image thresholding approach, we systematically study it from the deep neural network (DNN) perspective and propose a new solution called SauvolaNet for degraded document binarization (DDB). It is composed of three explainable modules, namely, Multi-Window Sauvola (MWS), Pixelwise Window Attention (PWA), and Adaptive Sauolva Threshold (AST). The MWS module honestly reflects the classic Sauvola but with trainable parameters and multi-window settings. The PWA module estimates the preferred window sizes for each pixel location. The AST module further consolidates the outputs from MWS and PWA and predicts the final adaptive threshold for each pixel location. As a result, SauvolaNet becomes end-to-end trainable and significantly reduces the number of required network parameters to 40K – it is only 1% of MobileNetV2. In the meantime, it achieves the State-of-The-Art (SoTA) performance for the DDB task – SauvolaNet is at least comparable to, if not better than, SoTA binarization solutions in our extensive studies on the 13 public document binarization datasets. Our source code is available at https://github.com/Leedeng/SauvolaNet. |
Keyword | Binarization Sauvola Document Processing |
DOI | 10.1007/978-3-030-86337-1_36 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science ; Imaging Science & Photographic Technology |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Software Engineering ; Imaging Science & Photographic Technology |
WOS ID | WOS:000711880100036 |
Scopus ID | 2-s2.0-85115317409 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology |
Corresponding Author | Zhou, Yicong |
Affiliation | 1.Department of Computer and Information Science, University of Macau, Macau, China 2.Amazon Alexa Natural Understanding, Manhattan Beach, United States |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Li, Deng,Wu, Yue,Zhou, Yicong. SauvolaNet: Learning Adaptive Sauvola Network for Degraded Document Binarization[C]. Lladós J., Lopresti D., Uchida S., BERLIN, GERMANY:Springer Science and Business Media Deutschland GmbH, 2021, 538-553. |
APA | Li, Deng., Wu, Yue., & Zhou, Yicong (2021). SauvolaNet: Learning Adaptive Sauvola Network for Degraded Document Binarization. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 12824 LNCS, 538-553. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment