Docscanner: document location and enhancement based on image segmentation
Author
Abstract

Document scanning aims to transfer the captured photographs documents into scanned document files. However, current methods based on traditional or key point detection have the problem of low detection accuracy. In this paper, we were the first to propose a document processing system based on semantic segmentation. Our system uses OCRNet to segment documents. Then, perspective transformation and other post-processing algorithms are used to obtain well-scanned documents based on the segmentation result. Meanwhile, we optimized OCRNet's loss function and reached 97.25 MIoU on the test dataset.

Year of Publication
2022
Conference Name
2022 18th International Conference on Computational Intelligence and Security (CIS)
Google Scholar | BibTeX