Docscanner: document location and enhancement based on image segmentation

Docscanner: document location and enhancement based on image segmentation
Author	Ziqi Shan Yuying Wang Shunzhong Wei Xiangmin Li Haowen Pang Xinmei Zhou
Abstract	Document scanning aims to transfer the captured photographs documents into scanned document files. However, current methods based on traditional or key point detection have the problem of low detection accuracy. In this paper, we were the first to propose a document processing system based on semantic segmentation. Our system uses OCRNet to segment documents. Then, perspective transformation and other post-processing algorithms are used to obtain well-scanned documents based on the segmentation result. Meanwhile, we optimized OCRNet's loss function and reached 97.25 MIoU on the test dataset.
Year of Publication	2022
Conference Name	2022 18th International Conference on Computational Intelligence and Security (CIS)
Google Scholar \| BibTeX