A Learning-Based Approach for Word Segmentation in Text Document Images

Authors

  • Jean-Pierre Lomaliza Department of Electronic Engineering, Pukyong National University, Busan, South Korea
  • Hanhoon Park Department of Electronic Engineering, Pukyong National University, Busan, South Korea
  • Kwang-Seok Moon Department of Electronic Engineering, Pukyong National University, Busan, South Korea

Keywords:

Word Segmentation, Deep Learning, Space Classification, Locally Likely Arrangement Hashing, Document Image Retrieval,

Abstract

In conventional document retrieval (DIR) systems based on locally likely arrangement hashing (LLAH), the word detection approach is sensitive to the distance between the camera and the text document, e.g. a single word may be detected as several words when the camera is too close. Thus, the systems work well only when the distance in which the text document was registered is similar to the one of the retrieval. Moreover, they were implemented in a desktop setup where it might not suffer from the distance problem since the camera is rigidly attached to the computer. In this paper, a new word segmentation approach is proposed to increase the robustness of LLAH-based DIR systems so that they may be implemented on a mobile platform where the distance between the camera and text document may be easily changeable. The proposed method uses a deep neural network to classify spaces between connected components as between-words space or intra-word space. From experiments results, the proposed method successfully could detect the same words in different camera distances and orientation as the neural networks offered classification accuracy as high as 92.5%. Moreover, it showed higher robustness than the state-of-the-art methods when implemented on a mobile platform.

Downloads

Published

2018-08-28

How to Cite

Lomaliza, J.-P., Park, H., & Moon, K.-S. (2018). A Learning-Based Approach for Word Segmentation in Text Document Images. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 10(3), 1–7. Retrieved from https://jtec.utem.edu.my/jtec/article/view/3289