Strathprints logo
Strathprints Home | Open Access | Browse | Search | User area | Copyright | Help | Library Home | SUPrimo

Component-based Segmentation of words from handwritten Arabic text

AlKhateeb, J. H. and Jiang, J. and Ren, Jinchang and Ipson, S. (2009) Component-based Segmentation of words from handwritten Arabic text. International Journal of Computer Systems Science and Engineering, 5 (1).

[img]
Preview
PDF - Published Version
Download (435Kb) | Preview

    Abstract

    Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words segmentation. Meanwhile, an improved projection based method is also employed for baseline detection. The proposed method has been successfully tested on IFN/ENIT database consisting of 26459 Arabic words handwritten by 411 different writers, and the results were promising and very encouraging in more accurate detection of the baseline and segmentation of words for further recognition.

    Item type: Article
    ID code: 29271
    Keywords: ocr, offline recognition, baseline estimation, word segmentation, Electronic computers. Computer science, Electrical engineering. Electronics Nuclear engineering
    Subjects: Science > Mathematics > Electronic computers. Computer science
    Technology > Electrical engineering. Electronics Nuclear engineering
    Department: Faculty of Engineering > Electronic and Electrical Engineering
    Related URLs:
      Depositing user: Pure Administrator
      Date Deposited: 15 Mar 2011 14:55
      Last modified: 05 Oct 2012 17:21
      URI: http://strathprints.strath.ac.uk/id/eprint/29271

      Actions (login required)

      View Item

      Fulltext Downloads: