Text localization using standard deviation analysis of structure elements and support vector machines

Zagoris, Konstantinos; Chatzichristofis, Savvas A.; Papamarkos, Nikos

Text localization using standard deviation analysis of structure elements and support vector machines

Files

Text-localization-fulltext.pdf (3.69 MB)

Date

2011

Authors

Zagoris, Konstantinos

Chatzichristofis, Savvas A.

Papamarkos, Nikos

Publisher

Springer

Abstract

A text localization technique is required to successfully exploit document images such as technical articles and letters. The proposed method detects and extracts text areas from document images. Initially a connected components analysis technique detects blocks of foreground objects. Then, a descriptor that consists of a set of suitable document structure elements is extracted from the blocks. This is achieved by incorporating an algorithm called Standard Deviation Analysis of Structure Elements (SDASE) which maximizes the separability between the blocks. Another feature of the SDASE is that its length adapts according to the requirements of the application. Finally, the descriptor of each block is used as input to a trained support vector machines that classify the block as text or not. The proposed technique is also capable of adjusting to the text structure of the documents. Experimental results on benchmarking databases demonstrate the effectiveness of the proposed method.

Keywords

Text localization, structure elements, vector machines, Standard Deviation Analysis of Structure Elements (SDASE)

URI

http://hdl.handle.net/11728/10170

Collections

Articles

Full item page

Text localization using standard deviation analysis of structure elements and support vector machines

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By