Header menu link for other important links
Newspaper article extraction using hierarchical fixed point model
A. Bansal, , S.D. Roy, J.B. Srivastava
Published in IEEE Computer Society
Pages: 257 - 261
This paper presents a novel learning based framework to extract articles from newspaper images using a Fixed-Point Model. The input to the system comprises blocks of text and graphics, obtained using standard image processing techniques. The fixed point model uses contextual information and features of each block to learn the layout of newspaper images and attains a contraction mapping to assign a unique label to every block. We use a hierarchical model which works in two stages. In the first stage, a semantic label (heading, sub-heading, text-blocks, image and caption) is assigned to each segmented block. The labels are then used as input to the next stage to group the related blocks into news articles. Experimental results show the applicability of our algorithm in newspaper labeling and article extraction. © 2014 IEEE.
About the journal
JournalData powered by TypesetProceedings - 11th IAPR International Workshop on Document Analysis Systems, DAS 2014
PublisherData powered by TypesetIEEE Computer Society