SciX Sandbox : Digital Library : Works

Paper 0346:
Practical Approach to Automatic Text Summarization

id 0346
authors Hynek, Jiri and Jezek, Karel
year 2003
title Practical Approach to Automatic Text Summarization
source ELPUB2003. From information to knowledge: Proceedings of the 7th ICCC/IFIP International Conference on Electronic Publishing held at the Universidade do Minho, Portugal 25-28 June 2003/Edited by: Sely Maria de Souza Costa, Jo„o Ńlovaro Carvalho, Ana Alice Baptista, Ana Cristina Santos Moreira. Universidade do Minho, 2003.
summary The significance of automatic document summarization increases with the threat of information overload we are facing. Short summaries can be presented to users, for example, in place of fulllength documents found by a search engine in response to a userís query. We have analyzed variousapproaches to document summarization, using some existing algorithms and combining these with a novel use of itemsets. The resulting summarizer is evaluated by comparing classification of original documents and that of abstracts generated automatically. Despite highly promising results achieved by this evaluation, readability of abstracts must be further improved by integrating additional heuristic approaches.
keywords document summarization, summarizer, condensation, abstract, abstracting, extraction, text, machine learning, classification, categorization, sentence selection, highlight, classifier, heuristics, itemsets, term frequency, evaluation
series ELPUB:2003
type full paper
content file.pdf (215,710 bytes)
discussion No discussions. Post discussion ...
ratings Ratings: 5
last changed 2003/06/07 08:13
HOMELOGIN (you are user _anon_722700 from group guest) Works Powered by SciX Open Publishing Services 1.002