|
Accession number: |
|
083511487090 |
|
|
Title: |
|
Terms derived from frequent sequences for extractive text summarization |
|
|
Authors: |
|
Ledeneva,
Yulia; Gelbukh,
Alexander; Garcia-Hernandez,
Rene Arnulfo |
|
|
Author affiliation: |
|
Natural Language and Text Processing Laboratory, Center
for Computing Research, National
Polytechnic Institute, DF 07738, Mexico |
|
|
Serial title: |
|
Lecture
Notes in Computer Science (including subseries Lecture Notes
in Artificial Intelligence and Lecture Notes in
Bioinformatics) |
|
|
Abbreviated serial title: |
|
Lect. Notes
Comput. Sci. |
|
|
Volume: |
|
v 4919
LNCS |
|
|
Monograph title: |
|
Computational Linguistics and Intelligent
Text Processing - 9th
International Conference, CICLing 2008,
Proceedings |
|
|
Publication year: |
|
2008 |
|
|
Pages: |
|
p
593-604 |
|
|
Language: |
|
English |
|
|
ISSN: |
|
0302-9743 |
|
|
Document type: |
|
Conference
article (CA) |
|
|
Conference name: |
|
9th
International Conference on Computational Linguistics and
Intelligent Text Processing,
CICLing 2008 |
|
|
Conference date: |
|
Feb 17-23
2008 |
|
|
Conference location: |
|
Haifa,
Israel |
|
|
Conference code: |
|
73285 |
|
|
Publisher: |
|
Springer
Verlag, Heidelberg, D-69121, Germany |
|
|
Abstract: |
|
Automatic
text summarization helps the user to
quickly understand large volumes of information. We present a
language- and domain-independent statistical-based method
for single-document extractive summarization, i.e., to produce a
text summary by extracting some sentences from the given text. We show experimentally that
words that are parts of bigrams that repeat more than once in
the text are good terms to describe the text's contents, and so are also
so-called maximal frequent
sentences. We also show that the frequency of the term as term weight gives good results (while
we only count the occurrences of a term in repeating bigrams). © 2008
Springer-Verlag Berlin Heidelberg. |
|
|
Number of references: |
|
21 |
|
|
Ei main heading: |
|
Linguistics |
|
|
Ei controlled terms: |
|
Computational
linguistics - Text processing - Word
processing |
|
|
Uncontrolled terms: |
|
International
conferences - Bigrams - Intelligent
text processing - Automatic
text summarization - Term weighting - Text summarization - Frequent sequences |
|
|
Ei classification codes: |
|
721.1 Computer Theory, Includes Formal Logic,
Automata Theory, Switching Theory, Programming Theory - 723.2 Data Processing and Image Processing - 903.1 Information Sources and Analysis - 903.2 Information Dissemination |
|
|
DOI: |
|
10.1007/978-3-540-78135-6_51 |
|
|
Database: |
|
Compendex |
|
|
|
|
Compilation
and indexing terms, © 2008 Elsevier Inc. |
|