EXT Ummarization: Kareem El-Sayed Hashem Mohamed Mohsen Brary
EXT Ummarization: Kareem El-Sayed Hashem Mohamed Mohsen Brary
TEXT SUMMARIZATION
1
TEXT SUMMARIZATION
Goal: reducing a text with a computer program in order to create a summary that retains the most important points of the original text. Summarization Applications
summaries of email threads action items from a meeting simplifying text by compressing sentences
Generic Summarization
Query-focused Summarization
Summarize a document with respect to an information need expressed in a user query A kind of complex question answering
Answer a question by summarizing a document that has the information to construct the answer
Snippets
Multiple Documents
Extractive Summarization:
Abstractive Summarization
Given
Align
Extract Features
Position Length of sentence Word informativeness Cohesion
10
Train
Problems
Hard to get labeled training data Alignment is difficult Performance not better that unsupervised algorithm
11
12
13
EXAMPLE
Human 1: water spinach is a green leafy vegetable grown in the tropics. Human 2: water spinach is a semi-aquatic tropical plant grown as a vegetable. Human 3: water spinach is a commonly eaten leaf vegetable of Asia.
System: water spinach is a leaf vegetable commonly eaten in tropical areas of Asia. ROUGE -2= = 12/28 = 0.43
14
Find a set of relevant documents Extract informative sentences form the documents Order and modify the sentences into an answer
15
QUERY-FOCUSED MULTI-DOCUMENT
SUMMARIZATION
16
17
Score each sentence based on LLR (including query words) Include the sentence with highest score in the summary Iteratively add into the summary high-scoring sentences that are not redundant with the summary so far.
18
INFORMATION ORDERING
Chronological ordering:
Coherence:
Choose ordering that make neighboring sentences similar(by cosine similarity) Choose ordering in which neighboring sentences discuss the same entity
Topical ordering
19
DOMAIN-SPECIFIC ANSWERING:
THE INFORMATION EXTRACTION METHOD
20
21
22
REFERENCES:
23
THANK YOU
24