Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities |
| |
Authors: | Du-Seong Chang Key-Sun Choi |
| |
Institution: | 1. Spoken Language Research Team, KT, 17 Woomyeon-dong, Seocho-gu, Seoul 137-792, Republic of Korea;2. Division of Computer Science, Korea Advanced Institute of Science and Technology, BOLA, KORTERM, 373-1 Guseong-dong, Yuseong-gu, Daejeon 305-701, Republic of Korea |
| |
Abstract: | This work aims to extract possible causal relations that exist between noun phrases. Some causal relations are manifested by lexical patterns like causal verbs and their sub-categorization. We use lexical patterns as a filter to find causality candidates and we transfer the causality extraction problem to the binary classification. To solve the problem, we introduce probabilities for word pair and concept pair that could be part of causal noun phrase pairs. We also use the cue phrase probability that could be a causality pattern. These probabilities are learned from the raw corpus in an unsupervised manner. With this probabilistic model, we increase both precision and recall. Our causality extraction shows an F-score of 77.37%, which is an improvement of 21.14 percentage points over the baseline model. The long distance causal relation is extracted with the binary tree-styled cue phrase. We propose an incremental cue phrase learning method based on the cue phrase confidence score that was measured after each causal classifier learning step. A better recall of 15.37 percentage points is acquired after the cue phrase learning. |
| |
Keywords: | Causality Pattern learning Word pair probability Cue phrase probability Unsupervised learning |
本文献已被 ScienceDirect 等数据库收录! |
|