Explicit discourse relations in text are signalled by discourse connectives like since, because, however, etc. Identifying discourse connectives is a part of the bigger task called discourse parsing in which discourse coherence relations are extracted from text. In this paper we report improvements to the state-of-the-art for identifying explicit discourse connectives in the Penn Discourse Treebank and the Biomedical Discourse Relation Bank. These improvements have been achieved with maximum entropy (logistic regression) classifiers by combining machine learning features from previous approaches with new surface level features that capture information about a connective's surrounding phrases and new syntactic features that add more information from the path in the syntax tree connecting the root to the connective and from the clause following the connective by means of its syntactic head.
展开▼