Cogprints

An Analysis of English Punctuation: The Special Case of Comma

Bayraktar, Murat and Say, Bilge and Akman, Varol (1998) An Analysis of English Punctuation: The Special Case of Comma. [Journal (Paginated)]

Full text available as:

[img] Postscript
239Kb

Abstract

Punctuation has usually been ignored by researchers in computational linguistics over the years. Recently, it has been realized that a true understanding of written language will be impossible if punctuation marks are not taken into account. This paper contains the details of a computer-aided exercise to investigate English punctuation practice for the special case of comma (the most significant punctuation mark) in a parsed corpus. The study classifies the various ``structural'' uses of the comma according to the syntax-patterns in which a comma occurs. The corpus (Penn Treebank) consists of syntactically annotated sentences with no part-of-speech tag information about individual words.

Item Type:Journal (Paginated)
Keywords:punctuation, structural punctuation marks, comma, the Penn Treebank, the Wall Street Journal, corpus linguistics.
Subjects:Linguistics > Computational Linguistics
Linguistics > Syntax
ID Code:214
Deposited By: Akman, Varol
Deposited On:10 Nov 1998
Last Modified:11 Mar 2011 08:53

Metadata

Repository Staff Only: item control page