Bayraktar, Murat and Say, Bilge and Akman, Varol (1998) An Analysis of English Punctuation: The Special Case of Comma. [Journal (Paginated)]
| Postscript 239Kb |
Abstract
Punctuation has usually been ignored by researchers in computational linguistics over the years. Recently, it has been realized that a true understanding of written language will be impossible if punctuation marks are not taken into account. This paper contains the details of a computer-aided exercise to investigate English punctuation practice for the special case of comma (the most significant punctuation mark) in a parsed corpus. The study classifies the various ``structural'' uses of the comma according to the syntax-patterns in which a comma occurs. The corpus (Penn Treebank) consists of syntactically annotated sentences with no part-of-speech tag information about individual words.
| Item Type: | Journal (Paginated) |
|---|---|
| Keywords: | punctuation, structural punctuation marks, comma, the Penn Treebank, the Wall Street Journal, corpus linguistics. |
| Subjects: | Linguistics > Computational Linguistics Linguistics > Syntax |
| ID Code: | 214 |
| Deposited By: | Akman, Varol |
| Deposited On: | 10 Nov 1998 |
| Last Modified: | 19 Dec 2009 19:15 |
Metadata
- ID Plus Text Citation
- RDF+XML
- BibTeX
- Pageflow Montage
- JSON
- Dublin Core
- OAI-ORE Resource Map (Atom Format)
- Simple Metadata
- Refer
- METS
- OAI-ORE Resource Map (RDF Format)
- Search Data Dump
- Pageflow
- HTML Citation
- ASCII Citation
- YAML
- EPrints Application Profile (experimental)
- OpenURL ContextObject
- EndNote
- OpenURL ContextObject in Span
- MODS
- DIDL
- EP3 XML
- Reference Manager
- RDF+N3
- Eprints Application Profile
Repository Staff Only: item control page

