Glass, K. R. and Bangay, S.D. (2005) Evaluating parts-of-speech taggers for use in a text-to-scene conversion system. In: Proceedings of the 2005 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries, September, 2005, White River, South Africa.
|
Text
GLASS_C28.pdf 255Kb |
Official URL: http://portal.acm.org/citation.cfm?id=1145678&coll...
Abstract
This paper presents parts-of-speech tagging as a first step towards an autonomous text-to-scene conversion system. It categorizes some freely available taggers, according to the techniques used by each in order to automatically identify word-classes. In addition, the performance of each identified tagger is verified experimentally. The SUSANNE corpus is used for testing and reveals the complexity of working with different tagsets, resulting in substantially lower accuracies in our tests than in those reported by the developers of each tagger. The taggers are then grouped to form a voting system to attempt to raise accuracies, but in no cases do the combined results improve upon the individual accuracies. Additionally a new metric, agreement, is tentatively proposed as an indication of confidence in the output of a group of taggers where such output cannot be validated.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Additional Information: | |
| Uncontrolled Keywords: | corpora; parts-of-speech tagging; evaluation; ensembles; computing methodologies; artificial intelligence; natural language processing; language parsing and understanding; languages; measurement; verification |
| Subjects: | Y Unknown > Subjects to be assigned |
| Divisions: | Faculty > Faculty of Science > Computer Science |
| ID Code: | 826 |
| Deposited By: | Mr Kevin Glass |
| Deposited On: | 17 Oct 2007 |
| Last Modified: | 01 Aug 2012 08:15 |
130 full-text download(s) in the past 12 months
More statistics...
Repository Staff Only: item control page
Tools
Tools