DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, GG | - |
dc.contributor.author | Cha, JW | - |
dc.contributor.author | Lee, JH | - |
dc.date.accessioned | 2016-03-31T13:06:22Z | - |
dc.date.available | 2016-03-31T13:06:22Z | - |
dc.date.created | 2010-01-11 | - |
dc.date.issued | 2002-03 | - |
dc.identifier.issn | 0891-2017 | - |
dc.identifier.other | 2002-OAK-0000002640 | - |
dc.identifier.uri | https://oasis.postech.ac.kr/handle/2014.oak/19082 | - |
dc.description.abstract | Most errors in Korean morphological analysis and part-of-speech (POS) tagging are caused by unknown morphemes. This paper presents a syllable-pattern-based generalized unknown-morpheme-estimation method with POSTAG (POStech TAGger),(1) which is a statistical and rule-based hybrid POS tagging system. This method of guessing unknown morphemes is based on a combination of a morpheme pattern dictionary that encodes general lexical patterns of Korean morphemes with a posteriori syllable trigram estimation. The syllable trigrams help to calculate lexical probabilities of the unknown morphemes and are utilized to search for the best tagging result. This method can guess the POS tags of unknown morphemes regardless of their numbers and/or positions in an eojeol (a Korean spacing unit similar to an English word), which is not possible with other systems for tagging Korean. In a series of experiments using three different domain corpora, the system achieved a 97% tagging accuracy even though 10% of the morphemes in the test corpora were unknown. It also achieved very high coverage and accuracy of estimation for all classes of unknown morphemes. | - |
dc.description.statementofresponsibility | X | - |
dc.language | English | - |
dc.publisher | M I T PRESS | - |
dc.relation.isPartOf | COMPUTATIONAL LINGUISTICS | - |
dc.title | Syllable-pattern-based unknown-morpheme segmentation and estimation for hybrid part-of-speech tagging of Korean | - |
dc.type | Article | - |
dc.contributor.college | 컴퓨터공학과 | - |
dc.identifier.doi | 10.1162/089120102317341774 | - |
dc.author.google | Lee, GG | - |
dc.author.google | Cha, JW | - |
dc.author.google | Lee, JH | - |
dc.relation.volume | 28 | - |
dc.relation.issue | 1 | - |
dc.relation.startpage | 53 | - |
dc.relation.lastpage | 70 | - |
dc.contributor.id | 10103841 | - |
dc.relation.journal | COMPUTATIONAL LINGUISTICS | - |
dc.relation.index | SCI급, SCOPUS 등재논문 | - |
dc.relation.sci | SCI | - |
dc.collections.name | Journal Papers | - |
dc.type.rims | ART | - |
dc.identifier.bibliographicCitation | COMPUTATIONAL LINGUISTICS, v.28, no.1, pp.53 - 70 | - |
dc.identifier.wosid | 000175684600004 | - |
dc.date.tcdate | 2019-01-01 | - |
dc.citation.endPage | 70 | - |
dc.citation.number | 1 | - |
dc.citation.startPage | 53 | - |
dc.citation.title | COMPUTATIONAL LINGUISTICS | - |
dc.citation.volume | 28 | - |
dc.contributor.affiliatedAuthor | Lee, GG | - |
dc.contributor.affiliatedAuthor | Lee, JH | - |
dc.identifier.scopusid | 2-s2.0-0037582087 | - |
dc.description.journalClass | 1 | - |
dc.description.journalClass | 1 | - |
dc.description.wostc | 16 | - |
dc.type.docType | Article; Proceedings Paper | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
dc.relation.journalWebOfScienceCategory | Linguistics | - |
dc.relation.journalWebOfScienceCategory | Language & Linguistics | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Linguistics | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
library@postech.ac.kr Tel: 054-279-2548
Copyrights © by 2017 Pohang University of Science ad Technology All right reserved.