A corpus-based learning method of compound noun indexing rules for Korean
SCIE
SCOPUS
- Title
- A corpus-based learning method of compound noun indexing rules for Korean
- Authors
- Kim, JH; Kwak, BK; Lee, S; Lee, G; Lee, JH
- Date Issued
- 2001-07
- Publisher
- KLUWER ACADEMIC PUBL
- Abstract
- In Korean information retrieval, compound nouns play an important role in improving precision in search experiments. There are two major approaches to compound noun indexing in Korean: statistical and linguistic. Each method, however, has its own shortcomings, such as limitations when indexing diverse types of compound nouns, over-generation of compound nouns, and data sparseness in training. In this paper, we propose a corpus-based learning method, which can index diverse types of compound nouns using rules automatically extracted from a large corpus. The automatic learning method is more portable and requires less human effort, although it exhibits a performance level similar to the manual-linguistic approach. We also present a new filtering method to solve the problems of compound noun over-generation and data sparseness.
- Keywords
- corpus-based learning; compound noun indexing; filtering; information retrieval; search performance evaluation
- URI
- https://oasis.postech.ac.kr/handle/2014.oak/19501
- DOI
- 10.1023/A:1011466928139
- ISSN
- 1386-4564
- Article Type
- Article
- Citation
- INFORMATION RETRIEVAL, vol. 4, no. 2, page. 115 - 132, 2001-07
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.