Open Access System for Information Sharing

Department of Computer Science & Engineering (컴퓨터공학과) 1. Journal Papers

Article

Cited 45 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Full metadata record

Files in This Item:: There are no files associated with this item.

DC Field	Value	Language
dc.contributor.author	Yu, HJ	-
dc.contributor.author	Yang, J	-
dc.contributor.author	Han, JW	-
dc.contributor.author	Li, XL	-
dc.date.accessioned	2016-04-01T08:46:40Z	-
dc.date.available	2016-04-01T08:46:40Z	-
dc.date.created	2009-08-05	-
dc.date.issued	2005-11	-
dc.identifier.issn	1384-5810	-
dc.identifier.other	2005-OAK-0000017235	-
dc.identifier.uri	https://oasis.postech.ac.kr/handle/2014.oak/28743	-
dc.description.abstract	Support vector machines (SVMs) have been promising methods for classification and regression analysis due to their solid mathematical foundations, which include two desirable properties: margin maximization and nonlinear classification using kernels. However, despite these prominent properties, SVMs are usually not chosen for large-scale data mining problems because their training complexity is highly dependent on the data set size. Unlike traditional pattern recognition and machine learning, real-world data mining applications often involve huge numbers of data records. Thus it is too expensive to perform multiple scans on the entire data set, and it is also infeasible to put the data set in memory. This paper presents a method, Clustering-Based SVM (CB-SVM), that maximizes the SVM performance for very large data sets given a limited amount of resource, e.g., memory. CB-SVM applies a hierarchical micro-clustering algorithm that scans the entire data set only once to provide an SVM with high quality samples. These samples carry statistical summaries of the data and maximize the benefit of learning. Our analyses show that the training complexity of CB-SVM is quadratically dependent on the number of support vectors, which is usually much less than that of the entire data set. Our experiments on synthetic and real-world data sets show that CB-SVM is highly scalable for very large data sets and very accurate in terms of classification.	-
dc.description.statementofresponsibility	X	-
dc.language	English	-
dc.publisher	SPRINGER	-
dc.relation.isPartOf	DATA MINING AND KNOWLEDGE DISCOVERY	-
dc.subject	SUPPORT VECTOR MACHINES	-
dc.title	MAKING SVMS SCALABLE TO LARGE DATA SETS USING HIERARCHICAL CLUSTER INDEXING	-
dc.type	Article	-
dc.contributor.college	컴퓨터공학과	-
dc.identifier.doi	10.1007/S10618-005-0	-
dc.author.google	Yu, HJ	-
dc.author.google	Yang, J	-
dc.author.google	Han, JW	-
dc.author.google	Li, XL	-
dc.relation.volume	11	-
dc.relation.issue	3	-
dc.relation.startpage	295	-
dc.relation.lastpage	321	-
dc.contributor.id	10162777	-
dc.relation.journal	DATA MINING AND KNOWLEDGE DISCOVERY	-
dc.relation.index	SCI급, SCOPUS 등재논문	-
dc.relation.sci	SCI	-
dc.collections.name	Journal Papers	-
dc.type.rims	ART	-
dc.identifier.bibliographicCitation	DATA MINING AND KNOWLEDGE DISCOVERY, v.11, no.3, pp.295 - 321	-
dc.identifier.wosid	000233732500005	-
dc.date.tcdate	2019-01-01	-
dc.citation.endPage	321	-
dc.citation.number	3	-
dc.citation.startPage	295	-
dc.citation.title	DATA MINING AND KNOWLEDGE DISCOVERY	-
dc.citation.volume	11	-
dc.contributor.affiliatedAuthor	Yu, HJ	-
dc.description.journalClass	1	-
dc.description.journalClass	1	-
dc.description.wostc	27	-
dc.type.docType	Article	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-

Show simple item record

qr_code

트윗하기

Communities & Collection

Department of Computer Science & Engineering (컴퓨터공학과)

Related Researcher

Researcher

유환조YU, HWANJO: Dept of Computer Science & Enginrg

Read more

Open Access System for Information Sharing

Communities & Collection

Related Researcher

Views & Downloads

Browse