Open Access System for Information Sharing

Login Library

 

Article
Cited 5 time in webofscience Cited 5 time in scopus
Metadata Downloads
Full metadata record
Files in This Item:
There are no files associated with this item.
DC FieldValueLanguage
dc.contributor.authorMin-Soo Kim-
dc.contributor.authorSangyeon Lee-
dc.contributor.authorHan, WS-
dc.contributor.authorHimchan Park-
dc.contributor.authorJeong-Hoon Lee-
dc.date.accessioned2017-07-19T13:36:40Z-
dc.date.available2017-07-19T13:36:40Z-
dc.date.created2017-02-13-
dc.date.issued2015-10-
dc.identifier.issn1041-4347-
dc.identifier.urihttps://oasis.postech.ac.kr/handle/2014.oak/37320-
dc.description.abstractComputing connected components is a core operation on graph data. Since billion-scale graphs cannot be resident in memory of a single server, several approaches based on distributed machines have recently been proposed. The representative methods are Hash-To-Min and PowerGraph. Hash-To-Min is the state-of-the art disk-based distributed method which minimizes the number of MapReduce rounds. PowerGraph is the-state-of-the-art in-memory distributed system, which is typically faster than the disk-based distributed one, however, requires a lot of machines for handling billion-scale graphs. In this paper, we propose an I/O efficient parallel algorithm for billion-scale graphs in a single PC. We first propose the Disk-based Sequential access-oriented Parallel processing (DSP) model that exploits sequential disk access in terms of disk I/Os and parallel processing in terms of computation. We then propose an ultra-fast disk-based parallel algorithm for computing connected components, DSP-CC, which largely improves the performance through sequential disk scan and page-level cache-conscious parallel processing. Extensive experimental results show that DSP-CC 1) computes connected components in billion-scale graphs using the limited memory size whereas in-memory algorithms can only support medium-sized graphs with the same memory size, and 2) significantly outperforms all distributed competitors as well as a representative disk-based parallel method.-
dc.languageEnglish-
dc.publisherIEEE-
dc.relation.isPartOfIEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING-
dc.titleDSP-CC: I/O Efficient Parallel Computation of Connected Components in Billion-scale Networks-
dc.typeArticle-
dc.identifier.doi10.1109/TKDE.2015.2419665-
dc.type.rimsART-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, v.27, no.10, pp.2658 - 2671-
dc.identifier.wosid000361245300006-
dc.date.tcdate2019-02-01-
dc.citation.endPage2671-
dc.citation.number10-
dc.citation.startPage2658-
dc.citation.titleIEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING-
dc.citation.volume27-
dc.contributor.affiliatedAuthorHan, WS-
dc.identifier.scopusid2-s2.0-84941554873-
dc.description.journalClass1-
dc.description.journalClass1-
dc.description.wostc1-
dc.description.isOpenAccessN-
dc.type.docTypeArticle-
dc.subject.keywordAuthorGraphs-
dc.subject.keywordAuthordisk-based-
dc.subject.keywordAuthorparallel-
dc.subject.keywordAuthorconnected components-
dc.subject.keywordAuthorSSD-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

한욱신HAN, WOOK SHIN
Grad. School of AI
Read more

Views & Downloads

Browse