Skip to main navigation Skip to search Skip to main content

A semi-clustering scheme for large-scale graph analysis on hadoop

  • Seungtae Hong
  • , Youngsung Shin
  • , Dong Hoon Choi
  • , Heeseung Jo
  • , Jae Woo Chang*
  • *Corresponding author for this work

    Research output: Contribution to conferenceConference paperpeer-review

    Abstract

    With the evolution of IT technologies, large-scale graph data have lately become a growing interest. As a result, there are a lot of research results in large-scale graph analysis on Hadoop. The graph analysis based on Hadoop provides parallel programming models with data partitioning and contains iterative phases of MapReduce jobs. Therefore, the effectiveness of data partitioning depends on how the data partitioning maintains data locality in each node of cluster. In this paper, we propose a semi-clustering scheme for large-scale graph analysis such as PageRank algorithm on Hadoop and show that the proposed scheme is effective. With experiment results, PageRank computation with the semi-clustering improves the performance.

    Original languageEnglish
    Title of host publicationMobile, Ubiquitous, and Intelligent Computing, MUSIC 2013
    PublisherSpringer Verlag
    Pages301-306
    Number of pages6
    ISBN (Print)9783642406744
    DOIs
    StatePublished - 2014
    Event4th International Conference on Mobile, Ubiquitous, and Intelligent Computing, MUSIC 2013 - Gwangju, Korea, Republic of
    Duration: 2013.09.42013.09.6

    Publication series

    NameLecture Notes in Electrical Engineering
    Volume274 LNEE
    ISSN (Print)1876-1100
    ISSN (Electronic)1876-1119

    Conference

    Conference4th International Conference on Mobile, Ubiquitous, and Intelligent Computing, MUSIC 2013
    Country/TerritoryKorea, Republic of
    CityGwangju
    Period13.09.413.09.6

    Keywords

    • Hadoop
    • large-scale graph analysis
    • PageRank
    • semi-clustering

    Quacquarelli Symonds(QS) Subject Topics

    • Engineering - Mechanical

    Fingerprint

    Dive into the research topics of 'A semi-clustering scheme for large-scale graph analysis on hadoop'. Together they form a unique fingerprint.

    Cite this