Skip to main navigation Skip to search Skip to main content

Korean-Japanese story link detection based on distributional and contrastive properties of event terms

    • NII (National Institute of Informatics)

    Research output: Contribution to journalJournal articlepeer-review

    Abstract

    In this paper, we propose a novel approach for multilingual story link detection. Our approach utilized the distributional features of terms in timelines and multilingual spaces, together with selected types of named entities in order to get distinctive weights for terms that constitute linguistic representation of events. On timelines term significance is calculated by comparing term distribution of the documents on a day with that of the total document collection. Since two languages can provide more information than one language, term significance is measured on each language space, which is then used as a bridge between two languages on multilingual spaces. Evaluating the method on Korean and Japanese news articles, our method achieved 14.3% improvement for monolingual story pairs, and 16.7% improvement for multilingual story pairs. By measuring the space density, the proposed weighting components are verified with a high density of the intra-event stories and a low density of the inter-events stories. This result indicates that the proposed method is helpful for multilingual story link detection.

    Original languageEnglish
    Pages (from-to)538-550
    Number of pages13
    JournalInformation Processing and Management
    Volume42
    Issue number2
    DOIs
    StatePublished - 2006.03

    Keywords

    • Distributional property
    • Event term
    • Multilingual space
    • Space density
    • Story link detection

    Quacquarelli Symonds(QS) Subject Topics

    • Computer Science & Information Systems
    • Engineering - Electrical & Electronic
    • Statistics & Operational Research
    • Data Science
    • Library & Information Management

    Fingerprint

    Dive into the research topics of 'Korean-Japanese story link detection based on distributional and contrastive properties of event terms'. Together they form a unique fingerprint.

    Cite this