Multilingual story link detection based on event term weighting on times and multilingual spaces

    Research output: Contribution to conferenceConference paperpeer-review

    Abstract

    In this paper, we propose a novel approach for multilingual story link detection. Our approach uses features such as timelines and multilingual spaces for giving distinctive weights to terms that constitute linguistic representation of events. On timelines term significance is calculated by comparing term distribution of the documents on a day with that of the total document collection. Since two languages can provide more information than one language, term significance is measured on each language space, which is then used as a bridge between two languages on multilingual (here bilingual) spaces. Evaluating the method in Korean and Japanese news articles, our method achieved 14.3% improvement for monolingual story pairs, and 16.7% improvement for multilingual story pairs. By measuring the space density, the proposed weighting components are verified with a high density of the intra-event stories and a low density of the inter-events stories. This result indicates that the proposed method is helpful for multilingual story link detection.

    Original languageEnglish
    Title of host publicationDigital Libraries
    Subtitle of host publicationInternational Collaboration and Cross-Fertilization - 7th International Conference on Asian Digital Libraries, ICADL 2004
    EditorsQihao Miao, Ee-peng Lim, Zhaoneng Chen, Yuxi Fu, Hsinchun Chen, Edward Fox
    PublisherSpringer Verlag
    Pages398-407
    Number of pages10
    ISBN (Print)9783540240303
    DOIs
    StatePublished - 2005
    Event7th International Conference on Asian Digital Libraries, ICADL 2004 - Shanghai, China
    Duration: 2004.12.132004.12.17

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume3334 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference7th International Conference on Asian Digital Libraries, ICADL 2004
    Country/TerritoryChina
    CityShanghai
    Period04.12.1304.12.17

    Quacquarelli Symonds(QS) Subject Topics

    • Computer Science & Information Systems

    Fingerprint

    Dive into the research topics of 'Multilingual story link detection based on event term weighting on times and multilingual spaces'. Together they form a unique fingerprint.

    Cite this