Advertisement

Measuring Peculiarity of Text Using Relation between Words on the Web

  • Takeru Nakabayashi
  • Takayuki Yumoto
  • Manabu Nii
  • Yutaka Takahashi
  • Kazutoshi Sumiya
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6102)

Abstract

We define the peculiarity of text as a metric of information credibility. Higher peculiarity means lower credibility. We extract the theme word and the characteristic words from text and check whether there is a subject-description relation between them. The peculiarity is defined using the ratio of the subject-description relation between a theme word and characteristic words. We evaluate the extent to which peculiarity can be used to judge by classifying text from Wikipedia and Uncyclopedia in terms of the peculiarity.

Keywords

Lower Credibility Device Product Acteristic Word Characteristic Word Information Credibility 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Gyöngyi, Z., Garcia-Molina, H., Pedersen, J.: Combating web spam with trustrank. In: Proceedings of the Thirtieth international conference on Very large data bases (VLDB 2004), VLDB Endowment, pp. 576–587 (2004)Google Scholar
  2. 2.
    Yamamoto, Y., Tanaka, K.: Finding comparative facts and aspects for judging the credibility of uncertain facts. In: Vossen, G., Long, D.D.E., Yu, J.X. (eds.) WISE 2009. LNCS, vol. 5802, pp. 291–305. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  3. 3.
    Nakagawa, H., Yumoto, H., Mori, T.: Term extraction based on occurrence and concatenation frequency (in Japanese). Journal of natural language processing 10(1), 27–45 (2003)Google Scholar
  4. 4.
    Oyama, S., Tanaka, K.: Query modification by discovering topics from web page structures. In: Yu, J.X., Lin, X., Lu, H., Zhang, Y. (eds.) APWeb 2004. LNCS, vol. 3007, pp. 553–564. Springer, Heidelberg (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Takeru Nakabayashi
    • 1
  • Takayuki Yumoto
    • 2
  • Manabu Nii
    • 2
  • Yutaka Takahashi
    • 2
  • Kazutoshi Sumiya
    • 3
  1. 1.School of EngineeringUniversity of HyogoHyogoJapan
  2. 2.Graduate School of EngineeringUniversity of HyogoHyogoJapan
  3. 3.School of Human Science and EnvironmentUniversity of HyogoHyogoJapan

Personalised recommendations