Skip to main content

Simple Document-by-Document Search Tool “Fuwatto Search” Using Web API

  • Conference paper
Book cover The Emergence of Digital Libraries – Research and Practices (ICADL 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8839))

Included in the following conference series:

Abstract

In this paper, we propose a new search method Fuwatto Search that allows users to retrieve documents in a document-by-document manner via a Web API. We present an implementation of the proposed method (i.e., Fuwatto CiNii Search), which targets the CiNii Article database, one of the largest academic article databases in Japan. The experimental evaluation of Fuwatto CiNii Search with newspaper articles demonstrates the retrieval effectiveness of 0.25 for precision at 10 and 0.17 for mean average precision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Nakatani, S.: Body text extraction of web pages (in Japanese), http://labs.cybozu.co.jp/blog/nakatani/2007/09/web_1.html (updated September 12, 2007, accessed June 15, 2014)

  2. National Institute of Informatics: CiNii Articles, http://ci.nii.ac.jp/en (accessed June 15, 2014)

  3. National Institute of Informatics: Metadata and API: CiNii Articles OpenSearch for Articles, http://ci.nii.ac.jp/info/en/api/a_opensearch.html (accessed June 15, 2014)

  4. Kudo, T.: MeCab: Yet another part-of-speech and morphological analyzer, https://code.google.com/p/mecab/ (accessed June 15, 2014)

  5. Library of Congress: InQuery stopword list for THOMAS, http://thomas.loc.gov/home/stopwords.html (accessed February 10, 2010)

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Takaku, M., Egusa, Y. (2014). Simple Document-by-Document Search Tool “Fuwatto Search” Using Web API. In: Tuamsuk, K., Jatowt, A., Rasmussen, E. (eds) The Emergence of Digital Libraries – Research and Practices. ICADL 2014. Lecture Notes in Computer Science, vol 8839. Springer, Cham. https://doi.org/10.1007/978-3-319-12823-8_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12823-8_32

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12822-1

  • Online ISBN: 978-3-319-12823-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics