Skip to main content

Energy Efficient Calculations of Text Similarity Measure on FPGA-Accelerated Computing Platforms

  • Conference paper
  • First Online:
Parallel Processing and Applied Mathematics (PPAM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9573))

Abstract

This paper presents an impact of the customized hardware accelerator on the overall performance of the text similarity computing system. The hardware processing module that is presented in the paper is a building block of the processing engine in the search system of related documents. The engine is used in the phase of preliminary retrieval of similar documents. The TF-IDF weighting scheme and cosine similarity metric are used by the module. Evaluation boards equipped with Xilinx’s Field Programmable Gate Array (FPGA) were utilized as a hardware platforms for implementation of the selected time-consuming operations. The series of tests was conducted, and the results of the hardware-accelerated solutions were compared against the standard software implementation. The two different FPGA-enabled platforms were employed in the experiments. The low-power and the high-performance platform were used to compare the metrics of different hardware solutions. We provide the adequate results and conclusions that present that the energy and speed metrics of the text similarity calculations can be improved thanks to the hardware accelerator. Consequently, the cluster of FPGA-enabled nodes is proposed for the large scale processing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wielgosz, M., Janiszewski, M., Russek, P., Pietroń, M., Jamro, E., Wiatr, K.: Implementation of a system for fast text search and document comparison. In: Bembenik, R., et al. (eds.) Intelligent Tools for Building a Scientific Information Platform: From Research to Implementation, pp. 173–186. Springer, Switzerland (2014)

    Chapter  Google Scholar 

  2. Perera, D.G., Li, K.F.: Parallel computation of similarity measures using an FPGA-based processor array. In: 2008 22nd International Conference on Advanced Information Networking and Applications, AINA 2008, pp. 955–962, March 2008

    Google Scholar 

  3. Russek, P., Wiatr, K.: The regular expression matching algorithm for the energy efficient reconfigurable SoC. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds.) PPAM 2013, Part I. LNCS, vol. 8384, pp. 545–556. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  4. Wang, W., Guo, K., Gu, M., Ma, Y., Wang, Y.: A universal FPGA-based floating-point matrix processor for mobile systems. In: 2014 International Conference on Field-Programmable Technology (FPT), pp. 139–146, December 2014

    Google Scholar 

  5. Dorrance, R., Ren, F., Marković, D.: A scalable sparse matrix-vector multiplication kernel for energy-efficient sparse-BLAS on FPGAs. In: ACM/SIGDA International Symposium on FPGA, pp. 161–170, February 2014

    Google Scholar 

  6. Jamro, E., Pabiś, T., Russek, P., Wiatr, K.: The algorithms for FPGA implementation of sparse matrices multiplication. Comput. Inform. 33(3), 667–684 (2015)

    MATH  Google Scholar 

  7. Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)

    Article  Google Scholar 

  8. Kiela, D., Clark, S.: A systematic study of semantic vector space model parameters. In: Proceedings of the 2nd Workshop on Continuous Vector Space Models and Their Compositionality (CVSC) at EACL, pp. 21–30, April 2014

    Google Scholar 

  9. Karwatowski, M., Koryciak, S., Wiatr, K.: Cosine similarity metric calculation on low power heterogeneous computing platform. In: Proceedings of the KU KDM 2015: Eighth ACC Cyfronet AGH users’ Conference, Zakopane, 11–13 March 2015, pp. 111–112 (2015)

    Google Scholar 

  10. Russek, P., Karwatowski, M., Wielgosz, M., Frączek, R., Wiatr, K.: Documents similarity calculation in the low-power cluster. In: Proceedings of the KU KDM 2015: Eighth ACC Cyfronet AGH Users’ Conference, Zakopane, 11–13 March 2015, pp. 37–38 (2015)

    Google Scholar 

  11. Karwatowski, M., Wielgosz, M., Russek, P., Wiatr, K.: FPGA-based low-energy cluster for acceleration of the document similarity analysis. In: Proceedings of the Cracow Grid Workshop, CGW 2014, 27–29 October 2014, Krakow, Poland, pp. 57–58 (2014)

    Google Scholar 

  12. Xillybus project site. http://xillybus.com

  13. Zedboard community site. http://zedboard.org

  14. Xilinx Virtex-7 FPGA VC707 Evaluation Kit. http://www.xilinx.com

Download references

Acknowledgments

This research was supported by the European Regional Development Program no. POIG.02.03.00-12-137/13 PL-Grid Core.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michał Karwatowski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Karwatowski, M., Russek, P., Wielgosz, M., Koryciak, S., Wiatr, K. (2016). Energy Efficient Calculations of Text Similarity Measure on FPGA-Accelerated Computing Platforms. In: Wyrzykowski, R., Deelman, E., Dongarra, J., Karczewski, K., Kitowski, J., Wiatr, K. (eds) Parallel Processing and Applied Mathematics. PPAM 2015. Lecture Notes in Computer Science(), vol 9573. Springer, Cham. https://doi.org/10.1007/978-3-319-32149-3_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-32149-3_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-32148-6

  • Online ISBN: 978-3-319-32149-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics