Skip to main content

Small Business-Oriented Index Construction of Cloud Data

  • Conference paper
Algorithms and Architectures for Parallel Processing (ICA3PP 2012)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7440))

Abstract

With the development of cloud computing, data owners (businesses and individuals) are motivated to outsource their local complex database systems to public cloud for flexibility and economic savings. But for the consideration of user’s privacy, personal data has to be special treatment locally before outsourcing to the cloud server. Considering the large number of data users and documents in cloud, it is crucial for data owner to construct an index for their data collection, which increases the cost of the data owner. Related works focus on the searches on encrypted database but rarely consider the overhead of the index construction for data owner and the extensions of the index. Although traditional index construction methods of information retrieval have been widely studied, direct application of these methods would not be necessarily suitable for our scenario. Thus, enabling an efficient index construction service is of paramount. In this paper, we define and solve the problem of index construction on small business (SBIC). Among various index methods, we choose inverted index method. An inverted index is an index data structure storing a mapping from content to its locations in a set of documents. The purpose of it is to allow fast full text searches.We firstly propose a basic SBIC scheme using Lucene (an open source project for web search engine), and then significantly improve it to meet efficient keyword extraction requirement and multi-type files demand. Thorough analysis design goals(see section 2.3) of proposed schemes is given, extensive experimental results on the dataset further show proposed scheme indeed introduce low overhead on time and space.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhang, Q., Cheng, L., Boutaba, R.: Cloud computing: state-of-the-art and research challenges. Proc. Journal of Internet Services and Applications 1(1), 7–18 (2010)

    Article  Google Scholar 

  2. Kamara, S., Lauter, K.: Cryptographic Cloud Storage. In: Sion, R., Curtmola, R., Dietrich, S., Kiayias, A., Miret, J.M., Sako, K., Sebé, F. (eds.) FC 2010 Workshops. LNCS, vol. 6054, pp. 136–149. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  3. Wang, C., Cao, N., Li, J., Ren, K., Lou, W.: Secure ranked keyword search over encrypted cloud data. In: Proc. ICDCS 2010, pp. 253–262 (2010)

    Google Scholar 

  4. Li, J., Wang, Q., Wang, C., Cao, N., Ren, K., Lou, W.: Fuzzy keyword search over encrypted data in cloud computing. In: Proc. INFOCOM 2010 Mini-Conference, San Diego, pp. 1–5 (March 2010)

    Google Scholar 

  5. Cao, N., Wang, C., Li, M., Ren, K., Lou, W.: Privacy-Preserving Multi-Keyword Ranked Search over Encrypted Cloud Data. In: Proc. INFOCOM 2011, Shanghai, pp. 829–837 (April 2011)

    Google Scholar 

  6. Chuah, M., Hu, W.: Privacy-Aware Bedtree Based Solution for Fuzzy Multi-Keyword Search over Encrypted Data. In: Proc. ICDCSW 2011, pp. 273–281 (2011)

    Google Scholar 

  7. Wang, C., Ren, K., Yu, S., Urs, K.M.R.: Achieving Usable and Privacy-Assured Similarity Search over Outsourced Cloud Data. In: Proc. INFOCOM 2012, Orlando (March 2012)

    Google Scholar 

  8. Cao, N., Yang, Z., Wang, C., Ren, K., Lou, W.: Privacy-Preserving Query over Encrypted Graph-Structured Data in Cloud Computing. In: Proc. ICDCS 2011, pp. 393–402 (2011)

    Google Scholar 

  9. Yang, Y., Lu, H., Weng, J.: Multi-User Private Keyword Search for Cloud Computing. In: Proc. CLOUDCOM 2011, Athens, pp. 264–271 (2011) ISBN:146730090X

    Google Scholar 

  10. Goyal, V., Pandey, O., Sahai, A., Waters, B.: Attribute-Based Encryption for Fine-Gained Access Control of Encrypted Data. In: Proc. ACM CCS 2006, pp. 89–98 (2006)

    Google Scholar 

  11. Di Vimercati, S.D.C., Foresti, S., Jajodia, S., Paraboschi, S., Samarati, P.: Over-Encryption: Management of Access Control Evolution on Outsourced Data. In: Proc. VLDB Endowment 2007, pp. 123–134 (2007)

    Google Scholar 

  12. Yu, S., Wang, C., Ren, K., Lou, W.: Achieving Secure, Scalable, and Fine-Grained Data Access Control in Cloud Computing. In: Proc. INFOCOM 2010, pp. 1–9 (2010)

    Google Scholar 

  13. Chai, Q., Gong, G.: Verifiable Symmetric Searchable Encryption for Semi-Honest-but-Curious Cloud Servers

    Google Scholar 

  14. Witten, I.H., Moffat, A., Bell, T.C.: Managing gigabytes: Compressing and indexing documents and images. Morgan Kaufmann Publishing, San Francisco (1999)

    Google Scholar 

  15. Inverted Index, http://en.wikipedia.org/wiki/Inverted_index

  16. Cutting, D.: Lucene, http://lucene.apache.org/

  17. TF*IDF, http://en.wikipedia.org/wiki/Tfidf

  18. Zhang, H.: NLPIR dataset, http://www.datatang.com/datares/go.aspx?dataid=605109

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Peng, K., Zou, H., Lin, R., Yang, F. (2012). Small Business-Oriented Index Construction of Cloud Data. In: Xiang, Y., Stojmenovic, I., Apduhan, B.O., Wang, G., Nakano, K., Zomaya, A. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2012. Lecture Notes in Computer Science, vol 7440. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33065-0_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33065-0_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33064-3

  • Online ISBN: 978-3-642-33065-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics