Extracting Information from Google Fusion Tables

  • Marco Brambilla
  • Stefano Ceri
  • Nicola Cinefra
  • Anish Das Sarma
  • Fabio Forghieri
  • Silvia Quarteroni
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7538)

Abstract

With Fusion Tables, Google has made available a huge repository that allows users to share, visualize and manage structured data. Since 2009, thousands of tables have been shared online, encompassing data from virtually any domain and entered by all kinds of users, from professional to non-experts. While Fusion Tables are a potentially precious source of freely available structured information for all sorts of applications, complex querying and composing them is not supported natively, as it requires understanding both the structure and content of tables’ data, which are heterogeneous and produced "bottom-up". In this paper, we discuss ongoing and future work concerning the integration of Fusion Tables in the aim of efficiently integrating, visualizing, and querying them.

Keywords

semantic annotation service description search services 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Baeza-Yates, R., Raghavan, P.: Chapter 2: Next Generation Web Search. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 5950, pp. 11–23. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  2. 2.
    Bozzon, A., Brambilla, M., Catarci, T., Ceri, S., Fraternali, P., Matera, M.: Visualization of Multi-domain Ranked Data. In: Ceri, S., Brambilla, M. (eds.) Search Computing II. LNCS, vol. 6585, pp. 53–69. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  3. 3.
    Bozzon, A., Brambilla, M., Ceri, S.: Crowdsearcher. In: Proc. WWW 2012, Lyon (to appear, 2012)Google Scholar
  4. 4.
    Bozzon, A., Brambilla, M., Ceri, S., Fraternali, P.: Liquid Query: Multi-Domain Exploratory Search on the Web. In: Proc. WWW 2010, pp. 161–170 (2011)Google Scholar
  5. 5.
    Bozzon, A., Brambilla, M., Ceri, S., Quarteroni, S.: A Framework for Integrating, Exploring, and Searching Location-Based Web Data. IEEE Internet Computing 15(6), 24–31 (2011)CrossRefGoogle Scholar
  6. 6.
    Braga, D., Ceri, S., Daniel, F., Martinenghi, D.: Optimization of multi-domain queries on the Web. Proc. VLDB 1(1), 562–573 (2008)Google Scholar
  7. 7.
    Brambilla, M., Campi, A., Ceri, S., Quarteroni, S.: Semantic Resource Framework. In: Ceri, S., Brambilla, M. (eds.) Search Computing II. LNCS, vol. 6585, pp. 73–84. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  8. 8.
    Cai, D., Yu, S., Wen, J., Ma, W.: Block-based Web Search. In: Proceedings of SIGIR (2004)Google Scholar
  9. 9.
    Fensel, D., Musen, M.: Special Issue on Semantic Web Technology. IEEE Intelligent Systems (IEEE IS) 16(2)Google Scholar
  10. 10.
    Google Fusion Tables, http://tables.googlelabs.com/
  11. 11.
    Gonzalez, H., Halevy, A., Jensen, C., Langen, A., Madhavan, J., Shapley, R., Shen, W., Goldberg-Kidon, J.: Google fusion tables: web-centered data management and collaboration. In: Proceedings of the 2010 International Conference on Management of Data, SIGMOD 2010, Indianapolis, USA, June 06 - 10, pp. 175–180 (2010)Google Scholar
  12. 12.
    Gonzalez, H., Halevy, A., Jensen, C., Langen, A., Madhavan, J., Shapley, R., Shen, W.: Google Fusion Tables: Data Management, Integration, and Collaboration in the Cloud. In: Proceedings of the ACM Symposium on Cloud Computing, SOCC (2010)Google Scholar
  13. 13.
    Das Sarma, A., Fang, L., Gupta, N., Halevy, A., Lee, H., Wu, F., Xin, R.: Finding Related Tables. In: Proc. ACM-SIGMOD (to appear, 2012)Google Scholar
  14. 14.
    Macqueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press, Berkeley (1967)Google Scholar
  15. 15.
  16. 16.
    Von Ahn, L.: Games With A Purpose. Computer 39(6), 92–94 (2006)CrossRefGoogle Scholar
  17. 17.
    Chen, L.J., Wang, B.-C., Zhu, W.-Y.: The design of puzzle selection strategies for ESP-like GWAP systems. IEEE Transactions on Computational Intelligence and Games 2(2) (2010)Google Scholar
  18. 18.
    Chan, K.T., King, I., Yuen, M.-C.: Mathematical Modeling of Social Games. In: Proceedings of ICCSE 2009. IEEE (2009)Google Scholar
  19. 19.
    Franklin, M.J., et al.: CrowdDB: answering queries with crowdsourcing. In: Proceedings of the 2011 International Conference on Management of Data (SIGMOD 2011), pp. 61–72. ACM, New York (2011)Google Scholar
  20. 20.
    Marcus, A., et al.: Crowdsourced Databases: Query Processing with People. In: Conference on Innovative Data Systems Research, Asilomar, CA, pp. 211–214 (2011)Google Scholar
  21. 21.
    Parameswaran, A., Polyzotis, N.: Answering Queries using Databases, Humans and Algorithms. In: Conference on Innovative Data Systems Research 2011, Asilomar, CA, pp. 160–166 (2011)Google Scholar
  22. 22.
    Hoffart, J., Suchanek, F.M., Berberich, K., Lewis Kelham, E., de Melo, G., Weikum, G.: YAGO2: Exploring and Querying World Knowledge in Time, Space, Context, and Many Languages. In: Proc. WWW 2011, pp. 229–232 (2011)Google Scholar
  23. 23.
    Jiang, J., Conrath, D.: Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In: Proc. International Conference on Research in Computational Linguistics, Taiwan, (1997)Google Scholar
  24. 24.
    Wu, Z., Palmer, M.: Verb Semantics and Lexical Selection. In: Proc. 32nd Annual Meeting of the Association for Computational Linguistics, pp. 132–138 (1994)Google Scholar
  25. 25.
    Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady 10, 707–710 (1966)MathSciNetGoogle Scholar
  26. 26.
    Ullman, J.: Information Integration using Logical Views. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 19–40. Springer, Heidelberg (1996)CrossRefGoogle Scholar
  27. 27.
    Lenzerini, M.: Data Integration: A Theoretical Perspective. In: Proc. ACM-PODS, pp. 233–246 (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Marco Brambilla
    • 1
  • Stefano Ceri
    • 1
  • Nicola Cinefra
    • 1
  • Anish Das Sarma
    • 2
  • Fabio Forghieri
    • 1
  • Silvia Quarteroni
    • 1
  1. 1.Dipartimento di Elettronica e InformazionePolitecnico di MilanoMilanoItaly
  2. 2.Google Inc.U.S

Personalised recommendations