Skip to main content

Analytic Challenges in Social Sensing

  • Chapter
  • First Online:
The Art of Wireless Sensor Networks

Part of the book series: Signals and Communication Technology ((SCT))

Abstract

Social sensing applications refer to those where individuals play an important role in data collection. They can act as sensor carriers (e.g., carrying GPS devices that share location data), sensor operators (e.g., taking pictures with smart phones), or as sensors themselves (e.g., sharing their observations on Twitter). The proliferation of sensors in the possession of the average individual, together with the popularity of social networks that allow massive information dissemination, heralds an era of social sensing that brings about new research challenges reviewed in this chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Regression is a technique from estimation theory, applied to continuous or inherently ordered parameters to predict continuous or ordered values. In contrast, prediction uses machine learning to predict unordered class labels.

  2. 2.

    To make error values meaningful, we have normalized fuel consumption values to be zero mean and between \(-1\) and \(1\).

References

  1. T. Abdelzaher et al., Mobiscopes for human spaces. IEEE Pervasive Comput. 6(2), 20–29 (2007)

    Article  Google Scholar 

  2. T.F. Abdelzaher, Y. Anokwa, P. Boda, J. Burke, D. Estrin, L.J. Guibas, A. Kansal, S. Madden, J. Reich, Mobiscopes for human spaces. IEEE Pervasive Comput. 6(2), 20–29 (2007)

    Article  Google Scholar 

  3. K. Aberer, Z. Despotovic, Managing trust in a peer-2-peer information system. in CIKM ’01: Proceedings of the Tenth International Conference on Information and Knowledge Management, pp. 310–317. ACM, New York, NY, USA, 2001

    Google Scholar 

  4. G. Adomavicius, A. Tuzhilin, Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17(6), 734–749 (2005)

    Article  Google Scholar 

  5. C. Aggarwal, T. Abdelzaher, Social sensing. in Managing and Mining Sensor Data (Kluwer Academic Publishers, Boston, 2013)

    Google Scholar 

  6. C. Aggarwal, T. Abdelzaher, Integrating sensors and social networks. Social Network Data Analytics (Springer, expected in 2011)

    Google Scholar 

  7. D. Agrawal, C.C. Aggarwal, On the design and quantification of privacy preserving data mining algorithms. in Proceedings of the 20th ACM SIGMOD Symposium on Principles of Database Systems, pp. 247–255, 2001

    Google Scholar 

  8. R. Agrawal, R. Srikant, Privacy preserving data mining. in Proceedings of ACM Conference on Management of Data, pp. 439–450, May 2000

    Google Scholar 

  9. H. Ahmadi, T. Abdelzaher, J. Han, N. Pham, R.K. Ganti. The sparse regression cube: a reliable modeling technique for open cyber-physical systems. in Proceedings of the 2011 IEEE/ACM Second International Conference on Cyber-Physical Systems, ICCPS ’11, pp. 87–96. IEEE Computer Society, Washington, DC, USA, 2011

    Google Scholar 

  10. H. Ahmadi, N. Pham, R. Ganti, T. Abdelzaher, S. Nath, J. Han, Privacy-aware regression modeling of participatory sensing data. in Proceedings of the 8th ACM Conference on Embedded Networked Sensor Systems, SenSys ’10, pp. 99–112. ACM, New York, NY, USA, 2010

    Google Scholar 

  11. R. Balakrishnan, Source rank: relevance and trust assessment for deep web sources based on inter-source agreement. in 20th World Wide Web Conference (WWW’11) 2011

    Google Scholar 

  12. L. Berti-Equille, A.D. Sarma, X. Dong, A. Marian, D. Srivastava, Sailing the information ocean with awareness of currents: discovery and application of source dependence. in CIDR’09 2009

    Google Scholar 

  13. J. Bilmes, A gentle tutorial on the em algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. Technical report, University of Berkeley, ICSI-TR-97-021, 1997

    Google Scholar 

  14. A.T. Campbell, S.B. Eisenman, N.D. Lane, E. Miluzzo, R.A. Peterson, H. Lu, X. Zheng, M. Musolesi, K. Fodor, G.-S. Ahn, The rise of people-centric sensing. IEEE Internet Comput. 12(4), 12–21 (2008)

    Article  Google Scholar 

  15. G. Casella, R. Berger, Statistical Inference (Duxbury Press, Pacific Grove, 2002)

    Google Scholar 

  16. S. Chaudhuri, U. Dayal, An overview of data warehousing and OLAP technology. SIGMOD Rec. 26, 65–74 (1997)

    Article  Google Scholar 

  17. B.-C. Chen, L. Chen, Y. Lin, R. Ramakrishnan, Prediction cubes. in Proceedings 2005 International Conference Very Large Data Bases (VLDB’05), pp. 982–993, Trondheim, Norway, Aug 2005

    Google Scholar 

  18. Y. Chen, G. Dong, J. Han, J. Pei, B.W. Wah, J. Wang, Regression cubes with lossless compression and aggregation. IEEE Trans. Knowl. Data Eng. 18, 1585–1599 (2006)

    Article  Google Scholar 

  19. Y. Chen, G. Dong, J. Han, B. W. Wah, J. Wang. Multi-dimensional regression analysis of time-series data streams. in Proceedings 2002 International Conference Very Large Data Bases (VLDB’02), pp. 323–334, Hong Kong, China, Aug 2002

    Google Scholar 

  20. D.J. Cook, L.B. Holder, Sensor selection to support practical use of health-monitoring smart environments. Wiley Interd. Rev. Data Min. Knowl. Discovery 1(4), 339–351 (2011)

    Article  Google Scholar 

  21. H. Cramer. Mathematical Methods of Statistics (Princeton University Press, Princeton, 1946)

    Google Scholar 

  22. O. Dekel, O. Shamir, Vox populi: collecting high-quality labels from a crowd. in In Proceedings of the 22nd Annual Conference on Learning Theory 2009

    Google Scholar 

  23. S.A. Delre, W. Jager, M.A. Janssen, Diffusion dynamics in small-world networks with heterogeneous consumers. Comput. Math. Organ. Theory 13, 185–202 (2007)

    Article  MATH  Google Scholar 

  24. A.P. Dempster, N.M. Laird, D.B. Rubin, Maximum likelihood from incomplete data via the em algorithm. J. Roy. Stat. Soc. B 39(1), 1–38 (1977)

    MATH  MathSciNet  Google Scholar 

  25. X. Dong, L. Berti-Equille, Y. Hu, D. Srivastava, Global detection of complex copying relationships between sources. PVLDB 3(1), 1358–1369 (2010)

    Google Scholar 

  26. X. Dong, L. Berti-Equille, D. Srivastava, Truth discovery and copying detection in a dynamic world. VLDB 2(1), 562–573 (2009)

    Google Scholar 

  27. A. Doucet, N. De Freitas, N. Gordon, (eds.), Sequential Monte Carlo Methods, in Practice (Springer, New York, 2001)

    Google Scholar 

  28. R.O. Duda, P.E. Hart, D.G. Stork. Pattern Classification, 2nd edn. (Wiley-Interscience, New York, 2001)

    Google Scholar 

  29. S.B. Eisenman et al., The bikenet mobile sensing system for cyclist experience mapping. in Proceedings of SenSys, Nov 2007

    Google Scholar 

  30. A. Evfimievski, J. Gehrke, R. Srikant, Limiting privacy breaches in privacy preserving data mining. in Proceedings of the SIGMOD/PODS Conference, pp. 211–222, 2003

    Google Scholar 

  31. A. Galland, S. Abiteboul, A. Marian, P. Senellart, Corroborating information from disagreeing views. in WSDM, pp. 131–140, 2010

    Google Scholar 

  32. R. Ganti, N. Pham, Y.-E. Tsai, T. Abdelzaher, Poolview: stream privacy for grassroots participatory sensing. in ACM Sensys, Raleigh, NC, Nov 2008

    Google Scholar 

  33. R.K. Ganti, S. Srinivasan, A. Gacic, Multisensor fusion in smartphones for lifestyle monitoring. in Proceedings of the 2010 International Conference on Body Sensor Networks, BSN ’10, pp. 36–43. IEEE Computer Society, Washington, DC, USA, 2010

    Google Scholar 

  34. P. Gilbert, L.P. Cox, J. Jung, D. Wetherall, Toward trustworthy mobile sensing. in Proceedings of the Eleventh Workshop on Mobile Computing Systems and Applications, HotMobile ’10, pp. 31–36. ACM, New York, NY, USA, 2010

    Google Scholar 

  35. P. Gilbert, J. Jung, K. Lee, H. Qin, D. Sharkey, A. Sheth, L.P. Cox. Youprove: authenticity and fidelity in mobile sensing. in Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems, SenSys ’11, pp. 176–189. ACM, New York, NY, USA, 2011

    Google Scholar 

  36. J. Gray, S. Chaudhuri, A. Bosworth, A. Layman, D. Reichart, M. Venkatrao, F. Pellow, H. Pirahesh, Data cube: a relational aggregation operator generalizing group-by, cross-tab and sub-totals. Data Min. and Knowl. Discovery 1, 29–54 (1997)

    Article  Google Scholar 

  37. V. Harinarayan, A. Rajaraman, J.D. Ullman. Implementing data cubes efficiently. in Proceedings 1996 ACM-SIGMOD International Conference Management of Data (SIGMOD’96), pp. 205–216, Montreal, Canada, June 1996

    Google Scholar 

  38. A. Helal, D.J. Cook, M. Schmalz, Smart home-based health platform for behavioral monitoring and alteration of diabetes patients. J. Diab. Sci. Technol. 3(1), 141–148 (2009)

    Google Scholar 

  39. K. Hoffman, D. Zage, C.N. Rotaru, A survey of attack and defense techniques for reputation systems. ACM Comput. Surv. 42(1), 1–31 (2009)

    Article  Google Scholar 

  40. R.V. Hogg, A.T. Craig, Introduction to Mathematical Statistics (Prentice Hall, Upper Saddle River, 1995)

    Google Scholar 

  41. D. Houser, J. Wooders, Reputation in auctions: theory, and evidence from ebay. J. Econ. Manage. Strategy 15(2), 353–369 (2006)

    Article  Google Scholar 

  42. J. Huang. Color-spatial image indexing and applications. Ph.D. thesis, Cornell University, 1998

    Google Scholar 

  43. J.-H. Huang, S. Amjad, S. Mishra, Cenwits: a sensor-based loosely coupled search and rescue system using witnesses. in Proceedings of SenSys, pp. 180–191, 2005

    Google Scholar 

  44. Z. Huang, W. Du, B. Chen, Deriving private information from randomized data. in Proceedings of the 2005 ACM SIGMOD Conference, pp. 37–48, Baltimore, MD, June 2005

    Google Scholar 

  45. C. Hui, M.K. Goldberg, M. Magdon-Ismail, W.A. Wallace, Simulating the diffusion of information: an agent-based modeling approach. in IJATS, pp. 31–46, 2010

    Google Scholar 

  46. B. Hull et al. Cartel: a distributed mobile sensor computing system. in Proceedings of SenSys, pp. 125–138, 2006

    Google Scholar 

  47. U.T. Inc, U.T.I. Staff, Solving Data Mining Problems Using Pattern Recognition Software with CDROM, 1st edn. (Prentice Hall PTR, Upper Saddle River, 1997)

    Google Scholar 

  48. J. Han, M. Kamber, J. Pei, Data Mining: Concepts and Techniques, 3rd edn. (Morgan Kaufman, San Francisco, 2011)

    Google Scholar 

  49. R.A. Johnson, D.W. Wichern, Applied Multivariate Statistical Analysis (Prentice-Hall, Inc., Upper Saddle River, 2002)

    Google Scholar 

  50. A. Jøsang, R. Ismail, C. Boyd, A survey of trust and reputation systems for online service provision. Decis. Support Syst. 43(2), 618–644 (2007)

    Article  Google Scholar 

  51. R.E. Kalman, A new approach to linear filtering and prediction problems. Trans. ASME J. Basic Eng. 82(Series D), 35–45 (1960)

    Google Scholar 

  52. H. Kargutpa, S. Datta, Q. Wang, K. Sivakumar, On the privacy preserving properties of random data perturbation techniques. in Proceedings of the IEEE International Conference on Data Mining, pp. 99–106, 2003

    Google Scholar 

  53. J.M. Kleinberg, Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  54. X. Li, J. Han, Z. Yin, J.-G. Lee, Y. Sun, Sampling cube: a framework for statistical OLAP over sampling data. in Proceedings 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD’08), Vancouver, BC, Canada, June 2008

    Google Scholar 

  55. Q. Lian, Z. Zhang, M. Yang, B. Y. Zhao, Y. Dai, X. Li, An empirical study of collusion behavior in the maze p2p file-sharing system. in Proceedings of the 27th International Conference on Distributed Computing Systems, ICDCS ’07, p. 56. IEEE Computer Society, Washington, DC, USA, 2007

    Google Scholar 

  56. B. Longstaff, S. Reddy, D. Estrin, Improving activity classification for health applications on mobile devices using active and semi-supervised, learning, p. 6, 2010

    Google Scholar 

  57. A. Madan, M. Cebrian, D. Lazer, A. Pentland, Social sensing for epidemiological behavior change. in Proceedings of the 12th ACM International Conference on Ubiquitous Computing, Ubicomp ’10, pp. 291–300. ACM, New York, NY, USA, 2010

    Google Scholar 

  58. A. Madan, S.T. Moturu, D. Lazer, A. Pentland, Social sensing: obesity, unhealthy eating and exercise in face-to-face networks. in Wireless, Health, pp. 104–110, 2010

    Google Scholar 

  59. G.J. McLachlan, T. Krishnan, The Em Algorithm and Extensions (Wiley, New York, 1997)

    Google Scholar 

  60. E. Miluzzo, N.D. Lane, K. Fodor, R. Peterson, H. Lu, M. Musolesi, S.B. Eisenman, X. Zheng, A.T. Campbell, Sensing meets mobile social networks: the design, implementation and evaluation of the cenceme application. in Proceedings of the 6th ACM conference on Embedded network sensor systems, SenSys ’08, pp. 337–350. ACM, New York, NY, USA, 2008

    Google Scholar 

  61. M. Mun, S. Reddy, K. Shilton, N. Yau, J. Burke, D. Estrin, M. Hansen, E. Howard, R. West, P. Boda, Peir, the personal environmental impact report, as a platform for participatory sensing systems research. in Proceedings of the 7th International Conference on Mobile Systems, Applications, and Services, MobiSys ’09, pp. 55–68. ACM, New York, NY, USA, 2009

    Google Scholar 

  62. N. Mustapha, M. Jalali, M. Jalali, Expectation maximization clustering algorithm for user modeling in web usage mining systems. Eur. J. Sci. Res. 32(4), 467–476 (2009)

    Google Scholar 

  63. S. Nath, Ace: exploiting correlation for energy-efficient and continuous context sensing. in Proceedings of the Tenth International Conference on Mobile systems, Applications, and Services (MobiSys’12) 2012

    Google Scholar 

  64. T. Park, J. Lee, I. Hwang, C. Yoo, L. Nachman, J. Song, E-gesture: a collaborative architecture for energy-efficient gesture recognition with hand-worn sensor and mobile devices. in Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems, SenSys ’11, pp. 260–273. ACM, New York, NY, USA, 2011

    Google Scholar 

  65. J. Pasternack, D. Roth, Knowing what to believe (when you already know something). in International Conference on Computational Linguistics (COLING) 2010

    Google Scholar 

  66. N. Pham, R. Ganti, M.Y. Uddin, S. Nath, T. Abdelzaher, Privacy-preserving reconstruction of multidimensional data maps in vehicular participatory sensing. in EWSN, Coimbra, Portugal, Feb 2010

    Google Scholar 

  67. N. Pham, R.K. Ganti, Y.S. Uddin, S. Nath, T. Abdelzaher, Privacy-preserving reconstruction of multidimensional data maps in vehicular participatory sensing. in Proceedings of the 7th European Conference on Wireless Sensor Networks, EWSN’10, pp. 114–130. Springer-Verlag, Berlin, Heidelberg, 2010

    Google Scholar 

  68. D. Pomerantz, G. Dudek, Context dependent movie recommendations using a hierarchical bayesian model. in Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence, Canadian AI ’09, pp. 98–109. Springer-Verlag, Berlin, Heidelberg, 2009

    Google Scholar 

  69. R. Ramakrishnan, B.-C. Chen, Exploratory mining in cube space. Data Min. Knowl. Discovery 15, 29–54 (2007)

    Article  MathSciNet  Google Scholar 

  70. S. Reddy, D. Estrin, M. Srivastava, Recruitment framework for participatory sensing data collections. in Proceedings of the 8th International Conference on Pervasive Computing, pp. 138–155. Springer, Berlin, Heidelberg, May 2010

    Google Scholar 

  71. S. Reddy, K. Shilton, G. Denisov, C. Cenizal, D. Estrin, M. Srivastava, Biketastic: sensing and mapping for better biking. in Proceedings of the 28th International Conference on Human Factors in Computing Systems, CHI ’10, pp. 1817–1820. ACM, New York, NY, USA, 2010

    Google Scholar 

  72. Sense Networks. Cab Sense. http://www.cabsense.com

  73. V.S. Sheng, F. Provost, P.G. Ipeirotis, Get another label? Improving data quality and data mining using multiple, noisy labelers. in Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’08, pp. 614–622. ACM, New York, NY, USA, 2008

    Google Scholar 

  74. Y. Sun, Y. Yu, J. Han, Ranking-based clustering of heterogeneous information networks with star network schema. in 15th SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’09), pp. 797–806, 2009

    Google Scholar 

  75. P.-N. Tan, M. Steinbach, V. Kumar, Introduction to Data Mining (Addison Wesley, Boston, 2005)

    Google Scholar 

  76. A. Thiagarajan, J. Biagioni, T. Gerlich, J. Eriksson, Cooperative transit tracking using smart-phones. in SenSys’10, pp. 85–98, 2010

    Google Scholar 

  77. A.N. Tikhonov, V.Y. Arsenin, Solution of Ill Posed Problems (V. H. Winstons and Sons, Washington, 1977)

    Google Scholar 

  78. D. Wang, T. Abdelzaher, H. Ahmadi, J. Pasternack, D. Roth, M. Gupta, J. Han, O. Fatemieh, H. Le, On bayesian interpretation of fact-finding in information networks. in 14th International Conference on Information Fusion (Fusion 2011), 2011

    Google Scholar 

  79. D. Wang, H. Ahmadi, T. Abdelzaher, H. Chenji, R. Stoleru, C. Aggarwal, Optimizing quality-of-information in cost-sensitive sensor data fusion. in IEEE 7th International Conference on Distributed Computing in Sensor Systems (DCoSS 11), June 2011

    Google Scholar 

  80. D. Wang, L. Kaplan, T. Abdelzaher, C. Aggarwal, On scalability and robustness limitations of real and asymptotic confidence bounds in social sensing. in 9th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks (SECON), Seoul, Korea, June 2012

    Google Scholar 

  81. D. Wang, H. Le, L. Kaplan, T. Abdelzaher. On truth discovery in social sensing: a maximum likelihood estimation approach. in 11th ACM/IEEE Conference on Information Processing in Sensor Networks (IPSN), April 2012

    Google Scholar 

  82. M.E. Whitman, H.J. Mattord, Principles of Information Security (Course Technology Press, Boston, 2004)

    Google Scholar 

  83. C.F.J. Wu, On the convergence properties of the EM algorithm. Ann. Stat. 11(1), 95–103 (1983)

    Google Scholar 

  84. J. Xie, S. Sreenivasan, G. Korniss, W. Zhang, C. Lim, B.K. Szymanski, Social consensus through the influence of committed minorities. CoRR, abs/1102.3931, 2011

    Google Scholar 

  85. Z. Yang, S. Zhong, R.N. Wright, Privacy-preserving classification of customer data without loss of accuracy. in Proceedings of SIAM International Conference on Data Mining, pp. 92–102, 2005

    Google Scholar 

  86. X. Yin, J. Han, P.S. Yu, Truth discovery with multiple conflicting information providers on the web. IEEE Trans. Knowl. Data Eng. 20, 796–808 (2008)

    Google Scholar 

  87. X. Yin, W. Tan, Semi-supervised truth discovery. in WWW. ACM, New York, NY, USA, 2011

    Google Scholar 

  88. H. Yu, M. Kaminsky, P.B. Gibbons, A. Flaxman, Sybilguard: defending against sybil attacks via social networks. SIGCOMM Comput. Commun. Rev. 36, 267–278 (2006)

    Article  Google Scholar 

  89. C. Zhai, A note on the expectation maximization (em) algorithm. University of Illinois at Urbana Champaign, Department of Computer Scinece, 2007

    Google Scholar 

  90. B. Zhao, B.I.P. Rubinstein, J. Gemmell, J. Han, A Bayesian approach to discovering truth from conflicting sources for data integration. Proc. VLDB Endow. 5(6), 550–561 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tarek Abdelzaher .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Abdelzaher, T., Wang, D. (2014). Analytic Challenges in Social Sensing. In: Ammari, H. (eds) The Art of Wireless Sensor Networks. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40066-7_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40066-7_16

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40065-0

  • Online ISBN: 978-3-642-40066-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics