Skip to main content
Log in

A novel truth prediction algorithm for ascertaining the truthfulness of the data and reliability of the users in crowdsourcing application

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

In recent years, the concept of crowdsourcing becomes more popular and on demand due to the capability of solving many real-world problems in less time. The crowd-sourcing has several ben- efits such as increased speed, flexibility and efficiency in terms of collecting and processing the data. One of the major demands in this crowdsourcing is to find the truthfulness of the obtained data and to ascertain trustworthiness of the sources. Several existing approaches are used for truth discovery among crowd-sourced-based applications. But they work efficiently only for structured data and the accuracy levels are dependent on the availability of ground truth values. To over- come the above-mentioned issues a novel truth discovery model is proposed. In this work it uses knowledge extraction and truth discovery phases to process the data. Reliability scores are assigned to trustworthy users based on the frequency with which they provide true values over a period of time. It works on both categorical and continues data. So, finally the proposed model is tested on three real-valued data sets. Experimental results prove to provide better performance than the conventional approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data availability

Enquiries about data availability should be directed to the authors.

References

  • Berti-Equille L (2015) Data veracity estimation with ensembling truth discovery methods, in: and others (Ed.) In: 2015 IEEE international conference on big data (Big Data), pp 2628–2636

  • Chu CH, Chang WC, Lin YI (2020) An exploratory study on computer-aided affective product design based on crowdsourcing. J Ambient Intell Human Comput 11:1–13

    Article  Google Scholar 

  • Galland A, Abiteboul S, Marian A, Senellart P (2010) Corroborating information from disagreeing views, WSDM. pp 131–140

  • Jiang L, Niu X, Xu J, Yang D, Xu L (2019) Incentivizing the workers for truth discovery in crowdsourcing with copiers. In: 2019 IEEE 39th international conference on distributed computing systems (ICDCS) 2019 pp 1286–1295

  • Kaliyar RK, Goswami A, Narang P (2021) FakeBERT: Fake news detection in social media with a BERT-based deep learning approach. Multimed Tools Appl 80:11765–11788. https://doi.org/10.1007/s11042-020-10183-2

    Article  Google Scholar 

  • Kleinberg JM (1999) Authoritative sources in a hyperlinked environment. J ACM 46(5):604–632

    Article  MATH  Google Scholar 

  • Lee S, Park S, Park S. (2014) A quality enhancement of crowdsourcing based on quality evaluation and user-level task assignment framework. In: 2014 International conference on big data and smart computing (BIGCOMP) pp 60–65)

  • Li X, Dong XL, Lyons K, Meng W, Srivastava D (2012) Truth finding on the deep web: is the problem solved? Proc VLDB Endowment 6(2):97–108. https://doi.org/10.14778/2535568.2448943

    Article  Google Scholar 

  • Li T, Gu Y, Zhou X, Ma Q, Yu G (2017) An effective and efficient truth discovery framework over data streams, EDBT pp 180–191

  • Lofi C, Balke WT (2015) Large Scale Cooperation Scenarios-Crowdsourcing and its Societal Implication, The IPSI BgD Transactions on Internet Research, 3–3

  • Ouyang RW, Kaplan LM, Toniolo A, Srivastava M, Norman TJ (2016) Parallel and streaming truth discovery in large-scale quantitative crowdsourcing. IEEE Trans Parallel Distrib Syst 27(10):2984–2997. https://doi.org/10.1109/tpds.2016.2515092

    Article  Google Scholar 

  • Pasternack J, Roth D, (2010) Knowing what to believe (when you already know something), In: International Conference on Computational Linguistics (COLING)

  • Pasternack J, Roth D, (2013) Latent credibility analysis, in: Proceedings of the 22nd international conference on World Wide Web, pp 1009–1020

  • Poongodi M, Bose S (2015) Detection and prevention system towards the truth of convergence on decision using aumann agreement theorem. Procedia Computer Science 50:244–251. https://doi.org/10.1016/j.procs.2015.04.053

    Article  Google Scholar 

  • Rajpurkar P, Zhang J, Lopyrev K, Liang P (2016) Squad: 100,000+ questions for machine comprehension of text, arXiv preprint arXiv:1606.05250

  • Sharma S (2022) Fact-finding knowledge-aware search engine. In: Sharma N, Chakrabarti A, Balas VE, Bruckstein AM (eds) Data management, analytics and innovation: proceedings of ICDMAI 2021. Springer Singapore, Singapore. https://doi.org/10.1007/978-981-16-2937-2_17

    Chapter  Google Scholar 

  • Thiyagaraj MPB, Aloysius A (2017) A survey on truth discovery methods for big data. Int J Comput Intell Res 13:1799–1810

    Google Scholar 

  • Van Der Putten P, Kok J, Gupta A, (2002) Data fusion through statistical matching

  • Wan C, Phoha VV, Huang D (2018) TR-MCN: light weight task recommendation for mobile crowdsourcing networks. J Ambient Intell Humaniz Comput 9(4):1027–1038. https://doi.org/10.1007/s12652-017-0505-5

    Article  Google Scholar 

  • Yin X, Jiawei Han P (2008) Truth discovery with multiple conflicting information providers on the web. IEEE Trans Knowl Data Eng 20(6):796–808. https://doi.org/10.1109/TKDE.2007.190745

    Article  Google Scholar 

  • Zhao B, Rubinstein BIP, Gemmell J, Han J (2012) A bayesian approach to discovering truth from conflicting sources for data integration. Proc VLDB Endow 5(6):550–561. https://doi.org/10.14778/2168651.2168656

    Article  Google Scholar 

  • Zhi S, Yang F, Zhu Z, Li Q, Wang Z, Han J (2018) Dynamic Truth Discovery on Numerical Data

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adilakshmi Vadavalli.

Ethics declarations

Conflict of interest

The authors have not disclosed any competing interests.

Additional information

Communicated by Meng Joo.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vadavalli, A., Subhashini, R. A novel truth prediction algorithm for ascertaining the truthfulness of the data and reliability of the users in crowdsourcing application. Soft Comput 27, 1685–1698 (2023). https://doi.org/10.1007/s00500-022-07095-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-022-07095-8

Keywords

Navigation