Contextual Probability Estimation from Data Samples – A Generalisation

Wang, Hui; Wang, Bowen

doi:10.1007/978-3-319-99368-3_26

Hui Wang¹⁷ &
Bowen Wang¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11103))

Included in the following conference series:

International Joint Conference on Rough Sets

965 Accesses

Abstract

Contextual probability (G) provides an alternative, efficient way of estimating (primary) probability (P) in a principled way. G is defined in terms of P in a combinatorial way, and they have a simple linear relationship. Consequently, if one is known, the other can be calculated. It turns out G can be estimated based on a set of data samples through a simple process called neighbourhood counting. Many results about contextual probability are obtained based on the assumption that the event space is the power set of the sample space. However, the real world is usually not the case. For example, in a multidimensional sample space, the event space is typically the set of hyper tuples which is much smaller than the power set. In this paper, we generalise contextual probability to multidimensional sample space where the attributes may be categorical or numerical. We present results about the normalisation constant, the relationship between G and P and the neighbourhood counting process.

Hui Wang gratefully acknowledges support by EU Horizon 2020 Programme (700381, ASGARD).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The concept of neighbourhood is used in different contexts with possibly different definitions. The use of this concept in this paper is defined as such.
2.
This is common in statistics. See, e.g., [3].
3.
https://en.wikipedia.org/wiki/Borel_set.

References

Ash, R.B., Doléans-Dade, C.: Probability and Measure Theory. Academic Press, San Diego (2000)
MATH Google Scholar
Chen, S., Ma, B., Zhang, K.: On the similarity and the distance metric. Theoret. Comput. Sci. 410(24–25), 2365–2376 (2009)
Article MathSciNet Google Scholar
Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. Wiley, New York (1973)
MATH Google Scholar
Feller, W.: An Introduction to Probability Theory and Its Applications. Wiley, New York (1968)
MATH Google Scholar
Hajek, A.: Probability, logic and probability logic. In: Goble, L. (ed.) Blackwell Companion to Logic, pp. 362–384. Blackwell, Oxford (2000)
Google Scholar
Lin, Z., Lyu, M., King, I.: Matchsim: a novel similarity measure based on maximum neighborhood matching. Knowl. Inf. Syst. 32, 141–166 (2012)
Article Google Scholar
Mani, A.: Comparing dependencies in probability theory and general rough sets: Part-a. arXiv:1804.02322v1
Mani, A.: Probabilities, dependence and rough membership functions. Int. J. Comput. Appl. 39, 17–35 (2017)
Google Scholar
TolgaKahraman, H.: A novel and powerful hybrid classifier method: development and testing of heuristic k-nn algorithm with fuzzy distance metric. Data Knowl. Eng. 103, 44–59 (2016)
Article Google Scholar
Wang, H.: Nearest neighbors by neighborhood counting. IEEE Trans. Pattern Anal. Mach. Intell. 28(6), 942–953 (2006)
Article Google Scholar
Wang, H., Düentsch, I., Trindade, L.: Lattice machine classification based on contextual probability. Fundamenta Informaticae 127(1–4), 241–256 (2013). https://doi.org/10.3233/FI-2013-907
Article MathSciNet MATH Google Scholar
Wang, H., Düntsch, I., Gediga, G., Skowron, A.: Hyperrelations in version space. Int. J. Approximate Reasoning 36(3), 223–241 (2004)
Article MathSciNet Google Scholar
Wang, X., Ouyang, J., Chen, G.: Simplifying calculation of graph similarity through matrices. In: Li, D., Li, Z. (eds.) CCTA 2015. IAICT, vol. 479, pp. 417–428. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-48354-2_41
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Ulster University, Jordanstown, UK
Hui Wang
Mavern Securities, London, UK
Bowen Wang

Authors

Hui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bowen Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hui Wang .

Editor information

Editors and Affiliations

University of Warsaw, Warsaw, Poland
Hung Son Nguyen
Faculty of Information Technology, Vietnam National University, Hanoi, Vietnam
Quang-Thuy Ha
School of Information Science, Southwest Jiaotong University, Chengdu, China
Tianrui Li
Institute of Computer Science, University of Silesia, Sosnowiec, Poland
Małgorzata Przybyła-Kasperek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, H., Wang, B. (2018). Contextual Probability Estimation from Data Samples – A Generalisation. In: Nguyen, H., Ha, QT., Li, T., Przybyła-Kasperek, M. (eds) Rough Sets. IJCRS 2018. Lecture Notes in Computer Science(), vol 11103. Springer, Cham. https://doi.org/10.1007/978-3-319-99368-3_26

Download citation

DOI: https://doi.org/10.1007/978-3-319-99368-3_26
Published: 15 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99367-6
Online ISBN: 978-3-319-99368-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Contextual Probability Estimation from Data Samples – A Generalisation