Skip to main content

Position Weighted Decision Trees for Ranking Data

  • Conference paper
  • First Online:
Models for Data Analysis (SIS 2018)

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 402))

Included in the following conference series:

  • 283 Accesses

Abstract

Preference data represent a particular type of ranking data where a group of people gives their preferences over a set of alternatives. Within this framework, distance-based decision trees represent a non-parametric tool for identifying the profiles of subjects giving a similar ranking. This paper aims at detecting, in the framework of (complete and incomplete) ranking data, the impact of the differently structured weighted distances for building decision trees. By means of simulations, we will compute the impact of higher/lower homogeneity in groups and different weighting structures both on splitting and on consensus ranking. The distances that will be used satisfy Kemeny’s axioms and, accordingly, a modified version of the rank correlation coefficient \(\tau _x\), proposed by Emond and Mason, will be proposed and used for rank aggregation and class label in the tree leaves.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. D’Ambrosio, A.: Tree based methods for data editing and preference rankings. Ph.D. thesis, Universitá degli Studi di Napoli “Federico II” (2007)

    Google Scholar 

  2. D’Ambrosio, A., Heiser, W.J.: A recursive partitioning method for the prediction of preference rankings based upon Kemeny distances. Psychometrika 81(3), 774–794 (2016)

    Article  MathSciNet  MATH  Google Scholar 

  3. Cook, W.D.: Distance based and ad hoc consensus models in ordinal preference ranking. Eur. J. Oper. Res. 172, 369–385 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  4. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees. Chapman & Hall (1984)

    Google Scholar 

  5. Cheng, W., and Hühn, J., and Hüllermeier, E.: Decision tree and instance-based learning for label ranking. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 161–168 (2009)

    Google Scholar 

  6. Chen, J., Li, Y., Feng, L.: On the equivalence of weighted metrics for distance measures between two rankings. J. Inf. Comput. Sci. 11(13), 4477–4485 (2014)

    Article  Google Scholar 

  7. D’Ambrosio, A., Amodio, S., Mazzeo, G.: ConsRank: compute the median ranking(s) according to the Kemeny’s axiomatic approach. R package version 2.0.1 (2017). https://CRAN.R-project.org/package=ConsRank

  8. D’ambrosio, A., Amodio, S., Iorio, C.: Two algorithms for finding optimal solutions of the Kemeny rank aggregation problem for full rankings. Electron. J. Appl. Stat. Anal. 8(2), 198–213 (2015)

    MathSciNet  Google Scholar 

  9. D’Ambrosio, A., Heiser, W.J.: A recursive partitioning method for the prediction of preference rankings based upon Kemeny distances. Psychometrika 81(3), 774–794 (2016)

    Article  MathSciNet  MATH  Google Scholar 

  10. Dittrich, R., Hatzinger, R., Katzenbeisser, W.: Modelling the effect of subject-specific covariates in paired comparison studies with an application to university rankings. J. R. Stat. Soc. C (Appl. Stat.) 47(4), 511–525 (1998)

    Article  MATH  Google Scholar 

  11. Emond, E.J., Mason, D.W.: A new rank correlation coefficient with application to the concensus ranking problem. J. Multi-Criteria Decis. Anal. 11, 17–28 (2002)

    Article  MATH  Google Scholar 

  12. García-Lapresta, J.L., Pérez-Román, D.: Consensus measures generated by weighted Kemeny distances on weak orders. In: Proceedings of the 10th International Conference on Intelligent Systems Design and Applications, Cairo (2010)

    Google Scholar 

  13. Henzgen, S., Hüllermeier, E.: Weighted rank correlation: a flexible approach based on fuzzy order relations. Jt. Eur. Conf. Mach. Learn. Knowl. Discov. Databases 422–437 (2015)

    Google Scholar 

  14. Irurozki, E., Calvo, B., Lozano, J.A.: PerMallows: an R package for mallows and generalized mallows models. J. Stat. Softw. 71(12), 1–30 (2016). https://doi.org/10.18637/jss.v071.i12

    Article  MATH  Google Scholar 

  15. Kemeny, J.G., Snell, J.L.: Preference rankings an axiomatic approach. MIT Press (1962)

    Google Scholar 

  16. Kumar, R., Vassilvitskii, S.: Generalized Distances Between Rankings. In: Proceedings of the 19th International Conference on World Wide Web, WWW ’10, pp. 571–580. ACM, New York, NY, USA (2010)

    Google Scholar 

  17. Lee, P.H., Yu, P.L.H.: Distance-based tree models for ranking data. Comput. Stat. Data Anal. 54(6), 1672–1682 (2010)

    Google Scholar 

  18. Mallows, C.L.: Non-null ranking models. Biometrika 44(1–2), 114–130 (1957)

    Article  MathSciNet  MATH  Google Scholar 

  19. Piccarreta, R.: Binary trees for dissimilarity data. Comput. Stat. Data Anal. 54(6), 1516–1524 (2010)

    Article  MathSciNet  MATH  Google Scholar 

  20. Plaia, A., Sciandra, M.: Weighted distance-based trees for ranking data. In: Advances in Data Analysis and Classification, pp. 1–18. Springer, Berlin (2017) https://doi.org/10.1007/s11634-017-0306-x

  21. Plaia, A., Buscemi, S., Sciandra, M.: A new position weight correlation coefficient for consensus ranking process without ties. Stat 8, e236 (2019). https://doi.org/10.1002/sta4.236

  22. R Core Team, R.: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2018). https://www.R-project.org/

  23. Strobl, C., Wickelmaier, F., Zeileis, A.: Accounting for individual differences in Bradley-Terry models by means of recursive partitioning. J. Educ. Behav. Stat. 36(2), 135–153 (2011)

    Article  Google Scholar 

  24. Therneau, T., Clinic, M.: User written splitting functions for RPART (2015)

    Google Scholar 

  25. Therneau, T., Atkinson, B., Ripley, B.: rpart: recursive partitioning and regression trees. R package version 4.1–10 (2015)

    Google Scholar 

  26. Yilmaz, E., Aslam, J.A., Robertson, S.: A new rank correlation coefficient for information retrieval. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 587–594 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Antonella Plaia .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Plaia, A., Buscemi, S., Sciandra, M. (2023). Position Weighted Decision Trees for Ranking Data. In: Brentari, E., Chiodi, M., Wit, EJ.C. (eds) Models for Data Analysis. SIS 2018. Springer Proceedings in Mathematics & Statistics, vol 402. Springer, Cham. https://doi.org/10.1007/978-3-031-15885-8_15

Download citation

Publish with us

Policies and ethics