Skip to main content

Comparing Machine Learning and Information Retrieval-Based Approaches for Filtering Documents in a Parliamentary Setting

  • Conference paper
  • First Online:
Scalable Uncertainty Management (SUM 2017)

Abstract

We consider the problem of building a content-based recommender/filtering system in a parliamentary context which, given a new document to be recommended, can decide those Members of Parliament who should receive it. We propose and compare two different approaches to tackle this task, namely a machine learning-based method using automatic document classification and an information retrieval-based approach that matches documents and legislators’ representations. The information necessary to build the system is automatically extracted from the transcriptions of the speeches of the members of parliament within the parliament debates. Our proposals are experimentally tested for the case of the regional Andalusian Parliament at Spain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.parlamentodeandalucia.es.

  2. 2.

    https://cran.r-project.org.

  3. 3.

    https://lucene.apache.org.

References

  1. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Boston (2011)

    Google Scholar 

  2. Belkin, N.J., Croft, W.B.: Information filtering and information retrieval: two sides of the same coin? Commun. ACM 35, 29–38 (1992)

    Article  Google Scholar 

  3. Bobadilla, J., Hernando, A., Fernando, O., Gutiérrez, A.: Recommender systems survey. Knowl. Based Syst. 46, 109–132 (2013)

    Article  Google Scholar 

  4. Billsus, D., Pazzani, M., Chen, J.: A learning agent for wireless news access. In: Proceedings of the International Conference on Intelligent User Interfaces, pp. 33–36 (2002)

    Google Scholar 

  5. Cohen, W.: Learning rules that classify e-mail. In: Papers from the AAAI Spring Symposium on Machine Learning in Information Access, pp. 18–25 (1996)

    Google Scholar 

  6. Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, New York (2000)

    Google Scholar 

  7. de Campos, L.M., Fernández-Luna, J.M., Huete, J.F.: A lazy approach for filtering parliamentary documents. In: Kő, A., Francesconi, E. (eds.) EGOVIS 2015. LNCS, vol. 9265, pp. 364–378. Springer, Cham (2015). doi:10.1007/978-3-319-22389-6_26

    Chapter  Google Scholar 

  8. de Campos, L.M., Fernández-Luna, J.M., Huete, J.F., Martin-Dancausa, C.J., Tur-Vigil, C., Tagua, A.: An integrated system for managing the andalusian parliament’s digital library. Program Electron. Libr. Inf. Syst. 43, 121–139 (2009)

    Article  Google Scholar 

  9. Foltz, P., Dumais, S.: Personalized information delivery: an analysis of information filtering methods. Commun. ACM 35, 51–60 (1992)

    Article  Google Scholar 

  10. Hanani, U., Shapira, B., Shoval, P.: Information filtering: overview of issues, research and systems. User Model. User Adapt. Interact. 11, 203–259 (2001)

    Article  MATH  Google Scholar 

  11. Jarvelin, K., Kekalainen, J.: Cumulative gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20, 422–446 (2002)

    Article  Google Scholar 

  12. Kim, J., Lee, B., Shaw, M., Chang, H., Nelson, W.: Application of decision-tree induction techniques to personalized advertisements on internet storefronts. Int. J. Electron. Commerce 5, 45–62 (2001)

    Google Scholar 

  13. Jennings, A., Higuchi, H.: A user model neural network for a personal news service. User Model. User Adapt. Interact. 3, 1–25 (1993)

    Article  Google Scholar 

  14. Lantz, B.: Machine Learning with R. Packt Publishing Ltd., Birmingham (2013)

    Google Scholar 

  15. Loeb, S.: Architecting personal delivery of multimedia information. Commun. ACM 35, 39–48 (1992)

    Article  Google Scholar 

  16. Lops, P., de Gemmis, M., Semeraro, G.: Content-based recommender systems: state of the art and trends. In: Ricci, F., Rokach, L., Shapira, B., Kantor, P. (eds.) Recommender Systems Handbook. Springer, Boston (2011)

    Google Scholar 

  17. Lu, J., Wu, D., Mao, M., Wang, W., Zhang, G.: Recommender system application developments: a survey. Decis. Support Syst. 74, 12–32 (2015)

    Article  Google Scholar 

  18. Pazzani, M.J., Billsus, D.: Content-based recommendation systems. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) The Adaptive Web. LNCS, vol. 4321, pp. 325–341. Springer, Heidelberg (2007). doi:10.1007/978-3-540-72079-9_10

    Chapter  Google Scholar 

  19. Pazzani, M., Billsus, D.: Learning and revising user profiles: the identification of interesting web sites. Mach. Learn. 27, 313–331 (1997)

    Article  Google Scholar 

  20. Ribadas, F.J., de Campos, L.M., Fernández-Luna, J.M., Huete, J.F.: Concept profiles for filtering parliamentary documents. In: Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, vol. 1, pp. 409–416 (2015)

    Google Scholar 

  21. Shamin, J., Neuhold, C.: ‘Connecting Europe’: the use of ‘new’ information and communication technologies within European parliament standing committees. J. Legislative Stud. 13, 388–402 (2007)

    Article  Google Scholar 

  22. Tjoa, A.M., Hofferer, M., Ehrentraut, G., Untersmeyer, P.: Applying evolutionary algorithms to the problem of information filtering. In: Proceedings of the 8th International Workshop on Database and Expert Systems Applications, pp. 450–458 (1997)

    Google Scholar 

  23. Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining multi-label data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook. Springer, Boston (2009)

    Google Scholar 

  24. Zhang, B., Zuo, W.: Learning from positive and unlabeled examples: a survey. In: International Symposiums on Information Processing, pp. 650–654 (2008)

    Google Scholar 

Download references

Acknowledgements

This work has been funded by the Spanish “Ministerio de Economía y Competitividad” under projects TIN2013-42741-P and TIN2016-77902-C3-2-P, and the European Regional Development Fund (ERDF-FEDER).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luis M. de Campos .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

de Campos, L.M., Fernández-Luna, J.M., Huete, J.F., Redondo-Expósito, L. (2017). Comparing Machine Learning and Information Retrieval-Based Approaches for Filtering Documents in a Parliamentary Setting. In: Moral, S., Pivert, O., Sánchez, D., Marín, N. (eds) Scalable Uncertainty Management. SUM 2017. Lecture Notes in Computer Science(), vol 10564. Springer, Cham. https://doi.org/10.1007/978-3-319-67582-4_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67582-4_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67581-7

  • Online ISBN: 978-3-319-67582-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics