Data Mining - A Tool for Migration Stock Prediction
The migration phenomenon is an important issue for most of the European Unions countries and it has a major socio-economic impact for all parts involved. After 1989, a massive migration process started to develop from Romania towards Western European countries. Beside qualified personnel in search of different and new opportunities, Roma people became more visible, as they were emigrating in countries with high living standards where they were generating significant integration problems along with costs. In order to identify the problems faced by the Roma community from Rennes, a group of sociologists developed a questionnaire, which contains, among other questions, one relating to the intention of returning home. This paper presents a research that aims to build various models, by data mining techniques, to predict that Roma people return to the home country after a five years interval. The second goal is to assess these models and to identify those aspects that have most influence in the decision-making process. The result is based on the data completed by more than 100 persons from Rennes.
KeywordsData mining Classification CRISP-DM model Migration phenomenon Return prediction
The infrastructure used for this work was partially supported by the project Integrated Center for research, development and innovation in Advanced Materials, Nanotechnologies, and Distributed Systems for fabrication and control, Contract No. 671/09.04.2015, Sectorial Operational Program for Increase of the Economic Competitiveness co-funded from the European Regional Development Fund. We are grateful to Mrs. PhD. Ionela Galbau because she allowed us access to the raw data from questionnaires.
- 1.Brachman, R., Anand, T.: The process of knowledge discovery in databases: a human centered approach. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 37–57. AAAI/MIT Press (1996)Google Scholar
- 2.Danubianu, M., Popa, V., Tobolcea, I.: Unsupervised information based feature selection for speech therapy optimization by data mining techniques. In: Proceedings of the Seventh International Multi-Conference on Computing in the Global Information Technology ICCGI 2012, pp. 29–39 (2012)Google Scholar
- 3.Danubianu, M., Tobolcea, I.: Using data mining approach for personalizing the therapy of dyslalia. In: Proceedings of the 3rd International Conference on E-Health and Bioengineering (EHB 2011), pp. 113–117 (2011)Google Scholar
- 5.Hipp, J., Untzer, U.G., Grimmer, U.: Integrating association rule mining algorithms with relational database systems. In: Proceedings of the 3rd International Conference on Enterprise Information Systems (ICEIS 2001), pp. 130–138 (2001)Google Scholar
- 6.RapidMiner: Rapidminer, the industry 1 open source data science platform. https://rapidminer.com/us/
- 8.Wirth, R., Hipp, J.: CRISP-DM: towards a standard process model for data mining. In: Proceedings of the 4th International Conference on the Practical Applications of Knowledge Discovery and Data Mining, pp. 29–39 (2000)Google Scholar