Research Status and Prospect of Data Extraction and Cleaning Technology in Large Environment

  • Mingzhe Wang
  • Zhaochan Li
Conference paper
Part of the Springer Proceedings in Business and Economics book series (SPBE)


In the era of big data, how to get effective data from the massive data, the data obtained by the relevant analysis and processing is particularly important. This paper first introduces the importance of data cleaning and data extraction technology, secondly from different angles, introduced the two technology, and then summarizes the current domestic and international data cleaning and data extraction technology research, and finally describes the data extraction and data cleaning technology development prospects. It has a certain guiding role in the research of data extraction and cleaning technology in the future.


Data cleaning Data fetch Research Status Developing Prospect 


  1. 1.
    Rahm, E., & Do, H. H. (2000). Data cleaning problems and current approaches. IEEE Data Engineering Bulletin, 23(4), 3–13.Google Scholar
  2. 2.
    Rifen, W., & Chengzhi, Z. (2007). Review of data cleaning research. J Modern book information technology, 158(12), 50–57.Google Scholar
  3. 3.
    Harte-Hanks Trillium Software[EB/OL]. (2007). http://www.trillium [2007-01-09].
  4. 4.
    Li, Y. (2013). Study on data extraction technology based on network. Harbin.Google Scholar
  5. 5.
    Congjian, B. (2007). Research on key techniques of data extraction. Jiangsu university.Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Economic ManagementChina Institute of Industrial RelationsBeijingChina
  2. 2.Beijing Wuzi UniversityBeijingChina

Personalised recommendations