Chapter

Database Systems for Advanced Applications

Volume 7239 of the series Lecture Notes in Computer Science pp 324-325

Detecting Clones, Copying and Reuse on the Web (DASFAA 2012 Tutorial)

  • Xin Luna DongAffiliated withAT&T Labs-Research
  • , Divesh SrivastavaAffiliated withAT&T Labs-Research

* Final gross prices may vary according to local VAT.

Get Access

Abstract

The Web has enabled the availability of a vast amount of useful information in recent years. However, the Web technologies that have enabled sources to share their information have also made it easy for sources to copy from each other and often publish without proper attribution. Understanding the copying relationships between sources has many benefits, including helping data providers protect their own rights, improving various aspects of data integration, and facilitating in-depth analysis of information flow.