The Empirical Impact of the Nature of Novelty Detection
Novelty detection systems aim at reducing redundant documents or sentences from a list of documents chronologically ordered. In the task, sentences appearing later in the list with no new meanings are eliminated. In an accompanying paper, the nature of novelty detection was revealed – Novelty as a combination of the PO (partial overlap) and CO (complete overlap) relations, which can be treated as two classification tasks; theoretical impacts were given. This paper provides what the nature of the task mean empirically. One new method – selected pool – implementing the nature of the task gained improvements on TREC Novelty datasets. New evaluation criteria are given, which are natural from the viewpoint of the nature of novelty detection.
Unable to display preview. Download preview PDF.
- 1.Harman, D.: Overview of the trec 2002 novelty track. In: Proceedings of TREC 2002 (2003)Google Scholar
- 3.Soboroff, I.: Draft overview of the trec 2004 novelty track. In: Proceedings of TREC 2004 (Notebook) (2004)Google Scholar
- 4.Soboroff, I., Harman, D.: Overview of the trec 2003 novelty track. In: Proceedings TREC 2003 (2004)Google Scholar
- 5.Zhang, M., Lin, C., Liu, Y., Zhao, L., Ma, S.: THUIR at trec 2003: Novelty, robust and web. In: Proceedings of TREC 2003, pp. 556–567 (2004)Google Scholar
- 6.Zhang, M., Song, R., Lin, C., Jiang, Z., Jin, Y., Liu, Y., Zhao, L., Ma, S.: Expansion-based technologies in finding relevant and new information: Thu trec2002 novelty track experiments. In: Proceedings of TREC 2002 (2003)Google Scholar
- 7.Zhang, Y., Callan, J., Minka, T.: Novelty and redundancy detection in adaptive filtering. In: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2002), pp. 81–88 (2002)Google Scholar