Detecting Link and Landing Page Misalignment in Marketing Emails

  • Nedim LipkaEmail author
  • Tak Yeon Lee
  • Eunyee Koh
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 903)


Links and their landing pages in the World Wide Web are oftentimes flawed or irrelevant. We created a data set of 4266 links within 160 marketing emails whose relevance with landing pages have been evaluated by crowd workers. We present a study of common misalignments and propose methods for detecting these misalignments. An F-score of 0.63 can be achieved by a neural network for cases where the misaligned label requires the majority out of 5 crowd worker votes.


  1. 1.
    Jesdanun, A.: Internet Littered with Dead Web Sites. The Associated Press (2003)Google Scholar
  2. 2.
    Lee, T.Y., Koh, E.: Identifying types of misalignments between promotion emails and landing pages. In: CHI 2018 Extended Abstracts on Human Factors in Computing Systems (2018)Google Scholar
  3. 3.
    Fetterly, D., Manasse, M., Najork, M., Wiener, J.: A large-scale study of the evolution of web pages. In: Proceedings of the 12th International Conference on World Wide Web (2003)Google Scholar
  4. 4.
    Hennessey, J., Ge, S.X.: A cross disciplinary study of link decay and the effectiveness of mitigation techniques. BMC Bioinform. 14, S5 (2013)CrossRefGoogle Scholar
  5. 5.
    Zittrain, J., Albert, K., Lessig, L.: Perma: scoping and addressing the problem of link and reference rot in legal citations. Legal Inf. Manag. (2014)Google Scholar
  6. 6.
  7. 7.
  8. 8.
  9. 9.
  10. 10.
    Bar-yossef, Z., Kumar, R., Broder, A.Z., Tomkins, A.: Sic Transit Gloria Telae: towards an understanding of the web’s decay. In: Proceedings of the 13th Conference on World Wide Web (2004)Google Scholar
  11. 11.
    Chakrabarti, D., Agarwal, D., Josifovski, V.: Contextual advertising by combining relevance with click feedback. In: Proceedings of the 17th International Conference on World Wide Web (2008)Google Scholar
  12. 12.
    Regelson, M., Fain, D.C.: Predicting click-through rate using keyword clusters. In: Proceedings of EC 2006 (2006)Google Scholar
  13. 13.
    Becker, H., Broder, A., Gabrilovich, E., Josifovski, V., Pang, B.: What happens after an ad click?: quantifying the impact of landing pages in web advertising. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (2009)Google Scholar
  14. 14.
    Rosales, R., Cheng, H., Manavoglu, E.: Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the 5th ACM International Conference on Web Search and Data Mining (2012)Google Scholar
  15. 15.
    Huang, A.: Similarity measures for text document clustering. In: Proceedings of the Sixth New Zealand CS Research Student Conference (2008)Google Scholar
  16. 16.
    Lukashenko, R., Graudina, V., Grundspenkis, J.: Computer-based plagiarism detection methods and tools: an overview. In: Proceedings of the International Conference on Computer Systems and Technologies (2007)Google Scholar
  17. 17.
    Reberšek, P., Verlic, M.: Application of localized similarity for web documents. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2013)Google Scholar
  18. 18.
    Gong, D., Wang, D.Z., Peng, Y.: Multimodal learning for web information extraction. In: Proeedings of the ACM on Multimedia Conference (2017)Google Scholar
  19. 19.
    Chopra, S., Hadsell, R., LeCun, Y.: Learning a similarity metric discriminatively, with application to face verification. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2005)Google Scholar
  20. 20.
    Koch, G., Zemel, R., Salakhutdinov, R.: Siamese neural networks for one-shot image recognition. In: Proceedings of the 32nd International Conference on Machine Learning (2015)Google Scholar
  21. 21.
    Mueller, J., Thyagarajan, A.: Siamese recurrent architectures for learning sentence similarity. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (2016)Google Scholar
  22. 22.
    Qiao, R., Liu, L., Shen, C., van den Hengel, A.: Less is more: zero-shot learning from online textual documents with noise suppression. CoRR (2016)Google Scholar
  23. 23.
    Cer, D., et al.: Universal sentence encoder. arXiv e-prints (2018)Google Scholar
  24. 24.
    Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Empirical Methods in Natural Language Processing (2014)Google Scholar
  25. 25.
    Kusner, M.J., Sun, Y., Kolkin, N.I., Weinberger, K.Q.: From word embeddings to document distances. In: Proceedings of the 32nd International Conference on Machine Learning (2015)Google Scholar
  26. 26.
    Smith, R.: An overview of the Tesseract OCR engine. In: Proceedings of Ninth International Conference on Document Analysis and Recognition (2007)Google Scholar
  27. 27.
    Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. CoRR (2015)Google Scholar
  28. 28.
    Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. CoRR (2014)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  1. 1.Adobe ResearchSan JoseUSA

Personalised recommendations