Abstract
The increasing popularity of the Internet and e-commerce makes online merchants to constantly seek tools that would permit them to attract new and retain old customers. Traffic tracking and analysis tools can help businesses know more about their customers. These tools track visitors’ behaviors on Web sites. The information obtained from Web traffic tracking and analysis can help online merchants target specific audiences with customized products and services. Most commonly used approaches include Web log file, packet monitors, and single-pixel image approach. Each of these approaches has some drawbacks, which limits the types of data it can track or the user environment. In this paper, we propose a tracking and analysis approach, which has fewer limitations and more advantages than the existing approaches. We discuss three different approaches (i.e., improved single-pixel image, JavaScript tracking and HTTP (Hypertext Transfer Protocol) proxy server), which work together to track a user’s activities. In addition to basic analysis, we implement advanced analysis such as path analysis tree and user clustering. Path analysis is pivotal for Web site management and marketing in e-commerce. In modeling the tracking and analysis approach, we used a formal technique to guide quality assurance imperatives.
Article PDF
Avoid common mistakes on your manuscript.
References
S.A. Ehikioya, S. Lu, A path analysis model for effective e-commerce transactions, Afr. J. Comput. ICT 12 (2019), 55–71.
“Web Analytics”, Tutorials Point (I) Pvt. Ltd., 2015, Available from: https://www.tutorialspoint.com/web_analytics_turorial.pdf (accessed December 12, 2018).
G. Zheng, S. Peltsverger, Web analytics overview, in: M. Khosrow-Pour (Ed.), Encyclopedia of Information Science and Technology, third ed., IGI, Global, Hershey, PA, USA, 2015.
M.T. Nguyen, T.D. Diep, T. Hoang Vinh, T. Nakajima, N. Thoai, Analyzing and visualizing web server access log file, in: T. Dang, J. Küng, R. Wagner, N. Thoai, M. Takizawa (Eds.), International Conference on Future Data and Security Engineering (FDSE 2018) — Lecture Notes in Computer Science, vol 11251, Springer, Cham, 2018, pp. 349–367.
B. Dykes, Web analytics kick start guide: a primer on the fundamentals of digital analytics, Adobe Press Books, Peachpit Press, Pearson Education, San Francisco, CA, USA, 2014.
A. Kaushik, Web analytics 2.0: the art of online accountability and science of customer centricity, John Wiley & Sons, Hoboken, NJ, USA, 2010.
E.T. Peterson, Web analytics demystified: a marketer’s guide to understanding how your web site affects your business, Celilo Group Media and CafePress, Portland, OR, USA, 2004.
B. Clifton, Advanced web metrics with Google analytics, third ed., Wiley Publishing, Hoboken, NJ, USA, 2012.
A. Kaushik, Web Analytics: An Hour a Day, Wiley Publishing, Hoboken, NJ, USA, 2007.
A. Croll, S. Power, Complete web monitoring: watching your visitors, performance, communities, and competitors, O’Reilly Media Inc., Sebastopol, CA, USA, 2009.
S. Jackson, Cult of analytics: driving online marketing strategies using web analytics, Butterworth-Heinemann, Oxford, United Kingdom, 2009.
I. Bekavac, D.G. Praničević, Web analytics tools and web metrics tools: an overview and comparative analysis, Croat. Oper. Res. Rev. 6 (2015), 373–386.
B.J. Jansen, Understanding user-web interactions via web analytics, Morgan & Claypool, Williston, VT, USA, 2009.
D. Booth, B.J. Jansen, A review of methodologies for analyzing websites, in: B.J. Jansen, A. Spink, I. Taksa (Eds.), Handbook of Research on Web Log Analysis, IGI Global, Hershey, PA, USA, 2010, pp. 141–162.
R.E. Bucklin, C. Sismeiro, Click here for internet insight: advances in clickstream data analysis in marketing, J. Interact. Market. 23 (2009), 35–48.
A. Ezzedin, Tracking product journey from carting to purchasing: 15 secrets to perfecting your online store, E-Nor Inc., 2014, Available from: https://www.e-nor.com/wp-content/uploads/pubs/ebooks/tracking-product-journey-from-carting-to-purchasing.pdf.
Eric Fettman, Google analytics universal guide: best practices for implementation and reporting, E-Nor Inc., 2014, Available from: https://www.e-nor.com/blog/ebooks/google-analytics-universal-guide-best-practices-for-implementation-and-reporting.
J. Andersen, A. Giversen, A.H. Jensen, R.S. Larsen, T.B. Pedersen, J. Skyt, Analyzing clickstreams using subsessions, Proceedings of the ACM Third International Workshop on Data Warehousing and OLAP (DOLAP), ACM, Washington DC, USA, 2000, pp. 25–32.
L. Clark, I.H. Ting, C. Kimble, P.C. Wright, D. Kudenko, Combining ethnographic and clickstream data to identify user web browsing strategies, Inform. Res. 11 (2006), Available from: https://informationr.net/ir/11-2/paper249.
H.K. Ellonen, P. Wikstrom, A. Johansson, The role of the website in a magazine business: revisiting old truths, J. Media Bus. Stud. 12 (2015), 238–249.
J.P. de Almeida Ribeiro, The use of web analytics on a small data set in an online media company: shifter’s case study, Master’s Degree in Information Management, NOVA Information Management School, Instituto Superior de Estatística e Gestão de Informação, Universidade Nova de Lisboa, 2016.
M. Lindén, Path analysis of online users using clickstream data: case online magazine website, Master’s Degree in Strategy, Innovation and Sustainability, LUT School of Business and Management, Lappeenranta University of Technology, 2016.
R. Kumar Jain, R.S. Kasana, S. Jain, Efficient web log mining using doubly linked tree, Int. J. Comput. Sci. Inform. Secur. 3 (2009), 5.
S.K. Pani, L. Panigrahy, V.H. Sankar, B.K. Ratha, A.K. Mandal, S.K. Padhi, Web usage mining: a survey on pattern extraction from web logs, Int. J. Instrum. Control Autom. 1 (2011), 15–23.
A.T. Siddiqui, S. Aljahdali, Web mining techniques in e-commerce applications, Int. J. Comput. Appl. 69 (2013), 39–43.
N. Jokar, A.R. Honarvar, S. Aghamirzadeh, K. Esfandiari, Web mining and web usage mining techniques, Bulletin Royale Soc. Sci. Liège 85 (2016), 321–328.
M. Wang, G. Liu, C. Yan, C. Jiang, Modeling and vulnerable points analysis for e-commerce transaction system with a known attack, Ninth International Conference on Security, Privacy and Anonymity in Computation, Communication and Storage (SpaCCS), Lecture Notes in Computer Science, vol. 10066, Springer, Cham, 2016, pp. 422–436.
Z. Tian, Z. Zhan, X. Guan, A new structural analysis model for e-commerce ecosystem network, Int. J. Hybrid Inform. Technol. 7 (2014), 43–56.
M.W. Lewis, B.J. White, SOLO: a linear ordering approach to path analysis of web site traffic, INFOR Inf. Syst. Oper. Res. 50 (2012), 186–194.
K.N. Asha, R. Rajkumar, Survey on web mining techniques and challenges of e-commerce in online social networks, Indian J. Sci. Technol 9 (2016), 1–5.
A.L. Montgomery, S. Li, K. Srinivasan, J.C. Liechty, Modeling online browsing and path analysis using clickstream data, INFORMS 23 (2004), 469–631.
A. Noreika, S. Drąsutis, Website activity analysis model, Inform. Technol. Control 36 (2007), 268–272.
M. Ohta, Y. Higuchi, Study on the design of supermarket store layouts: the principle of “sales magnet”, Int. J. Social Behav. Educ. Econ. Bus. Ind. Eng. 7 (2013), 209–212.
L. Zheng, G. Liu, C. Yan, C. Jiang, Transaction fraud detection based on total order relation and behavior diversity, IEEE Trans. Comput. Soc. Syst. 5 (2018), 796–806.
N. Privault, Understanding Markov chains: examples and applications, second ed., Springer, Singapore, 2018.
S.A. Ehikioya, J. Zheng, Web content usage data logging for discovering user interests, Computing, Information Systems, Dev. Inform. Allied Res. J. 9 (2018), 43–50.
R.Q.A. Fernandes, W.A. Pinheiro, G.B. Xexéo, J.M. de Souza, Path clustering: grouping in an efficient way complex data distributions, J. Today’s Ideas Tomorrow’s Technol. 5 (2017), 141–155.
B. Lavanya, A. Auxilia Princy, A survey on contribution of data mining techniques and graph reading algorithms in concept map generation, J. Today’s Ideas Tomorrow’s Technol. 6 (2018), 99–105.
P. Berkhin, J.D. Becher, D.J. Randall, Interactive path analysis of web site traffic, Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, San Francisco, CA, 2001, pp. 414–419.
M.S. Chen, J.S. Park, P.S. Yu, Efficient data mining for path traversal patterns, IEEE Trans. Knowl. Data Eng. 10 (1998), 209–221.
H. Mannila, H. Toivonen, A.I. Verkamo, Discovering frequent episodes in event sequences, Proceedings of the First International Conference on Knowledge Discovery and Data Mining (KDD), AAAI Press, Montreal, Quebec, 1995, pp. 210–215.
R. Srikant, R. Agrawal, Mining sequential patterns: generalizations and performance improvements, Proceedings of the 5th International Conference on Extending Database Technology (EDBT): Advances in Database Technology, ACM, Avignon, France, 1996, pp. 3–17.
J.B. Schafer, J.A. Konstan, J. Riedl, E-commerce recommendation applications, Data Mining Knowl. Discov. 5 (2001), 115–153.
E.J. Glover, S. Lawrence, M.D. Gordon, W.P. Birmingham, C.L. Giles, Recommending web documents based on user preferences, Proceedings of the ACM SIGIR ‘99 Workshop on Recommender Systems, University of California, Berkeley, 1999.
R. Cooley, B. Mobasher, J. Srivastava, Web mining: information and pattern discovery on the world wide web, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence (ICTAI), IEEE, Newport Beach, CA, USA, 1997, pp. 558–567.
B. Mobasher, H. Dai, T. Luo, Y. Sun, J. Zhu, Integrating web usage and content mining for more effective personalization, in: K. Bauknecht, S.K. Madria, G. Pernul (Eds.), Proceedings of the International Conference on E-commerce and Web Technologies (ECWeb), Lecture Notes in Computer Science, Vol. 1875, Springer, Heidelberg, Greenwich, UK, 2000, pp. 165–176.
B. Sarwar, G. Karypis, J. Konstan, J. Reidl, Analysis of recommendation algorithms for e-commerce, Proceedings of the Second ACM Conference on E-commerce (EC), Association for Computing Machinery, Minneapolis, MN, USA, 2000, pp. 158–167.
S. Schechter, M. Krishnan, M.D. Smith, Using path profiles to predict HTTP requests, Proceedings of the Seventh International Conference on World Wide Web, Association for Computing Machinery, Brisbane, Australia, 1998, pp. 457–467.
E. Kahya-Özyirmidokuz, Analyzing unstructured facebook social network data through web text mining: a study of online shopping firms in Turkey, Inform. Dev. 32 (2014), 70–80.
C.P. Chen, J.Y. Weng, C.S. Yang, F.M. Tseng, Employing a data mining approach for identification of mobile opinion leaders and their content usage patterns in large telecommunications data-sets, Technol. Forecast. Soc. Change 130 (2018), 88–98.
Sunil, M.N. Doja, Web data mining in e-services – concepts and applications, Indian J. Comput. Sci. Eng. 8 (2017), 313–318.
R.N. Landers, R.C. Brusso, K.J. Cavanaugh, A.B. Collmus, A primer on theory-driven web scraping: automatic extraction of big data from the internet for use in psychological research, Psychol. Methods 21 (2016), 475–492.
L. Flory, K.M. Osei-Bryson, M. Thomas, A new web personalization decision-support artifact for utility-sensitive customer review analysis, J. Decision Support Syst. 94 (2017), 85–96.
A.M. Alkalbani, L. Gadhvi, B. Patel, F.K. Hussain, A.M. Ghamry, O.K. Hussain, Analysing cloud services reviews using opining mining, 2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA), IEEE, Taipei, Taiwan, 2017.
L. Hongyan, L. Zhenyu, E-commerce consumer behavior information big data mining, Int. J. Database Theory Appl. 9 (2016), 135–146.
Arti, S. Choudhary, G.N. Purohit, Role of web mining in e-commerce, Int. J. Adv. Res. Comput. Commun. Eng. 4 (2015), 251–253.
V. Kumar, G.A. Ogunmola, Web analytics for knowledge creation: a systematic review of tools, techniques, and practices, Int. J. Cyber Behav. Psychol. Learn. 10 (2020), 1–14.
Web analytics software, Capterra Inc., Arlington, VA, USA, 2019, Available from: https://www.capterra.com/web-analytics-software/ (accessed January 10, 2020).
P. V. Pande, N.M. Tarbani, P. V. Ingalkar, A study of web traffic analysis, Int. J. Comput. Sci. Mobile Comp. 3 (2014), 900–907.
A. Malacinski, S. Dominick, T. Hartrick, Measuring web traffic, part 1 and part 2, DeveloperWorks, IBM Corporation, Armonk, NY, USA, 2001 (See https://www-106.ibm.com/developerworks/Web/library/wa-mwt1 and https://www-106.ibm.com/developer-works/Web/library/wa-mwt2.).
Driving business decisions in web time, White Paper, Accrue Software, Inc., Fremont, CA, USA, 2000.
Analyzing web site traffic, White Paper, Sane Solutions, LLC, North Kingstown, Rhodes Island, USA, 2002.
M. McClure, Web traffic analysis software, White Paper, Accrue Software, Inc., Fremont, CA, USA, 1999.
M. Hall, Core Servlets and JavaServer Pages, Sun Microsystems Press, Menlo Park, CA, USA, 2000, p. 181.
Assessing web site usability from server log files, White Paper, Tec-Ed, Inc., Ann Arbor, MI, USA, 1999, Available from: https://teced.com/wp-content/uploads/2011/06/whitepapassess-ing-web-site-usability.pdf.
T. Wilson, Web traffic analysis turns management data to business data, TechWeb Network, San Francisco, CA, USA, 1999.
J. Seymour, Mining for gold in your web traffic logs, Sharper Edge International Pty Ltd., Beecroft NSW, Australia, 2014, a web article on internet business, Available from: https://sharperedge.net/mining-gold-web-traffic-logs.
M.C. Drott, Using web server logs to improve site design, Proceedings on the 16th Annual International Conference on Computer Documentation, Association for Computing Machinery (ACM), Quebec City, Canada, 1998, pp. 43–50.
B. Krishnamurthy, J. Rexford, Software issues in characterizing web server logs, World Wide Web Consortium Workshop on Web Characterization, Cambridge, MA, Cambridge, 1998.
B.D. Davison, Web traffic logs: an imperfect resource for evaluation, Proceedings of Ninth Annual Conference of the Internet Society, INET, San Jose, CA, 1999, Available from: https://web.archive.org/web/20100311063424/https://www.isoc.org/isoc/con-ferences/inet/99/proceedings/4n/4n_1.htm.
Building Confidence Electronic Commerce and Development, United Nations Conference on Trade and Development (UNCTAD), UNCTAD/SDTE/MISC.11, UNCTAD, Geneva, 2000, p. 18.
Guidelines for UK Government Websites: Illustrated Handbook for Web Management Teams, 2nd Edition, The Office of e-Envoy Cabinet Office, Stationery Office Books (TSO), UK, 2003.
S.M. Ackerman, L.F. Cranor, J. Reagle, Beyond concern: understanding net users’ attitudes about online privacy, AT&T Labs-Research Technical Report TR 99.4.3, AT&T Labs-Research, 1999, Available from: https://arxiv.org/html/cs/9904010/report.htm.
Privacy notice research, final results, Harris Interactive, Inc., Privacy Leadership Initiative (PLI); Study No. 15338, 2001.
R. Morris, D. Lin, Variance of aggregated web traffic, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications, Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies, IEEE, Tel-Aviv, Israel, 2000.
B. Mah, An empirical model of HTTP network traffic, Proceedings of INFOCOM, IEEE, Kobe, Japan, 1997.
A. Feldmann, Continuous online extraction of HTTP traces from packet traces, World Wide Web Consortium Workshop on Web Characterization, Cambridge, MA, USA, 1998, Available from: https://www.net.t-labs.tu-berlin.de/papers/F-COEHTPT-98.ps.
S.E. Lamm, D.A. Reed, W.H. Scullin, Real-time geographic visualization of world wide web traffic, Proceedings of the Fifth International World Wide Web Conference, Computer Networks and ISDN Systems, Paris, France, 1996, pp. 1457–1468.
M.E. Crovella, A. Bestavros, Explaining world wide web traffic self-similarity, Technical Report TR-95-015, Computer Science Dept., Boston University, 1995.
C. Cunha, A. Bestavros, M.E. Crovella, Characteristics of WWW client-based traces, Technical Report TR-95-010, Computer Science Dept., Boston University, Boston, MA, 1995.
F. Masseglia, P. Poncelet, M. Teisseire, Using data mining techniques on web access logs to dynamically improve hypertext structure, ACM SigWeb Lett. 8 (1999), 13–19.
O.R. Zaiane, M. Xin, J. Han, Discovering web access patterns and trends by applying OLAP and data mining technology on web logs, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries (ADL), IEEE, Santa Barbara, CA, USA, 1998, pp. 19–29.
C. Theusinger, K.P. Huber, Analyzing the footsteps of your customers – a case study by ASKnet and SAS Institute GmbH, Proceedings of WEB Knowledge Discovery in Databases (WEBKDD), Association for Computing Machinery (ACM), Boston, MA, USA, 2000.
S.G. Eick, eBusiness performance analysis, International Conference on Advances in Infrastructure for Electronic Business, Science, and Education on the Internet, Superiore G. Reiss Romoli (SSGRR), L’Aquila, Italy, 2000.
L. Shen, L. Cheng, J. Ford, F. Makedon, V. Megalooikonomou, T. Steinberg, Mining the most interesting web access associations, Proceedings of WebNet World Conference on the WWW and Internet, Association for the Advancement of Computing in Education (AACE), San Antonio, Texas, 2000, pp. 489–494.
J. Pei, J. Han, B. Mortazavi-asl, H. Zhu, Mining access patterns efficiently from web logs, Proceedings of the Fourth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2000, pp. 396–407.
J. Myllymaki, J. Jackson, Web-based data mining, automatically extract information with HTML, XML, and Java, Developer Works, IBM Corporation, Armonk, New York, USA, 2001.
J. Hammer, H. Garcia-Molina, J. Cho, A. Crespo, R. Aranha, Extracting semistructured information from the web, Proceedings of the Workshop on Management of Semi-structured Data at PODS/SIGMOD’97, Tucson, Arizona, USA, 1997, pp. 18–25.
A. Bergholz, J.C. Freytag, A three-layer approach to semistruc-tured data, Proceedings of the Sixth International Workshop on Theory and Application of Graph Transformations (TAGT), Paderborn, Germany, 1998.
S. Soderland, Learning to extract text-based information from the world wide web, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, AAAI Press, Newport Beach, California, 1997, pp. 251–254.
IP address frequently asked questions, BCR online reference database, 2003, Available from: https://www.bcr.org/reference/IP/ipaddFAQ.html.
R. Cooley, J. Srivastava, B. Mobasher, Data preparation for mining world wide web browsing patterns, J. Knowl. Inform. Syst. 1 (1999), 5–32.
L. Catledge, J. Pitkow, Characterizing browsing behaviors on the world wide web, Comput. Netw. ISDN Syst. 27 (1995), 1065–1073.
B. Mobasher, R. Cooley, J. Srivastava, Automatic personalization based on web usage mining, Commun. ACM 43 (2000), 142–151.
J.A. Hartiga, Clustering algorithms, John Wiley & Sons, Inc., Hoboken, New Jersey, USA, 1975.
S.A. Ehikioya, Specification of transaction systems protocol, Ph.D. Thesis, Department of Computer Science, University of Manitoba, Winnipeg, Manitoba, Canada, 1997.
S.A. Ehikioya, K.E. Barker, Towards a formal specification methodology for transaction systems protocol, Third Annual IASTED International Conference on Software Engineering and Applications (SEA), ACTA Press, Scottsdale, Arizona, USA, 1999.
V.S. Alagar, K. Periyasamy, Specification of software system, Spring-Verlag, New York, 1998.
Usage share of web browsers, Wikipedia, 2019, Available from: https://en.wikipedia.org/wiki/Usage_share_of_web_brows-ers#Crossover_to_smartphones_having_majority_share (accessed January 10, 2020).
List of top web browsers present online, Slimjet, 2020, Available from: https://www.slimjet.com/en/lp/top-10-browsers.php (accessed January 10, 2020).
S. Robinson, Consider privacy issues in tracking web server activity, World Wide Web (W3C) in the Press, 2003, Available from: https://www.w3.org/Press/Articles-2003.html.
H. Singh, Using analytics for better decision-making, towards data science, 2018, Available from: https://towardsdatascience.com/using-analytics-for-better-decision-making-ce4f92c4a025 (accessed January 10, 2020).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).
About this article
Cite this article
Ehikioya, S.A., Lu, S. A Traffic Tracking Analysis Model for the Effective Management of E-commerce Transactions. Int J Netw Distrib Comput 8, 171–193 (2020). https://doi.org/10.2991/ijndc.k.200515.006
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.2991/ijndc.k.200515.006