, Volume 109, Issue 3, pp 1579–1591 | Cite as

Development of a software for metric studies of transportation engineering journals

  • Ercilia de Stefano
  • Marcio Peixoto de Sequeira Santos
  • Ronaldo Balassiano


This study intends to describe the development and results of a software designed to analyze millions of articles in the area of Transportation Engineering. This tool intends to support Transportation Planning activities by providing additional information about trends, references and technologies. In order to develop this software, techniques from scientometrics, bibliometrics and informetrics were employed with the support of tools from Computer Science, such as Artificial Intelligence, Data Mining and Natural Language Processing. The result of this study is a structured database that allows browsing the change of interest in different topics along the years in areas related to Transportation Engineering. When analyzing a given area, the database is capable of identifying which authors published works in that area, allowing the identification of specialists and related papers. In addition, the software responsible for creating this database is capable of performing the same analysis in academic corpora of other areas of study.


Scientometrics Informetrics Bibliometrics Artificial intelligence Natural language processing Transportation engineering 


  1. Aggarwal, N., Kumar, A., Khatter, H., & Kha, H. (2012). Analysis of the effect of data mining techniques on database. Advances in Engineering Software, 47(1), 164–169.CrossRefGoogle Scholar
  2. Aizawa, A. (2003). An information-theoretic perspective of tf–idf measures. Information Processing and Management, 39(1), 45–65.CrossRefzbMATHGoogle Scholar
  3. Bastian M., Heymann S., & Jacomy M. (2009). Gephi: An open source software for exploring and manipulating networks. In International AAAI conference on weblogs and social media.Google Scholar
  4. Björneborn, L., & Ingwersen, P. (2004). Toward a basic framework for webometrics. Journal of the American Society for Information Science and Technology, 55(14), 1216–1227.CrossRefGoogle Scholar
  5. Brin, S., & Page, L. (2001). Dynamic data mining: Exploring large rule space by sampling. Technical Report, Stanford: InfoLab.Google Scholar
  6. Furlan, B., Batanović, V., & Nikolić, B. (2013). Semantic similarity of short texts in languages with a deficient natural language processing support. Decision Support Systems, 55(3), 710–719.CrossRefGoogle Scholar
  7. Garfield, E. (2000). Use of journal citation reports and journal performance indicators in measuring short and long-term journal impact. Accessed 14 Aug 2014.
  8. Hea, W., Zhab, S., & Lia, L. (2013). Social media competitive analysis and text mining: A case study in the pizza industry. International Journal of Information Management, 33(3), 464–472.CrossRefGoogle Scholar
  9. Hong, T. P., Lin, C. W., Yang, K. T., & Wang, S. L. (2013). Using TF-IDF to hide sensitive itemsets. LLC, 38(4), 502–510.Google Scholar
  10. Jurish, B., & Würzner, K. M. (2013). Word and sentence Tokenization with hidden Markov models. JLCL, 28(2), 61–83.Google Scholar
  11. Leskovec, J., Rajaraman, A., & Ullman, J. D. (2014). Mining of massive datasets. Stanford: Cambridge University Press.CrossRefGoogle Scholar
  12. Leydesdorff, L. (2001). The challenge of scientometrics. Boca Raton: Universal Publishers.Google Scholar
  13. Leydesdorff, L., & Milojević, S. (2013). Scientometrics school of informatics and computing. Indiana University. Accessed 01 July 2011.
  14. Markscheffel, B. (2011). An ontology based visualization approach for the joined interpretation of bibliometrics and webometrics data. Accessed 12 Aug 2015.
  15. Princeton University. (2015). Wordnet. Accessed 10 April 2015.
  16. Sage, A. P. (1990). Concise encyclopedia of information processing in systems and organizations. New York: Pergamon.Google Scholar
  17. Silva, J. A. D., & Bianchi, M. D. L. P. (2001). Cientometria: A métrica da ciência. 10.1590/S0103-863X2001000200002. Accessed 25 April 2014.Google Scholar
  18. Spinak, E. (1996). Diccionario enciclopédico de Bibliometria, cienciometría e informetría. Caracas: Unesco.Google Scholar
  19. Tague-Sutcliffe, J. (1992). An introduction to informetrics. Accessed 20 April 2014.
  20. Van Noorden, R. (2014). Global scientific output doubles every nine years. - New Blog. Accessed 07 May 2014.Google Scholar
  21. Yue, X., Di, G., Yu, Y., Wang, W., & Shi, H. (2012). Analysis of the combination of natural language processing and search engine technology. In International workshop on information and electronics engineering (IWIEE). Google Scholar

Copyright information

© Akadémiai Kiadó, Budapest, Hungary 2016

Authors and Affiliations

  1. 1.Transportation Engineering Program - PETCOPPE - UFRJRio de JaneiroBrazil

Personalised recommendations