Abstract
Network analysis is a valuable method for investigating and mapping the social structure driving phenomena and sharing the findings with others. The interactive visual analytics approach transforms data into views that allow the visual exploration of the structures and processes of networks represented by data, therefore increasing the transparency of editorial processes on social media as well as networked structures in innovation ecosystems and other phenomena. Although existing tools have opened many new exploratory opportunities, new tools in development promise investigators even greater freedom to interact with the data, refine and analyze the data, and explore alternative explanations for networked processes. This chapter presents the Ostinato Model—an iterative, user-centric, process-automated model for data-driven visual network analytics. The Ostinato Model simultaneously supports the automation of the process and enables interactive and transparent exploration. The model has two phases, Data Collection and Refinement and Network Creation and Analysis. The Data Collection and Refinement phase is further divided into Entity Index Creation, Web/API Crawling, Scraping, and Data Aggregation. The Network Construction and Analysis phase is composed of Filtering in Entities, Node and Edge Creation, Metrics Calculation, Node and Edge Filtering, Entity Index Refinement, Layout Processing and Visual Properties Configuration. A cycle of exploration and automation characterizes the model and is embedded in each phase.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Ideally, a data scientist is a hacker, scientist, quantitative analyst, trusted adviser and business (domain) expert, all in one person (cf. Davenport, 2014).
- 2.
Funding for young innovative companies, http://www.tekes.fi/en/funding/companies/funding-for-young-innovative-growth-companies/
- 3.
Help:Infobox, http://en.wikipedia.org/wiki/Help:Infobox
- 4.
The Terms of Service for a Web page must also be considered. When using Wikipedia as a data source, for example, one has to take into account the Terms of Service that specifically deny crawling Wikipedia for large amount of files. Instead of crawling the live website, users of the data are advised to download a copy of Wikipedia’s contents and set up a proxy for serving further processing.
- 5.
When using names as identifiers, one can apply fuzzy string matching and semi-automated tools such as OpenRefine (http://openrefine.org/) or DataWrangler (http://vis.stanford.edu/wrangler/) to assist in the aggregation process.
- 6.
Gephi Toolkit, http://gephi.github.io/toolkit/
- 7.
Using a full stack programming language such as Python gives the developers more opportunities to turn the scripts developed for analysis into processes that run in the cloud, intermittently collecting and preprocessing the data and feeding results into dashboards implemented in Web technologies.
References
Barabási, A.-L., & Bonabeau, E. (2003). Scale-free networks. Scientific American, 288(5), 50–59.
Basole, R. C., Russel, M. G., Huhtamäki, J., & Rubens, N. (2012). Understanding mobile ecosystem dynamics: A data-driven approach. In Proceedings of the 2012 International Conference on Mobile Business (ICMB 2012) (pp. 17–28). Delft, The Netherlands. Retrieved from http://aisel.aisnet.org/icmb2012/15/
Bizer, C., Heath, T., & Berners-Lee, T. (2009). Linked data: The story so far. International Journal on Semantic Web and Information Systems, 5(3), 1–22.
Card, S. K., Mackinlay, J. D., & Shneiderman, B. (1999). Readings in information visualization: Using vision to think. San Francisco: Morgan Kaufmann. Retrieved from http://www.amazon.com/Readings-Information-Visualization-Interactive-Technologies/dp/1558605339
Davenport, T. H. (2014). Big data at work: Dispelling the myths, uncovering the opportunities. Boston: Harvard Business Press Books.
Freeman, L. C. (2000). Visualizing social networks. Journal of Social Structure, 1(1). Retrieved from http://www.cmu.edu/joss/content/articles/volume1/Freeman.html
Freeman, L. C. (2009). Methods of social network visualization. Berlin, Germany: Springer.
Ghosh, S. (2013). Python tools for reproducible research in brain imaging. Retrieved from https://speakerdeck.com/satra/pydata-2013-python-tools-for-reproducible-research-in-brain-imaging
Giuliani, E., & Bell, M. (2008). Industrial clusters and the evolution of their knowledge networks: Revisiting a Chilean case. Brighton, England. Retrieved from http://www.sussex.ac.uk/spru/documents/sewp171
Granovetter, M. (1973). The strength of weak ties. American Journal of Sociology, 78(6), 1360–1380. Retrieved from http://www.jstor.org/discover/10.2307/2776392?uid=3737976&uid=2&uid=4&sid=21104852601921
Hansen, D. L., Rotman, D., Bonsignore, E., Milic-Frayling, N., Rodrigues, E. M., Smith, M., & Shneiderman, B. (2009). Do you know the way to SNA?: A process model for analyzing and visualizing social media data. University of Maryland Technical Report: HCIL-2009-17. Retrieved from http://www.smrfoundation.org/wp-content/uploads/2010/05/2009-UMD-TechReport-Do-you-know-the-way-to-SNA.pdf
Hansen, D., Shneiderman, B., & Smith, M. A. (2011). Analyzing social media networks with NodeXL: Insights from a connected world. Burlington, MA: Morgan Kaufmann. Retrieved from http://www.amazon.com/dp/0123822297
Heer, J., & Shneiderman, B. (2012). Interactive dynamics for visual analysis. Communications of the ACM, 55(4), 45–54. Retrieved January 31, 2013, from http://dl.acm.org/citation.cfm?id=2133821
Hevner, A. R., et al. (2004). Design science in information systems research. MIS Quarterly, 28(1), 75–105.
Huhtamäki, J., Salonen, J., Marttila, J., & Nykänen, O. (2010, October 3–6). Context-driven social network visualisation: Case wiki co-creation. In D. Karabeg & J. Park (Eds.), Proceedings of the Second International Workshop on Knowledge Federation: Self-Organizing Collective Mind, Dubrovnik, Croatia. Dubrovnik, Croatia: CEUR-WS.org. Retrieved from http://urn.fi/URN:NBN:fi:tty-201201161008
Huhtamäki, J., Still, K., Isomursu, M., Russell, M. G., & Rubens, N. (2012, September 20–21). Networks of growth: Case young innovative companies in Finland. In Proceedings of the 7th European Conference on Innovation and Entrepreneurship (ECIE), Santarém, Portugal
Huhtamäki, J., Luotonen, V., Kairamo, V., Still, K., & Russell, M. G. (2013, October 1–3). Process for measuring and visualizing an open innovation platform: Case Demola. In 17th International Academic MindTrek Conference 2013: “Making Sense of Converging Media”. Tampere, Finland: ACM. Retrieved from http://urn.fi/URN:NBN:fi:tty-201312201533.
Hwang, V. W., & Horowitt, G. (2012). The rainforest: The secret to building the next silicon valley 1.02 Edition. Los Altos Hills, CA: Regenwald. Retrieved from http://www.amazon.com/The-Rainforest-Secret-Building-Silicon/dp/0615586724
Indarto, E. (2013). Data mining. Retrieved December 13, 2014, from http://recommender-systems.readthedocs.org/en/latest/datamining.html
Intel. (2013). Extract, transform, and load big data with Apache Hadoop. Retrieved from https://software.intel.com/en-us/articles/extract-transform-and-load-big-data-with-apache-hadoop
Jesus, R., Schwartz, M., & Lehmann, S. (2009). Bipartite networks of Wikipedia’s articles and authors: A meso-level approach (p. Article 5, 10 pages). Orlando, FL: ACM. Retrieved September 2, 2010, from http://portal.acm.org/citation.cfm?id=1641309.1641318
Jussila, J., Huhtamäki, J., Henttonen, K., Kärkkäinen, H., & Still, K. (2014, January 6–9). Visual network analysis of Twitter data for co-organizing conferences: Case CMAD 2013. In Proceedings of the 47th Annual Hawaii International Conference on System Sciences (pp. 1474–1483). Computer Society Press. Retrieved from http://urn.fi/URN:NBN:fi:tty-201401221053.
Kandel, S., Heer, J., Plaisant, C., Kennedy, J., van Ham, F., Riche, N. H., … Buono, P. (2011). Research directions in data wrangling: Visualizations and transformations for usable and credible data. Information Visualization, 10(4), 271–288. doi:10.1177/1473871611415994
Keim, D., Kohlhammer, J., & Ellis, G. (Eds.). (2010). Mastering the information age—Solving problems with visual analytics. Geneva, Switzerland: Eurographics Association. Retrieved from http://www.vismaster.eu/book/
Konno, N., Nonaka, I., & Ogilvy, J. (2014). Scenario planning: The basics. World Futures, 70(1), 28–43. Retrieved December 14, 2014, from http://www.tandfonline.com/doi/abs/10.1080/02604027.2014.875720
Leskovec, J., Backstrom, L., & Kleinberg, J. (2009). Meme-tracking and the dynamics of the news cycle. In KDD’09 (pp. 497–506). New York: ACM. Retrieved September 14, 2012, from http://doi.acm.org/10.1145/1557019.1557077
Liu, Y.-Y., Slotine, J.-J., & Barabási, A.-L. (2011). Controllability of complex networks. Nature, 473(7346), 167–173. Retrieved June 15, 2011, from http://dx.doi.org/10.1038/nature10011
Mawer, D. (2000). Ballet and the apotheosis of the dance. In D. Mawer (Ed.), The Cambridge companion to ravel (p. 157). Cambridge, England: Cambridge University Press.
Moreno, J. L. (1953). Who shall survive?: Foundations of sociometry, group psychotherapy and sociodrama. Beacon, NY: Beacon House Inc. Retrieved August 23, 2010, from http://www.asgpp.org/docs/WSS/WSS.html
North, C. (2006). Toward measuring visualization insight. IEEE Computer Graphics and Applications, 26(3), 6–9. Retrieved from http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=1626178&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D1626178
Nykänen, O., Salonen, J., Haapaniemi, M., Huhtamäki, J., Huhtamäki, J., & Huhtamäki, J. (2008). A visualisation system for a peer-to-peer information space (pp. 76–85). Tampere, Finland: Tampere University of Technology. Retrieved from http://matriisi.ee.tut.fi/hypermedia/events/opaals2008/articlelist.html#opaals2008-article14.
Paquin, R. L., & Howard-Grenville, J. (2013). Blind dates and arranged marriages: Longitudinal processes of network orchestration. Organization Studies, 34(11), 1623–1653. Retrieved March 11, 2014, from http://oss.sagepub.com/content/34/11/1623
Peffers, K., Tuunanen, T., Rothenberger, M., & Chatterjee, S. (2007). A design science research methodology for information systems research. Journal of Management Information Systems, 24(3), 45–77. doi:10.2753/MIS0742-1222240302.
Peng, R. D. (2009). Reproducible research and biostatistics. Biostatistics, 10(3), 405–408. Retrieved December 14, 2014, from http://biostatistics.oxfordjournals.org/content/10/3/405
Petschulat, S. (2010). Other people’s data. Communications of the ACM, 53(1), 53. Retrieved December 14, 2014, from http://cacm.acm.org/magazines/2010/1/55742-other-peoples-data/fulltext
Pirolli, P., & Card, S. (2005). The sensemaking process and leverage points for analyst technology as identified through cognitive task analysis. In Proceedings of International Conference on Intelligence Analysis.
Ritala, P., Armila, L., & Blomqvist, K. (2009). Innovation orchestration capability—Defining the organizational and individual level determinants. International Journal of Innovation Management, 13(04), 569–591. Retrieved March 6, 2014, from http://www.worldscientific.com/doi/abs/10.1142/S136391960900242X
Ritala, P., & Hallikas, J. (2011). Network position of a firm and the tendency to collaborate with competitors—A structural embeddedness perspective. International Journal of Strategic Business Alliances, 2(4), 307–328. Retrieved December 19, 2014, from http://dx.doi.org/10.1504/IJSBA.2011.044859
Ritala, P., & Huizingh, E. (2014). Business and network models for innovation: Strategic logic and the role of network position. International Journal of Technology Management, 66(2), 109–119. Retrieved December 19, 2014, from http://dx.doi.org/10.1504/IJTM.2014.064608
Rubens, N., Russell, M., Perez, R., Huhtamaki, J., Still, K., Kaplan, D., & Okamoto, T. (2011). Alumni network analysis. In Proceedings of 2011 IEEE Global Engineering Education Conference, EDUCON 2011 (pp. 606–611).
Russell, M. G., Still, K., Huhtamäki, J., Yu, C., & Rubens, N. (2011, July). Transforming innovation ecosystems through shared vision and network orchestration. In Proceedings of Triple Helix IX International Conference: “Silicon Valley: Global Model or Unique Anomaly?”, Stanford, CA.
Russell, M. G., Huhtamäki, J., Still, K., Rubens, N., & Basole, R. C. (2015). Relational capital for shared vision in innovation ecosystems. Triple Helix: A Journal of University-Industry-Government Innovation and Entrepreneurship.
Schwaber, K., & Beedle, M. (2001). Agile software development with Scrum. Upper Saddle River, NJ: Prentice Hall.
Shakarian, P., Eyre, S., & Paulo, D. (2013). A scalable heuristic for viral marketing under the tipping model. Retrieved September 23, 2013, from http://arxiv.org/abs/1309.2963
Simon, H. A. (1969). The sciences of the artificial. Cambridge, MA: MIT Press.
Still, K., Huhtamäki, J., Russell, M. G., Basole, R. C., Salonen, J., & Rubens, N. (2013, June 16–19). Networks of innovation relationships: Multiscopic views on Finland. In Proceedings of the XXIV ISPIM Conference—Innovating in Global Markets: Challenges for Sustainable Growth, Helsinki, Finland (p. 15).
Still, K., et al. (2014). Insights for orchestrating innovation ecosystems: The case of EIT ICT Labs and data-driven network visualisations. International Journal of Technology Management, 66(2/3), 243–265.
Telea, A. C. (2008). Data visualization: Principles and practice. Wellesley, MA: A K Peters. Retrieved from http://www.amazon.com/Data-Visualization-Principles-Alexandru-Telea/dp/1568813066
Thomas, J. J., & Cook, K. A. (2006). A visual analytics agenda. IEEE Computer Graphics and Applications, 26(1), 10–13.
Vaishnavi, V. K., & Kuechler, W., Jr. (2007). Design science research methods and patterns: Innovating information and communication technology. Boca Raton, FL: Auerbach.
Vassiliadis, P. (2009). A survey of extract–transform–load technology. International Journal of Data Warehousing and Mining, 5(3), 1–27.
Vinoski, S. (2008). Serendipitous reuse. Internet Computing, 12(1), 84–87. Retrieved February 27, 2009, from http://steve.vinoski.net/pdf/IEEE-Serendipitous_Reuse.pdf
Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications (1st ed.). New York: Cambridge University Press. Retrieved from http://www.amazon.com/dp/0521387078
Weng, L., Menczer, F., & Ahn, Y.-Y. (2013). Virality prediction and community structure in social networks. Scientific Reports, 3, 2522. Retrieved March 20, 2014, from http://www.nature.com/srep/2013/130828/srep02522/full/srep02522.html
Wikipedia. (2014). Web crawler. Wikipedia. Retrieved December 14, 2014, from http://en.wikipedia.org/w/index.php?title=Web_crawler&oldid=635502147
Acknowledgement
The research reported in this chapter was funded through resources provided by Tekes—the Finnish Funding Agency for Innovation and mediaX at Stanford University.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Huhtamäki, J., Russell, M.G., Rubens, N., Still, K. (2015). Ostinato: The Exploration-Automation Cycle of User-Centric, Process-Automated Data-Driven Visual Network Analytics. In: Matei, S., Russell, M., Bertino, E. (eds) Transparency in Social Media. Computational Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-18552-1_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-18552-1_11
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18551-4
Online ISBN: 978-3-319-18552-1
eBook Packages: Computer ScienceComputer Science (R0)