Skip to main content

Ostinato: The Exploration-Automation Cycle of User-Centric, Process-Automated Data-Driven Visual Network Analytics

  • Chapter
Transparency in Social Media

Part of the book series: Computational Social Sciences ((CSS))

Abstract

Network analysis is a valuable method for investigating and mapping the social structure driving phenomena and sharing the findings with others. The interactive visual analytics approach transforms data into views that allow the visual exploration of the structures and processes of networks represented by data, therefore increasing the transparency of editorial processes on social media as well as networked structures in innovation ecosystems and other phenomena. Although existing tools have opened many new exploratory opportunities, new tools in development promise investigators even greater freedom to interact with the data, refine and analyze the data, and explore alternative explanations for networked processes. This chapter presents the Ostinato Model—an iterative, user-centric, process-automated model for data-driven visual network analytics. The Ostinato Model simultaneously supports the automation of the process and enables interactive and transparent exploration. The model has two phases, Data Collection and Refinement and Network Creation and Analysis. The Data Collection and Refinement phase is further divided into Entity Index Creation, Web/API Crawling, Scraping, and Data Aggregation. The Network Construction and Analysis phase is composed of Filtering in Entities, Node and Edge Creation, Metrics Calculation, Node and Edge Filtering, Entity Index Refinement, Layout Processing and Visual Properties Configuration. A cycle of exploration and automation characterizes the model and is embedded in each phase.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Ideally, a data scientist is a hacker, scientist, quantitative analyst, trusted adviser and business (domain) expert, all in one person (cf. Davenport, 2014).

  2. 2.

    Funding for young innovative companies, http://www.tekes.fi/en/funding/companies/funding-for-young-innovative-growth-companies/

  3. 3.

    Help:Infobox, http://en.wikipedia.org/wiki/Help:Infobox

  4. 4.

    The Terms of Service for a Web page must also be considered. When using Wikipedia as a data source, for example, one has to take into account the Terms of Service that specifically deny crawling Wikipedia for large amount of files. Instead of crawling the live website, users of the data are advised to download a copy of Wikipedia’s contents and set up a proxy for serving further processing.

  5. 5.

    When using names as identifiers, one can apply fuzzy string matching and semi-automated tools such as OpenRefine (http://openrefine.org/) or DataWrangler (http://vis.stanford.edu/wrangler/) to assist in the aggregation process.

  6. 6.

    Gephi Toolkit, http://gephi.github.io/toolkit/

  7. 7.

    Using a full stack programming language such as Python gives the developers more opportunities to turn the scripts developed for analysis into processes that run in the cloud, intermittently collecting and preprocessing the data and feeding results into dashboards implemented in Web technologies.

References

  • Barabási, A.-L., & Bonabeau, E. (2003). Scale-free networks. Scientific American, 288(5), 50–59.

    Article  Google Scholar 

  • Basole, R. C., Russel, M. G., Huhtamäki, J., & Rubens, N. (2012). Understanding mobile ecosystem dynamics: A data-driven approach. In Proceedings of the 2012 International Conference on Mobile Business (ICMB 2012) (pp. 17–28). Delft, The Netherlands. Retrieved from http://aisel.aisnet.org/icmb2012/15/

  • Bizer, C., Heath, T., & Berners-Lee, T. (2009). Linked data: The story so far. International Journal on Semantic Web and Information Systems, 5(3), 1–22.

    Article  Google Scholar 

  • Card, S. K., Mackinlay, J. D., & Shneiderman, B. (1999). Readings in information visualization: Using vision to think. San Francisco: Morgan Kaufmann. Retrieved from http://www.amazon.com/Readings-Information-Visualization-Interactive-Technologies/dp/1558605339

  • Davenport, T. H. (2014). Big data at work: Dispelling the myths, uncovering the opportunities. Boston: Harvard Business Press Books.

    Book  Google Scholar 

  • Freeman, L. C. (2000). Visualizing social networks. Journal of Social Structure, 1(1). Retrieved from http://www.cmu.edu/joss/content/articles/volume1/Freeman.html

  • Freeman, L. C. (2009). Methods of social network visualization. Berlin, Germany: Springer.

    Book  Google Scholar 

  • Ghosh, S. (2013). Python tools for reproducible research in brain imaging. Retrieved from https://speakerdeck.com/satra/pydata-2013-python-tools-for-reproducible-research-in-brain-imaging

  • Giuliani, E., & Bell, M. (2008). Industrial clusters and the evolution of their knowledge networks: Revisiting a Chilean case. Brighton, England. Retrieved from http://www.sussex.ac.uk/spru/documents/sewp171

  • Granovetter, M. (1973). The strength of weak ties. American Journal of Sociology, 78(6), 1360–1380. Retrieved from http://www.jstor.org/discover/10.2307/2776392?uid=3737976&uid=2&uid=4&sid=21104852601921

  • Hansen, D. L., Rotman, D., Bonsignore, E., Milic-Frayling, N., Rodrigues, E. M., Smith, M., & Shneiderman, B. (2009). Do you know the way to SNA?: A process model for analyzing and visualizing social media data. University of Maryland Technical Report: HCIL-2009-17. Retrieved from http://www.smrfoundation.org/wp-content/uploads/2010/05/2009-UMD-TechReport-Do-you-know-the-way-to-SNA.pdf

  • Hansen, D., Shneiderman, B., & Smith, M. A. (2011). Analyzing social media networks with NodeXL: Insights from a connected world. Burlington, MA: Morgan Kaufmann. Retrieved from http://www.amazon.com/dp/0123822297

  • Heer, J., & Shneiderman, B. (2012). Interactive dynamics for visual analysis. Communications of the ACM, 55(4), 45–54. Retrieved January 31, 2013, from http://dl.acm.org/citation.cfm?id=2133821

  • Hevner, A. R., et al. (2004). Design science in information systems research. MIS Quarterly, 28(1), 75–105.

    Google Scholar 

  • Huhtamäki, J., Salonen, J., Marttila, J., & Nykänen, O. (2010, October 3–6). Context-driven social network visualisation: Case wiki co-creation. In D. Karabeg & J. Park (Eds.), Proceedings of the Second International Workshop on Knowledge Federation: Self-Organizing Collective Mind, Dubrovnik, Croatia. Dubrovnik, Croatia: CEUR-WS.org. Retrieved from http://urn.fi/URN:NBN:fi:tty-201201161008

  • Huhtamäki, J., Still, K., Isomursu, M., Russell, M. G., & Rubens, N. (2012, September 20–21). Networks of growth: Case young innovative companies in Finland. In Proceedings of the 7th European Conference on Innovation and Entrepreneurship (ECIE), SantarĂ©m, Portugal

    Google Scholar 

  • Huhtamäki, J., Luotonen, V., Kairamo, V., Still, K., & Russell, M. G. (2013, October 1–3). Process for measuring and visualizing an open innovation platform: Case Demola. In 17th International Academic MindTrek Conference 2013: “Making Sense of Converging Media”. Tampere, Finland: ACM. Retrieved from http://urn.fi/URN:NBN:fi:tty-201312201533.

  • Hwang, V. W., & Horowitt, G. (2012). The rainforest: The secret to building the next silicon valley 1.02 Edition. Los Altos Hills, CA: Regenwald. Retrieved from http://www.amazon.com/The-Rainforest-Secret-Building-Silicon/dp/0615586724

  • Indarto, E. (2013). Data mining. Retrieved December 13, 2014, from http://recommender-systems.readthedocs.org/en/latest/datamining.html

  • Intel. (2013). Extract, transform, and load big data with Apache Hadoop. Retrieved from https://software.intel.com/en-us/articles/extract-transform-and-load-big-data-with-apache-hadoop

  • Jesus, R., Schwartz, M., & Lehmann, S. (2009). Bipartite networks of Wikipedia’s articles and authors: A meso-level approach (p. Article 5, 10 pages). Orlando, FL: ACM. Retrieved September 2, 2010, from http://portal.acm.org/citation.cfm?id=1641309.1641318

  • Jussila, J., Huhtamäki, J., Henttonen, K., Kärkkäinen, H., & Still, K. (2014, January 6–9). Visual network analysis of Twitter data for co-organizing conferences: Case CMAD 2013. In Proceedings of the 47th Annual Hawaii International Conference on System Sciences (pp. 1474–1483). Computer Society Press. Retrieved from http://urn.fi/URN:NBN:fi:tty-201401221053.

  • Kandel, S., Heer, J., Plaisant, C., Kennedy, J., van Ham, F., Riche, N. H., … Buono, P. (2011). Research directions in data wrangling: Visualizations and transformations for usable and credible data. Information Visualization, 10(4), 271–288. doi:10.1177/1473871611415994

  • Keim, D., Kohlhammer, J., & Ellis, G. (Eds.). (2010). Mastering the information age—Solving problems with visual analytics. Geneva, Switzerland: Eurographics Association. Retrieved from http://www.vismaster.eu/book/

  • Konno, N., Nonaka, I., & Ogilvy, J. (2014). Scenario planning: The basics. World Futures, 70(1), 28–43. Retrieved December 14, 2014, from http://www.tandfonline.com/doi/abs/10.1080/02604027.2014.875720

  • Leskovec, J., Backstrom, L., & Kleinberg, J. (2009). Meme-tracking and the dynamics of the news cycle. In KDD’09 (pp. 497–506). New York: ACM. Retrieved September 14, 2012, from http://doi.acm.org/10.1145/1557019.1557077

  • Liu, Y.-Y., Slotine, J.-J., & Barabási, A.-L. (2011). Controllability of complex networks. Nature, 473(7346), 167–173. Retrieved June 15, 2011, from http://dx.doi.org/10.1038/nature10011

  • Mawer, D. (2000). Ballet and the apotheosis of the dance. In D. Mawer (Ed.), The Cambridge companion to ravel (p. 157). Cambridge, England: Cambridge University Press.

    Chapter  Google Scholar 

  • Moreno, J. L. (1953). Who shall survive?: Foundations of sociometry, group psychotherapy and sociodrama. Beacon, NY: Beacon House Inc. Retrieved August 23, 2010, from http://www.asgpp.org/docs/WSS/WSS.html

  • North, C. (2006). Toward measuring visualization insight. IEEE Computer Graphics and Applications, 26(3), 6–9. Retrieved from http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=1626178&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D1626178

  • Nykänen, O., Salonen, J., Haapaniemi, M., Huhtamäki, J., Huhtamäki, J., & Huhtamäki, J. (2008). A visualisation system for a peer-to-peer information space (pp. 76–85). Tampere, Finland: Tampere University of Technology. Retrieved from http://matriisi.ee.tut.fi/hypermedia/events/opaals2008/articlelist.html#opaals2008-article14.

    Google Scholar 

  • Paquin, R. L., & Howard-Grenville, J. (2013). Blind dates and arranged marriages: Longitudinal processes of network orchestration. Organization Studies, 34(11), 1623–1653. Retrieved March 11, 2014, from http://oss.sagepub.com/content/34/11/1623

  • Peffers, K., Tuunanen, T., Rothenberger, M., & Chatterjee, S. (2007). A design science research methodology for information systems research. Journal of Management Information Systems, 24(3), 45–77. doi:10.2753/MIS0742-1222240302.

    Article  Google Scholar 

  • Peng, R. D. (2009). Reproducible research and biostatistics. Biostatistics, 10(3), 405–408. Retrieved December 14, 2014, from http://biostatistics.oxfordjournals.org/content/10/3/405

  • Petschulat, S. (2010). Other people’s data. Communications of the ACM, 53(1), 53. Retrieved December 14, 2014, from http://cacm.acm.org/magazines/2010/1/55742-other-peoples-data/fulltext

  • Pirolli, P., & Card, S. (2005). The sensemaking process and leverage points for analyst technology as identified through cognitive task analysis. In Proceedings of International Conference on Intelligence Analysis.

    Google Scholar 

  • Ritala, P., Armila, L., & Blomqvist, K. (2009). Innovation orchestration capability—Defining the organizational and individual level determinants. International Journal of Innovation Management, 13(04), 569–591. Retrieved March 6, 2014, from http://www.worldscientific.com/doi/abs/10.1142/S136391960900242X

  • Ritala, P., & Hallikas, J. (2011). Network position of a firm and the tendency to collaborate with competitors—A structural embeddedness perspective. International Journal of Strategic Business Alliances, 2(4), 307–328. Retrieved December 19, 2014, from http://dx.doi.org/10.1504/IJSBA.2011.044859

  • Ritala, P., & Huizingh, E. (2014). Business and network models for innovation: Strategic logic and the role of network position. International Journal of Technology Management, 66(2), 109–119. Retrieved December 19, 2014, from http://dx.doi.org/10.1504/IJTM.2014.064608

  • Rubens, N., Russell, M., Perez, R., Huhtamaki, J., Still, K., Kaplan, D., & Okamoto, T. (2011). Alumni network analysis. In Proceedings of 2011 IEEE Global Engineering Education Conference, EDUCON 2011 (pp. 606–611).

    Google Scholar 

  • Russell, M. G., Still, K., Huhtamäki, J., Yu, C., & Rubens, N. (2011, July). Transforming innovation ecosystems through shared vision and network orchestration. In Proceedings of Triple Helix IX International Conference: “Silicon Valley: Global Model or Unique Anomaly?”, Stanford, CA.

    Google Scholar 

  • Russell, M. G., Huhtamäki, J., Still, K., Rubens, N., & Basole, R. C. (2015). Relational capital for shared vision in innovation ecosystems. Triple Helix: A Journal of University-Industry-Government Innovation and Entrepreneurship.

    Google Scholar 

  • Schwaber, K., & Beedle, M. (2001). Agile software development with Scrum. Upper Saddle River, NJ: Prentice Hall.

    Google Scholar 

  • Shakarian, P., Eyre, S., & Paulo, D. (2013). A scalable heuristic for viral marketing under the tipping model. Retrieved September 23, 2013, from http://arxiv.org/abs/1309.2963

  • Simon, H. A. (1969). The sciences of the artificial. Cambridge, MA: MIT Press.

    Google Scholar 

  • Still, K., Huhtamäki, J., Russell, M. G., Basole, R. C., Salonen, J., & Rubens, N. (2013, June 16–19). Networks of innovation relationships: Multiscopic views on Finland. In Proceedings of the XXIV ISPIM Conference—Innovating in Global Markets: Challenges for Sustainable Growth, Helsinki, Finland (p. 15).

    Google Scholar 

  • Still, K., et al. (2014). Insights for orchestrating innovation ecosystems: The case of EIT ICT Labs and data-driven network visualisations. International Journal of Technology Management, 66(2/3), 243–265.

    Article  Google Scholar 

  • Telea, A. C. (2008). Data visualization: Principles and practice. Wellesley, MA: A K Peters. Retrieved from http://www.amazon.com/Data-Visualization-Principles-Alexandru-Telea/dp/1568813066

  • Thomas, J. J., & Cook, K. A. (2006). A visual analytics agenda. IEEE Computer Graphics and Applications, 26(1), 10–13.

    Article  Google Scholar 

  • Vaishnavi, V. K., & Kuechler, W., Jr. (2007). Design science research methods and patterns: Innovating information and communication technology. Boca Raton, FL: Auerbach.

    Book  Google Scholar 

  • Vassiliadis, P. (2009). A survey of extract–transform–load technology. International Journal of Data Warehousing and Mining, 5(3), 1–27.

    Article  Google Scholar 

  • Vinoski, S. (2008). Serendipitous reuse. Internet Computing, 12(1), 84–87. Retrieved February 27, 2009, from http://steve.vinoski.net/pdf/IEEE-Serendipitous_Reuse.pdf

  • Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications (1st ed.). New York: Cambridge University Press. Retrieved from http://www.amazon.com/dp/0521387078

  • Weng, L., Menczer, F., & Ahn, Y.-Y. (2013). Virality prediction and community structure in social networks. Scientific Reports, 3, 2522. Retrieved March 20, 2014, from http://www.nature.com/srep/2013/130828/srep02522/full/srep02522.html

  • Wikipedia. (2014). Web crawler. Wikipedia. Retrieved December 14, 2014, from http://en.wikipedia.org/w/index.php?title=Web_crawler&oldid=635502147

Download references

Acknowledgement

The research reported in this chapter was funded through resources provided by Tekes—the Finnish Funding Agency for Innovation and mediaX at Stanford University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jukka Huhtamäki .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Huhtamäki, J., Russell, M.G., Rubens, N., Still, K. (2015). Ostinato: The Exploration-Automation Cycle of User-Centric, Process-Automated Data-Driven Visual Network Analytics. In: Matei, S., Russell, M., Bertino, E. (eds) Transparency in Social Media. Computational Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-18552-1_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-18552-1_11

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-18551-4

  • Online ISBN: 978-3-319-18552-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics