Software survey: VOSviewer, a computer program for bibliometric mapping
- 17k Downloads
We present VOSviewer, a freely available computer program that we have developed for constructing and viewing bibliometric maps. Unlike most computer programs that are used for bibliometric mapping, VOSviewer pays special attention to the graphical representation of bibliometric maps. The functionality of VOSviewer is especially useful for displaying large bibliometric maps in an easy-to-interpret way. The paper consists of three parts. In the first part, an overview of VOSviewer’s functionality for displaying bibliometric maps is provided. In the second part, the technical implementation of specific parts of the program is discussed. Finally, in the third part, VOSviewer’s ability to handle large maps is demonstrated by using the program to construct and display a co-citation map of 5,000 major scientific journals.
KeywordsBibliometric mapping Science mapping Visualization VOSviewer VOS Journal co-citation analysis
Bibliometric mapping is an important research topic in the field of bibliometrics (for an overview, see Börner et al. 2003). Two aspects of bibliometric mapping that can be distinguished are the construction of bibliometric maps and the graphical representation of such maps. In the bibliometric literature, most attention is paid to the construction of bibliometric maps. Researchers for example study the effect of different similarity measures (e.g., Ahlgren et al. 2003; Klavans and Boyack 2006a; Van Eck and Waltman 2009), and they experiment with different mapping techniques (e.g., Boyack et al. 2005; Van Eck and Waltman 2007b; White 2003). The graphical representation of bibliometric maps receives considerably less attention. Although some researchers seriously study issues concerning graphical representation (e.g., Chen 2003, 2006; Skupin 2004), most papers published in the bibliometric literature rely on simple graphical representations provided by computer programs such as SPSS and Pajek. For small maps containing no more than, say, 100 items, simple graphical representations typically yield satisfactory results. However, there seems to be a trend towards larger maps (e.g., Boyack et al. 2005; Klavans and Boyack 2006b; Leydesdorff 2004; Van Eck et al. 2006, in press), and for such maps simple graphical representations are inadequate. The graphical representation of large bibliometric maps can be much enhanced by means of, for example, zoom functionality, special labeling algorithms, and density metaphors. This kind of functionality is not incorporated into the computer programs that are commonly used by bibliometric researchers. In this paper, we therefore introduce a new computer program for bibliometric mapping. This program pays special attention to the graphical representation of bibliometric maps.
The computer program that we introduce is called VOSviewer. VOSviewer is a program that we have developed for constructing and viewing bibliometric maps. The program is freely available to the bibliometric research community (see www.vosviewer.com). VOSviewer can for example be used to construct maps of authors or journals based on co-citation data or to construct maps of keywords based on co-occurrence data. The program offers a viewer that allows bibliometric maps to be examined in full detail. VOSviewer can display a map in various different ways, each emphasizing a different aspect of the map. It has functionality for zooming, scrolling, and searching, which facilitates the detailed examination of a map. The viewing capabilities of VOSviewer are especially useful for maps containing at least a moderately large number of items (e.g., at least 100 items). Most computer programs that are used for bibliometric mapping do not display such maps in a satisfactory way.
To construct a map, VOSviewer uses the VOS mapping technique (Van Eck and Waltman 2007a), where VOS stands for visualization of similarities. For earlier studies in which the VOS mapping technique was used, we refer to Van Eck and Waltman (2007b) and Van Eck et al. (in press). VOSviewer can display maps constructed using any suitable mapping technique. Hence, the program can be employed not only for displaying maps constructed using the VOS mapping technique but also for displaying maps constructed using techniques such as multidimensional scaling. VOSviewer runs on a large number of hardware and operating system platforms and can be started directly from the internet.
In the remainder of this paper, we first discuss for what type of bibliometric maps VOSviewer is intended to be used. We then provide an overview of VOSviewer’s functionality for displaying bibliometric maps. We also elaborate on the technical implementation of specific parts of the program. Finally, to demonstrate VOSviewer’s ability to handle large maps, we use the program to construct and display a co-citation map of 5,000 major scientific journals.
Types of bibliometric maps
Two types of maps can be distinguished that are commonly used in bibliometric research.1 We refer to these types of maps as distance-based maps and graph-based maps. Distance-based maps are maps in which the distance between two items reflects the strength of the relation between the items. A smaller distance generally indicates a stronger relation. In many cases, items are distributed quite unevenly in distance-based maps. On the one hand this makes it easy to identify clusters of related items, but on the other hand this sometimes makes it difficult to label all the items in a map without having labels that overlap each other. Graph-based maps are maps in which the distance between two items need not reflect the strength of the relation between the items. Instead, lines are drawn between items to indicate relations. Items are often distributed in a fairly uniform way in graph-based maps. This may have the advantage that there are less problems with overlapping labels. In our opinion, a disadvantage of graph-based maps compared with distance-based maps is that it typically is more difficult to see the strength of the relation between two items. Clusters of related items may also be more difficult to detect.
Some mapping techniques for constructing distance-based and graph-based maps
For constructing graph-based maps, researchers in the field of bibliometrics (e.g., de Moya-Anegón et al. 2007; Leydesdorff and Rafols 2009; Vargas-Quesada and de Moya-Anegón 2007; White 2003) usually use a mapping technique developed by Kamada and Kawai (1989). Sometimes an alternative technique proposed by Fruchterman and Reingold (1991) is used (e.g., Bollen et al. 2009; Leydesdorff 2004). A popular computer program in which both techniques are implemented is Pajek (De Nooy et al. 2005). Some researchers (e.g., de Moya-Anegón et al. 2007; Vargas-Quesada and de Moya-Anegón 2007; White 2003) combine the Kamada–Kawai technique with the technique of pathfinder networks (Schvaneveldt 1990; Schvaneveldt et al. 1988). Two other computer programs that can be used to construct graph-based maps are CiteSpace (Chen 2006) and the Network Workbench Tool. Even more programs are available in the field of social network analysis (for an overview, see Huisman and Van Duijn 2005).
Distance-based and graph-based maps both have advantages and disadvantages. In VOSviewer, we have chosen to support only distance-based maps. VOSviewer can be employed to view any two-dimensional distance-based map, regardless of the mapping technique that has been used to construct the map. One can employ VOSviewer to view multidimensional scaling maps constructed using statistical packages such as SAS, SPSS, and R, but one can also employ VOSviewer to view maps constructed using other, less common techniques. Because the VOS mapping technique shows a very good performance (Van Eck et al. 2008), this technique has been fully integrated into VOSviewer. This means that VOSviewer can be used not only to view VOS maps but also to construct them. Hence, no separate computer program is needed for constructing VOS maps.
Functionality of VOSviewer
In this section, we provide an overview of VOSviewer’s functionality for displaying bibliometric maps.3 We use a data set that consists of co-citation frequencies of journals belonging to at least one of the following five closely related subject categories of Thomson Reuters: Business, Business-Finance, Economics, Management, and Operations Research & Management Science. The co-citation frequencies of journals were determined based on citations in articles published between 2005 and 2007 to articles published in 2005. A journal was included in the data set only if it had at least 25 co-citations. There were 232 journals that satisfied this condition. Based on a clustering technique, the journals in the data set were divided into five clusters. The data set is available at www.vosviewer.com.
Label view. In this view, items are indicated by a label and, by default, also by a circle. The more important an item, the larger its label and its circle. If colors have been assigned to items, each item’s circle is displayed in the color of the item. By default, to avoid overlapping labels, only a subset of all labels is displayed. The label view is particularly useful for a detailed examination of a map.An example of the label view is shown in Fig. 4. The map shown in this figure was constructed based on the journal co-citation data set discussed at the beginning of this section. Colors indicate the cluster to which a journal was assigned by the clustering technique that we used. As can be seen, there is a strong agreement between the structure of the map and the clustering obtained using our clustering technique. The clustering also has a straightforward interpretation. The five clusters correspond with the following research fields: accounting/finance, economics, management, marketing, and operations research.4 It is clear that the map shown in Fig. 4 is much easier to interpret than the maps shown in Figs. 1 and 2. This demonstrates one of the main advantages of VOSviewer over commonly used computer programs such as SPSS and Pajek.
Density view. In this view, items are indicated by a label in a similar way as in the label view. Each point in a map has a color that depends on the density of items at that point. That is, the color of a point in a map depends on the number of items in the neighborhood of the point and on the importance of the neighboring items. The density view is particularly useful to get an overview of the general structure of a map and to draw attention to the most important areas in a map. We will discuss the technical implementation of the density view later on in this paper.An example of the density view is shown in Fig. 5. The map shown in this figure is the same as the one shown in Fig. 4. The density view immediately reveals the general structure of the map. Especially the economics and management areas turn out to be important. These areas are very dense, which indicates that overall the journals in these areas receive a lot of citations. It can also be seen that there is a clear separation between the fields of accounting, finance, and economics on the one hand and the fields of management, marketing, and operations research on the other hand. Like Fig. 4, Fig. 5 demonstrates VOSviewer’s ability to provide easy-to-interpret graphical representations of bibliometric maps.
Cluster density view. This view is available only if items have been assigned to clusters. The cluster density view is similar to the ordinary density view except that the density of items is displayed separately for each cluster of items. The cluster density view is particularly useful to get an overview of the assignment of items to clusters and of the way in which clusters of items are related to each other. We will discuss the technical implementation of the cluster density view later on in this paper.
Unfortunately, the cluster density view cannot be shown satisfactorily in black and white. We therefore do not show an example of the cluster density view.
Scatter view. This is a simple view in which items are indicated by a small circle and in which no labels are displayed. If colors have been assigned to items, each item’s circle is displayed in the color of the item. The scatter view focuses solely on the general structure of a map and does not provide any detailed information.
In addition to the four views discussed above, another important feature of VOSviewer is its ability to handle large maps. VOSviewer can easily construct maps that contain several thousands of items, and it can display maps that contain more than 10,000 items. VOSviewer has functionality for zooming, scrolling, and searching, which facilitates the detailed examination of large maps. When displaying a map, VOSviewer uses a special algorithm to determine which labels can be displayed and which labels cannot be displayed without having labels that overlap each other. The further one zooms in on a specific area of a map, the more labels become visible. Later on in this paper, we will demonstrate VOSviewer’s ability to handle large maps by using the program to construct and display a co-citation map of 5,000 major scientific journals. In the next two sections, however, we will first elaborate on the technical implementation of specific parts of VOSviewer.
Construction of a map
VOSviewer constructs a map based on a co-occurrence matrix. The construction of a map is a process that consists of three steps. In the first step, a similarity matrix is calculated based on the co-occurrence matrix. In the second step, a map is constructed by applying the VOS mapping technique to the similarity matrix. And finally, in the third step, the map is translated, rotated, and reflected. We now discuss each of these steps in more detail.
Step 1: similarity matrix
Step 2: VOS mapping technique
We now discuss how the VOS mapping technique constructs a map based on the similarity matrix obtained in Step 1. A more elaborate discussion of the VOS mapping technique, including an analysis of the relation between the VOS mapping technique and multidimensional scaling, is provided by Van Eck and Waltman (2007a). Some results of an empirical comparison between the VOS mapping technique and multidimensional scaling are reported by Van Eck et al. (2008). A simple open source computer program that implements the VOS mapping technique is available at www.neesjanvaneck.nl/vos/.
Step 3: translation, rotation, and reflection
Translation. The solution is translated in such a way that it becomes centered at the origin.
Rotation. The solution is rotated in such a way that the variance on the horizontal dimension is maximized. This transformation is known as principal component analysis.
Reflection. If the median of x11,…,xn1 is larger than 0, the solution is reflected in the vertical axis. If the median of x12,…,xn2 is larger than 0, the solution is reflected in the horizontal axis.
These three transformations are sufficient to ensure that VOSviewer produces consistent results.
Density view and cluster density view
In this section, we discuss the technical implementation of the density view and the cluster density view. Recall that in VOSviewer the cluster density view is available only if items have been assigned to clusters.
Item densities calculated using (5) are translated into colors using a color scheme. By default, VOSviewer uses a red–green–blue color scheme (see Fig. 5). In this color scheme, red corresponds with the highest item density and blue corresponds with the lowest item density.
Finally, we note that the above-described calculation of the color of a point in a map is performed only for a limited number of points. These points are located on a grid. The colors of points that do not lie on this grid are obtained through interpolation.
Cluster density view
After calculating item densities, the color of a point in a map is determined in two steps. Each cluster is associated with a color. In the first step, the colors of the clusters are mixed together. This is done by calculating a weighted average of the colors, where the weight of a color equals the item density for the corresponding cluster, as given by (7). In the second step, the color obtained in the first step is mixed with the (black or white) background color of the cluster density view. The proportion in which the two colors are mixed depends on the total item density of a point, which is given by (5). The lower the total item density of a point, the closer the color of the point is to the background color.
Large-scale application of VOSviewer
To demonstrate VOSviewer’s ability to handle large maps, we use the program to construct and display a co-citation map of 5,000 major scientific journals. For earlier studies in which journal maps of similar size were presented, we refer to Bollen et al. (2009), Boyack et al. (2005), and Leydesdorff (2004).
Our journal co-citation map provides an overview of the structure of the scientific world. Clusters of related journals can be identified in the map, and these clusters can be linked to scientific fields. Clusters that are located close to each other in the map indicate closely related fields. As can be seen in Fig. 6, the map has a more or less circular structure. The center of the map is relatively empty. At a global level, the interpretation of the map is fairly straightforward. The right part of the map covers the medical sciences. Moving counterclockwise from the medical sciences, the following major fields can be identified: life sciences, chemistry, physics, engineering, mathematics, computer science, social sciences, and psychology. Psychology is again closely related to the medical sciences, which completes the circular structure of the map.
There seems to be only one earlier study in which distance-based journal maps of similar size as our map were presented. This study was done by Boyack et al. (2005). Boyack et al. presented two kinds of journal maps, namely maps based on journal-to-journal citation data and maps based on journal co-citation data. Comparing the global structure of the maps of Boyack et al. with the global structure of our map, there turn out to be both some similarities and some differences. On the one hand, the way in which major scientific fields are located relative to each other is fairly similar in the maps of Boyack et al. and in our map. On the other hand, the general shape of the maps of Boyack et al. is quite different from the general shape of our map. In the maps of Boyack et al., clusters of journals are located more or less equally distributed within an almost perfect circle. This seems to be a structure that has been imposed by the VxOrd mapping technique used by Boyack et al. A disadvantage of this structure is that in the center of the maps of Boyack et al. different fields can be identified that do not really seem to have much in common. In our map constructed using VOSviewer, we cannot find any indications of a structure that has been imposed by the mapping technique. The general shape of our map seems to have been determined by the data rather than by the mapping technique that we used. A noticeable difference between our map and the maps of Boyack et al. is the relatively empty center of our map. Due to the relatively empty center, fields between which there are no strong relations are clearly separated from each other.
In this paper, we have presented VOSviewer, a freely available computer program for constructing and viewing bibliometric maps. Unlike programs such as SPSS and Pajek, which are commonly used for bibliometric mapping, VOSviewer pays special attention to the graphical representation of bibliometric maps. The functionality of VOSviewer is especially useful for displaying large bibliometric maps in an easy-to-interpret way.
VOSviewer has been used successfully in various projects carried out by the Centre for Science and Technology Studies. In future research on bibliometric mapping, we expect to rely heavily on VOSviewer. By making VOSviewer freely available to the bibliometric research community, we hope that others will benefit from it as well.
We do not consider maps that are primarily intended for showing developments over time. Such maps are for example provided by the HistCite software of Eugene Garfield (e.g., Garfield, 2009).
A computer implementation of VxOrd is available at www.cs.sandia.gov/~smartin/software.html as part of the DrL toolbox.
For a more extensive discussion of the functionality of VOSviewer, we refer to the VOSviewer manual, which is available at www.vosviewer.com.
Although this is not directly visible in Fig. 4, we note that there is a large overlap in the map between the Business and Management subject categories of Thomson Reuters. This indicates an important difference between the clustering that we found and the clustering provided by the subject categories of Thomson Reuters.
The VOS mapping technique can also be used to construct maps in more than two dimensions. However, VOSviewer does not support this. The VOS software available at www.neesjanvaneck.nl/vos/ does support the construction of maps in more than two dimensions.
By default, VOSviewer uses h = 0.125. This generally seems to work fine. However, if necessary, the value of h can be changed.
Notice that Scientometrics is also visible in Fig. 7 (in the right part of the figure).
We thank Ed Noyons, Cathelijn Waaijer, and two anonymous referees for valuable comments on earlier drafts of this paper.
This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
- Chen, C. (2003). Mapping scientific frontiers. Berlin: Springer.Google Scholar
- Davidson, G. S., Wylie, B. N., & Boyack, K. W. (2001). Cluster stability and the use of noise in interpretation of clustering. In Proceedings of the IEEE symposium on information visualization 2001, pp. 23–30.Google Scholar
- De Nooy, W., Mrvar, A., & Batagelj, V. (2005). Exploratory social network analysis with Pajek. Cambridge: Cambridge University Press.Google Scholar
- Huisman, M., & Van Duijn, M. A. J. (2005). Software for social network analysis. In J. Scott, S. Wasserman, & P. J. Carrington (Eds.), Models and methods in social network analysis (pp. 270–316). Cambridge: Cambridge University Press.Google Scholar
- Schvaneveldt, R. W. (Ed.). (1990). Pathfinder associative networks. Westport: Ablex.Google Scholar
- Van Eck, N. J., & Waltman, L. (2007a). VOS: A new method for visualizing similarities between objects. In H.-J. Lenz & R. Decker (Eds.), Advances in data analysis: Proceedings of the 30th annual conference of the German Classification Society (pp. 299–306). Heidelberg: Springer.Google Scholar
- Van Eck, N. J., Waltman, L., Dekker, R., & Van den Berg, J. (2008). An experimental comparison of bibliometric mapping techniques. Paper presented at the 10th International Conference on Science and Technology Indicators, Vienna.Google Scholar
- Van Eck, N. J., Waltman, L., Noyons, E. C. M., & Buter, R. K. (in press). Automatic term identification for bibliometric mapping. Scientometrics.Google Scholar
- Van Eck, N. J., Waltman, L., Van den Berg, J., & Kaymak, U. (2006). Visualizing the computational intelligence field. IEEE Computational Intelligence Magazine, 1(4), 6–10.Google Scholar
- Vargas-Quesada, B., & de Moya-Anegón, F. (2007). Visualizing the structure of science. New York: Springer.Google Scholar