The Similarity Between Dissimilarities

Tax, David M. J.; Cheplygina, Veronika; Duin, Robert P. W.; van de Poll, Jan

doi:10.1007/978-3-319-49055-7_8

David M. J. Tax¹⁸,
Veronika Cheplygina^18,19,
Robert P. W. Duin¹⁸ &
…
Jan van de Poll²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10029))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

1388 Accesses

Abstract

When characterizing teams of people, molecules, or general graphs, it is difficult to encode all information using a single feature vector only. For these objects dissimilarity matrices that do capture the interaction or similarity between the sub-elements (people, atoms, nodes), can be used. This paper compares several representations of dissimilarity matrices, that encode the cluster characteristics, latent dimensionality, or outliers of these matrices. It appears that both the simple eigenvalue spectrum, or histogram of distances are already quite effective, and are able to reach high classification performances in multiple instance learning (MIL) problems. Finally, an analysis on teams of people is given, illustrating the potential use of dissimilarity matrix characterization for business consultancy.

You have full access to this open access chapter, Download conference paper PDF

A Literature Review on Correlation Clustering: Cross-disciplinary Taxonomy with Bibliometric Analysis

Article 03 September 2022

Dewan F. Wahid & Elkafi Hassini

Mathematical Modeling: Interdisciplinary Similarity Studies

Differential Network Analysis and Graph Classification: A Glocal Approach

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Consider the problem of evaluating and improving performances of teams in organizations based on the employee responses to questionnaires. The teams differ in size, and the roles of employees may be different for every organization. A key question for an organizations top management is how to support the autonomy of these teams while still keeping an eye on the overall process and the coherency of the teams performance. Assuming a span of control of 10–15 direct reports for an average manager, a middlesize organization may easily comprise of hundreds of teams. So, pattern recognition in organizational development may supply fundamentally important information of how similar – or dissimilar – teams are [1, 15, 20]. A possible solution is to focus at the diversity within a team – is there a large group of people who are all doing a similar job, or are there some isolated groups of people who are doing very different from the rest? Identifying such groups – clusters of employees – would help to compare the organizational structures on a higher level.

More formally, in this paper we focus on comparing sets (teams) of different samples (employees), residing in different feature spaces (evaluation questions). Comparing the team structures would be equivalent to comparing similarity matrices, with each similarity matrix originating from a single team. Comparing similarities alleviates the problem of different feature spaces, yet is still not trivial because the sets can be of different sizes, and there are no natural correspondences between the samples.

Comparing distance matrices has links with comparing graph structures: a distance matrix between N samples can be seen as a fully connected graph with N nodes, where the nodes are unlabeled and the edges are associated with weights. In graph-based pattern recognition, approaches such as graph edit distance [3, 21] or graph kernels [10, 12] have been used to define distance or similarity measures between graphs. Graph matching approaches search for a best correspondence between the nodes and define the graph distance as a measure of discrepancy between the matched nodes and edges. Graph kernels define similarity by considering all possible correspondences. However, the search space for correspondences becomes very large if the nodes are unlabeled, and the graph is fully connected. In [16] we used a threshold on the distances to reduce the number of edges. However, this threshold had a large influence on the results, suggesting that the larger distances can, too, be informative.

To avoid removing informative edges and to present a computationally efficient solution, in this paper we focus on finding feature representations to represent distance matrices. By representing each distance matrix in the same feature space, they can be compared with each other, for example, using the Euclidean distance. We investigate several representations in this paper, based on spectra [6], histograms of all distances [18], histograms of nearest neighbor distances, and hubness properties [23]. A detailed description of the representations is given in Sect. 2.

In Sect. 3 we investigate how well these features representations can encode the class information for some artificial examples. In Sect. 4 we investigate how good these representations are for multiple instance learning (MIL) datasets, where the goal is to classify sets of feature vectors. In Sect. 4.2 we apply the representation on real-world organisational data, and discuss some of the insights that arise from comparing teams of people.

2 Dissimilarity Matrix Representation

We assume we have a collection of N square dissimilarity matrices $\{D_n\in \mathbb {R}^{m_n\times m_n}; n=1...N\}$ of size $m_n\times m_n$. One element of matrix $D_n$ is indicated by $D_n(i,j)$. We assume that the matrices have the following characteristics:

The dissimilarities of objects to themselves is zero (i.e. the $D_n$ have zeros on the diagonal), and the dissimilarity is symmetric ($D_n(i,j)=D_n(j,i)$). In situations the matrices are not symmetric, they are made symmetric by averaging $D_n$ and its transpose: $\tilde{D}_n = (D_n+D_n^\top )/2$.
The size $m_n\times m_n$ of the matrices can be different for each $D_n$. It is assumed that the matrices have a minimum size of $3\times 3$.
The order of the rows and columns is arbitrary and may be permuted without altering the information that is stored in the dissimilarity matrix.

For this data type we investigate a few simple vector representations. Additional similarities between the dissimilarity matrices are also possible (such as embedding each matrix into a low-dimensional space and using the earth-movers distance, or matching the rows and columns of the dissimilarity matrices and computing the Frobenius norm [14]), but these tend to be computationally expensive. Here we focus on vector representations of the dissimilarity matrices.

We consider the following representations:

1.
spectrum features spect: use the k largest eigenvalues $\sigma $ of the centered matrix $D_n$:
$$\begin{aligned} \mathbf {x}_n = \sigma _{1:k}(C^\top D_n C),\quad \text {where}\,\, C = \mathbb {I}_{m_n}-\frac{1}{m_n}\mathbf {1}\mathbf {1}^\top \end{aligned}$$
(1)
2.
histogram of distances hist: collect all dissimilarities from all matrices, split the range of dissimilarity values from 0 to the maximum into k equally-sized bins, and count for each $D_n$ the number of occurrences into each bin. Optionally, the count can be converted into a frequency by dividing by $m_n(m_n+1)/2$.
3.
equalized histogram of distances histeq: split the range of dissimilarity values into k bins with an equal number of counts (instead of using equally wide bins). The bins become wider when dissimilarity values do not occur often, and they become smaller for frequently appearing values.
4.
histogram of the k-nearest neighbor distances distnn: instead of collecting all dissimilarities, only the dissimilarities up to the k-nearest neighbors are used. Per row of $D_n$, only $k< m_n$ dissimilarities are used; the total number of dissimilarities is therefore reduced from $m_n(m_n+1)/2$ to $m_n k$. By this variations in local densities are captured better.
5.
histogram of how often samples are the k-th nearest neigbor of other samples disthub: a measure used in hub analysis [24]. First the dataset is represented by a k-occurence histogram which stores how often each sample is the k-th nearest neighbor of others. To make this representation comparable across datasets of different sizes, it is summarized by q quantiles of the histogram. For the final representation, we concatenate the quantile-histograms for different values of $k \in \{1,3,\ldots ,|K|\}$, resulting in a $|K|\times q$ dim. feature vector.

In some situations we might want to be invariant to (non-linear) scaling of the dissimilarity values. For example, the expert may only have provided a relative ranking, but not an exact dissimilarity between two elements of a set. In this case, the extracted features should be invariant to the scaling of the dissimilarities. In the above representations, only disthub is invariant.

3 Illustrative Examples

To show the characteristics of the different representations, we construct some multi-class artificial datasets. Depending on the experiment we perform, the number of dissimilarity matrices, and the sizes of the matrices are varied.

The cluster dataset is constructed to investigate how well the clustering structure can be characterized. In cluster the dissimilarity matrices are computed from 2-dimensional datasets, containing samples belonging to a varying number of clusters (up to four clusters). The class label of a dissimilarity matrix is equal to the number of clusters, and therefore this defines a 4-class classification problem.
The subspace dataset is constructed to investigate how well the subspace structure can be characterized. In subspace the dissimilarity matrices are derived from p-dimensional Gaussian distributions, where the dimensionality is one ($p=1$) for class 1, $p=2$ for class 2, up to class 4.
The outlier dataset is used to investigate the sensitivity to outliers. In outlier the matrices are derived from 2-dimensional Gaussian distributions (zero mean, identity covariance matrix). Class 1 does not contain outliers. Class 2 contains an outlier from a Gaussian distribution with a 10 times larger covariance matrix, and for class 3 contains two such outliers.

Figure 1 shows the five different representations for a sample of 100 dissimilarities drawn from the cluster dataset. For the spectrum representation three features are computed, for the (equalized) histograms $k=10$ and for the disthub representation in total 75 features are computed. This cluster dataset has a very clear structure, and all the representations are able to distinguish well between the four different classes. In particular, the distinction between 1 cluster and more-than-1 cluster datasets are easy to make. For the disthub representation the distinction between the classes is less visible in the figure, due to the large difference in scales between the different features.

For each dataset, we compute the dissimilarity matrices using the Euclidean distances between the samples. We then compute the different representations, and use a linear classifier (LDA) to distinguish the different classes per dataset.

Figure 2 shows the classification performance on the three artificial sets as function of the number of training matrices. The size of the individual dissimilarity matrices is fixed to $m_n=30$. For many situations the (equalized) histogram is able to capture the information needed for good generalization. The histogram estimates start to suffer from noise for very small datasets and for situations where there is no clustering structure, and only the subspace dimensionality is informative. In these situations a spectrum representation is to be preferred. When very large training sizes are available, it is advantageous to use the nearest-neighbor distance histograms. Because distnn combines the histograms of the first-, second-, and all higher-nearest neighbors, this representation becomes very high-dimensional, but also very rich.

In Fig. 3 a similar curve to Fig. 2 is shown, only here the sizes of the individual dissimilarity matrices are varied while the number of training matrices is fixed to $N=100$ per class. Here as well the (equalized) histograms perform well when the dissimilarity matrices are large. Then there is a sufficient number of values available to estimate histograms well. For very small matrices, and characterizing subspace structure or outliers, the spectrum performs well. Somewhat surprising, to characterize the clustering structure with small dissimilarity sizes, the nearest neighbor distances are most effective, although this tends to overfit with larger matrices.

4 Experiments

We distinguish between two sets of experiments, a supervised and an unsupervised set. For the supervised set, we have a collection of labeled dissimilarity matrices. Here we use the bags from MIL data, where the distances between the instances in one bag give one dissimilarity matrix, and each matrix is labeled according to the original bag label (positive or negative). For the unsupervised set we only have a collection of dissimilarities between teams of people, for which we want to investigate how much variability is present in the teams, and what constitutes this variability.

4.1 Supervised Experiments: Multiple Instance Learning

We look at a wide variety of multiple instance learning (MIL) problems. In MIL, the i-th sample is a bag $B_n = \{\mathbf {x}_{n1}, \mathbf {x}_{n2}, \ldots , \mathbf {x}_{nm_n}\}$ of $m_n$ instances. The goal is to classify bags, based on the presence of concept feature vectors, or based on the overall distribution of the bag’s instances. Consider image classification, where a bag is an image, and an instance is an image patch. When classifying images of tigers, a patch containing a tiger is an example of a concept instance. When classifying images of scenes, it might be more reasonable to examine several patches before deciding what type of environment the image is depicting.

Characteristics of the datasets are listed in Table 1. From our previous experiences with these datasets [5, 25], we expect these datasets to contain a mix of concept-like and distribution-like problems. Note that in our previous work [5, 25] we represented each bag by its dissimilarities relative to a set of prototype bags, whereas here we use an absolute representation where each bag is represented by dissimilarities between its own instances.

Table 1. Characteristics of MIL datasets. Most of the datasets are available for download from http://www.miproblems.org

Full size table

We removed bags that contained only 1 or 2 instances. We then represented each bag by a $m_n \times m_n$ dissimilarity matrix between its instances, where the dissimilarity is simply the Euclidean distance between the feature vectors. We represented each dissimilarity matrix with the representations described in Sect. 2. We used two classifiers: a linear discriminant classifier and a 1-nearest neighbor classifier. The experiments were performed using 10-fold cross-validation, where the best hyper parameter for each representation type (the optimal value for k), was determined on the training set using a second internal 10-fold cross-validation. We choose $k \in \{5, 10, 25, 50, 100\}$.

We report the AUC performances of both classifiers, using the best parameters for each representation type. For reference, we also list the best performance of traditional MIL classifiers^{Footnote 1}. The classifiers that often perform well are MILES [4], MI-SVM [2], EM-DD [27], a logistic classifier trained on a bag summary representation (based on the mean instance, or the min/max values per feature) [11], and p-posterior classifier [26] (Table 2).

Table 2. AUC mean (standard deviation) $\times 100\,\%$ of two classifiers on five representations of MIL bags. Bold = best or not significantly worse than best representation per classifier.

Full size table

The results are similar to those on artificial data: when the dissimilarity matrices are small, a spectrum representation is preferred. When larger training sets are available, it is often good to choose for an equalized histogram. These histograms tend to become relatively high dimensional, and the classifier can therefore not be too complex, so a linear classifier is a good choice.

What is also surprising is that, although these representations remove the absolute locations of the instances in the feature space, it is still possible to achieve very reasonable classification performance. For some datasets classification performances exceed the best performances achieved up to now (comp.graphics, Biocreative) or are comparable (Corel African, alt.atheism, Harddrive). For datasets that contain a specific concept (Musk1, Musk2, AjaxOrange, Web recomm. 1, Brown Creeper), the classifier that has access to individual feature vectors is better off.

4.2 Unsupervised Experiments: Analysis of Teams of People

Given the required speed in a strategic decision making process, we used an online survey for the unsupervised gathering of a strategic status update from 20,191 employees in 1,378 teams in 277 different client projects on, for example, Human Resource Management, Information Technology and Marketing and Sales. We did not use a Likert scale given the subsequent need for statistical corrections for the structure of the survey [9], for various response styles [7], for a variety of sampling errors [19] and for a wide variety of biases [22]. Instead, we opted to use a Guttman scale with objective verifiable answers [8, 13]. The assessment questions were different for different teams. Four different types of assessment can be distinguished: (1) human resource (HR): focusing on team effectiveness, competency assessments, cultural aspects, (2) strategy: how strategy is finally incorporated, innovation assessment, (3) marketing and sales: analysis of client processes, commercial guidance of shops, and (4) IT: project assessment, IT processes, IT governance. From the answers a dissimilarity is derived by computing the pairwise Euclidean distances between the answer scores of all the members in a team.

In Fig. 4 the resulting embeddings are shown using the spectrum and equalized histogram representations. Both representations are 5-dimensional, and the 2-dimensional embedding of the 5D data is obtained by using t-SNE [17]. In the left subplot the marker size indices the size $m_n$ of the corresponding matrix $D_n$. It appears that the first important component is the size team. The plot also shows that there is more variation in the smaller teams, suggesting that in smaller team there is more possibility of specialisation. Larger teams tend to become more similar.

When we normalise for team size, and we focus on one type of questionnaires (Human Resource) we obtain the scatterplot on the right. There is one prominent outlier team. This appears to be a team that got a questionnaire with 160 questions, while normally less than 20 questions are used. Furthermore, there is a large cluster on the left, which contains fairly homogeneous team members, and a long tail up to the right where teams get stronger and stronger clusters of subteams. The teams most far in the tail show a clear clustering, while teams more close to the homogeneous cluster only contain a few outliers in a team.

5 Conclusions

We compared several feature vector representations for characterizing (square) dissimilarity matrices that can vary in size, and for which the rows and columns can be arbitrarily permuted. The spectrum representation is very effective, in particular when the sample sizes are small. It can not only characterize the intrinsic dimensionality, it is also able to characterize cluster structure. When a large sample size is available, it is often advantageous to use the more descriptive histograms of distances. These results can be observed in some artificial, and some real-world MIL problems. For MIL, the representations with a linear or nearest neighbor classifier are sometimes competitive to state-of-the-art classifiers.

We then used the representations in an unsupervised manner in order to characterize real-world organizations. Our analysis revealed some clusters of organizations, that could be interpreted by an expert. Given the current dissimilarity scores we suggest further research into the extent to which organizations are similar with respect to issues that affect a multitude of teams (a top management issue), a single team (a middle management issue) or a single employee (a lower management issue), and whether that similarity is particularly present in specific management topics (for example, in Human Resource Management) and/or in specific industries (e.g. in Professional Services).

Notes

1.
Available from http://homepage.tudelft.nl/n9d04/milweb/.

References

Ahrens, T., Chapman, C.S.: Doing qualitative field research in management accounting: positioning data to contribute to theory. Acc. Organ. Soc. 31(8), 819–841 (2006)
Article Google Scholar
Andrews, S., Tsochantaridis, I., Hofmann, T.: Support vector machines for multiple-instance learning. In: Advances in Neural Information Processing Systems, pp. 561–568 (2002)
Google Scholar
Bunke, H., Riesen, K.: Recent advances in graph-based pattern recognition with applications in document analysis. Pattern Recogn. 44(5), 1057–1067 (2011)
Article MATH Google Scholar
Chen, Y., Bi, J., Wang, J.: MILES: multiple-instance learning via embedded instance selection. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 1931–1947 (2006)
Article Google Scholar
Cheplygina, V., Tax, D.M.J., Loog, M.: Multiple instance learning with bag dissimilarities. Pattern Recogn. 48(1), 264–275 (2015)
Article Google Scholar
Cvetkovic, D., Doob, M., Sachs, H.: Spectra of Graphs, 3rd edn. Johann Ambrosius Barth Verlag, Heidelberg (1995)
MATH Google Scholar
De Jong, M.G., Steenkamp, J.B.E., Fox, J.P., Baumgartner, H.: Using item response theory to measure extreme response style in marketing research: a global investigation. J. Market. Res. 45(1), 104–115 (2008)
Article Google Scholar
Diamond, I.D., McDonald, J., Shah, I.: Proportional hazards models for current status data: application to the study of age at weaning differentials in Pakistan. Demography 23(4), 607–620 (1986)
Article Google Scholar
Edelen, M.O., Reeve, B.B.: Applying item response theory (IRT) modeling to questionnaire development, evaluation, and refinement. Qual. Life Res. 5, 5–18 (2007)
Article Google Scholar
Feragen, A., Kasenburg, N., Petersen, J., de Bruijne, M., Borgwardt, K.: Scalable kernels for graphs with continuous attributes. In: Advances in Neural Information Processing Systems, pp. 216–224 (2013)
Google Scholar
Gärtner, T., Flach, P.A., Kowalczyk, A., Smola, A.J.: Multi-instance kernels. In: International Conference on Machine Learning, pp. 179–186 (2002)
Google Scholar
Gärtner, T.: Predictive graph mining with kernel methods. In: Advanced Methods for Knowledge Discovery from Complex Data, pp. 95–121 (2005)
Google Scholar
Hopkins, L., Ferguson, K.E.: Looking forward: the role of multiple regression in family business research. J. Fam. Bus. Strategy 5(1), 52–62 (2014)
Article Google Scholar
Hubert, L., Arabie, P., Meulman, J.: 9. Anti-Robinson Matrices for Symmetric Proximity Data. ASA-SIAM Series on Statistics and Applied Probability (Book 19), chap. 11, pp. 115–141 (2006)
Google Scholar
Lau, L., Yang-Turner, F., Karacapilidis, N.: Requirements for big data analytics supporting decision making: a sensemaking perspective. In: Karacapilidis, N. (ed.) Mastering Data-Intensive Collaboration and Decision Making. SBD, vol. 5, pp. 49–70. Springer, Heidelberg (2014). doi:10.1007/978-3-319-02612-1_3
Chapter Google Scholar
Lee, W.J., Cheplygina, V., Tax, D.M.J., Loog, M., Duin, R.P.W.: Bridging structure and feature representations in graph matching. Int. J. Pattern Recogn. Artif. Intell. (IJPRAI) 26(05), 1260005 (2012)
Article MathSciNet Google Scholar
Van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(2579–2605), 85 (2008)
MATH Google Scholar
Papadopoulos, A., Manolopoulos, Y.: Structure-based similarity search with graph histograms. In: Proceedings of the International Workshop on Similarity Search, pp. 174–178 (1999)
Google Scholar
Piterenko, K.: Business and impact alignment of questionnaire. Master’s thesis, Gjovik University College (2013)
Google Scholar
Plewis, I., Mason, P.: What works and why: combining quantitative and qualitative approaches in large-scale evaluations. Int. J. Soc. Res. Methodol. 8(3), 185–194 (2007)
Article Google Scholar
Riesen, K., Fankhauser, S., Bunke, H., Dickinson, P.J.: Efficient suboptimal graph isomorphism. In: Torsello, A., Escolano, F., Brun, L. (eds.) GbRPR 2009. LNCS, vol. 5534, pp. 124–133. Springer, Heidelberg (2009). doi:10.1007/978-3-642-02124-4_13
Chapter Google Scholar
Roulston, K., Shelton, S.A.: Reconceptualizing bias in teaching qualitative research methods. Qual. Inq. 21(4), 332–342 (2015)
Article Google Scholar
Schnitzer, D., Flexer, A., Schedl, M., Widmer, G.: Local and global scaling reduce hubs in space. J. Mach. Learn. Res. 13, 2871–2902 (2012)
MathSciNet MATH Google Scholar
Schnitzer, D., Flexer, A., Tomasev, N.: Choosing the metric in high-dimensional spaces based on hub analysis. In: ESANN (2014)
Google Scholar
Tax, D.M.J., Loog, M., Duin, R.P.W., Cheplygina, V., Lee, W.-J.: Bag dissimilarities for multiple instance learning. In: Pelillo, M., Hancock, E.R. (eds.) SIMBAD 2011. LNCS, vol. 7005, pp. 222–234. Springer, Heidelberg (2011). doi:10.1007/978-3-642-24471-1_16
Chapter Google Scholar
Wang, H.Y., Yang, Q., Zha, H.: Adaptive p-posterior mixture-model kernels for multiple instance learning. In: Proceedings of 25th International Conference Machine learning, pp. 1136–1143 (2008)
Google Scholar
Zhang, Q., Goldman, S.A., et al.: EM-DD: an improved multiple-instance learning technique. In: Advances in Neural Information Processing Systems, pp. 1073–1080 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Pattern Recognition Laboratory, Delft University of Technology, Delft, The Netherlands
David M. J. Tax, Veronika Cheplygina & Robert P. W. Duin
Biomedical Imaging Group Rotterdam, Erasmus Medical Center, Rotterdam, The Netherlands
Veronika Cheplygina
Transparency Lab, Amsterdam, The Netherlands
Jan van de Poll

Authors

David M. J. Tax
View author publications
You can also search for this author in PubMed Google Scholar
Veronika Cheplygina
View author publications
You can also search for this author in PubMed Google Scholar
Robert P. W. Duin
View author publications
You can also search for this author in PubMed Google Scholar
Jan van de Poll
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David M. J. Tax .

Editor information

Editors and Affiliations

Data 61 - CSIRO , Canberra, Australia
Antonio Robles-Kelly
Pattern Recognition Laboratory, Technical University of Delft Pattern Recognition Laboratory, CD Delft, The Netherlands
Marco Loog
Electrical and Electronic Engineering, University of Cagliari Electrical and Electronic Engineering, Cagliari, Italy
Battista Biggio
Computación e IA, Universidad de Alicante Computación e IA, Alicante, Spain
Francisco Escolano
Computer Science, University of York Computer Science, York, United Kingdom
Richard Wilson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tax, D.M.J., Cheplygina, V., Duin, R.P.W., van de Poll, J. (2016). The Similarity Between Dissimilarities. In: Robles-Kelly, A., Loog, M., Biggio, B., Escolano, F., Wilson, R. (eds) Structural, Syntactic, and Statistical Pattern Recognition. S+SSPR 2016. Lecture Notes in Computer Science(), vol 10029. Springer, Cham. https://doi.org/10.1007/978-3-319-49055-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-49055-7_8
Published: 05 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49054-0
Online ISBN: 978-3-319-49055-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

The Similarity Between Dissimilarities

Abstract

Similar content being viewed by others

A Literature Review on Correlation Clustering: Cross-disciplinary Taxonomy with Bibliometric Analysis

Mathematical Modeling: Interdisciplinary Similarity Studies

Differential Network Analysis and Graph Classification: A Glocal Approach

Keywords

1 Introduction

2 Dissimilarity Matrix Representation

3 Illustrative Examples

4 Experiments

4.1 Supervised Experiments: Multiple Instance Learning

4.2 Unsupervised Experiments: Analysis of Teams of People

5 Conclusions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

The Similarity Between Dissimilarities

Abstract

Similar content being viewed by others

A Literature Review on Correlation Clustering: Cross-disciplinary Taxonomy with Bibliometric Analysis

Mathematical Modeling: Interdisciplinary Similarity Studies

Differential Network Analysis and Graph Classification: A Glocal Approach

Keywords

1 Introduction

2 Dissimilarity Matrix Representation

3 Illustrative Examples

4 Experiments

4.1 Supervised Experiments: Multiple Instance Learning

4.2 Unsupervised Experiments: Analysis of Teams of People

5 Conclusions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation