Developing Non-Empirical Metrics and Tools for Ontology Visualizations Evaluation and Comparing

1. Introduction

2. Related works

3. Expert evaluation

4. Information measuring

5. Realization

6. Conclusion

References

Scientific Visualization

3.1. Completeness

3.2. Semanticity

3.3. Conservativeness

3.4. Example of evaluation

4.1. Hypergraphs as the formal framework

4.2 Comparing visualizations with the information measure

National Research Nuclear University "MEPhI"

Open Access Electronic Journal

Baimuratov, I.; Nguyen, Th.; Golchin, R.; Mouromtsev, D.

doi:10.26583/sv.12.4.07

Scientific Visualization, 2020, volume 12, number 4, pages 71 - 84, DOI: 10.26583/sv.12.4.07

Developing Non-Empirical Metrics and Tools for Ontology Visualizations Evaluation and Comparing

Authors: I. Baimuratov¹, Th. Nguyen², R. Golchin³, D. Mouromtsev⁴

ITMO University, Kronverksky Pr. 49, bldg. A, St. Petersburg, 197101, Russia

¹ ORCID: 0000−0002−6573−131X, baimuratov.i@gmail.com

² ORCID: 0000−0002−6679−7839

³ ORCID: 0000−0003−0485−4014

⁴ ORCID: 0000−0002−0644−9242

Abstract

There are numerous ontology visualization systems, however, the choice of a visualization system is non-trivial, as there is no method for evaluation and comparing them, except for empirical experiments, that are subjective and costly. In this research, we aim to develop non- empirical metrics for ontology visualizations evaluation and comparing. First, we propose several half-formal metrics that require expert evaluation. These metrics are completeness, semanticity, and conservativeness. We apply the proposed metrics to evaluate and compare VOWL, Graphol and Logic Graphs visualization systems. And second, we develop a completely computable measure for the complexity of ontology visualizations, based on graph theory and information theory. In particular, ontology visualizations are considered as hypergraphs and the information measure is derived from the Hartley function. The usage of the proposed information measure is exemplified by the evaluation of visualizations of the sample of axioms from the DoCO ontology in Logic Graphs and Graphol. These results can be practically applied for choosing ontology visualization systems in general and regarding a particular ontology. The application for ontology visualization evaluation and comparing based on the formal metrics is provided.

Keywords: Ontology Visualization, Expert Evaluation, Hypergraphs, Information Measure.

Visualization of an ontology improves comprehension of knowledge it contains. There are numerous ontology visualization systems, the reviews are presented in [1–3]. However, the choice of a visualization system is non-trivial, as there is no method for evaluation and comparing of ontology visualization systems present in the literature, except for empirical experiments, that are subjective and costly.

Therefore, we aim to develop formal metrics for ontology visualizations estimation. ’Formal’ means that they must be objective and computable. Some metrics we propose require external knowledge of the language being visualized, its semantics, and knowledge of other visualization systems, therefore, they are half-formal and require expert evaluation.

Another criterion we propose is based on graph theory and information theory and is fully computable. The intended information measure should estimate not the content of the ontology, as it is the same for each visualization, but the complexity of its form. It implies the following requirements for the intended information measure:

· as it should estimate the complexity of an ontology visualization, it should depend on the complexity of its structure, in other words, on the number of nodes, edges, and types of edges;

· it should be normalized, as visualizations of the same ontology in different visualization systems can have a different number of nodes and edges;

· as it should measure the visualization complexity, a visualization with a greater number of nodes, edges, or edges types should have a higher value of the measure.

The outline of the paper is as follows: in the next section we consider related works, in Section 3 we propose several metrics for expert evaluation, in Section 4 we derive the information measure for ontology visualizations complexity, and finally Section 5 contains the description of the developed application for ontology visualization evaluation and comparing.

A rather small number of works are devoted to the assessment of ontology visualization systems in general.However, as ontology visualization results in a graph, works on graph visualization are also relevant. In [4] the authors propose some recommendations considering graph diagrams, like minimization of crossings between edges. The authors of [5] present empirical research on applying these criteria to automatic graph layout algorithms. In [6] several new shape-based metrics are proposed for large graphs. All these metrics are based on empirical experiments, i.e. on human assessments.

In [7] a metric for graphs with known clustering is described. The authors propose to cluster the nodes in the graph image and to use an existing measure of similarity to compare this clustering and the ground truth one. Unfortunately, ground truth is often not known in the case of ontologies.

(1)

where is a graph, represents a graph invariant and α is an equivalence relation which partitions into k subsets .

The authors of [9] perform information-theoretic analysis of edge bundling visualizations in terms of adjacency matrices and mutual information. The main idea is that the mutual information between the visual description and the raw data should be maximized

(2)

where is the entropy of and is the conditional entropy of given .

But none of the measures presented there satisfy our requirement. Therefore, we develop a new graph information measure.

First, we propose to consider several features of visualization systems that, though related to the formal properties, like completeness, still require expert evaluation, as they involve external knowledge.

The most important property of a visualization system is its completeness with respect to the language being visualized, because if a visualization system can not represent some axioms of an ontology, the system can not be applied to the ontology. In addition, a common reference language serves as a common denominator for comparing different visualization systems.

Ontologies are denoted on the OWL language [10]. The OWL 1 standard provided three increasingly expressive sub-languages: OWL Lite, OWL DL, and OWL Full. In this paper, we consider OWL DL language, as it provides the maximum expressiveness, retaining decidability.

The formal foundation of OWL is description logics (DLs) [11]. DLs are a family of logic languages, that can be used to represent the terminological knowledge of an application domain. We consider axioms, formulated with the SHOIN description logic syntax, as it corresponds to OWL DL language. We evaluate completeness of a visualization system by counting the number of SHOIN syntax entities that the system can represent.

We suppose, the advantage of a visualization with respect to a reference language is that it improves comprehension of a formula with representing its semantic. Therefore, we propose to evaluate the ability of a visualization system to represent semantics of expressions.

We consider a diagram of a visualization system for a logical relation as semantical, if it represents the semantic of the relation. For example, compare the visualization of conjunction from Graphol [12], Fig. 1, with the corresponding Venn diagram [13], Fig. 2. Venn diagram represents that these two sets have common elements, while in Graphol conjunction is just labeled with a hexagon.


Fig.1. Conjunction in Graphol	Fig.2. Conjunction in Venn diagrams

Finally, we suppose that it is important to use existing graphic primitives from mathematical theories, as in the other case, i.e. introducing new graphic primitives, instead of helping a user to understand an ontology it forces him or her to learn just one more language. Considering again the example above, in Graphol a user has to learn that hexagon denotes conjunction, while if the Venn diagram was used, the user familiar with Venn diagrams would have understood the diagram without additional instructions. Therefore, we consider a graphic primitive to be conservative if an expert can name an already existing formal system, where it is adopted from.

We provide an evaluation of the VOWL [14], Graphol, and Logic Graphs (LGs) [15] visualization system as examples. We examined its completeness with respect to OWL DL language, its semanticity, the ability to represent the semantics of relations, and whether its graphic primitives are new or adopted from some common visualization systems. See Table 1, Table 2 and Table 3 respectively.

We see that VOWL can represent only 12 of 15 entities of the SHOIN description logic, therefore, its completeness rate is 0.8. The diagrams for concepts, conjunction, disjunction, and equivalence are semantical and conservative, as they are based on the Venn diagrams. The diagram for roles can be considered as graph-theory based, therefore, it is also semantical and conservative. The diagram for negation is conservative, as it uses the sign of negation from logic, but it is not semantical, since it doesn’t represent the semantic of negation, i.e. a user wouldn’t have distinguished negation from another concept if there wasn’t the label of negation on it.

Considering Graphol, it is complete, but there are only arbitrary graphic primitives that would have been indistinguishable if there were no labels. Speaking of labels, Graphol mainly uses natural language names for operations that are understandable for new users and, therefore, conservative.

LGs, the semantically oriented ontology visualization method, developed by us, is complete, mostly semantical and conservative, since it is based on Venn diagrams, graph theory, and Ch. S. Peirce's existential graphs [16]. We use non-semantical primitives for number restrictions and functional roles as their semantical representations are inconvenient, and non-semantical and non-conservative primitive for concept inclusion due to the strong tradition in ontology visualization.

The resulting scores of the considered visualization systems are presented in Table 4.

In the previous section, we proposed several metrics for expert-based evaluation. Properties like completeness and conservativeness are important for visualization systems evaluation, but it ishard to imagine that they would be fully computable. Thus, we propose one more approach to ontology visualization systems evaluation intended to be completely formal. This approach is based on information measuring.

Before defining the information measure, we have to define the formal framework. We propose to consider ontology visualization as a hypergraph. Simple graphs are not suitable for our goals as many ontology visualization systems use edges connecting more than two nodes. A hypergraph can be represented as an incidence matrix, therefore, an ontology visualization can be

Table 1. Expert-based evaluation of VOWL

N	Entity	Complete	Semantical	Conservative
1	Concept		1	1
2	Role		1	1
3	Negation		0	1
4	Conjunction		1	1
5	Disjunction		1	1
6	Existential restriction		0	0
7	Universal restriction		0	0
8	Transitive role		0	0
9	Inverse role		0	0
10	Role hierarchy		0	0
11	Number restriction		0	0
12	Nominal		0	0
13	Functional role		0	0
14	Concept inclusion		0	0
15	Concept equivalence		1	1
		0.8	0.33	0.4

Table 2.Expert-based evaluation of Graphol

N	Entity	Complete	Semantical	Conservative
1	Concept		1	1
2	Role		1	1
3	Negation		0	1
4	Conjunction		0	1
5	Disjunction		0	1
6	Existential restriction		0	1
7	Universal restriction		0	1
8	Transitive role		0	0
9	Inverse role		0	0
10	Role hierarchy		0	0
11	Number restriction		0	1
12	Nominal		0	1
13	Functional role		0	0
14	Concept inclusion		0	0
15	Concept equivalence		0	0
		1	0. 07	0. 6

Table 3. Expert-based evaluation of LGs

N	Entity	Complete	Semantical	Conservative
1	Concept		1	1
2	Role		1	1
3	Negation		1	1
4	Conjunction		1	1
5	Disjunction		1	1
6	Existential restriction		1	1
7	Universal restriction		1	1
8	Transitive role		1	1
9	Inverse role		1	1
10	Role hierarchy		1	1
11	Number restriction		0	1
12	Nominal		1	1
13	Functional role		0	1
14	Concept inclusion		0	0
15	Concept equivalence		1	1
		1	0. 8	0. 93

Table 4. Expert-based comparing of VOWL and LG

	Complete	Semantical	Conservative
VOWL	0.8	0.33	0.4
Graphol	1	0.07	0.6
LGs	1	0.8	0.93

represented as an incidence matrix as well.

Let there is a hypergraph , where – is a set of nodes and E is a set of edges. It is represented with incidence matrix , where

(3)

for undirected graph and

(4)

and

= 2

(5)

Consider the following axiom from the Document Components Ontology (DoCO) [17] as the example

(6)

and its visualization in the Graphol system, see Fig. 3. Here edges denoting

Fig. 3. Visualization of the axiom 1 in Graphol

disjunction connect three nodes: ’graphol.paragraph’, ’graphol.section’ and ’or’, therefore, it is the hypergraph. The incidence matrix for this hypergraph is Table 5. Each node of the diagram corresponds to a row of the matrix and each relation – to a column. As nodes ’graphol.paragraph’, ’graphol.section’ and ’or’ are connected with the edge ’or’, the corresponding cells have value 1. In this research, we ignore the direction of edges for simplicity.

Table 5. The incidence matrix for the axiom 1 in Graphol

	subClassOf	graphol.contains	or
exists	1	1	0
graphol.chapter	1	0	0
graphol.contains	0	1	0
graphol.paragraph	0	0	1
graphol.sections	0	0	1
or	0	1	1

Definition 1. For a given incidence matrix A the set of all values is {a}.

For undirected graph {a} = {0,1}, for directed {a} = {−1,0,1}, for directed graph with loops {a} = {−1, 0, 1, 2} and so on.

Now we define the set of all possible edges :

Definition 2. For a hypergraph H with a given set of nodes X the set of all possible edges .

We are ready to define the information measure for hypergraph complexity estimation by deriving it from the Hartley function [18]

(7)

where is an arbitrary set and b – an arbitrary number. We substitute the number of edges as and the number of all possible edges as b.

Definition 3. For a hypergraph H with a given set of nodes , a given set of edges and a set of corresponding incidence matrix values , the information is following:

(8)

Consider several simple graphs for illustration, see Fig. 4, its information estimation is at the Table 6. As we see, has a more complex structure comparing to H1 and, therefore, its information value is higher. is directed and each directed edge contains less information, therefore, with the same number of edges its information value is lower compared to . Summing up, the information measure satisfies the desired properties.

Table 6. Information estimation of the graph examples


1	3	2	2	0.33
2	3	3	2	0.53
3	3	3	3	0.21

Fig. 4. The graph examples

We provide an example of comparing ontology visualizations with the developed information measure. Unlike expert-based evaluation, where we compared visualization systems itself, for information measure we have to compare visualizations of a particular ontology. We use the DoCO ontology as it is a real ontology, used in different applications, and it contains nontrivial axioms. We visualized some axioms of this ontology in Graphol and Logic Graphs. The list of axioms and their visualizations are in Table 7.

The example of an incidence matrix for Graphol was provided in Table 5. Now consider the example of the incidence matrix for Logic Graphs. The incidence matrix for the axiom 1 in Logic graphs is Table 8.

We compare LGs with Graphol by measuring information of the correspond- ing visualizations for the sample of axioms, presented in Table 7. We selected the axioms according to the following criteria:

· the axioms must have non-trivial form, i.e. contain more than one operation;

· each axiom must have a unique structure, i.e. none two axioms can have the same combination of operations;

· the sample must contain all types of operations: negation, conjunction, disjunction, inclusion, roles with existential and universal restrictions.

The result is in Table 9. As we see, the average information of LGs on this sample is higher, than of Graphol.

Table 7. Visualizations of DoCO in LGs and Graphol

1

2

3

4

5

6

7

8

9

10

Table 8. The incidence matrix for axiom 1 in LGs

Table 9. Comparing information of ontology visualizations

N	LG				Graphol

1	5	6	2	0.52	6	3	2	0.26
2	9	11	2	0.38	10	5	2	0.23
3	5	3	2	0.32	6	3	2	0.26
4	7	4	2	0.29	8	4	2	0.25
5	4	2	2	0.25	4	2	2	0.25
6	2	2	2	0.5	3	2	2	0.33
7	3	2	2	0.33	4	2	2	0.25
8	4	5	2	0.58	4	2	2	0.25
9	7	7	2	0.4	8	4	2	0.25
10	10	11	2	0.35	12	6	2	0.22
				0.39				0.26

As a part of this research, an application has been implemented. It includes a basic ontology visualization tool, a library of formal ontology visualization metrics, and an algorithm for constructing the best visualization according to the selected metrics.

The application is implemented with JavaFX, for graph representation Graph- stream library [19] is used and ontologies are imported with the help of OWL API [20]. Several decisions have been made in the course of development to improve the readability of ontologies. Firstly, in order to reduce visual clutter, labels of edges are shown only when they hover. Secondly, there is an option on the conversion step to remove nodes with a degree less than the specified threshold. A similar option exists in WebVOWL [21], the rationale behind it is that the most important concepts in an ontology often correspond to nodes with the largest degrees. Screenshots of the application are shown in Fig. 5.

Fig. 5. Application menu

The workflow of the application consists of two steps: 1) conversion of an ontology to a graph and 2) constructing the layout for the graph. For both steps, users could either set an algorithm explicitly or let the program choose it automatically based on a selected quality metric. The resulting graph also could be exported to DOT. Data flow in the application is summarized in Fig. 6.

Fig. 6. Data flow in the application

The application includes the library of visualization metrics. For the conversion step, the library includes a special case of the proposed formal metric (8) with = 2. Along with it, there is the graph entropy metric (1) with X being the set of nodes. Two nodes are considered equivalent if they have the same degree. For the layout step, the aesthetic metrics from the field of graph drawing are used, including the number of edge crossings and similarity with shape graph, namely, k-nearest neighbors graph [6].

If a user sets a visualization metric instead of explicit visualization parameters, the application searches for the best visualization algorithm automatically, based on the selected metric. First, the metric is evaluated on the result of each algorithm. Then the algorithm with the best value (the highest or the lowest, depending on the meaning of the metric) of the metric is chosen. Since force - directed algorithms use randomization, they are run multiple times, and the mean value of the metric for all runs is used.

The application provides two ways to evaluate and compare the visualizations produced by other ontology visualization tools. The first way is to simulate the results with the proper algorithms in the application. At the moment, conversion algorithms from visualization tools Ontograf [22] and OWLViz [23] are supported. The second way is to import the result. Some visualization tools, e.g. WebVOWL, support export to DOT, so the application supports import from this format. As per layout, several force-directed algorithms are supported.

As an example, the visualization of the FOAF ontology [24] in the application is provided at Fig. 7.

Fig. 7. Visualization of the FOAF ontology in the application

In this research, we proposed several non-empirical metrics for ontology visualization evaluation and comparing. These metrics are divided into two groups. The first group includes three metrics: completeness, semanticity, and conservativeness. These metrics require expert evaluation and, therefore, they are half-formal. As an example, we compared ontology visualization systems: VOWL Graphol and Logic Graphs.

The second group consists of the completely computable information measure, derived from the Hartley formula, that allows normalized measuring complexity of ontology visualizations, represented as hypergraphs with incidence matrices. As an example, we compared Logic Graphs with Graphol by measuring average information of visualizations of the sample of axioms from the DoCO ontology.

These results can be practically applied for choosing ontology visualization systems in general and regarding a particular ontology. Considering the presented examples, it is recommended to use LGs rather than VOWL in general, as it has higher scores of completeness, semanticity, and conservativeness, and for visualizing the mentioned fragment of the DoCO, as LGs has higher informativeness.

As the implementation of the results, the application for ontology visualization evaluation and comparing has been developed. It includes the basic ontology visualization tool, the library of formal ontology visualization metrics, and the algorithm for constructing the optimal visualization according to the selected metrics. The application supports simulation of several ontology visualization tools and import and export to DOT format.

[1] Antoniazzi F., Viola F.: RDF graph visualization tools: A survey //2018 23rd Conference of Open Innovations Association (FRUCT).IEEE, 25-36 (2018).

[2] Duda ́s, M.,Lohmann, S., Sva ́tek, V., Pavlov, D.: Ontology visualization methods and tools: A survey of the state of the art. The Knowledge Engineering Review, vol 33, (2018).

[3] Steffen, L., Stefan, N. , Florian, H. Thomas, E: Visualizing ontologies with VOWL. Semantic Web. 7. 399-419. (2016).

[4] R. Tamassia, C. Batini, and M. Talamo. An algorithm for automatic layout of entity-relationship diagrams. In C. G. Davis, S. Jajodia, P. A. Ng, and R. T. Yeh, editors, 3rd Int. Conf. on Entity-Relationship Approach (ER’83), pages 421-439. North-Holland, 1983.

[5] Purchase, Helen & Carrington, David & Allder, Jo-Anne. (2002). Empirical Evaluation of Aesthetics-based Graph Layout. Empirical Software Engineering. 7.

[6] Eades, Peter & Hong, Seok-Hee & Klein, Karsten & Nguyen, An. (2015). Shape- Based Quality Metrics for Large Graph Visualization. 502-514.

[7] Meidiana, Amyra, etal.”AQualityMetricforVisualizationofClustersinGraphs.” International Symposium on Graph Drawing and Network Visualization. Springer, Cham, 2019.

[8] Dehmer, Matthias & Mowshowitz, Abbe. (2011). A History of Graph Entropy Measures. Information Sciences. 1. 57-78.

[9] Wu, Jieting & Zhu, Feiyu & Liu, Xin & Yu, Hongfeng. (2018). An Information- Theoretic Framework for Evaluating Edge Bundling Visualization. Entropy. 20. 625.

[10] Deborah, L., Frank, V.,, et al. Owl web ontology language overview. W3C recom- mendation, 10(10):2004,(2004)

[11] Baader, F., Calvanese, D. , McGuinness, D. ,Nardi, D. and Patel-Schneider, P.: The Description Logic Handbook: Theory, Implementation, and Applications. Cam- bridge University Press, second edition, (2007).

[12] Graphol official website, Http://www.obdasystems.com/graphol. Last accessed 4 jul 2020.

[13] Ruskey, F.,Weston, M.:A Survey of Venn Diagrams. The Electronic Journal of Combinatorics, (2005)

[14] VOWL official website, Http://vowl.visualdataweb.org/. Last accessed 4 jul 2020.

[15] Mouromtsev, D., Baimuratov, I.: Logic Graphs: A complete visualization method for logical languages based on Ch. S. Peirce’s existential graphs. CEUR Workshop Proceedings, Vol. 2344, 2019, pp. 1-10.

[16] Sowa, J.: Peirce’s tutorial on existential graphs. Semiotica, 186, 2011, pp. 347-394.

[17] Ontology DoCO, https://sparontologies.github.io/doco/current/doco.html. Last accessed 4 jul 2020.

[18] R. V. L. Hartley, ”Transmission of information,” in The Bell System Technical Journal, vol. 7, no. 3, pp. 535-563, July 1928.

[19] Dutot A. et al. Graphstream: A tool for bridging the gap between complex systems and dynamic graphs. – 2007.

[20] Horridge, M. & Bechhofer, S. (2011). The owl api: A java api for owl ontologies. Semantic web, 2(1), 11-21.

[21] Lohmann, Steffen, et al. ”WebVOWL: Web-based visualization of ontologies.” International Conference on Knowledge Engineering and Knowledge Management. Springer, Cham, 2014.

[22] Falconer S. Ontograf protege plugin //Place: Available at: http://protegewiki. stanford. edu/wiki/OntoGraf [Accessed: 21/09/2020]. – 2010.

[23] Horridge M. OWLViz //Available on: http://protegewiki. stanford.edu/wiki/OWLViz. – 2010.

[24] Ontology FOAF, http://xmlns.com/foaf/spec/. Last accessed 4 jul 2020 .

subClassOf

negation 1

negation 2

conjunction

negation 3

conjunction