An experimental study of information content measurement of gene ontology terms
- 152 Downloads
The gene ontology (GO) is commonly used to store and organize information about functions of biological molecules through a controlled vocabulary of terms (GO Terms). GO Terms refer to biological concepts through the annotation process. There exist many different annotation processes used by researchers. Each term has a different specificity that is formally measured by the information content (IC). Both the structure of GO and the corpora of annotations are continuously changing following novel experimental findings. This work focuses on how changes of annotations affect the IC of terms. The study confirms that statistically significant differences among annotation corpus of different years on each species occur. These results convey that annotation corpora changes have a high impact on IC.
KeywordsInformation content Gene ontology Semantic similarity
This work has been partially founded by project PON Smartcities DICET-INMOTO-ORCHESTRA PON04a2 D funded by MIUR.
- 7.Hartung M, Kirsten T, Rahm E (2008) Analyzing the evolution of life science ontologies and mappings. In: Data Integration in the Life Sciences. Springer, Heidelberg, pp 11–27Google Scholar
- 12.Resnik P (1995) Using information content to evaluate semantic similarity in a taxonomy. In: IJCAI, 1995, pp 448–453. [Online]. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.55.5277