Information Content (IC) models

For a comprehensive review of the literature on IC models, we refer the reader to the surveys by Lastra-Díaz and García-Serrano [1,2] and the recent experimental survey [3].

[1] J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.

[2] J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement

[3] J.J. Lastra-Díaz, J. Goikoetxea, M. Hadj Taieb, A. García-Serrano, M. Ben Aouicha, E. Agirre, A reproducible survey on word embeddings and ontology-based methods for word similarity: linear combinations outperform the state of the art, Engineering Applications of Artificial Intelligence. 85 (2019) 645–665.

Corpus-based IC models

Table below enumerates all corpus-based IC models implemented by HESML. All these intrinsic IC models can be obtained by calling the ICModelsFactory.getCorpusICmodel() method.

CorpusBasedICModelType enum Reference
Resnik P. Resnik, Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language, Journal of Artificial Intelligence Research. 11 (1999) 95–130.
CondProbCorpus J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.
CondProbRefCorpus J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement

Intrinsic IC models

Table below enumerates all intrinsic IC models implemented by HESML. All these intrinsic IC models can be obtained by calling the ICModelsFactory.getIntrinsicICmodel() method.

IntrinsicICModelType enum Reference
Seco N. Seco, T. Veale, J. Hayes, An intrinsic information content metric for semantic similarity in WordNet, in: R. López de Mántaras, L. Saitta (Eds.), Proceedings of the 16th European Conference on Artificial Intelligence (ECAI), IOS Press, Valencia, Spain, 2004: pp. 1089–1094.
Cai Y. Cai, Q. Zhang, W. Lu, X. Che, A hybrid approach for measuring semantic similarity based on IC-weighted path distance in WordNet, J. Intell. Inf. Syst. (2017) 1–25.
Blanchard E. Blanchard, M. Harzallah, P. Kuntz, A generic framework for comparing semantic similarities on a subsumption hierarchy, in: M. Ghallab, C.D. Spyropoulos, N. Fakotakis, N. Avouris (Eds.), Proceedings of the ECAI, IOS Press, 2008: pp. 20–24.
Zhou Z. Zhou, Y. Wang, J. Gu, A new model of information content for semantic similarity in WordNet, in: Proc.of the Second International Conference on Future Generation Communication and Networking Symposia (FGCNS’08), IEEE, 2008: pp. 85–89.
Sanchez2011 D. Sánchez, M. Batet, D. Isern, Ontology-based information content computation, Knowledge-Based Systems. 24 (2011) 297–303.
Sanchez2012 D. Sánchez, M. Batet, A new model to compute the information content of concepts from taxonomic knowledge, International Journal on Semantic Web and Information Systems (ISWIS). 8 (2012) 34–50.
Harispe S. Harispe, S. Ranwez, S. Janaqi, J. Montmain, The Semantic Measures Library: Assessing Semantic Similarity from Knowledge Representation Analysis, in: E. Métais, M. Roche, M. Teisseire (Eds.), Proc. of the 19th International Conference on Applications of Natural Language to Information Systems (NLDB 2014), Springer, Montpelier, France, 2014: pp. 254–257.
Meng L. Meng, J. Gu, Z. Zhou, A new model of information content based on concept’s topology for measuring semantic similarity in WordNet, International Journal of Grid and Distributed Computing. 5 (2012) 81–93.
Yuan Q. Yuan, Z. Yu, K. Wang, A New Model of Information Content for Measuring the Semantic Similarity between Concepts, in: Proc. of the International Conference on Cloud Computing and Big Data (CloudCom-Asia 2013), IEEE Computer Society, 2013: pp. 141–146.
HadjTaieb M.A. Hadj Taieb, M. Ben Aouicha, A. Ben Hamadou, Computing semantic relatedness using Wikipedia features, Knowledge-Based Systems. 50 (2013) 260–278.
HadjTaiebHypoValue M.A. Hadj Taieb, M. Ben Aouicha, A. Ben Hamadou, Ontology-based approach for measuring semantic similarity, Eng. Appl. Artif. Intell. 36 (2014) 238–261.
CondProbHyponyms J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.
CondProbUniform J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.
CondProbLeaves J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.
CondProbCosine J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.
CondProbLogistic J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.
CondProbLogisticK10 J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.
CondProbLogisticK12 J.J. Lastra-Díaz, A. García-Serrano, A new family of information content models with an experimental survey on WordNet, Knowledge-Based Systems. 89 (2015) 509–526.
CondProbRefHyponyms J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefUniform J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefLeaves J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefCosine J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefLogistic J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefCosineLeaves J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefLogisticLeaves J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefLeavesSubsumers J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefLeavesSubsumersRatio J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
CondProbRefSubsumedLeavesRatio J.J. Lastra-Díaz, A. García-Serrano, A refinement of the well-founded Information Content models with a very detailed experimental survey on WordNet, NLP and IR Research Group. ETSI Informática. Universidad Nacional de Educación a Distancia (UNED), 2016. http://e-spacio.uned.es/fez/view/bibliuned:DptoLSI-ETSI-Informes-Jlastra-refinement
Adhikari A. Adhikari, S. Singh, A. Dutta, B. Dutta, A novel information theoretic approach for finding semantic similarity in WordNet, in: Proc. of IEEE International Technical Conference (TENCON-2015), IEEE, Macau, China, 2015: pp. 1–6.
AouichaTaiebAsGIC M.B. Aouicha, M.A.H. Taieb, Computing semantic similarity between biomedical concepts using new information content approach, J. Biomed. Inform. 59 (2016) 258–275.
AICAouichaTaiebHamadu2016 M. Ben Aouicha, M.A.H. Taieb, A. Ben Hamadou, Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness, Appl Intell. (2016) 1–37.

Contact Us

UNED - Universidad Nacional de Educación a Distancia - ETSI Informática
Juan del Rosal, 16
28040 Madrid, Spain