Methods Inf Med 2005; 44(04): 498-507
DOI: 10.1055/s-0038-1634000
Original Article
Schattauer GmbH

A Terminological and Ontological Analysis of the NCI Thesaurus

W. Ceusters
1   European Centre for Ontological Research, Saarland University, Saarbrücken, Germany
,
B. Smith
2   Department of Philosophy, University at Buffalo, New York, USA
3   Institute for Formal Ontology and Medical Information Science, Saarland University, Saarbrücken, Germany
,
L. Goldberg
3   Institute for Formal Ontology and Medical Information Science, Saarland University, Saarbrücken, Germany
4   School of Dental Medicine, University at Buffalo, New York, USA
› Author Affiliations
Further Information

Publication History

Publication Date:
06 February 2018 (online)

Summary

Objective: The National Cancer Institute Thesaurus is described by its authors as “a biomedical vocabulary that provides consistent, unambiguous codes and definitions for concepts used in cancer research” and which “exhibits ontology-like properties in its construction and use”. We performed a qualitative analysis of the Thesaurus in order to assess its conformity with principles of good practice in terminology and ontology design.

Materials and Methods: We used both the on-line browsable version of the Thesaurus and its OWL-representation (version 04.08b, released on August 2, 2004), measuring each in light of the requirements put forward in relevant ISO terminology standards and in light of ontological principles advanced in the recent literature.

Results: We found many mistakes and inconsistencies with respect to the term-formation principles used, the underlying knowledge representation system, and missing or inappropriately assigned verbal and formal definitions.

Conclusion: Version 04.08b of the NCI Thesaurus suffers from the same broad range of problems that have been observed in other biomedical terminologies. For its further development, we recommend the use of a more principled approach that allows the Thesaurus to be tested not just for internal consistency but also for its degree of correspondence to that part of reality which it is designed to represent.

 
  • References

  • 1 Cantor MN, Lussier YA. Putting data integration into practice: using biomedical terminologies to add structure to existing data sources. In Musen MA. editor AMIA 2003 Proceedings of AMIA 2003 Annual Symposium; Nov 8-12, 2003. Washington D.C., USA: AMIA; 2003: 125-9.
  • 2 Kumar A, Smith B. The Unified Medical Language System and the Gene Ontology. KI 2003: Advances in Artificial Intelligence (Lecture Notes in Artificial Intelligence 2821) 2003: 135-48.
  • 3 Smith B, Williams J, Schulze-Kremer S. The ontology of the gene ontology. In Musen MA. editor AMIA 2003. Proceedings of AMIA 2003 Annual Symposium; Nov 8-12, 2003. Washington D.C., USA: AMIA; 2003: 609-13.
  • 4 Grenon P, Smith B, Goldberg L. Biodynamic ontology: Applying BFO in the Biomedical Domain. In Pisanelli DM. (ed). Ontologies in Medicine Proceedings of the Workshop on Medical Ontologies, Rome, October 2003 IOS Press, Studies in Health Technology and Informatics 2004; 102: 20-38.
  • 5 Ceusters W, Smith B, Kumar A, Dhaen C. Ontology- Based Error Detection in SNOMED-CT®. In Fieschi M, Coiera E, Li Y-CJ. editors MEDINFO 2004. Proceedings of the 11th World Congress on Medical Informatics; Sep 7-11; 2004. San Francisco, CA, USA. Amsterdam: IOS Press; 2004: 482-6.
  • 6 Smith B, Rosse C. The role of foundational relations in the alignment of biomedical ontologies. In Fieschi M, Coiera E, Li Y-CJ. editors MEDINFO 2004. Proceedings of the 11th World Congress on Medical Informatics; Sep 7-11, 2004. San Francisco, CA, USA. Amsterdam: IOS Press; 2004: 444-8.
  • 7 Ceusters W, Smith B, Kumar A, Dhaen C. Mistakes in Medical Ontologies: Where Do They Come From and How Can They Be Detected?. In Pisanelli DM. (ed). Ontologies in Medicine Proceedings of the Workshop on Medical Ontologies, Rome, October 2003 IOS Press, Studies in Health Technology and Informatics 2004; 102: 145-64.
  • 8 Kumar A, Schulze-Kremer S, Smith B. Revising the UMLS Semantic Network. In Fieschi M, Coiera E, Li Y-CJ. editors MEDINFO 2004. Proceedings of the 11th World Congress on Medical Informatics; Sep 7-11, 2004. San Francisco, CA, USA. Amsterdam: IOS Press; 2004: 1700-4.
  • 9 de Coronado S, Haber MW, Sioutos N, Tuttle MS, Wright LW. NCI Thesaurus: Using Science-based Terminology to Integrate Cancer Research Results. In Fieschi M, Coiera E, Li Y-CJ. editors MEDINFO 2004. Proceedings of the 11th World Congress on Medical Informatics; Sep 7-11, 2004. San Francisco, CA, USA. Amsterdam: IOS Press; 2004: 33-7.
  • 10 Open Biological Ontologies http://obo.sourceforge.net/ Last visited 2005, Jan 24
  • 11 National Cancer Institute, Office of Communications, Center for Bioinformatics. NCI Terminology browser ftp://ftp1.nci.nih.gov/pub/cacore/EVS/ Last visited 2005, Jan 18
  • 12 National Cancer Institute, Office of Communications, Center for Bioinformatics. NCI Terminology browser http://nciterms.nci.nih.gov/NCIBrowser/Startup.do Last visited 2005, Jan 18
  • 13 Laboratory for Applied Ontology. DOLCE: a Descriptive Ontology for Linguistic and Cognitive Engineering http://www.loa-cnr.it/DOLCE.html Last visited Jan 18 2005
  • 14 Ceusters W, Smith B. Ontology and Medical Terminology: why Descriptions Logics are not enough. Proceedings of the conference Towards an Electronic Patient Record (TEPR 2003). San Antonio; May 10-14. 2003 (electronic publication)
  • 15 W3C. OWL Web Ontology Language Reference. Recommendation February 10, 2004 http://www.w3.org/TR/owl-ref/ Last visited Jan 18, 2005
  • 16 Gamper J, Nejdl W, Wolpers M. Combining Ontologies and Terminologies in Information Systems. In: Proc. 5th International Congress on Terminology and Knowledge Engineering, Innsbruck, Austria
  • 17 Wielemaker J. Native Preemptive Threads in SWIProlog, in Catuscia Palamidessi (ed.) Practical Aspects of Declarative Languages. Springer Verlag, Berlin, Germany: 2003: 331-45.
  • 18 Wielemaker J. Triple20: an RDF triple viewer and editor http://www.swi-prolog.org/packages/Triple20/Triple20.html Last visited Jan 18, 2005
  • 19 Golbeck J, Fragoso G, Hartel F, Hendler J, Oberthaler J, Parsia B. The National Cancer Institute’s Thesaurus and Ontology. Journal of Web Semantics 2003; 1 (01) 75-80. http://www.mindswap.org/papers/WebSemantics-NCI.pdf
  • 20 W3C. Resource Description Framework (RDF): Concepts and Abstract Syntax; Recommendation February 10, 2004 http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/ Last visited Jan 18, 2005
  • 21 College of American Pathologists SNOMED Clinical Terms Consultation Document; Requirements Analysis. Version 10, Oct 12 2000
  • 22 Swartz N. Use and Mention http://www.sfu.ca/philosophy/swartz/use&mention.htm Last visited Jan 24, 2005
  • 23 de Coronado S, Fragoso G. Enterprise Vocabulary Development in Protege/OWL: Workflow and Concept History Requirements. Expanded Abstract for Protégé Workshop, Jul 6-9, 2004 http://protege.stanford.edu/conference/2004/abstracts/DeCoronado.pdf
  • 24 Smith B, Williams J, Schulze-Kremer S. The ontology of the gene ontology. In Musen MA. editor AMIA 2003. Proceedings of AMIA 2003 Annual Symposium; Nov 8-12, 2003. Washington D.C., USA: AMIA; 2003: 609-13.
  • 25 National Cancer Institute, Office of Communications, Center for Bioinformatics NCI Thesaurus Semantics ftp://ftp1.nci.nih.gov/pub/cacoreEVS/ ThesaurusSemantics/ Last visited Jan 18, 2005
  • 26 Hartel F, Warzel DB, Covitz P. OWL/RDF/LSID Utilization in NCI Cancer Research Infrastructure. W3C Workshop on Semantic Web for Life Sciences, October 27-28, 2004. Cambridge, Massachusetts, USA:
  • 27 Hahn U, Schulz S. Towards a broad-coverage biomedical ontology based on description logics. Pac Symp Biocomput 2003: 577-88.
  • 28 Cimino J. Auditing the Unified Medical Language System with Semantic Methods. J Am Med Inform Assoc 1998; 5 (01) 41-51.
  • 29 Rosse C, Mejino Jr JL. A reference ontology for biomedical informatics: the Foundational Model of Anatomy. J Biomed Inform 2003; 36 (06) 478-500.
  • 30 Spyns P, De Bo J. Ontologies, a revamped crossdisciplinary buzz-word or a truly promising interdisciplinary research topic? STAR Lab Technical Report STAR. 2004-20.
  • 31 Ogden CK, Richards IA. The Meaning of Meaning. London: 1923
  • 32 Smith B. Beyond Concepts: Ontology as Reality Representation. In Varzi AC, Vieu L. editors FOIS 2004. Proceedings of. The International Conference on Formal Ontology and Information Systems; Nov 4–6, 2004, Turin, Italy. Amsterdam: IOS Press; 2004: 73-84.
  • 33 Bodenreider O, Smith B, Burgun A. The Ontology- Epistemology Divide: A Case Study in Medical Terminology. In Varzi AC, Vieu L. editors FOIS 2004. Proceedings of The International Conference on Formal Ontology and Information Systems; Nov 4-6, 2004. Turin, Italy. Amsterdam: IOS Press; 2004: 185-95.
  • 34 Fischer DH. Converting a Thesaurus to OWL: Notes on the Paper “The National Cancer Institute’s Thesaurus and Ontology” http://www.ipsi.fraunhofer.de/orion/pubFulltexts/NCIReview18Feb04.pdf
  • 35 Schneider L, Cunningham J. Ontological Foundations of Natural Language Communication in Multiagent Systems. IFOMIS Report ISSN 1611-4019.