UCDs vs ontologies?
Rob Seaman
seaman at noao.edu
Wed Jun 1 08:32:36 PDT 2005
On Jun 1, 2005, at 6:54 AM, Pierre Didelon wrote:
> But do/must UCD be the solution to (all?) natural langage parsing?
It would be premature to begin the work of the board by willy-nilly
reassigning identifiers such as "em". It seems likely that we will
need to be able to express the concepts of both "electromagnetism"
and "emission". If there is emission, there is absorption. And
sometimes we may want to express the combination of "electromagnetic
emission" at the same time.
On the other hand, we're told that the role of UCDs is distinct from
that of ontologies. An ontology is an (attempt) at expressing the
complete range of some knowledge domain. Astronomy is a big subject
- its ontology will be big. Perhaps by analogy we can view an
ontology as the unabridged dictionary for some subject, whereas UCDs
are simply one way to build a glossary for a specific purpose.
Glossaries are often small enough to be appended to a brief document.
Personally, I think the VO community will need to develop several
separate ontologies over time as well as several separate glossaries
of UCDs or UCD-like constructs. It is not obvious that a glossary of
UCDs for tabular convenience is equivalent to a glossary of UCDs for
VOEvent convenience. An ontology can afford to be large and unwieldy
to reach its goal of being complete and accurate. A UCD style
glossary, on the other hand, will eventually reach an optimum size.
Its utility will pass a point of diminishing returns. Too much
precision engenders confusion. The availability of too many options
results in overlapping shades of meaning.
I gather the current list of UCDs was generated by looking at actual
tables in the literature. This is just how the unabridged OED was
created from words sieved from millions of quotations. Just like a
dictionary, the work of maintaining the list of UCDs will continue
indefinitely as new tabular usage is coined.
I would suggest that the creation of this new list of UCD-like
entities to describe astronomical "concepts" is fundamentally a
different exercise. We may not be trying to generate a complete
ontology with all interrelationships clearly drawn between all
concepts, but we are trying to be complete in the sense of not
leaving any gaps in the web of concepts. "Star" and "galaxy" will
clearly make the final cut. "Star.white_dwarf" and "galaxy.spiral"
most likely, too. But it won't take many levels to exhaust the
utility of compiling such a list. I expect the final list to have
hundreds of entries, not tens of thousands.
One final point. The nature of this board is to participate in the
process of certifying an official list of terms. I think the true
utility of both glossaries and dictionaries will be achieved when
facilities are available for creating and maintaining *unofficial*
lists. For VOEvent, for instance, it seems likely that each project
publishing events will adopt its own glossary pertinent to its own
instrumentation and observations. We should support these activities
and provide a framework for project specific glossaries. They will
spring into existence whether or not we do so. At least if we
support the creation of project specific glossaries, we can have some
say in controlling a common semantic structure and a standardized
distribution mechanism. This might also naturally lead to the next
step of layering UCD glossaries on top of our emerging ontology
(ies). A glossary, after all, is nothing but a well chosen list of
words out of the dictionary. It is the dictionary that provides
etymology, synonyms and antonyms, classification by part of speech,
tenses and gender, pronunciation, ...
Sorry for the cross-posting. If we can't restrain ourselves from
generating all these mailing lists, I'm not sure what hope we have
for a coherent set of UCD lists :-)
Rob Seaman
NOAO
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.ivoa.net/pipermail/ucd-tech/attachments/20050601/ccb525ef/attachment-0001.html>
More information about the ucd-tech
mailing list