[SimDB] theory plans

From: Gerard <gerard.lemson-at-mpe.mpg.de>
Date: Thu, 15 May 2008 10:31:27 +0200

 

Dear colleagues

We have split up the former SNAP effort of the theory interest group in two separate ones:
SimDB(=Simulation Database) and SimDAP(=Simulation Data Access Protocol).

This mail deals with SimDB only:
what it is supposed to be, what its current state is, and what further work is needed and its organisation.

  1. SimDB is a specification for a Simulation (meta-)Database (could be called Simulation Registry, -Portal)
  2. SimDB is an online service offering query capabilities to a database containing meta data describing results of simulations and their post-processing as well as about the codes used in these algorithms. Currently the simulations are still supposed to be those that produce a representation of 3+1D space, (possibly reduced spatial dimentsions through assumptions of symmetry). This is open for discussion.
  3. A SimDB also contains information about web services giving access to the simulation results themselves. The more detailed specification of such services is the goal of the SimDAP-specification.
  4. SimDB is based on a (logical) data model, fully specified in UML.
  5. From the UML data model we derive physical models for use in their respective contexts:
    • The "public tables" of a relational data model, for implementing the database in a RDBM system so that SQL (c.q. ADQL) queries can be easily implemented.
    • XML schema, defining valid XML documents containing SimDB meta data descriptions
    • UTYPEs for the elements of the model.
  6. We present XSLT scripts that derive these physical models directly from the UML model according to predefined mapping rules.
  7. We also derive Java classes with JPA and JAXB annotations to make it easy to implement a SimDB from the specification.
  8. We suggest an implementation path to transform an existing relational database to SimDB.

We think this effort is evolved far enought that it can be moved onto the recommendation track.
There some issues have to be resolved that require input from a number of working groups, among which are:

We suggest forming a focus group to tackle these and other issues. This group should contain members from the mentioned WGs, together with the current developers of SimDB ("members" from the TIG).

It is unclear what the formal organisation should be, as there is not a single WG that could most obviously be given responsibility for the further development of this standard and the theory INTEREST group can not move a working draft through the recommendation track. Might it be possible for cross-WG focus groups like the one proposed here to get such responsibility? Or should intereste groups get this responsibility after all? Note that similar questions may come up based on the corss-WG focus group on UTYPE proposed by Mireille Louys.

We expect to discuss all these issues in Trieste, but would like to get input on this from now on. Could this discussion be held on the theory group's mailing list with the subject as above?

Thanks and best regards

Gerard Lemson
for Herve Wozniak, Mirelle Louys and members of the SNAP "tiger team".

PS
We have seen that our approach using UML as source for scripts producing other representations of the model is working, viable and completely independent of theory. We think it can be of use to other data modelling efforts and suggest that the DM working group could start efforts to come up with a set of META-specifications on:

Received on 2008-05-15Z10:31:07