Contents |
Hallmarks of the scientific method are reproducibility and testability. It is crucial that experiments which test hypotheses can be performed (especially in other laboratories than they came from) and can be extended to test their ranges of validity. Computational neuroscience models (CNS models) are implicitly a collection of hypotheses of the function of (usually a subset of) the nervous system across spatial scales ranging from molecules to animal behavior. Most publications of CNS models describe how a computer program which faithfully imbued the collection of hypotheses behaves. The complexity of computational neuroscience models make it difficult to reproduce a model de novo from the publication that introduces it. Missing information and typographical errors are among the most common reasons why an investigator may fail to successfully reproduce a model.
The software implementation of a model can also become the building block(s) for modified versions of that model by other researchers, such as simplifying it for use in a larger spatial scale, including the effect of a variable (e.g. temperature on ion channel kinetics) that was absent in the original model, or extending the range of applicability of the model.
The media which stores computer code also obeys the laws of entropy; hard drives crash and old computers are recycled. Thus it is an important step to get the computer code into a database or onto some other maintained system to provide long-term preservation.
An analogy exists between the sharing of sequence information in sequence databases in traditional bioinformatics (the field comprised of sequence oriented databases, tools to search for sequences in one or more databases, compare with statistical software, and model the molecules related to the sequences) and the sharing of models in the CNS subset of neuroinformatics (the field comprised of storing, searching, comparing, and modeling diverse neuroscience data). Both types of sharing are essential to support future research in their respective fields.
Efforts to build databases to share CNS models have been underway since circa 1996 (Peterson et al. 1996, Mirsky et al. 1998, Usui 2003, Migliore et al. 2003, Sivakumaran et al. 2003, Hines et al. 2004, Le Novere et al. 2006) and also an alternative approach for sharing models in a napster-like method has appeared (Cannon et al. 2002).
Sites that support the sharing of CNS models:
ModelDB and Visiome started out with particular domains of interest, olfaction and vision, respectively, and have subsequently expanded to broader categories of neuroscience and biological mathematical models. CellML has always been intended as a generic way to represent mathematical models, although, as the name implies, it was initially developed by groups with an interest in representing biological models, and its use in practice has been for biological mathematical models. The COR converts CellML models into executable models on the windows platform. BioModels contains diverse models: models of cell cycle, bacterial growth, plant photosynthesis, basic metabolism etc. The neuroscience models are a minority of their entries. The BioModels database contains models stored as SBML files. SBML is currently the most widely supported model definition standard, with more than 100 compatible applications listed on the SBML home page. The DOQCS is focused on the signaling pathways of the glutamate synapse.
It is expected that the above resources will continue to grow and have increased functionality and interoperability with journals and the National Library of Medicine (PubMed). One important area that is the focus of much current work is the construction of "simulator independent" descriptions of models. The objective is to separate the model from a particular software implementation which then allows modelers to use models from other simulators within their preferred simulator. This work is based in XML and can be further examined at the following:
SBML and CellML (see their descriptions above) have many citations (36 hits for sbml and 12 hits for CellML in pubmed on April 3rd, 2007, see references for links) and a conversion program between them. Collaborations between the SBML and CellML groups have created prototype standards for sharing computational models (see Box 3 in Le Novere et al. 2005). NeuroML and it's morphology specific subset MorphML are tackling the complexity of the distribution of channels and receptors within neuronal morphology respectively (see Crook et al. 2007 and below links in references). The NeuroML validator allows automatic conversion of NeuroML files to NEURON and GENESIS formats. [neuroConstruct], a graphical application for the creation, visualization and analysis of network models, has just appeared (Gleeson et al. 2007). neuroConstruct's models are based on the NeuroML standards. [GENESIS] and [NEURON] developers have plans to develop and adopt XML descriptions of channel, cell, and network models. ChannelDB is a prototype of a Java-based NeuroML to GENESIS translator and database interface for ionic conductance models. More recent developments in this representation for conductance models have been carried out as ChannelML (a subset of NeuroML), and within the Neuroconstruct project.
See references for papers on XML related to CNS models.
Another approach to interoperability simply requires that the model be able to communicate with other models during the simulation (run-time interoperability, see Cannon et al. 2007 for a review of interoperability of neuroscience models, and Diesmann and Gewaltig 2002 for a run-time interoperability example).
The author would like to thank (but not hold responsible for errors) the kind help of Dave Beeman, Hugo Cornelis, Andrew Miller, Tommy Yu, and Nicolas Le Novere.
Related to XML and model sharing
Internal references
GENESIS, NEST (Neural Simulation Tool), NEURON Simulation Environment, and a List of more simulation environments