Differences between revisions 13 and 14
Revision 13 as of 2006-09-20 00:16:35
Size: 4815
Editor: KeiichiroOno
Comment:
Revision 14 as of 2006-09-21 00:18:48
Size: 5027
Editor: KeiichiroOno
Comment:
Deletions are marked like this. Additions are marked like this.
Line 73: Line 73:
==== How to use ==== ==== User Interface ====
The following is a temporary GUI for the new classes. This may be changed.
Line 75: Line 77:
   * http://chianti.ucsd.edu/kono/images/onto_server_status.png
Line 76: Line 79:
   * http://chianti.ucsd.edu/kono/images/annotation_import.png

RFC Name : ...

Editor(s): ...

TableOfContents([2])

About this document

This is an official Request for Comment (RFC) for Add your text here.

For details on RFCs in general, check out the [http://www.answers.com/main/ntquery?method=4&dsid=2222&dekey=Request+for+Comments&gwp=8&curtab=2222_1&linktext=Request%20for%20Comments Wikipedia Entry: Request for Comments (RFCs)]

Status

7/20/2006: Open for public comment.

How to Comment

To view/add comments, click on any of 'Comment' links below. By adding your ideas to the Wiki directly, we can more easily organize everyone's ideas, and keep clear records. Be sure to include today's date and your name for each comment. Here is an example to get things started: ["/Comment"].

Try to keep your comments as concrete and constructive as possible. For example, if you find a part of the RFC makes no sense, please say so, but don't stop there. Take the extra step and propose alternatives.

Proposal

BioDataServer class was used to import Ontologies, annotations, and synonyms. Basically, the constructor takes manifest file location and load data from individual data sources (annotation files, ontology file, and synonym file) specified in the manifest file. Current problems are the following:

  1. File format used in the manifest file is out-of-date.
  2. New file formats (OBO and Gene Association) are converted into old format before loading.
  3. Many entries in the new file formats are lost in the file format conversion process.
  4. The imported ontologies are stored in a huge map, not in a DAG which is the original data structure of GO.
  5. Because GO terms are imported as a huge map, it makes no sense for many biologists.
  6. Name mapping service is not sophisticated.
  7. GO Annotations are mapped based on levels, which does not make sense for biologists.

To solve problems above, new BioDataServer should supports the following:

  • Import everything in the OBO and Gene Association files.
  • Use CyNetwork class to store GO's DAG (like BinGO plugin)

  • Import more general attribute files, not only GO.
  • Supports more general attributes. (Probably we should change the name to AttributeServer)

  • Directly convert attibutes into CyAttributes data structure, and avoid redundant data.

  • Support for MySQL (popular DB in lifescience projects) connection.
  • Support for XML attribute import based on XQuery or other light weight library
  • Remote and local data source support.

Biological Questions / Use Cases

  • Ontology can be anything which has tree/dag datastructure since there are many ontologies in life science field.

General Notes

There are several open-source applications/libraries which can be used in this project:

Requirements

Deferred Items

Open Issues

  • Visualization and interaction with large ontology (like full-size GO)

Backward Compatibility

Expected growth and plan for growth

References

Implementation Plan

  • Use incremental approach.
  • This project can be devided into two parts.
    1. New Ontology Server design (data representation)
    2. UI for browsing/serching ontologies
  • For 2.4, we will forcus on the first part.
  • The new ontology model will be based on [http://www.biojava.org/docs/api14/org/biojava/ontology/package-summary.html BioJava's Ontology interface].

    • Cytoscape version of Ontology class is an implementation of Interface Ontology.
    • This class uses CyNetwork as the data strage to represent the Ontology DAG.

    • This special CyNetwork will be stored in a different RootGraph.

    • Information belongs to an Ontology Term (such as reference, description, etc.) will be stored in CyAttributes.

    • Since this new Ontology class uses CyNetwork and CyAttributes as its backend, we can use all functionality of Cytoscape to this Ontology DAG. The picture below is an example of GO DAG visualization:

    • http://chianti.ucsd.edu/kono/images/godag.png

  • Write reader load all entries in GO
  • Integrate ID Mapper

Current Implementation

User Interface

The following is a temporary GUI for the new classes. This may be changed.

Comments

PageComment2

BioDataServerRFC (last edited 2009-02-12 01:03:40 by localhost)

Funding for Cytoscape is provided by a federal grant from the U.S. National Institute of General Medical Sciences (NIGMS) of the Na tional Institutes of Health (NIH) under award number GM070743-01. Corporate funding is provided through a contract from Unilever PLC.

MoinMoin Appliance - Powered by TurnKey Linux