Diff for "BioDataServerRFC"

Differences between revisions 1 and 2

RFC Name : ...

Editor(s): ...

About this document

This is an official Request for Comment (RFC) for Add your text here.

For details on RFCs in general, check out the [http://www.answers.com/main/ntquery?method=4&dsid=2222&dekey=Request+for+Comments&gwp=8&curtab=2222_1&linktext=Request%20for%20Comments Wikipedia Entry: Request for Comments (RFCs)]

Status

(6/28/2006) This page is still under construction.##Put the date and the status. Status can be e.g. "Not yet completely written", "Open for public comment", "Closed for public comment". There could be some explanation of the status

How to Comment

To view/add comments, click on any of 'Comment' links below. By adding your ideas to the Wiki directly, we can more easily organize everyone's ideas, and keep clear records. Be sure to include today's date and your name for each comment. Here is an example to get things started: ["/Comment"].

Try to keep your comments as concrete and constructive as possible. For example, if you find a part of the RFC makes no sense, please say so, but don't stop there. Take the extra step and propose alternatives.

Proposal

BioDataServer class was used to import Ontologies, annotations, and synonyms. Basically, the constructor takes manifest file location and load data from individual data sources (annotation files, ontology file, and synonym file) specified in the manifest file. Current problems are the following:

File format used in the manifest file is out-of-date.
New file formats (OBO and Gene Association) are converted into old format before loading.
Many entries in the new file formats are lost in the file format conversion process.
The imported ontologies are huge map, not a DAG which is the original data structure of GO.
Because GO terms are imported as a huge map, it makes no sense for many biologists.
Name mapping service is not sophisticated.
GO Annotations are mapped based on levels, which does not make sense for biologists.

To solve problems above, new BioDataServer should supports the following:

Import everything in the OBO and Gene Association files.
Use CyNetwork class to store GO's DAG (like BinGO plugin)
Import more general attribute files, not only GO.
Supports more general attributes. (Probably we should change the name to AttributeServer)
Directly convert attibutes into CyAttributes data structure, and avoid redundant data.
Support for MySQL (popular DB in lifescience projects) connection.
Support for XML attribute import based on XQuery or other light weight library
Remote and local data source support.

-  ← Revision 1 as of 2006-06-22 20:45:28 →
  Size: 1819
  Editor: KeiichiroOno
  Comment:
+  ← Revision 2 as of 2006-06-29 00:04:38 →
  Size: 3372
  Editor: KeiichiroOno
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 4:
-Line 6:
+Line 5:
-Line 10:
+Line 8:
-Line 16:
+Line 13:
-##Put the date and the status.  Status can be e.g. "Not yet completely written", "Open for public comment", "Closed for public comment".  There could be some explanation of the status
+(6/28/2006) This page is still under construction.##Put the date and the status.  Status can be e.g. "Not yet completely written", "Open for public comment", "Closed for public comment".  There could be some explanation of the status
-Line 20:
+Line 15:
-Line 26:
+Line 20:
+BioDataServer class was used to import Ontologies, annotations, and synonyms.  Basically, the constructor takes manifest file location and load data from individual data sources (annotation files, ontology file, and synonym file) specified in the manifest file.  Current problems are the following:
-Line 27:
+Line 22:
+. File format used in the manifest file is out-of-date.

 1. New file formats (OBO and Gene Association) are converted into old format before loading.

 1. Many entries in the new file formats are lost in the file format conversion process.

 1. The imported ontologies are huge map, not a DAG which is the original data structure of GO.

 1. Because GO terms are imported as a huge map, it makes no sense for many biologists.

 1. Name mapping service is not sophisticated.

 1. GO Annotations are mapped based on ''levels'', which does not make sense for biologists.

To solve problems above, new BioDataServer should supports the following:

 * Import everything in the OBO and Gene Association files.

 * Use CyNetwork class to store GO's DAG (like BinGO plugin)

 * Import more general attribute files, not only GO.

 * Supports more general attributes.  (Probably we should change the name to AttributeServer)

 * Directly convert attibutes into CyAttributes data structure, and avoid redundant data.

 * Support for MySQL (popular DB in lifescience projects) connection.

 * Support for XML attribute import based on XQuery or other light weight library

 * Remote and local data source support.
-Line 28:
+Line 39:
-Line 30:
+Line 40:
-Line 32:
+Line 41:
-Line 34:
+Line 42:
-Line 36:
+Line 43:
-Line 38:
+Line 44:
-Line 40:
+Line 45:
-Line 42:
+Line 46:
-Line 44:
+Line 47:
-Line 46:
+Line 48:
-  * ["/Implementation Plan"]
+ * [:BioDataServerRFC/Implementation Plan:/Implementation Plan]
 Line 50:

Diff for "BioDataServerRFC"

About this document

Status

How to Comment

Proposal

Biological Questions / Use Cases

General Notes

Requirements

Deferred Items

Open Issues

Backward Compatibility

Expected growth and plan for growth

References

Implementation Plan

Comments