[UNDER CONSTRUCTION]
Contents
Introduction
CyThesaurus is a Cytoscape plugin providing identifier mapping services based on various resources. Currently the plugin support ID mapping resources from delimited text, PGDB file and BioMart web service. This plugin utilized BridgeDb API.
Use Cases
5 related use cases have been identified on Bader Lab ID Mapping page. 2 of them are closely related to this project:
Unification during dataset merging: During a merge operation e.g. of two protein-protein interaction datasets from independently created databases, it is vital to recognize that two protein objects, one from each data source, represent the same protein molecule, even if the protein objects don’t share any database accession numbers. Unification requires knowledge of record type e.g. you cannot reliably use a gene ID to unify proteins (mostly because splice variants exist).
Identifier translation: Some analysis methods require specific translations from one set of identifiers to another. For instance, our 'activity centers' analysis requires translation from protein or gene identifiers in a pathway database to Affymetrix probe set identifiers or other gene expression array platform identifiers.
Supported ID Mapping Resources
File- based
Delimited text file
File format (e.g. http://tinyurl.com/mergesvn/testData/yeast_id_mapping.txt):
- Each column for one ID type
- Each row except the first one represents IDs of different types mapping to each other
- First row contains ID types
- Multiple IDs are allowed to be contained in one cell (One to many mapping, or IDs of the same type maps to each other). Use special character (e.g., ';', '/', etc, or user defined) to separate IDs.
RDB based
PGDB file
Gene database schema: http://www.bridgedb.org/wiki/GeneDatabaseLayout
Gene databases are available at http://bridgedb.org/data/gene_database/
Web service based
BioMart web service
BioMart web service has been utilized to provide ID mapping service in this plugin.
BridgeDb web service
Being developed.
PICR web service
Being developed.
Code Base
Currently the plugin is based on Cytoscape 2.6. Porting to Cytoscape 3.0 is in plan.
ID mapping service for other plugin
An inter-plugin communication module is being developed to allow CyThesaurus plugin providing ID mapping services to other plugins.