Differences between revisions 1 and 2
Revision 1 as of 2007-01-10 22:45:12
Size: 10522
Editor: nebbiolo
Comment:
Revision 2 as of 2007-01-10 22:47:39
Size: 10586
Editor: nebbiolo
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
## page was renamed from Cytoscape User Manual/Loaing Networks

Import Fixed-Format Network Files

Networks in specified format maybe imported into Cytoscape through the Import Network dialog, which can be activated through Menu File → Import → Network. Note that (1) The network file name must have Cytoscape recognized extension sif, gml, xgmml to represent its content type. (2) the network file can either be located in the local computer, or it can also be located in a remote computer.

Load a Network File from Local File System

By default, Cytoscape load network from the local computer. When the import network dialog is initialized, the data source type “Local” will be selected. To import a network, first click on the “Select” button, which will pop-up a file chooser and let user browse the local disk to choose a network file. User is allowed only to choose Cytoscape recognized network types. After the file is selected, a click on the Import button will import the network. For example, The following steps will load a sample molecular interaction network in SIF format. Use the menu File → Import → Network. select the file “galFiltered.sif” in “sampleData” directory. After a few seconds, a small network of 331 nodes should appear in the main window. The procedure to load network in other format is very similar. To load the same interaction network as a GML, use the menu: File → Import → Network again. In the resulting file dialog box, select the file “sampleData/galFiltered.gml”. Node and edge attribute files as well as expression data and extra annotation can be loaded as well.

Network files may also be loaded directly from the command line using the –N option (works for SIF, GML, and XGMML).

attachment:network_import_dialog1.png

Load a network in a remote computer (URL import)

Cytoscape also support network import through URL. In this case, user should choose “Remote” as the data Source Type on the Import Network dialog. After user provide a URL to the network file, a click on the import button will load the network from the remote computer.

attachment:network_import_dialog2.png

Note that Cytoscape supports URL bookmarks. A click on the right-most arrow of the comboBox will show a list of available network bookmarks. Cytoscape contains a pre-defined a list of bookmarks, which point to sample network files located on the Cytoscapes web server. Users may add new bookmarks through the Bookmark manager described in the [:Cytoscape_User_Manual/Preferences#properties: Preferences] chapter.

Another issue for network import is the presence of firewalls, which can affect what files are accessible to a computer. To work around this problem, Cytoscape supports the use of proxy servers. To configure the proxy server, click Edit → Preference → Proxy Server…. This is further described in the [:Cytoscape_User_Manual/Preferences#properties: Preferences] chapter.

Anchor(import-free-format-tables)

Import Free-Format Table Files

attachment:network_table_import.png

From version 2.4, Cytoscape supports free-format tables. Users can select columns for source, target, interaction, and edge attributes from interactive GUI.

Supported Files

Network Table Import function supports delimited text files and Microsoft Excel Workbooks (1). The following is a sample table file:

source  target  interaction     boolean attribute       string attribute        floating point attribute
YJR022W YNR053C pp      TRUE    abcd12371       1.2344543
YER116C YDL013W pp      TRUE    abcd12372       1.2344543
YNL307C YAL038W pp      FALSE   abcd12373       1.2344543
YNL216W YCR012W pd      TRUE    abcd12374       1.2344543
YNL216W YGR254W pd      TRUE    abcd12375       1.2344543

The network table files should contain at least two columns: source nodes and target nodes. Interaction is optional in this format. Therefore, minimal network table looks like the following:

YJR022W YNR053C
YER116C YDL013W
YNL307C YAL038W
YNL216W YCR012W
YNL216W YGR254W

One row in a network table file represents an edge and its attributes. A network file is combination of network data and edge attributes. However, network table may contain node attributes. In that case, you can turn off those columns from GUI. This function is useful when you import data table like the following (2):

Unique ID A     Unique ID B     Alternative ID A        Alternative ID B        Aliases A       Aliases B       Interaction detection methods   First author surnames   Pubmed IDs      species A       species B       Interactor types        Source database Interaction ID  Interaction labels      Cross-references        Associated Files        Experiment files        Experiment labels       Different techniques    Different Pubmed articles       Different sources       Weight

7205    5747    TRIP6   PTK2    Q15654  Q05397-1        vv|HPRD Currently not available 14688263|15892868(Marcotte)     Mammalia        Homo sapiens    protein|protein HPRD|Marcotte   0       Thyroid hormone receptor interactor 6-FAK-|PTK2-TRIP6   NA(HPRD)|NA(Marcotte)   HPRD/02859_psimi.xml|other/ORIGINAL_DATA_MARCOTTE.txt   vv(HPRD/02859_psimi.xml)|HPRD(other/ORIGINAL_DATA_MARCOTTE.txt) 17651(ExptRef)|Marcotte 2       2       2       2

4174    7311    MCM5    UBA52   P33992  P62987  neighbouring_reaction   Currently not available 15608231(Reactome)      Homo sapiens    Homo sapiens    protein|protein Reactome        1       P33992-P62988   Reaction:68944<->Reaction:68946(Reactome)|Reaction:68946<->Reaction:68944(Reactome)     other/ORIGINAL_DATA_MARCOTTE.txt        neighbouring_reaction(other/REACTOMEhomo_sapiens.interactions.txt)      Reactome        1       1       1       1

7040    7040    TGFB1   TGFB1   P01137  P01137  nmr: nuclear magnetic resonance Currently not available 8679613 Homo sapiens    Homo sapiens    protein|protein BIND    2       TGFB1-TGFB1-    72085(BIND)     BIND/bind_taxid9606.1.psi.xml   nmr: nuclear magnetic resonance(BIND/bind_taxid9606.1.psi.xml)  NotAvailable    1       1       1       1

This data file is a tab-delimited text and contains network data (interactions), edge attributes, and node attributes. To import network and edge attributes from this table, you need to choose Unique ID A as source, Unique ID B as target, and Interactor types as interaction type. Then you need to turn off columns used for node attributes (Alternative ID A, species B, etc.). Other columns can be imported as edge attributes. To import node attributes from this table after loading network data, please read Attributes section.

Note (1): in version 2.4, Cytoscape supports Excel Workbooks with single sheet (table) only. Multiple sheet Workbooks are not supported.

Note (2): from A merged human interactome datasets by Andrew Garrow, Yeyejide Adeleye and Guy Warner (Unilever, Safety and Environmental Assurance Center, 12 October 2006). Actual data files are available at:

http://www.cytoscape.org/cgi-bin/moin.cgi/Data_Sets/

Basic Operations

To import network text/Excel tables, please follow these steps:

  1. Select File → Import → Network from Table (Text/MS Excel)...

  2. Select a table file by clicking on Select File button.

  3. Set Interaction Definition. You need to specify Source Interaction, Target Interaction, and Interaction Type. For Interaction Type, if you select Default Interaction the default interaction value will be used for all edges. The default value is pp. This value can be modified in Advanced Options.

  4. (Optional) Select edge attribute columns. Network table files can have edge attribute columns in addition to network data.
    • Enable/Disable Attribute Column - By left-clicking on a column header in the preview table, you can enable/disable edge attributes. If the header is checked and entries are blue, the column will be imported as an edge attribute. For example, the table below shows that column 1 through 3 will be used as network data, and column 5 and 6 will be imported as edge attributes. attachment:network_table_sample.png

    • Change Attribute Name and Data Types:
    • If you right-click on a column header in the preview table, the dialog above will be displayed. You can modify attribute name and data type from this dialog. For more detail, see Modify Attribute Name/Type.

  5. Click Import button.

Advanced Options

attachment:network_import_advanced.png

You can select several options by checking Show Text File Import Options.

  • Delimiter: You can select multiple delimiters for text tables. By default, Tab and Space are selected as delimiters.

  • Preview Options: When you select a network table file, first 100 entries will be displayed in the Preview panel. To display more entries, change the value for this option. If you want to show all entries in the file, select Show all entries in the file. You need to click Reload button to update the Preview panel.

  • Attribute Names
    • Transfer first line as attribute names: By choosing this option, first entry in the file will be used as edge attribute names.

    • Start Import Row: Change the start row for import. For example, if you want to skip first 3 rows in the file, set 4 for this option.

    • Comment Line: Rows start with this character will be ignored. This option will be used to skip comment lines in text files.

  • Network Import Options: If Interaction Type is set to Default Interaction, the value here will be used as interaction for all edges.

Modify Attribute Name/Type

attachment:network_table_attr_dialog1.png

Attribute names and data types can be modified in this dialog.

  • Modify Attribute Name - just type new attribute name in the text field and click OK.
  • Modify Attribute Data Type - Table Import supports the following attribute data types:

    • String
    • Boolean (True/False)
    • Integer
    • Floating Point
    • List of String/Boolean/Integer/Floating Point
    Cytoscape has basic data type detection function for string, boolean, integer, and floating point numbers. This means attribute data type will be set to integer if all entries in a column are integers, etc. If you need to change the suggested data type, click on one of the radio buttons in the dialog. For list data type, you need to select delimiter for list cells. The list delimiter is global, i.e. all cells will be split by this character. You cannot specify column-specific list delimiter.

Cytoscape_User_Manual/Creating_Networks (last edited 2009-02-12 01:04:09 by localhost)

Funding for Cytoscape is provided by a federal grant from the U.S. National Institute of General Medical Sciences (NIGMS) of the Na tional Institutes of Health (NIH) under award number GM070743-01. Corporate funding is provided through a contract from Unilever PLC.

MoinMoin Appliance - Powered by TurnKey Linux