Index of /norm

 Name                                  Last modified      Size  Description
 Parent Directory                                           -   
 E_coli_v3_Build_1_norm.tar.gz         28-Feb-2007 15:15   50M  
 E_coli_v3_Build_2_norm.tar.gz         28-Feb-2007 15:15   61M  
 E_coli_v3_Build_3_norm.tar.gz         07-Sep-2007 19:05   65M  
 E_coli_v4_Build_2_norm.probe_data.txt 05-Sep-2007 14:02  8.0M  
 E_coli_v4_Build_2_norm.tar.gz         28-Aug-2007 17:33   61M  
 E_coli_v4_Build_3_norm.tar.gz         29-Oct-2007 17:51   64M  
 E_coli_v4_Build_4_norm.tar.gz         20-Dec-2007 12:32   74M  
 E_coli_v4_Build_5.tar.gz              30-Oct-2008 01:16   90M  
 E_coli_v4_Build_5_affy_cdf.tar.gz     30-Oct-2008 01:16   90M  
 E_coli_v4_Build_6.tar.gz              03-Sep-2009 11:12  112M  
 S_cerevisiae_v3_Build_1_norm.tar.gz   28-Feb-2007 15:15   68M  
 S_oneidensis_v3_Build1_norm.tar.gz    28-Feb-2007 15:15  2.9M  
 S_oneidensis_v4_Build1_norm.tar.gz    24-Aug-2007 14:52  3.0M  
 S_oneidensis_v4_Build_2.tar.gz        23-Jun-2008 13:01   42M  
 helper_scripts/                       17-Oct-2007 18:59    -   
 yg_s98_v3_Build_2_norm.tar.gz         30-Sep-2008 19:37   26M

This file describes the normalized compendium dumps from M3D.

--- EXPRESSION DATA ---
You should find six files with expression data in them. The
naming convention for these files is Compendium_chipsMprobesN.tab
where M = the number of chips in the file and N is the number
of probe sets in each file.  You find three different numbers of
probe sets, which from smallest to largest correspond to:
genes only, genes + intergenic regions, genes + intergenic regions +
control probes.  The final three files contain "avg" preceding
the compendium name and they have "exps" rather than "chips".  These
three files contain the average of the replicates for experiments that
have replicates.

--- PROBE INFORMATION ---
In each dump, you will find a file of the form 

Compendium.probe_set_descriptions 

This file contains additional names
for each probe_set. It contains the probe set name, the locus (the standard
gene name used for the species, for example b0123 for E. coli and SO0123 for 
Shewanella), the common name, and a friendly name. The friendly name is 
the common name if a gene has a common name, otherwise it is the locus.

For the E_coli_v4_Build_2_norm data set we have included a separate file 
containing probe_set --> probe mappings and sequences:

  E_coli_v4_Build_2_norm.probe_data.txt


--- CHIP INFORMATION ---
In each dump, you will find a file of the form 

Compendium.experiment_descriptions 

This file contains basic condition information for each experiment and 
chip in the compendium.


--- STRUCTURED EXPERIMENTAL METADATA ---
Compendia that are version 4 or later have an additional file

Compendium.experiment_feature_descriptions 

This file contains curated detailed condition information for each experiment.
For each experiment, you will find many rows.  Each row corresponds to one
feature of the experiment.  Each feature has a defined unit and type
that are enforced across all experiments M3D.  For example, all experiments
in the database utilizing glucose, contain the glucose value as a real number
in mM.  In general, we have tried to use mM with all chemicals in the database,
so that it is easier to combine and merge the different chemicals to calculate 
the number of certain important atoms like Sulfur and Nitrogen.  Chemically 
undefined constituents like yeast extract are provided in their commonly used
units.

--- HELPER SCRIPTS ---
directory contains a script for parsing the normalized data into matlab