Chado Expression Module

From GMOD
Revision as of 18:45, 19 February 2007 by Bosborne (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Introduction

This module is for how curated expression data is stored in chado. This module is totally dependent on the sequence module. Objects in the genetic module cannot connect to expression data except by going via the sequence module. We assume that we'll always have a controlled vocabulary for expression data.

Here is an example of a simple case of the sort of data that FlyBase curates. The dpp transcript is expressed in embryonic stage 13-15 in the cephalic segment as reported in a paper by Blackman et al. in 1991. This would be implemented in the expression module by linking the dpp transcript feature to expression via feature_expression (we would add a pub_id column to feature_expression to link to the publication in the pub table). We would then link the following cvterms to the expression using expression_cvterm:

  • embryonic stage 13 where the cvterm_type would be stage and the rank=0
  • embryonic stage 14 where the cvterm_type would be stage and the rank=1
  • embryonic stage 15 where the cvterm_type would be stage and the rank=1
  • cephalic segment where the cvterm_type would be anatomy and the rank=0
  • in situ hybridization where the cvterm_type would be assat and the rank=0

Note that we would change the cvterm_type column to cvterm_type_id and use a cvterm_id for a particular expression slot (i.e. stage, anatomy, assay, 'subcellular location' and that cvterms from different OBO ontologies can share the same cvterm_type.


Tables