Table of Contents

CLENCH

CLENCH: A program for calculating cluster enrichment using the Gene Ontology

Analysis of microarray data most often produces lists of genes with similar expression patterns, which are then subdivided into functional categories for biological interpretation. Such functional categorization is most commonly accomplished using Gene Ontology (GO) categories. Although there are several programs that identify and analyze functional categories for human, mouse and yeast genes none, of them accept Arabidopsis thaliana data. In order to address this need for A. thaliana community, we have developed a program that retrieves GO annotations for A. thaliana genes and performs functional category analysis for lists of genes selected by the user.

Clench2.0

Cluster enrichment analysis and visualization of expression, annotation and transcription factor binding site data.

Download Clench2 code

See a power point presentation on Clench2.0

Download a report describing Clench2

Motivation

The end result of analyzing microarray datasets is a list of differentially expressed genes. Such gene lists are grouped by the functions of the gene products and common transcription factor binding sites contained in their promoters. Functional categorization is most commonly accomplished using Gene Ontology (GO) catego-ries and promoters are analyzed for the presence and enrichment of binding sites for transcription factors known to be involved in the process under study. Although there are several programs that iden-tify and analyze functional categories, few of them analyze both promoter sequences and functional categories. Moreover, the inte-grated visualization of the three data types, expression, annotation and transcription factor binding sites in the promoters, is important for drawing meaningful inferences.

Results

In order to address this need for A. thaliana community, we have radically modified and extended our CLENCH tool, which per-formed functional category analysis. Clench2.0 performs functional analysis, searches promoter sequence for known TF binding sites, and visualizes the expression, annotation and transcription factor binding site data for lists of genes provided by the user. Although developed for A. thaliana, Clench2.0 can be easily adapted to work for other model organisms.

New features in Clench2

     holding/hovering the mouse on a node shows its label in text
     clicking on a node shows its row in the result
     holding/hovering the mouse on the promoter image shows 
     the p-values for the enriched promoters 
     The user can download the expression data for genes 
     assigned to each term by clicking the 'Get Data' link above 
     the expression image in the result row 
     Using a custom list of terms
     Using slimterms provided in the TAIR output