Introduction
Transcription factors have long been recognized as major regulators of haematopoietic cell type specification. To understand the mechanisms underlying cell type specification by transcription factors, it will be essential to identify their transcriptional targets. As sequencing protocols mature, ChIP-sequencing is becoming a common technique to identify genome wide binding patterns of a given TF in a given cell type. There are over 100 individual studies now deposited in public databases for the murine haematopoietic system alone. This wealth of new data represents unprecedented opportunities to unravel the transcriptional control mechanisms that mediate expression of specific sets of genes within the various haematopoietic cell lineages.
Gene Ontology overrepresentation analysis provides information on various types of functional categories enriched within a given gene set of interest and GSEA determines whether a gene set of interest shows statistically significant expression differences between two or more cell types. Complementary to these approaches, we developed a new computational framework for linking gene sets with transcriptional control, called Gene Set Control Analysis (GSCA). By exploiting multiple transcription factor binding patterns from genome-wide ChIP-Seq studies, GSCA can provide previously unattainable insights into possible transcriptional control mechanisms operating in both normal and malignant cells. Through integrated analysis of 142 blood specific ChIP-Seq binding datasets, C-GSCA identifies likely combinatorial transcriptional control mechanisms by revealing TF co-occupancy patterns specifically associated with gene regulatory elements from a given gene set.
Main features of this webtool
Citing this research