| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Vascular Biology |
From the Department of Vascular Biology and Thrombosis Research (F.G., B.R.B., R.d.M.), the Clinical Institute of Medical and Chemical Laboratory Diagnostics, and Ludwig Boltzmann Institute of Clinical Experimental Oncology (M.B., O.W.), University of Vienna, Austria; and the Competence Center Biomolecular Therapeutics GmbH (H.M., V.K.), Vienna, Austria.
Correspondence to Rainer de Martin, PhD, Department of Vascular Biology and Thrombosis Research, University of Vienna, Brunnerstr 59, A-1235 Vienna, Austria. E-mail rainer.de.martin{at}univie.ac.at
| Abstract |
|---|
|
|
|---|
Methods and Results Human umbilical vein endothelial cells (HUVECs) were stimulated with IL-1 for 0, 0.5, 1, 2.5, and 6 hours and analyzed using Affymetrix U133 microarrays. A total of 137 genes were found to be regulated >4-fold, including 18 transcription factors. The expression of selected genes was confirmed by real-time polymerase chain reaction. Cluster analysis was performed in order to group genes according to their expression profiles. To identify novel transcription factorbinding sites, the corresponding promoters were extracted from databases and analyzed for regulatory elements that were over-represented in specific clusters. Several potentially novel DNA binding sites were identified, and one was shown to specifically bind an IL-1inducible protein from HUVEC.
Conclusions These results demonstrate that in the early phase after stimulation, IL-1 evokes a complex gene expression program that includes positive but also negative (feedback) regulators of diverse endothelial cell functions. Furthermore, the identification of a new promoter regulatory element demonstrates the feasibility of the bioinformatics-driven approach to discover novel regulatory mechanisms.
We analyzed the early gene expression program of IL-1stimulated ECs using microarray technology, particularly focusing on TF profiles. Furthermore, large-scale bioinformatic comparisons of promoter sequence sets led to the identification of a novel DNA binding site that was shown to specifically bind an IL-1inducible protein.
Key Words: endothelial cells inflammation interleukin-1 microarrays transcription factors
| Introduction |
|---|
|
|
|---|
(TNF-
), or lipopolysaccharide.
This gene expression program requires the action of tightly controlled regulatory mechanisms. Transcriptional regulation appears to be of major importance, and several of the transcription factors (TFs) that are operative in the inflammatory context have been the subject of detailed investigation (eg, nuclear factor
B [NF-
B], early growth response 1 [EGR1], and NAK-1).2,5,6 Their tight regulation is of outstanding importance because an exaggerated or prolonged activation may lead to chronic inflammation and cause detrimental effect, as this occurs in a variety of diseases. Appropriate mechanisms to restrict the inflammatory response in time and magnitude can be expected to exist, and some have been described.79
With the aim of acquiring a comprehensive survey over the kinetics of the early gene expression program and of regulatory mechanisms of the endothelium during inflammation, we performed gene expression profiling of human umbilical vein endothelial cells (HUVECs) stimulated with IL-1. With the main focus on the identification of regulatory molecules, we present here several TFs that are novel in the context of endothelial cell (EC) biology. Furthermore, we used a new bioinformatics strategy that is based on the promoter analysis of clustered genes sharing a common regulatory behavior to delineate novel regulatory elements.
| Methods |
|---|
|
|
|---|
RNA Preparation
HUVECs were stimulated with 100 U/mL of human IL-1 (Biosource) for various periods of time and total RNA isolated using the RNeasy kit (Qiagen) according to the instructions of the manufacturer.
Microarray Hybridization and Primary Data Analysis
Preparation of cRNA, hybridization, and scanning of the arrays were performed according to protocols of the manufacturer. A detailed description is available as an online supplement (online Methods, available at http://www.ahajournals.org). Hybridization to one set of human U133A GeneChips (Affymetrix) and scanning of the arrays were performed according to protocols of the manufacturer.11 The arrays were scanned using the GeneArray scanner (Affymetrix). Image analysis was performed with GeneChip software (MAS 5.0; Affymetrix). Normalization was performed by global scaling, with the arrays scaled to an average intensity of 500. To be microarray experiment compliant,12 the microarray data were submitted to Gene Expression Omnibus (GEO),13 which is the public microarray repository of the National Center for Biotechnology Information. A series record (GSE973) was created, describing the experiment in detail and providing a focal point for the 5 individual sample records (GSM15389 to GSM15393), representing the 5 time points of the experiment.
Clustering Analysis
After filtering of genes (ProbeSets) showing absolute calls of "absence" and difference calls of "no change" for all data points, cluster analysis was performed on the remaining set, with a limit of 4-fold regulation (137 genes) in at least one of the time points using the Euclidean distance and average linkage (EPCLUST) program. The K-means clustering algorithm was applied, and the number of clusters was set to 10.14 Correlation measure-based distance was rated superior to Euclidean distance, thereby emphasizing the general shape of the profile and not the absolute magnitude of the expression response. In an alternative approach, we also generated a functional classification system for the set of genes using the Gene Ontology mining tool, which is part of the Affymetrix NetAffx portal,15 as a basis; however, manual literature search had to be added to achieve a reliable annotation of the full data set. The individual functional clusters were then subjected to hierarchical clustering16 using Euclidean distance and average linkage. The clustering procedure and the visualization process were performed using EPCLUST.
Real-Time Polymerase Chain Reaction Analysis
The reliability of the microarray data were confirmed by quantitative real-timepolymerase chain reaction (PCR) using a LightCycler (Roche) together with the 2-step real-timePCR kit and SybrGreen detection (Roche), according to the instructions of the manufacturer. Briefly, 1 µg of total RNA was reverse transcribed in a total volume of 20 µL of RNA PCR Master Mix. The thermocycler was programmed to 1 cycle of 15 minutes reverse transcription at 42°C, 5 minutes denaturation at 96°C, and a final cooling step. The first-strand cDNA was then diluted 1:3 with water, and 1.5 µL (representing 25 ng of total RNA) was used for real-time PCR. Primers were designed using the program Primer3.17 Primer sequences are listed in the online supplement (Table I). To choose sense and antisense primers that belong to different exons, the genomic organization of the genes was extracted from the University of California, Santa Cruz Human Genome Browser, data freeze July 2003.18 In all cases, the primer efficiency was determined using cDNA dilutions.
Comparative Multiple Promoter Analysis
The Java program TOUCAN was the primary software used for comparative promoter analyses of the coclustering genes.19 First, proximal promoter sequences (1 kb upstream and 0.2 kb downstream of the transcriptional start sites) were extracted from genomic databases. As a quality control, all sequences were checked against the UCSC Human Genome Browser,18 demonstrating an accuracy of
95%. The remaining sequences were added manually. The TOUCANtool MotifScanner, which searches the TRANSFAC database,20 was used to detect TF binding sites (TFBSs) in the sets of sequences. The prior (stringency level) was set to a value of 0.1, and the human promoter set of the Eukaryotic Promoter Database (EPD) was chosen as third-order background model. The statistics tool of TOUCAN was applied to the data produced by MotifScanner in combination with the appropriate expected frequencies file (human EPD), thereby detecting over-represented features (showing positive significance values) in the sets of coclustering genes. The tool ModuleSearcher was used to scan the sequences for high-scoring combinations of TFBSs. Default parameters were applied, except that the allowed distance was raised to 1 kb. Additionally, the clusters were scanned by the tool MotifSampler to detect over-represented patterns in the sequence sets using a prior level of 0.2. Finally, the extracted patterns were compared with the TRANSFAC site table to determine whether they correspond to known TFBSs. Selected motifs were subjected to a sequence conservation analysis using the tool WebLogo.21 For this purpose, all sequences defined by a motif were analyzed and a graphic representation created, which displays the frequencies of bases at each position as the relative heights of letters.
Nuclear Extracts and Electrophoretic Mobility Shift Assay
Nuclear proteins were extracted from IL-1treated and control HUVECs as described.22 Double-stranded oligonucleotide probes for the motif over-represented in cluster 1 (C1: 5'-ggatccCGG/CG/CGAG/ACGA/CCCgaattc-3') and FREAC7 (5'-cttaaACATAAACAgcatg-3';23) were labeled with 32P-
-ATP using T4 polynucleotide kinase, and 100 000 cpm per binding reaction were used in electrophoretic mobility shift assay (EMSA) as described.24 A 50-fold excess of unlabeled wild-type or mutant oligonucleotides was used for competition experiments (mutC1: 5'-ggatccCGG/CG/CACG/ACGA/CCCgaattc-3'; mutFREAC7: 5'-cttaaACATCCACAgcatg-3'). Gels were dried and exposed on a PhosphorImager (Molecular Dynamics) and signals quantified using ImageQuant 5.0 software.
| Results |
|---|
|
|
|---|
On the basis of these data, we generated a data set of high reliability (see Methods), both in terms of absolute signal intensities and of significant changes of expression levels, using the criterion of more than 4-fold induction or repression in at least 1 of the time points. This data set consisted of 118 upregulated and 16 downregulated genes, with an additional 3 genes showing both effects. An extended description of these 137 genes, containing all values of fold changes sorted separately for each time point, is available in an online supplement (Table II, available online at http://atvb/ahajournals.org). To subgroup the data set according to the different profiles, the K-means clustering algorithm was applied,14 and the number of clusters, K, was set to 10 (Figure 1; Table III, available online at http://atvb/ahajournals.org). In an alternative approach, the set of 137 genes was functionally classified (Figure 2; Table IV, available online at http://atvb/ahajournals.org). For this purpose, genes are categorized according to "biological processes" and "molecular functions" and subjected to hierarchical clustering.16 The largest group is built of 43 genes (31%) that are known to be involved in processes of inflammation or immune modulation. A total of 26 genes (19%) have functions in proliferation or differentiation, and 12 genes (9%) are part of metabolic pathways. A total of 10 genes (7%) are described to fulfill different functions ("miscellaneous"). The categories apoptosis and survival are composed of 9 genes (7%) each. Only 5 genes (4%) are known to play a role in processes of cell and tissue dynamics. Finally, a considerable list of 23 genes (17%) is unknown in the context of a biological process, most of which are also in the context of molecular function. Interestingly, the last category shows the highest internal fraction of downregulated genes (30%), followed by metabolism (25%) and proliferation (15%).
|
|
Expression Profiles of Selected TFs Are Confirmed by Real-Time PCR
To substantiate the data obtained by microarray analysis, we performed real-time PCR on some selected genes. We chose TFs and other potentially regulatory molecules that were of special interest in the context of this study. Primers were chosen to avoid amplification of genomic DNA, and primer efficiencies determined using a standard curve. Although the magnitude of gene expression differed in some cases between the microarray and real-time PCR experiments, the kinetics of expression were usually very similar (Figure 3).
|
Comparative Regulatory Sequence Analysis of Coclustering Genes
We hypothesized that genes showing a common regulatory behavior may also share common regulatory mechanisms such as TFBSs in their respective promoter regions. To identify these possible common regulatory patterns that should be over-represented in certain clusters, we took advantage of the newly emerging TOUCAN bioinformatics tool.19 In a first step, promoters were extracted from genomic databases (see Methods). Second, a procedure of regulatory sequence analysis was performed in 3 stages, namely the extraction of cognate TFBSs, of combinations ("modules") of TFBSs, and of sequence patterns. From each of these classes of sequence elements, several examples were found over-represented in certain clusters. The results are listed in parts A, B, and C, respectively, of the Table. Strikingly, the DNA binding site for NF-
B, a well-characterized TF involved in the inflammatory response, was identified, thus serving as a positive control for this novel procedure. Please note that the total number of genes in clusters 1, 2, 3, and 8 differs between the Table and Figure 1 because of the unavailability of the promoter sequences for the genes INSIG1, FLJ20220, PMCH, KIAA1053, and DKFZp434K1210.
|
Identification of a Novel TFBS
To further confirm these in-silico predictions experimentally, we performed electrophoretic mobility shift analysis with the most prominent unknown DNA sequence, the CsGAGCGC element ("C1") over-represented in clusters 1 and 5, and with a FREAC7 binding site. An NF-
B binding site served as positive control. Using HUVEC nuclear extracts, a band was detected that bound to the double-stranded C1 oligonucleotide, showing a moderate but significant (1.8-fold) induction on IL-1 stimulation (Figure 4A); no binding was detected with the FREAC7 site. To test for specificity, we generated a mutant C1 oligonucleotide by substituting those residues that were highly conserved in the multiple sequence alignment of the C1 element derived from 29 different promoters (Figure 4B). Binding of the protein to the C1 element was inhibited by an excess of wild-type but not of mutant oligonucleotide, demonstrating the specificity of binding (Figure 4A).
|
| Discussion |
|---|
|
|
|---|
, IL8, and LIF), phosphatases, and other signaling proteins (eg, DUSP1, DUSP5, SOCS1, and TNFAIP3), along with two well-known adhesion proteins (E-selectin and ICAM-1). These molecules also serve as additional positive controls for the microarray analysis. In this study, we chose a narrow time course of IL-1 stimulation for up to 6 hours. In contrast to previous experiments,25,26 in which either later time points of stimulation or smaller numbers of genes have been investigated, this approach has allowed us to study immediate-early to early regulatory mechanisms of EC activation. Indeed, within this time period, 36 TFs (26% of all regulated genes) that are either cognate or potentially novel in the context of inflammation were identified. Well-characterized TFs that mediate the immediate-early response are found in cluster 4 (JUN, FOS, EGR1). Similarly, cluster 5 shows a strong predominance of TFs (10 of the 16 genes), suggesting that these sets of genes peaking at 1 hour after stimulation govern the early response. The expression profiles of 8 TFs were confirmed by real-timePCR (Figure 3). Of these, BHLHB2, EGR4, TIEG, MAFF, and NFIB (a downregulated TF) have not been described in ECs. These TFs are expected to trigger a second wave of gene expression to regulate specific aspects of the inflammatory response at later time points; the identification of their target genes will be the subject of further studies. Notably, 5 downregulated TFs (CBFA2T1, ELK3, MAF, MEF2C, NFIB) are known to be involved in proliferation or differentiation, suggesting a concerted repression of these processes by IL-1. Consistent with these data are also the temporary downregulation of ID1 (inhibitor of DNA binding 1). ID1 is a potent repressor of thrombospondin-1 transcription,27 a major inhibitor of neovascularization.
From a therapeutic point of view, induced genes with inhibitory function are of special interest because they may represent endogenous feedback mechanisms to shut down the inflammatory response at later times. In our experiments, induction of I
B
, an inhibitor of the TF NF-
B, is a well-described example.28 Another is DSCR1, a calcineurin inhibitor peaking at 1 hour, that has been described recently to shut down NF-AT activity.9 It will be interesting to investigate whether the latter mechanism is also operative in ECs. Last but not least, the genes encoding ZFP36 (Figure 3) and C8FW have been associated with negative regulation on several levels.29,30 In particular, ZFP36 has been shown to bind to and accelerate mRNA degradation,29 as already demonstrated for TNF-
and granulocyte-macrophage colony-stimulating factor mRNAs. C8FW is highly homologous to a protein (SINK) that was characterized recently as a negative regulator of NF-
Bdependent transcription via specific interaction with the p65/RelA transactivation domain.31 Our findings on the regulated expression of ZFP36 and C8FW in ECs will serve as a starting point for the investigation of multiple negative regulatory mechanisms during inflammation.
To investigate transcriptional regulation, researchers have traditionally dissected individual promoters to identify novel TFBSs. We reasoned that the large number of regulated genes derived from microarray experiments would allow a different strategy, namely a comparative promoter analysis. This is based on the hypothesis that coregulated genes, because they are grouped in individual clusters, might share common regulatory mechanisms that govern their expression. Promoters of genes grouped in certain clusters would thereby contain a different setup of regulatory elements compared with those in other clusters.
Recently, bioinformatics solutions for this purpose have emerged, including the Genomatix and TOUCAN program packages.19 We used the TOUCAN program, which is freely available, first to extract the promoter regions of all genes, and second, to analyze them with regard to TFBSs, combinations thereof, and sequence motifs. As listed in the Table, a number of over-represented TFBSs (compared with the average in the EPD database) and sequence motifs were identified. In cluster 1 (and also 5, which both peak at 1 hour), the binding site for NF-
B was prominent, serving as an excellent positive control for the procedure. In contrast, NF-
B sites play only a minor role in promoters of cluster 4 (immediate-early response genes peaking at 0.5 hours), and a moderate role in "late responders" (clusters 2 and 3). This is consistent with the expression kinetics of many known NF-
Bdependent genes. In contrast, the binding sites for so-called "general" TFs such as SP-1 are more evenly distributed over the set of clusters, as expected. Another striking observation is the prominent occurrence of binding sites for CREB/ATF clusters of immediate-early (cluster 4) and early (cluster 5) induced genes compared with late responders (clusters 2 and 3) and downregulated genes.
The second part was the search for significant combinations of 2 or more TFBSs using the TOUCAN tool Module- Searcher (see the Table, B). This approach is based on the suggestion that rather the combinations of TFs are relevant for gene expression. Although parts of this hypothesis, especially the contribution of the distance between binding sites, remain a matter of debate, this approach has yielded several combinations of known TFBSs, involving AP2, FOXD3, FREAC7, HFH3, and PAX4. However, we were not able to confirm a role for FREAC7,23 in EMSA (see Results). Therefore, the significance of this second approach remains to be established.
The third stage of the analysis concentrated on the search for over-represented motifs (patterns) in promoters of coclustering genes using the TOUCAN tool MotifSampler. A motif length of 8 bases was chosen, and in the first run, hits allowing a certain degree of mismatches to the "perfect motif" were also produced. Finally, only "perfect matches" are listed in the Table (C), allowing for comparison between the different clusters. We decided to investigate further the motif C1 (CsGAGCGC, where s is C or G), which occurs in clusters 1 and 5. Searching this motif in the TRANSFAC site table did not yield any known TFBSs. Using EMSA, we could demonstrate that a protein, possibly representing a novel TF, binds to this sequence. This protein also showed IL-1 inducibility, further supporting its involvement in regulated inflammatory gene expression. Its cloning and additional characterization will be the subject of future studies. Moreover, our results demonstrate the validity of the comparative promoter analysis approach that can be applied as a general follow-up study for gene expression profiling.
| Acknowledgments |
|---|
This work was supported by grants from the Austrian Science Foundation (SFB 005-11, SFB 005-12; to B.R.B., R.dM., respectively).
Received February 13, 2004; accepted March 10, 2004.
| References |
|---|
|
|
|---|
signaling events leading to tissue factor up-regulation via EGR-1 in endothelial cells. FASEB J. 2001; 15: 230242.
-induced PAI-1 expression. Blood. 2003; 101: 30423048.
promoter. J Biol Chem. 1994; 269: 1355113557.
on gene expression in human endothelial cells. Am J Physiol Cell Physiol. 2003; 284: C1577C1583.
-like gene is regulated by NF kappa B. EMBO J. 1993; 12: 27732779.[Medline]
[Order article via Infotrieve]
This article has been cited by other articles:
![]() |
C Hamid, K Norgate, D P D'Cruz, M A Khamashta, M Arno, J D Pearson, G Frampton, and J J Murphy Anti-{beta}2GPI-antibody-induced endothelial cell gene expression profiling reveals induction of novel pro-inflammatory genes potentially involved in primary antiphospholipid syndrome Ann Rheum Dis, August 1, 2007; 66(8): 1000 - 1007. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. J. Florey, M. Johns, O. O. Esho, J. C. Mason, and D. O. Haskard Antiendothelial cell antibodies mediate enhanced leukocyte adhesion to cytokine-activated endothelial cells through a novel mechanism requiring cooperation between Fc{gamma}RIIa and CXCR1/2 Blood, May 1, 2007; 109(9): 3881 - 3889. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nagao, M. Matsumura, A. Mabuchi, A. Ishida-Okawara, O. Koshio, T. Nakayama, H. Minamitani, and K. Suzuki Up-regulation of adhesion molecule expression in glomerular endothelial cells by anti-myeloperoxidase antibody Nephrol. Dial. Transplant., January 1, 2007; 22(1): 77 - 87. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Panepucci, R. T. Calado, V. Rocha, R. Proto-Siqueira, W. A. Silva Jr., and M. A. Zago Higher Expression of Transcription Targets and Components of the Nuclear Factor-{kappa}B Pathway Is a Distinctive Feature of Umbilical Cord Blood CD34+ Precursors Stem Cells, January 1, 2007; 25(1): 189 - 196. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Vasarhelyi, A. Cseh, I. Kocsis, A. Treszl, B. Gyorffy, and J. Rigo Jr Three mechanisms in the pathogenesis of pre-eclampsia suggested by over-represented transcription factor-binding sites detected with comparative promoter analysis Mol. Hum. Reprod., January 1, 2006; 12(1): 31 - 34. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Aerts, P. Van Loo, G. Thijs, H. Mayer, R. de Martin, Y. Moreau, and B. De Moor TOUCAN 2: the all-inclusive open source workbench for regulatory sequence analysis Nucleic Acids Res., July 1, 2005; 33(suppl_2): W393 - W396. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
ATVB Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2004 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |