AJP - Lung AJP: Endocrinology and Metabolism
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Am J Physiol Lung Cell Mol Physiol 292: L367-L377, 2007. First published October 13, 2006; doi:10.1152/ajplung.00242.2006
1040-0605/07 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
292/2/L367    most recent
00242.2006v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Vos, J. B.
Right arrow Articles by Hiemstra, P. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Vos, J. B.
Right arrow Articles by Hiemstra, P. S.

INVITED REVIEW

Exploring host-pathogen interactions at the epithelial surface: application of transcriptomics in lung biology

Joost B. Vos,1 Nicole A. Datson,2 Klaus F. Rabe,1 and Pieter S. Hiemstra1

Departments of 1Pulmonology and 2Medical Pharmacology, Leiden Amsterdam Center for Drug Research, Leiden University Medical Center, Leiden, The Netherlands


    ABSTRACT
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
The epithelial surface of the airways is the largest barrier-forming interface between the human body and the outside world. It is now well recognized that, at this strategic position, airway epithelial cells play an eminent role in host defense by recognizing and responding to microbial exposure. Conversely, inhaled microorganisms also respond to contact with epithelial cells. Our understanding of this cross talk is limited, requiring sophisticated experimental approaches to analyze these complex interactions. High-throughput technologies, such as DNA microarray analysis and serial analysis of gene expression (SAGE), have been developed to screen for gene expression levels at large scale within single experiments. Since their introduction, these hypothesis-generating technologies have been widely used in diverse areas such as oncology and brain research. Successful application of these genomics-based technologies has also revealed novel insights in host-pathogen interactions in both the host and pathogen. This review aims to provide an overview of the SAGE and microarray technology illustrated by their application in the analysis of host-pathogen interactions. In particular, the interactions between epithelial cells in the human lungs and clinically relevant microorganisms are the central focus of this review.

large-scale gene expression profiling; serial analysis of gene expression; microarray; epithelial host defense


EFFICIENT HOST DEFENSE IS essential for survival. The human body is equipped with an elaborate defense system comprising the innate and adaptive immune system. The vast majority of threats are effectively eliminated by the immune system before they can cause injury (5, 63). By forming the primary physical barrier between the body's internal and external world, epithelial tissues are indispensable for host defense. In addition, epithelial cells contribute to the chemical barrier by producing a wide range of host defense effector molecules (5, 19). The effectiveness of epithelial host defense mechanisms is demonstrated by the rare incidence of severe infections at epithelial surfaces in the lungs of healthy individuals. The mechanisms underlying host defense and epithelial repair critically depend on the presence of functional proteins at appropriate quantities within a crucial time window. These proteins are encoded by genes whose transcription is tightly coordinated by complex programs of gene expression. Altered or defective gene regulation may not only increase the susceptibility of the host to infections but may also be involved in the development of atopic disease (15).

Evaluating gene expression has greatly extended our understanding of behavior and function of cells and tissues under varying conditions. A diversity of technologies has been developed to assess levels of gene expression, ranging from analysis of single genes (i.e., Northern blot and PCR) to thousands of genes simultaneously [i.e., serial analysis of gene expression (SAGE) and microarray; Table 1]. On the basis of experimental approach, a distinction can be made between "closed" and "open" profiling methods. Closed approaches rely on hybridization of genes of interest to complementary nucleic acids, and, therefore, genome knowledge is a prerequisite. Examples of closed approaches are Northern blot (4) and DNA microarrays (56). In contrast, open approaches do not depend on genome knowledge since they are based on sequencing of poly(A)+ mRNA molecules expressed in the cell or tissue of interest. Examples of open approaches are expressed sequence tag (EST) sequencing (2), SAGE (65), and massively parallel signature sequencing (MPSS) (11). The identity of transcripts is determined by matching the experimental sequence to available genomic data.


View this table:
[in this window]
[in a new window]

 
Table 1. Overview of gene expression analysis technologies

 
The most widely applied high-throughput strategies are SAGE (65) and DNA microarray technology (56). These hypothesis-generating technologies are highly suitable to study host-pathogen interactions and are being increasingly applied in hypothesis-driven pulmonary research (7, 20, 31, 34). So far, interactions between airway epithelial cells and respiratory pathogens such as Staphylococcus aureus, Pseudomonas aeruginosa, and Bordetella pertussis have been studied with high-throughput expression profiling techniques. The introduction of large-scale gene expression profiling in host-pathogen research has not only provided novel insights into how epithelial cells cope with potential hazards of infection but has also demonstrated how microbes respond to the host. This review focuses on the application of the SAGE and microarray technologies in pulmonary research, in particular to investigate the molecular mechanisms underlying host-pathogen interactions in the lung.


    SERIAL ANALYSIS OF GENE EXPRESSION
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
SAGE is an open high-throughput expression profiling technique that allows unbiased assessment of virtually all polyadenylated transcripts in a single sample (65, 66). The outline of the technology is presented in Fig. 1. Shortly, double-stranded cDNA is synthesized from mRNA molecules that are biochemically purified by the poly(A)+ tail. In a series of two endonuclease reactions, representative short nucleotide sequences of 10–14 bp are isolated. In the first endonuclease step, the restriction enzyme NlaIII is used to digest double-stranded cDNAs at every four-base CATG sequence. Since each cDNA is immobilized at the poly(A)+ tail, only the most 3' fragments are captured for further processing. Before the second endonuclease step, a linker sequence containing the recognition site for the second restriction enzyme, BsmFI, is attached to the generated four-base CATG overhang. The type IIs endonuclease BsmFI specifically cleaves at 10–14 bp distance from its recognition site, thereby releasing the 10- to 14-bp fragments from the 3'-end immobilized cDNA fragments. Although relatively short (10–14 bp), these so-called SAGE tags contain sufficient genetic information to uniquely identify individual transcripts since they are derived from a defined position within each individual transcript: directly adjacent to the most 3' recognition site of NlaIII. Serial ligation of multiple SAGE tags into long multimers not only allows qualitative and quantitative determination of SAGE tags but also drastically increases sequencing efficiency by limiting the number of required sequencing reactions. The identity of SAGE tags representing known genes is then established by comparing its nucleotide sequence to available gene sequences deposited in genetic databases. Since SAGE libraries are generated by random sampling of transcripts, expression profiles will become more accurate when analyzing larger numbers of tags. When sufficient SAGE tags are analyzed, the sensitivity of SAGE appears to be comparable to DNA microarrays in estimating expression levels of, in particular, medium to highly abundant genes (22, 30, 33).


Figure 1
View larger version (42K):
[in this window]
[in a new window]

 
Fig. 1. Outline of the serial analysis of gene expression (SAGE) and microarray technology. A flow chart of the SAGE and microarray technologies is presented from top to bottom. In the first step, cultured cells are exposed to a pathogen. Isolated RNA is processed as is required for SAGE or microarray technology. Collected data reveals the expression profiles in the cells of interest under the studied conditions as illustrated (bottom).

 
An attractive feature of SAGE is that the technology allows gene discovery. Despite the completion of sequencing the human genome, annotation of the genes within the genome is still very much an ongoing process. Our knowledge of the repertoire of human genes is still incomplete, and novel genes are discovered on a frequent basis. The SAGE platform is excellent for conducting comparative expression studies because SAGE yields digital sequence-based data. Libraries can be compared with one another without complex mathematical normalization methods, even if these libraries were generated in different laboratories (68). A drawback of SAGE is the laborious nature of the technology. On average, it takes a few weeks to generate and sequence a single library. Consequently, the number of samples that can be realistically studied is limited. Because of the laborious nature of SAGE, researchers often prefer the microarray technology.

Interestingly, recently the application of SAGE has expanded from expression analysis to include whole genome analysis. Combining the specificity of chromatin immunoprecipitation (ChIP) with the sensitivity of SAGE allows the identification of genome signature tags defining functional genomic elements and transcribed regions (28) such as transcription factor binding sites (32) and regions with hyperacetylated histone proteins (53). The combined approach of ChIP and SAGE was first introduced as serial analysis of chromatin occupancy (SACO, Ref. 32) and is also known as genome-wide mapping technique (GMAT, Ref. 53), sequence tag analysis of genomic enrichment (STAGE, ref. 36), and serial analysis of binding elements (SABE, Ref. 13). Essential to application of novel technologies is the availability of analysis software. Recently, the first web-based software tool has been introduced that facilitates the analysis of this data (40). Although the combination of ChIP and microarray, called the "ChIP-on-chip," is available, this method has major drawbacks. Most importantly, a substantial number of promoter and other regulatory regions within in the human genome have not been well characterized and are therefore not represented on the microarray. In addition, binding sites for transcription factors may well be located outside the transcription initiation sites of genes (32). Perhaps not surprisingly, the genome-wide and open, unbiased nature of SAGE offers clear advantages over the microarray, both in terms of efficiency and precision of identifying (unknown) regulatory regions within the genome. Attractive applications of ChIP and SAGE can be foreseen in studying the regulation of the response of airway epithelial cells to pathogens. Transcription factors associated with the regulation of expression of proinflammatory genes involved in the response of airway epithelial cells to microorganisms include NF-{kappa}B and AP-1. The use of antibodies directed against these transcription factors for ChIP would allow the identification of regulatory sequences employed by these transcription factors in the response of airway epithelial cells to microorganisms. This information could provide more integrated insights on how the transcriptional changes in airway epithelial cells exposed to microbial stimuli are regulated.


    DNA MICROARRAYS
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
The microarray technology is a closed high-throughput method that enables the measurement of a large, predetermined set of known genes or sequences (56). In microarray technology, DNA molecules representing specific transcripts are fixed onto a solid support, ranging from oligonucleotides (25–70 mer) to complete cDNAs. Inherent to closed approaches like microarray, a finite collection of arrayed sequences can be analyzed. However, microarrays are available that contain ~45,000 probe sets covering all known human genes as well as thousands of undefined ESTs.

To visualize gene expression on microarrays, samples are labeled with a fluorescent dye before hybridization, and fluorescence intensity is quantified as a measure for gene expression in the original sample. The outline of the labeling and hybridization steps for the single-color microarray technology is depicted in Fig. 1. For the single-color microarray, each sample is hybridized to a separate microarray. Before hybridization, samples are biotin labeled and stained with a streptavidin-bound fluorophore (i.e., phycoerythrin) and visualized by confocal laser microscopy. The single-color approach is common to commercial oligonucleotide arrays. Advantages of oligonucleotide microarrays are that 1) repetitive sequences within the genome can be circumvented because of the use of short and uniquely designed probes; 2) probes have more uniform hybridization efficiencies; and 3) standardized protocols and equipment are used providing consistent and reproducible data generation (57). Because of unique probe design and uniform hybridization efficiencies, oligonucleotide arrays have a larger dynamic range of detecting gene expression (6). A disadvantage of oligonucleotide arrays is that the production procedure is costly and relatively inflexible.

An alternative approach for single-color microarray is the two-color microarray. The two-color approach is mostly used for cDNA microarrays but can also be applied to oligonucleotide arrays. For cDNA arrays, long and double-stranded cDNA probes are fixed onto this type of microarray. For hybridizing cDNA microarrays, samples are each labeled with a distinct fluorescent dye (i.e., Cy3 and Cy5) and cohybridized to the same microarray. Binding of transcripts from both samples is detected using confocal laser microscopy by scanning the chip for the two fluorescent channels separately. Expression levels of genes within the two cohybridized samples can be directly compared. An advantage of this approach is that two samples can be applied at once to a single microarray. However, a disadvantage of the two-color approach is the need to perform dye-swap experiments and mathematical signal normalization strategies to control for varying emission intensities of the Cy3 and Cy5 dyes (reviewed in Ref. 57).

Since a substantial number of academic organizations invested in their own arraying facilities, the cDNA microarray is frequently used. When equipment and probe collections are available, "inhouse-made" cDNA are more flexible and cost effective compared with the commercially available oligonucleotide microarrays. However, cDNA microarrays have a smaller dynamic range because of a less-efficient signal-to-noise ratio (6). This less-efficient signal-to-noise ratio is partly due to the fact that the arrayed cDNA probes are lengthy and double stranded, increasing the likelihood of nonspecific and cross hybridization to related sequences (48). In addition, the density of arrayed cDNA probes is generally lower compared with oligonucleotide microarrays. Despite these differences, cDNA and oligonucleotide microarrays perform equally with respect to detection of abundantly expressed genes (48).

A major advantage of DNA microarrays is the commercialization of the most labor-intensive parts of the methodology: collecting sequences (synthesized oligonucleotides or cDNA clones) and array fabrication. Commercial manufacturers of microarrays provide extensive and well-documented annotation of probe sets, which eases data mining and interpretation, whereas annotation of SAGE tags is not straightforward (see ANNOTATION). Today, ready-to-use microarrays are available for many different organisms, which makes the use of this technology possible without the need of having microarray fabrication equipment and owning cDNA clone collections. Collecting data by using prefabricated microarrays typically takes less than 1 wk. Therefore, this high-throughput profiling technology is often the preferred choice of many scientists. However, in contrast to SAGE, comparing DNA microarray results between experiments and between laboratories is hampered by differences in the type of array used (single vs. dual color; oligo vs. cDNA; homemade vs. commercial), the spotted probe sequences, and the lack of standardized experimental procedures. Not all fluorescent labels perform equally, and different probe sequences used by the various manufacturers representing the same gene may give rise to varying hybridization efficiencies. To facilitate comparative analysis on microarray data, the Microarray Gene Expression Data Group (MGED; http://www.mged.org) proposed a uniform annotation format for microarray experiments (MIAME) (10). Although complying with the same standardized annotation format, complex computational normalization methods are still required to conduct comparative research based on microarray experiments.


    DATA MINING AND INTERPRETATION
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
The most challenging part of the analysis of SAGE and microarray experiments is to assign biological significance to the observed findings and to finally formulate new hypotheses. Although high-throughput expression profiling technologies are very efficient in producing sizeable quantities of data, simple and standardized computational methods for data analysis and mining are not easily accessible. The need for appropriate tools is becoming increasingly important in successful expression profiling experiments since data analysis is often hampered by the lack of knowledge on available methods and data mining resources. Recent efforts have greatly improved and facilitated the application of statistical and functional data analysis methods. However, investigators should be aware of both the experimental difficulties as well as issues related to statistical and functional data analysis since these factors can influence the final results.


    ANNOTATION
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
Before data mining and interpretation, accurate annotation of experimental data is essential. For most commercially available microarrays, appropriate annotation of probe sets, including links to reference databases such as gene ontology, is generally supplied by the manufacturer. In contrast, accurate annotation of SAGE tags is a tedious task that is not easily accomplished. A short guide on how to annotate SAGE data is provided here since this is a specific SAGE issue. For tag identification, SAGE libraries can be matched to the National Center for Biotechnology Information's (NCBI) reliable Unigene cluster to SAGE tag map (ftp://ftp.ncbi.nlm.nih.gov/pub/sage) (37) or to the Cancer Genome Anatomy Project's (CGAP) SAGE genie (http://cgap.nci.nih.gov/SAGE) (9). For well-known genes, both mapping strategies yield the same results. However, when tags match to poorly described transcripts, the NCBI and SAGE genie maps yield different outcomes. Therefore, the best tag identification is achieved by combining the two tag-mapping strategies (69). Besides poorly described transcripts, SAGE libraries are contaminated with tags isolated from nonpreferred restriction sites within transcripts. Typically, ~30% of the unique sequenced tags are derived from nonpreferred 3'-end positions (unpublished observations). Discarding these nonrepresentative tags not only facilitates experimental validation of SAGE data (68) by decreasing the number of false positives but also eases subsequent data mining and interpretation. The SAGE genie allows prescreening of SAGE libraries for the presence of known nonrepresentative tags. The presence of single nucleotide polymorphisms may also give rise to alternative anchoring sites for the restriction enzyme to isolate SAGE tags (58), a feature addressed in the recent versions of SAGE genie. Once SAGE libraries have been annotated and nonrepresentative tags have been discarded, association to additional data sources [Genbank, Gene (formerly known as Locuslink), Online Mendelian Inheritance Man (OMIM), Gene Ontology, or any other useful biological data source] can be established to facilitate the biological interpretation of the data.


    MATHEMATICAL CLUSTERING
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
Mathematical clustering is a useful and powerful next step to structure expression data in an unbiased fashion. Clustering methods are based on the assumption that genes showing similar transcriptional behavior across samples might correspond to or be involved in the same biological process. Since clustered data is more easily accessible, it is a recommended analysis method for SAGE and microarray experiments. Clustering methods can be divided into supervised and unsupervised methods. Supervised or knowledge-assisted clustering algorithms are guided by existing biological knowledge about specific subsets of genes (i.e., signal transduction cascades, cell cycle, metabolic pathways) (46). In contrast, unsupervised clustering methods allow complete unbiased structuring of gene expression data. To date, unsupervised clustering methods have been most widely applied to large-scale gene expression data. Commonly used unsupervised mathematical clustering methods are hierarchical clustering (21), K-means clustering (62), self-organizing maps (61), and principal component analysis (50). A detailed description of these clustering algorithms is beyond the scope of this review. An accessible description of each of the commonly used algorithms is excellently reviewed elsewhere (49). Software tools to perform mathematical clustering analysis include the commercially available Spotfire Decision Site Software (Spotfire, Göteborg, Sweden) and the freely available Cluster (Ref. 21; http://rana.lbl.gov/EisenSoftware.htm), Genesis (Ref. 60; http://genome.tugraz.at), and web-based "Classification of Expression Arrays" (Ref. 50; http://classify.stanford.edu).

Application of clustering methods has become a main step in the analysis of microarray data. In contrast, clustering methods are not frequently applied to SAGE data. Indeed, without normalization of SAGE tag counts into relative frequencies, these methods are not directly applicable to SAGE data. This might explain the limited application of these methods in SAGE studies. When using mathematical clustering, it is recommended to try different methods (i.e., K-means, hierarchical) since the different methods should yield similar results.


    FUNCTIONAL MATHEMATICAL CLUSTERING
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
In silico subtraction, methods can be used to identify preferential tag expression in the SAGE libraries of interest using the Tissue Preferential Expression (TPE) algorithm. With the use of this algorithm, expression specificity is established for each tag by comparing its occurrence and frequency to a panel of reference SAGE libraries derived from a number of whole tissues. This method has been successfully used to identify preferential expression of "no match" SAGE tags that possibly correspond to unknown genes that may serve as specific markers for disease (42, 47).


    TRANSLATING EXPRESSION DATA INTO FUNCTIONAL DATA
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
Large-scale expression profiling technologies are often utilized to characterize genetic hallmarks of disease (i.e., cancer) or to uncover potential novel groups of genes that participate in a certain biological process. Typically, a few tens to hundreds of genes are differentially expressed between experimental conditions. But what is the biological significance of these transcriptional changes? The interpretation of large-scale gene expression data is one of the most challenging fields in genomics. An important issue determining correct interpretation of large-scale gene expression data is the reliability of the generated data. Although microarray and SAGE are equally sensitive, both technologies are limited in the detection of unstable transcripts and low abundant gene expression. Therefore, it is essential to determine an appropriate threshold that defines the reliable minimal detection limit of the large-scale gene expression profiling technology. To obtain more depth of analysis and improve the detection of low abundant transcripts using SAGE, one could increase the number of sequenced tags per library. Estimates of the number of tags required to cover the complete human transcriptome within a single cell population exceed 1.2 million tags (64). Single-molecule sequencing has enormously expanded the number of SAGE tags that can be sequenced, both in practical and cost terms (52).

Because of technological limitations of large-scale expression profiling methods, the most representative results are yielded from uniform, single-cell populations. A frequently encountered problem in expression profiling studies is the heterogeneity of the samples used. Gene expression variations in the cells of interest are often leveled due to the presence of other cell types. Particularly for complex heterogeneous samples, such as tissue biopsies of the airway epithelium, laser microdissection of the cells of interest is a valuable method to select only the cells of interest for subsequent profiling analysis (17).

To address the multiple testing problem and the associated detection of false positives that occur with large-scale expression profiling technologies, it is essential to verify microarray and SAGE data with other techniques such as quantitative real-time polymerase chain reaction (qPCR) or in situ hybridization. SAGE, microarray, and qPCR perform equally in estimating the expression levels of high abundant genes. For genes expressed at low levels, large-scale expression profiling methods and qPCR correctly estimate the directional change in expression. However, discrepancies in the magnitude of differential gene expression may exist between large-scale expression profiling methods and qPCR (18, 69).

The main purpose of data interpretation is to predict and establish coherence within the observed findings, ultimately leading to new hypotheses. To accomplish these goals, the Gene Ontology Consortium developed a unified annotation for genes and their functions consisting of three independent hierarchical treelike structures (biological process, molecular function, and cellular localization) called Gene Ontology (GO). Aside from information on well-known genes, the GO database also includes predicted information on novel and inferred genes. Matching genomics data to GO not only allows screening for known biological processes but also for previously unexplored biological processes in the model system of interest. Because graphical representations are more illustrative than lists of differentially expressed genes, initiatives have been undertaken also to visualize genomics data. The "Kyoto Encyclopedia of Genes and Genomes" (KEGG) and BioCarta are excellent examples of these initiatives. To facilitate the usage of the GO database with KEGG maps, Dahlquist et al. (16) have developed the Gene Microarray Pathway Profiler (GenMAPP) allowing automated mapping and visualization of genomics data. The recent advancements in the field of functional data analysis have been very successful in accelerating genomics data analysis.

Analyzed data should be interpreted with caution since the gene discovery rate still exceeds the speed by which researchers can experimentally assess gene function of newly identified genes. Consequently, the biological function(s) of a substantial proportion of genes in genetic databases have not yet been experimentally established. In addition, gene expression profiling reveals differences at the level of gene expression that are not necessarily reflected at the protein level. For instance, if proteins are constructed of different subunits that are encoded by multiple genes, follow-up research is warranted to establish the functional consequences of the transcriptional changes in one or more genes encoding the subunits. Nonetheless, genomics technology provides very efficient ways to screen for putative functional consequences in multimeric protein complexes encoded by multiple genes.


    APPLICATION OF SAGE AND MICROARRAY TO STUDY PATHOGEN-EPITHELIAL CELL INTERACTIONS
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
So far, we have discussed the potentials and limitations of commonly applied large-scale expression profiling methods, including their data analysis strategies. Here, the application of the SAGE and microarray analysis in host-pathogen research will be highlighted by discussing recent publications that applied these technologies. This part is divided into three sections: 1) application of SAGE and 2) microarray analysis to study the epithelial host defense response in the human airways, and 3) application of microarray analysis to investigate the processes that occur within the microorganism upon contact with airway epithelial cells. Large-scale gene expression profiling technology can be applied to prokaryotic model systems, but there are limitations. For instance, SAGE cannot be used since mRNAs in prokaryotes are not polyadenylated, a requirement for the SAGE technology. Microarray technology is the only platform that can be used to investigate transcriptional changes in prokaryotes at large scale. The first studies published have demonstrated that microarray analysis of the invading pathogen can be very instrumental in understanding the cross talk between the host and the pathogen.


    APPLICATION OF THE SAGE TECHNOLOGY TO STUDY CHANGES IN EPITHELIAL GENE EXPRESSION AFTER MICROBIAL EXPOSURE
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
SAGE has been applied to investigate diverse areas of immunology and host defense including the processes of differentiation and development of T cells (45), B cells (23), and natural killer cells (35). Furthermore, SAGE has been used to monitor the transcriptional changes upon human immunodeficiency virus infection in T cell lines (54) as well as LPS activation in monocyte-derived dendritic cells (29). The use of SAGE to specifically investigate host-pathogen interactions in the human airways is so far limited to our own study (68). In our search for novel epithelial host defense molecules, bronchial epithelial cells were exposed for 6 h to P. aeruginosa and a mixture of the proinflammatory cytokines IL-1beta and TNF-{alpha} (69). SAGE revealed that the expression of keratins, proteinase inhibitors, S100 calcium-binding proteins, and IL-1 family members was affected upon exposure to P. aeruginosa and the proinflammatory cytokines IL-1beta and TNF-{alpha}. The first three families of affected genes all contribute to cytoskeletal architecture. It was therefore suggested that bronchial epithelial cells specifically strengthen the primary physical barrier upon microbial exposure to cope with infection. However, the exact mode of action is unclear. The S100 calcium-binding proteins and the proteinase inhibitors secretory leukocyte proteinase inhibitor (SLPI) and elafin (SKALP/PI3) were among the differentially expressed genes with highest expression in both bronchial epithelial cells and keratinocytes. The protein complex formed by the S100 members A8 and A9 serves as antimicrobial agent, acts as chemoattractant for leukocytes, and enhances the transendothelial migration of these cells (44). Association of this protein complex with inflammatory disorders has been demonstrated in 1975 by Wilson et al. (71) who observed elevated serum levels of the S100A8/A9 complex in patients with cystic fibrosis (CF). The proteinase inhibitors SLPI and elafin not only inhibit protease activity but also exert antimicrobial activity (70). The fourth group of molecules that was identified comprised cytokines. In particular, several members of the IL-1 family were identified. Despite its unknown function, the identification of the novel IL-1 family member IL-1F9 as cytokine contributing to the microbial response might be of interest. Our comparative genomic analysis revealed that this cytokine might fulfill specific immune signaling functions in epithelial cells since its expression is highly restricted to these cells (68).

To further delineate the early host defense response in bronchial epithelial cells, we performed a comparative genomic analysis of our model and of a culture model of epithelial inflammation in keratinocytes (68). A genetic signature of host defense was identified, the expression of which was restricted to these cell types. In keratinocytes, the majority of these genes encode for proteins that are required for the assembly of the cornified envelope. This protein structure forms the physical barrier in skin. In analogy, this comparative genomic analysis revealed that bronchial epithelial cells may strengthen their physical barrier by the formation of an impermeable protein envelope. A number of the components of the protein envelope, including members of the S100 calcium-binding protein family and proteinase inhibitors, exert additional host defense functions. Therefore, the incorporation of these molecules in the protein envelope appears to serve multiple purposes.

In summary, the use of SAGE showed that bronchial epithelial cells respond to microbial exposure by strengthening the physical barrier and by releasing cytokines to alert the immune system.


    APPLICATION OF DNA MICROARRAYS TO STUDY EPITHELIAL GENE EXPRESSION AFTER MICROBIAL EXPOSURE
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
The use of microarray technology has been more widespread than SAGE in studying host-pathogen interactions as judged by the number of publications. Studies on P. aeruginosa are of special interest because both the transcriptional changes in the host and the pathogen have been assessed using the microarray technology. Ichikawa et al. (31, 39) were the first to demonstrate differential gene expression in the lung epithelial cell line A549 upon 1–3 h of exposure to P. aeruginosa. Although a relatively small number of 1,506 probes was assessed using a cDNA microarray, this study illustrated the usefulness of microarrays in this field of research. Differential gene expression of the transcription factor interferon regulatory factor 1 (IRF-1) was reported, which was independent of LPS or its signaling pathways. It was demonstrated that microbial adherence is crucial for the increased expression of IRF-1 since a nonadherent P. aeruginosa strain was unable to enhance IRF transcription. This strain carries a mutation in the pilA gene that encodes for the major subunit of the type IV pili that normally serves as adhesin to epithelial cells.

The interaction between cells of the bronchial epithelial cell line BEAS-2B and B. pertussis has been studied using oligonucleotide arrays. This microorganism is the causative pathogen for whooping cough in humans (7). Differential gene expression was assessed after 3 h of exposure to B. pertussis. Transcriptional changes were observed in proinflammatory genes as characterized by the increased expression of cytokines and chemokines such as IL-8 (CXCL8), CCL2/MCP-1, CXCL1/Gro{alpha}, and CXCL2/Grobeta. Enhanced expression of the chemokine IL-8 is considered one of the hallmarks of the epithelial response to P. aeruginosa. Also, other studies showed regulation of the epithelial expression of these chemokines after microbial exposure, including our SAGE analysis.

Notably, epithelial cells require at least 3–6 h of microbial exposure before the transcriptional response can be measured robustly (31, 41, 69). This observation was also demonstrated in the CF bronchial epithelial cell line IB3–1 upon 3 h of exposure to P. aeruginosa (67). Compared with their mutation-corrected counterparts, lower expression of proteinase inhibitors and increased expression of cytokines such as IL-6 and IL-8 was observed in the CF cell line.

With the use of both oligonucleotide and cDNA microarrays, the transcriptional response of a submucosal tracheal gland epithelial cell line to S. aureus was assessed (41). S. aureus is one of the first pathogens to colonize the airways of CF patients. Particularly, the supernatants of this microorganism caused a robust transcriptional response in MM-39 cells, with increased gene expression of members of the JAK/STAT and NK-{kappa}B and AP-1 pathways after 3 h of exposure to S. aureus supernatants. These pathways lead to downstream transcription of the proinflammatory cytokines IL-1{alpha}, IL-1beta, and IL-6 and the chemokine IL-8.

Collectively, these microarray data suggest that soluble virulence factors released by microorganisms contribute to the induction of an epithelial host defense response. All discussed reports observed transcriptional regulation of members of these cytokine and chemokine families, including their associated signaling pathways. The microarray studies revealed that, in particular, the cytokine expression is enhanced in the early phase of microbial infection of airway epithelial cells.

Sensing microbial exposure by bronchial epithelial cells is in part mediated through pattern recognition receptors such as Toll-like receptor (TLR)2, TLR3, TLR4, TLR5, and TLR9. In the studies discussed, the focus was on TLR2 and TLR5. The ligands for TLR2 appear to be, among others, lipoproteins, whereas flagellin is a major ligand for TLR5. Recent studies suggest that lipoprotein I of P. aeruginosa is a TLR2/4 ligand (51). Antibodies against TLR2 can attenuate the action of lipoproteins. Enhanced expression of TLR2 and TLR5 or increased activity of the downstream signaling cascades in CF epithelial cells may possibly account for the exaggerated inflammatory response in these cells. Flagellin is a structural component of the bacterial flagellum that provides motility to the microorganism and appears to be essential in infection. The activating potential of flagellin, which is secreted by bacteria, on bronchial epithelial cells has been demonstrated in several investigations (1, 14) and leads to increased expression of various mediators, including the antimicrobial peptide human beta-defensin-2 (hBD-2; Ref. 14). Despite the eminent function of specialized antimicrobial peptides in the epithelial innate immune system, they do not appear to be highly expressed in the early epithelial host defense response. The reason for this is unknown but could possibly be explained in part by the location of the sensing TLR5 at the basolateral membrane of epithelial cells (26). In time-course experiments using live P. aeruginosa, presence of the microorganism at the basolateral membrane was not demonstrated earlier than 12 h postinfection. This implies that TLR5 activation occurs later in the infection process, providing an explanation for the delayed response of epithelial cells to produce human beta-defensins. However, it needs to be noted that a recent study demonstrated also the apical localization of TLR5 on tracheal epithelial cells (73), and, therefore, localization of TLR5 may not be the sole explanation for the limited expression of hBD-2 early after microbial exposure.

Hypothetically, the epithelial host defense response may therefore consist of two phases (Fig. 2). During the first phase when microbes attach to the apical surface of epithelial cells, the physical barrier is strengthened and expression of signaling molecules such as cytokines and chemokines is increased. In the proceeding phase when microorganisms migrate to the basolateral epithelial membrane, production of specialized antimicrobial agents may occur.


Figure 2
View larger version (17K):
[in this window]
[in a new window]

 
Fig. 2. Phases in gene expression in bronchial epithelial cells upon microbial contact. The early transcriptional response in epithelial cells upon microbial contact is characterized by expression of cytokines, chemokines, and proteins that are involved in strengthening the physical barrier. Expression of specialized antimicrobial peptides may occur at later stages during infection, possibly in part because of the localization of pattern recognition receptors that lead to the transcription of these peptides. See text for details.

 

    APPLICATION OF DNA MICROARRAYS TO STUDY MICROBIAL GENE EXPRESSION AFTER EXPOSURE TO EPITHELIAL CELLS
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
To survive in the lung, pathogens have to deal with the defense mechanisms of the host such as the release of antimicrobial peptides and the oxidative burst utilized by phagocytes. How pathogens accomplish their survival under these circumstances is largely unknown. As discussed, the microarray technology is the only high-throughput method available to assess microbial gene expression at large scale. For P. aeruginosa, the first oligonucleotide array was launched shortly after the completion of sequencing the genome of the P. aeruginosa strain O1 (PAO1) laboratory strain (59). The PAO1 microarray has been used to uncover regulatory networks in P. aeruginosa upon varying environmental conditions (reviewed in Ref. 27). A limited number of investigations have specifically focused on the interactions of this microorganism with host epithelial cells. It is known that virulence factors are crucial for the ability of P. aeruginosa to infect the host. The conversion of the nonmucoid to the alginate-overproducing mucoid form seems to be essential for P. aeruginosa to cause chronic colonization of the airways in patients with CF (8). Although alginate is immunologically inert, other virulence factors associated with the conversion to mucoidy may cause host tissue destruction. Bacterial virulence factors often act as double-edged swords: on the one hand they damage host tissue to promote bacterial adherence and survival, whereas on the other hand they activate the defense system of the host (14). Frisk et al. (25) and Firoved et al. (24) were the first to profile gene expression in P. aeruginosa upon contact with human airway epithelial cells. Frisk et al. evaluated 4- and 12-h interactions of primary normal human airway epithelial cells with the nonmucoid P. aeruginosa laboratory strain PAO1. During the course of infection, P. aeruginosa migrated from the apical membrane to the basolateral membrane. Global expression profiling revealed the activation of phosphate and repression of iron acquisition genes, indicating that P. aeruginosa may be able to acquire sufficient quantities of iron from host cells but not phosphate for growth. Alternatively, the repression of iron acquisition genes has also been associated with the antioxidant response of P. aeruginosa upon oxidative stress (12).

Using in vitro models for CF, Firoved et al. (24) found that specifically the expression of bacterial lipoproteins is strongly induced in mucoid P. aeruginosa upon contact with human epithelial cells. Because of their toxicity, these molecules are referred to as lipotoxins. Synthetic lipopeptides resembling the NH2-terminal parts of these mature lipoproteins caused the activation of NF-{kappa}B through TLR2, a pattern recognition receptor of the innate host defense system. Mature lipoproteins were able to induce IL-8 production in normal bronchial epithelial cells. This induction could be partially suppressed by antibodies against TLR2 (24). In addition to lipoproteins, P. aeruginosa flagellin has been identified as the virulence factor that specifically induced the epithelial host defense response through TLR5 (14) and in combination with TLR2 (1). Flagellin is a structural component of bacterial flagella necessary for normal growth and infection. By modulating the expression of flagellin, P. aeruginosa may be able to enhance colonization in, for instance, patients with CF (72). Whereas these studies focused on the role of TLR2 and TLR5, other studies have shown that a range of TLRs are involved in recognizing and responding to bacterial exposure. These include TLR4, which recognizes LPS, and TLR9, which is involved in the recognition of bacterial DNA (3).

In summary, during the course of infection P. aeruginosa increases its expression of phosphate acquisition genes, lipotoxins, and DNA repair genes and decreases the expression of iron acquisition genes. Increased expression of phosphate acquisition genes might play a role in safeguarding the nutritional requirements. Changes in expression of genes related to iron metabolism and DNA repair are associated with the antioxidant response of P. aeruginosa to oxidative stress. The excessive induction of lipotoxins upon contact seems to be due to yet unknown properties of CF cells. These molecules are potent agonists of TLR2 and may account for the excessive induction of cytokine gene expression and the enhanced inflammatory response.


    CONCLUSION
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
In this review, we have provided an overview of commonly used large-scale gene expression profiling methods and discussed how these have been used in host-pathogen research focused on the interaction between respiratory pathogens and lung epithelial cells. Since both the SAGE and microarray technology enable the profiling of thousands of genes at once, a frequently asked question is which technique is the best to choose. The answer is simple: neither of the techniques is the best. The selection of a high-throughput gene expression profiling technique highly depends on the research question and practical considerations such as sample number. The elegance of the SAGE technology is its ability of gene discovery and ease of comparative research. On the other hand, the benefits of the microarray analysis include the ability to profile prokaryotic gene expression and the less labor-intensive nature of the technology compared with SAGE.

One of the advantages of the application of large-scale gene expression profiling methods is that it allows the discovery of previously unknown associations at the level of gene expression. Application of these methods has increased our understanding of the dynamic interplay between the host and the pathogen during the course of infection (Fig. 3). Particularly, large-scale gene expression profiling revealed insights into the sequential events that underlie the inflammatory epithelial host defense response. In addition, the combined investigations provided novel mechanistic insights into the interplay during the course of infection in both the host and the pathogen. At the initial stage of infection, bronchial epithelial cells predominantly respond by increasing their cytokine production to alert the immune system and by strengthening the physical barrier. The transcriptional changes in the pathogen are dominated by differential expression of genes involved in the response to oxidative stress. These include the repression of iron acquisition genes and increased expression of lipoproteins and phosphate acquisition genes. Lipoprotein expression by P. aeruginosa was particularly altered upon contact with CF epithelial cells.


Figure 3
View larger version (19K):
[in this window]
[in a new window]

 
Fig. 3. Transcriptional changes in Pseudomonas aeruginosa and epithelial cells upon interaction: novel insights from gene expression profiling studies. The transcriptional changes in P. aeruginosa are characterized by increased expression of phosphate acquisition genes and lipoproteins and decreased expression of iron acquisition genes. Both lipotoxins and flagellin activate pattern recognition receptors present on epithelial cells. Activation of these receptors leads to increased expression of cytokines, chemokines, proteinase inhibitors, and components necessary to strengthen the physical barrier.

 
Although large-scale gene expression profiling methods are highly efficient in data generation, the analysis of expression profiles remains challenging and time consuming. Interpretation of large-scale gene expression data is the main challenge in the postgenomics era. Due to a lack of knowledge on data analysis methods and resources, researchers often tend to focus on those genes known to be involved in the biological process they are investigating. Continuing developments in the field of bioinformatics are expected to provide easy-to-use analysis methods that are helpful for nonbioinformaticians to analyze large-scale gene expression data. Notwithstanding the limitations of functional data interpretation, large-scale gene expression profiling has great promise to increase our understanding of and to uncover novel insights in the cross talk between the host and the pathogen in health and disease.


    GRANTS
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 
This work was financially supported by Netherlands Organization for Scientific Research (NWO) Program Grant 902-11-092.


    FOOTNOTES
 

Address for reprint requests and other correspondence: P. S. Hiemstra, Dept. of Pulmonology, Leiden Univ. Medical Center, PO Box 9600, 2300 RC Leiden, TheNetherlands (e-mail: p.s.hiemstra{at}lumc.nl)


    REFERENCES
 TOP
 ABSTRACT
 SERIAL ANALYSIS OF GENE...
 DNA MICROARRAYS
 DATA MINING AND INTERPRETATION
 ANNOTATION
 MATHEMATICAL CLUSTERING
 FUNCTIONAL MATHEMATICAL...
 TRANSLATING EXPRESSION DATA INTO...
 APPLICATION OF SAGE AND...
 APPLICATION OF THE SAGE...
 APPLICATION OF DNA MICROARRAYS...
 APPLICATION OF DNA MICROARRAYS...
 CONCLUSION
 GRANTS
 REFERENCES
 

  1. Adamo R, Sokol S, Soong G, Gomez MI, Prince A. Pseudomonas aeruginosa flagella activate airway epithelial cells through asialoGM1 and toll-like receptor 2 as well as toll-like receptor 5. Am J Respir Cell Mol Biol 30: 627–634, 2004.[Abstract/Free Full Text]
  2. Adams MD, Soares MB, Kerlavage AR, Fields C, Venter JC. Rapid cDNA sequencing (expressed sequence tags) from a directionally cloned human infant brain cDNA library. Nat Genet 4: 373–380, 1993.[CrossRef][ISI][Medline]
  3. Akira S, Uematsu S, Takeuchi O. Pathogen recognition and innate immunity. Cell 124: 783–801, 2006.[CrossRef][ISI][Medline]
  4. Alwine JC, Kemp DJ, Stark GR. Method for detection of specific RNAs in agarose gels by transfer to diazobenzyloxymethyl-paper and hybridization with DNA probes. Proc Natl Acad Sci USA 74: 5350–5354, 1977.[Abstract/Free Full Text]
  5. Bals R, Hiemstra PS. Innate immunity in the lung: how epithelial cells fight against respiratory pathogens. Eur Respir J 23: 327–333, 2004.[Abstract/Free Full Text]
  6. Barrett JC, Kawasaki ES. Microarrays: the use of oligonucleotides and cDNA for the analysis of gene expression. Drug Discov Today 8: 134–141, 2003.[CrossRef][ISI][Medline]
  7. Belcher CE, Drenkow J, Kehoe B, Gingeras TR, McNamara N, Lemjabbar H, Basbaum C, Relman DA. The transcriptional responses of respiratory epithelial cells to Bordetella pertussis reveal host defensive and pathogen counter-defensive strategies. Proc Natl Acad Sci USA 97: 13847–13852, 2000.[Abstract/Free Full Text]
  8. Bonfield TL, Panuska JR, Konstan MW, Hilliard KA, Hilliard JB, Ghnaim H, Berger M. Inflammatory cytokines in cystic fibrosis lungs. Am J Respir Crit Care Med 152: 2111–2118, 1995.[Abstract]
  9. Boon K, Osorio EC, Greenhut SF, Schaefer CF, Shoemaker J, Polyak K, Morin PJ, Buetow KH, Strausberg RL, De Souza SJ, Riggins GJ. An anatomy of normal and malignant gene expression. Proc Natl Acad Sci USA 99: 11287–11292, 2002.[Abstract/Free Full Text]
  10. Brazma A, Hingamp P, Quackenbush J, Sherlock G, Spellman P, Stoeckert C, Aach J, Ansorge W, Ball CA, Causton HC, Gaasterland T, Glenisson P, Holstege FC, Kim IF, Markowitz V, Matese JC, Parkinson H, Robinson A, Sarkans U, Schulze-Kremer S, Stewart J, Taylor R, Vilo J, Vingron M. Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nat Genet 29: 365–371, 2001.[CrossRef][ISI][Medline]
  11. Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, Johnson D, Luo S, McCurdy S, Foy M, Ewan M, Roth R, George D, Eletr S, Albrecht G, Vermaas E, Williams SR, Moon K, Burcham T, Pallas M, DuBridge RB, Kirchner J, Fearon K, Mao J, Corcoran K. Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nat Biotechnol 18: 630–634, 2000.[CrossRef][ISI][Medline]
  12. Chang W, Small DA, Toghrol F, Bentley WE. Microarray analysis of Pseudomonas aeruginosa reveals induction of pyocin genes in response to hydrogen peroxide. BMC Genomics 6: 115, 2005.[CrossRef][Medline]
  13. Chen J, Sadowski I. Identification of the mismatch repair genes PMS2 and MLH1 as p53 target genes by using serial analysis of binding elements. Proc Natl Acad Sci USA 102: 4813–4818, 2005.[Abstract/Free Full Text]
  14. Cobb LM, Mychaleckyj JC, Wozniak DJ, Lopez-Boado YS. Pseudomonas aeruginosa flagellin and alginate elicit very distinct gene expression patterns in airway epithelial cells: implications for cystic fibrosis disease. J Immunol 173: 5659–5670, 2004.[Abstract/Free Full Text]
  15. Cookson W. The immunogenetics of asthma and eczema: a new focus on the epithelium. Nat Rev Immunol 4: 978–988, 2004.[CrossRef][ISI][Medline]
  16. Dahlquist KD, Salomonis N, Vranizan K, Lawlor SC, Conklin BR. GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways. Nat Genet 31: 19–20, 2002.[CrossRef][ISI][Medline]
  17. Datson NA, Meijer L, Steenbergen PJ, Morsink MC, van der LS, M