198:, can also be applied. Given the number of distance measures available and their influence in the clustering algorithm results, several studies have compared and evaluated different distance measures for the clustering of microarray data, considering their intrinsic properties and robustness to noise. After calculation of the initial distance matrix, the hierarchical clustering algorithm either (A) joins iteratively the two closest clusters starting from single data points (agglomerative, bottom-up approach, which is fairly more commonly used), or (B) partitions clusters iteratively starting from the complete set (divisive, top-down approach). After each step, a new distance matrix between the newly formed clusters and the other clusters is recalculated. Hierarchical cluster analysis methods include:
65:
52:
17:
144:
observation, since the point of performing experiments has to do with predicting general behavior. The MAQC group recommends using a fold change assessment plus a non-stringent p-value cutoff, further pointing out that changes in the background correction and scaling process have only a minimal impact on the rank order of fold change differences, but a substantial impact on p-values.
307:. Protein complex enrichment analysis tool (COMPLEAT) provides similar enrichment analysis at the level of protein complexes. The tool can identify the dynamic protein complex regulation under different condition or time points. Related system, PAINT and SCOPE performs a statistical analysis on gene promoter regions, identifying over and under representation of previously identified
279:
329:
119:
971:(t) are specified to guarantee genes called significant change at least a pre-specified amount. This means that the absolute value of the average expression levels of a gene under each of two conditions must be greater than the fold change (t) to be called positive and less than the inverse of the fold change (t) to be called negative.
964:
39: – in a single experiment. Such experiments can generate very large amounts of data, allowing researchers to assess the overall state of a cell or organism. Data in such large quantities is difficult – if not impossible – to analyze without the help of computer programs.
103:
Raw Affy data contains about twenty probes for the same RNA target. Half of these are "mismatch spots", which do not precisely match the target sequence. These can theoretically measure the amount of nonspecific binding for a given target. Robust Multi-array
Average (RMA) is a normalization approach
311:
response elements. Another statistical analysis tool is Rank Sum
Statistics for Gene Set Collections (RssGsc), which uses rank sum probability distribution functions to find gene sets that explain experimental data. A further approach is contextual meta-analysis, i.e. finding out how a gene cluster
129:
for Robust
Microarray Summarization (FARMS) is a model-based technique for summarizing array data at perfect match probe level. It is based on a factor analysis model for which a Bayesian maximum a posteriori method optimizes the model parameters under the assumption of Gaussian measurement noise.
47:
Microarray data analysis is the final step in reading and processing data produced by a microarray chip. Samples undergo various processes including purification and scanning using the microchip, which then produces a large amount of data that requires processing via computer software. It involves
1086:
Visual identification of local artifacts, such as printing or washing defects, may likewise suggest the removal of individual spots. This can take a substantial amount of time depending on the quality of array manufacture. In addition, some procedures call for the elimination of all spots with an
1069:
Depending on the type of array, signal related to nonspecific binding of the fluorophore can be subtracted to achieve better results. One approach involves subtracting the average signal intensity of the area between spots. A variety of tools for background correction and further analysis are
143:
or other mechanisms that take both effect size and variability into account. Curiously, the p-values associated with particular genes do not reproduce well between replicate experiments, and lists generated by straight fold change perform much better. This represents an extremely important
138:
Many strategies exist to identify array probes that show an unusual level of over-expression or under-expression. The simplest one is to call "significant" any probe that differs by an average of at least twofold between treatment groups. More sophisticated approaches are often related to
48:
several distinct steps, as outlined in the image below. Changing any one of the steps will change the outcome of the analysis, so the MAQC Project was created to identify a set of standard strategies. Companies exist that use the MAQC protocols to perform a complete analysis.
91:
Comparing two different arrays or two different samples hybridized to the same array generally involves making adjustments for systematic errors introduced by differences in procedures and dye intensity effects. Dye normalization for two color arrays is often achieved by
286:
Specialized software tools for statistical analysis to determine the extent of over- or under-expression of a gene in a microarray experiment relative to a reference state have also been developed to aid in identifying genes or gene sets associated with particular
590:
464:
Blocks are batches of microarrays; for example for eight samples split into two groups (control and affected) there are 4!=24 permutations for each block and the total number of permutations is (24)(24)= 576. A minimum of 1000 permutations are
451:
SAM is run as an Excel Add-In, and the SAM Plot
Controller allows Customization of the False Discovery Rate and Delta, while the SAM Plot and SAM Output functionality generate a List of Significant Genes, Delta Table, and Assessment of Sample
358:, it is now possible to measure the expression of thousands of genes in a single hybridization experiment. The data generated is considerable, and a method for sorting out what is significant and what isn't is essential. SAM is distributed by
1060:
Entire arrays may have obvious flaws detectable by visual inspection, pairwise comparisons to arrays in the same experimental group, or by analysis of RNA degradation. Results may improve by removing these arrays from the analysis entirely.
299:-style statistic to identify groups of genes that are regulated together. This third-party statistics package offers the user information on the genes or gene sets of interest, including links to entries in databases such as NCBI's
545:
SAM calculates a test statistic for relative difference in gene expression based on permutation analysis of expression data and calculates a false discovery rate. The principal calculations of the program are illustrated below.
258:
Commercial systems for gene network analysis such as
Ingenuity and Pathway studio create visual representations of differentially expressed genes based on current scientific literature. Non-commercial tools such as FunRich,
430:
Adjust the Delta tuning parameter to get a significant # of genes along with an acceptable false discovery rate (FDR)) and Assess Sample Size by calculating the mean difference in expression in the SAM Plot
96:. LIMMA provides a set of tools for background correction and scaling, as well as an option to average on-slide duplicate spots. A common method for evaluating how well normalized an array is, is to plot an
216:
Different studies have already shown empirically that the Single linkage clustering algorithm produces poor results when employed to gene expression microarray data and thus should be avoided.
1705:
Jaskowiak, Pablo A.; Campello, Ricardo J.G.B.; Costa, Ivan G. (2013). "Proximity
Measures for Clustering Gene Expression Microarray Data: A Validation Methodology and a Comparative Analysis".
396:
of the data are used to determine if the expression of any gene is significant related to the response. The use of permutation-based analysis accounts for correlations in genes and avoids
83:, provide commercial data analysis software alongside their microarray products. There are also open source options that utilize a variety of methods for analyzing microarray data.
959:{\displaystyle \mathrm {False\ discovery\ rate\ (FDR)={\frac {Median\ (or\ 90^{th}\ percentile)\ of\ \#\ of\ falsely\ called\ genes}{Number\ of\ genes\ called\ significant}}} }
267:
also aid in organizing and visualizing gene network data procured from one or several microarray experiments. A wide variety of microarray analysis tools are available through
551:
556:
316:
is a public tool to perform contextual meta-analysis across contexts such as anatomical parts, stages of development, and response to diseases, chemicals, stresses, and
987:
Call each gene significant if the absolute value of the test statistic for that gene minus the mean test statistic for that gene is greater than a stated threshold
2177:"Integration of statistical inference methods and a novel control measure to improve sensitivity and specificity of data analysis in expression profiling studies"
536:— no explicit response parameter is specified; the user specifies eigengene (principal component) of the expression data and treats it as a quantitative response
1075:
2232:<Zhang, S. (2007). "A comprehensive evaluation of SAM, the SAM R-package and a simple modification to improve its performance." BMC Bioinformatics 8: 230.
238:. Thus the purpose of K-means clustering is to classify data based on similar expression. K-means clustering algorithm and some of its variants (including
115:
The current
Affymetrix MAS5 algorithm, which uses both perfect match and mismatch probes, continues to enjoy popularity and do well in head to head tests.
2543:
275:. The frequently cited SAM module and other microarray tools are available through Stanford University. Another set is available from Harvard and MIT.
242:) have been shown to produce good results for gene expression data (at least better than hierarchical clustering methods). Empirical comparisons of
183:
35:, which allow researchers to investigate the expression state of a large number of genes – in many cases, an organism's entire
1071:
2609:
1637:
Guo L, Lobenhofer EK, Wang C, et al. (2006). "Rat toxicogenomic study reveals analytical consistency across microarray platforms".
2146:
Chu, G., Narasimhan, B, Tibshirani, R, Tusher, V. "SAM "Significance
Analysis of Microarrays" Users Guide and technical document."
187:
68:
130:
According to the
Affycomp benchmark FARMS outperformed all other summarizations methods with respect to sensitivity and specificity.
1896:
1227:
2036:
1590:"The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements"
1689:
112:, also part of RMA, is one sensible approach to normalize a batch of arrays in order to make further comparisons meaningful.
1101:
504:— measurement units are different in the two groups; e.g. control and treatment groups with samples from different patients
234:
groups. Grouping is done by minimizing the sum of the squares of distances between the data and the corresponding cluster
530:— each experimental units is measured at more than one time point; experimental units fall into a one or two class design
175:
2309:
Dinu, I. P.; JD; Mueller, T; Liu, Q; Adewale, AJ; Jhangri, GS; Einecke, G; Famulski, KS; Halloran, P; Yasui, Y. (2007).
510:— same experimental units are measured in the two groups; e.g. samples before and after treatment from the same patients
2640:
384:, which measures the strength of the relationship between gene expression and a response variable. This analysis uses
2620:
1807:
de Souto, Marcilio C. P.; Costa, Ivan G.; de Araujo, Daniel S. A.; Ludermir, Teresa B.; Schliep, Alexander (2008).
1003:
Positive gene set — higher expression of most genes in the gene set correlates with higher values of the phenotype
518:— more than two groups with each containing different experimental units; generalization of two class unpaired type
1009:
Negative gene set — lower expression of most genes in the gene set correlates with higher values of the phenotype
392:. The response variable describes and groups the data based on experimental conditions. In this method, repeated
108:. The median polish algorithm, although robust, behaves differently depending on the number of samples analyzed.
1329:
264:
2061:
2362:"Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data"
1399:"A comparison of normalization methods for high density oligonucleotide array data based on variance and bias"
397:
2614:
2547:
1491:"Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks"
1136:"Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles"
400:
assumptions about the distribution of individual genes. This is an advantage over other techniques (e.g.,
104:
that does not take advantage of these mismatch spots but still must summarize the perfect matches through
1353:
2571:
471:
the number of permutations is set by the user when imputing correct values for the data set to run SAM
64:
2604:
2594:
385:
20:
Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail.
2040:
2572:
ArrayExplorer - Compare microarray side by side to find the one that best suits your research needs
1186:
363:
1358:"Exploration, normalization, and summaries of high density oligonucleotide array probe level data"
169:
153:
1043:
Can work with blocked design for when treatments are applied within different batches of arrays
272:
109:
152:
Clustering is a data mining technique used to group genes having similar expression patterns.
2635:
2263:
1096:
308:
2523:
1900:
421:
experiments — DNA microarray with oligo and cDNA primers, SNP arrays, protein arrays, etc.
8:
1231:
389:
370:
359:
292:
100:
of the data. MA plots can be produced using programs and languages such as R and MATLAB.
2267:
1356:; Hobbs, B; Collin, F; Beazer-Barclay, YD; Antonellis, KJ; Scherf, U; Speed, TP (2003).
250:, hierarchical methods and, different distance measures can be found in the literature.
2439:
2412:
2388:
2361:
2337:
2310:
2013:
1989:"Protein Complex-Based Analysis Framework for High-Throughput Data Sets. 6, rs5 (2013)"
1835:
1808:
1781:
1754:
1730:
1662:
1614:
1589:
1466:
1439:
1306:
1273:
1162:
1135:
296:
230:
K-means clustering is an algorithm for grouping genes or samples based on pattern into
225:
195:
157:
1415:
1398:
1184:
1024:
Data from Oligo or cDNA arrays, SNP array, protein arrays, etc. can be utilized in SAM
282:
Example of FunRich tool output. Image shows the result of comparing 4 different genes.
2600:
2576:
2485:
2464:"Simpleaffy: a BioConductor package for Affymetrix Quality Control and data analysis"
2444:
2393:
2342:
2291:
2286:
2248:
2198:
2018:
1840:
1786:
1722:
1685:
1654:
1619:
1553:
1512:
1471:
1420:
1379:
1311:
1293:
1207:
1167:
343:
2480:
2463:
2413:"Considerations when using the significance analysis of microarrays (SAM) algorithm"
1548:
1531:
1507:
1490:
1374:
1357:
2475:
2434:
2424:
2383:
2373:
2332:
2322:
2281:
2271:
2188:
2008:
2000:
1830:
1820:
1776:
1766:
1714:
1666:
1646:
1609:
1601:
1543:
1502:
1461:
1451:
1410:
1369:
1301:
1285:
1157:
1147:
191:
93:
1734:
2249:"Significance analysis of microarrays applied to the ionizing radiation response"
2147:
1330:"Create intensity versus ratio scatter plot of microarray data - MATLAB mairplot"
1106:
351:
179:
126:
2503:
178:
clusters. Hierarchical clustering consists of two separate phases. Initially, a
1947:
1771:
1755:"On the selection of appropriate distances for gene expression data clustering"
434:
List
Differentially Expressed Genes (Positively and Negatively Expressed Genes)
355:
313:
2193:
2176:
2004:
1440:"Algorithm-driven Artifacts in median polish summarization of Microarray data"
2629:
1456:
1297:
369:
SAM identifies statistically significant genes by carrying out gene specific
304:
105:
2429:
2378:
2327:
1825:
1152:
1033:
Uses data permutation to estimates False Discovery Rate for multiple testing
448:
for academic and non-academic users after completion of a registration step.
51:
2489:
2448:
2397:
2346:
2295:
2276:
2202:
2022:
1844:
1790:
1726:
1682:
Bioinformatics and computational biology solutions using R and Bioconductor
1658:
1623:
1557:
1516:
1475:
1424:
1383:
1352:
1315:
1289:
1171:
990:
Estimate the false discovery rate based on expected versus observed values
455:
393:
268:
1272:
Gatto, Laurent; Breckels, Lisa M.; Naake, Thomas; Gibb, Sebastian (2015).
445:
2577:
FARMS - Factor Analysis for Robust Microarray Summarization, an R package
347:
2589:
ArrayMining.net - web-application for online analysis of microarray data
1718:
1036:
Reports local false discovery rate (the FDR for genes having a similar d
133:
1572:"Affycomp III: A Benchmark for Affymetrix GeneChip Expression Measures"
1111:
418:
405:
339:
182:
containing all the pairwise distances between the genes is calculated.
174:
Hierarchical clustering is a statistical method for finding relatively
76:
32:
2106:
317:
288:
247:
239:
27:
are used in interpreting the data generated from experiments on DNA (
2065:
1650:
1605:
1988:
1252:
235:
190:
are often used as dissimilarity estimates, but other methods, like
1046:
Can adjust threshold determining number of gene called significant
981:
For each permutation compute the ordered null (unaffected) scores
2086:
1753:
Jaskowiak, Pablo A; Campello, Ricardo JGB; Costa, Ivan G (2014).
1707:
IEEE/ACM Transactions on Computational Biology and Bioinformatics
984:
Plot the ordered test statistic against the expected null scores
300:
260:
243:
97:
80:
1858:
568:
constant is chosen to minimize the coefficient of variation of
140:
36:
1806:
524:— data of a time until an event (for example death or relapse)
16:
2588:
1987:
Vinayagam A, Hu Y, Kulkarni M, Roesel C, et al. (2013).
1809:"Clustering cancer gene expression data: a comparative study"
1571:
408:), which assume equal variance and/or independence of genes.
401:
323:
206:
1876:
1532:"A new summarization method for affymetrix probe level data"
1185:
Dr. Leming Shi, National Center for Toxicological Research.
1967:
1396:
1274:"Visualization of proteomics data using R and Bioconductor"
2311:"Improving gene set analysis of microarray data by SAM-GS"
1921:
1529:
550:
492:— tests whether the mean gene expression differs from zero
278:
122:
Flowchart showing how the MAS5 algorithm by Agilent works.
2582:
1986:
2615:
GeneChip® Expression Analysis-Data Analysis Fundamentals
1134:
Subramanian A, Tamayo P, Mootha VK, et al. (2005).
555:
424:
Input Expression Analysis in Microsoft Excel — see below
1488:
1437:
1133:
1050:
118:
1752:
1704:
1271:
1087:
expression value below a certain intensity threshold.
328:
1397:
Bolstad BM, Irizarry RA, Astrand M, Speed TP (2003).
593:
134:
Identification of significant differential expression
212:
Complete linkage (maximum method, furthest neighbor)
2247:Tusher, V. G.; Tibshirani, R.; et al. (2001).
160:are widely used techniques in microarray analysis.
1253:"LIMMA Library: Linear Models for Microarray Data"
958:
354:are statistically significant. With the advent of
1636:
1027:Correlates expression data to clinical parameters
202:Single linkage (minimum method, nearest neighbor)
2627:
2583:StatsArray - Online Microarray Analysis Services
2359:
2308:
2246:
1748:
1746:
1744:
1438:Giorgi FM, Bolger AM, Lohse M, Usadel B (2010).
312:responds to a variety of experimental contexts.
86:
2256:Proceedings of the National Academy of Sciences
1489:Lim WK, Wang K, Lefebvre C, Califano A (2007).
581:is equal to the expression levels (x) for gene
2595:FunRich - Perform gene set enrichment analysis
2410:
1588:Shi L, Reid LH, Jones WD, et al. (2006).
1530:Hochreiter S, Clevert DA, Obermayer K (2006).
1741:
1698:
1684:. New York: Springer Science+Business Media.
1208:"GenUs BioSystems - Services - Data Analysis"
978:Order test statistics according to magnitude
458:are calculated based on the number of samples
55:The steps required in a microarray experiment
2461:
1802:
1800:
1587:
2107:"SAM: Significance Analysis of Microarrays"
1187:"MicroArray Quality Control (MAQC) Project"
303:and curated databases such as Biocarta and
342:, established in 2001 by Virginia Tusher,
336:Significance analysis of microarrays (SAM)
324:Significance analysis of microarrays (SAM)
163:
69:National Center for Toxicological Research
2479:
2438:
2428:
2387:
2377:
2360:Jeffery, I. H.; DG; Culhane, AC. (2006).
2336:
2326:
2285:
2275:
2192:
2012:
1922:"FunRich: Functional Enrichment Analysis"
1834:
1824:
1797:
1780:
1770:
1679:
1673:
1613:
1547:
1506:
1465:
1455:
1414:
1373:
1305:
1161:
1151:
1064:
444:SAM is available for download online at
327:
291:. One such method of analysis, known as
277:
117:
63:
50:
15:
2504:"J. Craig Venter Institute -- Software"
2411:Larsson, O. W. C; Timmons, JA. (2005).
2175:Zang, S.; Guo, R.; et al. (2007).
2174:
1680:Gentleman, Robert; et al. (2005).
446:http://www-stat.stanford.edu/~tibs/SAM/
75:Most microarray manufacturers, such as
2628:
2621:Duke data_analysis_fundamentals_manual
1968:"BioCarta - Charting Pathways of Life"
253:
2242:
2240:
2238:
2228:
2226:
2224:
2222:
2220:
2218:
2216:
2214:
2212:
2170:
2168:
2166:
2164:
2162:
2160:
2158:
2156:
2154:
2142:
2140:
2138:
2136:
2134:
2132:
2130:
2128:
2126:
1583:
1581:
350:, for determining whether changes in
219:
2601:Comparative Transcriptomics Analysis
2544:"Ocimum Biosolutions | Genowiz"
1129:
1127:
1102:Significance analysis of microarrays
1051:Error correction and quality control
1030:Correlates expression data with time
974:The SAM algorithm can be stated as:
427:Run SAM as a Microsoft Excel Add-Ins
1189:. U.S. Food and Drug Administration
474:
13:
2404:
2353:
2302:
2235:
2209:
2151:
2123:
1897:"Ariadne Genomics: Pathway Studio"
1578:
1055:
949:
946:
943:
940:
937:
934:
931:
928:
925:
922:
919:
913:
910:
907:
904:
901:
898:
892:
889:
886:
883:
880:
874:
871:
865:
862:
859:
856:
853:
850:
845:
842:
839:
836:
833:
827:
824:
821:
818:
815:
812:
806:
803:
800:
797:
794:
791:
788:
782:
779:
773:
767:
764:
755:
752:
749:
746:
743:
740:
737:
734:
731:
728:
720:
717:
706:
703:
694:
691:
688:
685:
682:
679:
667:
664:
661:
652:
649:
646:
643:
637:
634:
631:
628:
625:
622:
619:
616:
613:
607:
604:
601:
598:
595:
486:— real-valued (such as heart rate)
388:, since the data may not follow a
14:
2652:
2605:Reference Module in Life Sciences
2565:
2181:Journal of Biomedical Informatics
1124:
1081:
585:under y experimental conditions.
411:
71:scientist reviews microarray data
1228:"Agilent | DNA Microarrays"
554:
549:
2536:
2516:
2496:
2455:
2099:
2079:
2054:
2029:
1980:
1960:
1940:
1934:
1914:
1889:
1869:
1851:
1630:
1564:
1523:
1482:
1431:
1416:10.1093/bioinformatics/19.2.185
1018:
42:
2524:"Agilent | GeneSpring GX"
1390:
1346:
1322:
1265:
1245:
1220:
1200:
1178:
1070:available from TIGR, Agilent (
758:
700:
670:
658:
438:
25:Microarray analysis techniques
1:
2481:10.1093/bioinformatics/bti605
2462:Wilson CL, Miller CJ (2005).
1549:10.1093/bioinformatics/btl033
1508:10.1093/bioinformatics/btm201
1375:10.1093/biostatistics/4.2.249
1117:
1040:as that gene) and miss rates
147:
87:Aggregation and normalization
59:
1140:Proc. Natl. Acad. Sci. U.S.A
540:
7:
1090:
498:— two sets of measurements
10:
2657:
1772:10.1186/1471-2105-15-S2-S2
223:
167:
2641:Bioinformatics algorithms
2610:SAM download instructions
2194:10.1016/j.jbi.2007.01.002
2005:10.1126/scisignal.2003629
994:
386:non-parametric statistics
373:and computes a statistic
1457:10.1186/1471-2105-11-553
295:Analysis (GSEA), uses a
2430:10.1186/1471-2105-6-129
2379:10.1186/1471-2105-7-359
2328:10.1186/1471-2105-8-242
1826:10.1186/1471-2105-9-497
1153:10.1073/pnas.0506580102
170:Hierarchical clustering
164:Hierarchical clustering
154:Hierarchical clustering
2277:10.1073/pnas.091062498
1290:10.1002/pmic.201400392
1000:Significant gene sets
960:
332:
283:
273:R programming language
188:Spearman's correlation
123:
110:Quantile normalization
72:
56:
21:
2111:tibshirani.su.domains
1065:Background correction
961:
340:statistical technique
331:
281:
184:Pearson's correlation
121:
67:
54:
19:
1234:on December 22, 2007
1097:Microarray databases
1076:Ocimum Bio Solutions
591:
309:transcription factor
31:), RNA, and protein
2268:2001PNAS...98.5116G
1877:"Ingenuity Systems"
1719:10.1109/TCBB.2013.9
461:Block Permutations
390:normal distribution
360:Stanford University
293:Gene Set Enrichment
254:Pattern recognition
2417:BMC Bioinformatics
2366:BMC Bioinformatics
2315:BMC Bioinformatics
1948:"Software - Broad"
1813:BMC Bioinformatics
1759:BMC Bioinformatics
1444:BMC Bioinformatics
956:
333:
297:Kolmogorov-Smirnov
284:
226:k-means clustering
220:K-means clustering
196:Euclidean distance
192:Manhattan distance
158:k-means clustering
124:
73:
57:
29:Gene chip analysis
22:
1691:978-0-387-29362-2
953:
918:
897:
879:
870:
832:
811:
787:
778:
772:
763:
727:
711:
699:
657:
642:
612:
534:Pattern discovery
344:Robert Tibshirani
205:Average linkage (
2648:
2559:
2558:
2556:
2555:
2546:. Archived from
2540:
2534:
2533:
2531:
2530:
2520:
2514:
2513:
2511:
2510:
2500:
2494:
2493:
2483:
2459:
2453:
2452:
2442:
2432:
2408:
2402:
2401:
2391:
2381:
2357:
2351:
2350:
2340:
2330:
2306:
2300:
2299:
2289:
2279:
2262:(9): 5116–5121.
2253:
2244:
2233:
2230:
2207:
2206:
2196:
2172:
2149:
2144:
2121:
2120:
2118:
2117:
2103:
2097:
2096:
2094:
2093:
2083:
2077:
2076:
2074:
2073:
2064:. Archived from
2058:
2052:
2051:
2049:
2048:
2039:. Archived from
2033:
2027:
2026:
2016:
1984:
1978:
1977:
1975:
1974:
1964:
1958:
1957:
1955:
1954:
1944:
1938:
1932:
1931:
1929:
1928:
1918:
1912:
1911:
1909:
1908:
1899:. Archived from
1893:
1887:
1886:
1884:
1883:
1873:
1867:
1866:
1863:biostat.ucsf.edu
1855:
1849:
1848:
1838:
1828:
1804:
1795:
1794:
1784:
1774:
1750:
1739:
1738:
1702:
1696:
1695:
1677:
1671:
1670:
1634:
1628:
1627:
1617:
1585:
1576:
1575:
1568:
1562:
1561:
1551:
1527:
1521:
1520:
1510:
1486:
1480:
1479:
1469:
1459:
1435:
1429:
1428:
1418:
1394:
1388:
1387:
1377:
1350:
1344:
1343:
1341:
1340:
1326:
1320:
1319:
1309:
1284:(8): 1375–1389.
1269:
1263:
1262:
1260:
1259:
1249:
1243:
1242:
1240:
1239:
1230:. Archived from
1224:
1218:
1217:
1215:
1214:
1204:
1198:
1197:
1195:
1194:
1182:
1176:
1175:
1165:
1155:
1146:(43): 15545–50.
1131:
965:
963:
962:
957:
955:
954:
952:
916:
895:
877:
868:
848:
830:
809:
785:
776:
770:
761:
725:
724:
723:
709:
697:
677:
655:
640:
610:
558:
553:
475:Response formats
94:local regression
2656:
2655:
2651:
2650:
2649:
2647:
2646:
2645:
2626:
2625:
2617:(by Affymetrix)
2568:
2563:
2562:
2553:
2551:
2542:
2541:
2537:
2528:
2526:
2522:
2521:
2517:
2508:
2506:
2502:
2501:
2497:
2460:
2456:
2409:
2405:
2358:
2354:
2307:
2303:
2251:
2245:
2236:
2231:
2210:
2173:
2152:
2145:
2124:
2115:
2113:
2105:
2104:
2100:
2091:
2089:
2085:
2084:
2080:
2071:
2069:
2060:
2059:
2055:
2046:
2044:
2035:
2034:
2030:
1985:
1981:
1972:
1970:
1966:
1965:
1961:
1952:
1950:
1946:
1945:
1941:
1935:
1926:
1924:
1920:
1919:
1915:
1906:
1904:
1895:
1894:
1890:
1881:
1879:
1875:
1874:
1870:
1857:
1856:
1852:
1805:
1798:
1765:(Suppl 2): S2.
1751:
1742:
1703:
1699:
1692:
1678:
1674:
1651:10.1038/nbt1238
1639:Nat. Biotechnol
1635:
1631:
1606:10.1038/nbt1239
1594:Nat. Biotechnol
1586:
1579:
1570:
1569:
1565:
1528:
1524:
1487:
1483:
1436:
1432:
1395:
1391:
1351:
1347:
1338:
1336:
1328:
1327:
1323:
1270:
1266:
1257:
1255:
1251:
1250:
1246:
1237:
1235:
1226:
1225:
1221:
1212:
1210:
1206:
1205:
1201:
1192:
1190:
1183:
1179:
1132:
1125:
1120:
1107:Transcriptomics
1093:
1084:
1067:
1058:
1056:Quality control
1053:
1039:
1021:
1012:
1006:
997:
849:
716:
712:
678:
676:
594:
592:
589:
588:
580:
573:
567:
543:
477:
441:
414:
378:
356:DNA microarrays
352:gene expression
326:
271:written in the
256:
228:
222:
180:distance matrix
172:
166:
150:
136:
127:Factor analysis
89:
62:
45:
12:
11:
5:
2654:
2644:
2643:
2638:
2624:
2623:
2618:
2612:
2607:
2598:
2592:
2586:
2580:
2574:
2567:
2566:External links
2564:
2561:
2560:
2535:
2515:
2495:
2474:(18): 3683–5.
2468:Bioinformatics
2454:
2403:
2352:
2301:
2234:
2208:
2187:(5): 552–560.
2150:
2122:
2098:
2078:
2053:
2028:
1979:
1959:
1939:
1933:
1913:
1888:
1868:
1850:
1796:
1740:
1713:(4): 845–857.
1697:
1690:
1672:
1629:
1600:(9): 1151–61.
1577:
1563:
1542:(8): 943–949.
1536:Bioinformatics
1522:
1501:(13): i282–8.
1495:Bioinformatics
1481:
1430:
1403:Bioinformatics
1389:
1345:
1321:
1264:
1244:
1219:
1199:
1177:
1122:
1121:
1119:
1116:
1115:
1114:
1109:
1104:
1099:
1092:
1089:
1083:
1082:Spot filtering
1080:
1066:
1063:
1057:
1054:
1052:
1049:
1048:
1047:
1044:
1041:
1037:
1034:
1031:
1028:
1025:
1020:
1017:
1016:
1015:
1014:
1013:
1010:
1007:
1004:
996:
993:
992:
991:
988:
985:
982:
979:
951:
948:
945:
942:
939:
936:
933:
930:
927:
924:
921:
915:
912:
909:
906:
903:
900:
894:
891:
888:
885:
882:
876:
873:
867:
864:
861:
858:
855:
852:
847:
844:
841:
838:
835:
829:
826:
823:
820:
817:
814:
808:
805:
802:
799:
796:
793:
790:
784:
781:
775:
769:
766:
760:
757:
754:
751:
748:
745:
742:
739:
736:
733:
730:
722:
719:
715:
708:
705:
702:
696:
693:
690:
687:
684:
681:
675:
672:
669:
666:
663:
660:
654:
651:
648:
645:
639:
636:
633:
630:
627:
624:
621:
618:
615:
609:
606:
603:
600:
597:
576:
571:
565:
542:
539:
538:
537:
531:
525:
519:
513:
512:
511:
505:
493:
487:
476:
473:
469:
468:
467:
466:
459:
453:
449:
440:
437:
436:
435:
432:
428:
425:
422:
413:
412:Basic protocol
410:
380:for each gene
376:
325:
322:
314:Genevestigator
255:
252:
224:Main article:
221:
218:
214:
213:
210:
203:
168:Main article:
165:
162:
149:
146:
135:
132:
88:
85:
61:
58:
44:
41:
9:
6:
4:
3:
2:
2653:
2642:
2639:
2637:
2634:
2633:
2631:
2622:
2619:
2616:
2613:
2611:
2608:
2606:
2602:
2599:
2596:
2593:
2590:
2587:
2584:
2581:
2578:
2575:
2573:
2570:
2569:
2550:on 2009-11-24
2549:
2545:
2539:
2525:
2519:
2505:
2499:
2491:
2487:
2482:
2477:
2473:
2469:
2465:
2458:
2450:
2446:
2441:
2436:
2431:
2426:
2422:
2418:
2414:
2407:
2399:
2395:
2390:
2385:
2380:
2375:
2371:
2367:
2363:
2356:
2348:
2344:
2339:
2334:
2329:
2324:
2320:
2316:
2312:
2305:
2297:
2293:
2288:
2283:
2278:
2273:
2269:
2265:
2261:
2257:
2250:
2243:
2241:
2239:
2229:
2227:
2225:
2223:
2221:
2219:
2217:
2215:
2213:
2204:
2200:
2195:
2190:
2186:
2182:
2178:
2171:
2169:
2167:
2165:
2163:
2161:
2159:
2157:
2155:
2148:
2143:
2141:
2139:
2137:
2135:
2133:
2131:
2129:
2127:
2112:
2108:
2102:
2088:
2082:
2068:on 2011-08-17
2067:
2063:
2057:
2043:on 2007-07-05
2042:
2038:
2032:
2024:
2020:
2015:
2010:
2006:
2002:
1998:
1994:
1990:
1983:
1969:
1963:
1949:
1943:
1937:
1923:
1917:
1903:on 2007-12-30
1902:
1898:
1892:
1878:
1872:
1864:
1860:
1854:
1846:
1842:
1837:
1832:
1827:
1822:
1818:
1814:
1810:
1803:
1801:
1792:
1788:
1783:
1778:
1773:
1768:
1764:
1760:
1756:
1749:
1747:
1745:
1736:
1732:
1728:
1724:
1720:
1716:
1712:
1708:
1701:
1693:
1687:
1683:
1676:
1668:
1664:
1660:
1656:
1652:
1648:
1645:(9): 1162–9.
1644:
1640:
1633:
1625:
1621:
1616:
1611:
1607:
1603:
1599:
1595:
1591:
1584:
1582:
1573:
1567:
1559:
1555:
1550:
1545:
1541:
1537:
1533:
1526:
1518:
1514:
1509:
1504:
1500:
1496:
1492:
1485:
1477:
1473:
1468:
1463:
1458:
1453:
1449:
1445:
1441:
1434:
1426:
1422:
1417:
1412:
1409:(2): 185–93.
1408:
1404:
1400:
1393:
1385:
1381:
1376:
1371:
1368:(2): 249–64.
1367:
1363:
1362:Biostatistics
1359:
1355:
1349:
1335:
1331:
1325:
1317:
1313:
1308:
1303:
1299:
1295:
1291:
1287:
1283:
1279:
1275:
1268:
1254:
1248:
1233:
1229:
1223:
1209:
1203:
1188:
1181:
1173:
1169:
1164:
1159:
1154:
1149:
1145:
1141:
1137:
1130:
1128:
1123:
1113:
1110:
1108:
1105:
1103:
1100:
1098:
1095:
1094:
1088:
1079:
1077:
1073:
1062:
1045:
1042:
1035:
1032:
1029:
1026:
1023:
1022:
1008:
1002:
1001:
999:
998:
989:
986:
983:
980:
977:
976:
975:
972:
970:
966:
713:
673:
586:
584:
579:
574:
564:
559:
557:
552:
547:
535:
532:
529:
526:
523:
520:
517:
514:
509:
506:
503:
500:
499:
497:
494:
491:
488:
485:
482:
481:
480:
472:
463:
462:
460:
457:
454:
450:
447:
443:
442:
433:
429:
426:
423:
420:
416:
415:
409:
407:
403:
399:
395:
391:
387:
383:
379:
372:
367:
365:
361:
357:
353:
349:
345:
341:
337:
330:
321:
319:
315:
310:
306:
305:Gene Ontology
302:
298:
294:
290:
280:
276:
274:
270:
266:
262:
251:
249:
245:
241:
237:
233:
227:
217:
211:
208:
204:
201:
200:
199:
197:
193:
189:
185:
181:
177:
171:
161:
159:
155:
145:
142:
131:
128:
120:
116:
113:
111:
107:
106:median polish
101:
99:
95:
84:
82:
78:
70:
66:
53:
49:
40:
38:
34:
30:
26:
18:
2552:. Retrieved
2548:the original
2538:
2527:. Retrieved
2518:
2507:. Retrieved
2498:
2471:
2467:
2457:
2420:
2416:
2406:
2369:
2365:
2355:
2318:
2314:
2304:
2259:
2255:
2184:
2180:
2114:. Retrieved
2110:
2101:
2090:. Retrieved
2081:
2070:. Retrieved
2066:the original
2056:
2045:. Retrieved
2041:the original
2031:
1996:
1992:
1982:
1971:. Retrieved
1962:
1951:. Retrieved
1942:
1936:
1925:. Retrieved
1916:
1905:. Retrieved
1901:the original
1891:
1880:. Retrieved
1871:
1862:
1853:
1816:
1812:
1762:
1758:
1710:
1706:
1700:
1681:
1675:
1642:
1638:
1632:
1597:
1593:
1566:
1539:
1535:
1525:
1498:
1494:
1484:
1447:
1443:
1433:
1406:
1402:
1392:
1365:
1361:
1354:Irizarry, RA
1348:
1337:. Retrieved
1333:
1324:
1281:
1277:
1267:
1256:. Retrieved
1247:
1236:. Retrieved
1232:the original
1222:
1211:. Retrieved
1202:
1191:. Retrieved
1180:
1143:
1139:
1085:
1068:
1059:
1019:SAM features
973:
969:Fold changes
968:
967:
587:
582:
577:
569:
562:
560:
548:
544:
533:
527:
521:
515:
507:
501:
495:
489:
484:Quantitative
483:
478:
470:
465:recommended;
456:Permutations
394:permutations
381:
374:
368:
335:
334:
285:
269:Bioconductor
257:
231:
229:
215:
173:
151:
137:
125:
114:
102:
90:
74:
46:
43:Introduction
28:
24:
23:
2636:Microarrays
1999:(r5): rs5.
1993:Sci. Signal
1078:(Genowiz).
528:Time course
439:Running SAM
348:Gilbert Chu
176:homogeneous
33:microarrays
2630:Categories
2554:2009-04-02
2529:2008-01-02
2509:2008-01-01
2116:2023-11-24
2092:2008-10-15
2072:2007-12-31
2047:2007-12-31
1973:2007-12-31
1953:2007-12-31
1927:2014-09-09
1907:2007-12-31
1882:2007-12-31
1819:(1): 497.
1339:2023-11-24
1278:Proteomics
1258:2008-01-01
1238:2008-01-02
1213:2008-01-02
1193:2007-12-26
1118:References
1112:Proteomics
1072:GeneSpring
516:Multiclass
431:Controller
419:microarray
406:Bonferroni
398:parametric
289:phenotypes
265:Moksiskaan
148:Clustering
77:Affymetrix
60:Techniques
2597:—software
2591:—software
2585:—software
2579:—software
2037:"DBI Web"
1334:MathWorks
1298:1615-9853
774:#
541:Algorithm
496:Two class
490:One class
364:R-package
318:neoplasms
248:k-medoids
240:k-medoids
2490:16076888
2449:15921534
2398:16872483
2347:17612399
2296:11309499
2203:17317331
2087:"RssGsc"
2023:23443684
1845:19038021
1791:24564555
1727:24334380
1659:17061323
1624:16964229
1558:16473874
1517:17646307
1476:21070630
1425:12538238
1384:12925520
1316:25690415
1172:16199517
1091:See also
522:Survival
502:Unpaired
417:Perform
236:centroid
2440:1173086
2423:: 129.
2389:1544358
2372:: 359.
2338:1931607
2321:: 242.
2264:Bibcode
2062:"SCOPE"
2014:3756668
1836:2632677
1782:4072854
1667:8192240
1615:3272078
1467:2998528
1450:: 553.
1307:4510819
1163:1239896
1074:), and
479:Types:
371:t-tests
301:GenBank
261:GenMAPP
244:k-means
141:t-tests
98:MA plot
81:Agilent
2488:
2447:
2437:
2396:
2386:
2345:
2335:
2294:
2284:
2201:
2021:
2011:
1859:"Home"
1843:
1833:
1789:
1779:
1735:760277
1733:
1725:
1688:
1665:
1657:
1622:
1612:
1556:
1515:
1474:
1464:
1423:
1382:
1314:
1304:
1296:
1170:
1160:
995:Output
917:
896:
878:
869:
831:
810:
786:
777:
771:
762:
726:
710:
698:
656:
641:
611:
508:Paired
362:in an
156:, and
37:genome
2287:33173
2252:(PDF)
1731:S2CID
1663:S2CID
452:Sizes
402:ANOVA
338:is a
207:UPGMA
2486:PMID
2445:PMID
2394:PMID
2343:PMID
2292:PMID
2199:PMID
2019:PMID
1841:PMID
1787:PMID
1723:PMID
1686:ISBN
1655:PMID
1620:PMID
1554:PMID
1513:PMID
1472:PMID
1421:PMID
1380:PMID
1312:PMID
1294:ISSN
1168:PMID
575:. r
561:The
404:and
346:and
263:and
186:and
79:and
2603:in
2476:doi
2435:PMC
2425:doi
2384:PMC
2374:doi
2333:PMC
2323:doi
2282:PMC
2272:doi
2189:doi
2009:PMC
2001:doi
1831:PMC
1821:doi
1777:PMC
1767:doi
1715:doi
1647:doi
1610:PMC
1602:doi
1544:doi
1503:doi
1462:PMC
1452:doi
1411:doi
1370:doi
1302:PMC
1286:doi
1158:PMC
1148:doi
1144:102
194:or
2632::
2484:.
2472:21
2470:.
2466:.
2443:.
2433:.
2419:.
2415:.
2392:.
2382:.
2368:.
2364:.
2341:.
2331:.
2317:.
2313:.
2290:.
2280:.
2270:.
2260:98
2258:.
2254:.
2237:^
2211:^
2197:.
2185:40
2183:.
2179:.
2153:^
2125:^
2109:.
2017:.
2007:.
1995:.
1991:.
1861:.
1839:.
1829:.
1815:.
1811:.
1799:^
1785:.
1775:.
1763:15
1761:.
1757:.
1743:^
1729:.
1721:.
1711:10
1709:.
1661:.
1653:.
1643:24
1641:.
1618:.
1608:.
1598:24
1596:.
1592:.
1580:^
1552:.
1540:22
1538:.
1534:.
1511:.
1499:23
1497:.
1493:.
1470:.
1460:.
1448:11
1446:.
1442:.
1419:.
1407:19
1405:.
1401:.
1378:.
1364:.
1360:.
1332:.
1310:.
1300:.
1292:.
1282:15
1280:.
1276:.
1166:.
1156:.
1142:.
1138:.
1126:^
714:90
366:.
320:.
246:,
2557:.
2532:.
2512:.
2492:.
2478::
2451:.
2427::
2421:6
2400:.
2376::
2370:7
2349:.
2325::
2319:8
2298:.
2274::
2266::
2205:.
2191::
2119:.
2095:.
2075:.
2050:.
2025:.
2003::
1997:6
1976:.
1956:.
1930:.
1910:.
1885:.
1865:.
1847:.
1823::
1817:9
1793:.
1769::
1737:.
1717::
1694:.
1669:.
1649::
1626:.
1604::
1574:.
1560:.
1546::
1519:.
1505::
1478:.
1454::
1427:.
1413::
1386:.
1372::
1366:4
1342:.
1318:.
1288::
1261:.
1241:.
1216:.
1196:.
1174:.
1150::
1038:i
1011:y
1005:y
950:t
947:n
944:a
941:c
938:i
935:f
932:i
929:n
926:g
923:i
920:s
914:d
911:e
908:l
905:l
902:a
899:c
893:s
890:e
887:n
884:e
881:g
875:f
872:o
866:r
863:e
860:b
857:m
854:u
851:N
846:s
843:e
840:n
837:e
834:g
828:d
825:e
822:l
819:l
816:a
813:c
807:y
804:l
801:e
798:s
795:l
792:a
789:f
783:f
780:o
768:f
765:o
759:)
756:e
753:l
750:i
747:t
744:n
741:e
738:c
735:r
732:e
729:p
721:h
718:t
707:r
704:o
701:(
695:n
692:a
689:i
686:d
683:e
680:M
674:=
671:)
668:R
665:D
662:F
659:(
653:e
650:t
647:a
644:r
638:y
635:r
632:e
629:v
626:o
623:c
620:s
617:i
614:d
608:e
605:s
602:l
599:a
596:F
583:i
578:i
572:i
570:d
566:o
563:s
382:j
377:j
375:d
232:K
209:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.