Statistical comparison of two or more SAGE libraries: one tag at a time

Gerben J. Schaaf, Fred van Ruissen, Antoine van Kampen, Marcel Kool, Jan M. Ruijter

Research output: Contribution to journalArticleAcademicpeer-review

6 Citations (Scopus)

Abstract

Several statistical tests have been introduced for the comparison of serial analysis of gene expression (SAGE) libraries to quantitatively analyze the differential expression of genes. As each SAGE library is only one measurement, the necessary information on biological variation or experimental precision is lacking. Therefore, each test includes its own approach to derive such a variance measure from the data set or a theoretical distribution. Because the confidence in tag counts depends on the library size, a test between two or more libraries should be based on original tag counts. When groups of libraries are compared, the test should determine that the proportion of a specific tag in all libraries is the same (null hypothesis), but also offer the possibility to detect specific differences between individual libraries and groups of libraries. The Z-test and the G-test encompass these characteristics and are described for the comparison of two libraries and (two or more) groups of libraries, respectively
Original languageEnglish
Pages (from-to)151-168
JournalMethods in molecular biology (Clifton, N.J.)
Volume387
Publication statusPublished - 2008

Cite this