For every term in the categorical columns it is tested whether the corresponding expression values have a preference to be systematically larger or smaller than the global distribution of expression values. For details see Cox and Mann (2012) BMC Bioinformatics 13 Suppl 16:S12.
Output: The output matrix contains a list of terms from all categorical columns that are significantly biased towards low or high values in the expression columns.
Selected expression/numerical columns that should be processed (default: no expression/numerical columns are selected).
The truncation can be based on p-values or the Benjamini-Hochberg correction for multiple hypothesis testing (default: Benjamini-Hochberg FDRFalse Discovery Rate). Rows with a test result below a specified value (“Threshold value”) are reported as significant.
To apply a two-sided test, where the null hypothesis can be rejected regardless of the direction of the effect “both” has to be selected (default). “high” and “low” are the respective one-sided tests.
Based on a specified threshold (default: 0.02) a row is reported as significant, if its test result is below the defined value. Depending on the chosen truncation score this threshold value is applied to the p-value or to the Benjamini-Hochberg FDR
Selected text column, where all rows having the same identifier will be counted as one entity in the 1D annotation enrichment analysis (default: <None>). The main application is for posttranslational modification sites. Then one should select here protein or gene identifiers. This will make sure that multiple sites from the same protein (or gene) are counted only once for the enrichment analysis.