Tissue-specificity metrics
tspex provides twelve distinct tissue-specificity metrics, which differ in their assumptions, scale and properties. Broadly, these metrics can be divided into two groups1:
- General scoring metrics: Summarize in a single value how tissue-specific or ubiquitous is a gene across all tissues.
- Individualized scoring metrics: Quantify how specific is the expression of each gene to each tissue.
For the following equations x_i represents the gene expression in tissue i and n is the number of tissues. Some metrics give results that are not in the [0,1] interval, such as the Simpson index, that lies in the [\frac{1}{n},1] range. For these metrics, we provide transformed versions, denoted by the ' symbol, that range from 0 (perfectly ubiquitous) to 1 (perfectly tissue-specific).
General scoring metrics
The general scoring metrics included in tspex are: Counts2, Tau3, Gini coefficient4, Simpson index5, Shannon entropy specificity6, ROKU specificity7, Specificity measure dispersion (SPM DPM)8, and Jensen-Shannon specificity dispersion (JSS DPM).
Counts
Tau
Gini coefficient
Simpson index
Shannon entropy specificity (HS)
ROKU specificity
Specificity measure dispersion (SPM DPM)
Jensen-Shannon specificity dispersion (JSS DPM)
Individualized scoring metrics
The general scoring metrics included in tspex are: Tissue-specificity index (TSI)9, Z-score10, Specificity measure (SPM)11, and Jensen-Shannon specificity (JSS)12.
Tissue-specificity index (TSI)
Z-score
Specificity measure (SPM)
Jensen-Shannon specificity (JSS)
References
-
Kryuchkova-Mostacci, Nadezda, and Marc Robinson-Rechavi. "A benchmark of gene expression tissue-specificity metrics." Briefings in bioinformatics 18.2 (2017): 205-214. ↩
-
Duret, Laurent, and Dominique Mouchiroud. "Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate." Molecular biology and evolution 17.1 (2000): 68-070. ↩
-
Yanai, Itai, et al. "Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification." Bioinformatics 21.5 (2004): 650-659. ↩
-
Ceriani, Lidia, and Paolo Verme. "The origins of the Gini index: extracts from Variabilità e Mutabilità (1912) by Corrado Gini." The Journal of Economic Inequality 10.3 (2012): 421-443. ↩
-
Simpson, Edward H. "Measurement of diversity." Nature 163.4148 (1949): 688. ↩
-
Schug, Jonathan, et al. "Promoter features related to tissue specificity as measured by Shannon entropy." Genome biology 6.4 (2005): R33. ↩
-
Kadota, Koji, et al. "ROKU: a novel method for identification of tissue-specific genes." BMC bioinformatics 7.1 (2006): 294. ↩
-
Pan, Jian-Bo, et al. "PaGeFinder: quantitative identification of spatiotemporal pattern genes." Bioinformatics 28.11 (2012): 1544-1545. ↩
-
Julien, Philippe, et al. "Mechanisms and evolutionary patterns of mammalian and avian dosage compensation." PLoS biology 10.5 (2012): e1001328. ↩
-
Vandenbon, Alexis, and Kenta Nakai. "Modeling tissue-specific structural patterns in human and mouse promoters." Nucleic acids research 38.1 (2009): 17-25. ↩
-
Xiao, Sheng-Jian, et al. "TiSGeD: a database for tissue-specific genes." Bioinformatics 26.9 (2010): 1273-1275. ↩
-
Cabili, Moran N., et al. "Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses." Genes & development 25.18 (2011): 1915-1927. ↩