The identification of differentially expressed genes (DEGs) from transcriptomic datasets is a major avenue of research across diverse disciplines. Current bioinformatic tools support a number of experimental designs including covariates, random effects, and blocking. However, covariance matrices are not yet among the features available. Here, we introduce kimma for kinship in mixed model analysis, an open-source R package that provides linear and linear mixed effects modeling of RNA-seq data including all previous designs plus covariance random effects. kimma equals or outcompetes other DEG pipelines in terms of sensitivity, computational time, and model complexity.
In particular, kimma provides:
- A single function
kmFit
for flexible linear modeling of fixed, random, and complex random effects - Multi-processor architecture for reduced computational time
- Multiple model fit measures such as AIC and BIC
- Easy incorporation with data visualization and downstream analyses in the
BIGverse
(see below)
Main kimma
package vignette
https://bigslu.github.io/kimma_vignette/kimma_vignette.html
Additional tutorials related to RNA-seq data analysis
- fastq to counts: Sequence quality filtering, alignment, and counts with SEAsnake: Software to our run pipeline
- Counts to voom: Count quality filtering and normalization with
edgeR
andlimma
- Voom to DEG: Additional information on how to pick a best fit model
- RNAseq modules: Gene co-expression modules with
WGCNA
- Gene set analysis: Including hypergeometric enrichment,
enrichr
, gene set enrichment analysis (GSEA), and STRING networks