Dean's blog

R-ggplot2拼图-patchwork

Posted on 2022-01-24
Symbols count in article: 4.4k Reading time ≈ 4 mins.

据patchwork的作者 Thomas Pedersen所介绍，他开发的初衷就是让ggplots的组合可以ridiculously simple！

The goal of patchwork is to make it ridiculously simple to combine separate ggplots into the same graphic. As such it tries to solve the same problem as gridExtra::grid.arrange() and cowplot::plot_grid but using an API that incites exploration and iteration.

我使用后的感觉也是如此， patchwork可读性高、操作简单、可操作性也高，真的太强了。下面简单介绍一下这个包吧！

笔记-单细胞转录组测序 10X Genomics

Posted on 2022-01-24
Symbols count in article: 1.8k Reading time ≈ 2 mins.

10X Genomics 是一个综合的单细胞测序技术平台，可以应用于多种单细胞测序上，包括Single cell gene expression, Single cell Immune Profiling, Single cell CNV, Single cell ATAC。这篇文章将主要简述10X Genomics在单细胞基因表达检测方面，也就是scRNA-seq上的应用。

笔记-单细胞转录组测序 Microwell-seq

Posted on 2022-01-24
Symbols count in article: 1.4k Reading time ≈ 1 mins.

最近在看Construction of a human cell landscape at single-cell level 这篇文章。其中，作者构建出了多组织的人类单细胞图谱。注意到作者使用了Microwell-seq这一种scRNA-seq，因此特意了解该技术，来跟大家分享一下我对这项技术的理解。

笔记-单细胞转录组测序 Smart-seq2

Posted on 2022-01-24
Symbols count in article: 1.5k Reading time ≈ 1 mins.

Smart-seq是由路德维格癌症研究所的 Rickard Sandberg实验室所开发的一套在全转录组范围进行单细胞RNA测序 (scRNA-seq) 的方法。Smart-seq因为以全长mRNA建库，所以对转录本的测序覆盖度也有所上升。Smart-seq2是由Picelli等人从Smart-seq中改良而来(Picelli et al., 2013) (Picelli et al., 2014)。本文将从Smart-seq2的原理、优点和缺点来为大家介绍这项scRNA-seq技术。

理论-Neighbor Joining建树浅析

Posted on 2022-01-24
Symbols count in article: 1.9k Reading time ≈ 2 mins.

Neighbor Joining是一种bottom-up的聚类方法，常被用于**系统发育树(phylogenetic tree)*的构建当中。 Naruya Saitou 和 Masatoshi Nei在1987年将NJ法发表在Molecular Biology and Evolution*中，至今已有超5万的引入量，实在是生物信息学中超重量级的文章。

理论-DESeq2-normalization

Posted on 2022-01-24 Edited on 2022-01-23
Symbols count in article: 3.2k Reading time ≈ 3 mins.

在做基因差异表达分析时，经常会用DESeq2这个包，但一直没有深究其分析的统计流程。因此，在这里记录一下DESeq2校正基因表达的方法 – 比率中值法 (Median of ratios)。

理论-edgeR-TMM-normalization

Posted on 2022-01-24 Edited on 2022-01-23
Symbols count in article: 5.5k Reading time ≈ 5 mins.

最近在看差异分析当中原始read counts是如何被校正的，自然就不会放过差异分析的经典之一 —— edgeR.

理论-简述RPKM-FPKM-AND-TPM

Posted on 2022-01-24 Edited on 2022-01-23
Symbols count in article: 4.7k Reading time ≈ 4 mins.

在高通量测序数据的分析中，仅仅靠raw read counts描述基因的表达量是远远不够的。受限于测序过程中的技术因素影响，read counts对于基因表达量的反映存在一定偏好（bias）。因此，Mortazavi等人提出了RPKM/FPKM的方法对read counts进行normalization以使基因表达量的比较可以在不同文库间进行。随后，Mortazavi等人更是提出了考虑转录本长度分布情况的TPM方法。本文将会简要说明为什么我们要对read counts进行normalization，以及RPKM，FPKM，TPM是什么，并通过一个简单地例子阐述为什么TPM才是被更多人认同的方法。

edgeR-简单的转录组差异表达分析

Posted on 2022-01-24 Edited on 2022-01-23
Symbols count in article: 5.7k Reading time ≈ 5 mins.

经典的转录组差异分析通常会使用到三个工具limma/voom, edgeR和DESeq2，今天我们同样使用一个小规模的转录组测序数据来演示edgeR的简单流程。

DESeq2-简单的转录组差异表达分析

Posted on 2022-01-24 Edited on 2022-01-23
Symbols count in article: 7.2k Reading time ≈ 7 mins.

经典的转录组差异分析通常会使用到三个工具limma/voom, edgeR和DESeq2。今天我们就通过一个小规模的转录组测序数据来演示DESeq2的简单流程。