收起 展开

1 项目概述

项目编号GDR21060245_std_1
项目内容Mus_musculus --- 3 scRNA-seq
参考基因组GRCm39
样品信息NP¦NPM¦PBS

2 项目流程

2.1 实验流程
Fig 2-1-1 10x单细胞实验流程图

  1. 细胞质检
    取少量单细胞悬液,加入等体积0.4%台盼蓝染液,用 Countess® II Automated Cell Counter 对细胞计数,将活细胞浓度调整到理想浓度(1000~2000个/μL)。
  2. 10X 标记 cDNA 片段
    含有 barcode 信息的凝胶珠与细胞和酶的混合物结合,进入储液器中被油分隔开,形成GEMs(Gel Beads-In-Emulsions)。之后,凝胶珠溶解释放含有Barcode序列的捕获序列,逆转录 cDNA 片段,并对样本进行标记。将凝胶珠破碎并打碎油滴,以 cDNA 为模板进行 PCR 扩增。将所有GEMs的产物混合,构建标准测序文库。
  3. 标准测序文库建库
    首先将 cDNA 酶切打断成 200~300bp 左右的片段,然后经过末端修复、加A尾、测序接头 P5 、P7和sample index等常规二代测序文库构建步骤,最后进行 PCR 扩增得到 DNA 文库。
  4. 文库测序
    利用Illumina测序平台的PE150测序模式对建好的文库进行高通量测序。

2.2 分析流程
Fig 2-2-1 10x单细胞分析流程图

广州基迪奥生物科技有限公司




3 测序数据质控和表达量定量

3.1 测序数据基本质控

使用cellranger[1],我们可以对测序质量进行质控,去除测序质量低的reads,并对每个样本测的reads数和测序质量进行初步统计。

Tab 3-1-1 各样本测序数据统计表
SampleNumber of ReadsValid BarcodesSequencing SaturationQ30 Bases in BarcodeQ30 Bases in RNA ReadQ30 Bases in UMI
NP270,139,31396.8%61.6%94.9%89.6%94.3%
NPM320,541,93097.3%53.3%95.2%90.4%94.5%
PBS318,068,46196.7%69.1%95.0%90.1%94.3%

3.2 数据定量

使用cellranger,我们将reads与参考基因组进行比对,将reads注释为特定基因;再对UMI进行修正和统计后,获得未过滤的feature-barcode矩阵;根据未过滤的feature-barcode矩阵,cellranger对数据中的细胞和非细胞进行识别和区分,并绘制为rank-plot图,直观体现有效细胞鉴定结果。

Tab 3-2-1 各样本比对结果统计表
SampleEstimated Number of CellsFraction Reads in CellsMean Reads per CellMedian Genes per CellTotal Genes DetectedMedian UMI Counts per CellReads Mapped Confidently to GenomeReads Mapped Confidently to Intergenic RegionsReads Mapped Confidently to Intronic RegionsReads Mapped Confidently to Exonic RegionsReads Mapped Confidently to Transcriptome
NP10,86590.4%24,8631,23116,4522,80888.8%12.9%22.2%53.7%50.5%
NPM16,35988.7%19,5941,12416,5412,40291.7%8.1%19.0%64.5%61.0%
PBS12,48991.2%25,46793816,1352,34989.2%12.9%23.8%52.5%49.0%

  • NP有效细胞
  • NPM有效细胞
  • PBS有效细胞

Fig 3-2-1 有效细胞鉴定图


各样本质控、定量结果报告:

3.3 最终鉴定细胞表达量矩阵

基于UMI修正和有效细胞鉴定后的结果,我们可以使用UMI条数对基因进行定量,获得如下的细胞-基因表达量定量结果。

Tab 3-3-1 样本NP所有细胞各个基因UMI丰度信息表(示例,前10个细胞,前20个基因)
GeneIDNameAAACCCAAGATACATG-1AAACCCAAGATGAAGG-1AAACCCAAGCACTCAT-1AAACCCAAGCATTTCG-1AAACCCAAGCCGGAAT-1AAACCCAAGCTTAAGA-1AAACCCAAGCTTACGT-1AAACCCAAGTCGCTAT-1AAACCCACATACTGAC-1
ENSMUSG00000114212Gm47985000000000
ENSMUSG00000114943Gm8947000000000
ENSMUSG00000114694Gm47995000000000
ENSMUSG000001145823110040M04Rik000000000
ENSMUSG00000114591Gm47996000000000
ENSMUSG00000051285Pcmtd1000110020
ENSMUSG00000026312Cdh7000000000
ENSMUSG00000039748Exo1000000000
ENSMUSG00000104158Gm38100000000000
ENSMUSG00000057363Uxs1000000000
ENSMUSG00000047216Cdh19000000000
ENSMUSG00000038702Dsel002000000
ENSMUSG00000033021Gmppa000000000
ENSMUSG00000055214Pld5000000000
ENSMUSG00000078184Rbm8a2000000000
ENSMUSG00000033007Asic4000000000
ENSMUSG00000070695Cntnap5a000000000
ENSMUSG00000025909Sntg1000000000
ENSMUSG00000094429Gm19965000000000
ENSMUSG00000100265Gm28363000000000

备注:由于单个细胞在某个瞬间,只有小部分基因表达,因此表中大量基因UMI丰度为0。

UMI定量总表文件:

备注:文件较大,Excel可能难以打开,建议用软件notepad++ 打开,或联系基迪奥技术人员提取您所关心的基因的UMI丰度。

广州基迪奥生物科技有限公司




4 单细胞亚群分类

在cellranger完成基因表达量鉴定后,我们将表达量矩阵转入Seurat[2]进行后续的分析。

4.1 非正常细胞的进一步过滤

cellranger的细胞过滤是根据基因表达量进行自动识别,会有部分非正常细胞残留,所以在进行亚群分类之前,我们需要对非正常细胞进行进一步过滤。

首先检测各样本中含有多个细胞的GEM。利用DoubletFinder[3]计算GEM为多胞的概率(pANN值),然后基于10X官方给出的有效细胞数(cellranger过滤后)与多胞率之间的关系计算各样本多胞率,确定各样本多胞过滤阈值,依次进行多胞过滤。

  • NP
  • NPM
  • PBS
CellspANNclassifications
NP_AAACCCAAGATACATG0.111111111111111Singlet
NP_AAACCCAAGATGAAGG0.347222222222222Doublet
NP_AAACCCAAGCACTCAT0.111111111111111Singlet
NP_AAACCCAAGCATTTCG0.0833333333333333Singlet
NP_AAACCCAAGCCGGAAT0.0555555555555556Singlet
NP_AAACCCAAGCTTAAGA0.0138888888888889Singlet
NP_AAACCCAAGCTTACGT0.0277777777777778Singlet
NP_AAACCCAAGTCGCTAT0.541666666666667Doublet
NP_AAACCCACATACTGAC0.0138888888888889Singlet
NP_AAACCCACATCAACCA0.0833333333333333Singlet

CellspANNclassifications
NPM_AAACCCAAGATTGCGG0.152679830747532Singlet
NPM_AAACCCAAGCACACAG0.277856135401975Singlet
NPM_AAACCCAAGCCATTCA0.190409026798307Singlet
NPM_AAACCCAAGTCATGCT0.415373765867419Doublet
NPM_AAACCCAAGTCCCAGC0.0624118476727786Singlet
NPM_AAACCCACAGCATGCC0.138222849083216Singlet
NPM_AAACCCACAGGACATG0.175599435825106Singlet
NPM_AAACCCAGTAGGCTGA0.321932299012694Singlet
NPM_AAACCCAGTAGTTCCA0.21015514809591Singlet
NPM_AAACCCAGTATGCAAA0.0719322990126939Singlet

CellspANNclassifications
PBS_AAACCCAAGAAGCTGC0.0240963855421687Singlet
PBS_AAACCCACAATAGGAT0.192771084337349Singlet
PBS_AAACCCACACAGCATT0Singlet
PBS_AAACCCACACCAGTAT0.0602409638554217Singlet
PBS_AAACCCACACCTTCCA0.0120481927710843Singlet
PBS_AAACCCACACGGCCAT0.0481927710843374Singlet
PBS_AAACCCAGTGCATACT0.108433734939759Singlet
PBS_AAACCCATCCTCGCAT0.0481927710843374Singlet
PBS_AAACCCATCCTGTTAT0.0843373493975904Singlet
PBS_AAACGAAAGCATACTC0Singlet

Tab 4-1-1 多胞检测结果统计表


  • NP
  • NPM
  • PBS

Fig 4-1-1 多胞概率分布tSNE图


  • NP
  • NPM
  • PBS

Fig 4-1-2 多胞分布tSNE图


除此以外,我们对以下指标进行过滤:

  1. 单细胞中鉴定到的基因数量(200.0-4500.0)。对于同一种细胞来说,表达基因的数量一般维持在一定范围内,如果该值过高,可能是一个GEM中包裹了多种细胞类型,这样的barcode应该剔除
  2. 单细胞中UMI的总数(小于22000.0)。单个细胞中可以存在的mRNA总量是有限的,如果UMI总数过高,则可能是两个或两个以上的细胞进入同一个GEM中,这样的细胞应该剔除
  3. 单细胞中线粒体基因表达量比例(小于32.0%)。细胞凋亡通常伴随着线粒体基因的高表达,所以线粒体基因的高表达意味着细胞状态不佳,这些细胞在实验过程中受到了不良刺激,不利于后续分析反应真实的细胞情况,这样的细胞应该剔除
Tab 4-1-2 过滤前后各个样本中细胞数据量统计表
Samplesbefore_filter_numafter_filter_numpctbefore_filter_median_UMI_per_cellafter_filter_median_UMI_per_cellbefore_filter_median_genes_per_cellafter_filter_median_genes_per_cellbefore_filter_median_MT_per_cellafter_filter_median_MT_per_cell
NP10865929485.54%28082765.5123112193.08087291399232.90780504371868
NPM163591303379.67%24022188112410492.970041322314052.76990871891722
PBS124891069485.63%234922679388972.370057491493612.21095657188905

Fig 4-1-3 过滤前后各个样本细胞基本信息的分布图

  • nCount与nFeature的关系
  • nCount与pMito的关系

Fig 4-1-4 过滤前后各个样本细胞基本信息的分布散点图


4.2 单细胞亚群分类

在去除低质量细胞后,我们利用Harmony[4]进行数据合并以及批次效应矫正。首先对合并后的数据进行PCA降维,Harmony采用soft k-means clustering算法对降维后的数据进行聚类,将细胞概率性的分配给cluster,使每个cluster内数据集的多样性最大化;然后计算每个cluster内所有数据集的全局中心,以及每个特定数据集的中心;最后在每个cluster内,基于中心为每个数据集计算校正因子,对细胞进行校正使其向中心聚集;不断重复以上步骤,直到聚类效果趋于稳定。

Tab 4-2-1 细胞亚群分类结果统计表
ClusterCells numberMedian Features per CellMedian Counts per Cell
076516661732
165862055.55226.5
2353710572371
32826930.52056
4202510662107
5198610562379
6153711182327
7124713232533
888511702448
986816974899
1077518015471
1174912122997
1270111972600
1331515464094
1431512262610
15287315513100
162081948.55467.5
1713611732535.5
181319591547
191022281.56718
205811702734.5
21541038.54533.5
2242596.51134

Tab 4-2-2 各样本在各个亚群中细胞数量统计表
ClusterNPNPMPBS
Total9294 (100%)13033 (100%)10694 (100%)
01876 (20.19%)2015 (15.46%)3760 (35.16%)
12051 (22.07%)2983 (22.89%)1552 (14.51%)
2912 (9.81%)1496 (11.48%)1129 (10.56%)
3923 (9.93%)1127 (8.65%)776 (7.26%)
4396 (4.26%)897 (6.88%)732 (6.84%)
5386 (4.15%)1194 (9.16%)406 (3.8%)
6538 (5.79%)701 (5.38%)298 (2.79%)
7433 (4.66%)294 (2.26%)520 (4.86%)
8332 (3.57%)379 (2.91%)174 (1.63%)
9324 (3.49%)287 (2.2%)257 (2.4%)
10240 (2.58%)266 (2.04%)269 (2.52%)
11253 (2.72%)282 (2.16%)214 (2%)
12294 (3.16%)209 (1.6%)198 (1.85%)
1365 (0.7%)172 (1.32%)78 (0.73%)
1491 (0.98%)94 (0.72%)130 (1.22%)
1510 (0.11%)262 (2.01%)15 (0.14%)
1615 (0.16%)173 (1.33%)20 (0.19%)
176 (0.06%)39 (0.3%)91 (0.85%)
1838 (0.41%)82 (0.63%)11 (0.1%)
1969 (0.74%)18 (0.14%)15 (0.14%)
2014 (0.15%)31 (0.24%)13 (0.12%)
2111 (0.12%)13 (0.1%)30 (0.28%)
2217 (0.18%)19 (0.15%)6 (0.06%)

Fig 4-2-1 各样本中各亚群细胞数量堆叠图 Fig 4-2-2 各样本中各亚群细胞数量百分比堆叠图

Fig 4-2-3 各亚群中各个样本细胞数量堆叠图 Fig 4-2-4 各亚群中各个样本细胞数量百分比堆叠图

进一步,我们计算两个细胞亚群之间相关性并绘制成热图。图中具有高度相关性的两个细胞亚群具有比较相似的基因表达模式,可能是同一种细胞类型。这张相关性热图为人工细胞亚群鉴定提供了一定的指导作用。

Fig 4-2-5 各亚群相关性热图

基因在各个亚群中表达量的均值表:2.Cluster/2.cluster/AllGene.avg_exp.annot.xls

Tab 4-2-3 基因在各个亚群中表达量的均值表(前20行)
Gene_IDGene_nameCluster 0Cluster 1Cluster 2Cluster 3Cluster 4Cluster 5Cluster 6Cluster 7Cluster 8Cluster 9Cluster 10Cluster 11Cluster 12Cluster 13Cluster 14Cluster 15Cluster 16Cluster 17Cluster 18Cluster 19Cluster 20Cluster 21Cluster 22DescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
ENSMUSG00000114212Gm4798500000000000000000000000predicted gene, 47985 [Source:MGI Symbol;Acc:MGI:6097275]-------
ENSMUSG00000114943Gm894700000000000000000000000predicted gene 8947 [Source:MGI Symbol;Acc:MGI:3648958]----GO:0005575//cellular_componentGO:0003674//molecular_functionGO:0008150//biological_process
ENSMUSG00000114694Gm4799500000000000000000000000predicted gene, 47995 [Source:MGI Symbol;Acc:MGI:6097291]-------
ENSMUSG000001145823110040M04Rik00000000000000000000000RIKEN cDNA 3110040M04 gene [Source:MGI Symbol;Acc:MGI:1920426]----GO:0005575//cellular_componentGO:0003674//molecular_functionGO:0008150//biological_process
ENSMUSG00000114591Gm4799600000000000000000000000predicted gene, 47996 [Source:MGI Symbol;Acc:MGI:6097293]-------
ENSMUSG00000051285Pcmtd10.6628699218973920.7345568191219621.276552155605970.9386557864989661.068319249264720.7063231528231091.122468296532491.334454056426040.7809674059745050.572360297108192.041482026896690.8757577513264530.477737974913390.6063320310062271.192098504829780.6513532794068020.7230671064515620.8345686242024131.947493136205170.8552289500726040.5716733882053110.06725870136043730.030977782734223protein-L-isoaspartate (D-aspartate) O-methyltransferase domain containing 1 [Source:MGI Symbol;Acc:MGI:2441773]----GO:0005737//cytoplasm;GO:0016020//membraneGO:0004719//protein-L-isoaspartate (D-aspartate) O-methyltransferase activity;GO:0008168//methyltransferase activity;GO:0016740//transferase activityGO:0006464//cellular protein modification process;GO:0006479//protein methylation;GO:0032259//methylation
ENSMUSG00000026312Cdh700000000000000000000000cadherin 7, type 2 [Source:MGI Symbol;Acc:MGI:2442792]----GO:0005886//plasma membrane;GO:0005912//adherens junction;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0016342//catenin complexGO:0005509//calcium ion binding;GO:0045296//cadherin binding;GO:0046872//metal ion bindingGO:0000902//cell morphogenesis;GO:0007043//cell-cell junction assembly;GO:0007155//cell adhesion;GO:0007156//homophilic cell adhesion via plasma membrane adhesion molecules;GO:0007275//multicellular organism development;GO:0016339//calcium-dependent cell-cell adhesion via plasma membrane cell adhesion molecules;GO:0034332//adherens junction organization;GO:0098609//cell-cell adhesion;GO:0098742//cell-cell adhesion via plasma-membrane adhesion molecules
ENSMUSG00000039748Exo18.84315758029278e-040.004620934462528560.01394735854520640.004570054237897750.00473458686125060.006360508054659180.003515206833198480.01424131334369480.02634259367010340.008752807469717830.006610532203653850.0032683316637605700.09830676129736480.00627144048716550.004258779633919560.00435992772983795000000exonuclease 1 [Source:MGI Symbol;Acc:MGI:1349427]Genetic Information ProcessingReplication and repairko03430//Mismatch repairK10746GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005886//plasma membrane;GO:0016604//nuclear bodyGO:0003677//DNA binding;GO:0003682//chromatin binding;GO:0003824//catalytic activity;GO:0004518//nuclease activity;GO:0004519//endonuclease activity;GO:0004527//exonuclease activity;GO:0008409//5'-3' exonuclease activity;GO:0016787//hydrolase activity;GO:0016788//hydrolase activity, acting on ester bonds;GO:0017108//5'-flap endonuclease activity;GO:0035312//5'-3' exodeoxyribonuclease activity;GO:0045145//single-stranded DNA 5'-3' exodeoxyribonuclease activity;GO:0046872//metal ion binding;GO:0048256//flap endonuclease activity;GO:0051908//double-stranded DNA 5'-3' exodeoxyribonuclease activityGO:0002376//immune system process;GO:0002455//humoral immune response mediated by circulating immunoglobulin;GO:0006139//nucleobase-containing compound metabolic process;GO:0006281//DNA repair;GO:0006298//mismatch repair;GO:0006310//DNA recombination;GO:0006974//cellular response to DNA damage stimulus;GO:0016446//somatic hypermutation of immunoglobulin genes;GO:0045190//isotype switching;GO:0051321//meiotic cell cycle;GO:0090305//nucleic acid phosphodiester bond hydrolysis
ENSMUSG00000104158Gm3810000000000000000000000000predicted gene, 38100 [Source:MGI Symbol;Acc:MGI:5611328]Human Diseases;Human Diseases;Human Diseases;Human Diseases;Cellular Processes;Environmental Information Processing;Human Diseases;Cellular Processes;Cellular Processes;Cellular ProcessesNeurodegenerative diseases;Neurodegenerative diseases;Infectious diseases;Infectious diseases;Transport and catabolism;Signal transduction;Neurodegenerative disease;Transport and catabolism;Cell growth and death;Transport and catabolismko05010//Alzheimer disease;ko05016//Huntington disease;ko05131//Shigellosis;ko05167//Kaposi sarcoma-associated herpesvirus infection;ko04140//Autophagy - animal;ko04371//Apelin signaling pathway;ko05017//Spinocerebellar ataxia;ko04137//Mitophagy - animal;ko04215//Apoptosis - multiple species;ko04136//Autophagy - other eukaryotesK08334;K08334;K08334;K08334;K08334;K08334;K08334;K08334;K08334;K08334GO:0000407//phagophore assembly site;GO:0005737//cytoplasm;GO:0034271//phosphatidylinositol 3-kinase complex, class III, type I;GO:0034272//phosphatidylinositol 3-kinase complex, class III, type IIGO:0005515//protein binding;GO:0044877//protein-containing complex bindingGO:0000045//autophagosome assembly;GO:0006914//autophagy;GO:0006995//cellular response to nitrogen starvation;GO:0008333//endosome to lysosome transport;GO:0042593//glucose homeostasis;GO:0045324//late endosome to vacuole transport;GO:1990172//G protein-coupled receptor catabolic process
ENSMUSG00000057363Uxs10.03814203925165560.4515126554111870.09564034913815090.1328129056086570.05087379122562110.2543438376149010.1516358518537230.09031864812705630.1951483305276730.3334126410742480.08858764813416680.1042187806875960.4069625065314110.1709705912834520.2403719295560190.2285250136961440.3181252394076240.0569774597169360.2211102494179580.1835572434193180.10274673494320.0899015419351440.453115859707657UDP-glucuronate decarboxylase 1 [Source:MGI Symbol;Acc:MGI:1915133]Metabolism;MetabolismGlobal and overview maps;Carbohydrate metabolismko01100//Metabolic pathways;ko00520//Amino sugar and nucleotide sugar metabolismK08678;K08678GO:0005737//cytoplasm;GO:0005739//mitochondrion;GO:0005794//Golgi apparatus;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0032580//Golgi cisterna membrane;GO:1902494//catalytic complexGO:0016829//lyase activity;GO:0016831//carboxy-lyase activity;GO:0042802//identical protein binding;GO:0042803//protein homodimerization activity;GO:0048040//UDP-glucuronate decarboxylase activity;GO:0070403//NAD+ bindingGO:0033320//UDP-D-xylose biosynthetic process;GO:0042732//D-xylose metabolic process
ENSMUSG00000047216Cdh1900.00352711724978765000000000000000000000cadherin 19, type 2 [Source:MGI Symbol;Acc:MGI:3588198]----GO:0005886//plasma membrane;GO:0005912//adherens junction;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0016342//catenin complexGO:0005509//calcium ion binding;GO:0045296//cadherin binding;GO:0046872//metal ion bindingGO:0000902//cell morphogenesis;GO:0007043//cell-cell junction assembly;GO:0007155//cell adhesion;GO:0007156//homophilic cell adhesion via plasma membrane adhesion molecules;GO:0007275//multicellular organism development;GO:0016339//calcium-dependent cell-cell adhesion via plasma membrane cell adhesion molecules;GO:0034332//adherens junction organization;GO:0098609//cell-cell adhesion;GO:0098742//cell-cell adhesion via plasma-membrane adhesion molecules
ENSMUSG00000038702Dsel0.004674722884878140.09816810055982110.02395956474912990.05682191844958280.01833934779925020.1368875044878120.07184464918699530.01997115882642780.2047423698371880.1470318135315440.008507485656665880.03856239256091040.03475481025397830.07701743781735980.05848142060537390.07611854504659030.056541721611966000.04707114120854540.15588950551848800dermatan sulfate epimerase-like [Source:MGI Symbol;Acc:MGI:2442948]----GO:0005575//cellular_component;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0008146//sulfotransferase activity;GO:0016853//isomerase activity;GO:0047757//chondroitin-glucuronate 5-epimerase activityGO:0030204//chondroitin sulfate metabolic process;GO:0030205//dermatan sulfate metabolic process
ENSMUSG00000033021Gmppa0.01316382584652080.2441602448491470.1141101802252170.1409018540845390.1555842286173280.2270532117702980.2135593215746480.1562214648146790.3023715566181290.1945229856268340.1603881426794990.1413629887819680.2834547531812880.3436830670592930.1494204100335860.1983554201349440.2245817341985660.1763472586455550.2521203189253010.2029599773840860.15121702113507500.480948750607376GDP-mannose pyrophosphorylase A [Source:MGI Symbol;Acc:MGI:1916330]Metabolism;Metabolism;MetabolismGlobal and overview maps;Carbohydrate metabolism;Carbohydrate metabolismko01100//Metabolic pathways;ko00520//Amino sugar and nucleotide sugar metabolism;ko00051//Fructose and mannose metabolismK00966;K00966;K00966GO:0005575//cellular_component;GO:0005737//cytoplasmGO:0003674//molecular_function;GO:0016740//transferase activity;GO:0016779//nucleotidyltransferase activityGO:0009058//biosynthetic process
ENSMUSG00000055214Pld500000000000000000000000phospholipase D family, member 5 [Source:MGI Symbol;Acc:MGI:2442056]----GO:0005575//cellular_component;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0003674//molecular_function;GO:0003824//catalytic activityGO:0008150//biological_process
ENSMUSG00000078184Rbm8a20.002118254843873240.004657077069717350.001386239212166980.0031391772310891900.003186865017145330.006138480843637520.003169662526030850.008016007076005260.001363885086680350.00150229663598226000000.00468585994901784000000RNA binding motif protein 8A2 [Source:MGI Symbol;Acc:MGI:3612447]Genetic Information Processing;Genetic Information Processing;Genetic Information ProcessingTranslation;Transcription;Translationko03013//RNA transport;ko03040//Spliceosome;ko03015//mRNA surveillance pathwayK12876;K12876;K12876GO:0005634//nucleus;GO:0005681//spliceosomal complex;GO:0005737//cytoplasm;GO:0016607//nuclear speck;GO:0035145//exon-exon junction complexGO:0003676//nucleic acid binding;GO:0003723//RNA binding;GO:0003729//mRNA bindingGO:0000184//nuclear-transcribed mRNA catabolic process, nonsense-mediated decay;GO:0006396//RNA processing;GO:0006397//mRNA processing;GO:0008380//RNA splicing;GO:0051028//mRNA transport
ENSMUSG00000033007Asic400000000000000000000000acid-sensing (proton-gated) ion channel family member 4 [Source:MGI Symbol;Acc:MGI:2652846]Organismal SystemsSensory systemko04750//Inflammatory mediator regulation of TRP channelsK04831GO:0005887//integral component of plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005272//sodium channel activity;GO:0015280//ligand-gated sodium channel activityGO:0001662//behavioral fear response;GO:0006811//ion transport;GO:0006814//sodium ion transport;GO:0035725//sodium ion transmembrane transport
ENSMUSG00000070695Cntnap5a00000000000000000000000contactin associated protein-like 5A [Source:MGI Symbol;Acc:MGI:3643623]----GO:0016020//membrane;GO:0016021//integral component of membraneGO:0004190//aspartic-type endopeptidase activityGO:0006508//proteolysis;GO:0007155//cell adhesion
ENSMUSG00000025909Sntg107.05288651417747e-04000000000000000000000syntrophin, gamma 1 [Source:MGI Symbol;Acc:MGI:1918346]----GO:0005856//cytoskeletonGO:0005198//structural molecule activity;GO:0005515//protein binding-
ENSMUSG00000094429Gm1996500000000000000000000000predicted gene, 19965 [Source:MGI Symbol;Acc:MGI:5012150]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228-GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSMUSG00000100265Gm2836300000000000000000000000predicted gene 28363 [Source:MGI Symbol;Acc:MGI:5579069]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228-GO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001227//DNA-binding transcription repressor activity, RNA polymerase II-specific;GO:0046872//metal ion bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II

细胞与亚群对照表:2.Cluster/2.cluster/Cells.cluster.list.xls

Tab 4-2-4 细胞与亚群对照表(前20行)
CellsClusterSamples
NP_AAACCCAAGATACATG8NP
NP_AAACCCAAGCACTCAT8NP
NP_AAACCCAAGCATTTCG1NP
NP_AAACCCAAGCCGGAAT10NP
NP_AAACCCAAGCTTAAGA0NP
NP_AAACCCAAGCTTACGT0NP
NP_AAACCCACATCAACCA14NP
NP_AAACCCATCAGATTGC19NP
NP_AAACCCATCATGCCAA0NP
NP_AAACCCATCCCGTGAG4NP
NP_AAACCCATCCGGACGT3NP
NP_AAACGAAAGCAGAAAG0NP
NP_AAACGAAAGTGTTCAC11NP
NP_AAACGAACATCATCTT2NP
NP_AAACGAACATCCTGTC0NP
NP_AAACGAAGTACAGAAT7NP
NP_AAACGAAGTATAGGAT3NP
NP_AAACGAAGTTAATGAG0NP
NP_AAACGAATCATTGGTG0NP
NP_AAACGAATCGCGCCAA1NP

4.3 分类结果可视化

基于细胞亚群分类的结果,进一步利用tSNE(tSNE,t-Distributed Stochastic Neighbor Embedding)非线性聚类的方法对单细胞亚群分类结果进行可视化[5]。tSNE 的方法通常对不同亚群细胞的分类结果有更佳的呈现效果(亚群间的隔离更加清晰)。

对所有样本的亚群分类可视化,结果如下:

Fig 4-3-1 单细胞亚群分类tSNE图

分别对各个样本的亚群分类可视化,结果如下:

  • 样本NP单细胞亚群分类tSNE图
  • 样本NPM单细胞亚群分类tSNE图
  • 样本PBS单细胞亚群分类tSNE图

Fig 4-3-2 各样本单细胞亚群分类tSNE图


4.4 单细胞亚群鉴定

以上单细胞亚群分类是基于细胞表达特征的相似性进行聚类的,每个亚群不具有生物学意义。所以,细胞鉴定一直是很重要但又比较繁琐的步骤。这里,我们使用singleR[6]对所有细胞进行自动化注释,为后续的人工细胞鉴定工作提供参考。

singleR是通过细胞与参考数据库中细胞类型的相似度来自动化鉴定细胞类型,对于相似度较高的细胞类型的注释准确性会降低。所以,singleR的注释结果只能作为辅助手段,最终的细胞亚群鉴定结果依然需要人工鉴定的确认。

Tab 4-4-1 各样本在各个细胞类型中细胞数量统计表
single_clustersNPNPMPBS
Total9294 (100%)13033 (100%)10694 (100%)
Adipocytes2 (0.02%)4 (0.03%)1 (0.01%)
B cells1632 (17.56%)2684 (20.59%)2178 (20.37%)
Cardiomyocytes12 (0.13%)15 (0.12%)6 (0.06%)
Dendritic cells136 (1.46%)495 (3.8%)223 (2.09%)
Endothelial cells33 (0.36%)76 (0.58%)9 (0.08%)
Epithelial cells2 (0.02%)5 (0.04%)1 (0.01%)
Erythrocytes2 (0.02%)0 (0%)3 (0.03%)
Fibroblasts14 (0.15%)3 (0.02%)0 (0%)
Granulocytes1967 (21.16%)2163 (16.6%)3948 (36.92%)
Macrophages2037 (21.92%)4041 (31.01%)1371 (12.82%)
Microglia9 (0.1%)1 (0.01%)5 (0.05%)
Monocytes1203 (12.94%)1050 (8.06%)980 (9.16%)
NK cells457 (4.92%)316 (2.42%)562 (5.26%)
T cells1788 (19.24%)2180 (16.73%)1407 (13.16%)

Fig 4-4-1 各样本中各细胞类型细胞数量堆叠图 Fig 4-4-2 各样本中各细胞类型细胞数量百分比堆叠图

Fig 4-4-3 各细胞类型中各个样本细胞数量堆叠图 Fig 4-4-4 各细胞类型中各个样本细胞数量百分比堆叠图

Fig 4-4-5 各细胞类型在tSNE图的分布

各细胞亚群中各个细胞类型数量统计表:4.CellAnnotation/Cell.annotation.stat.xls

Tab 4-4-2 各细胞亚群中各个细胞类型数量统计表
ClusterCell.annotation (Maximum proportion)AdipocytesB cellsCardiomyocytesDendritic cellsEndothelial cellsEpithelial cellsErythrocytesFibroblastsGranulocytesMacrophagesMicrogliaMonocytesNK cellsT cells
0Granulocytes(99.14%)0(0.00%)1(0.01%)0(0.00%)31(0.41%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)7585(99.14%)28(0.37%)0(0.00%)5(0.07%)0(0.00%)1(0.01%)
1Macrophages(72.70%)0(0.00%)10(0.15%)0(0.00%)495(7.52%)1(0.02%)0(0.00%)0(0.00%)3(0.05%)27(0.41%)4788(72.70%)0(0.00%)1261(19.15%)0(0.00%)1(0.02%)
2B cells(98.39%)0(0.00%)3480(98.39%)0(0.00%)2(0.06%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)25(0.71%)2(0.06%)0(0.00%)3(0.08%)0(0.00%)25(0.71%)
3T cells(98.20%)0(0.00%)6(0.21%)0(0.00%)9(0.32%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)19(0.67%)4(0.14%)0(0.00%)5(0.18%)8(0.28%)2775(98.20%)
4B cells(98.47%)0(0.00%)1994(98.47%)0(0.00%)1(0.05%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)30(1.48%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
5Macrophages(61.08%)0(0.00%)1(0.05%)0(0.00%)215(10.83%)0(0.00%)1(0.05%)0(0.00%)1(0.05%)227(11.43%)1213(61.08%)1(0.05%)324(16.31%)0(0.00%)3(0.15%)
6T cells(95.58%)0(0.00%)9(0.59%)0(0.00%)4(0.26%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)3(0.20%)1(0.07%)0(0.00%)1(0.07%)50(3.25%)1469(95.58%)
7NK cells(98.48%)0(0.00%)4(0.32%)0(0.00%)7(0.56%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1228(98.48%)8(0.64%)
8Macrophages(87.91%)0(0.00%)4(0.45%)0(0.00%)5(0.56%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)4(0.45%)778(87.91%)14(1.58%)75(8.47%)0(0.00%)5(0.56%)
9Monocytes(85.83%)0(0.00%)34(3.92%)0(0.00%)9(1.04%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.12%)63(7.26%)0(0.00%)745(85.83%)1(0.12%)15(1.73%)
10B cells(99.35%)0(0.00%)770(99.35%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.13%)0(0.00%)3(0.39%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.13%)
11T cells(99.73%)0(0.00%)0(0.00%)0(0.00%)1(0.13%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.13%)747(99.73%)
12Monocytes(95.58%)0(0.00%)0(0.00%)0(0.00%)5(0.71%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)17(2.43%)7(1.00%)0(0.00%)670(95.58%)0(0.00%)2(0.29%)
13B cells(37.46%)0(0.00%)118(37.46%)0(0.00%)14(4.44%)2(0.63%)0(0.00%)4(1.27%)0(0.00%)5(1.59%)91(28.89%)0(0.00%)37(11.75%)4(1.27%)40(12.70%)
14T cells(86.35%)0(0.00%)10(3.17%)0(0.00%)2(0.63%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)2(0.63%)1(0.32%)0(0.00%)1(0.32%)27(8.57%)272(86.35%)
15Macrophages(90.24%)0(0.00%)3(1.05%)0(0.00%)3(1.05%)0(0.00%)0(0.00%)0(0.00%)13(4.53%)5(1.74%)259(90.24%)0(0.00%)4(1.39%)0(0.00%)0(0.00%)
16Macrophages(91.35%)0(0.00%)0(0.00%)0(0.00%)3(1.44%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.48%)190(91.35%)0(0.00%)13(6.25%)0(0.00%)1(0.48%)
17Granulocytes(48.53%)0(0.00%)0(0.00%)0(0.00%)31(22.79%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)66(48.53%)2(1.47%)0(0.00%)20(14.71%)15(11.03%)2(1.47%)
18Endothelial cells(87.02%)0(0.00%)7(5.34%)0(0.00%)0(0.00%)114(87.02%)0(0.00%)0(0.00%)0(0.00%)3(2.29%)4(3.05%)0(0.00%)1(0.76%)0(0.00%)2(1.53%)
19Monocytes(40.20%)7(6.86%)36(35.29%)0(0.00%)2(1.96%)1(0.98%)7(6.86%)0(0.00%)0(0.00%)1(0.98%)4(3.92%)0(0.00%)41(40.20%)0(0.00%)3(2.94%)
20Monocytes(44.83%)0(0.00%)6(10.34%)0(0.00%)15(25.86%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)9(15.52%)0(0.00%)26(44.83%)1(1.72%)1(1.72%)
21Granulocytes(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)54(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
22Cardiomyocytes(78.57%)0(0.00%)1(2.38%)33(78.57%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)5(11.90%)0(0.00%)1(2.38%)0(0.00%)2(4.76%)

Fig 4-4-6 Seurat分群与singleR细胞鉴定对应circos图

通过singleR,我们可以鉴定细胞的细胞类型;通过Seurat,我们可以得到细胞的聚类信息。通过这两个软件,我们可以将一组细胞按照细胞类型和细胞分群两种方式进行聚类。然后,我们计算各个细胞亚群的细胞与各个细胞类型的细胞之间的相关性,并绘制成热图,作为singleR细胞鉴定结果准确性的一个佐证。

Fig 4-4-7 Seurat分群与singleR鉴定细胞类型相关性热图

广州基迪奥生物科技有限公司




5 亚群上调表达基因分析

5.1 上调表达基因分析

为了了解各个细胞亚群的分子表达特征,我们可以筛选各个细胞亚群上调表达的基因。

采用Seurat的秩和检验分别对不同类细胞群进行基因差异表达分析,筛选亚群上调表达的基因。

上调基因的筛选条件为

  1. 目标亚群或对照亚群中,基因在25%以上的细胞中有表达。
  2. P值 ≤0.01;
  3. 基因表达倍数log2FC≥0.36,即基因上调的倍数≥1.28。

Tab 5-1-1 各亚群上调基因数量统计表
Cluster012345678910111213141516171819202122
Number of DE genes478142353046658463068694440868281165449666167716609295393451256529422257

Fig 5-1-1 各亚群上调基因数量统计柱状图

各亚群差异基因注释表: 3.MarkerGene/DeGene.list.xls

Tab 5-1-2 各亚群差异基因注释表(前20行)
Target_ClusterGene IDGene NameTarget_Cluster_meanOther_Cluster_meanLog2FCPvalueQvalueDescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
0ENSMUSG00000059657Stfa2l18.889425800262760.1057562706359626.3932751201309700stefin A2 like 1 [Source:MGI Symbol;Acc:MGI:3524944]----GO:0005829//cytosolGO:0004866//endopeptidase inhibitor activity;GO:0004869//cysteine-type endopeptidase inhibitor activityGO:0010951//negative regulation of endopeptidase activity
0ENSMUSG00000079597Cstdc48.898901905817690.115565235379516.2668479502896600cystatin domain containing 4 [Source:MGI Symbol;Acc:MGI:3645124]----GO:0001533//cornified envelope;GO:0005615//extracellular space;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005829//cytosolGO:0002020//protease binding;GO:0004866//endopeptidase inhibitor activity;GO:0004869//cysteine-type endopeptidase inhibitor activityGO:0010951//negative regulation of endopeptidase activity
0ENSMUSG00000044103Il1f93.506961436309980.04886880447401896.1651639451700400interleukin 1 family, member 9 [Source:MGI Symbol;Acc:MGI:2449929]Environmental Information ProcessingSignaling molecules and interactionko04060//Cytokine-cytokine receptor interactionK05487GO:0005576//extracellular region;GO:0005615//extracellular spaceGO:0005125//cytokine activityGO:0006954//inflammatory response;GO:0006955//immune response;GO:0007165//signal transduction
0ENSMUSG00000026073Il1r216.77120313192360.257097515851886.0275267078748300interleukin 1 receptor, type II [Source:MGI Symbol;Acc:MGI:96546]Human Diseases;Environmental Information Processing;Human Diseases;Human Diseases;Organismal Systems;Human Diseases;Human DiseasesCancers;Signaling molecules and interaction;Infectious diseases;Infectious diseases;Immune system;Cardiovascular diseases;Cancersko05202//Transcriptional misregulation in cancers;ko04060//Cytokine-cytokine receptor interaction;ko05166//HTLV-I infection;ko05146//Amoebiasis;ko04640//Hematopoietic cell lineage;ko05418//Fluid shear stress and atherosclerosis;ko05215//Prostate cancerK04387;K04387;K04387;K04387;K04387;K04387;K04387GO:0005576//extracellular region;GO:0005737//cytoplasm;GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0004908//interleukin-1 receptor activity;GO:0004910//interleukin-1, type II, blocking receptor activity;GO:0019966//interleukin-1 bindingGO:0010955//negative regulation of protein processing;GO:0019221//cytokine-mediated signaling pathway;GO:0032690//negative regulation of interleukin-1 alpha production;GO:1900016//negative regulation of cytokine production involved in inflammatory response;GO:2000660//negative regulation of interleukin-1-mediated signaling pathway
0ENSMUSG00000096719Mrgpra2b3.23712401319990.04999088369039926.0169037957607100MAS-related GPR, member A2B [Source:MGI Symbol;Acc:MGI:3033098]----GO:0016020//membrane;GO:0016021//integral component of membraneGO:0004930//G protein-coupled receptor activityGO:0007186//G protein-coupled receptor signaling pathway
0ENSMUSG00000009633G0s2213.2433973682493.320291745568026.0050472489268900G0/G1 switch gene 2 [Source:MGI Symbol;Acc:MGI:1316737]----GO:0005739//mitochondrion;GO:0016020//membrane;GO:0016021//integral component of membrane-GO:0006915//apoptotic process;GO:0097191//extrinsic apoptotic signaling pathway;GO:0120162//positive regulation of cold-induced thermogenesis;GO:2001238//positive regulation of extrinsic apoptotic signaling pathway
0ENSMUSG00000033508Asprv17.950560737678360.12761606275715.9611747799364500aspartic peptidase, retroviral-like 1 [Source:MGI Symbol;Acc:MGI:1915105]----GO:0016020//membrane;GO:0016021//integral component of membraneGO:0004190//aspartic-type endopeptidase activity;GO:0008233//peptidase activity;GO:0016787//hydrolase activityGO:0006508//proteolysis;GO:0016485//protein processing;GO:0043588//skin development
0ENSMUSG00000037095Lrg17.751147190168320.1248164383837355.9565299934072800leucine-rich alpha-2-glycoprotein 1 [Source:MGI Symbol;Acc:MGI:1924155]----GO:0005615//extracellular space;GO:0043231//intracellular membrane-bounded organelleGO:0005160//transforming growth factor beta receptor binding;GO:0005515//protein bindingGO:0001938//positive regulation of endothelial cell proliferation;GO:0009617//response to bacterium;GO:0030511//positive regulation of transforming growth factor beta receptor signaling pathway;GO:0045766//positive regulation of angiogenesis;GO:0050873//brown fat cell differentiation
0ENSMUSG00000022126Acod119.03528782035340.319234612470815.897915492103400aconitate decarboxylase 1 [Source:MGI Symbol;Acc:MGI:103206]Metabolism;MetabolismGlobal and overview maps;Carbohydrate metabolismko01100//Metabolic pathways;ko00660//C5-Branched dibasic acid metabolismK17724;K17724-GO:0016829//lyase activity;GO:0047613//aconitate decarboxylase activityGO:0002760//positive regulation of antimicrobial humoral response;GO:0006952//defense response;GO:0071222//cellular response to lipopolysaccharide;GO:0072573//tolerance induction to lipopolysaccharide
0ENSMUSG00000035692Isg158.388764692220250.1419551497391235.8849512866684700ISG15 ubiquitin-like modifier [Source:MGI Symbol;Acc:MGI:1855694]Human Diseases;Human Diseases;Organismal SystemsInfectious diseases;Infectious diseases;Immune systemko05169//Epstein-Barr virus infection;ko05165//Human papillomavirus infection;ko04622//RIG-I-like receptor signaling pathwayK12159;K12159;K12159GO:0005576//extracellular region;GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0022627//cytosolic small ribosomal subunitGO:0005178//integrin binding;GO:0005515//protein binding;GO:0031386//protein tag;GO:0031625//ubiquitin protein ligase bindingGO:0007229//integrin-mediated signaling pathway;GO:0009615//response to virus;GO:0009617//response to bacterium;GO:0019941//modification-dependent protein catabolic process;GO:0030501//positive regulation of bone mineralization;GO:0031397//negative regulation of protein ubiquitination;GO:0032020//ISG15-protein conjugation;GO:0032649//regulation of interferon-gamma production;GO:0032729//positive regulation of interferon-gamma production;GO:0032733//positive regulation of interleukin-10 production;GO:0034340//response to type I interferon;GO:0042742//defense response to bacterium;GO:0045071//negative regulation of viral genome replication;GO:0045648//positive regulation of erythrocyte differentiation;GO:0051607//defense response to virus;GO:0060339//negative regulation of type I interferon-mediated signaling pathway
0ENSMUSG00000056071S100a9711.73300004003313.21982434787435.7505612162059200S100 calcium binding protein A9 (calgranulin B) [Source:MGI Symbol;Acc:MGI:1338947]Organismal SystemsImmune systemko04657//IL-17 signaling pathwayK21128GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005829//cytosol;GO:0005856//cytoskeleton;GO:0005886//plasma membrane;GO:0016020//membrane;GO:0030054//cell junctionGO:0005509//calcium ion binding;GO:0016209//antioxidant activity;GO:0046872//metal ion binding;GO:0048306//calcium-dependent protein bindingGO:0002376//immune system process;GO:0002523//leukocyte migration involved in inflammatory response;GO:0002544//chronic inflammatory response;GO:0002793//positive regulation of peptide secretion;GO:0006417//regulation of translation;GO:0006914//autophagy;GO:0006915//apoptotic process;GO:0006919//activation of cysteine-type endopeptidase activity involved in apoptotic process;GO:0006935//chemotaxis;GO:0006954//inflammatory response;GO:0010976//positive regulation of neuron projection development;GO:0014002//astrocyte development;GO:0018119//peptidyl-cysteine S-nitrosylation;GO:0030194//positive regulation of blood coagulation;GO:0030593//neutrophil chemotaxis;GO:0030595//leukocyte chemotaxis;GO:0031532//actin cytoskeleton reorganization;GO:0032496//response to lipopolysaccharide;GO:0035425//autocrine signaling;GO:0035606//peptidyl-cysteine S-trans-nitrosylation;GO:0035821//modification of morphology or physiology of other organism;GO:0045087//innate immune response;GO:0045113//regulation of integrin biosynthetic process;GO:0050729//positive regulation of inflammatory response;GO:0061844//antimicrobial humoral immune response mediated by antimicrobial peptide;GO:0070488//neutrophil aggregation;GO:0098869//cellular oxidant detoxification;GO:2001244//positive regulation of intrinsic apoptotic signaling pathway
0ENSMUSG00000042265Trem118.83671537310580.3567755442295315.7223868819231400triggering receptor expressed on myeloid cells 1 [Source:MGI Symbol;Acc:MGI:1930005]----GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0038023//signaling receptor activity;GO:0097110//scaffold protein bindingGO:0002250//adaptive immune response;GO:0002376//immune system process;GO:0002526//acute inflammatory response;GO:0016477//cell migration;GO:0030593//neutrophil chemotaxis;GO:0045087//innate immune response;GO:0070945//neutrophil mediated killing of gram-negative bacterium;GO:0072672//neutrophil extravasation
0ENSMUSG00000027398Il1b365.6585427861496.956246551953865.7160441901609900interleukin 1 beta [Source:MGI Symbol;Acc:MGI:96543]Human Diseases;Human Diseases;Human Diseases;Environmental Information Processing;Human Diseases;Environmental Information Processing;Environmental Information Processing;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Organismal Systems;Organismal Systems;Human Diseases;Human Diseases;Human Diseases;Cellular Processes;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Organismal Systems;Organismal Systems;Organismal Systems;Environmental Information Processing;Organismal Systems;Human Diseases;Human Diseases;Human Diseases;Organismal Systems;Organismal Systems;Human Diseases;Human Diseases;Human Diseases;Organismal Systems;Human Diseases;Human Diseases;Human DiseasesInfectious diseases;Infectious diseases;Neurodegenerative diseases;Signaling molecules and interaction;Infectious diseases;Signal transduction;Signal transduction;Immune diseases;Infectious diseases;Infectious diseases;Infectious diseases;Infectious diseases;Immune system;Immune system;Infectious diseases;Infectious diseases;Infectious diseases;Cell growth and death;Infectious disease: bacterial;Infectious diseases;Cardiovascular diseases;Endocrine and metabolic diseases;Infectious diseases;Immune system;Sensory system;Development;Signal transduction;Immune system;Endocrine and metabolic diseases;Immune diseases;Endocrine and metabolic diseases;Immune system;Immune system;Infectious diseases;Immune diseases;Infectious diseases;Immune system;Infectious diseases;Neurodegenerative diseases;Drug resistanceko05168//Herpes simplex infection;ko05130//Pathogenic Escherichia coli infection;ko05010//Alzheimer disease;ko04060//Cytokine-cytokine receptor interaction;ko05152//Tuberculosis;ko04010//MAPK signaling pathway;ko04064//NF-kappa B signaling pathway;ko05323//Rheumatoid arthritis;ko05163//Human cytomegalovirus infection;ko05131//Shigellosis;ko05146//Amoebiasis;ko05132//Salmonella infection;ko04621//NOD-like receptor signaling pathway;ko04640//Hematopoietic cell lineage;ko05162//Measles;ko05140//Leishmaniasis;ko05164//Influenza A;ko04217//Necroptosis;ko05135//Yersinia infection;ko05143//African trypanosomiasis;ko05418//Fluid shear stress and atherosclerosis;ko04932//Non-alcoholic fatty liver disease (NAFLD);ko05142//Chagas disease (American trypanosomiasis);ko04659//Th17 cell differentiation;ko04750//Inflammatory mediator regulation of TRP channels;ko04380//Osteoclast differentiation;ko04668//TNF signaling pathway;ko04625//C-type lectin receptor signaling pathway;ko04940//Type I diabetes mellitus;ko05321//Inflammatiory bowel disease (IBD);ko04933//AGE-RAGE signaling pathway in diabetic complications;ko04620//Toll-like receptor signaling pathway;ko04657//IL-17 signaling pathway;ko05144//Malaria;ko05332//Graft-versus-host disease;ko05133//Pertussis;ko04623//Cytosolic DNA-sensing pathway;ko05134//Legionellosis;ko05020//Prion diseases;ko01523//Antifolate resistanceK04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519;K04519GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005737//cytoplasm;GO:0005764//lysosome;GO:0005776//autophagosome;GO:0005829//cytosol;GO:0030141//secretory granule;GO:0031410//cytoplasmic vesicle;GO:0031982//vesicleGO:0005125//cytokine activity;GO:0005149//interleukin-1 receptor binding;GO:0005178//integrin binding;GO:0019904//protein domain specific bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0000165//MAPK cascade;GO:0000187//activation of MAPK activity;GO:0001660//fever generation;GO:0001934//positive regulation of protein phosphorylation;GO:0006954//inflammatory response;GO:0006955//immune response;GO:0007204//positive regulation of cytosolic calcium ion concentration;GO:0007613//memory;GO:0008284//positive regulation of cell proliferation;GO:0008285//negative regulation of cell proliferation;GO:0009743//response to carbohydrate;GO:0010573//vascular endothelial growth factor production;GO:0010575//positive regulation of vascular endothelial growth factor production;GO:0010628//positive regulation of gene expression;GO:0010629//negative regulation of gene expression;GO:0010718//positive regulation of epithelial to mesenchymal transition;GO:0010829//negative regulation of glucose transmembrane transport;GO:0010942//positive regulation of cell death;GO:0014050//negative regulation of glutamate secretion;GO:0019221//cytokine-mediated signaling pathway;GO:0030213//hyaluronan biosynthetic process;GO:0030335//positive regulation of cell migration;GO:0030593//neutrophil chemotaxis;GO:0030730//sequestering of triglyceride;GO:0031394//positive regulation of prostaglandin biosynthetic process;GO:0031622//positive regulation of fever generation;GO:0031663//lipopolysaccharide-mediated signaling pathway;GO:0032308//positive regulation of prostaglandin secretion;GO:0032496//response to lipopolysaccharide;GO:0032722//positive regulation of chemokine production;GO:0032725//positive regulation of granulocyte macrophage colony-stimulating factor production;GO:0032729//positive regulation of interferon-gamma production;GO:0032743//positive regulation of interleukin-2 production;GO:0032755//positive regulation of interleukin-6 production;GO:0032757//positive regulation of interleukin-8 production;GO:0032874//positive regulation of stress-activated MAPK cascade;GO:0033092//positive regulation of immature T cell proliferation in thymus;GO:0033198//response to ATP;GO:0034116//positive regulation of heterotypic cell-cell adhesion;GO:0035176//social behavior;GO:0035234//ectopic germ cell programmed cell death;GO:0035505//positive regulation of myosin light chain kinase activity;GO:0035690//cellular response to drug;GO:0042102//positive regulation of T cell proliferation;GO:0043065//positive regulation of apoptotic process;GO:0043122//regulation of I-kappaB kinase/NF-kappaB signaling;GO:0043123//positive regulation of I-kappaB kinase/NF-kappaB signaling;GO:0043407//negative regulation of MAP kinase activity;GO:0043491//protein kinase B signaling;GO:0043507//positive regulation of JUN kinase activity;GO:0043525//positive regulation of neuron apoptotic process;GO:0045429//positive regulation of nitric oxide biosynthetic process;GO:0045665//negative regulation of neuron differentiation;GO:0045687//positive regulation of glial cell differentiation;GO:0045766//positive regulation of angiogenesis;GO:0045833//negative regulation of lipid metabolic process;GO:0045840//positive regulation of mitotic nuclear division;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045917//positive regulation of complement activation;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0046330//positive regulation of JNK cascade;GO:0046627//negative regulation of insulin receptor signaling pathway;GO:0048143//astrocyte activation;GO:0048711//positive regulation of astrocyte differentiation;GO:0050691//regulation of defense response to virus by host;GO:0050729//positive regulation of inflammatory response;GO:0050766//positive regulation of phagocytosis;GO:0050767//regulation of neurogenesis;GO:0050768//negative regulation of neurogenesis;GO:0050796//regulation of insulin secretion;GO:0050805//negative regulation of synaptic transmission;GO:0050900//leukocyte migration;GO:0050995//negative regulation of lipid catabolic process;GO:0050996//positive regulation of lipid catabolic process;GO:0050999//regulation of nitric-oxide synthase activity;GO:0051044//positive regulation of membrane protein ectodomain proteolysis;GO:0051091//positive regulation of DNA-binding transcription factor activity;GO:0051092//positive regulation of NF-kappaB transcription factor activity;GO:0051781//positive regulation of cell division;GO:0060252//positive regulation of glial cell proliferation;GO:0060559//positive regulation of calcidiol 1-monooxygenase activity;GO:0070164//negative regulation of adiponectin secretion;GO:0070372//regulation of ERK1 and ERK2 cascade;GO:0070374//positive regulation of ERK1 and ERK2 cascade;GO:0070487//monocyte aggregation;GO:0070498//interleukin-1-mediated signaling pathway;GO:0070555//response to interleukin-1;GO:0071222//cellular response to lipopolysaccharide;GO:0071310//cellular response to organic substance;GO:0071407//cellular response to organic cyclic compound;GO:0071639//positive regulation of monocyte chemotactic protein-1 production;GO:0090023//positive regulation of neutrophil chemotaxis;GO:0097192//extrinsic apoptotic signaling pathway in absence of ligand;GO:1900745//positive regulation of p38MAPK cascade;GO:1901224//positive regulation of NIK/NF-kappaB signaling;GO:1902680//positive regulation of RNA biosynthetic process;GO:1903140//regulation of establishment of endothelial barrier;GO:1903597//negative regulation of gap junction assembly;GO:2000173//negative regulation of branching morphogenesis of a nerve;GO:2000178//negative regulation of neural precursor cell proliferation;GO:2000556//positive regulation of T-helper 1 cell cytokine production;GO:2001240//negative regulation of extrinsic apoptotic signaling pathway in absence of ligand
0ENSMUSG00000032487Ptgs224.660641234010.484968854748095.6681744046717100prostaglandin-endoperoxide synthase 2 [Source:MGI Symbol;Acc:MGI:97798]Metabolism;Human Diseases;Human Diseases;Human Diseases;Environmental Information Processing;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Organismal Systems;Organismal Systems;Organismal Systems;Environmental Information Processing;Organismal Systems;Human Diseases;Organismal Systems;Human Diseases;Metabolism;Organismal Systems;Environmental Information Processing;Organismal SystemsGlobal and overview maps;Cancers;Infectious diseases;Neurodegenerative diseases;Signal transduction;Infectious diseases;Infectious diseases;Infectious diseases;Cancers;Endocrine system;Nervous system;Nervous system;Signal transduction;Immune system;Cancers;Immune system;Cancers;Lipid metabolism;Endocrine system;Signal transduction;Endocrine systemko01100//Metabolic pathways;ko05200//Pathways in cancer;ko05165//Human papillomavirus infection;ko05010//Alzheimer disease;ko04064//NF-kappa B signaling pathway;ko05163//Human cytomegalovirus infection;ko05167//Kaposi sarcoma-associated herpesvirus infection;ko05140//Leishmaniasis;ko05206//MicroRNAs in cancer;ko04921//Oxytocin signaling pathway;ko04723//Retrograde endocannabinoid signaling;ko04726//Serotonergic synapse;ko04668//TNF signaling pathway;ko04625//C-type lectin receptor signaling pathway;ko05204//Chemical carcinogenesis;ko04657//IL-17 signaling pathway;ko05222//Small cell lung cancer;ko00590//Arachidonic acid metabolism;ko04913//Ovarian Steroidogenesis;ko04370//VEGF signaling pathway;ko04923//Regulation of lipolysis in adipocyteK11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987;K11987GO:0005634//nucleus;GO:0005637//nuclear inner membrane;GO:0005640//nuclear outer membrane;GO:0005737//cytoplasm;GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0005901//caveola;GO:0016020//membrane;GO:0032991//protein-containing complex;GO:0043005//neuron projection;GO:0043231//intracellular membrane-bounded organelleGO:0004601//peroxidase activity;GO:0004666//prostaglandin-endoperoxide synthase activity;GO:0005515//protein binding;GO:0016491//oxidoreductase activity;GO:0016702//oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen;GO:0019899//enzyme binding;GO:0020037//heme binding;GO:0042803//protein homodimerization activity;GO:0046872//metal ion binding;GO:0051213//dioxygenase activityGO:0001516//prostaglandin biosynthetic process;GO:0001525//angiogenesis;GO:0006629//lipid metabolic process;GO:0006631//fatty acid metabolic process;GO:0006633//fatty acid biosynthetic process;GO:0006693//prostaglandin metabolic process;GO:0006954//inflammatory response;GO:0006979//response to oxidative stress;GO:0007566//embryo implantation;GO:0007568//aging;GO:0007612//learning;GO:0007613//memory;GO:0008217//regulation of blood pressure;GO:0008284//positive regulation of cell proliferation;GO:0008285//negative regulation of cell proliferation;GO:0009314//response to radiation;GO:0009750//response to fructose;GO:0010033//response to organic substance;GO:0010042//response to manganese ion;GO:0010226//response to lithium ion;GO:0010243//response to organonitrogen compound;GO:0010575//positive regulation of vascular endothelial growth factor production;GO:0010942//positive regulation of cell death;GO:0014070//response to organic cyclic compound;GO:0019233//sensory perception of pain;GO:0019371//cyclooxygenase pathway;GO:0030216//keratinocyte differentiation;GO:0030282//bone mineralization;GO:0030728//ovulation;GO:0031394//positive regulation of prostaglandin biosynthetic process;GO:0031622//positive regulation of fever generation;GO:0031915//positive regulation of synaptic plasticity;GO:0032227//negative regulation of synaptic transmission, dopaminergic;GO:0032355//response to estradiol;GO:0032496//response to lipopolysaccharide;GO:0033138//positive regulation of peptidyl-serine phosphorylation;GO:0033280//response to vitamin D;GO:0034097//response to cytokine;GO:0034605//cellular response to heat;GO:0034612//response to tumor necrosis factor;GO:0034644//cellular response to UV;GO:0035633//maintenance of permeability of blood-brain barrier;GO:0042127//regulation of cell proliferation;GO:0042307//positive regulation of protein import into nucleus;GO:0042493//response to drug;GO:0042633//hair cycle;GO:0043065//positive regulation of apoptotic process;GO:0043066//negative regulation of apoptotic process;GO:0043154//negative regulation of cysteine-type endopeptidase activity involved in apoptotic process;GO:0045429//positive regulation of nitric oxide biosynthetic process;GO:0045786//negative regulation of cell cycle;GO:0045907//positive regulation of vasoconstriction;GO:0045986//negative regulation of smooth muscle contraction;GO:0045987//positive regulation of smooth muscle contraction;GO:0046697//decidualization;GO:0048661//positive regulation of smooth muscle cell proliferation;GO:0050873//brown fat cell differentiation;GO:0051384//response to glucocorticoid;GO:0051926//negative regulation of calcium ion transport;GO:0051968//positive regulation of synaptic transmission, glutamatergic;GO:0055114//oxidation-reduction process;GO:0070542//response to fatty acid;GO:0071260//cellular response to mechanical stimulus;GO:0071284//cellular response to lead ion;GO:0071318//cellular response to ATP;GO:0071456//cellular response to hypoxia;GO:0071471//cellular response to non-ionic osmotic stress;GO:0071498//cellular response to fluid shear stress;GO:0071636//positive regulation of transforming growth factor beta production;GO:0090050//positive regulation of cell migration involved in sprouting angiogenesis;GO:0090271//positive regulation of fibroblast growth factor production;GO:0090336//positive regulation of brown fat cell differentiation;GO:0090362//positive regulation of platelet-derived growth factor production;GO:0098869//cellular oxidant detoxification;GO:0150077//regulation of neuroinflammatory response;GO:1902219//negative regulation of intrinsic apoptotic signaling pathway in response to osmotic stress;GO:1990776//response to angiotensin
0ENSMUSG00000026180Cxcr211.55273667393280.2272005978639355.6681242039952200chemokine (C-X-C motif) receptor 2 [Source:MGI Symbol;Acc:MGI:105303]Environmental Information Processing;Environmental Information Processing;Cellular Processes;Human Diseases;Organismal Systems;Environmental Information Processing;Human DiseasesSignaling molecules and interaction;Signal transduction;Transport and catabolism;Infectious diseases;Immune system;Signaling molecules and interaction;Infectious diseasesko04060//Cytokine-cytokine receptor interaction;ko04072//Phospholipase D signaling pathway;ko04144//Endocytosis;ko05163//Human cytomegalovirus infection;ko04062//Chemokine signaling pathway;ko04061//Viral protein interaction with cytokine and cytokine receptor;ko05120//Epithelial cell signaling in Helicobacter pylori infectionK05050;K05050;K05050;K05050;K05050;K05050;K05050GO:0005654//nucleoplasm;GO:0005886//plasma membrane;GO:0009897//external side of plasma membrane;GO:0009986//cell surface;GO:0015630//microtubule cytoskeleton;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0042629//mast cell granule;GO:0072686//mitotic spindleGO:0004918//interleukin-8 receptor activity;GO:0004930//G protein-coupled receptor activity;GO:0004950//chemokine receptor activity;GO:0005515//protein binding;GO:0016493//C-C chemokine receptor activity;GO:0016494//C-X-C chemokine receptor activity;GO:0019957//C-C chemokine binding;GO:0019959//interleukin-8 bindingGO:0002438//acute inflammatory response to antigenic stimulus;GO:0002690//positive regulation of leukocyte chemotaxis;GO:0006935//chemotaxis;GO:0006955//immune response;GO:0006968//cellular defense response;GO:0007165//signal transduction;GO:0007166//cell surface receptor signaling pathway;GO:0007186//G protein-coupled receptor signaling pathway;GO:0007200//phospholipase C-activating G protein-coupled receptor signaling pathway;GO:0007204//positive regulation of cytosolic calcium ion concentration;GO:0008284//positive regulation of cell proliferation;GO:0010666//positive regulation of cardiac muscle cell apoptotic process;GO:0019722//calcium-mediated signaling;GO:0030593//neutrophil chemotaxis;GO:0030901//midbrain development;GO:0031623//receptor internalization;GO:0033030//negative regulation of neutrophil apoptotic process;GO:0038112//interleukin-8-mediated signaling pathway;GO:0042119//neutrophil activation;GO:0043066//negative regulation of apoptotic process;GO:0043117//positive regulation of vascular permeability;GO:0045766//positive regulation of angiogenesis;GO:0060326//cell chemotaxis;GO:0070098//chemokine-mediated signaling pathway;GO:0072173//metanephric tubule morphogenesis;GO:0090023//positive regulation of neutrophil chemotaxis
0ENSMUSG00000041754Trem34.236002749162250.08571233103269935.6270569423474200triggering receptor expressed on myeloid cells 3 [Source:MGI Symbol;Acc:MGI:1930003]----GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005515//protein bindingGO:0016477//cell migration;GO:0030593//neutrophil chemotaxis;GO:0045089//positive regulation of innate immune response;GO:0070945//neutrophil mediated killing of gram-negative bacterium;GO:0072672//neutrophil extravasation
0ENSMUSG00000022237Ankrd33b5.953213885634490.1269798475984325.5509972714733400ankyrin repeat domain 33B [Source:MGI Symbol;Acc:MGI:1917904]----GO:0005575//cellular_componentGO:0003674//molecular_function;GO:0003677//DNA binding;GO:0005515//protein bindingGO:0008150//biological_process
0ENSMUSG00000056054S100a8296.7442598986836.558856209052995.4996320660538900S100 calcium binding protein A8 (calgranulin A) [Source:MGI Symbol;Acc:MGI:88244]Organismal SystemsImmune systemko04657//IL-17 signaling pathwayK21127GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005737//cytoplasm;GO:0005829//cytosol;GO:0005856//cytoskeleton;GO:0005886//plasma membrane;GO:0016020//membrane;GO:0045111//intermediate filament cytoskeletonGO:0005509//calcium ion binding;GO:0008017//microtubule binding;GO:0008270//zinc ion binding;GO:0016209//antioxidant activity;GO:0035662//Toll-like receptor 4 binding;GO:0046872//metal ion binding;GO:0048306//calcium-dependent protein binding;GO:0050544//arachidonic acid binding;GO:0050786//RAGE receptor bindingGO:0002376//immune system process;GO:0002523//leukocyte migration involved in inflammatory response;GO:0002793//positive regulation of peptide secretion;GO:0006914//autophagy;GO:0006915//apoptotic process;GO:0006919//activation of cysteine-type endopeptidase activity involved in apoptotic process;GO:0006935//chemotaxis;GO:0006954//inflammatory response;GO:0014002//astrocyte development;GO:0018119//peptidyl-cysteine S-nitrosylation;GO:0030593//neutrophil chemotaxis;GO:0032496//response to lipopolysaccharide;GO:0035425//autocrine signaling;GO:0045087//innate immune response;GO:0050727//regulation of inflammatory response;GO:0050729//positive regulation of inflammatory response;GO:0070488//neutrophil aggregation;GO:0098869//cellular oxidant detoxification;GO:2001244//positive regulation of intrinsic apoptotic signaling pathway
0ENSMUSG00000028859Csf3r22.14230798025410.5010261502349555.4657758930589400colony stimulating factor 3 receptor (granulocyte) [Source:MGI Symbol;Acc:MGI:1339755]Human Diseases;Environmental Information Processing;Environmental Information Processing;Organismal Systems;Environmental Information ProcessingCancers;Signal transduction;Signaling molecules and interaction;Immune system;Signal transductionko05200//Pathways in cancer;ko04151//PI3K-Akt signaling pathway;ko04060//Cytokine-cytokine receptor interaction;ko04640//Hematopoietic cell lineage;ko04630//Jak-STAT signaling pathwayK05061;K05061;K05061;K05061;K05061GO:0005886//plasma membrane;GO:0009897//external side of plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0043235//receptor complexGO:0004896//cytokine receptor activity;GO:0005515//protein binding;GO:0019955//cytokine binding;GO:0051916//granulocyte colony-stimulating factor bindingGO:0007155//cell adhesion;GO:0019221//cytokine-mediated signaling pathway;GO:0030593//neutrophil chemotaxis;GO:0045637//regulation of myeloid cell differentiation;GO:0097186//amelogenesis
0ENSMUSG00000027068Dhrs92.57651199255280.05984701506345835.4279962044639500dehydrogenase/reductase (SDR family) member 9 [Source:MGI Symbol;Acc:MGI:2442798]Metabolism;MetabolismGlobal and overview maps;Metabolism of cofactors and vitaminsko01100//Metabolic pathways;ko00830//Retinol metabolismK11149;K11149GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0016020//membrane;GO:0030176//integral component of endoplasmic reticulum membrane;GO:0043231//intracellular membrane-bounded organelleGO:0004022//alcohol dehydrogenase (NAD) activity;GO:0004745//retinol dehydrogenase activity;GO:0016491//oxidoreductase activity;GO:0047023//androsterone dehydrogenase activity;GO:0047035//testosterone dehydrogenase (NAD+) activity;GO:0047044//androstan-3-alpha,17-beta-diol dehydrogenase activityGO:0002138//retinoic acid biosynthetic process;GO:0006629//lipid metabolic process;GO:0008202//steroid metabolic process;GO:0008209//androgen metabolic process;GO:0042448//progesterone metabolic process;GO:0042904//9-cis-retinoic acid biosynthetic process;GO:0055114//oxidation-reduction process

5.2 上调基因表达分布

图形是比表格更优秀的数据呈现形式。我们使用热图、tSNE图、密度分布图、小提琴图和气泡图来可视化基因在细胞和各个细胞亚群的表达分布情况。(选择亚群上调top5的基因用于绘图)

Fig 5-2-1 标记基因表达热图

Fig 5-2-2 标记基因表达分布气泡图

上调基因表达分布图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/ExpPlot

  • Ace
  • Adgre4
  • Arg1
  • Atp2a1
  • C1qa

Fig 5-2-3 标记基因表达分布图


上调基因表达分布密度图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/DensityPlot

  • Ace
  • Adgre4
  • Arg1
  • Atp2a1
  • C1qa

Fig 5-2-4 标记基因表达分布密度图


上调基因表达分布小提琴图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/ViolinPlot

  • Ace
  • Adgre4
  • Arg1
  • Atp2a1
  • C1qa

Fig 5-2-5 标记基因表达分布小提琴图


5.3 GO富集分析

       Gene Ontology(简称GO)是一个国际标准化的基因功能分类体系,提供了一套动态更新的标准词汇表(controlled vocabulary)来全面描述生物体中基因和基因产物的属性。GO总共有三个ontology(本体),分别描述基因的分子功能(molecular function)、细胞组分(cellular component)、参与的生物过程(biological process)。GO的基本单位是term(词条、节点),每个term都对应一个属性。 GO功能分析一方面给出基因的GO功能分类注释;另一方面给出基因的GO功能显著性富集分析。 首先,我们将基因向GO数据库(http://www.geneontology.org/)的各term映射,并计算每个term的基因数,从而得到具有某个GO功能的基因列表及基因数目统计。然后应用超几何检验,找出与整个基因组背景相比,在基因中显著富集的GO条目。

基因集 细胞组分 分子功能 生物学过程 GO 分类表
Cluster_0 Cluster_0.C.html Cluster_0.F.html Cluster_0.P.html Cluster_0.Level2.xls
Cluster_1 Cluster_1.C.html Cluster_1.F.html Cluster_1.P.html Cluster_1.Level2.xls
Cluster_2 Cluster_2.C.html Cluster_2.F.html Cluster_2.P.html Cluster_2.Level2.xls
Cluster_3 Cluster_3.C.html Cluster_3.F.html Cluster_3.P.html Cluster_3.Level2.xls
Cluster_4 Cluster_4.C.html Cluster_4.F.html Cluster_4.P.html Cluster_4.Level2.xls
Cluster_5 Cluster_5.C.html Cluster_5.F.html Cluster_5.P.html Cluster_5.Level2.xls
Cluster_6 Cluster_6.C.html Cluster_6.F.html Cluster_6.P.html Cluster_6.Level2.xls
Cluster_7 Cluster_7.C.html Cluster_7.F.html Cluster_7.P.html Cluster_7.Level2.xls
Cluster_8 Cluster_8.C.html Cluster_8.F.html Cluster_8.P.html Cluster_8.Level2.xls
Cluster_9 Cluster_9.C.html Cluster_9.F.html Cluster_9.P.html Cluster_9.Level2.xls
Cluster_10 Cluster_10.C.html Cluster_10.F.html Cluster_10.P.html Cluster_10.Level2.xls
Cluster_11 Cluster_11.C.html Cluster_11.F.html Cluster_11.P.html Cluster_11.Level2.xls
Cluster_12 Cluster_12.C.html Cluster_12.F.html Cluster_12.P.html Cluster_12.Level2.xls
Cluster_13 Cluster_13.C.html Cluster_13.F.html Cluster_13.P.html Cluster_13.Level2.xls
Cluster_14 Cluster_14.C.html Cluster_14.F.html Cluster_14.P.html Cluster_14.Level2.xls
Cluster_15 Cluster_15.C.html Cluster_15.F.html Cluster_15.P.html Cluster_15.Level2.xls
Cluster_16 Cluster_16.C.html Cluster_16.F.html Cluster_16.P.html Cluster_16.Level2.xls
Cluster_17 Cluster_17.C.html Cluster_17.F.html Cluster_17.P.html Cluster_17.Level2.xls
Cluster_18 Cluster_18.C.html Cluster_18.F.html Cluster_18.P.html Cluster_18.Level2.xls
Cluster_19 Cluster_19.C.html Cluster_19.F.html Cluster_19.P.html Cluster_19.Level2.xls
Cluster_20 Cluster_20.C.html Cluster_20.F.html Cluster_20.P.html Cluster_20.Level2.xls
Cluster_21 Cluster_21.C.html Cluster_21.F.html Cluster_21.P.html Cluster_21.Level2.xls
Cluster_22 Cluster_22.C.html Cluster_22.F.html Cluster_22.P.html Cluster_22.Level2.xls

GO富集圈图:(第一圈:富集前20的GOterm,圈外为基因数目的坐标尺。不同的颜色代表不同的Ontology; 第二圈:背景基因中该GOterm的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该GOterm差异基因数量 第四圈:各GOterm的RichFactor值(该GOterm中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图
  • Cluster_17 富集圈图
  • Cluster_18 富集圈图
  • Cluster_19 富集圈图
  • Cluster_20 富集圈图
  • Cluster_21 富集圈图
  • Cluster_22 富集圈图

Fig 5-3-1 GO 富集圈图


GO 富集分类柱状图:(横坐标为二级GOterm,纵坐标为该term里的基因数量,不同颜色表色不同类型GOterm)

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16
  • Cluster_17
  • Cluster_18
  • Cluster_19
  • Cluster_20
  • Cluster_21
  • Cluster_22

Fig 5-3-2 GO富集分类柱状图


GO富集气泡图:(利用Q值最小的前20个GOterm来作图,纵坐标为GOterm,横坐标为富集因子(该GOterm中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0.C 气泡图
  • Cluster_0.F 气泡图
  • Cluster_0.P 气泡图
  • Cluster_1.C 气泡图
  • Cluster_1.F 气泡图
  • Cluster_1.P 气泡图
  • Cluster_2.C 气泡图
  • Cluster_2.F 气泡图
  • Cluster_2.P 气泡图
  • Cluster_3.C 气泡图
  • Cluster_3.F 气泡图
  • Cluster_3.P 气泡图
  • Cluster_4.C 气泡图
  • Cluster_4.F 气泡图
  • Cluster_4.P 气泡图
  • Cluster_5.C 气泡图
  • Cluster_5.F 气泡图
  • Cluster_5.P 气泡图
  • Cluster_6.C 气泡图
  • Cluster_6.F 气泡图
  • Cluster_6.P 气泡图
  • Cluster_7.C 气泡图
  • Cluster_7.F 气泡图
  • Cluster_7.P 气泡图
  • Cluster_8.C 气泡图
  • Cluster_8.F 气泡图
  • Cluster_8.P 气泡图
  • Cluster_9.C 气泡图
  • Cluster_9.F 气泡图
  • Cluster_9.P 气泡图
  • Cluster_10.C 气泡图
  • Cluster_10.F 气泡图
  • Cluster_10.P 气泡图
  • Cluster_11.C 气泡图
  • Cluster_11.F 气泡图
  • Cluster_11.P 气泡图
  • Cluster_12.C 气泡图
  • Cluster_12.F 气泡图
  • Cluster_12.P 气泡图
  • Cluster_13.C 气泡图
  • Cluster_13.F 气泡图
  • Cluster_13.P 气泡图
  • Cluster_14.C 气泡图
  • Cluster_14.F 气泡图
  • Cluster_14.P 气泡图
  • Cluster_15.C 气泡图
  • Cluster_15.F 气泡图
  • Cluster_15.P 气泡图
  • Cluster_16.C 气泡图
  • Cluster_16.F 气泡图
  • Cluster_16.P 气泡图
  • Cluster_17.C 气泡图
  • Cluster_17.F 气泡图
  • Cluster_17.P 气泡图
  • Cluster_18.C 气泡图
  • Cluster_18.F 气泡图
  • Cluster_18.P 气泡图
  • Cluster_19.C 气泡图
  • Cluster_19.F 气泡图
  • Cluster_19.P 气泡图
  • Cluster_20.C 气泡图
  • Cluster_20.F 气泡图
  • Cluster_20.P 气泡图
  • Cluster_21.C 气泡图
  • Cluster_21.F 气泡图
  • Cluster_21.P 气泡图
  • Cluster_22.C 气泡图
  • Cluster_22.F 气泡图
  • Cluster_22.P 气泡图

Fig 5-3-3 GO富集气泡图


GO富集条形图:(利用Q值最小的前20个GOterm来作图,纵坐标为GOterm,横坐标为该GOterm数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该GOterm数量及Q值");

  • Cluster_0.C 富集柱形图
  • Cluster_0.F 富集柱形图
  • Cluster_0.P 富集柱形图
  • Cluster_1.C 富集柱形图
  • Cluster_1.F 富集柱形图
  • Cluster_1.P 富集柱形图
  • Cluster_2.C 富集柱形图
  • Cluster_2.F 富集柱形图
  • Cluster_2.P 富集柱形图
  • Cluster_3.C 富集柱形图
  • Cluster_3.F 富集柱形图
  • Cluster_3.P 富集柱形图
  • Cluster_4.C 富集柱形图
  • Cluster_4.F 富集柱形图
  • Cluster_4.P 富集柱形图
  • Cluster_5.C 富集柱形图
  • Cluster_5.F 富集柱形图
  • Cluster_5.P 富集柱形图
  • Cluster_6.C 富集柱形图
  • Cluster_6.F 富集柱形图
  • Cluster_6.P 富集柱形图
  • Cluster_7.C 富集柱形图
  • Cluster_7.F 富集柱形图
  • Cluster_7.P 富集柱形图
  • Cluster_8.C 富集柱形图
  • Cluster_8.F 富集柱形图
  • Cluster_8.P 富集柱形图
  • Cluster_9.C 富集柱形图
  • Cluster_9.F 富集柱形图
  • Cluster_9.P 富集柱形图
  • Cluster_10.C 富集柱形图
  • Cluster_10.F 富集柱形图
  • Cluster_10.P 富集柱形图
  • Cluster_11.C 富集柱形图
  • Cluster_11.F 富集柱形图
  • Cluster_11.P 富集柱形图
  • Cluster_12.C 富集柱形图
  • Cluster_12.F 富集柱形图
  • Cluster_12.P 富集柱形图
  • Cluster_13.C 富集柱形图
  • Cluster_13.F 富集柱形图
  • Cluster_13.P 富集柱形图
  • Cluster_14.C 富集柱形图
  • Cluster_14.F 富集柱形图
  • Cluster_14.P 富集柱形图
  • Cluster_15.C 富集柱形图
  • Cluster_15.F 富集柱形图
  • Cluster_15.P 富集柱形图
  • Cluster_16.C 富集柱形图
  • Cluster_16.F 富集柱形图
  • Cluster_16.P 富集柱形图
  • Cluster_17.C 富集柱形图
  • Cluster_17.F 富集柱形图
  • Cluster_17.P 富集柱形图
  • Cluster_18.C 富集柱形图
  • Cluster_18.F 富集柱形图
  • Cluster_18.P 富集柱形图
  • Cluster_19.C 富集柱形图
  • Cluster_19.F 富集柱形图
  • Cluster_19.P 富集柱形图
  • Cluster_20.C 富集柱形图
  • Cluster_20.F 富集柱形图
  • Cluster_20.P 富集柱形图
  • Cluster_21.C 富集柱形图
  • Cluster_21.F 富集柱形图
  • Cluster_21.P 富集柱形图
  • Cluster_22.C 富集柱形图
  • Cluster_22.F 富集柱形图
  • Cluster_22.P 富集柱形图

Fig 5-3-4 GO富集条形图


5.4 KO富集分析

       在生物体内,不同基因相互协调行使其生物学,基于Pathway的分析有助于更进一步了解基因的生物学功能。KEGG是有关Pathway的主要公共数据库。 Pathway显著性富集分析以KEGG Pathway为单位,应用超几何检验,找出与整个基因组背景相比,在基因中显著性富集的Pathway。通过Pathway显著性富集能确定基因参与的最主要生化代谢途径和信号转导途径。

所有趋势pathway统计如下所示:

Tab 5-4-1 所有趋势pathway统计表
PathwayPathway_IDKEGG_A_classKEGG_B_classCluster_0(276)Cluster_0_PvalueCluster_0_QvalueCluster_1(772)Cluster_1_PvalueCluster_1_QvalueCluster_10(436)Cluster_10_PvalueCluster_10_QvalueCluster_11(361)Cluster_11_PvalueCluster_11_QvalueCluster_12(268)Cluster_12_PvalueCluster_12_QvalueCluster_13(377)Cluster_13_PvalueCluster_13_QvalueCluster_14(374)Cluster_14_PvalueCluster_14_QvalueCluster_15(850)Cluster_15_PvalueCluster_15_QvalueCluster_16(524)Cluster_16_PvalueCluster_16_QvalueCluster_17(291)Cluster_17_PvalueCluster_17_QvalueCluster_18(168)Cluster_18_PvalueCluster_18_QvalueCluster_19(634)Cluster_19_PvalueCluster_19_QvalueCluster_2(296)Cluster_2_PvalueCluster_2_QvalueCluster_20(271)Cluster_20_PvalueCluster_20_QvalueCluster_21(209)Cluster_21_PvalueCluster_21_QvalueCluster_22(167)Cluster_22_PvalueCluster_22_QvalueCluster_3(270)Cluster_3_PvalueCluster_3_QvalueCluster_4(325)Cluster_4_PvalueCluster_4_QvalueCluster_5(344)Cluster_5_PvalueCluster_5_QvalueCluster_6(398)Cluster_6_PvalueCluster_6_QvalueCluster_7(522)Cluster_7_PvalueCluster_7_QvalueCluster_8(238)Cluster_8_PvalueCluster_8_QvalueCluster_9(351)Cluster_9_PvalueCluster_9_Qvalue
2-Oxocarboxylic acid metabolismko01210MetabolismGlobal and overview maps01150.022231781.359559e-0120.24540079.743313e-0101101140.0080529368.407265e-0201150.032249561.737315e-0110.69188239.937946e-0101101110.76151841.000000e+0001101101161.042245e-061.643540e-0501101110.53466867.668253e-0101101101120.17688515.117647e-01
ABC transportersko02010Environmental Information ProcessingMembrane transport01150.52973999.736352e-0101120.66250461.000000e+0001101101130.90867821.000000e+0001101101110.98332671.000000e+0001120.50543697.414926e-0101101101101130.36108196.079542e-0110.92077461.000000e+0001120.43752537.712048e-0130.37305917.357263e-01
AGE-RAGE signaling pathway in diabetic complicationsko04933Human DiseasesEndocrine and metabolic diseases80.012241974.252474e-02130.090592333.552775e-0160.36480899.968682e-0160.21912267.435025e-01110.00019821593.780260e-0310.98704021.000000e+0030.79716081.000000e+00130.15544195.191759e-01100.068993663.219747e-0180.016393027.683508e-0260.011430416.803815e-0290.2811779.060017e-0130.6508769.998671e-0190.003330061.816091e-0250.085774172.757027e-0120.56269329.999696e-0150.18647298.123856e-0130.71241999.812043e-01120.00045120886.111828e-0350.46415439.343373e-0180.23139154.264863e-0150.12917964.274084e-0190.016876511.963583e-01
AMPK signaling pathwayko04152Environmental Information ProcessingSignal transduction70.088170961.923730e-01130.26998097.055091e-0130.94490179.968682e-0160.38331559.997647e-0140.51096797.651735e-0120.96952471.000000e+0030.89600591.000000e+00140.2864297.416069e-0170.59370269.456923e-0140.57648377.948874e-0120.67978058.321853e-0190.51876771.000000e+0030.78253549.998671e-0130.72939058.852148e-0120.78981368.997876e-0110.90453189.999696e-0140.51684589.999999e-0140.66370289.812043e-0170.19921844.271013e-0150.65063319.791958e-01130.028277828.782589e-0210.96529739.923214e-0140.72175199.282627e-01
Acute myeloid leukemiako05221Human DiseasesCancers90.00031790692.797581e-0390.15787675.071191e-0140.45941589.968682e-0150.15926047.072900e-0190.00025564694.266108e-0320.80813551.000000e+0040.34704071.000000e+0090.231976.403139e-01120.00079749879.333688e-0330.40958916.503915e-0140.044532261.568037e-0160.39191271.000000e+0020.68774869.998671e-01105.143303e-055.281777e-0450.025145191.357840e-0101140.16734927.898882e-0110.9288549.812043e-01100.00036459955.432533e-0340.39087949.200698e-0150.40189156.350345e-0170.0027484653.166710e-0280.0065095321.184885e-01
Adherens junctionko04520Cellular ProcessesCellular community - eukaryotes105.3179e-055.849690e-04110.03719121.935996e-0130.67198169.968682e-0140.31435029.061541e-0130.35161976.298840e-0110.95181161.000000e+00011100.12626624.584012e-0110.98574471.000000e+0080.0019103271.780918e-0270.00032968665.494777e-03100.025454912.882321e-0140.20236659.812866e-0140.16315653.485023e-0160.0058224734.912712e-0201140.16165137.898882e-0170.013454091.702151e-0160.052432011.860088e-0160.091438084.531596e-01100.0072353533.000543e-0250.038657622.008680e-0160.056772043.192088e-01
Adipocytokine signaling pathwayko04920Organismal SystemsEndocrine system60.019624366.167656e-0250.72409911.000000e+0010.96888139.968682e-0120.77507061.000000e+0060.017229178.480070e-0201120.79186261.000000e+0090.20828826.124830e-0160.21830046.029887e-0120.66375188.409648e-0110.7320728.434009e-0140.73246081.000000e+0020.67295549.998671e-0180.001108077.215968e-0310.80644629.072520e-0110.72994379.999696e-0120.62287069.999999e-0101120.75137318.932500e-0110.95760321.000000e+0010.98462291.000000e+0001150.13489264.659478e-01
Adrenergic signaling in cardiomyocytesko04261Organismal SystemsCirculatory system70.15437052.890341e-01140.35397718.078413e-0140.92407239.968682e-0150.69207471.000000e+0080.066051582.260996e-0150.72801391.000000e+0060.55754211.000000e+0090.93619871.000000e+0080.60751759.456923e-01110.006999493.890101e-0240.28277435.236561e-01110.43730941.000000e+0030.8583759.998671e-0160.26762224.828049e-0160.11896753.417152e-01117.288991e-058.301351e-0440.63282649.999999e-0170.26535229.681769e-0190.10077762.945201e-0170.4556089.343373e-01180.0016664099.720753e-0360.18253154.885944e-0160.49491558.148333e-01
African trypanosomiasisko05143Human DiseasesInfectious diseases10.99230931.000000e+0030.99989781.000000e+0030.98253239.968682e-0101101110.99875331.000000e+0001101110.9999151.000000e+0010.99412261.000000e+0001130.99907531.000000e+0030.88935379.998671e-0130.85068739.456345e-0110.97458029.929685e-0101101130.92286399.812043e-0120.98354811.000000e+0001120.99905631.000000e+0010.98483229.923214e-0110.99800441.000000e+00
Alanine, aspartate and glutamate metabolismko00250MetabolismAmino acid metabolism01120.84137151.000000e+0020.54512299.968682e-0101120.30521395.854740e-0120.46667721.000000e+0010.79495711.000000e+0040.47330119.580762e-0110.89344421.000000e+0001101110.93454281.000000e+0001120.30983945.269243e-0101120.15164976.476706e-0101101101101101110.63236618.972937e-0120.42981537.762725e-01

基因集 Pathway 富集结果 Pathway 注释表
Cluster_0 Cluster_0.htm Cluster_0.path.xls
Cluster_1 Cluster_1.htm Cluster_1.path.xls
Cluster_2 Cluster_2.htm Cluster_2.path.xls
Cluster_3 Cluster_3.htm Cluster_3.path.xls
Cluster_4 Cluster_4.htm Cluster_4.path.xls
Cluster_5 Cluster_5.htm Cluster_5.path.xls
Cluster_6 Cluster_6.htm Cluster_6.path.xls
Cluster_7 Cluster_7.htm Cluster_7.path.xls
Cluster_8 Cluster_8.htm Cluster_8.path.xls
Cluster_9 Cluster_9.htm Cluster_9.path.xls
Cluster_10 Cluster_10.htm Cluster_10.path.xls
Cluster_11 Cluster_11.htm Cluster_11.path.xls
Cluster_12 Cluster_12.htm Cluster_12.path.xls
Cluster_13 Cluster_13.htm Cluster_13.path.xls
Cluster_14 Cluster_14.htm Cluster_14.path.xls
Cluster_15 Cluster_15.htm Cluster_15.path.xls
Cluster_16 Cluster_16.htm Cluster_16.path.xls
Cluster_17 Cluster_17.htm Cluster_17.path.xls
Cluster_18 Cluster_18.htm Cluster_18.path.xls
Cluster_19 Cluster_19.htm Cluster_19.path.xls
Cluster_20 Cluster_20.htm Cluster_20.path.xls
Cluster_21 Cluster_21.htm Cluster_21.path.xls
Cluster_22 Cluster_22.htm Cluster_22.path.xls

KO富集圈图:(第一圈:富集前20的pathway,圈外为基因数目的坐标尺。不同的颜色代表不同的A class; 第二圈:背景基因中该pathway的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该pathway差异基因数量 第四圈:各pathway的RichFactor值(该pathway中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图
  • Cluster_17 富集圈图
  • Cluster_18 富集圈图
  • Cluster_19 富集圈图
  • Cluster_20 富集圈图
  • Cluster_21 富集圈图
  • Cluster_22 富集圈图

Fig 5-4-1 KO 富集圈图


KO富集气泡图:(利用Q值最小的前20个pathway来作图,纵坐标为pathway,横坐标为富集因子(该pathway中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16
  • Cluster_17
  • Cluster_18
  • Cluster_19
  • Cluster_20
  • Cluster_21
  • Cluster_22

Fig 5-4-2 KO富集气泡图


KO富集条形图:(利用Q值最小的前20个pathway来作图,纵坐标为pathway,横坐标为该pathway数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该pathway数量及Q值");

  • Cluster_0 富集柱形图
  • Cluster_1 富集柱形图
  • Cluster_2 富集柱形图
  • Cluster_3 富集柱形图
  • Cluster_4 富集柱形图
  • Cluster_5 富集柱形图
  • Cluster_6 富集柱形图
  • Cluster_7 富集柱形图
  • Cluster_8 富集柱形图
  • Cluster_9 富集柱形图
  • Cluster_10 富集柱形图
  • Cluster_11 富集柱形图
  • Cluster_12 富集柱形图
  • Cluster_13 富集柱形图
  • Cluster_14 富集柱形图
  • Cluster_15 富集柱形图
  • Cluster_16 富集柱形图
  • Cluster_17 富集柱形图
  • Cluster_18 富集柱形图
  • Cluster_19 富集柱形图
  • Cluster_20 富集柱形图
  • Cluster_21 富集柱形图
  • Cluster_22 富集柱形图

Fig 5-4-3 KO富集条形图


5.5 Reactome富集分析

       Reactome数据库汇集了部分物种各项反应及生物学通路。我们将基因向Reactome数据库(https://reactome.org/)的各term映射,并计算每个term的基因数,从而得到具有某个Reactome功能的基因列表及基因数目统计。然后应用超几何检验,找出与整个基因组背景相比,在基因中显著富集的Reactome条目。

基因集 Reactome 富集结果 Reactome 富集表
Cluster_0 Cluster_0.reactome.html Cluster_0.reactome.xls
Cluster_1 Cluster_1.reactome.html Cluster_1.reactome.xls
Cluster_2 Cluster_2.reactome.html Cluster_2.reactome.xls
Cluster_3 Cluster_3.reactome.html Cluster_3.reactome.xls
Cluster_4 Cluster_4.reactome.html Cluster_4.reactome.xls
Cluster_5 Cluster_5.reactome.html Cluster_5.reactome.xls
Cluster_6 Cluster_6.reactome.html Cluster_6.reactome.xls
Cluster_7 Cluster_7.reactome.html Cluster_7.reactome.xls
Cluster_8 Cluster_8.reactome.html Cluster_8.reactome.xls
Cluster_9 Cluster_9.reactome.html Cluster_9.reactome.xls
Cluster_10 Cluster_10.reactome.html Cluster_10.reactome.xls
Cluster_11 Cluster_11.reactome.html Cluster_11.reactome.xls
Cluster_12 Cluster_12.reactome.html Cluster_12.reactome.xls
Cluster_13 Cluster_13.reactome.html Cluster_13.reactome.xls
Cluster_14 Cluster_14.reactome.html Cluster_14.reactome.xls
Cluster_15 Cluster_15.reactome.html Cluster_15.reactome.xls
Cluster_16 Cluster_16.reactome.html Cluster_16.reactome.xls
Cluster_17 Cluster_17.reactome.html Cluster_17.reactome.xls
Cluster_18 Cluster_18.reactome.html Cluster_18.reactome.xls
Cluster_19 Cluster_19.reactome.html Cluster_19.reactome.xls
Cluster_20 Cluster_20.reactome.html Cluster_20.reactome.xls
Cluster_21 Cluster_21.reactome.html Cluster_21.reactome.xls
Cluster_22 Cluster_22.reactome.html Cluster_22.reactome.xls

Reactome富集圈图:(第一圈:富集前20的Reactome通路,圈外为基因数目的坐标尺。 第二圈:背景基因中该Reactome通路的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该Reactome通路差异基因数量 第四圈:各Reactome通路的RichFactor值(该Reactome通路中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图
  • Cluster_17 富集圈图
  • Cluster_18 富集圈图
  • Cluster_19 富集圈图
  • Cluster_20 富集圈图
  • Cluster_21 富集圈图
  • Cluster_22 富集圈图

Fig 5-5-1 Reactome 富集圈图


Reactome富集气泡图:(利用Q值最小的前20个Reactome通路来作图,纵坐标为Reactome通路,横坐标为富集因子(该Reactome通路中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16
  • Cluster_17
  • Cluster_18
  • Cluster_19
  • Cluster_20
  • Cluster_21
  • Cluster_22

Fig 5-5-2 Reactome富集气泡图


Reactome富集条形图:(利用Q值最小的前20个Reactome通路来作图,纵坐标为Reactome通路,横坐标为该Reactome通路数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该Reactome通路数量及Q值");

  • Cluster_0 富集柱形图
  • Cluster_1 富集柱形图
  • Cluster_2 富集柱形图
  • Cluster_3 富集柱形图
  • Cluster_4 富集柱形图
  • Cluster_5 富集柱形图
  • Cluster_6 富集柱形图
  • Cluster_7 富集柱形图
  • Cluster_8 富集柱形图
  • Cluster_9 富集柱形图
  • Cluster_10 富集柱形图
  • Cluster_11 富集柱形图
  • Cluster_12 富集柱形图
  • Cluster_13 富集柱形图
  • Cluster_14 富集柱形图
  • Cluster_15 富集柱形图
  • Cluster_16 富集柱形图
  • Cluster_17 富集柱形图
  • Cluster_18 富集柱形图
  • Cluster_19 富集柱形图
  • Cluster_20 富集柱形图
  • Cluster_21 富集柱形图
  • Cluster_22 富集柱形图

Fig 5-5-3 Reactome富集条形图


5.6 上调基因蛋白质互作网络分析

通过string数据库[11],我们可以获得上调基因构建蛋白质互作关系信息(3.MarkerGene/String),然后利用Cytoscape构建蛋白质互作网络图。

String蛋白质互作调控网络图

注:依据系统配置及浏览器不同,如果标记基因数量过多该图可能不能正常加载,请使用桌面版Cytoscape软件


Cytoscape官方手册:http://manual.cytoscape.org/en/stable/index.html
Cytoscape使用教程:http://www.omicshare.com/class/home/index/classdetail?id=14

广州基迪奥生物科技有限公司




6 GSVA分析

基于传统的超几何检验的富集分析,往往需要用到显著差异基因集数据。当单个基因变化较为微弱时,基于传统富集分析得到结果可能会很少,甚至没有结果。GSVA分析(Gene Set Variation Analysis)[12]能够有效弥补传统富集分析对微效基因的有效信息挖据不足等问题,更为全面地对某一功能单位的调节作用进行解释。GSVA分析反映了某一个细胞亚群相对于所有细胞过表达的通路信息。GSVA原理如下:

我们对MSigDB数据库的八个数据集的每个通路进行GSVA分析

MSigDB数据库各数据集在各细胞富集程度表:

  • H
  • C1
  • C2
  • C3
  • C4
  • C5
  • C6
  • C7

Fig 6-0-1 各个亚群中的富集分数热图


广州基迪奥生物科技有限公司




7 转录因子注释

转录因子是调控基因表达的重要元件,其表达情况与细胞的下游基因表达和上游表观调控息息相关。为了方便转录因子分析的进行,我们使用animalTFDB(对动物样本)或者plantTFDB(对植物样本)对样本中所有有表达的转录因子进行注释。

Tab 7-0-1 转录因子注释表
gene_idTF_familyGene_nameCluster 0Cluster 1Cluster 2Cluster 3Cluster 4Cluster 5Cluster 6Cluster 7Cluster 8Cluster 9Cluster 10Cluster 11Cluster 12Cluster 13Cluster 14Cluster 15Cluster 16Cluster 17Cluster 18Cluster 19Cluster 20Cluster 21Cluster 22DescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
ENSMUSG00000052713OthersZfp6081.0631030015941.09405235611161.329964012577490.05762155860752141.670492820147921.291305069041820.03672909177622880.05568668763332380.5097002175112490.5677839578551580.1024379197820430.05496655271585970.9328501581307640.3730384918993660.4028346901381720.3125712341188710.5902617443412551.894252968420070.8387116335933960.728689820360740.2698227735730520.7096743984001390.163990647626932zinc finger protein 608 [Source:MGI Symbol;Acc:MGI:2442338]----GO:0005634//nucleusGO:0046872//metal ion bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0006357//regulation of transcription by RNA polymerase II;GO:0033085//negative regulation of T cell differentiation in thymus
ENSMUSG00000021359AP-2Tfap2a0009.54049721636913e-04000.00394260385947877000.00585998846754270.001549378699141640.0072481730979706600000000000transcription factor AP-2, alpha [Source:MGI Symbol;Acc:MGI:104671]----GO:0005634//nucleusGO:0003700//DNA-binding transcription factor activityGO:0006355//regulation of transcription, DNA-templated
ENSMUSG00000033863zf-C2H2Klf90.03918886673642250.3958891942612290.1012418117979950.05455105521755890.1017894837512040.3434840159405240.1701672196827770.1386937290598750.2678347561907760.08403468474544480.08165536668483170.06498465238639880.3950453094790440.125403958853610.7441209859603370.8811191157038090.5926500326882940.3771239141153031.765898077232740.189713301818008001.07533983232231Kruppel-like factor 9 [Source:MGI Symbol;Acc:MGI:1333856]----GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005829//cytosol;GO:0005886//plasma membrane-GO:0007623//circadian rhythm;GO:0010839//negative regulation of keratinocyte proliferation;GO:0071387//cellular response to cortisol stimulus;GO:0097067//cellular response to thyroid hormone stimulus
ENSMUSG00000021514zf-C2H2Zfp3690.01019124048547930.1254471143347830.1760313520582460.2715938054600940.1783309715599880.1618976119149040.2049830079865450.1945537366440230.2173419436773590.1706032402197280.1403947832888730.2247059153449420.1558505460174820.06212363860202730.2623179131593280.1087180039906780.03368828669364660.1837787024696450.1470069425476160.08834266159260120.1121770931761750.02957757310097190.143820741827386zinc finger protein 369 [Source:MGI Symbol;Acc:MGI:2176229]Organismal SystemsNervous systemko04722//Neurotrophin signaling pathwayK12458GO:0005634//nucleusGO:0003677//DNA binding;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated
ENSMUSG00000045903bHLHNpas400.001254807050717180000.001820407348911650.002000055601545720000000.0052231049269548800.002270655299766240000000neuronal PAS domain protein 4 [Source:MGI Symbol;Acc:MGI:2664186]----GO:0005634//nucleus;GO:0005667//transcription factor complex;GO:0098794//postsynapseGO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0005515//protein binding;GO:0044877//protein-containing complex binding;GO:0046982//protein heterodimerization activity;GO:0046983//protein dimerization activityGO:0006357//regulation of transcription by RNA polymerase II;GO:0007399//nervous system development;GO:0007612//learning;GO:0007614//short-term memory;GO:0007616//long-term memory;GO:0030154//cell differentiation;GO:0032228//regulation of synaptic transmission, GABAergic;GO:0035176//social behavior;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0048167//regulation of synaptic plasticity;GO:0060079//excitatory postsynaptic potential;GO:0060080//inhibitory postsynaptic potential;GO:0071386//cellular response to corticosterone stimulus;GO:1904862//inhibitory synapse assembly
ENSMUSG00000021362GCMGcm200000000000000000000.0707731885763849000glial cells missing homolog 2 [Source:MGI Symbol;Acc:MGI:1861438]Organismal SystemsEndocrine systemko04928//Parathyroid hormone synthesis, secretion and actionK21598GO:0005634//nucleusGO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0043565//sequence-specific DNA binding;GO:1990837//sequence-specific double-stranded DNA bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0006366//transcription by RNA polymerase II;GO:0006874//cellular calcium ion homeostasis;GO:0007275//multicellular organism development;GO:0030643//cellular phosphate ion homeostasis;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0060017//parathyroid gland development
ENSMUSG00000036721zf-C2H2Zscan120.003495404227260970.06918258152246450.01620580943902520.08257616560623130.04073345136609770.07570426261353630.05233485531886870.02866477539273160.1305950689242460.09825945715060730.02327769975900430.06174563462617280.1253030928423550.05893896221496040.05476183470661670.04355552801450420.03112459850308640.07329038026416370.01193680654614470.0338467864283964000zinc finger and SCAN domain containing 12 [Source:MGI Symbol;Acc:MGI:1099444]----GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0046872//metal ion bindingGO:0006357//regulation of transcription by RNA polymerase II
ENSMUSG00000042320HPDProx200.00746341615386970.03013911178191750.010428096872270.00606689273824870.007900640858976580.02172793166648230.014833971040851600.002838319124707790.02467208185086430.02034881809685120.0024974326392468500.02682973155983110.002413298626881350000.014216823620399400.04570216811085520prospero homeobox 2 [Source:MGI Symbol;Acc:MGI:1920672]----GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSMUSG00000090641zf-C2H2Zfp7120.006903135785794910.02512940972271430.0670253753940170.09012082499323410.05672050958880740.02465911605977180.0800134435363850.09947304702984760.05564387519099320.02248971785806370.0418295299050360.08067288007564130.01564512395982790.0194344604510210.129040809312680.01572034112891350000.13444453924901000zinc finger protein 712 [Source:MGI Symbol;Acc:MGI:1925501]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSMUSG00000058883zf-C2H2Zfp708000000000.0021477732423966100000000000000zinc finger protein 708 [Source:MGI Symbol;Acc:MGI:3040674]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSMUSG00000021327zf-C2H2Zkscan30.1491035838083040.740635043725020.8088005478165220.8744459102187060.6943383292733420.7845670911541610.9287712701354910.8742588607793131.031542095994260.8744774187963020.7382525288297251.012947491123940.8168691651612490.6314129425005161.012515328516580.4352880825152990.7447957502758780.3208681113118380.5904871136420671.274286261266032.06540357980860.1411700489431520.533876259599236zinc finger with KRAB and SCAN domains 3 [Source:MGI Symbol;Acc:MGI:1919989]----GO:0005634//nucleusGO:0003677//DNA binding;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated
ENSMUSG00000100235zf-C2H2Gm285577.3345605520049e-040.009298080708386250.007612673114857720.01019037174501770.02098894789782660.02145294270209550.008909885371951950.007190588418315250.01166776059590370.003290699036614950.001274770381984940.009872566594591340.01957221453076780.0159656668581070.01891051779361530.01201533914557870.01311828829320280.01742403122386400000predicted gene 28557 [Source:MGI Symbol;Acc:MGI:5579263]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSMUSG00000074824zf-C2H2Rslcan1800.0026069266276578400000.002663193152610820.00283065520326793000.00375242191866155000.00280565901423171000000000regulator of sex-limitation candidate 18 [Source:MGI Symbol;Acc:MGI:5433745]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSMUSG00000057396zf-C2H2Zfp7590.005502506668249710.02787584853370640.03094929275411740.05371741430920720.02895488086379630.03228264933567560.05849255218458770.0801411352536950.0533595013715620.04457036701723760.01144120940420480.04498676438118750.03792616381544410.02918291997339840.05412709414395820.02306740776650370.01011081452721830.02743634767339770.03432368608929650.140566097003865000zinc finger protein 759 [Source:MGI Symbol;Acc:MGI:2446280]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0045944//positive regulation of transcription by RNA polymerase II
ENSMUSG00000022676zf-C2H2Snai201.56581654624024e-0400000.0027067861679815900.012078455490980800000.013520456450609800.037976759422333000.112275286593240000snail family zinc finger 2 [Source:MGI Symbol;Acc:MGI:1096393]Environmental Information Processing;Cellular ProcessesSignal transduction;Cellular community - eukaryotesko04390//Hippo signaling pathway;ko04520//Adherens junctionK05706;K05706GO:0000785//chromatin;GO:0005634//nucleus;GO:0005737//cytoplasmGO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001227//DNA-binding transcription repressor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003682//chromatin binding;GO:0005515//protein binding;GO:0043565//sequence-specific DNA binding;GO:0046872//metal ion binding;GO:0070888//E-box binding;GO:1990837//sequence-specific double-stranded DNA bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0001837//epithelial to mesenchymal transition;GO:0003198//epithelial to mesenchymal transition involved in endocardial cushion formation;GO:0003273//cell migration involved in endocardial cushion formation;GO:0006355//regulation of transcription, DNA-templated;GO:0006929//substrate-dependent cell migration;GO:0006933//negative regulation of cell adhesion involved in substrate-bound cell migration;GO:0007219//Notch signaling pathway;GO:0007275//multicellular organism development;GO:0007605//sensory perception of sound;GO:0009314//response to radiation;GO:0010839//negative regulation of keratinocyte proliferation;GO:0010957//negative regulation of vitamin D biosynthetic process;GO:0014032//neural crest cell development;GO:0016477//cell migration;GO:0030335//positive regulation of cell migration;GO:0032331//negative regulation of chondrocyte differentiation;GO:0032642//regulation of chemokine production;GO:0033629//negative regulation of cell adhesion mediated by integrin;GO:0035066//positive regulation of histone acetylation;GO:0035921//desmosome disassembly;GO:0036120//cellular response to platelet-derived growth factor stimulus;GO:0042981//regulation of apoptotic process;GO:0043066//negative regulation of apoptotic process;GO:0043473//pigmentation;GO:0043518//negative regulation of DNA damage response, signal transduction by p53 class mediator;GO:0044319//wound healing, spreading of cells;GO:0045600//positive regulation of fat cell differentiation;GO:0045667//regulation of osteoblast differentiation;GO:0050872//white fat cell differentiation;GO:0060021//roof of mouth development;GO:0060429//epithelium development;GO:0060536//cartilage morphogenesis;GO:0060693//regulation of branching involved in salivary gland morphogenesis;GO:0070563//negative regulation of vitamin D receptor signaling pathway;GO:0071364//cellular response to epidermal growth factor stimulus;GO:0071479//cellular response to ionizing radiation;GO:0090090//negative regulation of canonical Wnt signaling pathway;GO:1902230//negative regulation of intrinsic apoptotic signaling pathway in response to DNA damage;GO:2000647//negative regulation of stem cell proliferation;GO:2000810//regulation of bicellular tight junction assembly;GO:2000811//negative regulation of anoikis;GO:2001028//positive regulation of endothelial cell chemotaxis;GO:2001240//negative regulation of extrinsic apoptotic signaling pathway in absence of ligand
ENSMUSG00000022228zf-C2H2Zscan260.02417481313651890.5230898070837160.3923231278650970.5689884210657530.1473997049147480.5648321847029060.5214645805090550.5126629373158890.7329030014632180.48499250871020.3921743031998010.4017068096451640.4436907540350480.408602221338710.8951194026659050.3680430152697690.570452923786890.4587381994111640.3872082455849470.3125919585907140.3285056224248030.09235341443274230.18973812230604zinc finger and SCAN domain containing 26 [Source:MGI Symbol;Acc:MGI:3531417]----GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005829//cytosolGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0046872//metal ion bindingGO:0006357//regulation of transcription by RNA polymerase II
ENSMUSG00000024276zf-C2H2Zfp3970.1479643759997670.3606771509861560.7028589621113330.6667378807132050.6671231044454060.5340612659339890.5814096828163690.5656478122213270.5352407924653120.4850332525505720.4049916615549930.5645991585147760.3516550781500960.2806997159467150.8021304261865610.3606718708395610.4185789199233870.2186082598238290.3519544721862380.3989020561429450.5539060645196130.2425355953519640.0770036345715518zinc finger protein 397 [Source:MGI Symbol;Acc:MGI:1916506]----GO:0005634//nucleus;GO:0005730//nucleolus;GO:0005737//cytoplasm;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0015630//microtubule cytoskeletonGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0042803//protein homodimerization activity;GO:0046982//protein heterodimerization activityGO:0006357//regulation of transcription by RNA polymerase II;GO:0045892//negative regulation of transcription, DNA-templated
ENSMUSG00000058900zf-C2H2Rsl10.002670964475405120.03182255178986670.0492456233812330.01968451231473170.05527155686378810.03121622743492880.05050999547689750.02545442275879760.04008199317576620.01788072388938180.0346713730840680.02702625994123270.0154155692572010.007792536524106010.04116725981610910.00905120338073790.02034261095821140.026213694033763200.02587558634011030.04512268859027700regulator of sex limited protein 1 [Source:MGI Symbol;Acc:MGI:3044162]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0000987//proximal promoter sequence-specific DNA binding;GO:0043565//sequence-specific DNA binding;GO:0046872//metal ion bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0007530//sex determination;GO:0044030//regulation of DNA methylation;GO:0045892//negative regulation of transcription, DNA-templated
ENSMUSG00000063281zf-C2H2Zfp350.02342553572111070.07232727504935190.09849579567711830.08401022530114150.176049589892090.06760235325597630.07674890928611620.09408713831676070.0955466342537820.09767829697887550.07772561887161360.04783550441045660.08045529859418110.1297138253127370.09922853762463130.07057221374551490.03121371456192810.3324752598180360.09519459701823760.085922987065456600.1318905531187360zinc finger protein 35 [Source:MGI Symbol;Acc:MGI:99179]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0046872//metal ion bindingGO:0002437//inflammatory response to antigenic stimulus;GO:0002829//negative regulation of type 2 immune response;GO:0006357//regulation of transcription by RNA polymerase II;GO:0007275//multicellular organism development;GO:0007283//spermatogenesis;GO:0030154//cell differentiation;GO:0045629//negative regulation of T-helper 2 cell differentiation;GO:0045944//positive regulation of transcription by RNA polymerase II
ENSMUSG00000051469zf-C2H2Zfp240.1122760177738210.2273227784649340.3307743508035410.3102121005731560.369012867814670.2315228598253260.2744510140831410.3252996270673820.2776806949393580.2840578190820640.2873152348941240.2657643961794880.1381217063059370.2245647718091470.4745311261699570.1580436488490510.300579680464040.1820176825460290.4919041694356260.448833289230070.074404869528064200zinc finger protein 24 [Source:MGI Symbol;Acc:MGI:1929704]----GO:0005634//nucleus;GO:0005654//nucleoplasmGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0042802//identical protein binding;GO:0043565//sequence-specific DNA binding;GO:0046872//metal ion bindingGO:0006357//regulation of transcription by RNA polymerase II;GO:0042552//myelination;GO:0045892//negative regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription by RNA polymerase II

8 膜蛋白注释

使用TMHMM预测基因的跨膜结构域。

Tab 8-0-1 膜蛋白注释表
GeneIDLengthExpAAFirst60PredHelTopologyGene_nameCluster 0Cluster 1Cluster 2Cluster 3Cluster 4Cluster 5Cluster 6Cluster 7Cluster 8Cluster 9Cluster 10Cluster 11Cluster 12Cluster 13Cluster 14Cluster 15Cluster 16Cluster 17Cluster 18Cluster 19Cluster 20Cluster 21Cluster 22DescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
ENSMUSG000000000013540.740.000oGnai31.797551570398780.8756645179892950.7342769603792850.690266299241270.8914247107539020.7586928191339270.8482242846083581.277962718692130.6387978438528110.528853570595660.6643582391803850.8410911928520150.5553854979456610.794146823906140.9409786233517530.8886139389040670.749434056927722.17652420989541.022430262865550.8847683681314532.70089419002151.841253524095880.061955565468446guanine nucleotide binding protein (G protein), alpha inhibiting 3 [Source:MGI Symbol;Acc:MGI:95773]Human Diseases;Human Diseases;Environmental Information Processing;Human Diseases;Environmental Information Processing;Human Diseases;Organismal Systems;Organismal Systems;Environmental Information Processing;Human Diseases;Organismal Systems;Human Diseases;Organismal Systems;Organismal Systems;Organismal Systems;Organismal Systems;Environmental Information Processing;Organismal Systems;Organismal Systems;Human Diseases;Organismal Systems;Environmental Information Processing;Organismal Systems;Organismal Systems;Organismal Systems;Organismal Systems;Human Diseases;Organismal Systems;Organismal Systems;Organismal Systems;Human Diseases;Organismal Systems;Organismal Systems;Cellular Processes;Human Diseases;Organismal Systems;Organismal Systems;Organismal Systems;Organismal Systems;Human DiseasesCancers;Infectious disease: viral;Signal transduction;Infectious diseases;Signal transduction;Substance dependence;Immune system;Development;Signal transduction;Endocrine and metabolic diseases;Endocrine system;Infectious diseases;Nervous system;Circulatory system;Nervous system;Endocrine system;Signal transduction;Nervous system;Endocrine system;Neurodegenerative diseases;Immune system;Signal transduction;Nervous system;Immune system;Nervous system;Endocrine system;Infectious diseases;Endocrine system;Environmental adaptation;Endocrine system;Substance dependence;Nervous system;Endocrine system;Cellular community - eukaryotes;Infectious diseases;Endocrine system;Digestive system;Nervous system;Endocrine system;Substance dependenceko05200//Pathways in cancer;ko05170//Human immunodeficiency virus 1 infection;ko04015//Rap1 signaling pathway;ko05163//Human cytomegalovirus infection;ko04024//cAMP signaling pathway;ko05034//Alcoholism;ko04062//Chemokine signaling pathway;ko04360//Axon guidance;ko04022//cGMP - PKG signaling pathway;ko04934//Cushing syndrome;ko04921//Oxytocin signaling pathway;ko05142//Chagas disease (American trypanosomiasis);ko04723//Retrograde endocannabinoid signaling;ko04261//Adrenergic signaling in cardiomyocytes;ko04726//Serotonergic synapse;ko04915//Estrogen signaling pathway;ko04371//Apelin signaling pathway;ko04728//Dopaminergic synapse;ko04926//Relaxin signaling pathway;ko05012//Parkinson disease;ko04611//Platelet activation;ko04071//Sphingolipid signaling pathway;ko04724//Glutamatergic synapse;ko04670//Leukocyte transendothelial migration;ko04725//Cholinergic synapse;ko04935//Growth hormone synthesis, secretion and action;ko05145//Toxoplasmosis;ko04928//Parathyroid hormone synthesis, secretion and action;ko04713//Circadian entrainment;ko04916//Melanogenesis;ko05032//Morphine addiction;ko04727//GABAergic synapse;ko04914//Progesterone-mediated oocyte maturation;ko04540//Gap junction;ko05133//Pertussis;ko04924//Renin secretion;ko04971//Gastric acid secretion;ko04730//Long-term depression;ko04923//Regulation of lipolysis in adipocyte;ko05030//Cocaine addictionK04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630;K04630GO:0000139//Golgi membrane;GO:0005737//cytoplasm;GO:0005789//endoplasmic reticulum membrane;GO:0005794//Golgi apparatus;GO:0005813//centrosome;GO:0005815//microtubule organizing center;GO:0005834//heterotrimeric G-protein complex;GO:0005856//cytoskeleton;GO:0005886//plasma membrane;GO:0016020//membrane;GO:0030496//midbody;GO:0042588//zymogen granule;GO:0045121//membrane raftGO:0000166//nucleotide binding;GO:0001664//G protein-coupled receptor binding;GO:0003924//GTPase activity;GO:0005515//protein binding;GO:0005525//GTP binding;GO:0019001//guanyl nucleotide binding;GO:0019003//GDP binding;GO:0019904//protein domain specific binding;GO:0031683//G-protein beta/gamma-subunit complex binding;GO:0031821//G protein-coupled serotonin receptor binding;GO:0032794//GTPase activating protein binding;GO:0046872//metal ion bindingGO:0006906//vesicle fusion;GO:0007049//cell cycle;GO:0007165//signal transduction;GO:0007186//G protein-coupled receptor signaling pathway;GO:0007188//adenylate cyclase-modulating G protein-coupled receptor signaling pathway;GO:0007193//adenylate cyclase-inhibiting G protein-coupled receptor signaling pathway;GO:0007212//dopamine receptor signaling pathway;GO:0007420//brain development;GO:0016239//positive regulation of macroautophagy;GO:0032930//positive regulation of superoxide anion generation;GO:0033864//positive regulation of NAD(P)H oxidase activity;GO:0046039//GTP metabolic process;GO:0051301//cell division;GO:1904707//positive regulation of vascular smooth muscle cell proliferation;GO:2001234//negative regulation of apoptotic signaling pathway
ENSMUSG000000000031744.634.280oPbsn00000000000000000000000probasin [Source:MGI Symbol;Acc:MGI:1860484]----GO:0005576//extracellular region;GO:0005615//extracellular spaceGO:0005549//odorant binding;GO:0036094//small molecule binding-
ENSMUSG000000000285660.100.060oCdc450.01937648217201870.01526663387505620.01946359233870150.01371874129952720.0211558510276490.009673836182949830.03534682761955590.01211721163431040.007642675118988430.01702125248747840.009935153579101340.01004899902133410.002721353535936290.3007992590295890.04112967657529670.02228794612781480.0092281592747978400.008517727947176460000cell division cycle 45 [Source:MGI Symbol;Acc:MGI:1338073]Cellular ProcessesCell growth and deathko04110//Cell cycleK06628GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005813//centrosome;GO:0031261//DNA replication preinitiation complex;GO:0036064//ciliary basal bodyGO:0003682//chromatin binding;GO:0003688//DNA replication origin binding;GO:0003697//single-stranded DNA bindingGO:0000727//double-strand break repair via break-induced replication;GO:0006260//DNA replication;GO:0006270//DNA replication initiation;GO:0007049//cell cycle;GO:0031938//regulation of chromatin silencing at telomere;GO:1902977//mitotic DNA replication preinitiation complex assembly
ENSMUSG000000000379690.030.010oScml206.51822261781368e-0400.0069729127399994800.0140913972197575000.00804803064690071000.0030748813557028900.00615352427719165000000000Scm polycomb group protein like 2 [Source:MGI Symbol;Acc:MGI:1340042]----GO:0000785//chromatin;GO:0001741//XY body;GO:0005634//nucleusGO:0003682//chromatin binding;GO:0005515//protein binding;GO:0042393//histone bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006915//apoptotic process;GO:0034613//cellular protein localization;GO:0036353//histone H2A-K119 monoubiquitination;GO:0045892//negative regulation of transcription, DNA-templated
ENSMUSG000000000493451.361.360oApoh04.21032323750288e-04000000000000000000000apolipoprotein H [Source:MGI Symbol;Acc:MGI:88058]Organismal SystemsDigestive systemko04979//Cholesterol metabolismK17305GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005737//cytoplasm;GO:0005886//plasma membrane;GO:0009986//cell surface;GO:0034361//very-low-density lipoprotein particle;GO:0034364//high-density lipoprotein particle;GO:0042627//chylomicronGO:0005543//phospholipid binding;GO:0008201//heparin binding;GO:0008289//lipid binding;GO:0042802//identical protein binding;GO:0060230//lipoprotein lipase activator activityGO:0001937//negative regulation of endothelial cell proliferation;GO:0006641//triglyceride metabolic process;GO:0007596//blood coagulation;GO:0007597//blood coagulation, intrinsic pathway;GO:0010596//negative regulation of endothelial cell migration;GO:0010898//positive regulation of triglyceride catabolic process;GO:0016525//negative regulation of angiogenesis;GO:0030193//regulation of blood coagulation;GO:0030195//negative regulation of blood coagulation;GO:0031100//animal organ regeneration;GO:0031639//plasminogen activation;GO:0033033//negative regulation of myeloid cell apoptotic process;GO:0034197//triglyceride transport;GO:0034392//negative regulation of smooth muscle cell apoptotic process;GO:0051006//positive regulation of lipoprotein lipase activity;GO:0051917//regulation of fibrinolysis;GO:0051918//negative regulation of fibrinolysis;GO:0060268//negative regulation of respiratory burst
ENSMUSG000000000564620.950.000oNarf0.2849441144578220.1002561984924360.4171740198310760.2564260510306160.8115404358090030.1723848955638960.258943141645850.1652518350775140.1212984289024410.1798806559582080.24438310447350.3417412038186380.2357968629379080.1024697459882190.3035715567711380.07212312731790040.117346951140780.1108830161154160.152906859312610.1697885347048010.070430471038990300.288832138215002nuclear prelamin A recognition factor [Source:MGI Symbol;Acc:MGI:1914858]----GO:0005634//nucleus;GO:0005638//lamin filament;GO:0005654//nucleoplasm;GO:0005730//nucleolus;GO:0031981//nuclear lumenGO:0005521//lamin bindingGO:0008150//biological_process
ENSMUSG0000000005816226.880.001i92-114oCav20.01911258533462860.5438614511433540.02104110327086750.02707066284564630.02292468860055260.4834172364487650.03815134522405340.01613520354635680.2477221110595110.1155768626592960.003894725567899670.01718857110238520.04554914773119820.2851813350731720.1316938856653110.5428624597630931.002654891166870.07377341944535726.144287802575780.05894110237912730.048953376803931900.030977782734223caveolin 2 [Source:MGI Symbol;Acc:MGI:107571]Cellular Processes;Human Diseases;Cellular Processes;Human Diseases;Human DiseasesTransport and catabolism;Cancers;Cellular community - eukaryotes;Cardiovascular diseases;Infectious diseasesko04144//Endocytosis;ko05205//Proteoglycans in cancer;ko04510//Focal adhesion;ko05418//Fluid shear stress and atherosclerosis;ko05100//Bacterial invasion of epithelial cellsK12958;K12958;K12958;K12958;K12958GO:0000139//Golgi membrane;GO:0002080//acrosomal membrane;GO:0002095//caveolar macromolecular signaling complex;GO:0005634//nucleus;GO:0005635//nuclear envelope;GO:0005637//nuclear inner membrane;GO:0005737//cytoplasm;GO:0005783//endoplasmic reticulum;GO:0005794//Golgi apparatus;GO:0005811//lipid droplet;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0005901//caveola;GO:0005925//focal adhesion;GO:0009986//cell surface;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0030133//transport vesicle;GO:0031234//extrinsic component of cytoplasmic side of plasma membrane;GO:0031410//cytoplasmic vesicle;GO:0032991//protein-containing complex;GO:0042383//sarcolemma;GO:0045121//membrane raft;GO:0045202//synapse;GO:0048471//perinuclear region of cytoplasmGO:0000149//SNARE binding;GO:0005515//protein binding;GO:0019901//protein kinase binding;GO:0019905//syntaxin binding;GO:0030674//protein binding, bridging;GO:0031435//mitogen-activated protein kinase kinase kinase binding;GO:0031748//D1 dopamine receptor binding;GO:0042803//protein homodimerization activity;GO:0046982//protein heterodimerization activity;GO:0051219//phosphoprotein binding;GO:0060090//molecular adaptor activity;GO:0097110//scaffold protein bindingGO:0001937//negative regulation of endothelial cell proliferation;GO:0001938//positive regulation of endothelial cell proliferation;GO:0006897//endocytosis;GO:0006906//vesicle fusion;GO:0007005//mitochondrion organization;GO:0007029//endoplasmic reticulum organization;GO:0007088//regulation of mitotic nuclear division;GO:0007268//chemical synaptic transmission;GO:0008285//negative regulation of cell proliferation;GO:0008286//insulin receptor signaling pathway;GO:0016050//vesicle organization;GO:0019065//receptor-mediated endocytosis of virus by host cell;GO:0030154//cell differentiation;GO:0030512//negative regulation of transforming growth factor beta receptor signaling pathway;GO:0043410//positive regulation of MAPK cascade;GO:0043547//positive regulation of GTPase activity;GO:0044791//positive regulation by host of viral release from host cell;GO:0044794//positive regulation by host of viral process;GO:0048278//vesicle docking;GO:0048741//skeletal muscle fiber development;GO:0050731//positive regulation of peptidyl-tyrosine phosphorylation;GO:0051480//regulation of cytosolic calcium ion concentration;GO:0060161//positive regulation of dopamine receptor signaling pathway;GO:0070836//caveola assembly;GO:0071711//basement membrane organization;GO:1900182//positive regulation of protein localization to nucleus
ENSMUSG000000000783180.000.000oKlf68.508088100605815.008876584723651.977497398451773.744928378280370.7748435037008676.363783121082634.286463631185733.518242642906867.670338813222984.403521611005031.919990714056712.373742349779795.571086289284083.076869014155294.705764787302833.653773087807727.8219454985592912.45336566362831.631795490709042.0318482817621712.22926833712174.324511726237251.98599853771731Kruppel-like factor 6 [Source:MGI Symbol;Acc:MGI:1346318]----GO:0001650//fibrillar center;GO:0005654//nucleoplasm;GO:0005730//nucleolus;GO:0005829//cytosol;GO:0043231//intracellular membrane-bounded organelleGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:1990837//sequence-specific double-stranded DNA bindingGO:0045944//positive regulation of transcription by RNA polymerase II
ENSMUSG000000000857060.040.020oScmh10.01015913924561540.1068302436900620.244744910148590.1177122313517410.261630712640810.1711467549874140.1375074089435940.1828199327712050.3907000875067670.3540560142841240.1606172590945970.1082208720083020.1652344808540270.2760579838684080.448116474196220.0890564826552540.2122816606588150.0706107362072080.1007168928709480.2712630052315520.5803315555285410.01254642176051390.227190112686296sex comb on midleg homolog 1 [Source:MGI Symbol;Acc:MGI:1352762]----GO:0005634//nucleus;GO:0010369//chromocenterGO:0003682//chromatin binding;GO:0005515//protein binding;GO:0042393//histone bindingGO:0006338//chromatin remodeling;GO:0006355//regulation of transcription, DNA-templated;GO:0007275//multicellular organism development;GO:0007283//spermatogenesis;GO:0009952//anterior/posterior pattern specification;GO:0016458//gene silencing;GO:0045892//negative regulation of transcription, DNA-templated
ENSMUSG000000000881460.010.010oCox5a0.8107808899647335.075633366174092.257134805878251.611069594113043.388780890760812.803289032982443.075497965292093.378641363711121.630439045938881.750938303827362.633149137447521.801525934358512.519438150236514.292551053551512.363101382735614.216753399417783.492735906623211.853596943492392.918670936650942.947426151222092.414821218942261.167065678752521.1694126913861cytochrome c oxidase subunit 5A [Source:MGI Symbol;Acc:MGI:88474]Metabolism;Human Diseases;Human Diseases;Organismal Systems;Human Diseases;Human Diseases;Metabolism;Organismal SystemsGlobal and overview maps;Neurodegenerative diseases;Neurodegenerative diseases;Environmental adaptation;Endocrine and metabolic diseases;Neurodegenerative diseases;Energy metabolism;Circulatory systemko01100//Metabolic pathways;ko05010//Alzheimer disease;ko05016//Huntington disease;ko04714//Thermogenesis;ko04932//Non-alcoholic fatty liver disease (NAFLD);ko05012//Parkinson disease;ko00190//Oxidative phosphorylation;ko04260//Cardiac muscle contractionK02264;K02264;K02264;K02264;K02264;K02264;K02264;K02264GO:0005739//mitochondrion;GO:0005743//mitochondrial inner membrane;GO:0005751//mitochondrial respiratory chain complex IV;GO:0016020//membrane;GO:0043209//myelin sheathGO:0004129//cytochrome-c oxidase activity;GO:0005515//protein binding;GO:0046872//metal ion bindingGO:0006119//oxidative phosphorylation;GO:0006123//mitochondrial electron transport, cytochrome c to oxygen;GO:1902600//proton transmembrane transport
ENSMUSG000000000937112.750.140oTbx20.002153376983338670.005178073469712490.0026143043383071900.002443479270132740.00211733559498960.0021543645270954400.07538673502911380.002783699785934450.0055809800200915300.00629538183379437000.110184562669155000.5947894477400440000.030977782734223T-box 2 [Source:MGI Symbol;Acc:MGI:98494]Environmental Information ProcessingSignal transductionko04013//MAPK signaling pathway - flyK10176GO:0005634//nucleus;GO:0005667//transcription factor complexGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001227//DNA-binding transcription repressor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005515//protein binding;GO:0043565//sequence-specific DNA binding;GO:1990837//sequence-specific double-stranded DNA bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0001708//cell fate specification;GO:0001947//heart looping;GO:0003007//heart morphogenesis;GO:0003148//outflow tract septum morphogenesis;GO:0003151//outflow tract morphogenesis;GO:0003203//endocardial cushion morphogenesis;GO:0003256//regulation of transcription from RNA polymerase II promoter involved in myocardial precursor cell differentiation;GO:0003272//endocardial cushion formation;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0007219//Notch signaling pathway;GO:0007275//multicellular organism development;GO:0007521//muscle cell fate determination;GO:0007569//cell aging;GO:0008016//regulation of heart contraction;GO:0008284//positive regulation of cell proliferation;GO:0035050//embryonic heart tube development;GO:0035909//aorta morphogenesis;GO:0036302//atrioventricular canal development;GO:0042733//embryonic digit morphogenesis;GO:0045892//negative regulation of transcription, DNA-templated;GO:0048596//embryonic camera-type eye morphogenesis;GO:0048738//cardiac muscle tissue development;GO:0051145//smooth muscle cell differentiation;GO:0060021//roof of mouth development;GO:0060045//positive regulation of cardiac muscle cell proliferation;GO:0060465//pharynx development;GO:0060560//developmental growth involved in morphogenesis;GO:0060596//mammary placode formation;GO:0072105//ureteric peristalsis;GO:0090398//cellular senescence;GO:1901208//negative regulation of heart looping;GO:1901211//negative regulation of cardiac chamber formation;GO:1905072//cardiac jelly development;GO:1905222//atrioventricular canal morphogenesis
ENSMUSG000000000945520.000.000oTbx400.002518385598820150000000.02433527991908330.0206892970682163000000.05907103047551830000000T-box 4 [Source:MGI Symbol;Acc:MGI:102556]----GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005515//protein bindingGO:0001525//angiogenesis;GO:0001708//cell fate specification;GO:0002009//morphogenesis of an epithelium;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0007275//multicellular organism development;GO:0030324//lung development;GO:0030326//embryonic limb morphogenesis;GO:0035108//limb morphogenesis;GO:0035116//embryonic hindlimb morphogenesis;GO:0048705//skeletal system morphogenesis;GO:1990401//embryonic lung development
ENSMUSG0000000012042739.7917.912i13-31o253-275iNgfr07.27747462082611e-050000000.01756098800813140.0396519994149050000000000000nerve growth factor receptor (TNFR superfamily, member 16) [Source:MGI Symbol;Acc:MGI:97323]Environmental Information Processing;Human Diseases;Environmental Information Processing;Environmental Information Processing;Environmental Information Processing;Environmental Information Processing;Organismal Systems;Cellular ProcessesSignal transduction;Cancers;Signaling molecules and interaction;Signal transduction;Signal transduction;Signal transduction;Nervous system;Cell growth and deathko04151//PI3K-Akt signaling pathway;ko05202//Transcriptional misregulation in cancers;ko04060//Cytokine-cytokine receptor interaction;ko04010//MAPK signaling pathway;ko04014//Ras signaling pathway;ko04015//Rap1 signaling pathway;ko04722//Neurotrophin signaling pathway;ko04215//Apoptosis - multiple speciesK02583;K02583;K02583;K02583;K02583;K02583;K02583;K02583GO:0005634//nucleus;GO:0005635//nuclear envelope;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005739//mitochondrion;GO:0005791//rough endoplasmic reticulum;GO:0005794//Golgi apparatus;GO:0005886//plasma membrane;GO:0005911//cell-cell junction;GO:0009897//external side of plasma membrane;GO:0009986//cell surface;GO:0014069//postsynaptic density;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0030054//cell junction;GO:0030135//coated vesicle;GO:0030426//growth cone;GO:0032590//dendrite membrane;GO:0032809//neuronal cell body membrane;GO:0042995//cell projection;GO:0043197//dendritic spine;GO:0043204//perikaryon;GO:0045121//membrane raft;GO:0045202//synapse;GO:0045334//clathrin-coated endocytic vesicleGO:0001540//amyloid-beta binding;GO:0005035//death receptor activity;GO:0005168//neurotrophin TRKA receptor binding;GO:0005515//protein binding;GO:0005516//calmodulin binding;GO:0015026//coreceptor activity;GO:0031267//small GTPase binding;GO:0031625//ubiquitin protein ligase binding;GO:0042802//identical protein binding;GO:0043121//neurotrophin binding;GO:0044877//protein-containing complex binding;GO:0048406//nerve growth factor binding;GO:0070678//preprotein bindingGO:0001678//cellular glucose homeostasis;GO:0006886//intracellular protein transport;GO:0006915//apoptotic process;GO:0006919//activation of cysteine-type endopeptidase activity involved in apoptotic process;GO:0007165//signal transduction;GO:0007266//Rho protein signal transduction;GO:0007275//multicellular organism development;GO:0007399//nervous system development;GO:0007411//axon guidance;GO:0007417//central nervous system development;GO:0007623//circadian rhythm;GO:0009611//response to wounding;GO:0010468//regulation of gene expression;GO:0010941//regulation of cell death;GO:0010976//positive regulation of neuron projection development;GO:0010977//negative regulation of neuron projection development;GO:0016048//detection of temperature stimulus;GO:0016525//negative regulation of angiogenesis;GO:0019233//sensory perception of pain;GO:0021675//nerve development;GO:0030154//cell differentiation;GO:0031069//hair follicle morphogenesis;GO:0031643//positive regulation of myelination;GO:0032224//positive regulation of synaptic transmission, cholinergic;GO:0032922//circadian regulation of gene expression;GO:0034599//cellular response to oxidative stress;GO:0035025//positive regulation of Rho protein signal transduction;GO:0035907//dorsal aorta development;GO:0040037//negative regulation of fibroblast growth factor receptor signaling pathway;GO:0042488//positive regulation of odontogenesis of dentin-containing tooth;GO:0042593//glucose homeostasis;GO:0043065//positive regulation of apoptotic process;GO:0043066//negative regulation of apoptotic process;GO:0043410//positive regulation of MAPK cascade;GO:0043524//negative regulation of neuron apoptotic process;GO:0043588//skin development;GO:0045666//positive regulation of neuron differentiation;GO:0048146//positive regulation of fibroblast proliferation;GO:0048511//rhythmic process;GO:0051402//neuron apoptotic process;GO:0051799//negative regulation of hair follicle development;GO:0051897//positive regulation of protein kinase B signaling;GO:0051902//negative regulation of mitochondrial depolarization;GO:0051968//positive regulation of synaptic transmission, glutamatergic;GO:0061000//negative regulation of dendritic spine development;GO:1900182//positive regulation of protein localization to nucleus;GO:1901216//positive regulation of neuron death;GO:1902895//positive regulation of pri-miRNA transcription by RNA polymerase II;GO:1903588//negative regulation of blood vessel endothelial cell proliferation involved in sprouting angiogenesis;GO:1904646//cellular response to amyloid-beta;GO:2000179//positive regulation of neural precursor cell proliferation;GO:2000377//regulation of reactive oxygen species metabolic process;GO:2000463//positive regulation of excitatory postsynaptic potential;GO:2001235//positive regulation of apoptotic signaling pathway
ENSMUSG0000000012535515.3215.261i5-27oWnt30000.0086264761315481000.009097923671478140000000000000000wingless-type MMTV integration site family, member 3 [Source:MGI Symbol;Acc:MGI:98955]Human Diseases;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Environmental Information Processing;Human Diseases;Environmental Information Processing;Environmental Information Processing;Human Diseases;Human Diseases;Cellular Processes;Organismal Systems;Human DiseasesCancers;Infectious diseases;Neurodegenerative diseases;Cancers;Cancers;Cancers;Signal transduction;Endocrine and metabolic diseases;Signal transduction;Signal transduction;Cancers;Cancers;Cellular community - eukaryotes;Endocrine system;Cancersko05200//Pathways in cancer;ko05165//Human papillomavirus infection;ko05010//Alzheimer disease;ko05205//Proteoglycans in cancer;ko05225//Hepatocellular carcinoma;ko05206//MicroRNAs in cancer;ko04310//Wnt signaling pathway;ko04934//Cushing syndrome;ko04150//mTOR signaling pathway;ko04390//Hippo signaling pathway;ko05226//Gastric cancer;ko05224//Breast cancer;ko04550//Signaling pathways regulating pluripotency of stem cells;ko04916//Melanogenesis;ko05217//Basal cell carcinomaK00312;K00312;K00312;K00312;K00312;K00312;K00312;K00312;K00312;K00312;K00312;K00312;K00312;K00312;K00312GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005737//cytoplasm;GO:0005788//endoplasmic reticulum lumen;GO:0031012//extracellular matrixGO:0005102//signaling receptor binding;GO:0005109//frizzled binding;GO:0005125//cytokine activity;GO:0005515//protein binding;GO:0019904//protein domain specific binding;GO:0048018//receptor ligand activityGO:0000902//cell morphogenesis;GO:0001707//mesoderm formation;GO:0007165//signal transduction;GO:0007267//cell-cell signaling;GO:0007275//multicellular organism development;GO:0007276//gamete generation;GO:0007411//axon guidance;GO:0009887//animal organ morphogenesis;GO:0009948//anterior/posterior axis specification;GO:0009950//dorsal/ventral axis specification;GO:0009952//anterior/posterior pattern specification;GO:0010628//positive regulation of gene expression;GO:0016055//Wnt signaling pathway;GO:0030177//positive regulation of Wnt signaling pathway;GO:0030182//neuron differentiation;GO:0035115//embryonic forelimb morphogenesis;GO:0035116//embryonic hindlimb morphogenesis;GO:0044338//canonical Wnt signaling pathway involved in mesenchymal stem cell differentiation;GO:0044339//canonical Wnt signaling pathway involved in osteoblast differentiation;GO:0045165//cell fate commitment;GO:0048646//anatomical structure formation involved in morphogenesis;GO:0048697//positive regulation of collateral sprouting in absence of injury;GO:0048843//negative regulation of axon extension involved in axon guidance;GO:0050767//regulation of neurogenesis;GO:0060064//Spemann organizer formation at the anterior end of the primitive streak;GO:0060070//canonical Wnt signaling pathway;GO:0060173//limb development;GO:0060174//limb bud formation;GO:0060323//head morphogenesis;GO:0061180//mammary gland epithelium development;GO:0072089//stem cell proliferation;GO:1904954//canonical Wnt signaling pathway involved in midbrain dopaminergic neuron differentiation;GO:1905474//canonical Wnt signaling pathway involved in stem cell proliferation
ENSMUSG0000000012636518.3018.031i7-29oWnt9a000004.23592725421858e-0400000000000000000wingless-type MMTV integration site family, member 9A [Source:MGI Symbol;Acc:MGI:2446084]Human Diseases;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Environmental Information Processing;Human Diseases;Environmental Information Processing;Environmental Information Processing;Human Diseases;Human Diseases;Cellular Processes;Organismal Systems;Human DiseasesCancers;Infectious diseases;Neurodegenerative diseases;Cancers;Cancers;Signal transduction;Endocrine and metabolic diseases;Signal transduction;Signal transduction;Cancers;Cancers;Cellular community - eukaryotes;Endocrine system;Cancersko05200//Pathways in cancer;ko05165//Human papillomavirus infection;ko05010//Alzheimer disease;ko05205//Proteoglycans in cancer;ko05225//Hepatocellular carcinoma;ko04310//Wnt signaling pathway;ko04934//Cushing syndrome;ko04150//mTOR signaling pathway;ko04390//Hippo signaling pathway;ko05226//Gastric cancer;ko05224//Breast cancer;ko04550//Signaling pathways regulating pluripotency of stem cells;ko04916//Melanogenesis;ko05217//Basal cell carcinomaK01064;K01064;K01064;K01064;K01064;K01064;K01064;K01064;K01064;K01064;K01064;K01064;K01064;K01064GO:0005576//extracellular region;GO:0005615//extracellular spaceGO:0005102//signaling receptor binding;GO:0005109//frizzled binding;GO:0005125//cytokine activity;GO:0005515//protein binding;GO:0048018//receptor ligand activityGO:0007275//multicellular organism development;GO:0008285//negative regulation of cell proliferation;GO:0016055//Wnt signaling pathway;GO:0030182//neuron differentiation;GO:0032331//negative regulation of chondrocyte differentiation;GO:0035115//embryonic forelimb morphogenesis;GO:0043154//negative regulation of cysteine-type endopeptidase activity involved in apoptotic process;GO:0045165//cell fate commitment;GO:0045597//positive regulation of cell differentiation;GO:0045880//positive regulation of smoothened signaling pathway;GO:0048704//embryonic skeletal system morphogenesis;GO:0048856//anatomical structure development;GO:0060070//canonical Wnt signaling pathway;GO:0060548//negative regulation of cell death;GO:0061037//negative regulation of cartilage development
ENSMUSG000000001278230.150.000oFer0.03799003062902340.1414610511242730.01490459784762510.00238452652945160.04980586643683890.134316306744840.007598790345815070.1145450269447160.2981321945930450.005265005451829650.018278648712306300.01153323064591150.0495605220641360.09913949236901960.1343362856291660.07879201746271680.2578013988728770.155022224327430.026874785001720.04166597223379610.1939772931704160fer (fms/fps related) protein kinase [Source:MGI Symbol;Acc:MGI:105917]Cellular ProcessesCellular community - eukaryotesko04520//Adherens junctionK08889GO:0000785//chromatin;GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0005829//cytosol;GO:0005856//cytoskeleton;GO:0005886//plasma membrane;GO:0005938//cell cortex;GO:0015629//actin cytoskeleton;GO:0015630//microtubule cytoskeleton;GO:0016020//membrane;GO:0030027//lamellipodium;GO:0030054//cell junction;GO:0031234//extrinsic component of cytoplasmic side of plasma membrane;GO:0042995//cell projectionGO:0000166//nucleotide binding;GO:0003779//actin binding;GO:0004672//protein kinase activity;GO:0004713//protein tyrosine kinase activity;GO:0004714//transmembrane receptor protein tyrosine kinase activity;GO:0004715//non-membrane spanning protein tyrosine kinase activity;GO:0005102//signaling receptor binding;GO:0005154//epidermal growth factor receptor binding;GO:0005515//protein binding;GO:0005524//ATP binding;GO:0008092//cytoskeletal protein binding;GO:0008157//protein phosphatase 1 binding;GO:0008289//lipid binding;GO:0016301//kinase activity;GO:0016740//transferase activity;GO:0019901//protein kinase binding;GO:0031267//small GTPase binding;GO:0045295//gamma-catenin binding;GO:0045296//cadherin binding;GO:0050839//cell adhesion molecule bindingGO:0000226//microtubule cytoskeleton organization;GO:0001932//regulation of protein phosphorylation;GO:0006468//protein phosphorylation;GO:0006935//chemotaxis;GO:0007155//cell adhesion;GO:0007165//signal transduction;GO:0007169//transmembrane receptor protein tyrosine kinase signaling pathway;GO:0007260//tyrosine phosphorylation of STAT protein;GO:0008283//cell proliferation;GO:0008284//positive regulation of cell proliferation;GO:0010591//regulation of lamellipodium assembly;GO:0010762//regulation of fibroblast migration;GO:0016310//phosphorylation;GO:0018108//peptidyl-tyrosine phosphorylation;GO:0019221//cytokine-mediated signaling pathway;GO:0030154//cell differentiation;GO:0030335//positive regulation of cell migration;GO:0030838//positive regulation of actin filament polymerization;GO:0031532//actin cytoskeleton reorganization;GO:0032496//response to lipopolysaccharide;GO:0032869//cellular response to insulin stimulus;GO:0033007//negative regulation of mast cell activation involved in immune response;GO:0034446//substrate adhesion-dependent cell spreading;GO:0034614//cellular response to reactive oxygen species;GO:0035426//extracellular matrix-cell signaling;GO:0036006//cellular response to macrophage colony-stimulating factor stimulus;GO:0036119//response to platelet-derived growth factor;GO:0038028//insulin receptor signaling pathway via phosphatidylinositol 3-kinase;GO:0038095//Fc-epsilon receptor signaling pathway;GO:0038109//Kit signaling pathway;GO:0042058//regulation of epidermal growth factor receptor signaling pathway;GO:0043304//regulation of mast cell degranulation;GO:0044331//cell-cell adhesion mediated by cadherin;GO:0045087//innate immune response;GO:0046777//protein autophosphorylation;GO:0048008//platelet-derived growth factor receptor signaling pathway;GO:0050904//diapedesis;GO:0051092//positive regulation of NF-kappaB transcription factor activity;GO:0070102//interleukin-6-mediated signaling pathway
ENSMUSG0000000013111250.450.030oXpo61.064629555441320.3658552906999460.6131393042752650.9047374625926930.4808377460518260.5173170536309750.600361920853550.5441396774366690.45628340736910.4099344371608780.4892981125541280.6172412206227950.5906039401492290.3367521962915090.5783971866062940.2002012624005060.3967621502766731.119086083585660.2116804928143870.4085314361864560.165209496922520.4004920967961060.471894890317997exportin 6 [Source:MGI Symbol;Acc:MGI:2429950]----GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005730//nucleolus;GO:0005737//cytoplasm;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0032991//protein-containing complexGO:0005049//nuclear export signal receptor activity;GO:0031267//small GTPase bindingGO:0006611//protein export from nucleus;GO:0006886//intracellular protein transport;GO:0015031//protein transport
ENSMUSG000000001345720.010.000oTfe30.5084482747698010.4433365737656340.2379169174788470.2115282069273590.1264549960723850.6062887266433410.2370824149438770.1195584916219881.308663303577970.4493160549411730.1359291101207990.1128775175207451.109452246145170.3846904935721550.342801980261370.2593903225112190.528720184592230.1327724499880350.05348215655099480.160744899256970.5311816849170220.4997924062626650.89256479774068transcription factor E3 [Source:MGI Symbol;Acc:MGI:98511]Human Diseases;Human Diseases;Cellular ProcessesCancers;Cancers;Transport and catabolismko05202//Transcriptional misregulation in cancers;ko05211//Renal cell carcinoma;ko04137//Mitophagy - animalK09105;K09105;K09105GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005667//transcription factor complex;GO:0005737//cytoplasm;GO:0005829//cytosolGO:0000976//transcription regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0046982//protein heterodimerization activity;GO:0046983//protein dimerization activity;GO:1990837//sequence-specific double-stranded DNA bindingGO:0002250//adaptive immune response;GO:0002376//immune system process;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0006959//humoral immune response;GO:0045670//regulation of osteoclast differentiation;GO:0045785//positive regulation of cell adhesion;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0090336//positive regulation of brown fat cell differentiation;GO:0120163//negative regulation of cold-induced thermogenesis
ENSMUSG000000001428400.010.000oAxin20.001977042244024280.001844617199980630.008085179779185770.28834095499473702.49442520910018e-040.4816527569019860.03272697033255540.04214866530017510.01576616476660080.002527203080821870.2517722499009290.002387503805084190.06321652522961970.4394036349000560.035365687326687300.3217779052484740.03515352422868770.0155176753894061000.257958004436878axin 2 [Source:MGI Symbol;Acc:MGI:1270862]Human Diseases;Human Diseases;Human Diseases;Human Diseases;Environmental Information Processing;Human Diseases;Environmental Information Processing;Human Diseases;Human Diseases;Cellular Processes;Human Diseases;Human Diseases;Human DiseasesCancers;Infectious diseases;Neurodegenerative diseases;Cancers;Signal transduction;Endocrine and metabolic diseases;Signal transduction;Cancers;Cancers;Cellular community - eukaryotes;Cancers;Cancers;Cancersko05200//Pathways in cancer;ko05165//Human papillomavirus infection;ko05010//Alzheimer disease;ko05225//Hepatocellular carcinoma;ko04310//Wnt signaling pathway;ko04934//Cushing syndrome;ko04390//Hippo signaling pathway;ko05226//Gastric cancer;ko05224//Breast cancer;ko04550//Signaling pathways regulating pluripotency of stem cells;ko05210//Colorectal cancer;ko05217//Basal cell carcinoma;ko05213//Endometrial cancerK04385;K04385;K04385;K04385;K04385;K04385;K04385;K04385;K04385;K04385;K04385;K04385;K04385GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0005813//centrosome;GO:0032991//protein-containing complexGO:0005515//protein binding;GO:0008013//beta-catenin binding;GO:0019899//enzyme binding;GO:0019901//protein kinase binding;GO:0031625//ubiquitin protein ligase binding;GO:0070411//I-SMAD bindingGO:0001756//somitogenesis;GO:0001934//positive regulation of protein phosphorylation;GO:0001957//intramembranous ossification;GO:0003139//secondary heart field specification;GO:0003413//chondrocyte differentiation involved in endochondral bone morphogenesis;GO:0008219//cell death;GO:0008283//cell proliferation;GO:0008285//negative regulation of cell proliferation;GO:0010718//positive regulation of epithelial to mesenchymal transition;GO:0010942//positive regulation of cell death;GO:0016055//Wnt signaling pathway;GO:0030111//regulation of Wnt signaling pathway;GO:0030282//bone mineralization;GO:0032423//regulation of mismatch repair;GO:0034613//cellular protein localization;GO:0042476//odontogenesis;GO:0043570//maintenance of DNA repeat elements;GO:0045668//negative regulation of osteoblast differentiation;GO:0045860//positive regulation of protein kinase activity;GO:0048255//mRNA stabilization;GO:0061181//regulation of chondrocyte development;GO:0070602//regulation of centromeric sister chromatid cohesion;GO:0090090//negative regulation of canonical Wnt signaling pathway;GO:2000054//negative regulation of Wnt signaling pathway involved in dorsal/ventral axis specification
ENSMUSG000000001488670.010.010oBrat10.08057391050862680.1263897137444450.1795955942783850.1745275234407830.1408381953798960.1206676693746530.1295382451544260.1329425082570290.1367558920478640.06538780268630460.1286579399898560.1287124502466330.1052419193086230.134410391666670.1515347357196140.09654381292715370.05761905285728080.2388921996822650.2769129212732490.137229084337951000.268730517037515BRCA1-associated ATM activator 1 [Source:MGI Symbol;Acc:MGI:1891679]----GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasmGO:0005515//protein bindingGO:0001934//positive regulation of protein phosphorylation;GO:0006006//glucose metabolic process;GO:0006915//apoptotic process;GO:0006974//cellular response to DNA damage stimulus;GO:0008283//cell proliferation;GO:0010212//response to ionizing radiation;GO:0016477//cell migration;GO:0030307//positive regulation of cell growth;GO:0051646//mitochondrion localization

9 细胞周期分析

9.1 细胞周期评估

常规样本中,仅干细胞、祖细胞、癌细胞等具备增殖分化潜力的细胞才具有细胞分裂的能力,即处于细胞周期中。我们通过周期蛋白在细胞中的表达情况对细胞所处的细胞周期进行评分,以此直观体现各个样本中的细胞分裂/增殖活性。

Fig 9-1-1 不同样本的细胞周期指数评估

9.2 细胞周期推断

每一个细胞周期都有特征表达的周期蛋白,通过每个细胞周期的特征周期蛋白[22],我们可以对细胞的各个细胞周期进行评分,通过评分,我们可以推测细胞所处的细胞周期:

Tab 9-2-1 各细胞的细胞周期表
CellsSamplesClustersG1SG2MCellCycle.ScorePhase
NP_AAACCCAAGATACATGNP8-0.042145058805713-0.001292466610278110.002691366867349720.003638215266941160.00363821526694116M
NP_AAACCCAAGCACTCATNP80.0745961538603549-0.08336518368097030.05379410170959210.0420426215402310.0745961538603549G1
NP_AAACCCAAGCATTTCGNP10.07135896625075560.0386888974331032-0.07397842899663680.001222129524530460.0713589662507556G1
NP_AAACCCAAGCCGGAATNP10-0.0926926561442492-0.0863266063292219-0.02753962422265450.02533524149525540.0253352414952554M
NP_AAACCCAAGCTTAAGANP0-0.115470447729413-0.09763347230317960.0405116055191659-0.03549919968547120.0405116055191659G2
NP_AAACCCAAGCTTACGTNP0-0.0709996067974205-0.04755463179402360.06736270665012460.002590658350642290.0673627066501246G2
NP_AAACCCACATCAACCANP14-0.0154831399697472-0.0340019759283895-0.026268696653803-0.0654748336085478-0.0154831399697472non-cycling
NP_AAACCCATCAGATTGCNP19-0.00960479283899247-0.0420234053142075-0.0685928073484325-0.080740425497878-0.00960479283899247non-cycling
NP_AAACCCATCATGCCAANP0-0.0413919260518897-0.03434176403446530.0383181318803816-0.1026235928988230.0383181318803816G2
NP_AAACCCATCCCGTGAGNP4-0.1332074273491070.037406489419951-0.0252592704991908-0.02163790378373160.037406489419951S
NP_AAACCCATCCGGACGTNP3-0.0922808402020432-0.0652756262990237-0.0840624651014420.0180398927411980.018039892741198M
NP_AAACGAAAGCAGAAAGNP0-0.0669403797552773-0.05764361062003690.0987431243880099-0.02100526869050120.0987431243880099G2
NP_AAACGAAAGTGTTCACNP110.06869869944278780.002006906386513320.0817484070338133-0.05851884078650340.0817484070338133G2
NP_AAACGAACATCATCTTNP20.0882892355360060.121666862932965-0.0723995378626260.007875781532716980.121666862932965S
NP_AAACGAACATCCTGTCNP0-0.0581795015914253-0.04796083462472220.019768415170054-0.05309101974991270.019768415170054G2
NP_AAACGAAGTACAGAATNP7-0.117139047956149-0.0878542859671115-0.086414536386770.1347200942877250.134720094287725M
NP_AAACGAAGTATAGGATNP30.02423462667708610.21533897435891-0.0465427164399261-0.0602454540439650.21533897435891S
NP_AAACGAAGTTAATGAGNP0-0.0737528068255964-0.05531460511919730.0537150840871653-0.08031198459238760.0537150840871653G2
NP_AAACGAATCATTGGTGNP0-0.0750028697140771-0.06070159101109920.0639188160678494-0.09117341456135520.0639188160678494G2
NP_AAACGAATCGCGCCAANP1-0.02805034726797370.0249418845991424-0.141164555660366-0.01679848664212690.0249418845991424S

Fig 9-2-1 不同样本的不同细胞周期细胞的数量堆叠图 Fig 9-2-2 不同样本的不同细胞周期细胞的数量比例堆叠图

Fig 9-2-3 不同细胞周期细胞的tSNE分布

Fig 9-2-4 Seurat分群与细胞周期推断对应circos图

9.3 周期蛋白基因分布可视化

细胞周期推断是基于每个细胞周期的特征周期蛋白基因在细胞中的表达量,利用软件预测的结果。那么我们就可以通过观察细胞周期特征基因的表达分布,对预测结果进行初步验证,这个过程即为周期蛋白基因分布可视化。

周期蛋白基因:CellCycle.genes.xls

Tab 9-3-1 周期蛋白基因(前20行)
G1SG2M
Ccne1Abcc5Arl4aAhi1
Ccne2Asf1bAurkbAkirin2
Cdc25aAtad2Brd8Anln
Cdca7Brca1Bub3Anp32e
DtlCdkn2aipCasp3Arl6ip1
Ints8CenpqCcdc107Asxl1
Ivns1abpCrebzfCcna2Aurka
Mcm2DonsonCcnfBirc2
Mcm6Dscc1Cdc25cBirc5
NaspE2f8Cdca2Bub1
Plcxd1Exo1Cdca3Ccnb2
Skp2Ezh2Cdca8Cdc20
SlbpFen1Cdk1Cdc25b
UngHellsCdkn1bCdc27
Zranb2MastlCdkn2cCenpa
-Pkmyt1CenplCenpe
-Rbbp8Ckap2Cenpf
-Rfc2Ckap2lCep55
-Rrm2Dcaf7Cit
-Usp1Espl1Ckap5

热图可以同时展示大量基因在每个细胞中的表达量及其在细胞群体中的分布情况。使用热图展示周期蛋白基因在不同细胞周期的表达量,可以看到各时期特征蛋白基因相对集中表达在对应的时期中,是对细胞周期推断结果准确性的验证。

Fig 9-3-1 周期蛋白基因在不同细胞周期的表达量热图

广州基迪奥生物科技有限公司




10 个性化分析推荐

10.1 细胞亚群鉴定

细胞亚群鉴定是进行单细胞转录组分析的最基础一步,是赋予细胞数据以生物学意义的关键过程。细胞亚群鉴定主要借助marker基因在各个细胞亚群的表达情况来判断细胞亚群所属细胞类型。按照marker基因的查询方式,我们通常会遇到三种情况:

(1)人、小鼠常见组织的细胞类型注释。如今已经建立了许多数据库用以收集marker基因信息,例如Cell Marker(人、小鼠,http://bio-bigdata.hrbmu.edu.cn/CellMarker/),panglaoDB(人、小鼠,https://panglaodb.se/),MCA(小鼠,http://bis.zju.edu.cn/MCA/)。根据数据库内容,我们可以快速锁定组织类型和细胞类型,并获取相关的marker基因。

(2)人、小鼠罕见组织、稀有细胞类型及其他模式物种的细胞类型注释。这一类细胞通常没有成型的数据库可以快速查询。此时,我们需要从已有的单细胞文章、细胞生物学文章和分子生物学文章找寻相关细胞类型的marker基因。在找寻marker基因时,优先考虑荧光定量PCR(qPCR)和RNA荧光原位杂交(FISH)的结果,因为这些技术直接体现mRNA的表达水平,更容易在scRNA-seq数据中找到表达量分布情况;次级考虑蛋白免疫印迹(western blot)、流式细胞术(FAC)和免疫荧光(IF)的结果,因为蛋白水平和mRNA水平并不是完全同步的,可能出现蛋白高丰度但是mRNA低丰度的情况,使得scRNA-seq不具有相应marker基因的表达分布。

(3)人和模式物种的新细胞类型及非模式物种的细胞类型注释。这一类细胞通常缺乏前人的研究基础,无可直接利用的marker基因。为了完成细胞亚群注释,我们可以使用同源比对的方式将基因比对到具有marker基因信息的近缘物种上,然后使用同源比对得到的marker基因用于注释细胞亚群。当然,细胞类型最终是与细胞功能相关的,我们也可以通过细胞亚群上调基因的功能注释或富集的功能通路来确定细胞功能,并结合生物学背景推测细胞亚群所属的细胞类型。

10.2 拟时分析

细胞分化相关分析一直是研究人员广泛关注的问题,其与胚胎发育、组织修复、疾病发生等多个研究领域紧密相关。单细胞转录组的特征是获得了大量细胞的转录本“快照”,记录了样本中所有细胞的转录本,其中包含了多功能性较强的干细胞、过渡阶段的中间细胞和发育成熟的功能细胞,这为细胞分化相关分析提供了基础。

拟时分析通过分析关键基因的表达模式,将所有细胞按照发育时间的先后排布在拟时间轴上,模拟发育过程中的细胞分化过程。通过对细胞轨迹的分析,我们可以挖掘出细胞分化过程中经历的细胞类型变化、伴随发育过程变化的动态变化基因、祖细胞不同的分化命运等与生命发育息息相关的信息。

10.3 细胞周期分析

胚胎干细胞、肿瘤细胞、生殖细胞、植物根尖、芽尖等细胞除了具备多功能性的分化能力以外,还需具备自我增殖能力。而在增殖过程中,细胞进程必然涉及细胞周期。

细胞周期分析必然在细胞鉴定之后,对具有增殖能力的细胞进行细胞周期分析才具有切实的生物学意义。通过早期的细胞周期研究,我们已知了若干与细胞周期各个时期相关的周期蛋白基因,通过这些基因的表达情况,我们可以进一步推测细胞所处细胞周期。

对细胞周期的分析,既可以深入探索与细胞周期进行有关的新的标记基因,也可以侧面反映样本的细胞更新活性,对样本的表型做出关联解释。

细胞周期分析目前只能分析人和小鼠,其他物种可以同源比对,但存在一定误差。

10.4 WGCNA

单细胞转录组数据的特点是数据庞大,同一个细胞通常带有细胞类型、样本属性、表型特征等多级注释信息,导致单细胞转录组分析时可以进行比较方式多样而繁复。极高的复杂程度需求有效的简化方式。

权重基因共表达网络分析(weighted gene co-expression network analysis, WGCNA)可以将大量的基因简化成少量的具有相同表达模式的模块,并进一步找到与表型相关性最高的基因模块。这一分析极大得简化了数据挖掘过程,有利于从具有复杂样本设计和细胞组成的样本中快速锁定核心基因。

10.5 转录因子分析

转录因子是重要的基因调控元件,在外界刺激中,表达量一般优先发生变化,并进一步调控下游基因的表达完成对刺激的响应。所以,转录因子与靶基因之间具备潜在的共表达关系。

借助这一个基础,我们可以使用软件SCENIC将转录因子和靶基因构建为一个网络单位,通过对网络单位的表达活性的分析来研究不同细胞类型之间的转录调控差异。这种差异同时体现在转录因子的表达情况和转录因子的功能特性(靶基因的表达情况)上,对细胞的表型变化会有更加全面的解释度。

目前转录因子分析只能做人、小鼠和果蝇,其他物种的分析流程待开发。

10.6 细胞通讯分析

多细胞生命体的正常运转离不开多种细胞类型之间的有序合作,生命体的表型变化也不应该是由单一细胞类型的功能决定的。所以,为了获得更具有解释力的分子机制,我们往往需要从细胞间的分子信号交互去解释表型变化。

细胞通讯分析从细胞的配体-受体表达情况去推测细胞类型之间的互作关系。这些互作关系体现了下游细胞的激活、细胞信号转导和靶向细胞的杀伤,在宿主免疫、肿瘤微环境等领域都有广泛的应用前景。

目前细胞通讯分析理论上只能做人的,小鼠基因可以比对到人数据库进行参考分析,其他物种需要提供配受体信息数据库才能进行。

广州基迪奥生物科技有限公司




11 目录结构

结果文件夹
├── 1.Expression                                            定量结果文件夹
│   ├── barcode_plot                                            有效细胞鉴定图文件夹
│   │   └── *.barcode_plot.{pdf,png}                                有效细胞鉴定图
│   ├── CellRanger_Report                                       Cell Ranger报告文件夹
│   │   └── CellRanger.*.result.html                                Cell Ranger count结果报告
│   ├── expressions                                             表达量结果文件夹
│   │   ├── *                                                       各样本结果文件夹
│   │   │   ├── expression.xls                                          表达量矩阵表
│   │   │   ├── barcodes.tsv                                            细胞barcode ID表
│   │   │   ├── genes.tsv                                               基因ID与名称表
│   │   │   └── matrix.mtx                                              表达量稀疏矩阵
│   │   └── *.expression.demo.xls                                   表达量矩阵示例表
│   ├── samples.align.stat.xls                                  各样本比对结果统计表
│   └── samples.sequence.stat.xls                               各样本测序数据统计表
├── 2.Cluster                                               聚类结果文件夹
│   ├── 1.QC                                                    质控结果文件夹
│   │   ├── AfterFilter.BasicInfo.merge.{pdf,png}                   过滤后各个样本细胞基本信息的分布图
│   │   ├── AfterFilter.BasicInfo.nUMI-nGene.{pdf,png}              过滤后各个样本细胞基本信息的分布散点图
│   │   ├── AfterFilter.BasicInfo.nUMI-pMito.{pdf,png}              过滤后各个样本细胞基本信息的分布散点图
│   │   ├── AfterFilter.BasicInfo.PresetMarker.{pdf,png}            过滤后各个样本细胞中预设标记基因的表达量分布
│   │   ├── BasicInfo.merge.{pdf,png}                               过滤前各个样本细胞基本信息的分布图
│   │   ├── BasicInfo.nUMI-nGene.{pdf,png}                          过滤前各个样本细胞基本信息的分布散点图
│   │   ├── BasicInfo.nUMI-pMito.{pdf,png}                          过滤前各个样本细胞基本信息的分布散点图
│   │   ├── BasicInfo.PresetMarker.{pdf,png}                        过滤前各个样本细胞中预设标记基因的表达量分布
│   │   └── Filter.stat.xls                                         过滤前后各个样本中细胞数据量统计表
│   ├── 2.cluster                                               分群结果文件夹
│   │   ├── AllGene.avg_exp.annot.xls                               基因在各个亚群中表达量的均值表
│   │   ├── Cells.cluster.list.xls                                  细胞与亚群对照表
│   │   ├── tSNE_*.{pdf,png}                                        各样本单细胞亚群分类tSNE图
│   │   └── tSNE.{pdf,png}                                          单细胞亚群分类tSNE图
│   ├── 3.cluster_stat                                          分群结果的统计结果文件夹
│   │   ├── Cluster.stat.inSamples.pct.{pdf,png}                    各亚群中各个样本细胞数量百分比堆叠图
│   │   ├── Cluster.stat.inSamples.{pdf,png}                        各亚群中各个样本细胞数量堆叠图
│   │   ├── Cluster.stat.bySamples.pct.{pdf,png}                    各样本中各亚群细胞数量百分比堆叠图
│   │   ├── Cluster.stat.bySamples.{pdf,png}                        各样本中各亚群细胞数量堆叠图
│   │   ├── Cluster.stat.Sample.xls                                 各样本中各亚群细胞数量统计表
│   │   ├── Cluster.stat.xls                                        细胞亚群分类结果统计表
│   │   ├── Cluster.cor.heatmap.{pdf,png}                           各亚群相关性系数热图
│   │   ├── PresetMarker.Distribution.{pdf,png}                     已知标记基因在各个细胞亚群中的表达分布
│   │   ├── PresetMarker.DotPlot.{pdf,png}                          已知标记基因在各个细胞亚群中的表达分布气泡图
│   │   ├── PresetMarker.Heatmap.{pdf,png}                          已知标记基因在各个亚群的表达量热图
│   │   └── PresetMarker.VlnPlot.{pdf,png}                          已知标记基因在各个细胞亚群中的表达分布小提琴图
│   └── 4.CellAnnotation                                        单细胞亚群鉴定结果文件夹
│       ├── Cell.annotation.stat.xls                                各样本在各个细胞类型中细胞数量统计表
│       ├── Cells.annotation.circos.{pdf,png}                       Seurat分群与singleR细胞鉴定对应circos图
│       ├── Cells.annotation.{pdf,png}                              各细胞类型在tSNE图的分布
│       ├── Cluster.correlation.heatmap.{pdf,png}                   Seurat分群与singleR鉴定细胞类型相关性热图
│       ├── Cluster.sample.singleR.stat.pct.{pdf,png}               各细胞类型中各样本细胞数量百分比堆叠图
│       ├── Cluster.sample.singleR.stat.{pdf,png}                   各细胞类型中各样本细胞数量堆叠图
│       ├── Cluster.stat.sample.singleR.xls                         各样本在各个细胞类型中细胞数量统计表
│       ├── Sample.cluster.singleR.stat.pct.{pdf,png}               各样本中各细胞类型数量百分比堆叠图
│       └── Sample.cluster.singleR.stat.{pdf,png}                   各样本中各细胞类型数量堆叠图
├── 3.MarkerGene                                            亚群上调表达基因分析结果文件夹
│   ├── DeGene.list.xls                                         各亚群差异基因注释表
│   ├── DeGene.stat.{pdf,png}                                   各亚群上调基因数量统计柱状图
│   ├── DeGene.stat.xls                                         各亚群上调基因数量统计表
│   ├── Enrichment                                              上调表达基因富集分析结果文件夹
│   │   ├── GO                                                      GO功能富集分析结果文件夹
│   │   ├── KO                                                      KO功能富集分析结果文件夹
│   │   ├── DO                                                      DO功能富集分析结果文件夹
│   │   └── Reactome                                                Reactome功能富集分析结果文件夹
│   ├── Plots                                                   上调基因表达分布结果文件夹
│   │   ├── Top.DotPlot.{pdf,png}                                   标记基因表达分布气泡图
│   │   ├── Top.Heatmap.{pdf,png}                                   标记基因表达热图
│   │   ├── ExpPlot                                                 标记基因表达分布图文件夹
│   │   ├── DensityPlot                                             标记基因表达分布密度图文件夹
│   │   └── ViolinPlot                                              标记基因表达分布小提琴图文件夹
│   └── String                                                  蛋白质互作网络分析结果文件夹
│       ├── Top.aln.links.xls                                       标记基因与String蛋白对应表及析构关系表
│       ├── Top.edge.tsv                                            Cytoscape绘图文件--连接信息文件
│       └── Top.node.tsv                                            Cytoscape绘图文件--节点信息文件
├── 4.GSVA                                                  GSVA分析
│   ├── *.gsva.xls                                              各细胞富集程度表
│   └── *.heatmap.cluster.{xls,pdf,png}                         各个亚群中的富集分数热图
├── 5.TF                                                    转录因子注释结果文件夹
│   └── TF.annot.xls                                            转录因子注释表
├── 6.CellCycle                                             细胞周期分析
│   ├── CellCycle.annot.xls                                     各细胞的细胞周期表
│   ├── CellCycle.boxplot.{pdf,png}                             不同样本的细胞周期指数评估盒形图
│   ├── CellCycle.DotPlot.{pdf,png}                             周期蛋白基因的分布气泡图
│   ├── CellCycle.Heatmap.{pdf,png}                             周期蛋白基因在不同细胞周期的表达量热图
│   ├── Phase.stat.bySamples.pct.{pdf,png}                      不同样本的不同细胞周期细胞的数量比例堆叠图
│   ├── Phase.stat.bySamples.{pdf,png}                          不同样本的不同细胞周期细胞的数量堆叠图
│   ├── CellCycle.Cluster.tSNE.{pdf,png}                        不同细胞周期细胞与分群的tSNE分布
│   └── CellCycle.Samples.tSNE.{pdf,png}                        不同细胞周期细胞与样本的tSNE分布
├── 7.RNAVelocity                                           RNA速率分析
│   ├── velocity.trajectory.tSNE.{pdf,png}                      RNA速率轨迹图
│   └── velocity.tSNE.{pdf,png}                                 RNA速率分布图
├── index.html                                              单细胞分析结果网页版报告
├── scRNA-seq_method.pdf                                    单细胞分析方法说明(英文版)
└── src                                                     网页版报告系统文件文件夹

广州基迪奥生物科技有限公司




12 参考文献

广州基迪奥生物科技有限公司




13 附录

13.1 分析方法英文文档

scRNA-seq分析方法文档(英文):scRNA-seq_method.pdf

13.2 结果文件查看

*.xls,*.txt :结果数据表格文件,文件以制表符(Tab)分隔。unix/Linux/Mac用户使用 less 或 more 命令查看;windows用户使用高级文本编辑器Notepad++ 等查看,也可以用Microsoft Excel打开。

*.png:结果图像文件,位图,无损压缩。

*.pdf:结果图像文件,矢量图,可以放大和缩小而不失真,方便用户查看和编辑处理,可使用Adobe Illustrator进行图片编辑,用于文章发表等。

13.3 文章引用与致谢

如果您的研究课题使用了基迪奥的测序和分析服务,我们期望您在论文发表时,在Method部分或Acknowledgements部分引用或提及基迪奥公司。以下语句可供参考:

广州基迪奥生物科技有限公司




帮助文档