收起 展开

1 项目概述

项目编号GHR20100418_sstd_2
项目内容Homo_sapiens --- 2 scRNA-seq
参考基因组GRCh38
样品信息AdjacNormal¦FBBST

2 项目流程

2.1 实验流程
Fig 2-1-1 10x单细胞实验流程图

  1. 细胞质检
    取少量单细胞悬液,加入等体积0.4%台盼蓝染液,用 Countess® II Automated Cell Counter 对细胞计数,将活细胞浓度调整到理想浓度(1000~2000个/μL)。
  2. 10X 标记 cDNA 片段
    含有 barcode 信息的凝胶珠与细胞和酶的混合物结合,进入储液器中被油分隔开,形成GEMs(Gel Beads-In-Emulsions)。之后,凝胶珠溶解释放含有Barcode序列的捕获序列,逆转录 cDNA 片段,并对样本进行标记。将凝胶珠破碎并打碎油滴,以 cDNA 为模板进行 PCR 扩增。将所有GEMs的产物混合,构建标准测序文库。
  3. 标准测序文库建库
    首先将 cDNA 酶切打断成 200~300bp 左右的片段,然后经过末端修复、加A尾、测序接头 P5 、P7和sample index等常规二代测序文库构建步骤,最后进行 PCR 扩增得到 DNA 文库。
  4. 文库测序
    利用Illumina测序平台的PE150测序模式对建好的文库进行高通量测序。

2.2 分析流程
Fig 2-2-1 10x单细胞分析流程图

广州基迪奥生物科技有限公司




3 测序数据质控和表达量定量

3.1 测序数据基本质控

使用cellranger[1],我们可以对测序质量进行质控,去除测序质量低的reads,并对每个样本测的reads数和测序质量进行初步统计。

Tab 3-1-1 各样本测序数据统计表
SampleNumber of ReadsValid BarcodesSequencing SaturationQ30 Bases in BarcodeQ30 Bases in RNA ReadQ30 Bases in UMI
AdjacNormal304,261,17197.9%55.0%96.1%91.6%94.9%
FBBST291,896,69397.7%53.9%96.3%92.1%95.2%

3.2 数据定量

使用cellranger,我们将reads与参考基因组进行比对,将reads注释为特定基因;再对UMI进行修正和统计后,获得未过滤的feature-barcode矩阵;根据未过滤的feature-barcode矩阵,cellranger对数据中的细胞和非细胞进行识别和区分,并绘制为rank-plot图,直观体现有效细胞鉴定结果。

Tab 3-2-1 各样本比对结果统计表
SampleEstimated Number of CellsFraction Reads in CellsMean Reads per CellMedian Genes per CellTotal Genes DetectedMedian UMI Counts per CellReads Mapped Confidently to GenomeReads Mapped Confidently to Intergenic RegionsReads Mapped Confidently to Intronic RegionsReads Mapped Confidently to Exonic RegionsReads Mapped Confidently to Transcriptome
AdjacNormal8,96791.2%33,9311,69927,1583,89194.3%4.0%12.5%77.8%72.4%
FBBST11,78892.6%24,7621,66526,8273,53694.1%4.1%19.4%70.6%64.8%

  • AdjacNormal有效细胞
  • FBBST有效细胞

Fig 3-2-1 有效细胞鉴定图


各样本质控、定量结果报告:

3.3 最终鉴定细胞表达量矩阵

基于UMI修正和有效细胞鉴定后的结果,我们可以使用UMI条数对基因进行定量,获得如下的细胞-基因表达量定量结果。

Tab 3-3-1 样本AdjacNormal所有细胞各个基因UMI丰度信息表(示例,前10个细胞,前20个基因)
GeneIDNameAAACCCAAGAAGATCT-1AAACCCAAGACTCGAG-1AAACCCACACTGGATT-1AAACCCACAGAGGCTA-1AAACCCAGTTTGTTCT-1AAACCCATCGGAAGGT-1AAACCCATCTAGCCAA-1AAACGAAAGAGGATCC-1AAACGAAAGCACTCCG-1
ENSG00000284662OR4F16000000000
ENSG00000186827TNFRSF4000000000
ENSG00000186891TNFRSF18002000000
ENSG00000160072ATAD3B000000000
ENSG00000041988THAP3110000000
ENSG00000260179ENSG00000260179000000000
ENSG00000234396ENSG00000234396000000000
ENSG00000228037ENSG00000228037000000000
ENSG00000142611PRDM16000000000
ENSG00000067606PRKCZ000000000
ENSG00000131584ACAP3000001100
ENSG00000227589TP73-AS3000000000
ENSG00000237402CAMTA1-IT1000000000
ENSG00000284616ENSG00000284616000000000
ENSG00000169972PUSL1000000000
ENSG00000157911PEX10000001000
ENSG00000224051CPTP000000000
ENSG00000228750LINC01672000000000
ENSG00000238260ENSG00000238260000000000
ENSG00000260972ENSG00000260972000000000

备注:由于单个细胞在某个瞬间,只有小部分基因表达,因此表中大量基因UMI丰度为0。

UMI定量总表文件:

备注:文件较大,Excel可能难以打开,建议用软件notepad++ 打开,或联系基迪奥技术人员提取您所关心的基因的UMI丰度。

广州基迪奥生物科技有限公司




4 单细胞亚群分类

在cellranger完成基因表达量鉴定后,我们将表达量矩阵转入Seurat[2]进行后续的分析。

4.1 非正常细胞的进一步过滤

cellranger的细胞过滤是根据基因表达量进行自动识别,会有部分非正常细胞残留,所以在进行亚群分类之前,我们首先使用Seurat对非正常细胞进行进一步过滤。

我们进行过滤的指标主要为以下三个:

  1. 单细胞中鉴定到的基因数量(200-7500)。对于同一种细胞来说,表达基因的数量一般维持在一定范围内,如果该值过高,可能是一个GEM中包裹了多种细胞类型,这样的barcode应该剔除
  2. 单细胞中UMI的总数(小于40000)。单个细胞中可以存在的mRNA总量是有限的,如果UMI总数过高,则可能是两个或两个以上的细胞进入同一个GEM中,这样的细胞应该剔除
  3. 单细胞中线粒体基因表达量比例(小于10%)。细胞凋亡通常伴随着线粒体基因的高表达,所以线粒体基因的高表达意味着细胞状态不佳,这些细胞在实验过程中受到了不良刺激,不利于后续分析反应真实的细胞情况,这样的细胞应该剔除
Tab 4-1-1 过滤前后各个样本中细胞数据量统计表
Samplesbefore_filter_numafter_filter_numpctbefore_filter_median_UMI_per_cellafter_filter_median_UMI_per_cellbefore_filter_median_genes_per_cellafter_filter_median_genes_per_cellbefore_filter_median_MT_per_cellafter_filter_median_MT_per_cell
AdjacNormal8967758484.58%38913518169915782.507624534056252.41970054396315
FBBST117881047388.84%35363285166515812.450755349273462.3971669844729

Fig 4-1-1 过滤前后各个样本细胞基本信息的分布图

  • nUMI与nGene的关系
  • nUMI与pMito的关系

Fig 4-1-2 过滤前后各个样本细胞基本信息的分布散点图


4.2 单细胞亚群分类

在去除低质量细胞后,我们需要进行批次效应矫正。首先对所有样本进行典型相关分析(Canonical Correspondence Analysis, CCA),然后寻找细胞间的最近邻接关系(Mutual Nearest Neighbors, MNN),以此构建细胞间的对应关系,最后,多个样本以细胞间对应关系作为锚点(anchors)完成数据整合并完成批次效应矫正[3]

Tab 4-2-1 细胞亚群分类结果统计表
ClusterCells numberMedian Genes per CellMedian UMI Counts per Cell
0289513792991
1268117683646
2190411252055
318803194.510481.5
4169625828168
5155514922853
614264260.513391.5
712481313.52612.5
87742839.57368.5
9516488.5611
10339459122753
1133715413173
12245628919
132301363.52707.5
1415212412808.5
1512721575201
163310191815
171912462474

Tab 4-2-2 各样本在各个亚群中细胞数量统计表
ClusterAdjacNormalFBBST
Total7584 (100%)10473 (100%)
01463 (19.29%)1432 (13.67%)
1504 (6.65%)2177 (20.79%)
2779 (10.27%)1125 (10.74%)
31046 (13.79%)834 (7.96%)
4731 (9.64%)965 (9.21%)
5483 (6.37%)1072 (10.24%)
61074 (14.16%)352 (3.36%)
7602 (7.94%)646 (6.17%)
8209 (2.76%)565 (5.39%)
916 (0.21%)500 (4.77%)
10278 (3.67%)61 (0.58%)
11120 (1.58%)217 (2.07%)
1248 (0.63%)197 (1.88%)
1389 (1.17%)141 (1.35%)
1458 (0.76%)94 (0.9%)
1538 (0.5%)89 (0.85%)
1631 (0.41%)2 (0.02%)
1715 (0.2%)4 (0.04%)

Fig 4-2-1 各样本中各亚群细胞数量堆叠图 Fig 4-2-2 各样本中各亚群细胞数量百分比堆叠图

Fig 4-2-3 各亚群中各个样本细胞数量堆叠图 Fig 4-2-4 各亚群中各个样本细胞数量百分比堆叠图

进一步,我们计算两个细胞亚群之间相关性并绘制成热图。图中具有高度相关性的两个细胞亚群具有比较相似的基因表达模式,可能是同一种细胞类型。这张相关性热图为人工细胞亚群鉴定提供了一定的指导作用。

Fig 4-2-5 各亚群相关性热图

基因在各个亚群中表达量的均值表:2.Cluster/2.cluster/AllGene.avg_exp.annot.xls

Tab 4-2-3 基因在各个亚群中表达量的均值表(前20行)
Gene_IDGene_nameCluster 0Cluster 1Cluster 2Cluster 3Cluster 4Cluster 5Cluster 6Cluster 7Cluster 8Cluster 9Cluster 10Cluster 11Cluster 12Cluster 13Cluster 14Cluster 15Cluster 16Cluster 17DescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
ENSG00000284662OR4F16000000000000000000olfactory receptor family 4 subfamily F member 16 [Source:HGNC Symbol;Acc:HGNC:15079]Organismal SystemsSensory systemko04740//Olfactory transductionK04257GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0004930//G protein-coupled receptor activity;GO:0004984//olfactory receptor activity;GO:0005515//protein bindingGO:0007165//signal transduction;GO:0007186//G protein-coupled receptor signaling pathway;GO:0007608//sensory perception of smell;GO:0050896//response to stimulus;GO:0050911//detection of chemical stimulus involved in sensory perception of smell
ENSG00000186827TNFRSF40.5713191343106790.8548047759264410.08869456082661070.04118131813246740.04597224102429710.1328497596698740.04557931364103570.1080248776144920.6712059042382650.09509018340142340.029792645898037910.29615922164720.4385041752786410.3014136947279880.01635953806238291.111027512557920.125947756870450.535134098611364TNF receptor superfamily member 4 [Source:HGNC Symbol;Acc:HGNC:11918]Environmental Information ProcessingSignaling molecules and interactionko04060//Cytokine-cytokine receptor interactionK05142GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0009897//external side of plasma membrane;GO:0009986//cell surface;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0001618//virus receptor activity;GO:0005031//tumor necrosis factor-activated receptor activity;GO:0005515//protein bindingGO:0002639//positive regulation of immunoglobulin production;GO:0006954//inflammatory response;GO:0006955//immune response;GO:0016032//viral process;GO:0030890//positive regulation of B cell proliferation;GO:0033209//tumor necrosis factor-mediated signaling pathway;GO:0042098//T cell proliferation;GO:0043433//negative regulation of DNA-binding transcription factor activity;GO:0045892//negative regulation of transcription, DNA-templated;GO:0046718//viral entry into host cell
ENSG00000186891TNFRSF180.4922226314686370.5503034138655350.033063418791920.01476767166196660.02800600127399410.283919436109130.02427975355417730.3620124305730590.9921461106800630.05383133393097170.02366554127706679.837789567836480.2562829659185790.201116530194220.03965347786751743.017459211669690.25189551374090.60916179337232TNF receptor superfamily member 18 [Source:HGNC Symbol;Acc:HGNC:11914]Environmental Information ProcessingSignaling molecules and interactionko04060//Cytokine-cytokine receptor interactionK05154GO:0005576//extracellular region;GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0009897//external side of plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005031//tumor necrosis factor-activated receptor activity;GO:0005515//protein bindingGO:0002687//positive regulation of leukocyte migration;GO:0006915//apoptotic process;GO:0007165//signal transduction;GO:0033209//tumor necrosis factor-mediated signaling pathway;GO:0042531//positive regulation of tyrosine phosphorylation of STAT protein;GO:0043066//negative regulation of apoptotic process;GO:0045589//regulation of regulatory T cell differentiation;GO:0045785//positive regulation of cell adhesion
ENSG00000160072ATAD3B0.1386327798636340.1594799202160270.1664620622595810.1071871297435990.171137125069470.1761856210619740.1225663492653990.1616004390907280.270535283485970.1148826975400630.2031400506941170.1533686375514290.2069369168271690.0868248265800960.01855840426770280.191024455101930.9552572394766690ATPase family AAA domain containing 3B [Source:HGNC Symbol;Acc:HGNC:24007]----GO:0005739//mitochondrion;GO:0005743//mitochondrial inner membrane;GO:0005886//plasma membrane;GO:0016020//membrane;GO:0030667//secretory granule membrane;GO:0101003//ficolin-1-rich granule membraneGO:0000166//nucleotide binding;GO:0005524//ATP binding;GO:0008270//zinc ion binding;GO:0016887//ATPase activityGO:0007005//mitochondrion organization;GO:0043312//neutrophil degranulation
ENSG00000041988THAP30.2225760901250240.2542181816771490.3247685203337130.1589685706860770.1406697967550020.286249119400280.2293564576210540.2394273606613560.2014222732843170.1743626825328120.2023808905205160.247358123968580.266399747507730.2299208317942150.1688529483774520.06556812067360360.3874738668583250.469087156393658THAP domain containing 3 [Source:HGNC Symbol;Acc:HGNC:20855]-----GO:0003677//DNA binding;GO:0005515//protein binding;GO:0046872//metal ion bindingGO:0045944//positive regulation of transcription by RNA polymerase II
ENSG00000260179ENSG000002601790.001764163142750790.0021610379551740200.001235133163330430000000.002069926676987320000000novel transcript-------
ENSG00000234396ENSG00000234396000000000000000000novel transcript-------
ENSG00000228037ENSG000002280370.04351628052627020.1069236191944470.02860911657485030.004351749168065040.001477359315215230.04759321006883364.34891331528534e-040.02029257051869170.061219080168029400.004150470368071420.02928179153263340.01136628419120360.03117890710679680.07801904580141030.024851369309625100novel transcript-------
ENSG00000142611PRDM1600.006306160448709090000009.10621415338799e-04000000000PR/SET domain 16 [Source:HGNC Symbol;Acc:HGNC:14000]Organismal SystemsEnvironmental adaptationko04714//ThermogenesisK22410GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005829//cytosol;GO:0016235//aggresome;GO:0017053//transcriptional repressor complexGO:0000976//transcription regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0001227//DNA-binding transcription repressor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003712//transcription coregulator activity;GO:0003713//transcription coactivator activity;GO:0005515//protein binding;GO:0008168//methyltransferase activity;GO:0016740//transferase activity;GO:0033613//activating transcription factor binding;GO:0043565//sequence-specific DNA binding;GO:0046872//metal ion binding;GO:0046974//histone methyltransferase activity (H3-K9 specific)GO:0000122//negative regulation of transcription by RNA polymerase II;GO:0006357//regulation of transcription by RNA polymerase II;GO:0030154//cell differentiation;GO:0030512//negative regulation of transforming growth factor beta receptor signaling pathway;GO:0032259//methylation;GO:0043457//regulation of cellular respiration;GO:0045892//negative regulation of transcription, DNA-templated;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0050873//brown fat cell differentiation;GO:0051567//histone H3-K9 methylation;GO:0070828//heterochromatin organization;GO:0120162//positive regulation of cold-induced thermogenesis
ENSG00000067606PRKCZ0.1648576179073620.1119793178535250.1767157638197620.005468853256106290.01648159906749670.2113317825242520.02348215387550970.1253391091647370.08160662749430330.02457692984870990.01754389495820230.1341844289958940.0404811011225850.07514096668152520.1045194698357060.1702121327743250.3832017290939510protein kinase C zeta [Source:HGNC Symbol;Acc:HGNC:9412]Human Diseases;Cellular Processes;Environmental Information Processing;Organismal Systems;Organismal Systems;Cellular Processes;Environmental Information Processing;Organismal Systems;Human Diseases;Organismal Systems;Organismal Systems;Environmental Information Processing;Human Diseases;Human Diseases;Human DiseasesInfectious diseases;Transport and catabolism;Signal transduction;Immune system;Development;Cellular community - eukaryotes;Signal transduction;Endocrine system;Cardiovascular diseases;Endocrine system;Immune system;Signal transduction;Endocrine and metabolic diseases;Endocrine and metabolic diseases;Endocrine and metabolic diseasesko05165//Human papillomavirus infection;ko04144//Endocytosis;ko04015//Rap1 signaling pathway;ko04062//Chemokine signaling pathway;ko04360//Axon guidance;ko04530//Tight junction;ko04390//Hippo signaling pathway;ko04910//Insulin signaling pathway;ko05418//Fluid shear stress and atherosclerosis;ko04926//Relaxin signaling pathway;ko04611//Platelet activation;ko04071//Sphingolipid signaling pathway;ko04931//Insulin resistance;ko04933//AGE-RAGE signaling pathway in diabetic complications;ko04930//Type II diabetes mellitusK18952;K18952;K18952;K18952;K18952;K18952;K18952;K18952;K18952;K18952;K18952;K18952;K18952;K18952;K18952GO:0001725//stress fiber;GO:0005634//nucleus;GO:0005635//nuclear envelope;GO:0005737//cytoplasm;GO:0005768//endosome;GO:0005815//microtubule organizing center;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0005911//cell-cell junction;GO:0005923//bicellular tight junction;GO:0005938//cell cortex;GO:0014069//postsynaptic density;GO:0016020//membrane;GO:0016324//apical plasma membrane;GO:0016363//nuclear matrix;GO:0030054//cell junction;GO:0031252//cell leading edge;GO:0031982//vesicle;GO:0035748//myelin sheath abaxonal region;GO:0043203//axon hillock;GO:0043231//intracellular membrane-bounded organelle;GO:0045121//membrane raft;GO:0045179//apical cortex;GO:0048471//perinuclear region of cytoplasm;GO:0070062//extracellular exosome;GO:0098685//Schaffer collateral - CA1 synapse;GO:0098978//glutamatergic synapseGO:0000166//nucleotide binding;GO:0004672//protein kinase activity;GO:0004674//protein serine/threonine kinase activity;GO:0004697//protein kinase C activity;GO:0004698//calcium-dependent protein kinase C activity;GO:0005515//protein binding;GO:0005524//ATP binding;GO:0015459//potassium channel regulator activity;GO:0016301//kinase activity;GO:0016740//transferase activity;GO:0019901//protein kinase binding;GO:0043274//phospholipase binding;GO:0043560//insulin receptor substrate binding;GO:0044877//protein-containing complex binding;GO:0046872//metal ion binding;GO:0071889//14-3-3 protein bindingGO:0000226//microtubule cytoskeleton organization;GO:0001954//positive regulation of cell-matrix adhesion;GO:0006468//protein phosphorylation;GO:0006954//inflammatory response;GO:0007165//signal transduction;GO:0007166//cell surface receptor signaling pathway;GO:0007179//transforming growth factor beta receptor signaling pathway;GO:0007616//long-term memory;GO:0008284//positive regulation of cell proliferation;GO:0016310//phosphorylation;GO:0016477//cell migration;GO:0018105//peptidyl-serine phosphorylation;GO:0030010//establishment of cell polarity;GO:0031333//negative regulation of protein complex assembly;GO:0031584//activation of phospholipase D activity;GO:0032148//activation of protein kinase B activity;GO:0032733//positive regulation of interleukin-10 production;GO:0032736//positive regulation of interleukin-13 production;GO:0032753//positive regulation of interleukin-4 production;GO:0032754//positive regulation of interleukin-5 production;GO:0032869//cellular response to insulin stimulus;GO:0034613//cellular protein localization;GO:0035556//intracellular signal transduction;GO:0043066//negative regulation of apoptotic process;GO:0045630//positive regulation of T-helper 2 cell differentiation;GO:0046627//negative regulation of insulin receptor signaling pathway;GO:0046628//positive regulation of insulin receptor signaling pathway;GO:0047496//vesicle transport along microtubule;GO:0050732//negative regulation of peptidyl-tyrosine phosphorylation;GO:0050806//positive regulation of synaptic transmission;GO:0051092//positive regulation of NF-kappaB transcription factor activity;GO:0051222//positive regulation of protein transport;GO:0051346//negative regulation of hydrolase activity;GO:0051899//membrane depolarization;GO:0060081//membrane hyperpolarization;GO:0060291//long-term synaptic potentiation;GO:0070374//positive regulation of ERK1 and ERK2 cascade;GO:0070528//protein kinase C signaling;GO:0072659//protein localization to plasma membrane;GO:0098696//regulation of neurotransmitter receptor localization to postsynaptic specialization membrane;GO:1990138//neuron projection extension;GO:2000463//positive regulation of excitatory postsynaptic potential;GO:2000553//positive regulation of T-helper 2 cell cytokine production
ENSG00000131584ACAP30.2049858089965990.1475876863150650.178201699153820.1522621828380520.1637198265948890.1552402700780590.2300119990917010.1775540924480750.2584594062064970.0779019287349160.2207740398984790.09059289892447290.1745147480793550.1134141507983490.06035319207182370.1576489997971852.408482582593070.473316038698781ArfGAP with coiled-coil, ankyrin repeat and PH domains 3 [Source:HGNC Symbol;Acc:HGNC:16754]Cellular ProcessesTransport and catabolismko04144//EndocytosisK12489GO:0030426//growth coneGO:0005096//GTPase activator activity;GO:0005515//protein binding;GO:0046872//metal ion bindingGO:0001764//neuron migration;GO:0010975//regulation of neuron projection development;GO:0043547//positive regulation of GTPase activity
ENSG00000227589TP73-AS300002.43796833371691e-040000000000000TP73 antisense RNA 3 [Source:HGNC Symbol;Acc:HGNC:40590]-------
ENSG00000237402CAMTA1-IT10000000.0017643631439072600000000000CAMTA1 intronic transcript 1 [Source:HGNC Symbol;Acc:HGNC:41446]-------
ENSG00000284616ENSG00000284616000000000000000000novel transcript-------
ENSG00000169972PUSL10.1989487603705280.1394978958158010.1508673653077740.1477964358320760.1806071078300790.1363818212202260.2062772235454610.1529281582829080.1742355323422950.226911717887740.21720432818670.1436193626717730.002259789975119710.1099936310126450.1057432950972180.1568729273671930.1638887523149290.221141088014153pseudouridine synthase like 1 [Source:HGNC Symbol;Acc:HGNC:26914]----GO:0005739//mitochondrion;GO:0043231//intracellular membrane-bounded organelleGO:0003723//RNA binding;GO:0009982//pseudouridine synthase activity;GO:0016853//isomerase activity;GO:0106029//tRNA pseudouridine synthase activityGO:0001522//pseudouridine synthesis;GO:0008033//tRNA processing;GO:0009451//RNA modification;GO:0031119//tRNA pseudouridine synthesis
ENSG00000157911PEX100.06523785818754120.1060277345696190.07637722035838470.1215369642538690.1606033819333820.08076928737705810.2788146130572720.08385214874292160.1039662859292090.1527219560790150.1440714866073120.09711293538748150.09304527290698370.07828567658315940.08522488128984940.14756768331785200.259396643407434peroxisomal biogenesis factor 10 [Source:HGNC Symbol;Acc:HGNC:8851]Cellular ProcessesTransport and catabolismko04146//PeroxisomeK13346GO:0005777//peroxisome;GO:0005778//peroxisomal membrane;GO:0005779//integral component of peroxisomal membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005515//protein binding;GO:0008022//protein C-terminus binding;GO:0008270//zinc ion binding;GO:0046872//metal ion bindingGO:0007031//peroxisome organization;GO:0008104//protein localization;GO:0016558//protein import into peroxisome matrix;GO:0016567//protein ubiquitination
ENSG00000224051CPTP0.1590713055555890.155104613171260.1741706720225550.1042127103522330.08451822363331960.1545394882507180.206631548807590.1557080047823220.204523014383370.2083360494999190.1850090454656860.1273634512954580.3387563419938760.1483935110541770.07068627058124070.09159413766656720.1293343162741370ceramide-1-phosphate transfer protein [Source:HGNC Symbol;Acc:HGNC:28116]----GO:0005634//nucleus;GO:0005640//nuclear outer membrane;GO:0005737//cytoplasm;GO:0005768//endosome;GO:0005794//Golgi apparatus;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0010008//endosome membrane;GO:0016020//membraneGO:0005543//phospholipid binding;GO:0008289//lipid binding;GO:0120013//intermembrane lipid transfer activity;GO:1902387//ceramide 1-phosphate binding;GO:1902388//ceramide 1-phosphate transporter activityGO:0006687//glycosphingolipid metabolic process;GO:0006869//lipid transport;GO:0010507//negative regulation of autophagy;GO:0032691//negative regulation of interleukin-1 beta production;GO:0035627//ceramide transport;GO:0120009//intermembrane lipid transfer;GO:1900226//negative regulation of NLRP3 inflammasome complex assembly;GO:1902389//ceramide 1-phosphate transport
ENSG00000228750LINC01672000000000000000000long intergenic non-protein coding RNA 1672 [Source:HGNC Symbol;Acc:HGNC:52460]-------
ENSG00000238260ENSG0000023826000.004531107563237330.006245066397545943.72750451028046e-0400.005687395128467160.0020664880373912400000000.00174364511102835000novel transcript-------
ENSG00000260972ENSG00000260972000000000000000000novel transcript-------

细胞与亚群对照表:2.Cluster/2.cluster/Cells.cluster.list.xls

Tab 4-2-4 细胞与亚群对照表(前20行)
CellsClusterSamples
AdjacNormal_AAACCCAAGAAGATCT1AdjacNormal
AdjacNormal_AAACCCAAGACTCGAG5AdjacNormal
AdjacNormal_AAACCCACACTGGATT5AdjacNormal
AdjacNormal_AAACCCACAGAGGCTA2AdjacNormal
AdjacNormal_AAACCCAGTTTGTTCT14AdjacNormal
AdjacNormal_AAACCCATCGGAAGGT6AdjacNormal
AdjacNormal_AAACGAAAGAGGATCC3AdjacNormal
AdjacNormal_AAACGAAAGCACTCCG0AdjacNormal
AdjacNormal_AAACGAACAGTAGATA1AdjacNormal
AdjacNormal_AAACGAAGTAATGCGG4AdjacNormal
AdjacNormal_AAACGAAGTATCCTTT0AdjacNormal
AdjacNormal_AAACGAAGTGGCCACT4AdjacNormal
AdjacNormal_AAACGAAGTTGGCTAT0AdjacNormal
AdjacNormal_AAACGAAGTTTAGTCG5AdjacNormal
AdjacNormal_AAACGAATCGCATAGT0AdjacNormal
AdjacNormal_AAACGCTAGAATCTAG7AdjacNormal
AdjacNormal_AAACGCTAGAGATCGC6AdjacNormal
AdjacNormal_AAACGCTAGATGCGAC2AdjacNormal
AdjacNormal_AAACGCTAGGAACATT0AdjacNormal
AdjacNormal_AAACGCTAGGTTACCT5AdjacNormal

4.3 分类结果可视化

基于细胞亚群分类的结果,进一步利用tSNE(tSNE,t-Distributed Stochastic Neighbor Embedding)非线性聚类的方法对单细胞亚群分类结果进行可视化[4]。tSNE 的方法通常对不同亚群细胞的分类结果有更佳的呈现效果(亚群间的隔离更加清晰)。

对所有样本的亚群分类可视化,结果如下:

Fig 4-3-1 单细胞亚群分类tSNE图

分别对各个样本的亚群分类可视化,结果如下:

  • 样本AdjacNormal单细胞亚群分类tSNE图
  • 样本FBBST单细胞亚群分类tSNE图

Fig 4-3-2 各样本单细胞亚群分类tSNE图


4.4 单细胞亚群鉴定

以上单细胞亚群分类是基于细胞表达特征的相似性进行聚类的,每个亚群不具有生物学意义。所以,细胞鉴定一直是很重要但又比较繁琐的步骤。这里,我们使用singleR[5]对所有细胞进行自动化注释,为后续的人工细胞鉴定工作提供参考。

singleR是通过细胞与参考数据库中细胞类型的相似度来自动化鉴定细胞类型,对于相似度较高的细胞类型的注释准确性会降低。所以,singleR的注释结果只能作为辅助手段,最终的细胞亚群鉴定结果依然需要人工鉴定的确认。

Tab 4-4-1 各样本在各个细胞类型中细胞数量统计表
ClusterAdjacNormalFBBST
Total7584 (100%)10473 (100%)
B_cell51 (0.67%)80 (0.76%)
Chondrocytes120 (1.58%)31 (0.3%)
CMP10 (0.13%)3 (0.03%)
DC23 (0.3%)182 (1.74%)
Endothelial_cells13 (0.17%)5 (0.05%)
Fibroblasts35 (0.46%)44 (0.42%)
GMP1 (0.01%)2 (0.02%)
HSC_-G-CSF0 (0%)4 (0.04%)
iPS_cells1 (0.01%)0 (0%)
Macrophage1600 (21.1%)1123 (10.72%)
Monocyte500 (6.59%)1003 (9.58%)
MSC82 (1.08%)51 (0.49%)
Neurons3 (0.04%)29 (0.28%)
Neutrophils1 (0.01%)6 (0.06%)
NK_cell164 (2.16%)217 (2.07%)
Osteoblasts3 (0.04%)13 (0.12%)
Platelets1 (0.01%)1 (0.01%)
Pre-B_cell_CD34-5 (0.07%)29 (0.28%)
Pro-B_cell_CD34+1 (0.01%)1 (0.01%)
Smooth_muscle_cells59 (0.78%)56 (0.53%)
T_cells4156 (54.8%)7360 (70.28%)
Tissue_stem_cells755 (9.96%)233 (2.22%)

Fig 4-4-1 各样本中各细胞类型细胞数量堆叠图 Fig 4-4-2 各样本中各细胞类型细胞数量百分比堆叠图

Fig 4-4-3 各细胞类型中各个样本细胞数量堆叠图 Fig 4-4-4 各细胞类型中各个样本细胞数量百分比堆叠图

Fig 4-4-5 各细胞类型在tSNE图的分布

各细胞亚群中各个细胞类型数量统计表:4.CellAnnotation/Cell.annotation.stat.xls

Tab 4-4-2 各细胞亚群中各个细胞类型数量统计表
ClusterCell.annotation (Maximum proportion)B_cellCMPChondrocytesDCEndothelial_cellsFibroblastsGMPHSC_-G-CSFMSCMacrophageMonocyteNK_cellNeuronsNeutrophilsOsteoblastsPlateletsPre-B_cell_CD34-Pro-B_cell_CD34+Smooth_muscle_cellsT_cellsTissue_stem_cellsiPS_cells
0T_cells(99.79%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)6(0.21%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)2889(99.79%)0(0.00%)0(0.00%)
1T_cells(99.81%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)5(0.19%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)2676(99.81%)0(0.00%)0(0.00%)
2T_cells(98.63%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.05%)1(0.05%)23(1.21%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.05%)0(0.00%)0(0.00%)1878(98.63%)0(0.00%)0(0.00%)
3Macrophage(85.90%)0(0.00%)0(0.00%)0(0.00%)9(0.48%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1615(85.90%)256(13.62%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
4Monocyte(64.98%)0(0.00%)0(0.00%)0(0.00%)30(1.77%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)554(32.67%)1102(64.98%)1(0.06%)0(0.00%)5(0.29%)0(0.00%)0(0.00%)2(0.12%)0(0.00%)0(0.00%)2(0.12%)0(0.00%)0(0.00%)
5T_cells(99.36%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)10(0.64%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1545(99.36%)0(0.00%)0(0.00%)
6Tissue_stem_cells(65.92%)0(0.00%)0(0.00%)145(10.17%)0(0.00%)6(0.42%)66(4.63%)0(0.00%)0(0.00%)111(7.78%)5(0.35%)2(0.14%)5(0.35%)3(0.21%)0(0.00%)6(0.42%)2(0.14%)0(0.00%)0(0.00%)103(7.22%)31(2.17%)940(65.92%)1(0.07%)
7T_cells(76.20%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)297(23.80%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)951(76.20%)0(0.00%)0(0.00%)
8T_cells(98.32%)1(0.13%)0(0.00%)0(0.00%)1(0.13%)0(0.00%)1(0.13%)3(0.39%)0(0.00%)0(0.00%)1(0.13%)0(0.00%)5(0.65%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.13%)0(0.00%)0(0.00%)761(98.32%)0(0.00%)0(0.00%)
9DC(27.33%)0(0.00%)0(0.00%)6(1.16%)141(27.33%)12(2.33%)12(2.33%)0(0.00%)2(0.39%)22(4.26%)108(20.93%)55(10.66%)4(0.78%)29(5.62%)1(0.19%)10(1.94%)0(0.00%)9(1.74%)0(0.00%)12(2.33%)45(8.72%)48(9.30%)0(0.00%)
10Macrophage(83.19%)0(0.00%)0(0.00%)0(0.00%)2(0.59%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)282(83.19%)45(13.27%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)10(2.95%)0(0.00%)0(0.00%)
11T_cells(99.70%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(0.30%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)336(99.70%)0(0.00%)0(0.00%)
12Macrophage(50.61%)0(0.00%)0(0.00%)0(0.00%)18(7.35%)0(0.00%)0(0.00%)0(0.00%)2(0.82%)0(0.00%)124(50.61%)35(14.29%)10(4.08%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)15(6.12%)0(0.00%)0(0.00%)41(16.73%)0(0.00%)0(0.00%)
13T_cells(96.09%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)2(0.87%)1(0.43%)5(2.17%)0(0.00%)1(0.43%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)221(96.09%)0(0.00%)0(0.00%)
14B_cell(85.53%)130(85.53%)0(0.00%)0(0.00%)2(1.32%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)3(1.97%)1(0.66%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)6(3.95%)2(1.32%)0(0.00%)8(5.26%)0(0.00%)0(0.00%)
15T_cells(96.06%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)5(3.94%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)122(96.06%)0(0.00%)0(0.00%)
16Macrophage(90.91%)0(0.00%)0(0.00%)0(0.00%)2(6.06%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)30(90.91%)0(0.00%)1(3.03%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
17CMP(68.42%)0(0.00%)13(68.42%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)1(5.26%)3(15.79%)2(10.53%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)

Fig 4-4-6 Seurat分群与singleR细胞鉴定对应circos图

通过singleR,我们可以鉴定细胞的细胞类型;通过Seurat,我们可以得到细胞的聚类信息。通过这两个软件,我们可以将一组细胞按照细胞类型和细胞分群两种方式进行聚类。然后,我们计算各个细胞亚群的细胞与各个细胞类型的细胞之间的相关性,并绘制成热图,作为singleR细胞鉴定结果准确性的一个佐证。

Fig 4-4-7 Seurat分群与singleR鉴定细胞类型相关性热图

广州基迪奥生物科技有限公司




5 亚群上调表达基因分析

5.1 上调表达基因分析

为了了解各个细胞亚群的分子表达特征,我们可以筛选各个细胞亚群上调表达的基因。

采用Seurat的秩和检验分别对不同类细胞群进行基因差异表达分析,筛选亚群上调表达的基因。

上调基因的筛选条件为

  1. 目标亚群或对照亚群中,基因在25%以上的细胞中有表达。
  2. P值 ≤0.01;
  3. 基因表达倍数log2FC≥0.36,即基因上调的倍数≥1.28。

Tab 5-1-1 各亚群上调基因数量统计表
Cluster01234567891011121314151617
Number of DE genes347711402152011915812583281163663421003664502692571201373225

Fig 5-1-1 各亚群上调基因数量统计柱状图

各亚群差异基因注释表: 3.MarkerGene/DeGene.list.xls

Tab 5-1-2 各亚群差异基因注释表(前20行)
Target_ClusterGene IDGene NameTarget_Cluster_meanOther_Cluster_meanLog2FCPvalueQvalueDescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
0ENSG00000102245CD40LG1.913007979255810.06004180686681884.9937316877034200CD40 ligand [Source:HGNC Symbol;Acc:HGNC:11935]Environmental Information Processing;Human Diseases;Environmental Information Processing;Environmental Information Processing;Human Diseases;Human Diseases;Organismal Systems;Human Diseases;Human Diseases;Organismal Systems;Human Diseases;Human Diseases;Human DiseasesSignaling molecules and interaction;Immune diseases;Signal transduction;Signaling molecules and interaction;Cardiovascular diseases;Immune diseases;Immune system;Immune diseases;Infectious diseases;Immune system;Immune diseases;Immune diseases;Infectious diseasesko04060//Cytokine-cytokine receptor interaction;ko05322//Systemic lupus erythematosus;ko04064//NF-kappa B signaling pathway;ko04514//Cell adhesion molecules (CAMs);ko05416//Viral myocarditis;ko05320//Autoimmune thyroid disease;ko04672//Intestinal immune network for IgA production;ko05340//Primary immunodeficiency;ko05145//Toxoplasmosis;ko04660//T cell receptor signaling pathway;ko05330//Allograft rejection;ko05310//Asthma;ko05144//MalariaK03161;K03161;K03161;K03161;K03161;K03161;K03161;K03161;K03161;K03161;K03161;K03161;K03161GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0009897//external side of plasma membrane;GO:0009986//cell surface;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005125//cytokine activity;GO:0005164//tumor necrosis factor receptor binding;GO:0005174//CD40 receptor binding;GO:0005178//integrin binding;GO:0005515//protein bindingGO:0002637//regulation of immunoglobulin production;GO:0006954//inflammatory response;GO:0006955//immune response;GO:0007159//leukocyte cell-cell adhesion;GO:0007165//signal transduction;GO:0007229//integrin-mediated signaling pathway;GO:0007257//activation of JUN kinase activity;GO:0023035//CD40 signaling pathway;GO:0030168//platelet activation;GO:0030183//B cell differentiation;GO:0031295//T cell costimulation;GO:0032733//positive regulation of interleukin-10 production;GO:0032735//positive regulation of interleukin-12 production;GO:0032753//positive regulation of interleukin-4 production;GO:0033209//tumor necrosis factor-mediated signaling pathway;GO:0042100//B cell proliferation;GO:0042102//positive regulation of T cell proliferation;GO:0043066//negative regulation of apoptotic process;GO:0045190//isotype switching;GO:0050776//regulation of immune response;GO:0051092//positive regulation of NF-kappaB transcription factor activity;GO:2000353//positive regulation of endothelial cell apoptotic process
0ENSG00000168685IL7R19.85260247497831.71469592469543.5333034783457800interleukin 7 receptor [Source:HGNC Symbol;Acc:HGNC:6024]Human Diseases;Environmental Information Processing;Environmental Information Processing;Organismal Systems;Environmental Information Processing;Environmental Information Processing;Human DiseasesCancers;Signal transduction;Signaling molecules and interaction;Immune system;Signal transduction;Signal transduction;Immune diseasesko05200//Pathways in cancer;ko04151//PI3K-Akt signaling pathway;ko04060//Cytokine-cytokine receptor interaction;ko04640//Hematopoietic cell lineage;ko04630//Jak-STAT signaling pathway;ko04068//FoxO signaling pathway;ko05340//Primary immunodeficiencyK05072;K05072;K05072;K05072;K05072;K05072;K05072GO:0005576//extracellular region;GO:0005654//nucleoplasm;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0009897//external side of plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0030669//clathrin-coated endocytic vesicle membraneGO:0003823//antigen binding;GO:0004896//cytokine receptor activity;GO:0004917//interleukin-7 receptor activity;GO:0005515//protein bindingGO:0000018//regulation of DNA recombination;GO:0000902//cell morphogenesis;GO:0001915//negative regulation of T cell mediated cytotoxicity;GO:0006955//immune response;GO:0007165//signal transduction;GO:0007166//cell surface receptor signaling pathway;GO:0008284//positive regulation of cell proliferation;GO:0008361//regulation of cell size;GO:0010628//positive regulation of gene expression;GO:0030217//T cell differentiation;GO:0033089//positive regulation of T cell differentiation in thymus;GO:0038111//interleukin-7-mediated signaling pathway;GO:0042100//B cell proliferation;GO:0048535//lymph node development;GO:0048872//homeostasis of number of cells;GO:0050830//defense response to Gram-positive bacterium;GO:0061024//membrane organization;GO:0070233//negative regulation of T cell apoptotic process;GO:1904894//positive regulation of STAT cascade
0ENSG00000165272AQP32.371454840116340.2113857283886763.487822513052100aquaporin 3 (Gill blood group) [Source:HGNC Symbol;Acc:HGNC:636]Organismal SystemsExcretory systemko04962//Vasopressin-regulated water reabsorptionK09876GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005886//plasma membrane;GO:0005911//cell-cell junction;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0016323//basolateral plasma membraneGO:0005515//protein binding;GO:0015204//urea transmembrane transporter activity;GO:0015250//water channel activity;GO:0015254//glycerol channel activity;GO:0015267//channel activity;GO:0042802//identical protein bindingGO:0002684//positive regulation of immune system process;GO:0003091//renal water homeostasis;GO:0006833//water transport;GO:0015793//glycerol transport;GO:0015840//urea transport;GO:0032526//response to retinoic acid;GO:0033280//response to vitamin D;GO:0042476//odontogenesis;GO:0045616//regulation of keratinocyte differentiation;GO:0051592//response to calcium ion;GO:0055085//transmembrane transport;GO:0070295//renal water absorption;GO:0071456//cellular response to hypoxia;GO:0071918//urea transmembrane transport;GO:0090650//cellular response to oxygen-glucose deprivation
0ENSG00000111796KLRB17.182932910329920.7267486491494553.3050446507422900killer cell lectin like receptor B1 [Source:HGNC Symbol;Acc:HGNC:6373]Human DiseasesInfectious diseasesko05144//MalariaK06543GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0004888//transmembrane signaling receptor activity;GO:0005515//protein binding;GO:0030246//carbohydrate bindingGO:0007166//cell surface receptor signaling pathway;GO:0050776//regulation of immune response
0ENSG00000215788TNFRSF252.033102662965680.3031316073782512.7456668727798500TNF receptor superfamily member 25 [Source:HGNC Symbol;Acc:HGNC:11910]Environmental Information ProcessingSignaling molecules and interactionko04060//Cytokine-cytokine receptor interactionK05160GO:0005576//extracellular region;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005031//tumor necrosis factor-activated receptor activity;GO:0005515//protein binding;GO:0038023//signaling receptor activityGO:0006915//apoptotic process;GO:0007165//signal transduction;GO:0007166//cell surface receptor signaling pathway;GO:0033209//tumor necrosis factor-mediated signaling pathway;GO:0042981//regulation of apoptotic process;GO:0097190//apoptotic signaling pathway
0ENSG00000069667RORA2.569419208293410.3913329886918892.71497365221100RAR related orphan receptor A [Source:HGNC Symbol;Acc:HGNC:10258]Organismal Systems;Human Diseases;Human Diseases;Organismal SystemsImmune system;Neurodegenerative disease;Immune diseases;Environmental adaptationko04659//Th17 cell differentiation;ko05017//Spinocerebellar ataxia;ko05321//Inflammatiory bowel disease (IBD);ko04710//Circadian rhythmK08532;K08532;K08532;K08532GO:0000785//chromatin;GO:0005634//nucleus;GO:0005654//nucleoplasmGO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001222//transcription corepressor binding;GO:0001223//transcription coactivator binding;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0004879//nuclear receptor activity;GO:0005515//protein binding;GO:0008013//beta-catenin binding;GO:0008134//transcription factor binding;GO:0008142//oxysterol binding;GO:0008270//zinc ion binding;GO:0043565//sequence-specific DNA binding;GO:0046872//metal ion binding;GO:0098531//transcription factor activity, direct ligand regulated sequence-specific DNA bindingGO:0001525//angiogenesis;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0006367//transcription initiation from RNA polymerase II promoter;GO:0006805//xenobiotic metabolic process;GO:0006809//nitric oxide biosynthetic process;GO:0007275//multicellular organism development;GO:0007623//circadian rhythm;GO:0008589//regulation of smoothened signaling pathway;GO:0010575//positive regulation of vascular endothelial growth factor production;GO:0010906//regulation of glucose metabolic process;GO:0019218//regulation of steroid metabolic process;GO:0019221//cytokine-mediated signaling pathway;GO:0021702//cerebellar Purkinje cell differentiation;GO:0021930//cerebellar granule cell precursor proliferation;GO:0030522//intracellular receptor signaling pathway;GO:0032922//circadian regulation of gene expression;GO:0036315//cellular response to sterol;GO:0042632//cholesterol homeostasis;GO:0042692//muscle cell differentiation;GO:0042752//regulation of circadian rhythm;GO:0042753//positive regulation of circadian rhythm;GO:0043030//regulation of macrophage activation;GO:0043124//negative regulation of I-kappaB kinase/NF-kappaB signaling;GO:0045599//negative regulation of fat cell differentiation;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0046068//cGMP metabolic process;GO:0048511//rhythmic process;GO:0050728//negative regulation of inflammatory response;GO:0070328//triglyceride homeostasis;GO:0071347//cellular response to interleukin-1;GO:0071356//cellular response to tumor necrosis factor;GO:0071456//cellular response to hypoxia;GO:0072539//T-helper 17 cell differentiation
0ENSG00000117090SLAMF11.544114841172130.2655783214539172.5395707620927600signaling lymphocytic activation molecule family member 1 [Source:HGNC Symbol;Acc:HGNC:10903]Human DiseasesInfectious diseasesko05162//MeaslesK06536GO:0005576//extracellular region;GO:0005886//plasma membrane;GO:0009897//external side of plasma membrane;GO:0009986//cell surface;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0045335//phagocytic vesicle;GO:0070062//extracellular exosomeGO:0001618//virus receptor activity;GO:0003823//antigen binding;GO:0004888//transmembrane signaling receptor activity;GO:0005515//protein binding;GO:0038023//signaling receptor activity;GO:0042169//SH2 domain binding;GO:0042802//identical protein bindingGO:0001779//natural killer cell differentiation;GO:0001787//natural killer cell proliferation;GO:0002232//leukocyte chemotaxis involved in inflammatory response;GO:0002250//adaptive immune response;GO:0002277//myeloid dendritic cell activation involved in immune response;GO:0002376//immune system process;GO:0002725//negative regulation of T cell cytokine production;GO:0006909//phagocytosis;GO:0007155//cell adhesion;GO:0008284//positive regulation of cell proliferation;GO:0010759//positive regulation of macrophage chemotaxis;GO:0016032//viral process;GO:0031338//regulation of vesicle fusion;GO:0032689//negative regulation of interferon-gamma production;GO:0032695//negative regulation of interleukin-12 production;GO:0032715//negative regulation of interleukin-6 production;GO:0032720//negative regulation of tumor necrosis factor production;GO:0032729//positive regulation of interferon-gamma production;GO:0042104//positive regulation of activated T cell proliferation;GO:0045087//innate immune response;GO:0046330//positive regulation of JNK cascade;GO:0046649//lymphocyte activation;GO:0046718//viral entry into host cell;GO:0050790//regulation of catalytic activity;GO:0070374//positive regulation of ERK1 and ERK2 cascade;GO:2000349//negative regulation of CD40 signaling pathway;GO:2000510//positive regulation of dendritic cell chemotaxis;GO:2000556//positive regulation of T-helper 1 cell cytokine production
0ENSG00000227507LTB9.970398807104322.66889628266891.901407971032900lymphotoxin beta [Source:HGNC Symbol;Acc:HGNC:6711]Environmental Information Processing;Environmental Information Processing;Human DiseasesSignaling molecules and interaction;Signal transduction;Immune diseasesko04060//Cytokine-cytokine receptor interaction;ko04064//NF-kappa B signaling pathway;ko05323//Rheumatoid arthritisK03157;K03157;K03157GO:0005575//cellular_component;GO:0005615//extracellular space;GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005102//signaling receptor binding;GO:0005125//cytokine activity;GO:0005164//tumor necrosis factor receptor binding;GO:0005515//protein bindingGO:0006955//immune response;GO:0007165//signal transduction;GO:0007267//cell-cell signaling;GO:0010467//gene expression;GO:0032735//positive regulation of interleukin-12 production;GO:0033209//tumor necrosis factor-mediated signaling pathway;GO:0043588//skin development;GO:0048535//lymph node development
0ENSG00000107742SPOCK23.804415421975511.216122263144121.6453865116770200SPARC (osteonectin), cwcv and kazal like domains proteoglycan 2 [Source:HGNC Symbol;Acc:HGNC:13564]----GO:0005576//extracellular region;GO:0031012//extracellular matrixGO:0005509//calcium ion binding;GO:0005515//protein binding;GO:0005539//glycosaminoglycan binding;GO:0008191//metalloendopeptidase inhibitor activity;GO:0050840//extracellular matrix bindingGO:0007416//synapse assembly;GO:0010811//positive regulation of cell-substrate adhesion;GO:0010951//negative regulation of endopeptidase activity;GO:0019800//peptide cross-linking via chondroitin 4-sulfate glycosaminoglycan;GO:0030198//extracellular matrix organization;GO:0045595//regulation of cell differentiation;GO:1990830//cellular response to leukemia inhibitory factor;GO:2000147//positive regulation of cell motility
0ENSG00000115165CYTIP4.092081454345751.479144763770711.468071606943800cytohesin 1 interacting protein [Source:HGNC Symbol;Acc:HGNC:9506]----GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005768//endosome;GO:0005769//early endosome;GO:0005829//cytosol;GO:0005938//cell cortexGO:0005515//protein bindingGO:0030155//regulation of cell adhesion
0ENSG00000185201IFITM25.702817115921522.234982581791711.3514111796602700interferon induced transmembrane protein 2 [Source:HGNC Symbol;Acc:HGNC:5413]----GO:0005765//lysosomal membrane;GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0031902//late endosome membrane;GO:0032991//protein-containing complex-GO:0002376//immune system process;GO:0006955//immune response;GO:0009615//response to virus;GO:0034341//response to interferon-gamma;GO:0035455//response to interferon-alpha;GO:0035456//response to interferon-beta;GO:0035458//cellular response to interferon-beta;GO:0045071//negative regulation of viral genome replication;GO:0045087//innate immune response;GO:0046597//negative regulation of viral entry into host cell;GO:0051607//defense response to virus;GO:0060337//type I interferon signaling pathway
0ENSG00000152518ZFP36L211.18336003628294.406201519260071.3437463266353200ZFP36 ring finger protein like 2 [Source:HGNC Symbol;Acc:HGNC:1108]Cellular ProcessesCell growth and deathko04218//Cellular senescenceK18753GO:0005634//nucleus;GO:0005737//cytoplasmGO:0003723//RNA binding;GO:0005515//protein binding;GO:0035925//mRNA 3'-UTR AU-rich region binding;GO:0046872//metal ion bindingGO:0000165//MAPK cascade;GO:0000288//nuclear-transcribed mRNA catabolic process, deadenylation-dependent decay;GO:0006402//mRNA catabolic process;GO:0007275//multicellular organism development;GO:0009611//response to wounding;GO:0030097//hemopoiesis;GO:0033077//T cell differentiation in thymus;GO:0035019//somatic stem cell population maintenance;GO:0043488//regulation of mRNA stability;GO:0044344//cellular response to fibroblast growth factor stimulus;GO:0045577//regulation of B cell differentiation;GO:0045599//negative regulation of fat cell differentiation;GO:0048103//somatic stem cell division;GO:0060216//definitive hemopoiesis;GO:0061158//3'-UTR-mediated mRNA destabilization;GO:0070371//ERK1 and ERK2 cascade;GO:0071356//cellular response to tumor necrosis factor;GO:0071364//cellular response to epidermal growth factor stimulus;GO:0071385//cellular response to glucocorticoid stimulus;GO:0071560//cellular response to transforming growth factor beta stimulus;GO:0097011//cellular response to granulocyte macrophage colony-stimulating factor stimulus;GO:1900153//positive regulation of nuclear-transcribed mRNA catabolic process, deadenylation-dependent decay;GO:1901991//negative regulation of mitotic cell cycle phase transition;GO:2000737//negative regulation of stem cell differentiation
0ENSG00000135046ANXA18.557414435931193.649742064121391.2293804531208900annexin A1 [Source:HGNC Symbol;Acc:HGNC:533]----GO:0001533//cornified envelope;GO:0001891//phagocytic cup;GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005768//endosome;GO:0005769//early endosome;GO:0005829//cytosol;GO:0005884//actin filament;GO:0005886//plasma membrane;GO:0005912//adherens junction;GO:0005925//focal adhesion;GO:0005929//cilium;GO:0009986//cell surface;GO:0010008//endosome membrane;GO:0016020//membrane;GO:0016323//basolateral plasma membrane;GO:0016324//apical plasma membrane;GO:0016328//lateral plasma membrane;GO:0019898//extrinsic component of membrane;GO:0030659//cytoplasmic vesicle membrane;GO:0031232//extrinsic component of external side of plasma membrane;GO:0031313//extrinsic component of endosome membrane;GO:0031410//cytoplasmic vesicle;GO:0031514//motile cilium;GO:0031901//early endosome membrane;GO:0031966//mitochondrial membrane;GO:0031982//vesicle;GO:0032991//protein-containing complex;GO:0042383//sarcolemma;GO:0042629//mast cell granule;GO:0042995//cell projection;GO:0062023//collagen-containing extracellular matrix;GO:0070062//extracellular exosome;GO:0097060//synaptic membraneGO:0003697//single-stranded DNA binding;GO:0003727//single-stranded RNA binding;GO:0004859//phospholipase inhibitor activity;GO:0005102//signaling receptor binding;GO:0005509//calcium ion binding;GO:0005515//protein binding;GO:0005543//phospholipid binding;GO:0005544//calcium-dependent phospholipid binding;GO:0019834//phospholipase A2 inhibitor activity;GO:0036121//double-stranded DNA-dependent ATP-dependent DNA helicase activity;GO:0042802//identical protein binding;GO:0046872//metal ion binding;GO:0048306//calcium-dependent protein binding;GO:0098641//cadherin binding involved in cell-cell adhesion;GO:1990814//DNA/DNA annealing activityGO:0001780//neutrophil homeostasis;GO:0002250//adaptive immune response;GO:0002376//immune system process;GO:0002548//monocyte chemotaxis;GO:0002685//regulation of leukocyte migration;GO:0006909//phagocytosis;GO:0006954//inflammatory response;GO:0007165//signal transduction;GO:0007166//cell surface receptor signaling pathway;GO:0007186//G protein-coupled receptor signaling pathway;GO:0007187//G protein-coupled receptor signaling pathway, coupled to cyclic nucleotide second messenger;GO:0008360//regulation of cell shape;GO:0009725//response to hormone;GO:0010165//response to X-ray;GO:0014070//response to organic cyclic compound;GO:0014839//myoblast migration involved in skeletal muscle regeneration;GO:0018149//peptide cross-linking;GO:0019221//cytokine-mediated signaling pathway;GO:0030073//insulin secretion;GO:0030216//keratinocyte differentiation;GO:0030850//prostate gland development;GO:0031018//endocrine pancreas development;GO:0031340//positive regulation of vesicle fusion;GO:0031394//positive regulation of prostaglandin biosynthetic process;GO:0031532//actin cytoskeleton reorganization;GO:0031960//response to corticosteroid;GO:0032355//response to estradiol;GO:0032508//DNA duplex unwinding;GO:0032652//regulation of interleukin-1 production;GO:0032717//negative regulation of interleukin-8 production;GO:0032743//positive regulation of interleukin-2 production;GO:0033031//positive regulation of neutrophil apoptotic process;GO:0035924//cellular response to vascular endothelial growth factor stimulus;GO:0042063//gliogenesis;GO:0042102//positive regulation of T cell proliferation;GO:0042127//regulation of cell proliferation;GO:0042493//response to drug;GO:0043065//positive regulation of apoptotic process;GO:0043066//negative regulation of apoptotic process;GO:0043086//negative regulation of catalytic activity;GO:0043434//response to peptide hormone;GO:0044849//estrous cycle;GO:0045087//innate immune response;GO:0045627//positive regulation of T-helper 1 cell differentiation;GO:0045629//negative regulation of T-helper 2 cell differentiation;GO:0045920//negative regulation of exocytosis;GO:0046632//alpha-beta T cell differentiation;GO:0046883//regulation of hormone secretion;GO:0050482//arachidonic acid secretion;GO:0050709//negative regulation of protein secretion;GO:0050727//regulation of inflammatory response;GO:0051384//response to glucocorticoid;GO:0070301//cellular response to hydrogen peroxide;GO:0070365//hepatocyte differentiation;GO:0070459//prolactin secretion;GO:0070555//response to interleukin-1;GO:0071385//cellular response to glucocorticoid stimulus;GO:0071621//granulocyte chemotaxis;GO:0090050//positive regulation of cell migration involved in sprouting angiogenesis;GO:0090303//positive regulation of wound healing;GO:0097350//neutrophil clearance;GO:0098609//cell-cell adhesion;GO:1900087//positive regulation of G1/S transition of mitotic cell cycle;GO:1900138//negative regulation of phospholipase A2 activity
0ENSG00000133112TPT176.018073287977836.26041218261451.067947135205100tumor protein, translationally-controlled 1 [Source:HGNC Symbol;Acc:HGNC:12022]----GO:0000922//spindle pole;GO:0005615//extracellular space;GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005771//multivesicular body;GO:0005829//cytosol;GO:0005881//cytoplasmic microtubule;GO:0070062//extracellular exosomeGO:0003723//RNA binding;GO:0005509//calcium ion binding;GO:0005515//protein binding;GO:0008134//transcription factor bindingGO:0006816//calcium ion transport;GO:0006874//cellular calcium ion homeostasis;GO:0009615//response to virus;GO:0019827//stem cell population maintenance;GO:0042981//regulation of apoptotic process;GO:0043066//negative regulation of apoptotic process;GO:1902230//negative regulation of intrinsic apoptotic signaling pathway in response to DNA damage;GO:2000384//negative regulation of ectoderm development
0ENSG00000133639BTG122.332072839782211.0752264612921.011780968162400BTG anti-proliferation factor 1 [Source:HGNC Symbol;Acc:HGNC:1130]Genetic Information ProcessingFolding, sorting and degradationko03018//RNA degradationK14443GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasmGO:0003712//transcription coregulator activity;GO:0005515//protein binding;GO:0019899//enzyme binding;GO:0019900//kinase bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006479//protein methylation;GO:0006979//response to oxidative stress;GO:0007283//spermatogenesis;GO:0008285//negative regulation of cell proliferation;GO:0016477//cell migration;GO:0030308//negative regulation of cell growth;GO:0043434//response to peptide hormone;GO:0045603//positive regulation of endothelial cell differentiation;GO:0045663//positive regulation of myoblast differentiation;GO:0045766//positive regulation of angiogenesis;GO:0045930//negative regulation of mitotic cell cycle;GO:2000271//positive regulation of fibroblast apoptotic process
0ENSG00000109475RPL3434.115705729268417.24148613970720.98455192698873100ribosomal protein L34 [Source:HGNC Symbol;Acc:HGNC:10340]Genetic Information ProcessingTranslationko03010//RibosomeK02915GO:0005737//cytoplasm;GO:0005783//endoplasmic reticulum;GO:0005829//cytosol;GO:0005840//ribosome;GO:0022625//cytosolic large ribosomal subunit;GO:0022626//cytosolic ribosome;GO:0070062//extracellular exosomeGO:0003723//RNA binding;GO:0003735//structural constituent of ribosome;GO:0005515//protein binding;GO:0045296//cadherin bindingGO:0000184//nuclear-transcribed mRNA catabolic process, nonsense-mediated decay;GO:0002181//cytoplasmic translation;GO:0006364//rRNA processing;GO:0006412//translation;GO:0006413//translational initiation;GO:0006614//SRP-dependent cotranslational protein targeting to membrane;GO:0019083//viral transcription
0ENSG00000114942EEF1B218.46082889633249.333045683841930.98404746934845100eukaryotic translation elongation factor 1 beta 2 [Source:HGNC Symbol;Acc:HGNC:3208]----GO:0005737//cytoplasm;GO:0005783//endoplasmic reticulum;GO:0005829//cytosol;GO:0005853//eukaryotic translation elongation factor 1 complexGO:0003746//translation elongation factor activity;GO:0005085//guanyl-nucleotide exchange factor activity;GO:0005515//protein bindingGO:0006412//translation;GO:0006414//translational elongation;GO:0045471//response to ethanol;GO:0050790//regulation of catalytic activity
0ENSG00000112306RPS1275.306667932775938.9084347135060.95269467029297900ribosomal protein S12 [Source:HGNC Symbol;Acc:HGNC:10385]Genetic Information ProcessingTranslationko03010//RibosomeK02951GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005794//Golgi apparatus;GO:0005829//cytosol;GO:0005840//ribosome;GO:0016020//membrane;GO:0022626//cytosolic ribosome;GO:0022627//cytosolic small ribosomal subunit;GO:0043231//intracellular membrane-bounded organelleGO:0003723//RNA binding;GO:0003735//structural constituent of ribosome;GO:0005515//protein bindingGO:0000184//nuclear-transcribed mRNA catabolic process, nonsense-mediated decay;GO:0002181//cytoplasmic translation;GO:0006412//translation;GO:0006413//translational initiation;GO:0006614//SRP-dependent cotranslational protein targeting to membrane;GO:0019083//viral transcription
0ENSG00000149273RPS371.884673705747138.85257178749730.88767411237762700ribosomal protein S3 [Source:HGNC Symbol;Acc:HGNC:10420]Human Diseases;Human Diseases;Genetic Information ProcessingInfectious diseases;Infectious diseases;Translationko05130//Pathogenic Escherichia coli infection;ko05132//Salmonella infection;ko03010//RibosomeK02985;K02985;K02985GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005730//nucleolus;GO:0005737//cytoplasm;GO:0005739//mitochondrion;GO:0005743//mitochondrial inner membrane;GO:0005759//mitochondrial matrix;GO:0005783//endoplasmic reticulum;GO:0005819//spindle;GO:0005829//cytosol;GO:0005840//ribosome;GO:0005844//polysome;GO:0005856//cytoskeleton;GO:0005886//plasma membrane;GO:0005925//focal adhesion;GO:0014069//postsynaptic density;GO:0015935//small ribosomal subunit;GO:0016020//membrane;GO:0022626//cytosolic ribosome;GO:0022627//cytosolic small ribosomal subunit;GO:0032587//ruffle membrane;GO:0070062//extracellular exosome;GO:0071159//NF-kappaB complex;GO:0072686//mitotic spindle;GO:1990904//ribonucleoprotein complexGO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0003677//DNA binding;GO:0003684//damaged DNA binding;GO:0003723//RNA binding;GO:0003729//mRNA binding;GO:0003735//structural constituent of ribosome;GO:0003906//DNA-(apurinic or apyrimidinic site) endonuclease activity;GO:0004520//endodeoxyribonuclease activity;GO:0005515//protein binding;GO:0008017//microtubule binding;GO:0008134//transcription factor binding;GO:0015631//tubulin binding;GO:0016829//lyase activity;GO:0019899//enzyme binding;GO:0019900//kinase binding;GO:0019901//protein kinase binding;GO:0030544//Hsp70 protein binding;GO:0032357//oxidized purine DNA binding;GO:0032358//oxidized pyrimidine DNA binding;GO:0044390//ubiquitin-like protein conjugating enzyme binding;GO:0044877//protein-containing complex binding;GO:0051018//protein kinase A binding;GO:0051536//iron-sulfur cluster binding;GO:0051879//Hsp90 protein binding;GO:0070181//small ribosomal subunit rRNA binding;GO:0097100//supercoiled DNA binding;GO:0140078//class I DNA-(apurinic or apyrimidinic site) endonuclease activityGO:0000184//nuclear-transcribed mRNA catabolic process, nonsense-mediated decay;GO:0002181//cytoplasmic translation;GO:0006281//DNA repair;GO:0006412//translation;GO:0006413//translational initiation;GO:0006417//regulation of translation;GO:0006614//SRP-dependent cotranslational protein targeting to membrane;GO:0006915//apoptotic process;GO:0006974//cellular response to DNA damage stimulus;GO:0007049//cell cycle;GO:0007059//chromosome segregation;GO:0010628//positive regulation of gene expression;GO:0017148//negative regulation of translation;GO:0019083//viral transcription;GO:0031116//positive regulation of microtubule polymerization;GO:0031334//positive regulation of protein complex assembly;GO:0031397//negative regulation of protein ubiquitination;GO:0032079//positive regulation of endodeoxyribonuclease activity;GO:0032743//positive regulation of interleukin-2 production;GO:0034614//cellular response to reactive oxygen species;GO:0042104//positive regulation of activated T cell proliferation;GO:0042769//DNA damage response, detection of DNA damage;GO:0042981//regulation of apoptotic process;GO:0043507//positive regulation of JUN kinase activity;GO:0045738//negative regulation of DNA repair;GO:0045739//positive regulation of DNA repair;GO:0050862//positive regulation of T cell receptor signaling pathway;GO:0051092//positive regulation of NF-kappaB transcription factor activity;GO:0051225//spindle assembly;GO:0051301//cell division;GO:0061481//response to TNF agonist;GO:0070301//cellular response to hydrogen peroxide;GO:0071356//cellular response to tumor necrosis factor;GO:1901224//positive regulation of NIK/NF-kappaB signaling;GO:1902231//positive regulation of intrinsic apoptotic signaling pathway in response to DNA damage;GO:1902546//positive regulation of DNA N-glycosylase activity;GO:1905053//positive regulation of base-excision repair;GO:2001235//positive regulation of apoptotic signaling pathway;GO:2001272//positive regulation of cysteine-type endopeptidase activity involved in execution phase of apoptosis
0ENSG00000100316RPL354.379320308084129.44237987721880.88516382752144100ribosomal protein L3 [Source:HGNC Symbol;Acc:HGNC:10332]Genetic Information ProcessingTranslationko03010//RibosomeK02925GO:0005634//nucleus;GO:0005730//nucleolus;GO:0005737//cytoplasm;GO:0005829//cytosol;GO:0005840//ribosome;GO:0005925//focal adhesion;GO:0022625//cytosolic large ribosomal subunit;GO:0022626//cytosolic ribosome;GO:0032991//protein-containing complex;GO:0070062//extracellular exosomeGO:0003723//RNA binding;GO:0003735//structural constituent of ribosome;GO:0005515//protein bindingGO:0000027//ribosomal large subunit assembly;GO:0000184//nuclear-transcribed mRNA catabolic process, nonsense-mediated decay;GO:0002181//cytoplasmic translation;GO:0006364//rRNA processing;GO:0006412//translation;GO:0006413//translational initiation;GO:0006614//SRP-dependent cotranslational protein targeting to membrane;GO:0019083//viral transcription;GO:0071353//cellular response to interleukin-4

5.2 上调基因表达分布

图形是比表格更优秀的数据呈现形式。我们使用热图、tSNE图、密度分布图、小提琴图和气泡图来可视化基因在细胞和各个细胞亚群的表达分布情况。(选择亚群上调top5的基因用于绘图)

Fig 5-2-1 标记基因表达热图

Fig 5-2-2 标记基因表达分布气泡图

上调基因表达分布图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/ExpPlot

  • ACTN2
  • AK5
  • AKAP6
  • AQP3
  • ASPM

Fig 5-2-3 标记基因表达分布图


上调基因表达分布密度图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/DensityPlot

  • ACTN2
  • AK5
  • AKAP6
  • AQP3
  • ASPM

Fig 5-2-4 标记基因表达分布密度图


上调基因表达分布小提琴图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/ViolinPlot

  • ACTN2
  • AK5
  • AKAP6
  • AQP3
  • ASPM

Fig 5-2-5 标记基因表达分布小提琴图


5.3 GO富集分析

       Gene Ontology(简称GO)是一个国际标准化的基因功能分类体系,提供了一套动态更新的标准词汇表(controlled vocabulary)来全面描述生物体中基因和基因产物的属性。GO总共有三个ontology(本体),分别描述基因的分子功能(molecular function)、细胞组分(cellular component)、参与的生物过程(biological process)。GO的基本单位是term(词条、节点),每个term都对应一个属性。 GO功能分析一方面给出基因的GO功能分类注释;另一方面给出基因的GO功能显著性富集分析。 首先,我们将基因向GO数据库(http://www.geneontology.org/)的各term映射,并计算每个term的基因数,从而得到具有某个GO功能的基因列表及基因数目统计。然后应用超几何检验,找出与整个基因组背景相比,在基因中显著富集的GO条目。

基因集 细胞组分 分子功能 生物学过程 GO 分类表
Cluster_0 Cluster_0.C.html Cluster_0.F.html Cluster_0.P.html Cluster_0.Level2.xls
Cluster_1 Cluster_1.C.html Cluster_1.F.html Cluster_1.P.html Cluster_1.Level2.xls
Cluster_2 Cluster_2.C.html Cluster_2.F.html Cluster_2.P.html Cluster_2.Level2.xls
Cluster_3 Cluster_3.C.html Cluster_3.F.html Cluster_3.P.html Cluster_3.Level2.xls
Cluster_4 Cluster_4.C.html Cluster_4.F.html Cluster_4.P.html Cluster_4.Level2.xls
Cluster_5 Cluster_5.C.html Cluster_5.F.html Cluster_5.P.html Cluster_5.Level2.xls
Cluster_6 Cluster_6.C.html Cluster_6.F.html Cluster_6.P.html Cluster_6.Level2.xls
Cluster_7 Cluster_7.C.html Cluster_7.F.html Cluster_7.P.html Cluster_7.Level2.xls
Cluster_8 Cluster_8.C.html Cluster_8.F.html Cluster_8.P.html Cluster_8.Level2.xls
Cluster_9 Cluster_9.C.html Cluster_9.F.html Cluster_9.P.html Cluster_9.Level2.xls
Cluster_10 Cluster_10.C.html Cluster_10.F.html Cluster_10.P.html Cluster_10.Level2.xls
Cluster_11 Cluster_11.C.html Cluster_11.F.html Cluster_11.P.html Cluster_11.Level2.xls
Cluster_12 Cluster_12.C.html Cluster_12.F.html Cluster_12.P.html Cluster_12.Level2.xls
Cluster_13 Cluster_13.C.html Cluster_13.F.html Cluster_13.P.html Cluster_13.Level2.xls
Cluster_14 Cluster_14.C.html Cluster_14.F.html Cluster_14.P.html Cluster_14.Level2.xls
Cluster_15 Cluster_15.C.html Cluster_15.F.html Cluster_15.P.html Cluster_15.Level2.xls
Cluster_16 Cluster_16.C.html Cluster_16.F.html Cluster_16.P.html Cluster_16.Level2.xls
Cluster_17 Cluster_17.C.html Cluster_17.F.html Cluster_17.P.html Cluster_17.Level2.xls

GO富集圈图:(第一圈:富集前20的GOterm,圈外为基因数目的坐标尺。不同的颜色代表不同的Ontology; 第二圈:背景基因中该GOterm的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该GOterm差异基因数量 第四圈:各GOterm的RichFactor值(该GOterm中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图
  • Cluster_17 富集圈图

Fig 5-3-1 GO 富集圈图


GO 富集分类柱状图:(横坐标为二级GOterm,纵坐标为该term里的基因数量,不同颜色表色不同类型GOterm)

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16
  • Cluster_17

Fig 5-3-2 GO富集分类柱状图


GO富集气泡图:(利用Q值最小的前20个GOterm来作图,纵坐标为GOterm,横坐标为富集因子(该GOterm中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0.C 气泡图
  • Cluster_0.F 气泡图
  • Cluster_0.P 气泡图
  • Cluster_1.C 气泡图
  • Cluster_1.F 气泡图
  • Cluster_1.P 气泡图
  • Cluster_2.C 气泡图
  • Cluster_2.F 气泡图
  • Cluster_2.P 气泡图
  • Cluster_3.C 气泡图
  • Cluster_3.F 气泡图
  • Cluster_3.P 气泡图
  • Cluster_4.C 气泡图
  • Cluster_4.F 气泡图
  • Cluster_4.P 气泡图
  • Cluster_5.C 气泡图
  • Cluster_5.F 气泡图
  • Cluster_5.P 气泡图
  • Cluster_6.C 气泡图
  • Cluster_6.F 气泡图
  • Cluster_6.P 气泡图
  • Cluster_7.C 气泡图
  • Cluster_7.F 气泡图
  • Cluster_7.P 气泡图
  • Cluster_8.C 气泡图
  • Cluster_8.F 气泡图
  • Cluster_8.P 气泡图
  • Cluster_9.C 气泡图
  • Cluster_9.F 气泡图
  • Cluster_9.P 气泡图
  • Cluster_10.C 气泡图
  • Cluster_10.F 气泡图
  • Cluster_10.P 气泡图
  • Cluster_11.C 气泡图
  • Cluster_11.F 气泡图
  • Cluster_11.P 气泡图
  • Cluster_12.C 气泡图
  • Cluster_12.F 气泡图
  • Cluster_12.P 气泡图
  • Cluster_13.C 气泡图
  • Cluster_13.F 气泡图
  • Cluster_13.P 气泡图
  • Cluster_14.C 气泡图
  • Cluster_14.F 气泡图
  • Cluster_14.P 气泡图
  • Cluster_15.C 气泡图
  • Cluster_15.F 气泡图
  • Cluster_15.P 气泡图
  • Cluster_16.C 气泡图
  • Cluster_16.F 气泡图
  • Cluster_16.P 气泡图
  • Cluster_17.C 气泡图
  • Cluster_17.F 气泡图
  • Cluster_17.P 气泡图

Fig 5-3-3 GO富集气泡图


GO富集条形图:(利用Q值最小的前20个GOterm来作图,纵坐标为GOterm,横坐标为该GOterm数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该GOterm数量及Q值");

  • Cluster_0.C 富集柱形图
  • Cluster_0.F 富集柱形图
  • Cluster_0.P 富集柱形图
  • Cluster_1.C 富集柱形图
  • Cluster_1.F 富集柱形图
  • Cluster_1.P 富集柱形图
  • Cluster_2.C 富集柱形图
  • Cluster_2.F 富集柱形图
  • Cluster_2.P 富集柱形图
  • Cluster_3.C 富集柱形图
  • Cluster_3.F 富集柱形图
  • Cluster_3.P 富集柱形图
  • Cluster_4.C 富集柱形图
  • Cluster_4.F 富集柱形图
  • Cluster_4.P 富集柱形图
  • Cluster_5.C 富集柱形图
  • Cluster_5.F 富集柱形图
  • Cluster_5.P 富集柱形图
  • Cluster_6.C 富集柱形图
  • Cluster_6.F 富集柱形图
  • Cluster_6.P 富集柱形图
  • Cluster_7.C 富集柱形图
  • Cluster_7.F 富集柱形图
  • Cluster_7.P 富集柱形图
  • Cluster_8.C 富集柱形图
  • Cluster_8.F 富集柱形图
  • Cluster_8.P 富集柱形图
  • Cluster_9.C 富集柱形图
  • Cluster_9.F 富集柱形图
  • Cluster_9.P 富集柱形图
  • Cluster_10.C 富集柱形图
  • Cluster_10.F 富集柱形图
  • Cluster_10.P 富集柱形图
  • Cluster_11.C 富集柱形图
  • Cluster_11.F 富集柱形图
  • Cluster_11.P 富集柱形图
  • Cluster_12.C 富集柱形图
  • Cluster_12.F 富集柱形图
  • Cluster_12.P 富集柱形图
  • Cluster_13.C 富集柱形图
  • Cluster_13.F 富集柱形图
  • Cluster_13.P 富集柱形图
  • Cluster_14.C 富集柱形图
  • Cluster_14.F 富集柱形图
  • Cluster_14.P 富集柱形图
  • Cluster_15.C 富集柱形图
  • Cluster_15.F 富集柱形图
  • Cluster_15.P 富集柱形图
  • Cluster_16.C 富集柱形图
  • Cluster_16.F 富集柱形图
  • Cluster_16.P 富集柱形图
  • Cluster_17.C 富集柱形图
  • Cluster_17.F 富集柱形图
  • Cluster_17.P 富集柱形图

Fig 5-3-4 GO富集条形图


5.4 KO富集分析

       在生物体内,不同基因相互协调行使其生物学,基于Pathway的分析有助于更进一步了解基因的生物学功能。KEGG是有关Pathway的主要公共数据库。 Pathway显著性富集分析以KEGG Pathway为单位,应用超几何检验,找出与整个基因组背景相比,在基因中显著性富集的Pathway。通过Pathway显著性富集能确定基因参与的最主要生化代谢途径和信号转导途径。

所有趋势pathway统计如下所示:

Tab 5-4-1 所有趋势pathway统计表
PathwayPathway_IDKEGG_A_classKEGG_B_classCluster_0(201)Cluster_0_PvalueCluster_0_QvalueCluster_1(328)Cluster_1_PvalueCluster_1_QvalueCluster_10(1083)Cluster_10_PvalueCluster_10_QvalueCluster_11(199)Cluster_11_PvalueCluster_11_QvalueCluster_12(264)Cluster_12_PvalueCluster_12_QvalueCluster_13(139)Cluster_13_PvalueCluster_13_QvalueCluster_14(162)Cluster_14_PvalueCluster_14_QvalueCluster_15(603)Cluster_15_PvalueCluster_15_QvalueCluster_16(165)Cluster_16_PvalueCluster_16_QvalueCluster_17(119)Cluster_17_PvalueCluster_17_QvalueCluster_2(176)Cluster_2_PvalueCluster_2_QvalueCluster_3(788)Cluster_3_PvalueCluster_3_QvalueCluster_4(696)Cluster_4_PvalueCluster_4_QvalueCluster_5(294)Cluster_5_PvalueCluster_5_QvalueCluster_6(1132)Cluster_6_PvalueCluster_6_QvalueCluster_7(147)Cluster_7_PvalueCluster_7_QvalueCluster_8(829)Cluster_8_PvalueCluster_8_QvalueCluster_9(357)Cluster_9_PvalueCluster_9_Qvalue
2-Oxocarboxylic acid metabolismko01210MetabolismGlobal and overview maps01110.53503829.616227e-01114.86387e-061.018925e-0401101110.27441026.698837e-0101110.76129711.000000e+0001101101120.54966718.361133e-0120.48070767.780525e-0110.49589427.921402e-0140.25390366.597495e-0101130.29306348.320907e-01011
ABC transportersko02010Environmental Information ProcessingMembrane transport01110.84387969.999999e-0140.86710481.000000e+0001101120.17924424.959090e-0101120.85685321.000000e+0020.23188340.520643860410.48579890.8043555601140.64634648.961024e-0120.90744011.000000e+0010.810079.485722e-0120.99032411.000000e+0001130.85106211.000000e+00011
AGE-RAGE signaling pathway in diabetic complicationsko04933Human DiseasesEndocrine and metabolic diseases60.038371273.001181e-0170.11196573.918800e-01120.70560861.000000e+0060.03681991.185817e-0140.4146349.999996e-0110.82585249.622773e-0120.60036029.858982e-01120.068758593.246238e-0150.053782580.227345658640.05998640.4178363020.64566939.871110e-01170.016246269.931676e-02140.047520371.695613e-0180.029142889.854895e-02160.3251267.298747e-0150.035560731.667403e-01100.58481761.000000e+0080.074728255.624284e-01
AMPK signaling pathwayko04152Environmental Information ProcessingSignal transduction01120.96230429.999999e-01180.37529769.739866e-0101150.37147939.999996e-0110.88248279.622773e-0110.91783289.858982e-0160.90302651.000000e+0040.24073730.525646581710.83972070.8880815820.75110179.871110e-01120.53834228.266487e-01100.6181448.903542e-0110.98966811.000000e+00240.053169072.580264e-0120.65715079.022570e-0190.89310181.000000e+0040.79645429.999142e-01
Acute myeloid leukemiako05221Human DiseasesCancers20.52314369.390856e-0150.15421484.507817e-01140.078938593.796017e-0150.028722369.677226e-0230.40197939.999996e-0101101180.14906425.061250e-0150.0139380.112286813330.083911740.5009561510.78724329.871110e-01120.036978381.788208e-01140.0022565531.816814e-0230.47117747.900987e-0190.66139361.000000e+0020.36496757.968263e-0170.58666311.000000e+0010.95819149.999142e-01
Adherens junctionko04520Cellular ProcessesCellular community - eukaryotes20.52314369.390856e-0140.31628997.294964e-0170.84540631.000000e+0060.0073548773.158271e-0210.90309899.999996e-0110.70461749.622773e-0110.75907649.858982e-0140.77681021.000000e+0040.054448330.227345658601140.065896763.001964e-0190.24007135.557630e-0160.56686218.579730e-0150.11063372.796203e-01261.200854e-064.403131e-0530.1345694.368317e-0140.93783921.000000e+0060.088119776.146891e-01
Adipocytokine signaling pathwayko04920Organismal SystemsEndocrine system20.50802859.365971e-0130.52595659.517308e-0180.70801091.000000e+0050.02580459.264239e-0230.38469519.999996e-0101101180.13284474.730567e-0110.75572290.826602473120.2650160.72342205011110.0639432.622472e-01110.029996311.308172e-0110.92046059.764918e-01100.4877499.613114e-0101170.55586431.000000e+0030.58414019.999142e-01
Adrenergic signaling in cardiomyocytesko04261Organismal SystemsCirculatory system40.50376389.365971e-0160.55869869.853248e-01100.99667171.000000e+0040.49590637.388310e-0150.5325529.999996e-0110.92476469.622773e-0130.57249229.858982e-01140.21235995.814853e-0110.95385580.970161027430.37161660.7664993920.83684279.871110e-01190.12816223.844866e-01130.51092648.068303e-0150.62891298.691192e-01170.84228081.000000e+0030.5065158.358562e-01110.90334781.000000e+0070.48048019.999142e-01
African trypanosomiasisko05143Human DiseasesInfectious diseases01130.80547059.999999e-0140.9997791.000000e+0010.92823199.744691e-0101110.84010889.622773e-0170.0050427443.112590e-0250.90079551.000000e+0010.88691950.917379345510.79144620.8784183101170.89749471.000000e+0050.95441831.000000e+0040.53467888.228291e-0130.99997981.000000e+0001150.98664511.000000e+00011
Alanine, aspartate and glutamate metabolismko00250MetabolismAmino acid metabolism01101170.26012237.563270e-0110.62152718.507152e-0110.72587039.999996e-0101101130.56281019.284777e-0101110.43907740.7877254601120.90397681.000000e+0030.66264089.085992e-0101150.6516441.000000e+0001130.77628511.000000e+0010.8279919.999142e-01

基因集 Pathway 富集结果 Pathway 注释表
Cluster_0 Cluster_0.htm Cluster_0.path.xls
Cluster_1 Cluster_1.htm Cluster_1.path.xls
Cluster_2 Cluster_2.htm Cluster_2.path.xls
Cluster_3 Cluster_3.htm Cluster_3.path.xls
Cluster_4 Cluster_4.htm Cluster_4.path.xls
Cluster_5 Cluster_5.htm Cluster_5.path.xls
Cluster_6 Cluster_6.htm Cluster_6.path.xls
Cluster_7 Cluster_7.htm Cluster_7.path.xls
Cluster_8 Cluster_8.htm Cluster_8.path.xls
Cluster_9 Cluster_9.htm Cluster_9.path.xls
Cluster_10 Cluster_10.htm Cluster_10.path.xls
Cluster_11 Cluster_11.htm Cluster_11.path.xls
Cluster_12 Cluster_12.htm Cluster_12.path.xls
Cluster_13 Cluster_13.htm Cluster_13.path.xls
Cluster_14 Cluster_14.htm Cluster_14.path.xls
Cluster_15 Cluster_15.htm Cluster_15.path.xls
Cluster_16 Cluster_16.htm Cluster_16.path.xls
Cluster_17 Cluster_17.htm Cluster_17.path.xls

KO富集圈图:(第一圈:富集前20的pathway,圈外为基因数目的坐标尺。不同的颜色代表不同的A class; 第二圈:背景基因中该pathway的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该pathway差异基因数量 第四圈:各pathway的RichFactor值(该pathway中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图
  • Cluster_17 富集圈图

Fig 5-4-1 KO 富集圈图


KO富集气泡图:(利用Q值最小的前20个pathway来作图,纵坐标为pathway,横坐标为富集因子(该pathway中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16
  • Cluster_17

Fig 5-4-2 KO富集气泡图


KO富集条形图:(利用Q值最小的前20个pathway来作图,纵坐标为pathway,横坐标为该pathway数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该pathway数量及Q值");

  • Cluster_0 富集柱形图
  • Cluster_1 富集柱形图
  • Cluster_2 富集柱形图
  • Cluster_3 富集柱形图
  • Cluster_4 富集柱形图
  • Cluster_5 富集柱形图
  • Cluster_6 富集柱形图
  • Cluster_7 富集柱形图
  • Cluster_8 富集柱形图
  • Cluster_9 富集柱形图
  • Cluster_10 富集柱形图
  • Cluster_11 富集柱形图
  • Cluster_12 富集柱形图
  • Cluster_13 富集柱形图
  • Cluster_14 富集柱形图
  • Cluster_15 富集柱形图
  • Cluster_16 富集柱形图
  • Cluster_17 富集柱形图

Fig 5-4-3 KO富集条形图


5.5 DO富集分析

       DO(Disease Ontology)是描述基因功能与疾病相关的数据库。我们将基因向DO数据库(http://disease-ontology.org/)的各term映射,并计算每个term的基因数,从而得到具有某个DO功能的基因列表及基因数目统计。然后应用超几何检验,找出与整个基因组背景相比,在基因中显著富集的DO条目。

基因集 DO 富集结果 DO 富集表
Cluster_0 Cluster_0.do.html Cluster_0.do.xls
Cluster_1 Cluster_1.do.html Cluster_1.do.xls
Cluster_2 Cluster_2.do.html Cluster_2.do.xls
Cluster_3 Cluster_3.do.html Cluster_3.do.xls
Cluster_4 Cluster_4.do.html Cluster_4.do.xls
Cluster_5 Cluster_5.do.html Cluster_5.do.xls
Cluster_6 Cluster_6.do.html Cluster_6.do.xls
Cluster_7 Cluster_7.do.html Cluster_7.do.xls
Cluster_8 Cluster_8.do.html Cluster_8.do.xls
Cluster_9 Cluster_9.do.html Cluster_9.do.xls
Cluster_10 Cluster_10.do.html Cluster_10.do.xls
Cluster_11 Cluster_11.do.html Cluster_11.do.xls
Cluster_12 Cluster_12.do.html Cluster_12.do.xls
Cluster_13 Cluster_13.do.html Cluster_13.do.xls
Cluster_14 Cluster_14.do.html Cluster_14.do.xls
Cluster_15 Cluster_15.do.html Cluster_15.do.xls
Cluster_16 Cluster_16.do.html Cluster_16.do.xls
Cluster_17 Cluster_17.do.html Cluster_17.do.xls

DO富集圈图:(第一圈:富集前20的DOterm,圈外为基因数目的坐标尺。 第二圈:背景基因中该DOterm的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该DOterm差异基因数量 第四圈:各DOterm的RichFactor值(该DOterm中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图
  • Cluster_17 富集圈图

Fig 5-5-1 DO 富集圈图


DO富集气泡图:(利用Q值最小的前20个DOterm来作图,纵坐标为DOterm,横坐标为富集因子(该DOterm中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16
  • Cluster_17

Fig 5-5-2 DO富集气泡图


DO富集条形图:(利用Q值最小的前20个DOterm来作图,纵坐标为DOterm,横坐标为该DOterm数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该DOterm数量及Q值");

  • Cluster_0 富集柱形图
  • Cluster_1 富集柱形图
  • Cluster_2 富集柱形图
  • Cluster_3 富集柱形图
  • Cluster_4 富集柱形图
  • Cluster_5 富集柱形图
  • Cluster_6 富集柱形图
  • Cluster_7 富集柱形图
  • Cluster_8 富集柱形图
  • Cluster_9 富集柱形图
  • Cluster_10 富集柱形图
  • Cluster_11 富集柱形图
  • Cluster_12 富集柱形图
  • Cluster_13 富集柱形图
  • Cluster_14 富集柱形图
  • Cluster_15 富集柱形图
  • Cluster_16 富集柱形图
  • Cluster_17 富集柱形图

Fig 5-5-3 DO富集条形图


5.6 Reactome富集分析

       Reactome数据库汇集了部分物种各项反应及生物学通路。我们将基因向Reactome数据库(https://reactome.org/)的各term映射,并计算每个term的基因数,从而得到具有某个Reactome功能的基因列表及基因数目统计。然后应用超几何检验,找出与整个基因组背景相比,在基因中显著富集的Reactome条目。

基因集 Reactome 富集结果 Reactome 富集表
Cluster_0 Cluster_0.reactome.html Cluster_0.reactome.xls
Cluster_1 Cluster_1.reactome.html Cluster_1.reactome.xls
Cluster_2 Cluster_2.reactome.html Cluster_2.reactome.xls
Cluster_3 Cluster_3.reactome.html Cluster_3.reactome.xls
Cluster_4 Cluster_4.reactome.html Cluster_4.reactome.xls
Cluster_5 Cluster_5.reactome.html Cluster_5.reactome.xls
Cluster_6 Cluster_6.reactome.html Cluster_6.reactome.xls
Cluster_7 Cluster_7.reactome.html Cluster_7.reactome.xls
Cluster_8 Cluster_8.reactome.html Cluster_8.reactome.xls
Cluster_9 Cluster_9.reactome.html Cluster_9.reactome.xls
Cluster_10 Cluster_10.reactome.html Cluster_10.reactome.xls
Cluster_11 Cluster_11.reactome.html Cluster_11.reactome.xls
Cluster_12 Cluster_12.reactome.html Cluster_12.reactome.xls
Cluster_13 Cluster_13.reactome.html Cluster_13.reactome.xls
Cluster_14 Cluster_14.reactome.html Cluster_14.reactome.xls
Cluster_15 Cluster_15.reactome.html Cluster_15.reactome.xls
Cluster_16 Cluster_16.reactome.html Cluster_16.reactome.xls
Cluster_17 Cluster_17.reactome.html Cluster_17.reactome.xls

Reactome富集圈图:(第一圈:富集前20的Reactome通路,圈外为基因数目的坐标尺。 第二圈:背景基因中该Reactome通路的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该Reactome通路差异基因数量 第四圈:各Reactome通路的RichFactor值(该Reactome通路中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图
  • Cluster_17 富集圈图

Fig 5-6-1 Reactome 富集圈图


Reactome富集气泡图:(利用Q值最小的前20个Reactome通路来作图,纵坐标为Reactome通路,横坐标为富集因子(该Reactome通路中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16
  • Cluster_17

Fig 5-6-2 Reactome富集气泡图


Reactome富集条形图:(利用Q值最小的前20个Reactome通路来作图,纵坐标为Reactome通路,横坐标为该Reactome通路数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该Reactome通路数量及Q值");

  • Cluster_0 富集柱形图
  • Cluster_1 富集柱形图
  • Cluster_2 富集柱形图
  • Cluster_3 富集柱形图
  • Cluster_4 富集柱形图
  • Cluster_5 富集柱形图
  • Cluster_6 富集柱形图
  • Cluster_7 富集柱形图
  • Cluster_8 富集柱形图
  • Cluster_9 富集柱形图
  • Cluster_10 富集柱形图
  • Cluster_11 富集柱形图
  • Cluster_12 富集柱形图
  • Cluster_13 富集柱形图
  • Cluster_14 富集柱形图
  • Cluster_15 富集柱形图
  • Cluster_16 富集柱形图
  • Cluster_17 富集柱形图

Fig 5-6-3 Reactome富集条形图


5.7 上调基因蛋白质互作网络分析

通过string数据库[10],我们可以获得上调基因构建蛋白质互作关系信息(3.MarkerGene/String),然后利用Cytoscape构建蛋白质互作网络图。

String蛋白质互作调控网络图

注:依据系统配置及浏览器不同,如果标记基因数量过多该图可能不能正常加载,请使用桌面版Cytoscape软件


Cytoscape官方手册:http://manual.cytoscape.org/en/stable/index.html
Cytoscape使用教程:http://www.omicshare.com/class/home/index/classdetail?id=14

广州基迪奥生物科技有限公司




6 GSVA分析

基于传统的超几何检验的富集分析,往往需要用到显著差异基因集数据。当单个基因变化较为微弱时,基于传统富集分析得到结果可能会很少,甚至没有结果。GSVA分析(Gene Set Variation Analysis)[11]能够有效弥补传统富集分析对微效基因的有效信息挖据不足等问题,更为全面地对某一功能单位的调节作用进行解释。GSVA分析反映了某一个细胞亚群相对于所有细胞过表达的通路信息。GSVA原理如下:

我们对MSigDB数据库的八个数据集的每个通路进行GSVA分析

MSigDB数据库各数据集在各细胞富集程度表:

  • H
  • C1
  • C2
  • C3
  • C4
  • C5
  • C6
  • C7

Fig 6-0-1 各个亚群中的富集分数热图


广州基迪奥生物科技有限公司




7 转录因子注释

转录因子是调控基因表达的重要元件,其表达情况与细胞的下游基因表达和上游表观调控息息相关。为了方便转录因子分析的进行,我们使用animalTFDB(对动物样本)或者plantTFDB(对植物样本)对样本中所有有表达的转录因子进行注释。

Tab 7-0-1 转录因子注释表
gene_idTF_familyGene_nameCluster 0Cluster 1Cluster 2Cluster 3Cluster 4Cluster 5Cluster 6Cluster 7Cluster 8Cluster 9Cluster 10Cluster 11Cluster 12Cluster 13Cluster 14Cluster 15Cluster 16Cluster 17DescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
ENSG00000184895HMGSRY0000000.0019056021604784300000000000sex determining region Y [Source:HGNC Symbol;Acc:HGNC:11311]----GO:0000785//chromatin;GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0016607//nuclear speckGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005516//calmodulin binding;GO:0008134//transcription factor bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0006355//regulation of transcription, DNA-templated;GO:0007548//sex differentiation;GO:0009653//anatomical structure morphogenesis;GO:0010628//positive regulation of gene expression;GO:0030154//cell differentiation;GO:0030238//male sex determination;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:2000020//positive regulation of male gonad development
ENSG00000067646zf-C2H2ZFY0.227914270739110.2476157522933650.3216541623253710.1654353131372010.1685205793690560.2961476487425350.08326459176114520.3037902404655560.2382707184807880.145831944345320.1043614071059290.2271245747371630.296870742779830.2739022686845960.09762259864781910.2511871252084590.1540570935588730.351837832358251zinc finger protein Y-linked [Source:HGNC Symbol;Acc:HGNC:12870]----GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005730//nucleolusGO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0005515//protein binding;GO:0043565//sequence-specific DNA binding;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSG00000189403HMGHMGB16.091108637246439.124097288074217.323281303043924.628037918873484.195002656746499.536960510284238.081331285943166.6783386152073623.59169283121799.2862852530292310.14799074216556.097967139105649.128508806815797.586431805263775.573812853577059.153243013670613.676445561122517.95279003891913high mobility group box 1 [Source:HGNC Symbol;Acc:HGNC:4983]Cellular Processes;Cellular Processes;Genetic Information ProcessingCell growth and death;Transport and catabolism;Replication and repairko04217//Necroptosis;ko04140//Autophagy - animal;ko03410//Base excision repairK10802;K10802;K10802GO:0000793//condensed chromosome;GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005694//chromosome;GO:0005737//cytoplasm;GO:0005768//endosome;GO:0005769//early endosome;GO:0005793//endoplasmic reticulum-Golgi intermediate compartment;GO:0005886//plasma membrane;GO:0009986//cell surface;GO:0016020//membrane;GO:0017053//transcriptional repressor complex;GO:0034774//secretory granule lumen;GO:0035868//alphav-beta3 integrin-HMGB1 complex;GO:0043005//neuron projection;GO:1904813//ficolin-1-rich granule lumenGO:0000400//four-way junction DNA binding;GO:0000405//bubble DNA binding;GO:0000976//transcription regulatory region sequence-specific DNA binding;GO:0001530//lipopolysaccharide binding;GO:0001786//phosphatidylserine binding;GO:0003677//DNA binding;GO:0003684//damaged DNA binding;GO:0003690//double-stranded DNA binding;GO:0003697//single-stranded DNA binding;GO:0003713//transcription coactivator activity;GO:0003723//RNA binding;GO:0003725//double-stranded RNA binding;GO:0003727//single-stranded RNA binding;GO:0005125//cytokine activity;GO:0005178//integrin binding;GO:0005515//protein binding;GO:0008134//transcription factor binding;GO:0008301//DNA binding, bending;GO:0010858//calcium-dependent protein kinase regulator activity;GO:0016829//lyase activity;GO:0019958//C-X-C chemokine binding;GO:0030295//protein kinase activator activity;GO:0042056//chemoattractant activity;GO:0050786//RAGE receptor binding;GO:0070182//DNA polymerase binding;GO:0070491//repressing transcription factor binding;GO:0097100//supercoiled DNA bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0001654//eye development;GO:0001773//myeloid dendritic cell activation;GO:0001934//positive regulation of protein phosphorylation;GO:0001935//endothelial cell proliferation;GO:0002218//activation of innate immune response;GO:0002224//toll-like receptor signaling pathway;GO:0002250//adaptive immune response;GO:0002270//plasmacytoid dendritic cell activation;GO:0002281//macrophage activation involved in immune response;GO:0002376//immune system process;GO:0002407//dendritic cell chemotaxis;GO:0002437//inflammatory response to antigenic stimulus;GO:0002643//regulation of tolerance induction;GO:0002840//regulation of T cell mediated immune response to tumor cell;GO:0006265//DNA topological change;GO:0006281//DNA repair;GO:0006284//base-excision repair;GO:0006303//double-strand break repair via nonhomologous end joining;GO:0006309//apoptotic DNA fragmentation;GO:0006310//DNA recombination;GO:0006342//chromatin silencing;GO:0006357//regulation of transcription by RNA polymerase II;GO:0006914//autophagy;GO:0006935//chemotaxis;GO:0006954//inflammatory response;GO:0006974//cellular response to DNA damage stimulus;GO:0007165//signal transduction;GO:0007204//positive regulation of cytosolic calcium ion concentration;GO:0010508//positive regulation of autophagy;GO:0016032//viral process;GO:0017055//negative regulation of RNA polymerase II transcriptional preinitiation complex assembly;GO:0030324//lung development;GO:0031175//neuron projection development;GO:0031497//chromatin assembly;GO:0032072//regulation of restriction endodeoxyribonuclease activity;GO:0032147//activation of protein kinase activity;GO:0032392//DNA geometric change;GO:0032425//positive regulation of mismatch repair;GO:0032689//negative regulation of interferon-gamma production;GO:0032727//positive regulation of interferon-alpha production;GO:0032728//positive regulation of interferon-beta production;GO:0032731//positive regulation of interleukin-1 beta production;GO:0032732//positive regulation of interleukin-1 production;GO:0032733//positive regulation of interleukin-10 production;GO:0032735//positive regulation of interleukin-12 production;GO:0032755//positive regulation of interleukin-6 production;GO:0032757//positive regulation of interleukin-8 production;GO:0032760//positive regulation of tumor necrosis factor production;GO:0033151//V(D)J recombination;GO:0034137//positive regulation of toll-like receptor 2 signaling pathway;GO:0034145//positive regulation of toll-like receptor 4 signaling pathway;GO:0034165//positive regulation of toll-like receptor 9 signaling pathway;GO:0035711//T-helper 1 cell activation;GO:0035767//endothelial cell chemotaxis;GO:0042104//positive regulation of activated T cell proliferation;GO:0043065//positive regulation of apoptotic process;GO:0043277//apoptotic cell clearance;GO:0043280//positive regulation of cysteine-type endopeptidase activity involved in apoptotic process;GO:0043312//neutrophil degranulation;GO:0043371//negative regulation of CD4-positive, alpha-beta T cell differentiation;GO:0043388//positive regulation of DNA binding;GO:0043410//positive regulation of MAPK cascade;GO:0043536//positive regulation of blood vessel endothelial cell migration;GO:0043537//negative regulation of blood vessel endothelial cell migration;GO:0045063//T-helper 1 cell differentiation;GO:0045087//innate immune response;GO:0045089//positive regulation of innate immune response;GO:0045639//positive regulation of myeloid cell differentiation;GO:0045819//positive regulation of glycogen catabolic process;GO:0045859//regulation of protein kinase activity;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0046330//positive regulation of JNK cascade;GO:0050918//positive chemotaxis;GO:0051106//positive regulation of DNA ligation;GO:0051384//response to glucocorticoid;GO:0070374//positive regulation of ERK1 and ERK2 cascade;GO:0071222//cellular response to lipopolysaccharide;GO:0071639//positive regulation of monocyte chemotactic protein-1 production;GO:0090026//positive regulation of monocyte chemotaxis;GO:0090303//positive regulation of wound healing;GO:0097350//neutrophil clearance;GO:0098761//cellular response to interleukin-7;GO:1901224//positive regulation of NIK/NF-kappaB signaling;GO:1903672//positive regulation of sprouting angiogenesis;GO:1905564//positive regulation of vascular endothelial cell proliferation;GO:2000343//positive regulation of chemokine (C-X-C motif) ligand 2 production;GO:2000426//negative regulation of apoptotic cell clearance;GO:2000819//regulation of nucleotide-excision repair;GO:2001200//positive regulation of dendritic cell differentiation
ENSG00000120669bHLHSOHLH20003.06384939587018e-04002.92424115795739e-0400000000000spermatogenesis and oogenesis specific basic helix-loop-helix 2 [Source:HGNC Symbol;Acc:HGNC:26026]----GO:0000785//chromatin;GO:0005634//nucleus;GO:0005737//cytoplasmGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0042803//protein homodimerization activity;GO:0046982//protein heterodimerization activity;GO:0046983//protein dimerization activity;GO:1990837//sequence-specific double-stranded DNA bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0007275//multicellular organism development;GO:0007283//spermatogenesis;GO:0009994//oocyte differentiation;GO:0030154//cell differentiation;GO:0048477//oogenesis
ENSG00000136169MBDSETDB20.2989655953105270.2574386996466510.248768687560270.3099532383857540.2651812406232820.2631179570979970.05800968971859410.2819315880751050.1361563061908230.1372650698641010.2449711402372450.3691017501910040.4960491522846230.2945067389058060.2719646466564620.09232248278004850.7793496031414440.384201717228608SET domain bifurcated histone lysine methyltransferase 2 [Source:HGNC Symbol;Acc:HGNC:20263]Metabolism;MetabolismGlobal and overview maps;Amino acid metabolismko01100//Metabolic pathways;ko00310//Lysine degradationK18494;K18494GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005694//chromosome;GO:0005829//cytosolGO:0003677//DNA binding;GO:0005515//protein binding;GO:0008168//methyltransferase activity;GO:0008270//zinc ion binding;GO:0016740//transferase activity;GO:0018024//histone-lysine N-methyltransferase activity;GO:0046872//metal ion binding;GO:0046974//histone methyltransferase activity (H3-K9 specific)GO:0000278//mitotic cell cycle;GO:0001947//heart looping;GO:0006325//chromatin organization;GO:0007049//cell cycle;GO:0007059//chromosome segregation;GO:0007275//multicellular organism development;GO:0010629//negative regulation of gene expression;GO:0032259//methylation;GO:0034968//histone lysine methylation;GO:0045892//negative regulation of transcription, DNA-templated;GO:0051301//cell division;GO:0051567//histone H3-K9 methylation;GO:0070828//heterochromatin organization;GO:0070986//left/right axis specification;GO:0090309//positive regulation of methylation-dependent chromatin silencing
ENSG00000122034zf-C2H2GTF3A2.440912556497242.261667345332292.063153461319251.30868599670331.484202745147042.259554893516252.140395045971842.624058296031182.741023147865233.477485225847982.086277465303631.749815393692492.857071253942042.322501881498622.062210918452672.854061454803850.3370654130895831.44715693577479general transcription factor IIIA [Source:HGNC Symbol;Acc:HGNC:4662]----GO:0005634//nucleus;GO:0005654//nucleoplasmGO:0003677//DNA binding;GO:0003723//RNA binding;GO:0008097//5S rRNA binding;GO:0046872//metal ion bindingGO:0006383//transcription by RNA polymerase III;GO:0009303//rRNA transcription;GO:0042254//ribosome biogenesis;GO:0042273//ribosomal large subunit biogenesis
ENSG00000120690ETSELF11.891714292154272.376407407230662.125151909466431.15586800046521.11729006286592.430885678729870.4938360485747211.944330711984431.453218846532741.373912640684180.6972040749296242.157426406125461.703342601028483.490351691959021.168364600208452.719230042798110.2471598780825713.39915663392579E74 like ETS transcription factor 1 [Source:HGNC Symbol;Acc:HGNC:3316]Cellular ProcessesCell growth and deathko04214//Apoptosis - flyK09428GO:0000785//chromatin;GO:0005634//nucleus;GO:0005654//nucleoplasmGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005515//protein binding;GO:0043565//sequence-specific DNA binding;GO:1990837//sequence-specific double-stranded DNA bindingGO:0001959//regulation of cytokine-mediated signaling pathway;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0030154//cell differentiation;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0050855//regulation of B cell receptor signaling pathway
ENSG00000102804TSC22TSC22D10.1970605343863520.5425540647124210.3157847347336590.1358275314310840.2105803031901021.630468868789262.41968867652960.2787892084355330.4765344825334831.512026336770240.1864677403862420.1541521252127260.2948402943567090.4729239806142820.1532255616444620.5772375270491930.5506239608560732.75204179339859TSC22 domain family member 1 [Source:HGNC Symbol;Acc:HGNC:16826]----GO:0005634//nucleus;GO:0005737//cytoplasmGO:0005515//protein bindingGO:0006357//regulation of transcription by RNA polymerase II;GO:0006366//transcription by RNA polymerase II
ENSG00000276644DACHDACH10000008.67041413489545e-0400000000.0141482739105829000.575366896364155dachshund family transcription factor 1 [Source:HGNC Symbol;Acc:HGNC:2663]----GO:0005634//nucleus;GO:0005667//transcription factor complex;GO:0005737//cytoplasmGO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001227//DNA-binding transcription repressor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005515//protein bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0001967//suckling behavior;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0007275//multicellular organism development;GO:0007585//respiratory gaseous exchange;GO:0008283//cell proliferation;GO:0010944//negative regulation of transcription by competitive promoter binding;GO:0030336//negative regulation of cell migration;GO:0033262//regulation of nuclear cell cycle DNA replication;GO:0045892//negative regulation of transcription, DNA-templated;GO:0046545//development of primary female sexual characteristics;GO:0048147//negative regulation of fibroblast proliferation;GO:0060244//negative regulation of cell proliferation involved in contact inhibition;GO:2000279//negative regulation of DNA biosynthetic process
ENSG00000102554zf-C2H2KLF50.008017429490297456.45879049459803e-040.006518595522307345.36908139312629e-040.0033170529656090.00161255470591840.050610846502446200.0043618842273577800.00214456743538686000.01606059402264630000Kruppel like factor 5 [Source:HGNC Symbol;Acc:HGNC:6349]----GO:0000785//chromatin;GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005667//transcription factor complex;GO:0005794//Golgi apparatus;GO:0043231//intracellular membrane-bounded organelleGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005515//protein binding;GO:0008134//transcription factor binding;GO:0043426//MRF binding;GO:0043565//sequence-specific DNA binding;GO:0046872//metal ion binding;GO:1990837//sequence-specific double-stranded DNA bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0001525//angiogenesis;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0008284//positive regulation of cell proliferation;GO:0010468//regulation of gene expression;GO:0014816//skeletal muscle satellite cell differentiation;GO:0014901//satellite cell activation involved in skeletal muscle regeneration;GO:0014908//myotube differentiation involved in skeletal muscle regeneration;GO:0030033//microvillus assembly;GO:0032534//regulation of microvillus assembly;GO:0035914//skeletal muscle cell differentiation;GO:0043403//skeletal muscle tissue regeneration;GO:0045600//positive regulation of fat cell differentiation;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0060576//intestinal epithelial cell development;GO:0061586//positive regulation of transcription by transcription factor localization;GO:0071407//cellular response to organic cyclic compound;GO:0099156//cell-cell signaling via exosome;GO:1901653//cellular response to peptide;GO:1902895//positive regulation of pri-miRNA transcription by RNA polymerase II;GO:1990830//cellular response to leukemia inhibitory factor
ENSG00000169548OthersZNF280A0004.4552717448448e-04000.029657066488480600000000000zinc finger protein 280A [Source:HGNC Symbol;Acc:HGNC:18597]----GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0005515//protein binding;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSG00000159086GCFCPAXBP10.4793297436658660.6697621048946830.7295905342281720.4035280104085590.3253704494622470.7051367790038750.2145713036124950.5468760229132450.5989088912247670.5235252905614220.2790541040555010.5245541906633720.5771855199159270.5448888466628580.4977491234163880.6289516159573242.633316115569250.537819848705888PAX3 and PAX7 binding protein 1 [Source:HGNC Symbol;Acc:HGNC:13579]----GO:0005634//nucleus;GO:0005829//cytosolGO:0003677//DNA binding;GO:0008134//transcription factor bindingGO:0000398//mRNA splicing, via spliceosome;GO:0006355//regulation of transcription, DNA-templated;GO:0007517//muscle organ development;GO:0014842//regulation of skeletal muscle satellite cell proliferation;GO:0031062//positive regulation of histone methylation;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:2000288//positive regulation of myoblast proliferation
ENSG00000152192PouPOU4F100.004673021442994790.00237866885884789000007.37858174804504e-04000.0127903407346772000000POU class 4 homeobox 1 [Source:HGNC Symbol;Acc:HGNC:9218]----GO:0000785//chromatin;GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0043005//neuron projection;GO:0090575//RNA polymerase II transcription factor complexGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003682//chromatin binding;GO:0003697//single-stranded DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005515//protein binding;GO:0043565//sequence-specific DNA binding;GO:0051020//GTPase binding;GO:1990837//sequence-specific double-stranded DNA bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0001967//suckling behavior;GO:0003223//ventricular compact myocardium morphogenesis;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0007275//multicellular organism development;GO:0007399//nervous system development;GO:0007409//axonogenesis;GO:0007416//synapse assembly;GO:0007498//mesoderm development;GO:0007507//heart development;GO:0010628//positive regulation of gene expression;GO:0010629//negative regulation of gene expression;GO:0021535//cell migration in hindbrain;GO:0021559//trigeminal nerve development;GO:0021953//central nervous system neuron differentiation;GO:0021986//habenula development;GO:0030182//neuron differentiation;GO:0031175//neuron projection development;GO:0043065//positive regulation of apoptotic process;GO:0043066//negative regulation of apoptotic process;GO:0043069//negative regulation of programmed cell death;GO:0043524//negative regulation of neuron apoptotic process;GO:0045672//positive regulation of osteoclast differentiation;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0048665//neuron fate specification;GO:0048880//sensory system development;GO:0048934//peripheral nervous system neuron differentiation;GO:0048935//peripheral nervous system neuron development;GO:0050767//regulation of neurogenesis;GO:0051090//regulation of DNA-binding transcription factor activity;GO:0051355//proprioception involved in equilibrioception;GO:0060384//innervation;GO:0071158//positive regulation of cell cycle arrest;GO:0071345//cellular response to cytokine stimulus;GO:0071392//cellular response to estradiol stimulus;GO:0072332//intrinsic apoptotic signaling pathway by p53 class mediator;GO:1901796//regulation of signal transduction by p53 class mediator;GO:2000679//positive regulation of transcription regulatory region DNA binding;GO:2001208//negative regulation of transcription elongation by RNA polymerase I
ENSG00000173404zf-C2H2INSM100.00201180785842880000009.19565597211877e-040000000.0064435480753121900INSM transcriptional repressor 1 [Source:HGNC Symbol;Acc:HGNC:6090]----GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0017053//transcriptional repressor complexGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0001227//DNA-binding transcription repressor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0030332//cyclin binding;GO:0031490//chromatin DNA binding;GO:0042826//histone deacetylase binding;GO:0046872//metal ion bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0001933//negative regulation of protein phosphorylation;GO:0003309//type B pancreatic cell differentiation;GO:0003310//pancreatic A cell differentiation;GO:0003323//type B pancreatic cell development;GO:0003358//noradrenergic neuron development;GO:0007049//cell cycle;GO:0007275//multicellular organism development;GO:0007399//nervous system development;GO:0008284//positive regulation of cell proliferation;GO:0008285//negative regulation of cell proliferation;GO:0010468//regulation of gene expression;GO:0010564//regulation of cell cycle process;GO:0030154//cell differentiation;GO:0030182//neuron differentiation;GO:0030335//positive regulation of cell migration;GO:0031018//endocrine pancreas development;GO:0035270//endocrine system development;GO:0042421//norepinephrine biosynthetic process;GO:0043254//regulation of protein complex assembly;GO:0045597//positive regulation of cell differentiation;GO:0060290//transdifferentiation;GO:0061104//adrenal chromaffin cell differentiation;GO:0061549//sympathetic ganglion development;GO:0071158//positive regulation of cell cycle arrest;GO:2000179//positive regulation of neural precursor cell proliferation
ENSG00000275004OthersZNF280B0.03402427750073660.04966425656416940.04215661084157310.008789076657331710.01802346864248960.02792754626579950.05368117136688450.03748751927721370.03980787373489360.10100781068660.01051421409592820.01431341262294390.06288759053217720.06545393437459550.03875856639709370.012121329661378500zinc finger protein 280B [Source:HGNC Symbol;Acc:HGNC:23022]----GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0046872//metal ion bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSG00000088876zf-C2H2ZNF3430.009946647276286060.02860106001727010.009932344127069220.01209267858252350.0100735420367970.01520187700273680.01665772020907510.01283792710836370.018225719143891100.02008471833318010.006410367358922240.1043091212063330.06978854072161350.0424296144728360.062221501222402300zinc finger protein 343 [Source:HGNC Symbol;Acc:HGNC:16017]Human DiseasesInfectious diseasesko05168//Herpes simplex infectionK09228GO:0005634//nucleusGO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001227//DNA-binding transcription repressor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0005515//protein binding;GO:0046872//metal ion binding;GO:1990837//sequence-specific double-stranded DNA bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II
ENSG00000100219TF_bZIPXBP11.600788380087840.4146195337079690.6477540864900070.9806167119857021.644194393817040.8883310769650330.9308531178577481.949021563849770.5108477304826461.625231981125231.029272445112110.3698230004199121.492055716140610.9309393857070751.488509167530131.563535137315220.2162957195077111.97800515414804X-box binding protein 1 [Source:HGNC Symbol;Acc:HGNC:12801]Human Diseases;Genetic Information Processing;Human Diseases;Human DiseasesNeurodegenerative diseases;Folding, sorting and degradation;Endocrine and metabolic diseases;Neurodegenerative diseaseko05010//Alzheimer disease;ko04141//Protein processing in endoplasmic reticulum;ko04932//Non-alcoholic fatty liver disease (NAFLD);ko05017//Spinocerebellar ataxiaK09027;K09027;K09027;K09027GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0005829//cytosol;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0030176//integral component of endoplasmic reticulum membraneGO:0000976//transcription regulatory region sequence-specific DNA binding;GO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0000987//proximal promoter sequence-specific DNA binding;GO:0002020//protease binding;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005515//protein binding;GO:0019901//protein kinase binding;GO:0030331//estrogen receptor binding;GO:0031490//chromatin DNA binding;GO:0031625//ubiquitin protein ligase binding;GO:0042803//protein homodimerization activity;GO:0046982//protein heterodimerization activity;GO:1990837//sequence-specific double-stranded DNA bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0001525//angiogenesis;GO:0001558//regulation of cell growth;GO:0001889//liver development;GO:0001934//positive regulation of protein phosphorylation;GO:0001935//endothelial cell proliferation;GO:0002639//positive regulation of immunoglobulin production;GO:0006355//regulation of transcription, DNA-templated;GO:0006366//transcription by RNA polymerase II;GO:0006511//ubiquitin-dependent protein catabolic process;GO:0006629//lipid metabolic process;GO:0006633//fatty acid biosynthetic process;GO:0006914//autophagy;GO:0006915//apoptotic process;GO:0006955//immune response;GO:0006986//response to unfolded protein;GO:0006990//positive regulation of transcription from RNA polymerase II promoter involved in unfolded protein response;GO:0006996//organelle organization;GO:0007275//multicellular organism development;GO:0007517//muscle organ development;GO:0008284//positive regulation of cell proliferation;GO:0010508//positive regulation of autophagy;GO:0010832//negative regulation of myotube differentiation;GO:0014065//phosphatidylinositol 3-kinase signaling;GO:0015031//protein transport;GO:0030154//cell differentiation;GO:0030335//positive regulation of cell migration;GO:0030512//negative regulation of transforming growth factor beta receptor signaling pathway;GO:0030968//endoplasmic reticulum unfolded protein response;GO:0031062//positive regulation of histone methylation;GO:0031647//regulation of protein stability;GO:0031648//protein destabilization;GO:0031670//cellular response to nutrient;GO:0032008//positive regulation of TOR signaling;GO:0032755//positive regulation of interleukin-6 production;GO:0032869//cellular response to insulin stimulus;GO:0034599//cellular response to oxidative stress;GO:0034976//response to endoplasmic reticulum stress;GO:0035356//cellular triglyceride homeostasis;GO:0035470//positive regulation of vascular wound healing;GO:0035924//cellular response to vascular endothelial growth factor stimulus;GO:0036498//IRE1-mediated unfolded protein response;GO:0036500//ATF6-mediated unfolded protein response;GO:0042149//cellular response to glucose starvation;GO:0042307//positive regulation of protein import into nucleus;GO:0042632//cholesterol homeostasis;GO:0043066//negative regulation of apoptotic process;GO:0045348//positive regulation of MHC class II biosynthetic process;GO:0045579//positive regulation of B cell differentiation;GO:0045582//positive regulation of T cell differentiation;GO:0045600//positive regulation of fat cell differentiation;GO:0045766//positive regulation of angiogenesis;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0048010//vascular endothelial growth factor receptor signaling pathway;GO:0048666//neuron development;GO:0051897//positive regulation of protein kinase B signaling;GO:0055089//fatty acid homeostasis;GO:0055092//sterol homeostasis;GO:0060394//negative regulation of pathway-restricted SMAD protein phosphorylation;GO:0060612//adipose tissue development;GO:0070373//negative regulation of ERK1 and ERK2 cascade;GO:0071073//positive regulation of phospholipid biosynthetic process;GO:0071222//cellular response to lipopolysaccharide;GO:0071230//cellular response to amino acid stimulus;GO:0071332//cellular response to fructose stimulus;GO:0071333//cellular response to glucose stimulus;GO:0071353//cellular response to interleukin-4;GO:0071375//cellular response to peptide hormone stimulus;GO:0071498//cellular response to fluid shear stress;GO:0071499//cellular response to laminar fluid shear stress;GO:1900100//positive regulation of plasma cell differentiation;GO:1900102//negative regulation of endoplasmic reticulum unfolded protein response;GO:1900103//positive regulation of endoplasmic reticulum unfolded protein response;GO:1901985//positive regulation of protein acetylation;GO:1902236//negative regulation of endoplasmic reticulum stress-induced intrinsic apoptotic signaling pathway;GO:1903071//positive regulation of ER-associated ubiquitin-dependent protein catabolic process;GO:1903489//positive regulation of lactation;GO:1904707//positive regulation of vascular smooth muscle cell proliferation;GO:1904754//positive regulation of vascular associated smooth muscle cell migration;GO:1990418//response to insulin-like growth factor stimulus;GO:1990440//positive regulation of transcription from RNA polymerase II promoter in response to endoplasmic reticulum stress;GO:2000347//positive regulation of hepatocyte proliferation;GO:2000353//positive regulation of endothelial cell apoptotic process
ENSG00000187792zf-C2H2ZNF700.01267415735619020.03566643233475650.01573020642347470.02946273266864040.01344547259752770.01851838500254070.07475127658887610.02614760095240840.02592269938193780.08680763060914650.003807871185274260000.02676621586278930.03136867483195300zinc finger protein 70 [Source:HGNC Symbol;Acc:HGNC:13140]----GO:0005634//nucleusGO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0046872//metal ion bindingGO:0006357//regulation of transcription by RNA polymerase II
ENSG00000125285HMGSOX21000000009.05388692419543e-04000000000SRY-box transcription factor 21 [Source:HGNC Symbol;Acc:HGNC:11197]----GO:0000785//chromatin;GO:0005575//cellular_component;GO:0005634//nucleusGO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001228//DNA-binding transcription activator activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activityGO:0001942//hair follicle development;GO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0009653//anatomical structure morphogenesis;GO:0030154//cell differentiation;GO:0042633//hair cycle;GO:0043588//skin development;GO:0045944//positive regulation of transcription by RNA polymerase II;GO:0048863//stem cell differentiation
ENSG00000215397zf-C2H2SCRT20000.0016346850806270700000000000000scratch family transcriptional repressor 2 [Source:HGNC Symbol;Acc:HGNC:15952]----GO:0000785//chromatin;GO:0005634//nucleusGO:0000977//RNA polymerase II regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0001227//DNA-binding transcription repressor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0046872//metal ion binding;GO:0070888//E-box binding;GO:1990837//sequence-specific double-stranded DNA bindingGO:0000122//negative regulation of transcription by RNA polymerase II;GO:0006355//regulation of transcription, DNA-templated;GO:1902042//negative regulation of extrinsic apoptotic signaling pathway via death domain receptors;GO:2001222//regulation of neuron migration

8 膜蛋白注释

使用TMHMM预测基因的跨膜结构域。

Tab 8-0-1 膜蛋白注释表
GeneIDLengthExpAAFirst60PredHelTopologyGene_nameCluster 0Cluster 1Cluster 2Cluster 3Cluster 4Cluster 5Cluster 6Cluster 7Cluster 8Cluster 9Cluster 10Cluster 11Cluster 12Cluster 13Cluster 14Cluster 15Cluster 16Cluster 17DescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
ENSG0000000000324590.5625.324i20-42o57-79i92-114o211-233iTSPAN60.002080367384513330.004187887400501050.0021220609455903600.0020029671080453400.33884726781575800.003309538798740910.06799004735348890.001230386864389840000000.120273260848648tetraspanin 6 [Source:HGNC Symbol;Acc:HGNC:11858]----GO:0005887//integral component of plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0070062//extracellular exosomeGO:0005515//protein bindingGO:0039532//negative regulation of viral-induced cytoplasmic pattern recognition receptor signaling pathway;GO:0043123//positive regulation of I-kappaB kinase/NF-kappaB signaling;GO:1901223//negative regulation of NIK/NF-kappaB signaling
ENSG0000000000531720.4220.381i31-50oTNMD0000000.0090541484296863600000000000tenomodulin [Source:HGNC Symbol;Acc:HGNC:17757]----GO:0005634//nucleus;GO:0005635//nuclear envelope;GO:0005737//cytoplasm;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005515//protein bindingGO:0001886//endothelial cell morphogenesis;GO:0001937//negative regulation of endothelial cell proliferation;GO:0016525//negative regulation of angiogenesis;GO:0030948//negative regulation of vascular endothelial growth factor receptor signaling pathway;GO:0035990//tendon cell differentiation;GO:0071773//cellular response to BMP stimulus
ENSG000000004192950.020.000oDPM10.2741102717614130.3796265942995890.2620579828505310.2456976308620070.2943285866713240.3471118185724350.2683969936499140.2487569566069390.4474127359122930.3319344744952520.3334110407119850.3246826799096250.3899383352496620.1925017077224020.1971014856452340.4996051152296350.1293343162741370.281472583811492dolichyl-phosphate mannosyltransferase subunit 1, catalytic [Source:HGNC Symbol;Acc:HGNC:3005]Metabolism;MetabolismGlobal and overview maps;Glycan biosynthesis and metabolismko01100//Metabolic pathways;ko00510//N-Glycan biosynthesisK00721;K00721GO:0005634//nucleus;GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0016020//membrane;GO:0033185//dolichol-phosphate-mannose synthase complex;GO:0043231//intracellular membrane-bounded organelleGO:0004169//dolichyl-phosphate-mannose-protein mannosyltransferase activity;GO:0004582//dolichyl-phosphate beta-D-mannosyltransferase activity;GO:0005515//protein binding;GO:0005537//mannose binding;GO:0016740//transferase activity;GO:0016757//transferase activity, transferring glycosyl groups;GO:0043178//alcohol bindingGO:0006486//protein glycosylation;GO:0006506//GPI anchor biosynthetic process;GO:0018279//protein N-linked glycosylation via asparagine;GO:0019348//dolichol metabolic process;GO:0019673//GDP-mannose metabolic process;GO:0035268//protein mannosylation;GO:0035269//protein O-linked mannosylation;GO:0097502//mannosylation
ENSG000000004577420.630.030oSCYL30.1939493970223630.2118418155292680.1474687934603870.09183551730563560.07348314078982220.1908890556216270.09923413108914070.1405979132230820.1693278287477890.2185091024425170.07419232001343610.2019953806902270.15179817861490.1606924326408260.2091740200516570.107779806920490.2623638987275350.208938384070538SCY1 like pseudokinase 3 [Source:HGNC Symbol;Acc:HGNC:19285]----GO:0000139//Golgi membrane;GO:0005737//cytoplasm;GO:0005794//Golgi apparatus;GO:0030027//lamellipodium;GO:0042995//cell projectionGO:0004672//protein kinase activity;GO:0005515//protein binding;GO:0005524//ATP binding;GO:0042802//identical protein bindingGO:0006468//protein phosphorylation;GO:0006954//inflammatory response;GO:0016477//cell migration;GO:0021522//spinal cord motor neuron differentiation;GO:0034613//cellular protein localization;GO:0048666//neuron development
ENSG000000004608530.170.000oC1orf1120.03590235674766430.06083663636701550.05586915134340080.0549100778454240.02895799371895820.03991263974078480.04461553641954680.02835620812283270.3246391714635010.09503084725942590.07848113325348470.03579477339369190.07148218306587080.007896523950157140.03546926734117710.04664424990382070.3302783568297880chromosome 1 open reading frame 112 [Source:HGNC Symbol;Acc:HGNC:25565]-----GO:0005515//protein binding-
ENSG000000009385290.120.000oFGR0.008490889366176910.02759384086086490.2873010070284360.2616840571846610.7823867395767120.1111864019429690.01058111522800510.5130011240059480.0211978966020450.1074457510572010.22598895773940.008006905155005680.2676205384514970.1887941114755470.1031940214995260.01102031595245840.3947505897722190.221141088014153FGR proto-oncogene, Src family tyrosine kinase [Source:HGNC Symbol;Acc:HGNC:3697]Organismal SystemsImmune systemko04062//Chemokine signaling pathwayK08891GO:0005576//extracellular region;GO:0005737//cytoplasm;GO:0005739//mitochondrion;GO:0005743//mitochondrial inner membrane;GO:0005758//mitochondrial intermembrane space;GO:0005829//cytosol;GO:0005856//cytoskeleton;GO:0005886//plasma membrane;GO:0015629//actin cytoskeleton;GO:0016020//membrane;GO:0016235//aggresome;GO:0031234//extrinsic component of cytoplasmic side of plasma membrane;GO:0032587//ruffle membrane;GO:0034774//secretory granule lumen;GO:0042995//cell projection;GO:0070062//extracellular exosomeGO:0000166//nucleotide binding;GO:0001784//phosphotyrosine residue binding;GO:0004672//protein kinase activity;GO:0004713//protein tyrosine kinase activity;GO:0004714//transmembrane receptor protein tyrosine kinase activity;GO:0004715//non-membrane spanning protein tyrosine kinase activity;GO:0005102//signaling receptor binding;GO:0005515//protein binding;GO:0005524//ATP binding;GO:0016301//kinase activity;GO:0016740//transferase activity;GO:0019901//protein kinase binding;GO:0034987//immunoglobulin receptor binding;GO:0034988//Fc-gamma receptor I complex bindingGO:0001819//positive regulation of cytokine production;GO:0002376//immune system process;GO:0002768//immune response-regulating cell surface receptor signaling pathway;GO:0002862//negative regulation of inflammatory response to antigenic stimulus;GO:0006468//protein phosphorylation;GO:0007169//transmembrane receptor protein tyrosine kinase signaling pathway;GO:0007229//integrin-mediated signaling pathway;GO:0008360//regulation of cell shape;GO:0009615//response to virus;GO:0014068//positive regulation of phosphatidylinositol 3-kinase signaling;GO:0016310//phosphorylation;GO:0018108//peptidyl-tyrosine phosphorylation;GO:0030154//cell differentiation;GO:0030282//bone mineralization;GO:0030335//positive regulation of cell migration;GO:0032815//negative regulation of natural killer cell activation;GO:0038096//Fc-gamma receptor signaling pathway involved in phagocytosis;GO:0043306//positive regulation of mast cell degranulation;GO:0043312//neutrophil degranulation;GO:0043552//positive regulation of phosphatidylinositol 3-kinase activity;GO:0045087//innate immune response;GO:0045088//regulation of innate immune response;GO:0045859//regulation of protein kinase activity;GO:0046777//protein autophosphorylation;GO:0048705//skeletal system morphogenesis;GO:0050764//regulation of phagocytosis;GO:0050830//defense response to Gram-positive bacterium
ENSG0000000097112310.010.010oCFH0.3301856077471520.003326584883571030.08306715065143380.1377927477475070.0237949130773800.2007177975803910.1016983949539120.006063845715519970.2248050161675610.032181464463446100.3125531905249620.3014169890974670000complement factor H [Source:HGNC Symbol;Acc:HGNC:4883]Human Diseases;Organismal SystemsInfectious diseases;Immune systemko05150//Staphylococcus aureus infection;ko04610//Complement and coagulation cascadesK04004;K04004GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0070062//extracellular exosome;GO:0072562//blood microparticleGO:0005515//protein binding;GO:0008201//heparin binding;GO:0042802//identical protein binding;GO:0043395//heparan sulfate proteoglycan bindingGO:0002376//immune system process;GO:0006956//complement activation;GO:0006957//complement activation, alternative pathway;GO:0016032//viral process;GO:0030449//regulation of complement activation;GO:0045087//innate immune response;GO:1903659//regulation of complement-dependent cytotoxicity
ENSG000000010364679.475.190oFUCA20.19963605398950.1603496859806950.1762618008939121.06534591659250.8795152392078750.1020434570224671.583152831205330.2017711592789430.1629844830917090.9066560334912891.128354664808030.556743090721280.7490244827985920.2281229747220180.1162501605241420.2194481172547910.5230648014054130.212395395267831alpha-L-fucosidase 2 [Source:HGNC Symbol;Acc:HGNC:4008]Cellular Processes;MetabolismTransport and catabolism;Glycan biosynthesis and metabolismko04142//Lysosome;ko00511//Other glycan degradationK01206;K01206GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005764//lysosome;GO:0005788//endoplasmic reticulum lumen;GO:0035578//azurophil granule lumen;GO:0070062//extracellular exosomeGO:0004560//alpha-L-fucosidase activity;GO:0005515//protein binding;GO:0016787//hydrolase activity;GO:0016798//hydrolase activity, acting on glycosyl bondsGO:0005975//carbohydrate metabolic process;GO:0006004//fucose metabolic process;GO:0008152//metabolic process;GO:0009617//response to bacterium;GO:0016139//glycoside catabolic process;GO:0043312//neutrophil degranulation;GO:0043687//post-translational protein modification;GO:0044267//cellular protein metabolic process;GO:2000535//regulation of entry of bacterium into host cell
ENSG000000010846390.240.000oGCLC0.1547490251828310.1437838191474520.1679611172143340.5796729024101750.1627986327376190.1538469754193840.07081098663619960.2251756550663280.09514965421591410.369887326674040.306066624312050.1498695029704730.5369727789089180.1549780495431540.1186296300906040.08391689837796070.8894955750365451.09937363575016glutamate-cysteine ligase catalytic subunit [Source:HGNC Symbol;Acc:HGNC:4311]Metabolism;Metabolism;Metabolism;Cellular ProcessesGlobal and overview maps;Metabolism of other amino acids;Amino acid metabolism;Cell growth and deathko01100//Metabolic pathways;ko00480//Glutathione metabolism;ko00270//Cysteine and methionine metabolism;ko04216//FerroptosisK11204;K11204;K11204;K11204GO:0005739//mitochondrion;GO:0005829//cytosol;GO:0017109//glutamate-cysteine ligase complexGO:0000166//nucleotide binding;GO:0000287//magnesium ion binding;GO:0003824//catalytic activity;GO:0004357//glutamate-cysteine ligase activity;GO:0005515//protein binding;GO:0005524//ATP binding;GO:0016595//glutamate binding;GO:0016874//ligase activity;GO:0043531//ADP binding;GO:0044877//protein-containing complex bindingGO:0006534//cysteine metabolic process;GO:0006536//glutamate metabolic process;GO:0006749//glutathione metabolic process;GO:0006750//glutathione biosynthetic process;GO:0006979//response to oxidative stress;GO:0007568//aging;GO:0007584//response to nutrient;GO:0009408//response to heat;GO:0009410//response to xenobiotic stimulus;GO:0009725//response to hormone;GO:0014823//response to activity;GO:0019852//L-ascorbic acid metabolic process;GO:0031397//negative regulation of protein ubiquitination;GO:0032436//positive regulation of proteasomal ubiquitin-dependent protein catabolic process;GO:0032869//cellular response to insulin stimulus;GO:0035729//cellular response to hepatocyte growth factor stimulus;GO:0043066//negative regulation of apoptotic process;GO:0043524//negative regulation of neuron apoptotic process;GO:0044344//cellular response to fibroblast growth factor stimulus;GO:0044752//response to human chorionic gonadotropin;GO:0045454//cell redox homeostasis;GO:0045892//negative regulation of transcription, DNA-templated;GO:0046685//response to arsenic-containing substance;GO:0046686//response to cadmium ion;GO:0051409//response to nitrosative stress;GO:0051900//regulation of mitochondrial depolarization;GO:0070555//response to interleukin-1;GO:0071260//cellular response to mechanical stimulus;GO:0071333//cellular response to glucose stimulus;GO:0071372//cellular response to follicle-stimulating hormone stimulus;GO:0097069//cellular response to thyroxine stimulus;GO:0097746//regulation of blood vessel diameter;GO:1901029//negative regulation of mitochondrial outer membrane permeabilization involved in apoptotic signaling pathway;GO:2000490//negative regulation of hepatic stellate cell activation;GO:2001237//negative regulation of extrinsic apoptotic signaling pathway
ENSG000000011673470.470.000oNFYA0.1173595440534950.110388822683930.1417211965273480.09710555253478440.09335590211798370.09373968360032060.1156485891897510.1053311466219550.1664444068597440.2272246576493460.06794281743892940.04632219807922870.1270312080076910.1639318763133210.1205802010765820.04555049755748490.2700804839842270nuclear transcription factor Y subunit alpha [Source:HGNC Symbol;Acc:HGNC:7804]Human Diseases;Human Diseases;Organismal SystemsInfectious diseases;Neurodegenerative disease;Immune systemko05152//Tuberculosis;ko05017//Spinocerebellar ataxia;ko04612//Antigen processing and presentationK08064;K08064;K08064GO:0000785//chromatin;GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0016602//CCAAT-binding factor complex;GO:0032993//protein-DNA complex;GO:0090575//RNA polymerase II transcription factor complexGO:0000976//transcription regulatory region sequence-specific DNA binding;GO:0000978//RNA polymerase II proximal promoter sequence-specific DNA binding;GO:0000981//DNA-binding transcription factor activity, RNA polymerase II-specific;GO:0003677//DNA binding;GO:0003700//DNA-binding transcription factor activity;GO:0005515//protein bindingGO:0006355//regulation of transcription, DNA-templated;GO:0006357//regulation of transcription by RNA polymerase II;GO:0006366//transcription by RNA polymerase II;GO:0019216//regulation of lipid metabolic process;GO:0045893//positive regulation of transcription, DNA-templated;GO:0048511//rhythmic process
ENSG000000014603340.000.000oSTPG10.04626333412883950.1227123557529850.04779463318910910.01458218692245470.01983955498769340.08207025413671390.0446488087932910.03933081930864250.1109544924859220.1758796300078260.0479643658998380.01645001011723170.06780120686148210.06721305650716810.08336265461328890.1227576680497230.1081864702000371.12299963381962sperm tail PG-rich repeat containing 1 [Source:HGNC Symbol;Acc:HGNC:28070]----GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0005739//mitochondrionGO:0003674//molecular_functionGO:0006915//apoptotic process;GO:0043065//positive regulation of apoptotic process;GO:1902110//positive regulation of mitochondrial membrane permeability involved in apoptotic process
ENSG00000001461406196.5920.369o34-53i75-97o102-121i134-156o171-193i202-224o239-261i274-296o301-320iNIPAL30.4479245656811040.6708946896920380.5532283432345440.0886589864285450.05582080451187260.6452559186744460.2074838179804250.4388737522565440.4421110658665060.1359372922500920.2298636662760650.3496783734895560.2820254513658850.4590673057186210.3759199440015990.4038695465394410.5168081831324620.503973177775445NIPA like domain containing 3 [Source:HGNC Symbol;Acc:HGNC:25233]----GO:0016020//membrane;GO:0016021//integral component of membraneGO:0005515//protein binding;GO:0015095//magnesium ion transmembrane transporter activityGO:0015693//magnesium ion transport;GO:1903830//magnesium ion transmembrane transport
ENSG000000014977340.010.010oLAS1L0.2204979402403640.3229841828063610.3152274965413310.1448413309618840.1872260662851590.2605185332785090.2300131153334580.220938638582610.4571194973568330.4364853673026790.2128535742631260.3927650998829730.1780062431533370.3083925339424230.2026648687964080.43074682285262500.17526333315807LAS1 like ribosome biogenesis factor [Source:HGNC Symbol;Acc:HGNC:25726]----GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005730//nucleolus;GO:0005737//cytoplasm;GO:0016020//membrane;GO:0030687//preribosome, large subunit precursor;GO:0071339//MLL1 complex;GO:0090730//Las1 complexGO:0003723//RNA binding;GO:0004519//endonuclease activity;GO:0005515//protein bindingGO:0000460//maturation of 5.8S rRNA;GO:0000470//maturation of LSU-rRNA;GO:0006364//rRNA processing;GO:0090305//nucleic acid phosphodiester bond hydrolysis
ENSG0000000156145321.650.021o406-428iENPP40.224763633862870.06626891917434260.2569305485526250.2442215198520280.09941361106420920.07381498963644040.1579723034237020.4263756070328520.04836005172595080.4280599479008290.08357346577315510.1052216870378350.2013365973134420.207355259669640.1339479588801610.044063168014338800ectonucleotide pyrophosphatase/phosphodiesterase 4 [Source:HGNC Symbol;Acc:HGNC:3359]Metabolism;MetabolismGlobal and overview maps;Nucleotide metabolismko01100//Metabolic pathways;ko00230//Purine metabolismK18424;K18424GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0070062//extracellular exosome;GO:0101003//ficolin-1-rich granule membraneGO:0003824//catalytic activity;GO:0005515//protein binding;GO:0016787//hydrolase activity;GO:0046872//metal ion binding;GO:0047710//bis(5'-adenosyl)-triphosphatase activityGO:0007596//blood coagulation;GO:0007599//hemostasis;GO:0030194//positive regulation of blood coagulation;GO:0043312//neutrophil degranulation;GO:0046130//purine ribonucleoside catabolic process
ENSG000000016177850.530.520oSEMA3F0.0017323126547171700.001770172174026332.85576556220885e-040.0063861288427766500.00879826520630746000.09302729702230320.002913283408359080000000semaphorin 3F [Source:HGNC Symbol;Acc:HGNC:10728]Organismal SystemsDevelopmentko04360//Axon guidanceK06840GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005887//integral component of plasma membrane;GO:0098978//glutamatergic synapseGO:0005515//protein binding;GO:0030215//semaphorin receptor binding;GO:0045499//chemorepellent activityGO:0001755//neural crest cell migration;GO:0007411//axon guidance;GO:0021612//facial nerve structural organization;GO:0021637//trigeminal nerve structural organization;GO:0021675//nerve development;GO:0021785//branchiomotor neuron axon guidance;GO:0030335//positive regulation of cell migration;GO:0036486//ventral trunk neural crest cell migration;GO:0048843//negative regulation of axon extension involved in axon guidance;GO:0048846//axon extension involved in axon guidance;GO:0050919//negative chemotaxis;GO:0061549//sympathetic ganglion development;GO:0071526//semaphorin-plexin signaling pathway;GO:0097490//sympathetic neuron projection extension;GO:0097491//sympathetic neuron projection guidance;GO:0099175//regulation of postsynapse organization;GO:1901166//neural crest cell migration involved in autonomic nervous system development;GO:1902285//semaphorin-plexin signaling pathway involved in neuron projection guidance;GO:1902287//semaphorin-plexin signaling pathway involved in axon guidance
ENSG000000016261480241.520.0011i84-106o121-143i196-215o219-241i304-326o859-881i901-923o986-1008i1015-1034o1098-1120i1129-1148oCFTR0000000.0012111194767563600000000000CF transmembrane conductance regulator [Source:HGNC Symbol;Acc:HGNC:1884]Environmental Information Processing;Cellular Processes;Environmental Information Processing;Organismal Systems;Organismal Systems;Organismal Systems;Human Diseases;Environmental Information ProcessingSignal transduction;Cellular community - eukaryotes;Signal transduction;Digestive system;Digestive system;Digestive system;Infectious diseases;Membrane transportko04024//cAMP signaling pathway;ko04530//Tight junction;ko04152//AMPK signaling pathway;ko04972//Pancreatic secretion;ko04971//Gastric acid secretion;ko04976//Bile secretion;ko05110//Vibrio cholerae infection;ko02010//ABC transportersK05031;K05031;K05031;K05031;K05031;K05031;K05031;K05031GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0005765//lysosomal membrane;GO:0005768//endosome;GO:0005769//early endosome;GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0009986//cell surface;GO:0010008//endosome membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0016324//apical plasma membrane;GO:0030660//Golgi-associated vesicle membrane;GO:0030669//clathrin-coated endocytic vesicle membrane;GO:0031901//early endosome membrane;GO:0032991//protein-containing complex;GO:0034707//chloride channel complex;GO:0055037//recycling endosome;GO:0055038//recycling endosome membraneGO:0000166//nucleotide binding;GO:0005254//chloride channel activity;GO:0005260//intracellularly ATP-gated chloride channel activity;GO:0005515//protein binding;GO:0005524//ATP binding;GO:0015106//bicarbonate transmembrane transporter activity;GO:0015108//chloride transmembrane transporter activity;GO:0016853//isomerase activity;GO:0016887//ATPase activity;GO:0019869//chloride channel inhibitor activity;GO:0019899//enzyme binding;GO:0030165//PDZ domain binding;GO:0042626//ATPase activity, coupled to transmembrane movement of substances;GO:0051087//chaperone binding;GO:0106138//Sec61 translocon complex bindingGO:0006695//cholesterol biosynthetic process;GO:0006811//ion transport;GO:0006821//chloride transport;GO:0006904//vesicle docking involved in exocytosis;GO:0015698//inorganic anion transport;GO:0015701//bicarbonate transport;GO:0016579//protein deubiquitination;GO:0030301//cholesterol transport;GO:0034220//ion transmembrane transport;GO:0034976//response to endoplasmic reticulum stress;GO:0035377//transepithelial water transport;GO:0035774//positive regulation of insulin secretion involved in cellular response to glucose stimulus;GO:0035973//aggrephagy;GO:0045921//positive regulation of exocytosis;GO:0048240//sperm capacitation;GO:0050891//multicellular organismal water homeostasis;GO:0051454//intracellular pH elevation;GO:0055085//transmembrane transport;GO:0060081//membrane hyperpolarization;GO:0061024//membrane organization;GO:0071320//cellular response to cAMP;GO:1902161//positive regulation of cyclic nucleotide-gated ion channel activity;GO:1902476//chloride transmembrane transport;GO:1902943//positive regulation of voltage-gated chloride channel activity;GO:1904322//cellular response to forskolin
ENSG0000000162910890.020.000oANKIB10.3432658114782820.5476450279690730.5486436339735070.3414690241805920.2971449359052080.5025706270975080.481685388530480.4166669772348650.4359413181686260.6818771252609610.2284189123011110.4818931477196020.2057103363667540.6343973350062830.1658120988094730.5296422545422930.1580752754461670.4721110932069ankyrin repeat and IBR domain containing 1 [Source:HGNC Symbol;Acc:HGNC:22215]----GO:0000151//ubiquitin ligase complex;GO:0005737//cytoplasmGO:0004842//ubiquitin-protein transferase activity;GO:0005515//protein binding;GO:0016740//transferase activity;GO:0031624//ubiquitin conjugating enzyme binding;GO:0046872//metal ion binding;GO:0061630//ubiquitin protein ligase activityGO:0000209//protein polyubiquitination;GO:0006511//ubiquitin-dependent protein catabolic process;GO:0016567//protein ubiquitination;GO:0032436//positive regulation of proteasomal ubiquitin-dependent protein catabolic process
ENSG0000000163050940.0733.712o4-21i28-50oCYP51A10.01142215299030860.0352612887440840.01738150456707590.05071150751987710.0551111738521820.02224154961001960.1189278865804840.0389272780874850.07551694027784590.14828574014330.04842808395815750.007398052980416610.2286110361525190.006960976764259560.09477236341309640.12679234837798100.208938384070538cytochrome P450 family 51 subfamily A member 1 [Source:HGNC Symbol;Acc:HGNC:2649]Metabolism;MetabolismGlobal and overview maps;Lipid metabolismko01100//Metabolic pathways;ko00100//Steroid biosynthesisK05917;K05917GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0043231//intracellular membrane-bounded organelleGO:0004497//monooxygenase activity;GO:0005506//iron ion binding;GO:0008398//sterol 14-demethylase activity;GO:0016491//oxidoreductase activity;GO:0016705//oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen;GO:0020037//heme binding;GO:0046872//metal ion bindingGO:0006629//lipid metabolic process;GO:0006694//steroid biosynthetic process;GO:0006695//cholesterol biosynthetic process;GO:0007399//nervous system development;GO:0008202//steroid metabolic process;GO:0008203//cholesterol metabolic process;GO:0016125//sterol metabolic process;GO:0016126//sterol biosynthetic process;GO:0019216//regulation of lipid metabolic process;GO:0033488//cholesterol biosynthetic process via 24,25-dihydrolanosterol;GO:0042177//negative regulation of protein catabolic process;GO:0050709//negative regulation of protein secretion;GO:0055114//oxidation-reduction process;GO:0070988//demethylation;GO:1900222//negative regulation of amyloid-beta clearance
ENSG000000016317360.120.000oKRIT10.2139284811145480.2623287841824780.2661915031384460.1020193447753830.0989697349801810.2288622934868020.1642010165412060.2300037079619670.1814944890173140.2217438300567010.0828076602610420.1755681210131730.1333573126516010.2629819203991440.2946810650853870.107892705574420.6123578045448750.208938384070538KRIT1 ankyrin repeat containing [Source:HGNC Symbol;Acc:HGNC:1573]Environmental Information ProcessingSignal transductionko04015//Rap1 signaling pathwayK17705GO:0005615//extracellular space;GO:0005737//cytoplasm;GO:0005856//cytoskeleton;GO:0005886//plasma membrane;GO:0005911//cell-cell junction;GO:0016020//membrane;GO:0030054//cell junctionGO:0005515//protein binding;GO:0005546//phosphatidylinositol-4,5-bisphosphate binding;GO:0008017//microtubule binding;GO:0030695//GTPase regulator activityGO:0001525//angiogenesis;GO:0001937//negative regulation of endothelial cell proliferation;GO:0007264//small GTPase mediated signal transduction;GO:0010596//negative regulation of endothelial cell migration;GO:0016525//negative regulation of angiogenesis;GO:0045454//cell redox homeostasis;GO:0050790//regulation of catalytic activity;GO:2000114//regulation of establishment of cell polarity;GO:2000352//negative regulation of endothelial cell apoptotic process
ENSG000000020164180.000.000oRAD520.03978359649140880.037170038650960.03093144822222480.01764574486560850.01516027954775820.03775472041123020.03224199757161380.02697288350614340.03774990898547860.05388507126941020.025200349286116200.06222000995520160.02721168047621360.01099704573837050.03235010578484590.1834323868222170RAD52 homolog, DNA repair protein [Source:HGNC Symbol;Acc:HGNC:9824]Genetic Information ProcessingReplication and repairko03440//Homologous recombinationK10873GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0032991//protein-containing complex;GO:0032993//protein-DNA complexGO:0003677//DNA binding;GO:0003697//single-stranded DNA binding;GO:0005515//protein binding;GO:0042802//identical protein bindingGO:0000724//double-strand break repair via homologous recombination;GO:0000730//DNA recombinase assembly;GO:0006281//DNA repair;GO:0006302//double-strand break repair;GO:0006310//DNA recombination;GO:0006312//mitotic recombination;GO:0006974//cellular response to DNA damage stimulus;GO:0010792//DNA double-strand break processing involved in repair via single-strand annealing;GO:0034599//cellular response to oxidative stress;GO:0045002//double-strand break repair via single-strand annealing;GO:2000819//regulation of nucleotide-excision repair

9 细胞周期分析

9.1 细胞周期评估

常规样本中,仅干细胞、祖细胞、癌细胞等具备增殖分化潜力的细胞才具有细胞分裂的能力,即处于细胞周期中。我们通过周期蛋白在细胞中的表达情况对细胞所处的细胞周期进行评分,以此直观体现各个样本中的细胞分裂/增殖活性。

Fig 9-1-1 不同样本的细胞周期指数评估

9.2 细胞周期推断

每一个细胞周期都有特征表达的周期蛋白,通过每个细胞周期的特征周期蛋白[21],我们可以对细胞的各个细胞周期进行评分,通过评分,我们可以推测细胞所处的细胞周期:

Tab 9-2-1 各细胞的细胞周期表
CellsSamplesClustersG1SG2MCellCycle.ScorePhase
AdjacNormal_AAACCCAAGAAGATCTAdjacNormal10.0116144488108683-0.0424193077599221-0.0660583741165098-0.02636083563465510.0116144488108683G1
AdjacNormal_AAACCCAAGACTCGAGAdjacNormal5-0.04344398083700120.0481355966792152-0.02732990692905470.01876917595325170.0481355966792152S
AdjacNormal_AAACCCACACTGGATTAdjacNormal5-0.0404035085419784-0.08509224680737930.03052284850598170.002524627446861520.0305228485059817G2
AdjacNormal_AAACCCACAGAGGCTAAdjacNormal20.07160728042201740.0359509411103350.01283455096239750.03596519488190740.0716072804220174G1
AdjacNormal_AAACCCAGTTTGTTCTAdjacNormal14-0.100669333403082-0.0797571949551742-0.0218257307389038-0.0318318186320522-0.0218257307389038non-cycling
AdjacNormal_AAACCCATCGGAAGGTAdjacNormal6-0.184635854661247-0.216029363283324-0.10688898571373-0.0777946857335721-0.0777946857335721non-cycling
AdjacNormal_AAACGAAAGAGGATCCAdjacNormal3-0.0417624993915903-0.0333476092557671-0.0725326456513034-0.100543559462892-0.0333476092557671non-cycling
AdjacNormal_AAACGAAAGCACTCCGAdjacNormal00.04974522425650370.0288078982973114-0.0564401978253098-0.03607054874128170.0497452242565037G1
AdjacNormal_AAACGAACAGTAGATAAdjacNormal1-0.08715076638848060.0299758126369763-0.0599967580361364-0.06588876791153040.0299758126369763S
AdjacNormal_AAACGAAGTAATGCGGAdjacNormal4-0.11012409998223-0.06354411533088850.0459487373612010.04927543532779240.0492754353277924M
AdjacNormal_AAACGAAGTATCCTTTAdjacNormal0-0.1921764738812010.019461554177533-0.116390747271062-0.1162417659843390.019461554177533S
AdjacNormal_AAACGAAGTGGCCACTAdjacNormal4-0.153908798913884-0.0971036266040841-0.1043081548039360.001046032792229870.00104603279222987M
AdjacNormal_AAACGAAGTTGGCTATAdjacNormal0-0.01723224260958880.01491500853415380.0293337504152403-0.1001706037542870.0293337504152403G2
AdjacNormal_AAACGAAGTTTAGTCGAdjacNormal5-0.0567825855829962-0.0753566120962961-0.0833774827833383-0.123560252018268-0.0567825855829962non-cycling
AdjacNormal_AAACGAATCGCATAGTAdjacNormal0-0.0916990623077417-0.08279377195455970.00231204752637706-0.07942408954515960.00231204752637706G2
AdjacNormal_AAACGCTAGAATCTAGAdjacNormal7-0.109148004867408-0.0803670224961999-0.00986313780847131-0.163963734957107-0.00986313780847131non-cycling
AdjacNormal_AAACGCTAGAGATCGCAdjacNormal6-0.114586948428964-0.204654822111741-0.0441475728652551-0.138470617729536-0.0441475728652551non-cycling
AdjacNormal_AAACGCTAGATGCGACAdjacNormal2-0.133572387513075-0.0413708063962026-0.0546673545839012-0.0684203654998439-0.0413708063962026non-cycling
AdjacNormal_AAACGCTAGGAACATTAdjacNormal0-0.0622745380266054-0.0561355663282031-0.0170165641130481-0.0205165586392129-0.0170165641130481non-cycling
AdjacNormal_AAACGCTAGGTTACCTAdjacNormal5-0.08697903778178160.148230255908973-0.0175133530884733-0.08719338833878120.148230255908973S

Fig 9-2-1 不同样本的不同细胞周期细胞的数量堆叠图 Fig 9-2-2 不同样本的不同细胞周期细胞的数量比例堆叠图

Fig 9-2-3 不同细胞周期细胞的tSNE分布

Fig 9-2-4 Seurat分群与细胞周期推断对应circos图

9.3 周期蛋白基因分布可视化

细胞周期推断是基于每个细胞周期的特征周期蛋白基因在细胞中的表达量,利用软件预测的结果。那么我们就可以通过观察细胞周期特征基因的表达分布,对预测结果进行初步验证,这个过程即为周期蛋白基因分布可视化。

周期蛋白基因:CellCycle.genes.xls

Tab 9-3-1 周期蛋白基因(前20行)
G1SG2M
CCNE1ABCC5ARL4AAHI1
CCNE2ASF1BAURKBAKIRIN2
CDC25AATAD2BRD8ANLN
CDCA7BRCA1BUB3ANP32E
DTLCDKN2AIPCASP3ARL6IP1
INTS8CENPQCCDC107ASXL1
IVNS1ABPCREBZFCCNA2AURKA
MCM2DONSONCCNFBIRC2
MCM6DSCC1CDC25CBIRC5
NASPE2F8CDCA2BUB1
PLCXD1EXO1CDCA3CCNB2
SKP2EZH2CDCA8CDC20
SLBPFEN1CDK1CDC25B
UNGHELLSCDKN1BCDC27
ZNF367MASTLCDKN2CCENPA
ZRANB2PKMYT1CENPLCENPE
-RBBP8CKAP2CENPF
-RFC2CKAP2LCEP55
-RRM2DCAF7CIT
-USP1ESPL1CKAP5

热图可以同时展示大量基因在每个细胞中的表达量及其在细胞群体中的分布情况。使用热图展示周期蛋白基因在不同细胞周期的表达量,可以看到各时期特征蛋白基因相对集中表达在对应的时期中,是对细胞周期推断结果准确性的验证。

Fig 9-3-1 周期蛋白基因在不同细胞周期的表达量热图

广州基迪奥生物科技有限公司




10 个性化分析推荐

10.1 细胞亚群鉴定

细胞亚群鉴定是进行单细胞转录组分析的最基础一步,是赋予细胞数据以生物学意义的关键过程。细胞亚群鉴定主要借助marker基因在各个细胞亚群的表达情况来判断细胞亚群所属细胞类型。按照marker基因的查询方式,我们通常会遇到三种情况:

(1)人、小鼠常见组织的细胞类型注释。如今已经建立了许多数据库用以收集marker基因信息,例如Cell Marker(人、小鼠,http://bio-bigdata.hrbmu.edu.cn/CellMarker/),panglaoDB(人、小鼠,https://panglaodb.se/),MCA(小鼠,http://bis.zju.edu.cn/MCA/)。根据数据库内容,我们可以快速锁定组织类型和细胞类型,并获取相关的marker基因。

(2)人、小鼠罕见组织、稀有细胞类型及其他模式物种的细胞类型注释。这一类细胞通常没有成型的数据库可以快速查询。此时,我们需要从已有的单细胞文章、细胞生物学文章和分子生物学文章找寻相关细胞类型的marker基因。在找寻marker基因时,优先考虑荧光定量PCR(qPCR)和RNA荧光原位杂交(FISH)的结果,因为这些技术直接体现mRNA的表达水平,更容易在scRNA-seq数据中找到表达量分布情况;次级考虑蛋白免疫印迹(western blot)、流式细胞术(FAC)和免疫荧光(IF)的结果,因为蛋白水平和mRNA水平并不是完全同步的,可能出现蛋白高丰度但是mRNA低丰度的情况,使得scRNA-seq不具有相应marker基因的表达分布。

(3)人和模式物种的新细胞类型及非模式物种的细胞类型注释。这一类细胞通常缺乏前人的研究基础,无可直接利用的marker基因。为了完成细胞亚群注释,我们可以使用同源比对的方式将基因比对到具有marker基因信息的近缘物种上,然后使用同源比对得到的marker基因用于注释细胞亚群。当然,细胞类型最终是与细胞功能相关的,我们也可以通过细胞亚群上调基因的功能注释或富集的功能通路来确定细胞功能,并结合生物学背景推测细胞亚群所属的细胞类型。

10.2 拟时分析

细胞分化相关分析一直是研究人员广泛关注的问题,其与胚胎发育、组织修复、疾病发生等多个研究领域紧密相关。单细胞转录组的特征是获得了大量细胞的转录本“快照”,记录了样本中所有细胞的转录本,其中包含了多功能性较强的干细胞、过渡阶段的中间细胞和发育成熟的功能细胞,这为细胞分化相关分析提供了基础。

拟时分析通过分析关键基因的表达模式,将所有细胞按照发育时间的先后排布在拟时间轴上,模拟发育过程中的细胞分化过程。通过对细胞轨迹的分析,我们可以挖掘出细胞分化过程中经历的细胞类型变化、伴随发育过程变化的动态变化基因、祖细胞不同的分化命运等与生命发育息息相关的信息。

10.3 细胞周期分析

胚胎干细胞、肿瘤细胞、生殖细胞、植物根尖、芽尖等细胞除了具备多功能性的分化能力以外,还需具备自我增殖能力。而在增殖过程中,细胞进程必然涉及细胞周期。

细胞周期分析必然在细胞鉴定之后,对具有增殖能力的细胞进行细胞周期分析才具有切实的生物学意义。通过早期的细胞周期研究,我们已知了若干与细胞周期各个时期相关的周期蛋白基因,通过这些基因的表达情况,我们可以进一步推测细胞所处细胞周期。

对细胞周期的分析,既可以深入探索与细胞周期进行有关的新的标记基因,也可以侧面反映样本的细胞更新活性,对样本的表型做出关联解释。

细胞周期分析目前只能分析人和小鼠,其他物种可以同源比对,但存在一定误差。

10.4 WGCNA

单细胞转录组数据的特点是数据庞大,同一个细胞通常带有细胞类型、样本属性、表型特征等多级注释信息,导致单细胞转录组分析时可以进行比较方式多样而繁复。极高的复杂程度需求有效的简化方式。

权重基因共表达网络分析(weighted gene co-expression network analysis, WGCNA)可以将大量的基因简化成少量的具有相同表达模式的模块,并进一步找到与表型相关性最高的基因模块。这一分析极大得简化了数据挖掘过程,有利于从具有复杂样本设计和细胞组成的样本中快速锁定核心基因。

10.5 转录因子分析

转录因子是重要的基因调控元件,在外界刺激中,表达量一般优先发生变化,并进一步调控下游基因的表达完成对刺激的响应。所以,转录因子与靶基因之间具备潜在的共表达关系。

借助这一个基础,我们可以使用软件SCENIC将转录因子和靶基因构建为一个网络单位,通过对网络单位的表达活性的分析来研究不同细胞类型之间的转录调控差异。这种差异同时体现在转录因子的表达情况和转录因子的功能特性(靶基因的表达情况)上,对细胞的表型变化会有更加全面的解释度。

目前转录因子分析只能做人、小鼠和果蝇,其他物种的分析流程待开发。

10.6 细胞通讯分析

多细胞生命体的正常运转离不开多种细胞类型之间的有序合作,生命体的表型变化也不应该是由单一细胞类型的功能决定的。所以,为了获得更具有解释力的分子机制,我们往往需要从细胞间的分子信号交互去解释表型变化。

细胞通讯分析从细胞的配体-受体表达情况去推测细胞类型之间的互作关系。这些互作关系体现了下游细胞的激活、细胞信号转导和靶向细胞的杀伤,在宿主免疫、肿瘤微环境等领域都有广泛的应用前景。

目前细胞通讯分析理论上只能做人的,小鼠基因可以比对到人数据库进行参考分析,其他物种需要提供配受体信息数据库才能进行。

广州基迪奥生物科技有限公司




11 目录结构

结果文件夹
├── 1.Expression                                            定量结果文件夹
│   ├── barcode_plot                                            有效细胞鉴定图文件夹
│   │   └── *.barcode_plot.{pdf,png}                                有效细胞鉴定图
│   ├── CellRanger_Report                                       Cell Ranger报告文件夹
│   │   └── CellRanger.*.result.html                                Cell Ranger count结果报告
│   ├── expressions                                             表达量结果文件夹
│   │   ├── *                                                       各样本结果文件夹
│   │   │   ├── expression.xls                                          表达量矩阵表
│   │   │   ├── barcodes.tsv                                            细胞barcode ID表
│   │   │   ├── genes.tsv                                               基因ID与名称表
│   │   │   └── matrix.mtx                                              表达量稀疏矩阵
│   │   └── *.expression.demo.xls                                   表达量矩阵示例表
│   ├── samples.align.stat.xls                                  各样本比对结果统计表
│   └── samples.sequence.stat.xls                               各样本测序数据统计表
├── 2.Cluster                                               聚类结果文件夹
│   ├── 1.QC                                                    质控结果文件夹
│   │   ├── AfterFilter.BasicInfo.merge.{pdf,png}                   过滤后各个样本细胞基本信息的分布图
│   │   ├── AfterFilter.BasicInfo.nUMI-nGene.{pdf,png}              过滤后各个样本细胞基本信息的分布散点图
│   │   ├── AfterFilter.BasicInfo.nUMI-pMito.{pdf,png}              过滤后各个样本细胞基本信息的分布散点图
│   │   ├── AfterFilter.BasicInfo.PresetMarker.{pdf,png}            过滤后各个样本细胞中预设标记基因的表达量分布
│   │   ├── BasicInfo.merge.{pdf,png}                               过滤前各个样本细胞基本信息的分布图
│   │   ├── BasicInfo.nUMI-nGene.{pdf,png}                          过滤前各个样本细胞基本信息的分布散点图
│   │   ├── BasicInfo.nUMI-pMito.{pdf,png}                          过滤前各个样本细胞基本信息的分布散点图
│   │   ├── BasicInfo.PresetMarker.{pdf,png}                        过滤前各个样本细胞中预设标记基因的表达量分布
│   │   └── Filter.stat.xls                                         过滤前后各个样本中细胞数据量统计表
│   ├── 2.cluster                                               分群结果文件夹
│   │   ├── AllGene.avg_exp.annot.xls                               基因在各个亚群中表达量的均值表
│   │   ├── Cells.cluster.list.xls                                  细胞与亚群对照表
│   │   ├── tSNE_*.{pdf,png}                                        各样本单细胞亚群分类tSNE图
│   │   └── tSNE.{pdf,png}                                          单细胞亚群分类tSNE图
│   ├── 3.cluster_stat                                          分群结果的统计结果文件夹
│   │   ├── Cluster.stat.inSamples.pct.{pdf,png}                    各亚群中各个样本细胞数量百分比堆叠图
│   │   ├── Cluster.stat.inSamples.{pdf,png}                        各亚群中各个样本细胞数量堆叠图
│   │   ├── Cluster.stat.bySamples.pct.{pdf,png}                    各样本中各亚群细胞数量百分比堆叠图
│   │   ├── Cluster.stat.bySamples.{pdf,png}                        各样本中各亚群细胞数量堆叠图
│   │   ├── Cluster.stat.Sample.xls                                 各样本中各亚群细胞数量统计表
│   │   ├── Cluster.stat.xls                                        细胞亚群分类结果统计表
│   │   ├── Cluster.cor.heatmap.{pdf,png}                           各亚群相关性系数热图
│   │   ├── PresetMarker.Distribution.{pdf,png}                     已知标记基因在各个细胞亚群中的表达分布
│   │   ├── PresetMarker.DotPlot.{pdf,png}                          已知标记基因在各个细胞亚群中的表达分布气泡图
│   │   ├── PresetMarker.Heatmap.{pdf,png}                          已知标记基因在各个亚群的表达量热图
│   │   └── PresetMarker.VlnPlot.{pdf,png}                          已知标记基因在各个细胞亚群中的表达分布小提琴图
│   └── 4.CellAnnotation                                        单细胞亚群鉴定结果文件夹
│       ├── Cell.annotation.stat.xls                                各样本在各个细胞类型中细胞数量统计表
│       ├── Cells.annotation.circos.{pdf,png}                       Seurat分群与singleR细胞鉴定对应circos图
│       ├── Cells.annotation.{pdf,png}                              各细胞类型在tSNE图的分布
│       ├── Cluster.correlation.heatmap.{pdf,png}                   Seurat分群与singleR鉴定细胞类型相关性热图
│       ├── Cluster.sample.singleR.stat.pct.{pdf,png}               各细胞类型中各样本细胞数量百分比堆叠图
│       ├── Cluster.sample.singleR.stat.{pdf,png}                   各细胞类型中各样本细胞数量堆叠图
│       ├── Cluster.stat.sample.singleR.xls                         各样本在各个细胞类型中细胞数量统计表
│       ├── Sample.cluster.singleR.stat.pct.{pdf,png}               各样本中各细胞类型数量百分比堆叠图
│       └── Sample.cluster.singleR.stat.{pdf,png}                   各样本中各细胞类型数量堆叠图
├── 3.MarkerGene                                            亚群上调表达基因分析结果文件夹
│   ├── DeGene.list.xls                                         各亚群差异基因注释表
│   ├── DeGene.stat.{pdf,png}                                   各亚群上调基因数量统计柱状图
│   ├── DeGene.stat.xls                                         各亚群上调基因数量统计表
│   ├── Enrichment                                              上调表达基因富集分析结果文件夹
│   │   ├── GO                                                      GO功能富集分析结果文件夹
│   │   ├── KO                                                      KO功能富集分析结果文件夹
│   │   ├── DO                                                      DO功能富集分析结果文件夹
│   │   └── Reactome                                                Reactome功能富集分析结果文件夹
│   ├── Plots                                                   上调基因表达分布结果文件夹
│   │   ├── Top.DotPlot.{pdf,png}                                   标记基因表达分布气泡图
│   │   ├── Top.Heatmap.{pdf,png}                                   标记基因表达热图
│   │   ├── ExpPlot                                                 标记基因表达分布图文件夹
│   │   ├── DensityPlot                                             标记基因表达分布密度图文件夹
│   │   └── ViolinPlot                                              标记基因表达分布小提琴图文件夹
│   └── String                                                  蛋白质互作网络分析结果文件夹
│       ├── Top.aln.links.xls                                       标记基因与String蛋白对应表及析构关系表
│       ├── Top.edge.tsv                                            Cytoscape绘图文件--连接信息文件
│       └── Top.node.tsv                                            Cytoscape绘图文件--节点信息文件
├── 4.GSVA                                                  GSVA分析
│   ├── *.gsva.xls                                              各细胞富集程度表
│   └── *.heatmap.cluster.{xls,pdf,png}                         各个亚群中的富集分数热图
├── 5.TF                                                    转录因子注释结果文件夹
│   └── TF.annot.xls                                            转录因子注释表
├── 6.CellCycle                                             细胞周期分析
│   ├── CellCycle.annot.xls                                     各细胞的细胞周期表
│   ├── CellCycle.boxplot.{pdf,png}                             不同样本的细胞周期指数评估盒形图
│   ├── CellCycle.DotPlot.{pdf,png}                             周期蛋白基因的分布气泡图
│   ├── CellCycle.Heatmap.{pdf,png}                             周期蛋白基因在不同细胞周期的表达量热图
│   ├── Phase.stat.bySamples.pct.{pdf,png}                      不同样本的不同细胞周期细胞的数量比例堆叠图
│   ├── Phase.stat.bySamples.{pdf,png}                          不同样本的不同细胞周期细胞的数量堆叠图
│   ├── CellCycle.Cluster.tSNE.{pdf,png}                        不同细胞周期细胞与分群的tSNE分布
│   └── CellCycle.Samples.tSNE.{pdf,png}                        不同细胞周期细胞与样本的tSNE分布
├── 7.RNAVelocity                                           RNA速率分析
│   ├── velocity.trajectory.tSNE.{pdf,png}                      RNA速率轨迹图
│   └── velocity.tSNE.{pdf,png}                                 RNA速率分布图
├── index.html                                              单细胞分析结果网页版报告
├── scRNA-seq_method.pdf                                    单细胞分析方法说明(英文版)
└── src                                                     网页版报告系统文件文件夹

广州基迪奥生物科技有限公司




12 参考文献

广州基迪奥生物科技有限公司




13 附录

13.1 分析方法英文文档

scRNA-seq分析方法文档(英文):scRNA-seq_method.pdf

13.2 结果文件查看

*.xls,*.txt :结果数据表格文件,文件以制表符(Tab)分隔。unix/Linux/Mac用户使用 less 或 more 命令查看;windows用户使用高级文本编辑器Notepad++ 等查看,也可以用Microsoft Excel打开。

*.png:结果图像文件,位图,无损压缩。

*.pdf:结果图像文件,矢量图,可以放大和缩小而不失真,方便用户查看和编辑处理,可使用Adobe Illustrator进行图片编辑,用于文章发表等。

13.3 文章引用与致谢

如果您的研究课题使用了基迪奥的测序和分析服务,我们期望您在论文发表时,在Method部分或Acknowledgements部分引用或提及基迪奥公司。以下语句可供参考:

广州基迪奥生物科技有限公司




帮助文档