收起 展开

1 项目概述

项目编号GHR180993_sstd_42
项目内容Mus_musculus --- 2 scRNA-seq
参考基因组mm10
样品信息Control¦DDC

2 项目流程

2.1 实验流程
Fig 2-1-1 10x单细胞实验流程图

  1. 细胞质检
    取少量单细胞悬液,加入等体积0.4%台盼蓝染液,用 Countess® II Automated Cell Counter 对细胞计数,将活细胞浓度调整到理想浓度(1000~2000个/μL)。
  2. 10X 标记 cDNA 片段
    含有 barcode 信息的凝胶珠与细胞和酶的混合物结合,进入储液器中被油分隔开,形成GEMs(Gel Beads-In-Emulsions)。之后,凝胶珠溶解释放含有Barcode序列的捕获序列,逆转录 cDNA 片段,并对样本进行标记。将凝胶珠破碎并打碎油滴,以 cDNA 为模板进行 PCR 扩增。将所有GEMs的产物混合,构建标准测序文库。
  3. 标准测序文库建库
    首先将 cDNA 酶切打断成 200~300bp 左右的片段,然后经过末端修复、加A尾、测序接头 P5 、P7和sample index等常规二代测序文库构建步骤,最后进行 PCR 扩增得到 DNA 文库。
  4. 文库测序
    利用Illumina测序平台的PE150测序模式对建好的文库进行高通量测序。

2.2 分析流程
Fig 2-2-1 10x单细胞分析流程图

广州基迪奥生物科技有限公司




3 测序数据质控和表达量定量

3.1 测序数据基本质控

使用cellranger[1],我们可以对测序质量进行质控,去除测序质量低的reads,并对每个样本测的reads数和测序质量进行初步统计。

Tab 3-1-1 各样本测序数据统计表
SampleNumber of ReadsValid BarcodesSequencing SaturationQ30 Bases in BarcodeQ30 Bases in RNA ReadQ30 Bases in UMI
Control304,237,95895.3%42.2%96.7%92.5%94.7%
DDC336,368,96696.1%41.9%96.5%91.9%94.5%

3.2 数据定量

使用cellranger,我们将reads与参考基因组进行比对,将reads注释为特定基因;再对UMI进行修正和统计后,获得未过滤的feature-barcode矩阵;根据未过滤的feature-barcode矩阵,cellranger对数据中的细胞和非细胞进行识别和区分,并绘制为rank-plot图,直观体现有效细胞鉴定结果。

Tab 3-2-1 各样本比对结果统计表
SampleEstimated Number of CellsFraction Reads in CellsMean Reads per CellMedian Genes per CellTotal Genes DetectedMedian UMI Counts per CellReads Mapped Confidently to GenomeReads Mapped Confidently to Intergenic RegionsReads Mapped Confidently to Intronic RegionsReads Mapped Confidently to Exonic RegionsReads Mapped Confidently to Transcriptome
Control16,75570.0%18,1581,79724,2803,21881.6%8.4%0.0%73.3%58.9%
DDC13,75571.1%24,4542,38624,6564,46180.7%7.0%0.0%73.7%61.1%

  • Control有效细胞
  • DDC有效细胞

Fig 3-2-1 有效细胞鉴定图


各样本质控、定量结果报告:

3.3 最终鉴定细胞表达量矩阵

基于UMI修正和有效细胞鉴定后的结果,我们可以使用UMI条数对基因进行定量,获得如下的细胞-基因表达量定量结果。

Tab 3-3-1 样本Control所有细胞各个基因UMI丰度信息表(示例,前10个细胞,前20个基因)
GeneIDNameAAACCCAAGAAACTCA-1AAACCCAAGAAGTATC-1AAACCCAAGAGGGCGA-1AAACCCAAGCCATTGT-1AAACCCAAGGGCTAAC-1AAACCCAAGTGCTCAT-1AAACCCACACATTACG-1AAACCCACACTCACTC-1AAACCCAGTATAGCTC-1
ENSMUSG00000051951Xkr4000000000
ENSMUSG00000089699Gm1992000000000
ENSMUSG00000102331Gm19938000000000
ENSMUSG00000102343Gm37381000000000
ENSMUSG00000025900Rp1000000000
ENSMUSG00000025902Sox17000000010
ENSMUSG00000104238Gm37587000000000
ENSMUSG00000104328Gm37323000000000
ENSMUSG00000033845Mrpl15100000000
ENSMUSG00000025903Lypla1000000000
ENSMUSG00000104217Gm37988000010100
ENSMUSG00000033813Tcea1000020000
ENSMUSG00000002459Rgs20000000000
ENSMUSG00000085623Gm16041000000000
ENSMUSG00000033793Atp6v1h010000130
ENSMUSG00000025905Oprk1000000000
ENSMUSG00000033774Npbwr1000000000
ENSMUSG00000025907Rb1cc1101020401
ENSMUSG000000900314732440D04Rik000000001
ENSMUSG00000087247Alkal1000000000

备注:由于单个细胞在某个瞬间,只有小部分基因表达,因此表中大量基因UMI丰度为0。

UMI定量总表文件:

备注:文件较大,Excel可能难以打开,建议用软件notepad++ 打开,或联系基迪奥技术人员提取您所关心的基因的UMI丰度。

广州基迪奥生物科技有限公司




4 单细胞亚群分类

在cellranger完成基因表达量鉴定后,我们将表达量矩阵转入Seurat[2]进行后续的分析。

4.1 非正常细胞的进一步过滤

cellranger的细胞过滤是根据基因表达量进行自动识别,会有部分非正常细胞残留,所以在进行亚群分类之前,我们首先使用Seurat对非正常细胞进行进一步过滤。

我们进行过滤的指标主要为以下三个:

  1. 单细胞中鉴定到的基因数量(500-3800)。对于同一种细胞来说,表达基因的数量一般维持在一定范围内,如果该值过高,可能是一个GEM中包裹了多种细胞类型,这样的barcode应该剔除
  2. 单细胞中UMI的总数(600-9000)。单个细胞中可以存在的mRNA总量是有限的,如果UMI总数过高,则可能是两个或两个以上的细胞进入同一个GEM中,这样的细胞应该剔除
  3. 单细胞中线粒体基因表达量比例(小于20%)。细胞凋亡通常伴随着线粒体基因的高表达,所以线粒体基因的高表达意味着细胞状态不佳,这些细胞在实验过程中受到了不良刺激,不利于后续分析反应真实的细胞情况,这样的细胞应该剔除
Tab 4-1-1 过滤前后各个样本中细胞数据量统计表
Samplesbefore_filter_numafter_filter_numpctbefore_filter_median_UMI_per_cellafter_filter_median_UMI_per_cellbefore_filter_median_genes_per_cellafter_filter_median_genes_per_cellbefore_filter_median_MT_per_cellafter_filter_median_MT_per_cell
Control167551592795.06%32183116179717540.4678362573099420.482758620689655
DDC137551183986.07%44614050238622400.3792667509481670.41232112539413

Fig 4-1-1 过滤前后各个样本细胞基本信息的分布图

  • nUMI与nGene的关系
  • nUMI与pMito的关系

Fig 4-1-2 过滤前后各个样本细胞基本信息的分布散点图


Fig 4-1-3 过滤前各个样本细胞中预设标记基因的表达量分布 Fig 4-1-4 过滤后各个样本细胞中预设标记基因的表达量分布

4.2 单细胞亚群分类

在去除低质量细胞后,我们需要进行批次效应矫正。首先对所有样本进行典型相关分析(Canonical Correspondence Analysis, CCA),然后寻找细胞间的最近邻接关系(Mutual Nearest Neighbors, MNN),以此构建细胞间的对应关系,最后,多个样本以细胞间对应关系作为锚点(anchors)完成数据整合并完成批次效应矫正[3]

Tab 4-2-1 细胞亚群分类结果统计表
ClusterCells numberMedian Genes per CellMedian UMI Counts per Cell
0390416982993.5
1332718603303
2291920923749
3287518303305
4272916892903
5263322134034
6191819813608.5
7115923624177
8103317762979
996519493245
1071917832984
1169817493080
1241919193455
1321720683670
1411320773771
15682110.53572.5
164122853941

Tab 4-2-2 各样本在各个亚群中细胞数量统计表
ClusterControlDDC
Total14016 (100%)11721 (100%)
03179 (22.68%)725 (6.19%)
12626 (18.74%)701 (5.98%)
248 (0.34%)2871 (24.49%)
32444 (17.44%)431 (3.68%)
42038 (14.54%)691 (5.9%)
515 (0.11%)2618 (22.34%)
61128 (8.05%)790 (6.74%)
720 (0.14%)1139 (9.72%)
8709 (5.06%)324 (2.76%)
9406 (2.9%)559 (4.77%)
10521 (3.72%)198 (1.69%)
11664 (4.74%)34 (0.29%)
12116 (0.83%)303 (2.59%)
1387 (0.62%)130 (1.11%)
141 (0.01%)112 (0.96%)
157 (0.05%)61 (0.52%)
167 (0.05%)34 (0.29%)

Fig 4-2-1 各样本中各亚群细胞数量堆叠图 Fig 4-2-2 各样本中各亚群细胞数量百分比堆叠图

Fig 4-2-3 各亚群中各个样本细胞数量堆叠图 Fig 4-2-4 各亚群中各个样本细胞数量百分比堆叠图

进一步,我们计算两个细胞亚群之间相关性并绘制成热图。图中具有高度相关性的两个细胞亚群具有比较相似的基因表达模式,可能是同一种细胞类型。这张相关性热图为人工细胞亚群鉴定提供了一定的指导作用。

Fig 4-2-5 各亚群相关性热图

基因在各个亚群中表达量的均值表:2.Cluster/2.cluster/AllGene.avg_exp.annot.xls

Tab 4-2-3 基因在各个亚群中表达量的均值表(前20行)
Gene_IDGene_nameCluster 0Cluster 1Cluster 2Cluster 3Cluster 4Cluster 5Cluster 6Cluster 7Cluster 8Cluster 9Cluster 10Cluster 11Cluster 12Cluster 13Cluster 14Cluster 15Cluster 16DescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
ENSMUSG00000051951Xkr40.02890919285595120.01231703549699910.0182772669189120.02303409765591920.02225237565118470.02501239564491380.02589560098134970.1137031099369650.02775229237257810.03563868454645680.03368241071116710.02898258432034520.5476967445329340.0912295915815388000X-linked Kx blood group related 4 [Source:MGI Symbol;Acc:MGI:3528744]----GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0003674//molecular_functionGO:0043652//engulfment of apoptotic cell;GO:0070782//phosphatidylserine exposure on apoptotic cell surface;GO:1902742//apoptotic process involved in development
ENSMUSG00000089699Gm19920.001492467275318470000.001834006964091247.74459442798116e-046.600537204522e-040.0020690953708129300000.01312780442722080000predicted gene 1992 [Source:MGI Symbol;Acc:MGI:3780162]-------
ENSMUSG00000102331Gm1993800000000000000000predicted gene, 19938 [Source:MGI Symbol;Acc:MGI:5012123]-------
ENSMUSG00000102343Gm37381006.54533963309182e-0409.95746172351713e-04000.0047757965409947700000.003554184430184270000predicted gene, 37381 [Source:MGI Symbol;Acc:MGI:5610609]-------
ENSMUSG00000025900Rp10.005778080630430530.01041089464486280.009070404619805640.007388083831060260.01076227273040090.008085853380113080.009739674987903160.007839122940753890.007588384118931380.01202713393376930.017170302355310.006184792349322080.02239607095977190.00934935064085124000retinitis pigmentosa 1 (human) [Source:MGI Symbol;Acc:MGI:1341105]----GO:0001750//photoreceptor outer segment;GO:0001917//photoreceptor inner segment;GO:0005737//cytoplasm;GO:0005856//cytoskeleton;GO:0005875//microtubule associated complex;GO:0005929//cilium;GO:0005930//axoneme;GO:0032391//photoreceptor connecting cilium;GO:0042995//cell projection;GO:0097542//ciliary tipGO:0004096//catalase activity;GO:0005515//protein binding;GO:0008017//microtubule binding;GO:0020037//heme bindingGO:0007026//negative regulation of microtubule depolymerization;GO:0007601//visual perception;GO:0030030//cell projection organization;GO:0035082//axoneme assembly;GO:0035556//intracellular signal transduction;GO:0035845//photoreceptor cell outer segment organization;GO:0042461//photoreceptor cell development;GO:0045494//photoreceptor cell maintenance;GO:0046548//retinal rod cell development;GO:0046549//retinal cone cell development;GO:0046785//microtubule polymerization;GO:0050896//response to stimulus;GO:0055114//oxidation-reduction process;GO:0060041//retina development in camera-type eye;GO:0060042//retina morphogenesis in camera-type eye;GO:0071482//cellular response to light stimulus;GO:0098869//cellular oxidant detoxification;GO:1902857//positive regulation of nonmotile primary cilium assembly
ENSMUSG00000025902Sox170.003988177975392570.008907762005414360.002384190745271910.002817386264481480.008362966332588910.007698168334864310.008635169433743170.002786862950998030.2577218193741390.01755990954894890.01789250969604990.003627917843623680.0189408678569760000SRY (sex determining region Y)-box 17 [Source:MGI Symbol;Acc:MGI:107543]Environmental Information ProcessingSignal transductionko04310//Wnt signaling pathwayK04495GO:0005634//nucleus;GO:0005667//transcription factor complex;GO:0044798//nuclear transcription factor complexGO:0000981//RNA polymerase II transcription factor activity, sequence-specific DNA binding;GO:0001228//transcriptional activator activity, RNA polymerase II transcription regulatory region sequence-specific binding;GO:0003677//DNA binding;GO:0003700//transcription factor activity, sequence-specific DNA binding;GO:0003713//transcription coactivator activity;GO:0005515//protein binding;GO:0008013//beta-catenin binding;GO:0008134//transcription factor binding;GO:0043565//sequence-specific DNA binding;GO:0044212//transcription regulatory region DNA bindingGO:0001525//angiogenesis;GO:0001570//vasculogenesis;GO:0001656//metanephros development;GO:0001706//endoderm formation;GO:0001828//inner cell mass cellular morphogenesis;GO:0001947//heart looping;GO:0003142//cardiogenic plate morphogenesis;GO:0003143//embryonic heart tube morphogenesis;GO:0003151//outflow tract morphogenesis;GO:0003308//negative regulation of Wnt signaling pathway involved in heart development;GO:0006355//regulation of transcription, DNA-templated;GO:0007283//spermatogenesis;GO:0007369//gastrulation;GO:0007492//endoderm development;GO:0007493//endodermal cell fate determination;GO:0010628//positive regulation of gene expression;GO:0016055//Wnt signaling pathway;GO:0021903//rostrocaudal neural tube patterning;GO:0023019//signal transduction involved in regulation of gene expression;GO:0030154//cell differentiation;GO:0030178//negative regulation of Wnt signaling pathway;GO:0030308//negative regulation of cell growth;GO:0031648//protein destabilization;GO:0035050//embryonic heart tube development;GO:0042074//cell migration involved in gastrulation;GO:0042789//mRNA transcription from RNA polymerase II promoter;GO:0045595//regulation of cell differentiation;GO:0045597//positive regulation of cell differentiation;GO:0045732//positive regulation of protein catabolic process;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription from RNA polymerase II promoter;GO:0045995//regulation of embryonic development;GO:0048568//embryonic organ development;GO:0048617//embryonic foregut morphogenesis;GO:0048863//stem cell differentiation;GO:0048866//stem cell fate specification;GO:0050821//protein stabilization;GO:0060214//endocardium formation;GO:0060807//regulation of transcription from RNA polymerase II promoter involved in definitive endodermal cell fate specification;GO:0060913//cardiac cell fate determination;GO:0060956//endocardial cell differentiation;GO:0061009//common bile duct development;GO:0061010//gall bladder development;GO:0061031//endodermal digestive tract morphogenesis;GO:0072091//regulation of stem cell proliferation;GO:0072189//ureter development;GO:0090090//negative regulation of canonical Wnt signaling pathway;GO:1990830//cellular response to leukemia inhibitory factor;GO:2000035//regulation of stem cell division;GO:2000043//regulation of cardiac cell fate specification
ENSMUSG00000104238Gm37587000000000.0036327865670670500000000predicted gene, 37587 [Source:MGI Symbol;Acc:MGI:5610815]-------
ENSMUSG00000104328Gm3732300000000000000000predicted gene, 37323 [Source:MGI Symbol;Acc:MGI:5610551]-------
ENSMUSG00000033845Mrpl150.4396052213890180.4147880560626460.4968386541896760.4019531442181290.3827515628035960.5260903012094090.4554654476504940.3336226341886950.4672515369285750.4320624832452180.3773159229133350.4028590596588170.4229829174483950.3614552096059160.6367718072265920.4070582120293620.607850833082405mitochondrial ribosomal protein L15 [Source:MGI Symbol;Acc:MGI:1351639]Genetic Information ProcessingTranslationko03010//RibosomeK02876GO:0005739//mitochondrion;GO:0005762//mitochondrial large ribosomal subunit;GO:0005840//ribosome;GO:0015934//large ribosomal subunitGO:0003735//structural constituent of ribosomeGO:0000002//mitochondrial genome maintenance;GO:0006412//translation;GO:1990830//cellular response to leukemia inhibitory factor
ENSMUSG00000025903Lypla1000.00170779062727190.001470012033976840.001403077772082386.85921818219606e-040.0062547989817879000.0029472964449710300.0094385861035427700000lysophospholipase 1 [Source:MGI Symbol;Acc:MGI:1344588]Human Diseases;MetabolismCancers;Lipid metabolismko05231//Choline metabolism in cancer;ko00564//Glycerophospholipid metabolismK06128;K06128GO:0005737//cytoplasm;GO:0005739//mitochondrion;GO:0005829//cytosolGO:0004622//lysophospholipase activity;GO:0008474//palmitoyl-(protein) hydrolase activity;GO:0016298//lipase activity;GO:0016787//hydrolase activity;GO:0052689//carboxylic ester hydrolase activityGO:0002084//protein depalmitoylation;GO:0006629//lipid metabolic process;GO:0006631//fatty acid metabolic process;GO:0042997//negative regulation of Golgi to plasma membrane protein transport
ENSMUSG00000104217Gm379880.2540589392480730.228392488188550.2410185055126050.2382318127124690.2299399323936010.205226948168230.2079104969464850.1220365870539640.1892158258260380.2275573967267030.1016240430171410.2392062303107670.181866925374340.1661409034190520.142771219163870.3394449133030960.337666406382047predicted gene, 37988 [Source:MGI Symbol;Acc:MGI:5611216]-------
ENSMUSG00000033813Tcea10.2581751720011220.2414640384506910.2749010222727090.2076330673570960.2731074810093680.2272275341663440.2553426786477030.2240443107381870.2120365359494020.2719440883192090.2748902544050970.195552007595390.2222690186911060.2527204646420340.2226478896620280.1551539830976670.377301561235658transcription elongation factor A (SII) 1 [Source:MGI Symbol;Acc:MGI:1196624]----GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005669//transcription factor TFIID complex;GO:0005730//nucleolusGO:0003676//nucleic acid binding;GO:0003677//DNA binding;GO:0005515//protein binding;GO:0008270//zinc ion binding;GO:0046872//metal ion bindingGO:0006351//transcription, DNA-templated;GO:0006355//regulation of transcription, DNA-templated;GO:0030218//erythrocyte differentiation;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription from RNA polymerase II promoter;GO:1901919//positive regulation of exoribonuclease activity
ENSMUSG00000002459Rgs200.01369749333036580.02122562672588370.01713067152476170.01415275750321870.02287128141499920.007750193359252240.007413366576864290.02518369443381940.02034991129305350.01963603133823310.03002683517415640.02392481836055510.03629137294819890000regulator of G-protein signaling 20 [Source:MGI Symbol;Acc:MGI:1929866]----GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0005794//Golgi apparatus;GO:0005802//trans-Golgi network;GO:0016020//membraneGO:0005096//GTPase activator activityGO:0007186//G-protein coupled receptor signaling pathway;GO:0009968//negative regulation of signal transduction;GO:0043547//positive regulation of GTPase activity
ENSMUSG00000085623Gm1604100000000000000000predicted gene 16041 [Source:MGI Symbol;Acc:MGI:3801909]-------
ENSMUSG00000033793Atp6v1h0.5762016930668420.5609180353808350.7901952152790220.6446823028737420.6056250195631760.8200604338483860.798784866316611.051426859922830.646065107280660.6816160524196470.7351591374743440.5472416875781880.7637662536530170.6314944739616570.6369471049008660.8093776013870210.644650343075006ATPase, H+ transporting, lysosomal V1 subunit H [Source:MGI Symbol;Acc:MGI:1914864]Metabolism;Human Diseases;Human Diseases;Cellular Processes;Environmental Information Processing;Metabolism;Cellular Processes;Human Diseases;Organismal SystemsGlobal and overview maps;Infectious diseases;Infectious diseases;Transport and catabolism;Signal transduction;Energy metabolism;Transport and catabolism;Immune diseases;Nervous systemko01100//Metabolic pathways;ko05165//Human papillomavirus infection;ko05152//Tuberculosis;ko04145//Phagosome;ko04150//mTOR signaling pathway;ko00190//Oxidative phosphorylation;ko04142//Lysosome;ko05323//Rheumatoid arthritis;ko04721//Synaptic vesicle cycleK02144;K02144;K02144;K02144;K02144;K02144;K02144;K02144;K02144GO:0000221//vacuolar proton-transporting V-type ATPase, V1 domainGO:0046961//proton-transporting ATPase activity, rotational mechanismGO:0006811//ion transport;GO:0006897//endocytosis;GO:1902600//hydrogen ion transmembrane transport
ENSMUSG00000025905Oprk100005.50119488703544e-0400.0012956670819711600.004133450943790850000.004370325663557470000opioid receptor, kappa 1 [Source:MGI Symbol;Acc:MGI:97439]Environmental Information ProcessingSignaling molecules and interactionko04080//Neuroactive ligand-receptor interactionK04214GO:0005634//nucleus;GO:0005829//cytosol;GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0030285//integral component of synaptic vesicle membrane;GO:0030425//dendrite;GO:0043025//neuronal cell body;GO:0043204//perikaryon;GO:0043679//axon terminus;GO:0045202//synapseGO:0004930//G-protein coupled receptor activity;GO:0004985//opioid receptor activity;GO:0033612//receptor serine/threonine kinase binding;GO:0038048//dynorphin receptor activity;GO:0042277//peptide binding;GO:0042923//neuropeptide bindingGO:0006955//immune response;GO:0007165//signal transduction;GO:0007186//G-protein coupled receptor signaling pathway;GO:0007200//phospholipase C-activating G-protein coupled receptor signaling pathway;GO:0007218//neuropeptide signaling pathway;GO:0007610//behavior;GO:0007626//locomotory behavior;GO:0009314//response to radiation;GO:0019233//sensory perception of pain;GO:0031635//adenylate cyclase-inhibiting opioid receptor signaling pathway;GO:0032868//response to insulin;GO:0033603//positive regulation of dopamine secretion;GO:0033685//negative regulation of luteinizing hormone secretion;GO:0038003//opioid receptor signaling pathway;GO:0040017//positive regulation of locomotion;GO:0042220//response to cocaine;GO:0042711//maternal behavior;GO:0042755//eating behavior;GO:0043278//response to morphine;GO:0043627//response to estrogen;GO:0044849//estrous cycle;GO:0045471//response to ethanol;GO:0046877//regulation of saliva secretion;GO:0048148//behavioral response to cocaine;GO:0050951//sensory perception of temperature stimulus;GO:0051607//defense response to virus;GO:0051930//regulation of sensory perception of pain;GO:0071222//cellular response to lipopolysaccharide;GO:0071333//cellular response to glucose stimulus;GO:1900745//positive regulation of p38MAPK cascade;GO:1901381//positive regulation of potassium ion transmembrane transport;GO:1903937//response to acrylamide;GO:1904000//positive regulation of eating behavior;GO:1990708//conditioned place preference
ENSMUSG00000033774Npbwr1005.96938624143624e-0400000000000000neuropeptides B/W receptor 1 [Source:MGI Symbol;Acc:MGI:891989]Environmental Information ProcessingSignaling molecules and interactionko04080//Neuroactive ligand-receptor interactionK05268GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membraneGO:0004930//G-protein coupled receptor activity;GO:0008188//neuropeptide receptor activity;GO:0042277//peptide binding;GO:0042923//neuropeptide bindingGO:0007165//signal transduction;GO:0007186//G-protein coupled receptor signaling pathway;GO:0007218//neuropeptide signaling pathway;GO:0019222//regulation of metabolic process
ENSMUSG00000025907Rb1cc13.303283762085633.047993480814743.456507449865523.059939327721953.092775475271673.443570767569733.11563146249523.284961311158263.415092983078323.360246112734273.018181240519532.823620237970782.66019414358943.434852532529283.055178696610453.591759373541022.86506491844575RB1-inducible coiled-coil 1 [Source:MGI Symbol;Acc:MGI:1341850]Cellular Processes;Organismal SystemsTransport and catabolism;Agingko04140//Autophagy - animal;ko04211//Longevity regulating pathway - mammalK17589;K17589GO:0000407//pre-autophagosomal structure;GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0005764//lysosome;GO:0005829//cytosol;GO:0019898//extrinsic component of membrane;GO:0031965//nuclear membrane;GO:0034045//pre-autophagosomal structure membrane;GO:1990316//ATG1/ULK1 kinase complexGO:0005515//protein binding;GO:0019901//protein kinase bindingGO:0000045//autophagosome assembly;GO:0000422//mitophagy;GO:0001889//liver development;GO:0001934//positive regulation of protein phosphorylation;GO:0006914//autophagy;GO:0007049//cell cycle;GO:0007507//heart development;GO:0030242//pexophagy;GO:0034727//piecemeal microautophagy of nucleus;GO:0043066//negative regulation of apoptotic process;GO:0045793//positive regulation of cell size;GO:0046330//positive regulation of JNK cascade;GO:0061709//reticulophagy;GO:0061723//glycophagy;GO:2001237//negative regulation of extrinsic apoptotic signaling pathway
ENSMUSG000000900314732440D04Rik0.1228774235290320.07039522485607680.1563177999605770.08321922073199090.1196800658176660.1182714625983770.07977449949999330.09235385805820540.1291212234199390.1360009741140120.1137966218350620.07659975227453030.09799332957475860.1224746661337860.04415314922168930.1881738305319670.0618884646090815RIKEN cDNA 4732440D04 gene [Source:MGI Symbol;Acc:MGI:3604103]-------
ENSMUSG00000087247Alkal10.003454899609706297.0440844870311e-040.0011177261872627305.87705840297403e-0400.00477895205860330.001172459260264970.01276822420382110.007914488536683720.002810874260599530.004562626612888510.006340687685623630000ALK and LTK ligand 1 [Source:MGI Symbol;Acc:MGI:3645495]----GO:0005575//cellular_component;GO:0005576//extracellular regionGO:0030298//receptor signaling protein tyrosine kinase activator activity;GO:0030971//receptor tyrosine kinase bindingGO:0010976//positive regulation of neuron projection development;GO:0061098//positive regulation of protein tyrosine kinase activity;GO:0070374//positive regulation of ERK1 and ERK2 cascade;GO:0070378//positive regulation of ERK5 cascade

细胞与亚群对照表:2.Cluster/2.cluster/Cells.cluster.list.xls

Tab 4-2-4 细胞与亚群对照表(前20行)
CellsClusterSamples
Control_AAACCCAAGAAGTATC3Control
Control_AAACCCAAGAGGGCGA1Control
Control_AAACCCAAGCCATTGT3Control
Control_AAACCCAAGTGCTCAT4Control
Control_AAACCCACACATTACG0Control
Control_AAACCCACACTCACTC3Control
Control_AAACCCAGTATAGCTC11Control
Control_AAACCCAGTCATGACT10Control
Control_AAACCCAGTCCGGCAT3Control
Control_AAACCCAGTCTACTGA4Control
Control_AAACCCAGTGGTTCTA1Control
Control_AAACCCAGTGTCCTAA6Control
Control_AAACCCATCACCACAA0Control
Control_AAACCCATCCTCTCGA3Control
Control_AAACCCATCGAAATCC4Control
Control_AAACGAAAGATTGAGT0Control
Control_AAACGAAAGGCTAACG4Control
Control_AAACGAACACCTATCC1Control
Control_AAACGAACATGAAGGC6Control
Control_AAACGAACATGTAACC3Control

4.3 分类结果可视化

基于细胞亚群分类的结果,进一步利用tSNE(tSNE,t-Distributed Stochastic Neighbor Embedding)非线性聚类的方法对单细胞亚群分类结果进行可视化[4]。tSNE 的方法通常对不同亚群细胞的分类结果有更佳的呈现效果(亚群间的隔离更加清晰)。

对所有样本的亚群分类可视化,结果如下:

Fig 4-3-1 单细胞亚群分类tSNE图

分别对各个样本的亚群分类可视化,结果如下:

  • 样本Control单细胞亚群分类tSNE图
  • 样本DDC单细胞亚群分类tSNE图

Fig 4-3-2 各样本单细胞亚群分类tSNE图


4.4 基于已知标记基因的分类效果评估

如果客户提前提供已知的在特定类型细胞表达的标记基因,那么我们可以分析标记基因在各个细胞亚群中的表达分布,从而:1)判断识别出的细胞亚群属于哪一类细胞;2)细胞亚群分类结果是否合理。

Fig 4-4-1 已知标记基因在各个细胞亚群中的表达分布小提琴图


Fig 4-4-2 已知标记基因在各个细胞亚群中的表达分布


Fig 4-4-3 已知标记基因在各个细胞亚群中的表达分布气泡图


Fig 4-4-4 已知标记基因在各个亚群的表达量热图

4.5 单细胞亚群鉴定

以上单细胞亚群分类是基于细胞表达特征的相似性进行聚类的,每个亚群不具有生物学意义。所以,细胞鉴定一直是很重要但又比较繁琐的步骤。这里,我们使用singleR[5]对所有细胞进行自动化注释,为后续的人工细胞鉴定工作提供参考。

singleR是通过细胞与参考数据库中细胞类型的相似度来自动化鉴定细胞类型,对于相似度较高的细胞类型的注释准确性会降低。所以,singleR的注释结果只能作为辅助手段,最终的细胞亚群鉴定结果依然需要人工鉴定的确认。

Tab 4-5-1 各样本在各个细胞类型中细胞数量统计表
ClusterControlDDC
Total14016 (100%)11721 (100%)
B cells0 (0%)2 (0.02%)
Endothelial cells26 (0.19%)14 (0.12%)
Hepatocytes13988 (99.8%)11697 (99.8%)
Kupffer cells1 (0.01%)0 (0%)
Macrophages0 (0%)2 (0.02%)
Red pulp macrophages1 (0.01%)5 (0.04%)
T memory cells0 (0%)1 (0.01%)

Fig 4-5-1 各样本中各细胞类型细胞数量堆叠图 Fig 4-5-2 各样本中各细胞类型细胞数量百分比堆叠图

Fig 4-5-3 各细胞类型中各个样本细胞数量堆叠图 Fig 4-5-4 各细胞类型中各个样本细胞数量百分比堆叠图

Fig 4-5-5 各细胞类型在tSNE图的分布

各细胞亚群中各个细胞类型数量统计表:4.CellAnnotation/Cell.annotation.stat.xls

Tab 4-5-2 各细胞亚群中各个细胞类型数量统计表
ClusterCell.annotation (Maximum proportion)B cellsEndothelial cellsHepatocytesKupffer cellsMacrophagesRed pulp macrophagesT memory cells
0Hepatocytes(100.00%)0(0.00%)0(0.00%)3904(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
1Hepatocytes(100.00%)0(0.00%)0(0.00%)3327(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
2Hepatocytes(100.00%)0(0.00%)0(0.00%)2919(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
3Hepatocytes(100.00%)0(0.00%)0(0.00%)2875(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
4Hepatocytes(100.00%)0(0.00%)0(0.00%)2729(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
5Hepatocytes(100.00%)0(0.00%)0(0.00%)2633(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
6Hepatocytes(100.00%)0(0.00%)0(0.00%)1918(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
7Hepatocytes(100.00%)0(0.00%)0(0.00%)1159(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
8Hepatocytes(96.42%)0(0.00%)37(3.58%)996(96.42%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
9Hepatocytes(98.86%)2(0.21%)0(0.00%)954(98.86%)1(0.10%)2(0.21%)5(0.52%)1(0.10%)
10Hepatocytes(99.44%)0(0.00%)3(0.42%)715(99.44%)0(0.00%)0(0.00%)1(0.14%)0(0.00%)
11Hepatocytes(100.00%)0(0.00%)0(0.00%)698(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
12Hepatocytes(100.00%)0(0.00%)0(0.00%)419(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
13Hepatocytes(100.00%)0(0.00%)0(0.00%)217(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
14Hepatocytes(100.00%)0(0.00%)0(0.00%)113(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
15Hepatocytes(100.00%)0(0.00%)0(0.00%)68(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)
16Hepatocytes(100.00%)0(0.00%)0(0.00%)41(100.00%)0(0.00%)0(0.00%)0(0.00%)0(0.00%)

Fig 4-5-6 Seurat分群与singleR细胞鉴定对应circos图

通过singleR,我们可以鉴定细胞的细胞类型;通过Seurat,我们可以得到细胞的聚类信息。通过这两个软件,我们可以将一组细胞按照细胞类型和细胞分群两种方式进行聚类。然后,我们计算各个细胞亚群的细胞与各个细胞类型的细胞之间的相关性,并绘制成热图,作为singleR细胞鉴定结果准确性的一个佐证。

Fig 4-5-7 Seurat分群与singleR鉴定细胞类型相关性热图

广州基迪奥生物科技有限公司




5 亚群上调表达基因分析

5.1 上调表达基因分析

为了了解各个细胞亚群的分子表达特征,我们可以筛选各个细胞亚群上调表达的基因。

采用Seurat的秩和检验分别对不同类细胞群进行基因差异表达分析,筛选亚群上调表达的基因。

上调基因的筛选条件为

  1. 目标亚群或对照亚群中,基因在25%以上的细胞中有表达。
  2. P值 ≤0.01;
  3. 基因表达倍数log2FC≥0.36,即基因上调的倍数≥1.28。

Tab 5-1-1 各亚群上调基因数量统计表
Cluster012345678910111213141516
Number of DE genes2792334072822524272971084172217194411428209474336614

Fig 5-1-1 各亚群上调基因数量统计柱状图

各亚群差异基因注释表: 3.MarkerGene/DeGene.list.xls

Tab 5-1-2 各亚群差异基因注释表(前20行)
Target_ClusterGene IDGene NameTarget_Cluster_meanOther_Cluster_meanLog2FCPvalueQvalueDescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO Process
0ENSMUSG00000093662Gm206495.391797836865631.219299245345982.1447141628469100predicted gene 20649 [Source:MGI Symbol;Acc:MGI:5313096]-------
0ENSMUSG00000024694Keg17.55248571603432.186488430100381.7883358360412100kidney expressed gene 1 [Source:MGI Symbol;Acc:MGI:1928492]MetabolismAmino acid metabolismko00360//Phenylalanine metabolismK00628GO:0005737//cytoplasm;GO:0005739//mitochondrion;GO:0005813//centrosome;GO:0005815//microtubule organizing center;GO:0005856//cytoskeleton;GO:0005874//microtubuleGO:0016740//transferase activity;GO:0016746//transferase activity, transferring acyl groups;GO:0047961//glycine N-acyltransferase activity-
0ENSMUSG00000020122Egfr84.168152367130224.6694067900991.7705514236316200epidermal growth factor receptor [Source:MGI Symbol;Acc:MGI:95294]Human Diseases;Human Diseases;Environmental Information Processing;Environmental Information Processing;Cellular Processes;Human Diseases;Environmental Information Processing;Cellular Processes;Environmental Information Processing;Human Diseases;Cellular Processes;Environmental Information Processing;Human Diseases;Environmental Information Processing;Human Diseases;Organismal Systems;Human Diseases;Human Diseases;Environmental Information Processing;Human Diseases;Organismal Systems;Human Diseases;Environmental Information Processing;Organismal Systems;Organismal Systems;Environmental Information Processing;Human Diseases;Human Diseases;Human Diseases;Organismal Systems;Human Diseases;Cellular Processes;Environmental Information Processing;Human Diseases;Human Diseases;Cellular Processes;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Human DiseasesCancers;Infectious diseases;Signal transduction;Signal transduction;Transport and catabolism;Infectious diseases;Signal transduction;Cell motility;Signal transduction;Cancers;Cellular community - eukaryotes;Signal transduction;Cancers;Signal transduction;Endocrine and metabolic diseases;Endocrine system;Cancers;Cancers;Signal transduction;Cancers;Endocrine system;Infectious diseases;Signal transduction;Endocrine system;Endocrine system;Signal transduction;Cancers;Cancers;Drug resistance;Endocrine system;Cancers;Cellular community - eukaryotes;Signal transduction;Drug resistance;Cancers;Cellular community - eukaryotes;Cancers;Cancers;Cancers;Cancers;Cancers;Cancersko05200//Pathways in cancer;ko05165//Human papillomavirus infection;ko04151//PI3K-Akt signaling pathway;ko04010//MAPK signaling pathway;ko04144//Endocytosis;ko05163//Human cytomegalovirus infection;ko04014//Ras signaling pathway;ko04810//Regulation of actin cytoskeleton;ko04015//Rap1 signaling pathway;ko05205//Proteoglycans in cancer;ko04510//Focal adhesion;ko04020//Calcium signaling pathway;ko05225//Hepatocellular carcinoma;ko04630//Jak-STAT signaling pathway;ko04934//Cushing syndrome;ko04921//Oxytocin signaling pathway;ko05226//Gastric cancer;ko05224//Breast cancer;ko04072//Phospholipase D signaling pathway;ko05206//MicroRNAs in cancer;ko04915//Estrogen signaling pathway;ko05160//Hepatitis C;ko04068//FoxO signaling pathway;ko04926//Relaxin signaling pathway;ko04928//Parathyroid hormone synthesis, secretion and action;ko04066//HIF-1 signaling pathway;ko05231//Choline metabolism in cancer;ko05215//Prostate cancer;ko01522//Endocrine resistance;ko04912//GnRH signaling pathway;ko05210//Colorectal cancer;ko04540//Gap junction;ko04012//ErbB signaling pathway;ko01521//EGFR tyrosine kinase inhibitor resistance;ko05212//Pancreatic cancer;ko04520//Adherens junction;ko05218//Melanoma;ko05214//Glioma;ko05223//Non-small cell lung cancer;ko05230//Central carbon metabolism in cancer;ko05213//Endometrial cancer;ko05219//Bladder cancerK04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361;K04361GO:0000139//Golgi membrane;GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0005768//endosome;GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0005794//Golgi apparatus;GO:0005886//plasma membrane;GO:0005887//integral component of plasma membrane;GO:0009925//basal plasma membrane;GO:0009986//cell surface;GO:0010008//endosome membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0016323//basolateral plasma membrane;GO:0016324//apical plasma membrane;GO:0030139//endocytic vesicle;GO:0031901//early endosome membrane;GO:0031965//nuclear membrane;GO:0032991//macromolecular complex;GO:0043235//receptor complex;GO:0045121//membrane raft;GO:0045202//synapse;GO:0048471//perinuclear region of cytoplasm;GO:0097489//multivesicular body, internal vesicle lumenGO:0000166//nucleotide binding;GO:0003682//chromatin binding;GO:0004672//protein kinase activity;GO:0004713//protein tyrosine kinase activity;GO:0004714//transmembrane receptor protein tyrosine kinase activity;GO:0004888//transmembrane signaling receptor activity;GO:0005006//epidermal growth factor-activated receptor activity;GO:0005102//receptor binding;GO:0005178//integrin binding;GO:0005515//protein binding;GO:0005516//calmodulin binding;GO:0005524//ATP binding;GO:0016301//kinase activity;GO:0016740//transferase activity;GO:0019899//enzyme binding;GO:0019901//protein kinase binding;GO:0019903//protein phosphatase binding;GO:0030235//nitric-oxide synthase regulator activity;GO:0031625//ubiquitin protein ligase binding;GO:0042802//identical protein binding;GO:0046982//protein heterodimerization activity;GO:0048408//epidermal growth factor binding;GO:0051015//actin filament bindingGO:0000186//activation of MAPKK activity;GO:0000902//cell morphogenesis;GO:0001889//liver development;GO:0001892//embryonic placenta development;GO:0001934//positive regulation of protein phosphorylation;GO:0001942//hair follicle development;GO:0006412//translation;GO:0006468//protein phosphorylation;GO:0006970//response to osmotic stress;GO:0006979//response to oxidative stress;GO:0007165//signal transduction;GO:0007166//cell surface receptor signaling pathway;GO:0007169//transmembrane receptor protein tyrosine kinase signaling pathway;GO:0007173//epidermal growth factor receptor signaling pathway;GO:0007275//multicellular organism development;GO:0007435//salivary gland morphogenesis;GO:0007494//midgut development;GO:0007611//learning or memory;GO:0007623//circadian rhythm;GO:0008283//cell proliferation;GO:0008284//positive regulation of cell proliferation;GO:0008544//epidermis development;GO:0010750//positive regulation of nitric oxide mediated signal transduction;GO:0010960//magnesium ion homeostasis;GO:0014066//regulation of phosphatidylinositol 3-kinase signaling;GO:0014070//response to organic cyclic compound;GO:0016101//diterpenoid metabolic process;GO:0016310//phosphorylation;GO:0018108//peptidyl-tyrosine phosphorylation;GO:0021795//cerebral cortex cell migration;GO:0030154//cell differentiation;GO:0030307//positive regulation of cell growth;GO:0030324//lung development;GO:0030335//positive regulation of cell migration;GO:0032355//response to estradiol;GO:0032930//positive regulation of superoxide anion generation;GO:0033138//positive regulation of peptidyl-serine phosphorylation;GO:0033590//response to cobalamin;GO:0033594//response to hydroxyisoflavone;GO:0033993//response to lipid;GO:0034614//cellular response to reactive oxygen species;GO:0035556//intracellular signal transduction;GO:0035690//cellular response to drug;GO:0038083//peptidyl-tyrosine autophosphorylation;GO:0042060//wound healing;GO:0042127//regulation of cell proliferation;GO:0042177//negative regulation of protein catabolic process;GO:0042327//positive regulation of phosphorylation;GO:0042698//ovulation cycle;GO:0042743//hydrogen peroxide metabolic process;GO:0043066//negative regulation of apoptotic process;GO:0043406//positive regulation of MAP kinase activity;GO:0043586//tongue development;GO:0045737//positive regulation of cyclin-dependent protein serine/threonine kinase activity;GO:0045739//positive regulation of DNA repair;GO:0045740//positive regulation of DNA replication;GO:0045780//positive regulation of bone resorption;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045907//positive regulation of vasoconstriction;GO:0045930//negative regulation of mitotic cell cycle;GO:0045944//positive regulation of transcription from RNA polymerase II promoter;GO:0046328//regulation of JNK cascade;GO:0046777//protein autophosphorylation;GO:0048143//astrocyte activation;GO:0048146//positive regulation of fibroblast proliferation;GO:0048546//digestive tract morphogenesis;GO:0048661//positive regulation of smooth muscle cell proliferation;GO:0048812//neuron projection morphogenesis;GO:0050679//positive regulation of epithelial cell proliferation;GO:0050729//positive regulation of inflammatory response;GO:0050730//regulation of peptidyl-tyrosine phosphorylation;GO:0050999//regulation of nitric-oxide synthase activity;GO:0051592//response to calcium ion;GO:0051897//positive regulation of protein kinase B signaling;GO:0051968//positive regulation of synaptic transmission, glutamatergic;GO:0060571//morphogenesis of an epithelial fold;GO:0061029//eyelid development in camera-type eye;GO:0070141//response to UV-A;GO:0070372//regulation of ERK1 and ERK2 cascade;GO:0070374//positive regulation of ERK1 and ERK2 cascade;GO:0071230//cellular response to amino acid stimulus;GO:0071260//cellular response to mechanical stimulus;GO:0071276//cellular response to cadmium ion;GO:0071363//cellular response to growth factor stimulus;GO:0071364//cellular response to epidermal growth factor stimulus;GO:0071392//cellular response to estradiol stimulus;GO:0071549//cellular response to dexamethasone stimulus;GO:0090263//positive regulation of canonical Wnt signaling pathway;GO:0097421//liver regeneration;GO:0098609//cell-cell adhesion;GO:1900020//positive regulation of protein kinase C activity;GO:1900087//positive regulation of G1/S transition of mitotic cell cycle;GO:1901224//positive regulation of NIK/NF-kappaB signaling;GO:1902722//positive regulation of prolactin secretion;GO:1902966//positive regulation of protein localization to early endosome;GO:1903078//positive regulation of protein localization to plasma membrane;GO:1903800//positive regulation of production of miRNAs involved in gene silencing by miRNA
0ENSMUSG00000041237Pklr5.04230768079121.494203782638061.7547072345846400pyruvate kinase liver and red blood cell [Source:MGI Symbol;Acc:MGI:97604]Metabolism;Metabolism;Human Diseases;Organismal Systems;Metabolism;Metabolism;Metabolism;Human Diseases;Metabolism;Human DiseasesGlobal and overview maps;Nucleotide metabolism;Endocrine and metabolic diseases;Endocrine system;Global and overview maps;Global and overview maps;Carbohydrate metabolism;Endocrine and metabolic diseases;Carbohydrate metabolism;Endocrine and metabolic diseasesko01100//Metabolic pathways;ko00230//Purine metabolism;ko04932//Non-alcoholic fatty liver disease (NAFLD);ko04910//Insulin signaling pathway;ko01200//Carbon metabolism;ko01230//Biosynthesis of amino acids;ko00010//Glycolysis / Gluconeogenesis;ko04930//Type II diabetes mellitus;ko00620//Pyruvate metabolism;ko04950//Maturity onset diabetes of the youngK12406;K12406;K12406;K12406;K12406;K12406;K12406;K12406;K12406;K12406GO:0005829//cytosolGO:0000287//magnesium ion binding;GO:0003824//catalytic activity;GO:0004743//pyruvate kinase activity;GO:0016301//kinase activity;GO:0016740//transferase activity;GO:0030955//potassium ion bindingGO:0001666//response to hypoxia;GO:0005975//carbohydrate metabolic process;GO:0006096//glycolytic process;GO:0006754//ATP biosynthetic process;GO:0007584//response to nutrient;GO:0009408//response to heat;GO:0009749//response to glucose;GO:0010038//response to metal ion;GO:0010226//response to lithium ion;GO:0016310//phosphorylation;GO:0032869//cellular response to insulin stimulus;GO:0033198//response to ATP;GO:0042866//pyruvate biosynthetic process;GO:0051591//response to cAMP;GO:0071872//cellular response to epinephrine stimulus
0ENSMUSG00000113543Gm362644.249231655808661.313699317495911.6935668912488200predicted gene, 36264 [Source:MGI Symbol;Acc:MGI:5595423]-------
0ENSMUSG00000062181Ces3b18.57896118344785.779727759610081.6845963930009200carboxylesterase 3B [Source:MGI Symbol;Acc:MGI:3644960]----GO:0005615//extracellular space;GO:0005783//endoplasmic reticulum;GO:0005788//endoplasmic reticulum lumenGO:0004771//sterol esterase activity;GO:0004806//triglyceride lipase activity;GO:0016787//hydrolase activity;GO:0052689//carboxylic ester hydrolase activity;GO:0080030//methyl indole-3-acetate esterase activityGO:0016042//lipid catabolic process
0ENSMUSG00000020053Igf119.87760031960126.285167018935191.6611206134312600insulin-like growth factor 1 [Source:MGI Symbol;Acc:MGI:96432]Human Diseases;Environmental Information Processing;Environmental Information Processing;Environmental Information Processing;Environmental Information Processing;Human Diseases;Cellular Processes;Human Diseases;Environmental Information Processing;Human Diseases;Cellular Processes;Environmental Information Processing;Environmental Information Processing;Organismal Systems;Cellular Processes;Environmental Information Processing;Human Diseases;Human Diseases;Organismal Systems;Organismal Systems;Human Diseases;Human Diseases;Human Diseases;Human Diseases;Cellular Processes;Human Diseases;Organismal Systems;Organismal Systems;Organismal Systems;Organismal SystemsCancers;Signal transduction;Signal transduction;Signal transduction;Signal transduction;Cancers;Cellular community - eukaryotes;Cancers;Signal transduction;Cancers;Cellular community - eukaryotes;Signal transduction;Signal transduction;Sensory system;Cell growth and death;Signal transduction;Cancers;Drug resistance;Aging;Endocrine system;Cardiovascular diseases;Cardiovascular diseases;Drug resistance;Cancers;Cell growth and death;Cancers;Aging;Nervous system;Endocrine system;Excretory systemko05200//Pathways in cancer;ko04151//PI3K-Akt signaling pathway;ko04010//MAPK signaling pathway;ko04014//Ras signaling pathway;ko04015//Rap1 signaling pathway;ko05205//Proteoglycans in cancer;ko04510//Focal adhesion;ko05202//Transcriptional misregulation in cancers;ko04150//mTOR signaling pathway;ko05224//Breast cancer;ko04550//Signaling pathways regulating pluripotency of stem cells;ko04068//FoxO signaling pathway;ko04152//AMPK signaling pathway;ko04750//Inflammatory mediator regulation of TRP channels;ko04114//Oocyte meiosis;ko04066//HIF-1 signaling pathway;ko05215//Prostate cancer;ko01522//Endocrine resistance;ko04211//Longevity regulating pathway - mammal;ko04914//Progesterone-mediated oocyte maturation;ko05414//Dilated cardiomyopathy (DCM);ko05410//Hypertrophic cardiomyopathy (HCM);ko01521//EGFR tyrosine kinase inhibitor resistance;ko05218//Melanoma;ko04115//p53 signaling pathway;ko05214//Glioma;ko04213//Longevity regulating pathway - multiple species;ko04730//Long-term depression;ko04913//Ovarian Steroidogenesis;ko04960//Aldosterone-regulated sodium reabsorptionK05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459;K05459GO:0005576//extracellular region;GO:0005614//interstitial matrix;GO:0005615//extracellular space;GO:0005737//cytoplasm;GO:0005886//plasma membrane;GO:0031091//platelet alpha granule;GO:0035867//alphav-beta3 integrin-IGF-1-IGF1R complex;GO:0042567//insulin-like growth factor ternary complex;GO:0043025//neuronal cell body;GO:0070382//exocytic vesicleGO:0005158//insulin receptor binding;GO:0005159//insulin-like growth factor receptor binding;GO:0005178//integrin binding;GO:0005179//hormone activity;GO:0005496//steroid binding;GO:0005515//protein binding;GO:0008083//growth factor activity;GO:0043539//protein serine/threonine kinase activator activityGO:0000187//activation of MAPK activity;GO:0001649//osteoblast differentiation;GO:0001775//cell activation;GO:0001932//regulation of protein phosphorylation;GO:0001974//blood vessel remodeling;GO:0002683//negative regulation of immune system process;GO:0006417//regulation of translation;GO:0007165//signal transduction;GO:0007399//nervous system development;GO:0007608//sensory perception of smell;GO:0007613//memory;GO:0007623//circadian rhythm;GO:0008283//cell proliferation;GO:0008284//positive regulation of cell proliferation;GO:0008285//negative regulation of cell proliferation;GO:0009408//response to heat;GO:0010001//glial cell differentiation;GO:0010468//regulation of gene expression;GO:0010560//positive regulation of glycoprotein biosynthetic process;GO:0010613//positive regulation of cardiac muscle hypertrophy;GO:0010628//positive regulation of gene expression;GO:0010629//negative regulation of gene expression;GO:0010656//negative regulation of muscle cell apoptotic process;GO:0014065//phosphatidylinositol 3-kinase signaling;GO:0014068//positive regulation of phosphatidylinositol 3-kinase signaling;GO:0014834//skeletal muscle satellite cell maintenance involved in skeletal muscle regeneration;GO:0014896//muscle hypertrophy;GO:0014904//myotube cell development;GO:0014911//positive regulation of smooth muscle cell migration;GO:0021940//positive regulation of cerebellar granule cell precursor proliferation;GO:0030104//water homeostasis;GO:0030166//proteoglycan biosynthetic process;GO:0030307//positive regulation of cell growth;GO:0030324//lung development;GO:0030335//positive regulation of cell migration;GO:0030879//mammary gland development;GO:0031017//exocrine pancreas development;GO:0031643//positive regulation of myelination;GO:0032148//activation of protein kinase B activity;GO:0032691//negative regulation of interleukin-1 beta production;GO:0032720//negative regulation of tumor necrosis factor production;GO:0032878//regulation of establishment or maintenance of cell polarity;GO:0034392//negative regulation of smooth muscle cell apoptotic process;GO:0035264//multicellular organism growth;GO:0035630//bone mineralization involved in bone maturation;GO:0040014//regulation of multicellular organism growth;GO:0042104//positive regulation of activated T cell proliferation;GO:0042127//regulation of cell proliferation;GO:0042531//positive regulation of tyrosine phosphorylation of STAT protein;GO:0043066//negative regulation of apoptotic process;GO:0043154//negative regulation of cysteine-type endopeptidase activity involved in apoptotic process;GO:0043388//positive regulation of DNA binding;GO:0043410//positive regulation of MAPK cascade;GO:0043491//protein kinase B signaling;GO:0043536//positive regulation of blood vessel endothelial cell migration;GO:0043568//positive regulation of insulin-like growth factor receptor signaling pathway;GO:0045428//regulation of nitric oxide biosynthetic process;GO:0045445//myoblast differentiation;GO:0045471//response to ethanol;GO:0045600//positive regulation of fat cell differentiation;GO:0045669//positive regulation of osteoblast differentiation;GO:0045725//positive regulation of glycogen biosynthetic process;GO:0045821//positive regulation of glycolytic process;GO:0045840//positive regulation of mitotic nuclear division;GO:0045893//positive regulation of transcription, DNA-templated;GO:0045944//positive regulation of transcription from RNA polymerase II promoter;GO:0046326//positive regulation of glucose import;GO:0046579//positive regulation of Ras protein signal transduction;GO:0048009//insulin-like growth factor receptor signaling pathway;GO:0048015//phosphatidylinositol-mediated signaling;GO:0048146//positive regulation of fibroblast proliferation;GO:0048286//lung alveolus development;GO:0048468//cell development;GO:0048661//positive regulation of smooth muscle cell proliferation;GO:0048754//branching morphogenesis of an epithelial tube;GO:0048839//inner ear development;GO:0050650//chondroitin sulfate proteoglycan biosynthetic process;GO:0050679//positive regulation of epithelial cell proliferation;GO:0050714//positive regulation of protein secretion;GO:0050731//positive regulation of peptidyl-tyrosine phosphorylation;GO:0050732//negative regulation of peptidyl-tyrosine phosphorylation;GO:0050821//protein stabilization;GO:0050974//detection of mechanical stimulus involved in sensory perception;GO:0051246//regulation of protein metabolic process;GO:0051450//myoblast proliferation;GO:0051897//positive regulation of protein kinase B signaling;GO:0051924//regulation of calcium ion transport;GO:0060252//positive regulation of glial cell proliferation;GO:0060283//negative regulation of oocyte development;GO:0060426//lung vasculature development;GO:0060463//lung lobe morphogenesis;GO:0060509//Type I pneumocyte differentiation;GO:0060510//Type II pneumocyte differentiation;GO:0060527//prostate epithelial cord arborization involved in prostate glandular acinus morphogenesis;GO:0060736//prostate gland growth;GO:0060740//prostate gland epithelium morphogenesis;GO:0060741//prostate gland stromal morphogenesis;GO:0060766//negative regulation of androgen receptor signaling pathway;GO:0061051//positive regulation of cell growth involved in cardiac muscle cell development;GO:0070371//ERK1 and ERK2 cascade;GO:0070373//negative regulation of ERK1 and ERK2 cascade;GO:0070886//positive regulation of calcineurin-NFAT signaling cascade;GO:0071333//cellular response to glucose stimulus;GO:0071902//positive regulation of protein serine/threonine kinase activity;GO:0090031//positive regulation of steroid hormone biosynthetic process;GO:0090201//negative regulation of release of cytochrome c from mitochondria;GO:0097192//extrinsic apoptotic signaling pathway in absence of ligand;GO:1900142//negative regulation of oligodendrocyte apoptotic process;GO:1901215//negative regulation of neuron death;GO:1902430//negative regulation of beta-amyloid formation;GO:1904075//positive regulation of trophectodermal cell proliferation;GO:1904193//negative regulation of cholangiocyte apoptotic process;GO:1904646//cellular response to beta-amyloid;GO:1904707//positive regulation of vascular smooth muscle cell proliferation;GO:1990314//cellular response to insulin-like growth factor stimulus;GO:2000288//positive regulation of myoblast proliferation;GO:2000679//positive regulation of transcription regulatory region DNA binding;GO:2001237//negative regulation of extrinsic apoptotic signaling pathway
0ENSMUSG00000022762Ncam23.73701641407951.216811498723591.6187812089395700neural cell adhesion molecule 2 [Source:MGI Symbol;Acc:MGI:97282]Environmental Information Processing;Human DiseasesSignaling molecules and interaction;Neurodegenerative diseasesko04514//Cell adhesion molecules (CAMs);ko05020//Prion diseasesK06491;K06491GO:0005886//plasma membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0016604//nuclear body;GO:0030424//axon;GO:0031225//anchored component of membraneGO:0005515//protein binding;GO:0042802//identical protein bindingGO:0007155//cell adhesion;GO:0007413//axonal fasciculation;GO:0007608//sensory perception of smell
0ENSMUSG00000020593Lpin19.271391321106633.559611436169091.3810660880934500lipin 1 [Source:MGI Symbol;Acc:MGI:1891340]Metabolism;Environmental Information Processing;Metabolism;MetabolismGlobal and overview maps;Signal transduction;Lipid metabolism;Lipid metabolismko01100//Metabolic pathways;ko04150//mTOR signaling pathway;ko00564//Glycerophospholipid metabolism;ko00561//Glycerolipid metabolismK15728;K15728;K15728;K15728-GO:0008195//phosphatidate phosphatase activityGO:0006357//regulation of transcription from RNA polymerase II promoter;GO:0009062//fatty acid catabolic process;GO:0016311//dephosphorylation;GO:0019432//triglyceride biosynthetic process;GO:0031100//organ regeneration;GO:0032869//cellular response to insulin stimulus
0ENSMUSG00000026365Cfh20.10880676805228.099822780214771.3118652283673400complement component factor h [Source:MGI Symbol;Acc:MGI:88385]Organismal Systems;Human DiseasesImmune system;Infectious diseasesko04610//Complement and coagulation cascades;ko05150//Staphylococcus aureus infectionK04004;K04004GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005634//nucleus;GO:0005737//cytoplasm;GO:0005886//plasma membraneGO:0001851//complement component C3b binding;GO:0005515//protein binding;GO:0008201//heparin binding;GO:0043395//heparan sulfate proteoglycan bindingGO:0002376//immune system process;GO:0006956//complement activation;GO:0006957//complement activation, alternative pathway;GO:0030449//regulation of complement activation;GO:0045087//innate immune response;GO:0045919//positive regulation of cytolysis;GO:1903659//regulation of complement-dependent cytotoxicity
0ENSMUSG00000074882Cyp2c687.714517430316363.340023358507981.2077177218197900cytochrome P450, family 2, subfamily c, polypeptide 68 [Source:MGI Symbol;Acc:MGI:3612287]Metabolism;Organismal Systems;Organismal Systems;Human Diseases;Metabolism;Metabolism;Metabolism;MetabolismGlobal and overview maps;Nervous system;Sensory system;Cancers;Metabolism of cofactors and vitamins;Lipid metabolism;Lipid metabolism;Lipid metabolismko01100//Metabolic pathways;ko04726//Serotonergic synapse;ko04750//Inflammatory mediator regulation of TRP channels;ko05204//Chemical carcinogenesis;ko00830//Retinol metabolism;ko00590//Arachidonic acid metabolism;ko00140//Steroid hormone biosynthesis;ko00591//Linoleic acid metabolismK07413;K07413;K07413;K07413;K07413;K07413;K07413;K07413GO:0005737//cytoplasm;GO:0043231//intracellular membrane-bounded organelleGO:0004497//monooxygenase activity;GO:0005506//iron ion binding;GO:0008392//arachidonic acid epoxygenase activity;GO:0008395//steroid hydroxylase activity;GO:0016491//oxidoreductase activity;GO:0016705//oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen;GO:0016712//oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, reduced flavin or flavoprotein as one donor, and incorporation of one atom of oxygen;GO:0020037//heme binding;GO:0046872//metal ion bindingGO:0006082//organic acid metabolic process;GO:0006805//xenobiotic metabolic process;GO:0019373//epoxygenase P450 pathway;GO:0042738//exogenous drug catabolic process;GO:0055114//oxidation-reduction process
0ENSMUSG00000028603Scp28.137137295084663.592606198937231.1794905290148400sterol carrier protein 2, liver [Source:MGI Symbol;Acc:MGI:98254]Metabolism;Organismal Systems;Cellular Processes;MetabolismGlobal and overview maps;Endocrine system;Transport and catabolism;Lipid metabolismko01100//Metabolic pathways;ko03320//PPAR signaling pathway;ko04146//Peroxisome;ko00120//Primary bile acid biosynthesisK08764;K08764;K08764;K08764GO:0005634//nucleus;GO:0005654//nucleoplasm;GO:0005737//cytoplasm;GO:0005739//mitochondrion;GO:0005777//peroxisome;GO:0005782//peroxisomal matrix;GO:0005783//endoplasmic reticulum;GO:0005829//cytosol;GO:0016020//membrane;GO:0019898//extrinsic component of membrane;GO:0031315//extrinsic component of mitochondrial outer membrane;GO:0032991//macromolecular complex;GO:0043231//intracellular membrane-bounded organelleGO:0000062//fatty-acyl-CoA binding;GO:0003824//catalytic activity;GO:0003988//acetyl-CoA C-acyltransferase activity;GO:0005102//receptor binding;GO:0005515//protein binding;GO:0008289//lipid binding;GO:0008525//phosphatidylcholine transporter activity;GO:0008526//phosphatidylinositol transporter activity;GO:0015485//cholesterol binding;GO:0016740//transferase activity;GO:0016746//transferase activity, transferring acyl groups;GO:0016747//transferase activity, transferring acyl groups other than amino-acyl groups;GO:0033814//propanoyl-CoA C-acyltransferase activity;GO:0036042//long-chain fatty acyl-CoA binding;GO:0042803//protein homodimerization activity;GO:0046982//protein heterodimerization activity;GO:0050632//propionyl-CoA C2-trimethyltridecanoyltransferase activity;GO:0070538//oleic acid binding;GO:1904121//phosphatidylethanolamine transporter activityGO:0006637//acyl-CoA metabolic process;GO:0006694//steroid biosynthetic process;GO:0006701//progesterone biosynthetic process;GO:0006869//lipid transport;GO:0007031//peroxisome organization;GO:0007568//aging;GO:0010893//positive regulation of steroid biosynthetic process;GO:0015914//phospholipid transport;GO:0015918//sterol transport;GO:0032385//positive regulation of intracellular cholesterol transport;GO:0032959//inositol trisphosphate biosynthetic process;GO:0043065//positive regulation of apoptotic process;GO:0045542//positive regulation of cholesterol biosynthetic process;GO:0045940//positive regulation of steroid metabolic process;GO:0072659//protein localization to plasma membrane;GO:1901373//lipid hydroperoxide transport;GO:1904109//positive regulation of cholesterol import
0ENSMUSG00000031767Nudt79.526958017485224.327945334593151.1383333544898700nudix (nucleoside diphosphate linked moiety X)-type motif 7 [Source:MGI Symbol;Acc:MGI:1914778]Cellular ProcessesTransport and catabolismko04146//PeroxisomeK17879GO:0005777//peroxisomeGO:0000287//magnesium ion binding;GO:0003723//RNA binding;GO:0003986//acetyl-CoA hydrolase activity;GO:0016787//hydrolase activity;GO:0016818//hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides;GO:0030145//manganese ion binding;GO:0030515//snoRNA binding;GO:0046872//metal ion bindingGO:0009132//nucleoside diphosphate metabolic process;GO:0015938//coenzyme A catabolic process;GO:0046356//acetyl-CoA catabolic process;GO:0050873//brown fat cell differentiation
0ENSMUSG00000022875Kng117.47352871888328.212664288365911.089248754067100kininogen 1 [Source:MGI Symbol;Acc:MGI:1097705]Organismal SystemsImmune systemko04610//Complement and coagulation cascadesK03898GO:0005576//extracellular region;GO:0005615//extracellular space;GO:0005623//cellGO:0004869//cysteine-type endopeptidase inhibitor activity;GO:0005102//receptor binding;GO:0030414//peptidase inhibitor activityGO:0006954//inflammatory response;GO:0007204//positive regulation of cytosolic calcium ion concentration;GO:0007596//blood coagulation;GO:0007599//hemostasis;GO:0010466//negative regulation of peptidase activity;GO:0010951//negative regulation of endopeptidase activity;GO:0030195//negative regulation of blood coagulation;GO:0042311//vasodilation
0ENSMUSG00000052974Cyp2f26.529611396062353.166311272096011.0441940434642300cytochrome P450, family 2, subfamily f, polypeptide 2 [Source:MGI Symbol;Acc:MGI:88608]MetabolismXenobiotics biodegradation and metabolismko00980//Metabolism of xenobiotics by cytochrome P450K07416GO:0005737//cytoplasm;GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0016020//membrane;GO:0031090//organelle membrane;GO:0043231//intracellular membrane-bounded organelleGO:0004497//monooxygenase activity;GO:0005506//iron ion binding;GO:0008392//arachidonic acid epoxygenase activity;GO:0008395//steroid hydroxylase activity;GO:0016491//oxidoreductase activity;GO:0016705//oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen;GO:0016712//oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, reduced flavin or flavoprotein as one donor, and incorporation of one atom of oxygen;GO:0019825//oxygen binding;GO:0020037//heme binding;GO:0046872//metal ion bindingGO:0006082//organic acid metabolic process;GO:0006805//xenobiotic metabolic process;GO:0009636//response to toxic substance;GO:0018931//naphthalene metabolic process;GO:0018979//trichloroethylene metabolic process;GO:0019373//epoxygenase P450 pathway;GO:0042738//exogenous drug catabolic process;GO:0055114//oxidation-reduction process
0ENSMUSG00000016756Cmah19.259700502295610.34578107406110.89654270048092900cytidine monophospho-N-acetylneuraminic acid hydroxylase [Source:MGI Symbol;Acc:MGI:103227]MetabolismCarbohydrate metabolismko00520//Amino sugar and nucleotide sugar metabolismK08080GO:0005737//cytoplasm;GO:0005783//endoplasmic reticulumGO:0016491//oxidoreductase activity;GO:0030338//CMP-N-acetylneuraminate monooxygenase activity;GO:0046872//metal ion binding;GO:0051536//iron-sulfur cluster binding;GO:0051537//2 iron, 2 sulfur cluster bindingGO:0006054//N-acetylneuraminate metabolic process;GO:0046381//CMP-N-acetylneuraminate metabolic process;GO:0055114//oxidation-reduction process
0ENSMUSG00000052520Cyp2j518.908220190540310.5365415233420.84361217079849500cytochrome P450, family 2, subfamily j, polypeptide 5 [Source:MGI Symbol;Acc:MGI:1270149]Metabolism;Organismal Systems;Organismal Systems;Metabolism;Organismal Systems;MetabolismGlobal and overview maps;Nervous system;Sensory system;Lipid metabolism;Endocrine system;Lipid metabolismko01100//Metabolic pathways;ko04726//Serotonergic synapse;ko04750//Inflammatory mediator regulation of TRP channels;ko00590//Arachidonic acid metabolism;ko04913//Ovarian Steroidogenesis;ko00591//Linoleic acid metabolismK07418;K07418;K07418;K07418;K07418;K07418GO:0005737//cytoplasm;GO:0005783//endoplasmic reticulum;GO:0005789//endoplasmic reticulum membrane;GO:0016020//membrane;GO:0016021//integral component of membrane;GO:0031090//organelle membrane;GO:0043231//intracellular membrane-bounded organelleGO:0004497//monooxygenase activity;GO:0005506//iron ion binding;GO:0008395//steroid hydroxylase activity;GO:0016491//oxidoreductase activity;GO:0016705//oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen;GO:0016712//oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, reduced flavin or flavoprotein as one donor, and incorporation of one atom of oxygen;GO:0020037//heme binding;GO:0046872//metal ion binding;GO:0070330//aromatase activityGO:0001990//regulation of systemic arterial blood pressure by hormone;GO:0001998//angiotensin mediated vasoconstriction involved in regulation of systemic arterial blood pressure;GO:0006082//organic acid metabolic process;GO:0006805//xenobiotic metabolic process;GO:0042738//exogenous drug catabolic process;GO:0055114//oxidation-reduction process;GO:0097254//renal tubular secretion;GO:2000863//positive regulation of estrogen secretion
0ENSMUSG000000861419030622O22Rik13.50017265314518.259742662240180.70880911862394400RIKEN cDNA 9030622O22 gene [Source:MGI Symbol;Acc:MGI:1918820]-------
0ENSMUSG00000024424Ttc39c26.35135163814917.14994768188480.61967278854385400tetratricopeptide repeat domain 39C [Source:MGI Symbol;Acc:MGI:1919997]-----GO:0005515//protein binding-
0ENSMUSG00000022353Mtss121.728963671479214.35057890011210.59851043293654700MTSS I-BAR domain containing 1 [Source:MGI Symbol;Acc:MGI:2384818]----GO:0015629//actin cytoskeletonGO:0003779//actin binding;GO:0003785//actin monomer binding;GO:0005102//receptor binding;GO:0042802//identical protein bindingGO:0007009//plasma membrane organization;GO:0030036//actin cytoskeleton organization

5.2 上调基因表达分布

图形是比表格更优秀的数据呈现形式。我们使用热图、tSNE图、密度分布图、小提琴图和气泡图来可视化基因在细胞和各个细胞亚群的表达分布情况。(选择亚群上调top5的基因用于绘图)

Fig 5-2-1 标记基因表达热图

Fig 5-2-2 标记基因表达分布气泡图

上调基因表达分布图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/ExpPlot

  • 1810064F22Rik
  • Abcc4
  • Adgre1
  • Adgrl4
  • Adgrv1

Fig 5-2-3 标记基因表达分布图


上调基因表达分布密度图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/DensityPlot

  • 1810064F22Rik
  • Abcc4
  • Adgre1
  • Adgrl4
  • Adgrv1

Fig 5-2-4 标记基因表达分布密度图


上调基因表达分布小提琴图只展示其中5个基因,其他top5的基因请浏览文件夹3.MarkerGene/Plots/ViolinPlot

  • 1810064F22Rik
  • Abcc4
  • Adgre1
  • Adgrl4
  • Adgrv1

Fig 5-2-5 标记基因表达分布小提琴图


5.3 GO富集分析

       Gene Ontology(简称GO)是一个国际标准化的基因功能分类体系,提供了一套动态更新的标准词汇表(controlled vocabulary)来全面描述生物体中基因和基因产物的属性。GO总共有三个ontology(本体),分别描述基因的分子功能(molecular function)、细胞组分(cellular component)、参与的生物过程(biological process)。GO的基本单位是term(词条、节点),每个term都对应一个属性。 GO功能分析一方面给出基因的GO功能分类注释;另一方面给出基因的GO功能显著性富集分析。 首先,我们将基因向GO数据库(http://www.geneontology.org/)的各term映射,并计算每个term的基因数,从而得到具有某个GO功能的基因列表及基因数目统计。然后应用超几何检验,找出与整个基因组背景相比,在基因中显著富集的GO条目。

基因集 细胞组分 分子功能 生物学过程 GO 分类表
Cluster_0 Cluster_0.C.html Cluster_0.F.html Cluster_0.P.html Cluster_0.Level2.xls
Cluster_1 Cluster_1.C.html Cluster_1.F.html Cluster_1.P.html Cluster_1.Level2.xls
Cluster_2 Cluster_2.C.html Cluster_2.F.html Cluster_2.P.html Cluster_2.Level2.xls
Cluster_3 Cluster_3.C.html Cluster_3.F.html Cluster_3.P.html Cluster_3.Level2.xls
Cluster_4 Cluster_4.C.html Cluster_4.F.html Cluster_4.P.html Cluster_4.Level2.xls
Cluster_5 Cluster_5.C.html Cluster_5.F.html Cluster_5.P.html Cluster_5.Level2.xls
Cluster_6 Cluster_6.C.html Cluster_6.F.html Cluster_6.P.html Cluster_6.Level2.xls
Cluster_7 Cluster_7.C.html Cluster_7.F.html Cluster_7.P.html Cluster_7.Level2.xls
Cluster_8 Cluster_8.C.html Cluster_8.F.html Cluster_8.P.html Cluster_8.Level2.xls
Cluster_9 Cluster_9.C.html Cluster_9.F.html Cluster_9.P.html Cluster_9.Level2.xls
Cluster_10 Cluster_10.C.html Cluster_10.F.html Cluster_10.P.html Cluster_10.Level2.xls
Cluster_11 Cluster_11.C.html Cluster_11.F.html Cluster_11.P.html Cluster_11.Level2.xls
Cluster_12 Cluster_12.C.html Cluster_12.F.html Cluster_12.P.html Cluster_12.Level2.xls
Cluster_13 Cluster_13.C.html Cluster_13.F.html Cluster_13.P.html Cluster_13.Level2.xls
Cluster_14 Cluster_14.C.html Cluster_14.F.html Cluster_14.P.html Cluster_14.Level2.xls
Cluster_15 Cluster_15.C.html Cluster_15.F.html Cluster_15.P.html Cluster_15.Level2.xls
Cluster_16 Cluster_16.C.html Cluster_16.F.html Cluster_16.P.html Cluster_16.Level2.xls

GO富集圈图:(第一圈:富集前20的GOterm,圈外为基因数目的坐标尺。不同的颜色代表不同的Ontology; 第二圈:背景基因中该GOterm的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该GOterm差异基因数量 第四圈:各GOterm的RichFactor值(该GOterm中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图

Fig 5-3-1 GO 富集圈图


GO 富集分类柱状图:(横坐标为二级GOterm,纵坐标为该term里的基因数量,不同颜色表色不同类型GOterm)

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16

Fig 5-3-2 GO富集分类柱状图


GO富集气泡图:(利用Q值最小的前20个GOterm来作图,纵坐标为GOterm,横坐标为富集因子(该GOterm中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0.C 气泡图
  • Cluster_0.F 气泡图
  • Cluster_0.P 气泡图
  • Cluster_1.C 气泡图
  • Cluster_1.F 气泡图
  • Cluster_1.P 气泡图
  • Cluster_2.C 气泡图
  • Cluster_2.F 气泡图
  • Cluster_2.P 气泡图
  • Cluster_3.C 气泡图
  • Cluster_3.F 气泡图
  • Cluster_3.P 气泡图
  • Cluster_4.C 气泡图
  • Cluster_4.F 气泡图
  • Cluster_4.P 气泡图
  • Cluster_5.C 气泡图
  • Cluster_5.F 气泡图
  • Cluster_5.P 气泡图
  • Cluster_6.C 气泡图
  • Cluster_6.F 气泡图
  • Cluster_6.P 气泡图
  • Cluster_7.C 气泡图
  • Cluster_7.F 气泡图
  • Cluster_7.P 气泡图
  • Cluster_8.C 气泡图
  • Cluster_8.F 气泡图
  • Cluster_8.P 气泡图
  • Cluster_9.C 气泡图
  • Cluster_9.F 气泡图
  • Cluster_9.P 气泡图
  • Cluster_10.C 气泡图
  • Cluster_10.F 气泡图
  • Cluster_10.P 气泡图
  • Cluster_11.C 气泡图
  • Cluster_11.F 气泡图
  • Cluster_11.P 气泡图
  • Cluster_12.C 气泡图
  • Cluster_12.F 气泡图
  • Cluster_12.P 气泡图
  • Cluster_13.C 气泡图
  • Cluster_13.F 气泡图
  • Cluster_13.P 气泡图
  • Cluster_14.C 气泡图
  • Cluster_14.F 气泡图
  • Cluster_14.P 气泡图
  • Cluster_15.C 气泡图
  • Cluster_15.F 气泡图
  • Cluster_15.P 气泡图
  • Cluster_16.C 气泡图
  • Cluster_16.F 气泡图
  • Cluster_16.P 气泡图

Fig 5-3-3 GO富集气泡图


GO富集条形图:(利用Q值最小的前20个GOterm来作图,纵坐标为GOterm,横坐标为该GOterm数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该GOterm数量及Q值");

  • Cluster_0.C 富集柱形图
  • Cluster_0.F 富集柱形图
  • Cluster_0.P 富集柱形图
  • Cluster_1.C 富集柱形图
  • Cluster_1.F 富集柱形图
  • Cluster_1.P 富集柱形图
  • Cluster_2.C 富集柱形图
  • Cluster_2.F 富集柱形图
  • Cluster_2.P 富集柱形图
  • Cluster_3.C 富集柱形图
  • Cluster_3.F 富集柱形图
  • Cluster_3.P 富集柱形图
  • Cluster_4.C 富集柱形图
  • Cluster_4.F 富集柱形图
  • Cluster_4.P 富集柱形图
  • Cluster_5.C 富集柱形图
  • Cluster_5.F 富集柱形图
  • Cluster_5.P 富集柱形图
  • Cluster_6.C 富集柱形图
  • Cluster_6.F 富集柱形图
  • Cluster_6.P 富集柱形图
  • Cluster_7.C 富集柱形图
  • Cluster_7.F 富集柱形图
  • Cluster_7.P 富集柱形图
  • Cluster_8.C 富集柱形图
  • Cluster_8.F 富集柱形图
  • Cluster_8.P 富集柱形图
  • Cluster_9.C 富集柱形图
  • Cluster_9.F 富集柱形图
  • Cluster_9.P 富集柱形图
  • Cluster_10.C 富集柱形图
  • Cluster_10.F 富集柱形图
  • Cluster_10.P 富集柱形图
  • Cluster_11.C 富集柱形图
  • Cluster_11.F 富集柱形图
  • Cluster_11.P 富集柱形图
  • Cluster_12.C 富集柱形图
  • Cluster_12.F 富集柱形图
  • Cluster_12.P 富集柱形图
  • Cluster_13.C 富集柱形图
  • Cluster_13.F 富集柱形图
  • Cluster_13.P 富集柱形图
  • Cluster_14.C 富集柱形图
  • Cluster_14.F 富集柱形图
  • Cluster_14.P 富集柱形图
  • Cluster_15.C 富集柱形图
  • Cluster_15.F 富集柱形图
  • Cluster_15.P 富集柱形图
  • Cluster_16.C 富集柱形图
  • Cluster_16.F 富集柱形图
  • Cluster_16.P 富集柱形图

Fig 5-3-4 GO富集条形图


5.4 KO富集分析

       在生物体内,不同基因相互协调行使其生物学,基于Pathway的分析有助于更进一步了解基因的生物学功能。KEGG是有关Pathway的主要公共数据库。 Pathway显著性富集分析以KEGG Pathway为单位,应用超几何检验,找出与整个基因组背景相比,在基因中显著性富集的Pathway。通过Pathway显著性富集能确定基因参与的最主要生化代谢途径和信号转导途径。

所有趋势pathway统计如下所示:

Tab 5-4-1 所有趋势pathway统计表
PathwayPathway_IDKEGG_A_classKEGG_B_classCluster_0(157)Cluster_0_PvalueCluster_0_QvalueCluster_1(121)Cluster_1_PvalueCluster_1_QvalueCluster_10(86)Cluster_10_PvalueCluster_10_QvalueCluster_11(202)Cluster_11_PvalueCluster_11_QvalueCluster_12(242)Cluster_12_PvalueCluster_12_QvalueCluster_13(103)Cluster_13_PvalueCluster_13_QvalueCluster_14(284)Cluster_14_PvalueCluster_14_QvalueCluster_15(176)Cluster_15_PvalueCluster_15_QvalueCluster_16(219)Cluster_16_PvalueCluster_16_QvalueCluster_2(200)Cluster_2_PvalueCluster_2_QvalueCluster_3(152)Cluster_3_PvalueCluster_3_QvalueCluster_4(124)Cluster_4_PvalueCluster_4_QvalueCluster_5(212)Cluster_5_PvalueCluster_5_QvalueCluster_6(170)Cluster_6_PvalueCluster_6_QvalueCluster_7(507)Cluster_7_PvalueCluster_7_QvalueCluster_8(77)Cluster_8_PvalueCluster_8_QvalueCluster_9(108)Cluster_9_PvalueCluster_9_Qvalue
2-Oxocarboxylic acid metabolismko01210MetabolismGlobal and overview maps20.053348172.405319e-0110.25336857.568349e-0101110.38756889.851618e-0110.44507967.601418e-0101110.49993999.997430e-0101101120.081608253.498268e-0110.30772317.151241e-0101120.090191794.322399e-0110.33754069.380489e-0120.34479636.277908e-01011011
ABC transportersko02010Environmental Information ProcessingMembrane transport30.065432692.703728e-0130.03445912.190614e-0120.091722810.30870130620.33702119.851618e-0101101101120.27989625.008669e-0101120.33265947.602019e-0120.22700826.568130e-0110.52391299.463171e-0170.00022254364.037577e-0350.0031487742.894450e-0260.076811742.129780e-01011011
AGE-RAGE signaling pathway in diabetic complicationsko04933Human DiseasesEndocrine and metabolic diseases20.59231158.945946e-0110.78685559.699723e-0130.094096120.30870130620.72766459.851618e-0140.36383696.746749e-0150.0094950293.708436e-0220.87834619.997430e-0140.18126153.911442e-0120.76804349.938591e-0140.24456896.571154e-0110.85711119.471165e-0101140.27795147.992310e-0110.8868039.983327e-01140.0043010942.475158e-0220.25305780.414569528220.39610310.5706351898
AMPK signaling pathwayko04152Environmental Information ProcessingSignal transduction70.011321161.077971e-0110.85335359.941709e-0120.39005360.64548693190.0042599396.191834e-0240.52619218.027681e-0130.2174573.601088e-0140.65329999.997430e-0120.76598968.249119e-0130.67021899.938591e-01100.0011080141.314306e-0260.031497172.112163e-0180.0006857061.660932e-0250.23676657.424530e-0150.12685165.512279e-01100.26785495.410314e-01011011
Acute myeloid leukemiako05221Human DiseasesCancers10.74951999.706835e-0110.65509159.699723e-0120.17158390.41734344350.030683672.371011e-0110.88300599.404372e-0130.060108681.439150e-0120.71310969.997430e-0130.19677634.072414e-0110.85612569.938591e-0110.82939989.945835e-0120.38224337.781381e-0120.29317127.890287e-0130.28004557.992310e-01011100.011654985.642490e-0210.49109360.656074926640.014237690.0759299959
Adherens junctionko04520Cellular ProcessesCellular community - eukaryotes20.42538047.763192e-0101150.0011185960.01494124760.010430581.108249e-01111.127976e-054.221853e-0474.074162e-057.027929e-0430.4879829.997430e-0150.022618621.055536e-0120.60281989.938591e-0130.2791056.826972e-0120.40935557.839524e-0130.10558024.603297e-0130.30938488.271973e-0120.46598019.983327e-01130.00063068945.829099e-0375.978495e-060.000992430250.0030630770.0284366062
Adipocytokine signaling pathwayko04920Organismal SystemsEndocrine system40.051589782.405319e-0120.29480637.686886e-0101160.0091486721.036289e-0110.89009179.435683e-0110.60601176.817632e-0110.92560959.997430e-0101120.58760689.938591e-0160.0087326496.162755e-0230.15461625.184190e-0130.099162774.503642e-0150.04072822.248905e-0120.45164959.983327e-01100.014129776.245768e-0201120.25155690.4541999583
Adrenergic signaling in cardiomyocytesko04261Organismal SystemsCirculatory system20.78278789.706835e-0120.64911169.699723e-0130.20469170.44661014330.71478169.851618e-0180.072847422.848660e-0150.038967061.052553e-0150.59108949.997430e-0150.21510124.338482e-0120.91147169.938591e-0140.49362098.673859e-0130.52373738.212434e-0120.66234629.463171e-0130.74501289.968878e-0120.81887339.983327e-0190.57653198.333755e-0130.16365110.328949200040.13260480.3237653250
African trypanosomiasisko05143Human DiseasesInfectious diseases10.48344328.208427e-0101101101110.64077228.053797e-0101101120.16558493.717850e-0101110.5699369.322794e-0101101101110.51123059.983327e-0120.63229358.421376e-0120.040400780.139719364210.36427030.5424225687
Alanine, aspartate and glutamate metabolismko00250MetabolismAmino acid metabolism30.033656382.007375e-0110.42547439.016003e-0101110.60546059.851618e-0110.67276818.236694e-0101120.37096889.997430e-0101101182.330427e-067.195193e-0501140.0022700672.911027e-0230.07041763.805547e-0120.1797026.607504e-0120.67952098.730479e-01011011

基因集 Pathway 富集结果 Pathway 注释表
Cluster_0 Cluster_0.htm Cluster_0.path.xls
Cluster_1 Cluster_1.htm Cluster_1.path.xls
Cluster_2 Cluster_2.htm Cluster_2.path.xls
Cluster_3 Cluster_3.htm Cluster_3.path.xls
Cluster_4 Cluster_4.htm Cluster_4.path.xls
Cluster_5 Cluster_5.htm Cluster_5.path.xls
Cluster_6 Cluster_6.htm Cluster_6.path.xls
Cluster_7 Cluster_7.htm Cluster_7.path.xls
Cluster_8 Cluster_8.htm Cluster_8.path.xls
Cluster_9 Cluster_9.htm Cluster_9.path.xls
Cluster_10 Cluster_10.htm Cluster_10.path.xls
Cluster_11 Cluster_11.htm Cluster_11.path.xls
Cluster_12 Cluster_12.htm Cluster_12.path.xls
Cluster_13 Cluster_13.htm Cluster_13.path.xls
Cluster_14 Cluster_14.htm Cluster_14.path.xls
Cluster_15 Cluster_15.htm Cluster_15.path.xls
Cluster_16 Cluster_16.htm Cluster_16.path.xls

KO富集圈图:(第一圈:富集前20的pathway,圈外为基因数目的坐标尺。不同的颜色代表不同的A class; 第二圈:背景基因中该pathway的数目以及Q值。基因越多条形越长,Q值越小颜色越红; 第三圈:该pathway差异基因数量 第四圈:各pathway的RichFactor值(该pathway中差异数量除以所有数量),背景网格线,每一格代表0.1)

  • Cluster_0 富集圈图
  • Cluster_1 富集圈图
  • Cluster_2 富集圈图
  • Cluster_3 富集圈图
  • Cluster_4 富集圈图
  • Cluster_5 富集圈图
  • Cluster_6 富集圈图
  • Cluster_7 富集圈图
  • Cluster_8 富集圈图
  • Cluster_9 富集圈图
  • Cluster_10 富集圈图
  • Cluster_11 富集圈图
  • Cluster_12 富集圈图
  • Cluster_13 富集圈图
  • Cluster_14 富集圈图
  • Cluster_15 富集圈图
  • Cluster_16 富集圈图

Fig 5-4-1 KO 富集圈图


KO富集气泡图:(利用Q值最小的前20个pathway来作图,纵坐标为pathway,横坐标为富集因子(该pathway中差异数量除以所有数量),大小表示数量多少,颜色越红Q值越小)");

  • Cluster_0
  • Cluster_1
  • Cluster_2
  • Cluster_3
  • Cluster_4
  • Cluster_5
  • Cluster_6
  • Cluster_7
  • Cluster_8
  • Cluster_9
  • Cluster_10
  • Cluster_11
  • Cluster_12
  • Cluster_13
  • Cluster_14
  • Cluster_15
  • Cluster_16

Fig 5-4-2 KO富集气泡图


KO富集条形图:(利用Q值最小的前20个pathway来作图,纵坐标为pathway,横坐标为该pathway数目占所有差异数目的百分比,颜色越深Q值越小,柱子上的数值为该pathway数量及Q值");

  • Cluster_0 富集柱形图
  • Cluster_1 富集柱形图
  • Cluster_2 富集柱形图
  • Cluster_3 富集柱形图
  • Cluster_4 富集柱形图
  • Cluster_5 富集柱形图
  • Cluster_6 富集柱形图
  • Cluster_7 富集柱形图
  • Cluster_8 富集柱形图
  • Cluster_9 富集柱形图
  • Cluster_10 富集柱形图
  • Cluster_11 富集柱形图
  • Cluster_12 富集柱形图
  • Cluster_13 富集柱形图
  • Cluster_14 富集柱形图
  • Cluster_15 富集柱形图
  • Cluster_16 富集柱形图

Fig 5-4-3 KO富集条形图


5.5 上调基因蛋白质互作网络分析

通过string数据库[10],我们可以获得上调基因构建蛋白质互作关系信息(3.MarkerGene/String),然后利用Cytoscape构建蛋白质互作网络图。

String蛋白质互作调控网络图

注:依据系统配置及浏览器不同,如果标记基因数量过多该图可能不能正常加载,请使用桌面版Cytoscape软件


Cytoscape官方手册:http://manual.cytoscape.org/en/stable/index.html
Cytoscape使用教程:http://www.omicshare.com/class/home/index/classdetail?id=14

广州基迪奥生物科技有限公司




6 GSVA分析

基于传统的超几何检验的富集分析,往往需要用到显著差异基因集数据。当单个基因变化较为微弱时,基于传统富集分析得到结果可能会很少,甚至没有结果。GSVA分析(Gene Set Variation Analysis)[11]能够有效弥补传统富集分析对微效基因的有效信息挖据不足等问题,更为全面地对某一功能单位的调节作用进行解释。GSVA分析反映了某一个细胞亚群相对于所有细胞过表达的通路信息。GSVA原理如下:

我们对MSigDB数据库的八个数据集的每个通路进行GSVA分析

MSigDB数据库各数据集在各细胞富集程度表:

  • H
  • C1
  • C2
  • C3
  • C4
  • C5
  • C6
  • C7

Fig 6-0-1 各个亚群中的富集分数热图


广州基迪奥生物科技有限公司




7 细胞周期分析

7.1 细胞周期评估

常规样本中,仅干细胞、祖细胞、癌细胞等具备增殖分化潜力的细胞才具有细胞分裂的能力,即处于细胞周期中。我们通过周期蛋白在细胞中的表达情况对细胞所处的细胞周期进行评分,以此直观体现各个样本中的细胞分裂/增殖活性。

Fig 7-1-1 不同样本的细胞周期指数评估

7.2 细胞周期推断

每一个细胞周期都有特征表达的周期蛋白,通过每个细胞周期的特征周期蛋白[21],我们可以对细胞的各个细胞周期进行评分,通过评分,我们可以推测细胞所处的细胞周期:

Tab 7-2-1 各细胞的细胞周期表
CellsSamplesClustersG1SG2MCellCycle.ScorePhase
Control_AAACCCAAGAAGTATCControl3-0.0587845964814063-0.0154632076371087-0.0169571564744789-0.0281281203767242-0.0154632076371087non-cycling
Control_AAACCCAAGAGGGCGAControl1-0.00860483687417736-0.0322890945327648-0.0308494228933196-0.108124598212906-0.00860483687417736non-cycling
Control_AAACCCAAGCCATTGTControl3-0.01209429112929590.0355842192808410.0498732573098604-0.01360146214339620.0498732573098604G2
Control_AAACCCAAGTGCTCATControl4-0.136221876399368-0.00660768728117632-0.0121575619226615-0.00655686501751004-0.00655686501751004non-cycling
Control_AAACCCACACATTACGControl0-0.03743021315999290.05458192531344130.0276070181394694-0.03331662128041910.0545819253134413S
Control_AAACCCACACTCACTCControl3-0.04757001857335270.0281975610263254-0.003960838435927640.000102393257577910.0281975610263254S
Control_AAACCCAGTATAGCTCControl11-0.0243249522713944-0.0198546033798266-0.03471759134105630.004953524324009060.00495352432400906M
Control_AAACCCAGTCATGACTControl10-0.0969536176476925-0.02277710430872960.000184471258750277-0.02105564427434210.000184471258750277G2
Control_AAACCCAGTCCGGCATControl3-0.161162588698223-0.0539907637980259-0.0460324682277105-0.0475894211147426-0.0460324682277105non-cycling
Control_AAACCCAGTCTACTGAControl40.0201754196218080.0223676897765450.00273748406828021-0.02345670407405920.022367689776545S
Control_AAACCCAGTGGTTCTAControl1-0.134569355846460.1552233975785830.0240005500574801-0.01006051935795020.155223397578583S
Control_AAACCCAGTGTCCTAAControl6-0.1120936755027730.0199814692626372-0.0202568189255216-0.1048123210544370.0199814692626372S
Control_AAACCCATCACCACAAControl00.0973202952167085-0.03294300116845850.0262933603116692-0.0245261245611540.0973202952167085G1
Control_AAACCCATCCTCTCGAControl3-0.08158575741486710.0399115765832385-0.0164252360976582-0.01654886334532530.0399115765832385S
Control_AAACCCATCGAAATCCControl4-0.1277763256623230.0687253713588798-0.00470390901353283-0.02304733183031130.0687253713588798S
Control_AAACGAAAGATTGAGTControl00.00662842174196435-0.0391323625825285-0.0528332388410231-0.08144566641098390.00662842174196435G1
Control_AAACGAAAGGCTAACGControl4-0.0341706093177366-0.0371504014711734-0.04336186357689440.03735409071218240.0373540907121824M
Control_AAACGAACACCTATCCControl10.0159118755454719-0.042606472549617-0.0402880100639022-0.05531258221836270.0159118755454719G1
Control_AAACGAACATGAAGGCControl6-0.0284494260429285-0.06451558829326610.0108237958929478-0.0003956451173834640.0108237958929478G2
Control_AAACGAACATGTAACCControl3-0.0222209279500079-0.1028901103569420.01414193058109940.009985738301947210.0141419305810994G2

Fig 7-2-1 不同样本的不同细胞周期细胞的数量堆叠图 Fig 7-2-2 不同样本的不同细胞周期细胞的数量比例堆叠图

Fig 7-2-3 不同细胞周期细胞的tSNE分布

Fig 7-2-4 Seurat分群与细胞周期推断对应circos图

7.3 周期蛋白基因分布可视化

细胞周期推断是基于每个细胞周期的特征周期蛋白基因在细胞中的表达量,利用软件预测的结果。那么我们就可以通过观察细胞周期特征基因的表达分布,对预测结果进行初步验证,这个过程即为周期蛋白基因分布可视化。

周期蛋白基因:CellCycle.genes.xls

Tab 7-3-1 周期蛋白基因(前20行)
G1SG2M
Ccne1Abcc5Arl4aAhi1
Ccne2Asf1bAurkbAkirin2
Cdc25aAtad2Brd8Anln
Cdca7Brca1Bub3Anp32e
DtlCdkn2aipCasp3Arl6ip1
Ints8CenpqCcdc107Asxl1
Ivns1abpCrebzfCcna2Aurka
Mcm2DonsonCcnfBirc2
Mcm6Dscc1Cdc25cBirc5
NaspE2f8Cdca2Bub1
Plcxd1Exo1Cdca3Ccnb2
Skp2Ezh2Cdca8Cdc20
SlbpFen1Cdk1Cdc25b
UngHellsCdkn1bCdc27
Zranb2MastlCdkn2cCenpa
-Pkmyt1CenplCenpe
-Rbbp8Ckap2Cenpf
-Rfc2Ckap2lCep55
-Rrm2Dcaf7Cit
-Usp1Espl1Ckap5

热图可以同时展示大量基因在每个细胞中的表达量及其在细胞群体中的分布情况。使用热图展示周期蛋白基因在不同细胞周期的表达量,可以看到各时期特征蛋白基因相对集中表达在对应的时期中,是对细胞周期推断结果准确性的验证。

Fig 7-3-1 周期蛋白基因在不同细胞周期的表达量热图

广州基迪奥生物科技有限公司




8 个性化分析推荐

8.1 细胞亚群鉴定

细胞亚群鉴定是进行单细胞转录组分析的最基础一步,是赋予细胞数据以生物学意义的关键过程。细胞亚群鉴定主要借助marker基因在各个细胞亚群的表达情况来判断细胞亚群所属细胞类型。按照marker基因的查询方式,我们通常会遇到三种情况:

(1)人、小鼠常见组织的细胞类型注释。如今已经建立了许多数据库用以收集marker基因信息,例如Cell Marker(人、小鼠,http://bio-bigdata.hrbmu.edu.cn/CellMarker/),panglaoDB(人、小鼠,https://panglaodb.se/),MCA(小鼠,http://bis.zju.edu.cn/MCA/)。根据数据库内容,我们可以快速锁定组织类型和细胞类型,并获取相关的marker基因。

(2)人、小鼠罕见组织、稀有细胞类型及其他模式物种的细胞类型注释。这一类细胞通常没有成型的数据库可以快速查询。此时,我们需要从已有的单细胞文章、细胞生物学文章和分子生物学文章找寻相关细胞类型的marker基因。在找寻marker基因时,优先考虑荧光定量PCR(qPCR)和RNA荧光原位杂交(FISH)的结果,因为这些技术直接体现mRNA的表达水平,更容易在scRNA-seq数据中找到表达量分布情况;次级考虑蛋白免疫印迹(western blot)、流式细胞术(FAC)和免疫荧光(IF)的结果,因为蛋白水平和mRNA水平并不是完全同步的,可能出现蛋白高丰度但是mRNA低丰度的情况,使得scRNA-seq不具有相应marker基因的表达分布。

(3)人和模式物种的新细胞类型及非模式物种的细胞类型注释。这一类细胞通常缺乏前人的研究基础,无可直接利用的marker基因。为了完成细胞亚群注释,我们可以使用同源比对的方式将基因比对到具有marker基因信息的近缘物种上,然后使用同源比对得到的marker基因用于注释细胞亚群。当然,细胞类型最终是与细胞功能相关的,我们也可以通过细胞亚群上调基因的功能注释或富集的功能通路来确定细胞功能,并结合生物学背景推测细胞亚群所属的细胞类型。

8.2 拟时分析

细胞分化相关分析一直是研究人员广泛关注的问题,其与胚胎发育、组织修复、疾病发生等多个研究领域紧密相关。单细胞转录组的特征是获得了大量细胞的转录本“快照”,记录了样本中所有细胞的转录本,其中包含了多功能性较强的干细胞、过渡阶段的中间细胞和发育成熟的功能细胞,这为细胞分化相关分析提供了基础。

拟时分析通过分析关键基因的表达模式,将所有细胞按照发育时间的先后排布在拟时间轴上,模拟发育过程中的细胞分化过程。通过对细胞轨迹的分析,我们可以挖掘出细胞分化过程中经历的细胞类型变化、伴随发育过程变化的动态变化基因、祖细胞不同的分化命运等与生命发育息息相关的信息。

8.3 细胞周期分析

胚胎干细胞、肿瘤细胞、生殖细胞、植物根尖、芽尖等细胞除了具备多功能性的分化能力以外,还需具备自我增殖能力。而在增殖过程中,细胞进程必然涉及细胞周期。

细胞周期分析必然在细胞鉴定之后,对具有增殖能力的细胞进行细胞周期分析才具有切实的生物学意义。通过早期的细胞周期研究,我们已知了若干与细胞周期各个时期相关的周期蛋白基因,通过这些基因的表达情况,我们可以进一步推测细胞所处细胞周期。

对细胞周期的分析,既可以深入探索与细胞周期进行有关的新的标记基因,也可以侧面反映样本的细胞更新活性,对样本的表型做出关联解释。

细胞周期分析目前只能分析人和小鼠,其他物种可以同源比对,但存在一定误差。

8.4 WGCNA

单细胞转录组数据的特点是数据庞大,同一个细胞通常带有细胞类型、样本属性、表型特征等多级注释信息,导致单细胞转录组分析时可以进行比较方式多样而繁复。极高的复杂程度需求有效的简化方式。

权重基因共表达网络分析(weighted gene co-expression network analysis, WGCNA)可以将大量的基因简化成少量的具有相同表达模式的模块,并进一步找到与表型相关性最高的基因模块。这一分析极大得简化了数据挖掘过程,有利于从具有复杂样本设计和细胞组成的样本中快速锁定核心基因。

8.5 转录因子分析

转录因子是重要的基因调控元件,在外界刺激中,表达量一般优先发生变化,并进一步调控下游基因的表达完成对刺激的响应。所以,转录因子与靶基因之间具备潜在的共表达关系。

借助这一个基础,我们可以使用软件SCENIC将转录因子和靶基因构建为一个网络单位,通过对网络单位的表达活性的分析来研究不同细胞类型之间的转录调控差异。这种差异同时体现在转录因子的表达情况和转录因子的功能特性(靶基因的表达情况)上,对细胞的表型变化会有更加全面的解释度。

目前转录因子分析只能做人、小鼠和果蝇,其他物种的分析流程待开发。

8.6 细胞通讯分析

多细胞生命体的正常运转离不开多种细胞类型之间的有序合作,生命体的表型变化也不应该是由单一细胞类型的功能决定的。所以,为了获得更具有解释力的分子机制,我们往往需要从细胞间的分子信号交互去解释表型变化。

细胞通讯分析从细胞的配体-受体表达情况去推测细胞类型之间的互作关系。这些互作关系体现了下游细胞的激活、细胞信号转导和靶向细胞的杀伤,在宿主免疫、肿瘤微环境等领域都有广泛的应用前景。

目前细胞通讯分析理论上只能做人的,小鼠基因可以比对到人数据库进行参考分析,其他物种需要提供配受体信息数据库才能进行。

广州基迪奥生物科技有限公司




9 目录结构

结果文件夹
├── 1.Expression                                            定量结果文件夹
│   ├── barcode_plot                                            有效细胞鉴定图文件夹
│   │   └── *.barcode_plot.{pdf,png}                                有效细胞鉴定图
│   ├── CellRanger_Report                                       Cell Ranger报告文件夹
│   │   └── CellRanger.*.result.html                                Cell Ranger count结果报告
│   ├── expressions                                             表达量结果文件夹
│   │   ├── *                                                       各样本结果文件夹
│   │   │   ├── expression.xls                                          表达量矩阵表
│   │   │   ├── barcodes.tsv                                            细胞barcode ID表
│   │   │   ├── genes.tsv                                               基因ID与名称表
│   │   │   └── matrix.mtx                                              表达量稀疏矩阵
│   │   └── *.expression.demo.xls                                   表达量矩阵示例表
│   ├── samples.align.stat.xls                                  各样本比对结果统计表
│   └── samples.sequence.stat.xls                               各样本测序数据统计表
├── 2.Cluster                                               聚类结果文件夹
│   ├── 1.QC                                                    质控结果文件夹
│   │   ├── AfterFilter.BasicInfo.merge.{pdf,png}                   过滤后各个样本细胞基本信息的分布图
│   │   ├── AfterFilter.BasicInfo.nUMI-nGene.{pdf,png}              过滤后各个样本细胞基本信息的分布散点图
│   │   ├── AfterFilter.BasicInfo.nUMI-pMito.{pdf,png}              过滤后各个样本细胞基本信息的分布散点图
│   │   ├── AfterFilter.BasicInfo.PresetMarker.{pdf,png}            过滤后各个样本细胞中预设标记基因的表达量分布
│   │   ├── BasicInfo.merge.{pdf,png}                               过滤前各个样本细胞基本信息的分布图
│   │   ├── BasicInfo.nUMI-nGene.{pdf,png}                          过滤前各个样本细胞基本信息的分布散点图
│   │   ├── BasicInfo.nUMI-pMito.{pdf,png}                          过滤前各个样本细胞基本信息的分布散点图
│   │   ├── BasicInfo.PresetMarker.{pdf,png}                        过滤前各个样本细胞中预设标记基因的表达量分布
│   │   └── Filter.stat.xls                                         过滤前后各个样本中细胞数据量统计表
│   ├── 2.cluster                                               分群结果文件夹
│   │   ├── AllGene.avg_exp.annot.xls                               基因在各个亚群中表达量的均值表
│   │   ├── Cells.cluster.list.xls                                  细胞与亚群对照表
│   │   ├── tSNE_*.{pdf,png}                                        各样本单细胞亚群分类tSNE图
│   │   └── tSNE.{pdf,png}                                          单细胞亚群分类tSNE图
│   ├── 3.cluster_stat                                          分群结果的统计结果文件夹
│   │   ├── Cluster.stat.inSamples.pct.{pdf,png}                    各亚群中各个样本细胞数量百分比堆叠图
│   │   ├── Cluster.stat.inSamples.{pdf,png}                        各亚群中各个样本细胞数量堆叠图
│   │   ├── Cluster.stat.bySamples.pct.{pdf,png}                    各样本中各亚群细胞数量百分比堆叠图
│   │   ├── Cluster.stat.bySamples.{pdf,png}                        各样本中各亚群细胞数量堆叠图
│   │   ├── Cluster.stat.Sample.xls                                 各样本中各亚群细胞数量统计表
│   │   ├── Cluster.stat.xls                                        细胞亚群分类结果统计表
│   │   ├── Cluster.cor.heatmap.{pdf,png}                           各亚群相关性系数热图
│   │   ├── PresetMarker.Distribution.{pdf,png}                     已知标记基因在各个细胞亚群中的表达分布
│   │   ├── PresetMarker.DotPlot.{pdf,png}                          已知标记基因在各个细胞亚群中的表达分布气泡图
│   │   ├── PresetMarker.Heatmap.{pdf,png}                          已知标记基因在各个亚群的表达量热图
│   │   └── PresetMarker.VlnPlot.{pdf,png}                          已知标记基因在各个细胞亚群中的表达分布小提琴图
│   └── 4.CellAnnotation                                        单细胞亚群鉴定结果文件夹
│       ├── Cell.annotation.stat.xls                                各样本在各个细胞类型中细胞数量统计表
│       ├── Cells.annotation.circos.{pdf,png}                       Seurat分群与singleR细胞鉴定对应circos图
│       ├── Cells.annotation.{pdf,png}                              各细胞类型在tSNE图的分布
│       ├── Cluster.correlation.heatmap.{pdf,png}                   Seurat分群与singleR鉴定细胞类型相关性热图
│       ├── Cluster.sample.singleR.stat.pct.{pdf,png}               各细胞类型中各样本细胞数量百分比堆叠图
│       ├── Cluster.sample.singleR.stat.{pdf,png}                   各细胞类型中各样本细胞数量堆叠图
│       ├── Cluster.stat.sample.singleR.xls                         各样本在各个细胞类型中细胞数量统计表
│       ├── Sample.cluster.singleR.stat.pct.{pdf,png}               各样本中各细胞类型数量百分比堆叠图
│       └── Sample.cluster.singleR.stat.{pdf,png}                   各样本中各细胞类型数量堆叠图
├── 3.MarkerGene                                            亚群上调表达基因分析结果文件夹
│   ├── DeGene.list.xls                                         各亚群差异基因注释表
│   ├── DeGene.stat.{pdf,png}                                   各亚群上调基因数量统计柱状图
│   ├── DeGene.stat.xls                                         各亚群上调基因数量统计表
│   ├── Enrichment                                              上调表达基因富集分析结果文件夹
│   │   ├── GO                                                      GO功能富集分析结果文件夹
│   │   ├── KO                                                      KO功能富集分析结果文件夹
│   │   ├── DO                                                      DO功能富集分析结果文件夹
│   │   └── Reactome                                                Reactome功能富集分析结果文件夹
│   ├── Plots                                                   上调基因表达分布结果文件夹
│   │   ├── Top.DotPlot.{pdf,png}                                   标记基因表达分布气泡图
│   │   ├── Top.Heatmap.{pdf,png}                                   标记基因表达热图
│   │   ├── ExpPlot                                                 标记基因表达分布图文件夹
│   │   ├── DensityPlot                                             标记基因表达分布密度图文件夹
│   │   └── ViolinPlot                                              标记基因表达分布小提琴图文件夹
│   └── String                                                  蛋白质互作网络分析结果文件夹
│       ├── Top.aln.links.xls                                       标记基因与String蛋白对应表及析构关系表
│       ├── Top.edge.tsv                                            Cytoscape绘图文件--连接信息文件
│       └── Top.node.tsv                                            Cytoscape绘图文件--节点信息文件
├── 4.GSVA                                                  GSVA分析
│   ├── *.gsva.xls                                              各细胞富集程度表
│   └── *.heatmap.cluster.{xls,pdf,png}                         各个亚群中的富集分数热图
├── 5.TF                                                    转录因子注释结果文件夹
│   └── TF.annot.xls                                            转录因子注释表
├── 6.CellCycle                                             细胞周期分析
│   ├── CellCycle.annot.xls                                     各细胞的细胞周期表
│   ├── CellCycle.boxplot.{pdf,png}                             不同样本的细胞周期指数评估盒形图
│   ├── CellCycle.DotPlot.{pdf,png}                             周期蛋白基因的分布气泡图
│   ├── CellCycle.Heatmap.{pdf,png}                             周期蛋白基因在不同细胞周期的表达量热图
│   ├── Phase.stat.bySamples.pct.{pdf,png}                      不同样本的不同细胞周期细胞的数量比例堆叠图
│   ├── Phase.stat.bySamples.{pdf,png}                          不同样本的不同细胞周期细胞的数量堆叠图
│   ├── CellCycle.Cluster.tSNE.{pdf,png}                        不同细胞周期细胞与分群的tSNE分布
│   └── CellCycle.Samples.tSNE.{pdf,png}                        不同细胞周期细胞与样本的tSNE分布
├── 7.RNAVelocity                                           RNA速率分析
│   ├── velocity.trajectory.tSNE.{pdf,png}                      RNA速率轨迹图
│   └── velocity.tSNE.{pdf,png}                                 RNA速率分布图
├── index.html                                              单细胞分析结果网页版报告
├── scRNA-seq_method.pdf                                    单细胞分析方法说明(英文版)
└── src                                                     网页版报告系统文件文件夹

广州基迪奥生物科技有限公司




10 参考文献

广州基迪奥生物科技有限公司




11 附录

11.1 分析方法英文文档

scRNA-seq分析方法文档(英文):scRNA-seq_method.pdf

11.2 结果文件查看

*.xls,*.txt :结果数据表格文件,文件以制表符(Tab)分隔。unix/Linux/Mac用户使用 less 或 more 命令查看;windows用户使用高级文本编辑器Notepad++ 等查看,也可以用Microsoft Excel打开。

*.png:结果图像文件,位图,无损压缩。

*.pdf:结果图像文件,矢量图,可以放大和缩小而不失真,方便用户查看和编辑处理,可使用Adobe Illustrator进行图片编辑,用于文章发表等。

11.3 文章引用与致谢

如果您的研究课题使用了基迪奥的测序和分析服务,我们期望您在论文发表时,在Method部分或Acknowledgements部分引用或提及基迪奥公司。以下语句可供参考:

广州基迪奥生物科技有限公司




帮助文档