1 项目概述

项目编号GDMR20030248_sup_18
项目内容Rosa_chinensis_12_WGCNA
参考基因组Rosa_chinensis

2 流程介绍

WGCNA[1](weighted gene co-expression network analysis,权重基因共表达网络分析)是一种分析多个样本基因表达模式的分析方法,可将表达模式相似的基因进行聚类,并分析模块与特定性状或表型之间的关联关系,因此在疾病以及其他性状与基因关联分析等方面的研究中被广泛应用。

WGCNA算法是构建基因共表达网络的常用算法,使用R语言包[2]进行分析。WGCNA算法首先假定基因网络服从无尺度分布,并定义基因共表达相关矩阵、基因网络形成的邻接函数,然后计算不同节点的相异系数,并据此构建分层聚类树(hierarchical clustering tree),该聚类树的不同分支代表不同的基因模块(module),模块内基因共表达程度高,而分属不同模块的基因共表达程度低。最后,探索模块与特定表型或疾病的关联关系,最终达到鉴定疾病治疗的靶点基因、基因网络的目的。

WGCNA 与趋势分析都为基因共表达分析方法,与趋势分析相比,WGCNA 有以下优势:
1.聚类方法:使用权重共表达策略(无尺度分布),更加符合生物学现象;
2.基因间的关系:能呈现基因间的相互作用关系,且能找出处于调控网络中心的 核心基因(hub gene);
3.样本数:适合于大样本量,且越多越好。而趋势分析 5 个点以上就会使结果非常复杂,准确性 下降,且最多只能分析 8 个点;
4.与表型的关联:可以利用模块特征值和 hub 基因与特定性状、表型进行关联分析,更准确地分析生物学问题。

Fig 2-0-1 WGCNA总体分析流程图

3 数据过滤

在进行WGCNA分析之前,我们需要对选用的基因集进行筛选过滤,把低质量的对结果造成不稳定影响的基因或样品从中去掉,提高网络构建的精度。

4 模块划分

基因间具有相互诱导、阻遏表达或协同作用,这些作用都会导致相关基因的表达量间存在相关性,在大样本的情况下,基因的表达分类更加有规律,利用WGCNA分析可以将具有相似表达模式的基于进行归类,将成千上万的基因划分成数个至数十个的模块。基于模块的分析更利于对相似基因进行分析,并且结合表型信息能更好的找出与特定性状相关联的模块。

4.1 样本层次聚类树

首先我们根据所有基因的表达量对所有样本进程层次聚类,将每个样本视为一个簇,计算各个簇之间的距离,最近的两个簇聚合成一个新的簇,重复以上过程直至最后只有一个簇。通过层次聚类可以查看是否存在离群样品,以便于分析样本情况。

下图所示,关系越相近的样本越容易被归为一个簇,若存在离群样本,则无法归为任何一个簇。

Fig 4-1-1 样本层次聚类树

4.2 Power值曲线

对于power值的选取,一般情况下,我们会取相关系数达到平台期(或大于0.8)时最小的Power值作为后续分析的参数(如图4-2-1左图),同时我们会统计在不同power值下,基因平均连通性的变化(如图4-2-1右图)。有时即使选择较大的power值也无法使相关系数大于0.8,可能是由于部分样本与其他样本差别太大,可以通过样本聚类图查看分组信息,以确认有无异常样本,若样本情况正常,也可取经验值进行后续分析。

本次分析选用的Power值参数为:10

本次分析选用的特征参数如下图所示。左图:横坐标表示power值,纵坐标表示相关系数,蓝色横线表示相关系数0.8,红色横线表示相关系数0.9。右图:横坐标表示power值,纵坐标表示基因的平均连接度。

Fig 4-2-1 Power值曲线图

4.3 模块划分

根据基因间表达量的相关性构建基因聚类树,并根据基因间的聚类关系进行基因模块的划分,表达模式相似的基因将被划入同一个模块,对聚类树的分支进行剪切区分,产生不同的模块,每个颜色代表一个模块,灰色表示无法归入任何一个模块的基因。在进行初步的模块划分之后,获得初步划分的模块结果Dynamic Tree Cut,由于有些模块非常相似,我们还会再根据模块特征值的相似度对表达模式相近的模块再进行合并,获得最终划分的模块Merged dynamic。

模块特征值(module eigengene):模块中的所有基因进行PCA分析,得到的主成分1(PC1)的值,PC1相当于模块中所有基因表达量的加权,可代表该模块内基因的整体表达模式。

相似度:合并模块的参数,由模块数量决定。本次分析选用的相似度为:0.85

模块最少基因数:指每个模块至少需要包含的基因数目。本次分析选用的最少基因数为:50

下图所示,红线和蓝线为帮助模块合并的辅助线;Height指的是模块之间不相关的程度,即:不相关 = 1 - 相关性。

Fig 4-3-1 模块特征值聚类图

下图所示,1)Dynamic Tree Cut为根据聚类结果划分的模块;2)Merged dynamic为根据模块相似度对表达模式相似的模块进行合并后的模块划分,之后的分析按照合并后的模块进行;3)对于树图,纵向距离代表两个节点间(基因间)的距离,横向距离无意义。

Fig 4-3-2 模块层级聚类图

Tab 4-3-1 基因-模块对应关系列表 (仅展示前10行)
GeneModuleCK-1CK-2CK-3T36-1T36-2T36-3T60-1T60-2T60-3T72-1T72-2T72-3
MSTRG.10pink1.481.101.452.861.471.691.291.442.422.412.311.43
MSTRG.10026orange28.5229.6428.6733.7733.8332.7828.9333.4734.7030.9222.0732.68
MSTRG.10033pink43.4643.278.3011.025.755.153.686.324.480.921.382.37
MSTRG.1005darkturquoise4.986.328.768.329.889.849.729.3011.106.547.099.04
MSTRG.10074darkturquoise3.012.873.793.793.804.595.174.415.593.332.356.03
MSTRG.10085pink4.474.391.970.440.710.551.000.530.631.451.601.35
MSTRG.10087pink12.2512.417.771.321.631.221.221.001.464.771.343.76
MSTRG.10094brown42.934.8514.5812.7226.0111.3222.2224.9022.479.0111.6714.41
MSTRG.1011cyan0.000.001.1611.1324.149.4738.5223.4426.5238.5431.6440.38
MSTRG.1012honeydew10.211.813.363.805.852.616.594.174.102.744.484.19


下图中,横坐标表示每个模块,纵坐标表示基因数量。

Fig 4-3-3 各模块基因数柱状图

5 模块概况

每个模块具有特定的表达模式,我们对获得的模块两两之间或是样品与模块之间进行相关性和聚类分析,了解模块的具体情况。若有性状等表型信息,还可以对分析模块与性状间的关系,找出与性状最相关的模块,并进一步寻找hub基因,同时,我们也提供TF因子的信息,以便于挖掘模块内调控关系的规律。

5.1 模块间相关性分析

将所有模块两两间进行相关性分析,并绘制热图。

如下图,每一行和列代表一个模块,方块里的数字为两个模块的pearson相关性系数,括号里数字为P值。方块颜色越深(越红或越绿),相关性越强;方块颜色越浅,相关性越弱。两个模块相关性的检验值P值通过student`s T test计算,P值越小,说明两个模块间相似度越高。

Fig 5-1-1 模块间相关性热图

5.2 模块基因相关性分析

将模块内的基因进行聚类,并绘制热图。

如下图,每一行和列代表一个基因,每个点的颜色越深(白→黄→红)代表行和列对应的两个基因间的连通性越强,pearson相关性越强。P值采用student's t test计算所得,P值越小代表基因与模块相关性的显著性越强。

Fig 5-2-1 模块基因相关性热图

5.3 样本表达模式分析

将模块基因在各个样本中的表达模式用模块特征值来展示,并绘制样本表达模式热图。通过样本表达模式热图,我们可以找出与特定样本显著相关的模块,从而后续可选择相应的模块进行进一步的研究。模块特征值相当于模块中所有基因表达量的加权综合值。因此,模块特征值在各个样本中的数值,反映了模块中所有基因在各个样本中的综合表达水平。

如下图,横坐标为样本,纵坐标为模块,用模块特征值作图。红色代表高表达量,绿色代表低表达量。该图能直观反映各模块在各个样本中的表达模式。

Fig 5-3-1 样本表达模式热图

5.4 各模块hub基因挖掘

针对每个模块,我们还可以在模块中进一步挖掘hub基因,hub基因可以通过模块相关性程度MM和模块内连通性K.in(即表中All.kWithin)进行寻找,通常MM、K.in的绝对值越高,表明该基因具有重要意义。我们对所有模块的基因都计算MM和K.in值,后续可以针对该结果进一步寻找到各模块中的hub基因。

Tab 5-4-1 hub相关性结果 (仅展示前10行)
GenemoduleColorsAll.kWithinMM.paleturquoiseMM.cyanMM.pinkMM.grey60MM.bisque4MM.palevioletred3MM.antiquewhite4MM.lightcyanMM.darkturquoiseMM.lavenderblush3MM.orangeMM.darkgreenMM.brown4MM.honeydew1MM.coral1MM.darkmagentaMM.darkseagreen4MM.sienna3MM.greyMM.paleturquoise.pvalueMM.cyan.pvalueMM.pink.pvalueMM.grey60.pvalueMM.bisque4.pvalueMM.palevioletred3.pvalueMM.antiquewhite4.pvalueMM.lightcyan.pvalueMM.darkturquoise.pvalueMM.lavenderblush3.pvalueMM.orange.pvalueMM.darkgreen.pvalueMM.brown4.pvalueMM.honeydew1.pvalueMM.coral1.pvalueMM.darkmagenta.pvalueMM.darkseagreen4.pvalueMM.sienna3.pvalueMM.grey.pvalue
MSTRG.10pink5.46397549215724-0.16719700747642-0.337580314122616-0.4107549586010170.3696125427392770.4432859946524530.0983641816950861-0.137602990708348-0.1548946797375290.003629189667920920.1487087282522590.3737339048397570.0629565573129345-0.06938422916253790.01344378441468140.1230765372572780.3067341007747730.3459454029820860.385902317244637-0.05987695927853950.6034943169764650.2832083676356610.1847095771437960.2370090764879960.1489214469222660.7610276719260530.6697724117261380.6307476340911490.9910689478938890.644615277808630.2314129675257210.8458837026376240.8303410385388330.9669236581680250.7031509107411260.3321754082654320.2706826858551860.2153547079452470.853348675182016
MSTRG.10026orange15.0273452469243-0.6458009763233270.131561590518916-0.250961048364513-0.0244180167850260.0065053394191954-0.1678300436438160.6300579264356620.4982340762410730.5952545984166740.6003382521673670.714450514933406-0.3624739629993060.2960974924775940.03666713727644170.3921789023057160.2571898240324080.4316867733079030.5970983916905530.01066215065392830.02330099607869010.6835915358530540.4314109390489340.9399565328670560.9839916695744310.6021040907249020.02809536619277690.09923614427511880.04115722970055890.03902340059511310.009036144156414310.2468902404142890.3500582229684490.9099260309503050.2073428429608890.4196734814281060.1611250101692290.04037397214424890.973765090384632
MSTRG.10033pink310.7217907682170.1801041538248940.7260964688563070.907405462063653-0.4348462249112510.0503031683381370.2166907504266880.3349865794293240.129007092616242-0.389334393098407-0.245774819690225-0.509206680244224-0.511025161331683-0.587460469539468-0.7266972173909-0.644457669576175-0.765250443857159-0.541450188752566-0.673949866879735-0.1824887057530390.5753933821450070.007498615565293674.58087694737703e-050.1577401843841160.8766237590047950.4987437016781540.2871579468476450.6894619031439470.2109509201068720.441304242966260.09086068386733480.08952031436262610.0445876182948430.007425021823639310.02368501964852950.003726911631866150.06904159895713550.01625159682034750.570259463364988
MSTRG.1005darkturquoise261.811981147371-0.192361817973044-0.197611363472253-0.643451975047021-0.190192561337912-0.532252061895802-0.6571722315847830.1984411844368110.4816861385549170.8491924716693780.5457150485005650.6756981899504740.1118169091558410.7077544156774430.6467553177454210.7655139025935160.8011328338843910.4170369876827820.5327727627923310.2484973065692220.5492012447628990.5381379696847760.02397552051422440.5538002197538620.07485237616887340.02022915691643060.5363978030377560.1128194759208110.0004742035333096970.06645589247757460.01587308795610940.7293616104467150.010020312363220.02303093280659030.003707830139055590.001735484466011210.1774188501705320.07451478947137370.436097249754895
MSTRG.10074darkturquoise18.8990747719072-0.504690467846375-0.262411884123078-0.5044073363161240.0401769759903475-0.335926003763676-0.3666688026991150.1524318820036340.2140895622516410.5407242564155190.268448657056150.4643075815101980.1769723901443040.4269075599628020.5014323108234340.6682081162810760.3951299981615370.4609241667831650.5384790619373110.2895810361021610.09424789150719340.4099562629670050.09446302956797680.9013394639596430.2857238605970430.2410548779982260.6362559339591140.5040372818642490.06948851543234610.3988648387177750.1283510494702030.5821637534916790.1663320234996530.09674359255276120.01754142920985810.203639574031020.1315287840474740.07088335589957120.361261880528056
MSTRG.10085pink154.2182642422740.3609710930017290.5187107307596130.896531929363748-0.1798401614564140.1455977132591580.4504567962539570.0290847950126714-0.187311551016691-0.671434250208853-0.521898291693249-0.779290277378527-0.268902872719401-0.686667405801078-0.676590412797026-0.758837142181421-0.90570357894288-0.565339061437244-0.7256218914577080.03280790623528910.2490008821272430.08400298173600417.83211965504143e-050.5759628816890610.6516293659048160.1416821379840070.9285048062798140.5599325281742190.01680781070264410.08178403729352950.002808967083970810.3980365054369020.0136444517660920.01568244244789110.004214823057954695.00288739287409e-050.05541725984169340.007557138255787350.919377552089288
MSTRG.10087pink182.1351141206530.345178138632060.6267848466897940.943326283024287-0.1491054328364830.1912900993853110.535302864699740.19713315608285-0.0559289699240517-0.599216127367693-0.380636401708537-0.679650612252391-0.421990289118142-0.754754473138781-0.789175782845866-0.798071607400294-0.942391214003754-0.534265860099897-0.623842186338930.1513027254129660.2718180489888850.02917514468719984.18493600945728e-060.6437227632340150.5514713822621170.07288928801677130.5391418692653510.862934736973440.03948747516297670.2222188104411540.01504134150367280.1717990235713350.004549675836365150.002274204547074580.001862925708657514.53452825374845e-060.0735525548081430.0301711629185310.638787094099933
MSTRG.10094brown448.6980122455299-0.166015783849706-0.298894833987972-0.653937038058816-0.169030149374153-0.430069569859702-0.6535098563772040.09033704992785590.3471206433497560.7831806338480580.2479764630865130.5010472604970910.2037753310479140.9273113192786250.7461838827922470.8298044201937130.8165969765899810.537471320876270.4176844255520320.1158573172037460.6060916677314710.3453063514837520.02107106689559940.5994718481585820.1628752478119750.02118411342753960.7800874747256950.2689489223403080.002588202061507550.4370910975842320.0970414343080270.5252691483981591.41328705003176e-050.005317726632860920.0008387876978246780.0011904196136480.07151563429244330.176677848243430.719923945087724
MSTRG.1011cyan248.330335851192-0.414840200751593-0.943551273927608-0.8356989879416110.6500562424247470.1936425987454670.00112913184395867-0.553509801030212-0.5733321806172220.0165900468858819-0.2992692270197240.1558998987948940.7345365887094340.5802861826240150.7577878918483330.6515012357311180.4328495541662020.6484415508151650.5736241661707130.1492706116096820.1799475684709584.10409582552642e-060.000710666741440550.02211435625010680.5464933637981710.9972212818263540.06190432480544870.05131274619220390.9591879102076910.3446730020151550.6285042872166210.006513475060501970.04791948105194250.004299036860190030.02172159703995490.1598739767896850.02255932894835840.0511669756001080.643351267730829
MSTRG.1012honeydew136.0257712857298-0.0499803544167186-0.549219140676997-0.7576630267065780.00475773068891355-0.366277973642613-0.577904760825604-0.2569045306815790.02512669570977120.5331995743063140.09503669910971540.3386494993590580.5055057451787090.87347278134770.8771743817312880.7718873189376380.8052071171469910.3581458752141850.3646575118223610.1864071867767560.8774102245821070.06438209456269670.004309142809716090.9882918755030470.2415950742791260.04906316850834350.4202076071387730.9382167956720420.0742388532707830.7689143500135540.2815893278121190.09363024244382730.0002057390359131860.0001785063460886670.003268391438291530.001576362212622070.2529972168447260.2438424720921720.5618631782611

5.5 各模块TF因子分析

转录因子(TF)对于转录调控起到重要作用,我们寻找各模块中的转录因子,若分析的物种是植物或者动物,我们将预测的蛋白序列同相应的TF数据库(plant TFdb[3]/animal TFdb[4])进行 hmmscan 比对,获得该物种的TF因子的信息,便于在后续构建共表达网络图时找到关键的TF因子,以便于进一步了解TF因子调控的机制。

Tab 5-5-1 TF预测结果 (仅展示前10行)
GeneTF familyModule
ncbi_112163987GRAShoneydew1
ncbi_112164284AP2-EREBPcyan
ncbi_112164486WRKYdarkgreen
ncbi_112164521NACcoral1
ncbi_112164550WRKYdarkturquoise
ncbi_112164583WRKYlightcyan
ncbi_112164600ARR-Bdarkturquoise
ncbi_112164616bZIPpink
ncbi_112164651TCPlightcyan
ncbi_112164653ARR-Bcyan

Tab 5-5-2 TF因子统计表
ModuleTF number
lightcyan178
cyan167
darkturquoise122
pink90
darkgreen71
honeydew139
antiquewhite430
coral124
orange21
grey6020
bisque418
darkmagenta17
sienna313
palevioletred312
brown410
darkseagreen45
paleturquoise3
lavenderblush33

下图中,横坐标表示每个模块,纵坐标表示转录因子的数量。

Fig 5-5-1 各模块TF因子的柱状图

6 表达模式

6.1 模块基因总表
Tab 6-1-1 模块基因总表 (仅展示前10行)
GeneIDModuleAll.kTotalAll.kWithinMM.paleturquoiseMM.cyanMM.pinkMM.grey60MM.bisque4MM.palevioletred3MM.antiquewhite4MM.lightcyanMM.darkturquoiseMM.lavenderblush3MM.orangeMM.darkgreenMM.brown4MM.honeydew1MM.coral1MM.darkmagentaMM.darkseagreen4MM.sienna3MM.greyMM.paleturquoise.pvalueMM.cyan.pvalueMM.pink.pvalueMM.grey60.pvalueMM.bisque4.pvalueMM.palevioletred3.pvalueMM.antiquewhite4.pvalueMM.lightcyan.pvalueMM.darkturquoise.pvalueMM.lavenderblush3.pvalueMM.orange.pvalueMM.darkgreen.pvalueMM.brown4.pvalueMM.honeydew1.pvalueMM.coral1.pvalueMM.darkmagenta.pvalueMM.darkseagreen4.pvalueMM.sienna3.pvalueMM.grey.pvalueCK-1CK-2CK-3T36-1T36-2T36-3T60-1T60-2T60-3T72-1T72-2T72-3SymbolDescriptionKEGG_A_classKEGG_B_classPathwayK_IDGO ComponentGO FunctionGO ProcessTF_family
MSTRG.10786antiquewhite42058.76098088245234.3327295821440.192345033091137-0.789847074877546-0.4707997450890720.373754253919778-0.0731085988142983-0.056268194902974-0.976706028892179-0.787296318218886-0.435300104723269-0.589069378651897-0.4138602401994770.9308428839792970.1729940471503310.6053430020105050.09333325781481550.1840021329869110.0191571607700432-0.139934342498874-0.05557954646932740.5492367689632460.002240948884254150.1223947753918050.2313855343505030.8213603228900450.862110384806445.1943629264324e-080.002369320678801310.1572576742277020.04386352156400250.1810828119219521.10842870019211e-050.5908090258043160.03700076611680080.7729596284678990.5670106295248810.9528784862607060.6644644349786540.8637840037347822.894.023.054.894.873.079.236.106.348.0426.7217.03--PREDICTED: muscle M-line assembly protein unc-89-like [Pyrus x bretschneideri]----GO:0043231//intracellular membrane-bounded organelle---
MSTRG.11431antiquewhite454.64502960043148.40825285571775-0.48283639682843-0.0482615400924883-0.09706493462643940.3467106315864290.3459296748720810.3077067328394910.404662071358880.06998004427745380.1716916781987120.09795401593588440.27944367203074-0.2055224164234140.192146750458971-0.02194870627371340.287103200619694-0.1264364639627610.4184177538186410.5576898968241690.5200708371369290.1118376424804240.8815994379517030.7641046584480940.2695530595936050.2707059321447320.3305654125508460.1919559036140340.8289030465590410.5936499185966870.7619987200028630.3790602285614020.5216459518864780.5496564944056820.9460202855520570.3655709399578360.6953852483094310.1758408857737880.05955479302644550.08305123754142812.853.322.942.863.772.693.772.773.504.061.563.55MPT2mitochondrial phosphate carrier protein 2, mitochondrial [Rosa chinensis]----GO:0009536//plastid;GO:0019866//organelle inner membrane;GO:0030312//external encapsulating structure;GO:0031224//intrinsic component of membrane-GO:0006006//glucose metabolic process;GO:0006739//NADP metabolic process;GO:0006970//response to osmotic stress;GO:0009725//response to hormone;GO:0051234//establishment of localization-
MSTRG.1162antiquewhite41706.03916799444118.5878246396640.179932787607858-0.679308198650942-0.2553685288071250.5476911796414090.03843017532920930.273827251303733-0.839157147927077-0.817188465938962-0.575345161015403-0.69522865780465-0.5724007706612890.822946268423824-0.02367188326290480.415860139405769-0.0531341540958046-0.176296814886630.0260540306027714-0.08648312921240160.3486748634112760.575763037745420.01511209928582090.4230890660281770.06528078826243940.9056117259410130.3891122684766070.0006428955449103120.001172588077162840.0503136868614090.01207272902746850.05177968920508120.001009541076334040.9417884777349830.1787707388328190.8697311402036550.5836283270015380.935940655159240.7892786187273840.2666659344440980.450.750.890.180.500.150.930.460.651.131.572.02Os04g0119400pyruvate dehydrogenase E1 component subunit alpha-3, chloroplastic [Rosa chinensis]Metabolism;Metabolism;Metabolism;Metabolism;Metabolism;MetabolismGlobal and overview maps;Global and overview maps;Global and overview maps;Carbohydrate metabolism;Carbohydrate metabolism;Carbohydrate metabolismko01100//Metabolic pathways;ko01110//Biosynthesis of secondary metabolites;ko01200//Carbon metabolism;ko00010//Glycolysis / Gluconeogenesis;ko00620//Pyruvate metabolism;ko00020//Citrate cycle (TCA cycle)K00161;K00161;K00161;K00161;K00161;K00161GO:0009526//plastid envelope;GO:0009532//plastid stromaGO:0004738//pyruvate dehydrogenase activityGO:0006090//pyruvate metabolic process-
MSTRG.12109antiquewhite4101.46366293740212.4997149849126-0.3490812891244990.169869684499217-0.03773939746638210.2365549773859880.2537718381214850.2924144179674660.51813404781820.3115663652408010.2493490908447330.5165263363506050.533962605355671-0.453173350257988-0.183008151086383-0.3104359445483920.0442351336205528-0.0558596333443090.3469280285529460.5727878515671760.1755534574730460.2660707907799710.59763307065230.9073018454174770.4591577458894380.4260946442347340.3563675020536520.08440874555050910.3242185371686030.4344742798830690.08554697609448960.0737472955331660.1390000488052710.5691435435807710.3260704770423350.8914236161986660.8631032475864440.2692326358267410.05158527392742370.585241509713342.983.833.423.243.324.572.783.514.364.192.483.45--zinc finger MYM-type protein 1-like [Rosa chinensis]----GO:0043231//intracellular membrane-bounded organelleGO:0005515//protein binding--
MSTRG.12245antiquewhite4199.37656453135724.8978312639893-0.2169034739680140.217451434199096-0.07011105831859550.03168206989890830.2902596928226120.05424011415175990.6289881754744890.4786031961582170.4760903487830.4557109908047610.554974921069137-0.5261599150995080.151056924669248-0.1686291413173150.3068835351689030.1905587037455780.3215418098853760.5152849734501220.46182840657140.498311898399620.4972003753064430.8285869103851270.9221366588875910.3600863591467920.8670406864019680.02844504625850330.1154790622579050.1176771850280870.1365244494965370.06107362390540290.07888018249149580.6393385607097170.6003509085556190.3319277735696350.5530228618886450.308126947120260.08643292450586490.1306745548867161.061.031.491.711.571.051.680.871.601.650.460.73rpsEprobable 37S ribosomal protein S5, mitochondrial [Rosa chinensis]Genetic Information ProcessingTranslationko03010//RibosomeK02988----
MSTRG.12586antiquewhite4534.83766505828982.80793632929450.532157647367151-0.235325898843630.10631366105142-0.096605966602373-0.438685545991436-0.121888036420232-0.781420143875696-0.534778271512551-0.535521475598063-0.589018477583406-0.7601611394765630.680827404714321-0.1042348276128710.28796564840339-0.364278393315044-0.194457035574626-0.508725974001952-0.679693939692834-0.06308670459642350.07491370004449570.4615631004325240.7422716487928810.765192376734570.1536883102373960.7059041340639950.002686399045087330.07322430204031940.07274999041567960.04388630151068050.004110347487197660.01480002434844730.7471644733631010.3640680600461220.2443700404600890.5447743455356250.09121725295508930.01503240556976090.8455684764013072.142.462.181.431.921.681.611.921.611.163.503.05--hypothetical protein RchiOBHm_Chr3g0490531 [Rosa chinensis]--------
MSTRG.12762antiquewhite41222.72257370144136.197126289498-0.2259581035386440.7377811602451190.444387769032224-0.3833877822406770.08415407638427990.02286232064636310.8806346252534320.6808217163987840.3490377845518910.558223253274970.376895267555669-0.835887858690047-0.0984493316622386-0.55325013030362-0.172182861866582-0.183847440626352-0.09043954272528670.0662107342203426-0.06123954185700980.4800885991693070.006161909723439250.1477940557665170.2186161895913230.794845187019960.9437764496518630.0001556774144521040.01480118390428090.2661344593404350.05925953659057720.2271743079145410.0007068285826878190.7608261230204250.06205236757511680.5925778689734370.567342360976410.7798433887576280.8380081826615540.85004436593103345.0344.4040.3043.4246.6643.7436.5041.2642.4540.0229.7334.37MLO2MLO-like protein 3 [Rosa chinensis]----GO:0005911//cell-cell junction;GO:0016020//membrane;GO:0043231//intracellular membrane-bounded organelle-GO:0002252//immune effector process;GO:0006605//protein targeting;GO:0006812//cation transport;GO:0009404//toxin metabolic process;GO:0009617//response to bacterium;GO:0009696//salicylic acid metabolic process;GO:0009755//hormone-mediated signaling pathway;GO:0009863//salicylic acid mediated signaling pathway;GO:0010053//root epidermal cell differentiation;GO:0010243//response to organonitrogen compound;GO:0010260//organ senescence;GO:0015833//peptide transport;GO:0031347//regulation of defense response;GO:0035556//intracellular signal transduction;GO:0042743//hydrogen peroxide metabolic process;GO:0043067//regulation of programmed cell death;GO:0046942//carboxylic acid transport;GO:0048468//cell development;GO:0050832//defense response to fungus-
MSTRG.1286antiquewhite42894.1564520347237.50157245664610.03071506424516340.8607994538981540.503201140974355-0.671421618288681-0.227492604915585-0.2488781875435060.9238459235392880.9100539047506610.5659569680073690.4775062191871790.308310657005444-0.887543820099833-0.0732315754851193-0.4861388848141250.00907806939420662-0.0492413984809363-0.0479206802563757-0.0876165537469658-0.01013129885651710.9245071468599490.0003242970059153430.0953832586276760.01681063846646880.4770309188862920.4353711653024541.7732950070359e-053.98032992463316e-050.05509204727773140.1164352875918810.3295678777755170.000116944821140520.8210641142534070.1090501979349770.9776618932390460.8792109113267670.8824305394754060.7865729220004030.97507091860683119.9020.4821.1718.3117.3718.7617.3320.5120.0612.136.919.40PFP-BETAprobable LRR receptor-like serine/threonine-protein kinase At1g53430 isoform X5 [Rosa chinensis]Metabolism;Metabolism;Metabolism;Metabolism;MetabolismGlobal and overview maps;Global and overview maps;Carbohydrate metabolism;Carbohydrate metabolism;Carbohydrate metabolismko01100//Metabolic pathways;ko01110//Biosynthesis of secondary metabolites;ko00010//Glycolysis / Gluconeogenesis;ko00051//Fructose and mannose metabolism;ko00030//Pentose phosphate pathwayK00895;K00895;K00895;K00895;K00895GO:0030312//external encapsulating structure;GO:0044445//cytosolic partGO:0008443//phosphofructokinase activity;GO:0032550//purine ribonucleoside binding;GO:0043169//cation bindingGO:0006084//acetyl-CoA metabolic process;GO:0006090//pyruvate metabolic process;GO:0016310//phosphorylation;GO:0016458//gene silencing;GO:0034968//histone lysine methylation-
MSTRG.13315antiquewhite42230.14826120019208.5093479238820.207952719685512-0.764746414329568-0.3792112276958550.5507052723150670.16703078294930.217328626197036-0.963016965585853-0.87409840410392-0.602533924009522-0.584400715629885-0.4427397038837820.839248711562158-0.01512898519667420.404701926417307-0.09577083277040470.0251721883218937-0.000213766793618697-0.0995826091495116-0.03900489313853350.5166239221293970.003763624437911050.2240988592763240.06351625190251450.603859567723020.4974493915722425.12040056464765e-070.0002009225218510060.03812658949181650.04598791729494850.149482476773540.0006411718722582680.9627798730166270.1919079421798210.7671725765592540.9381051241706920.9994739333133810.7581449733730720.9042058431141520.000.110.130.922.090.4517.624.182.88101.52247.51120.04--glu S.griseus protease inhibitor [Prunus persica]------GO:0006950//response to stress;GO:0010466//negative regulation of peptidase activity-
MSTRG.13789antiquewhite41751.157652632106.8088416519360.15049777746640.749320615428110.378456940723274-0.472200483175366-0.115684117180504-0.1131011259789250.8437381711317030.8424875892767660.5267105233209820.6222301606080320.426897222686915-0.844868475572642-0.168871237099333-0.494024580691575-0.04083539753970360.0118829667069112-0.04431939816636660.09748017942065840.2971756842272510.6405936603280390.005026099364735530.2250977447628320.1211338814020450.7203277837890150.726358104735060.0005609487081336930.0005824656412362980.07851020572582420.03072704898169290.166343399723780.0005420360865360810.5998201453426560.1025818425295910.8997297380265250.9707622666380480.8912178678167090.7631208978860840.3482225779850190.861.151.301.010.951.230.780.701.420.720.270.21--PREDICTED: transposase, partial [Prunus dulcis]--------

6.2 模块基因表达模式

将每个模块包含的各个基因的表达模式用热图展示,并用柱状图呈现模块特征值在不同样本的变化(相当于模块表达模式)。图中,上图为模块中基因在不同样本中的表达量热图,红色为上调,绿色为下调;下图为在不同样本中的模块特征值。

  • antiquewhite4
  • bisque4
  • brown4
  • coral1
  • cyan
  • darkgreen
  • darkmagenta
  • darkseagreen4
  • darkturquoise
  • grey60
  • honeydew1
  • lavenderblush3
  • lightcyan
  • orange
  • paleturquoise
  • palevioletred3
  • pink
  • sienna3

Fig 6-2-1 各模块基因表达模式热图


6.3 各模块网络信息(可用于cytoscape作图)

我们提供各模块节点文件和边界文件,可用于导入cytoscape软件[5],构建基因共表达网络图。也可挑选感兴趣的一部分基因(如连通性较高的基因),构建基因共表达网络图。基因调控关系网络图能帮助我们获取在调控关系网络中处于枢纽位置的核心基因(hub gene),并且可以利用已知基因的功能预测未知基因功能(直线相连的基因功能潜在相关)。我们还提供TF因子的信息,在绘制调控网络图时,可以将TF因子在图中标注出来。

注:依据系统配置及浏览器不同,如果关系对数量过多该图可能不能正常加载,请使用桌面版Cytoscape软件

下图展示的为示例图,可以根据个人喜好使用桌面版Cytoscape软件调整图形。

Fig 6-3-1 网络图例图

7 富集分析

7.1 GO富集分析

我们将各模块基因向GO数据库[6](http://www.geneontology.org/)的各term映射,并计算每个term的基因数,从而得到具有某个GO功能的基因列表及基因数目统计。然后应用超几何检验,找出与整个基因组背景相比,在基因中显著富集的GO条目。

基因集 细胞组分 分子功能 生物学过程 GO 分类表
antiquewhite4 antiquewhite4.C.html antiquewhite4.F.html antiquewhite4.P.html antiquewhite4.Level2.xls
bisque4 bisque4.C.html bisque4.F.html bisque4.P.html bisque4.Level2.xls
brown4 brown4.C.html brown4.F.html brown4.P.html brown4.Level2.xls
coral1 coral1.C.html coral1.F.html coral1.P.html coral1.Level2.xls
cyan cyan.C.html cyan.F.html cyan.P.html cyan.Level2.xls
darkgreen darkgreen.C.html darkgreen.F.html darkgreen.P.html darkgreen.Level2.xls
darkmagenta darkmagenta.C.html darkmagenta.F.html darkmagenta.P.html darkmagenta.Level2.xls
darkseagreen4 darkseagreen4.C.html darkseagreen4.F.html darkseagreen4.P.html darkseagreen4.Level2.xls
darkturquoise darkturquoise.C.html darkturquoise.F.html darkturquoise.P.html darkturquoise.Level2.xls
grey60 grey60.C.html grey60.F.html grey60.P.html grey60.Level2.xls
honeydew1 honeydew1.C.html honeydew1.F.html honeydew1.P.html honeydew1.Level2.xls
lavenderblush3 lavenderblush3.C.html lavenderblush3.F.html lavenderblush3.P.html lavenderblush3.Level2.xls
lightcyan lightcyan.C.html lightcyan.F.html lightcyan.P.html lightcyan.Level2.xls
orange orange.C.html orange.F.html orange.P.html orange.Level2.xls
paleturquoise paleturquoise.C.html paleturquoise.F.html paleturquoise.P.html paleturquoise.Level2.xls
palevioletred3 palevioletred3.C.html palevioletred3.F.html palevioletred3.P.html palevioletred3.Level2.xls
pink pink.C.html pink.F.html pink.P.html pink.Level2.xls
sienna3 sienna3.C.html sienna3.F.html sienna3.P.html sienna3.Level2.xls

GO 富集分类柱状图:(横坐标为二级GOterm,纵坐标为该term里的基因数量,红色表示上调,绿色表示下调)

  • antiquewhite4
  • bisque4
  • brown4
  • coral1
  • cyan
  • darkgreen
  • darkmagenta
  • darkseagreen4
  • darkturquoise
  • grey60
  • honeydew1
  • lavenderblush3
  • lightcyan
  • orange
  • paleturquoise
  • palevioletred3
  • pink
  • sienna3

Fig 7-1-1 GO富集分类柱状图


GO富集气泡图:(利用P值(或Q值)最小的前20个GOterm来作图,纵坐标为GOterm,横坐标为富集因子(该GOterm中差异数量除以所有数量),大小表示数量多少,颜色越红P/Q值越小)

  • antiquewhite4.C 气泡图
  • antiquewhite4.F 气泡图
  • antiquewhite4.P 气泡图
  • bisque4.C 气泡图
  • bisque4.F 气泡图
  • bisque4.P 气泡图
  • brown4.C 气泡图
  • brown4.F 气泡图
  • brown4.P 气泡图
  • coral1.C 气泡图
  • coral1.F 气泡图
  • coral1.P 气泡图
  • cyan.C 气泡图
  • cyan.F 气泡图
  • cyan.P 气泡图
  • darkgreen.C 气泡图
  • darkgreen.F 气泡图
  • darkgreen.P 气泡图
  • darkmagenta.C 气泡图
  • darkmagenta.F 气泡图
  • darkmagenta.P 气泡图
  • darkseagreen4.C 气泡图
  • darkseagreen4.F 气泡图
  • darkseagreen4.P 气泡图
  • darkturquoise.C 气泡图
  • darkturquoise.F 气泡图
  • darkturquoise.P 气泡图
  • grey60.C 气泡图
  • grey60.F 气泡图
  • grey60.P 气泡图
  • honeydew1.C 气泡图
  • honeydew1.F 气泡图
  • honeydew1.P 气泡图
  • lavenderblush3.C 气泡图
  • lavenderblush3.F 气泡图
  • lavenderblush3.P 气泡图
  • lightcyan.C 气泡图
  • lightcyan.F 气泡图
  • lightcyan.P 气泡图
  • orange.C 气泡图
  • orange.F 气泡图
  • orange.P 气泡图
  • paleturquoise.C 气泡图
  • paleturquoise.F 气泡图
  • paleturquoise.P 气泡图
  • palevioletred3.C 气泡图
  • palevioletred3.F 气泡图
  • palevioletred3.P 气泡图
  • pink.C 气泡图
  • pink.F 气泡图
  • pink.P 气泡图
  • sienna3.C 气泡图
  • sienna3.F 气泡图
  • sienna3.P 气泡图

Fig 7-1-2 GO富集气泡图


GO 富集条形图:(利用P值(或Q值)最小的前20个term来作图,纵坐标为GOterm,横坐标为该GOterm数目占所有差异数目的百分比,颜色越深P/Q值越小,柱子上的数值为该GOterm数量及P/Q值

  • antiquewhite4.C 富集柱形图
  • antiquewhite4.F 富集柱形图
  • antiquewhite4.P 富集柱形图
  • bisque4.C 富集柱形图
  • bisque4.F 富集柱形图
  • bisque4.P 富集柱形图
  • brown4.C 富集柱形图
  • brown4.F 富集柱形图
  • brown4.P 富集柱形图
  • coral1.C 富集柱形图
  • coral1.F 富集柱形图
  • coral1.P 富集柱形图
  • cyan.C 富集柱形图
  • cyan.F 富集柱形图
  • cyan.P 富集柱形图
  • darkgreen.C 富集柱形图
  • darkgreen.F 富集柱形图
  • darkgreen.P 富集柱形图
  • darkmagenta.C 富集柱形图
  • darkmagenta.F 富集柱形图
  • darkmagenta.P 富集柱形图
  • darkseagreen4.C 富集柱形图
  • darkseagreen4.F 富集柱形图
  • darkseagreen4.P 富集柱形图
  • darkturquoise.C 富集柱形图
  • darkturquoise.F 富集柱形图
  • darkturquoise.P 富集柱形图
  • grey60.C 富集柱形图
  • grey60.F 富集柱形图
  • grey60.P 富集柱形图
  • honeydew1.C 富集柱形图
  • honeydew1.F 富集柱形图
  • honeydew1.P 富集柱形图
  • lavenderblush3.C 富集柱形图
  • lavenderblush3.F 富集柱形图
  • lavenderblush3.P 富集柱形图
  • lightcyan.C 富集柱形图
  • lightcyan.F 富集柱形图
  • lightcyan.P 富集柱形图
  • orange.C 富集柱形图
  • orange.F 富集柱形图
  • orange.P 富集柱形图
  • paleturquoise.C 富集柱形图
  • paleturquoise.F 富集柱形图
  • paleturquoise.P 富集柱形图
  • palevioletred3.C 富集柱形图
  • palevioletred3.F 富集柱形图
  • palevioletred3.P 富集柱形图
  • pink.C 富集柱形图
  • pink.F 富集柱形图
  • pink.P 富集柱形图
  • sienna3.C 富集柱形图
  • sienna3.F 富集柱形图
  • sienna3.P 富集柱形图

Fig 7-1-3 GO富集条形图


7.2 KO富集分析

在生物体内,不同基因相互协调行使其生物学,基于Pathway的分析有助于更进一步了解基因的生物学功能。KEGG[7]是有关Pathway的主要公共数据库。Pathway显著性富集分析以KEGG Pathway为单位,应用超几何检验,找出与整个基因组背景相比,在基因中显著性富集的Pathway。通过Pathway显著性富集能确定基因参与的最主要生化代谢途径和信号转导途径。

基因集 Pathway 富集结果 Pathway 注释表
antiquewhite4 antiquewhite4.htm antiquewhite4.path.xls
bisque4 bisque4.htm bisque4.path.xls
brown4 brown4.htm brown4.path.xls
coral1 coral1.htm coral1.path.xls
cyan cyan.htm cyan.path.xls
darkgreen darkgreen.htm darkgreen.path.xls
darkmagenta darkmagenta.htm darkmagenta.path.xls
darkseagreen4 darkseagreen4.htm darkseagreen4.path.xls
darkturquoise darkturquoise.htm darkturquoise.path.xls
grey60 grey60.htm grey60.path.xls
honeydew1 honeydew1.htm honeydew1.path.xls
lavenderblush3 lavenderblush3.htm lavenderblush3.path.xls
lightcyan lightcyan.htm lightcyan.path.xls
orange orange.htm orange.path.xls
paleturquoise paleturquoise.htm paleturquoise.path.xls
palevioletred3 palevioletred3.htm palevioletred3.path.xls
pink pink.htm pink.path.xls
sienna3 sienna3.htm sienna3.path.xls

KO 富集气泡图:(利用P值(或Q值)最小的前20个pathway来作图,纵坐标为Pathway,横坐标为富集因子(该Pathway中差异数量除以所有数量),大小表示数量多少,颜色越红P/Q值越小)

  • antiquewhite4
  • bisque4
  • brown4
  • coral1
  • cyan
  • darkgreen
  • darkmagenta
  • darkseagreen4
  • darkturquoise
  • grey60
  • honeydew1
  • lavenderblush3
  • lightcyan
  • orange
  • paleturquoise
  • palevioletred3
  • pink
  • sienna3

Fig 7-2-1 KO富集气泡图


KO 富集条形图:(利用P值(或Q值)最小的前20个Pathway来作图,纵坐标为Pathway,横坐标为该pathway数目占所有差异数目的百分比,颜色越深P/Q值越小,柱子上的数值为该pathway数量及P/Q值

  • antiquewhite4 富集柱形图
  • bisque4 富集柱形图
  • brown4 富集柱形图
  • coral1 富集柱形图
  • cyan 富集柱形图
  • darkgreen 富集柱形图
  • darkmagenta 富集柱形图
  • darkseagreen4 富集柱形图
  • darkturquoise 富集柱形图
  • grey60 富集柱形图
  • honeydew1 富集柱形图
  • lavenderblush3 富集柱形图
  • lightcyan 富集柱形图
  • orange 富集柱形图
  • paleturquoise 富集柱形图
  • palevioletred3 富集柱形图
  • pink 富集柱形图
  • sienna3 富集柱形图

Fig 7-2-2 KO富集条形图


8 参考文献

[1] Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis[J]. Statistical applications in genetics and molecular biology, 2005, 4(1).     返回
[2] Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis[J]. BMC bioinformatics, 2008, 9(1): 559.     返回
[3] Jin J, Zhang H, Kong L, et al. PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors[J]. Nucleic acids research, 2013, 42(D1): D1182-D1187.     返回
[4] Zhang H M, Chen H, Liu W, et al. AnimalTFDB: a comprehensive animal transcription factor database[J]. Nucleic acids research, 2011, 40(D1): D144-D149.     返回
[5] Shannon P, Markiel A, Ozier O, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks[J]. Genome research, 2003, 13(11): 2498-2504.     返回
[6] Ashburner M, Ball C A, Blake J A, et al. Gene ontology: tool for the unification of biology[J]. Nature genetics, 2000, 25(1): 25.     返回
[7] Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes[J]. Nucleic acids research, 2000, 28(1): 27-30.     返回

9 附录

9.1 分析方法英文文档
9.2 结果目录结构
result
├── 1.Filter                                          数据过滤
│   |-- removeGene.xls                                    过滤掉的基因列表
│   |-- removeSample.xls                                  过滤掉的样本列表
├── 2.module_construction                             模块划分
│   |-- 1.sampleClustering.pdf/png                        样本层次聚类树图
│   |-- 2.softPower.pdf/png                               Power值曲线图
│   |-- 3.eigengeneClustering.pdf/png                     模块特征值聚类图
│   |-- 4.ModuleTree.pdf/png                              模块层次聚类图
│   |-- 4.netcolor2gene.xls                               基因-模块对应关系列表
│   |-- 5.module_gene.xls                                 各模块基因统计表
│   |-- 5.module_gene.pdf/png                             各模块基因数柱状图
├── 3.basic_info                                      模块相关信息
│   |-- 1.ModuleModuleMembership.xls/pdf/png              模块-模块相关性结果
│   |-- 2.geneModuleMembership.xls/pdf/png                模块-基因相关性结果
│   |-- 3.SampleExpressionPattern.xls/pdf/png             样本表达模式分析
│   |-- 4.hubGene.xls                                     Hub基因相关结果
│   |-- 5.ModuleTraitRelation.xls/pdf/png                 性状关联相关性分析
│   |-- TF.xls                                            TF相关性结果
│   |-- TFmodule.xls/pdf/png                              TF因子统计结果
│   |-- MM-GS-*-Scatter.png                               某性状最正相关和最负相关模块的MM 和 GS相关性散点图
│   |-- K.in-GS-*-Scatter.png                             某性状最正相关和最负相关模块的K.in 和 GS相关性散点图
│   |-- moduleGS-*-histogram.png                          各模块与某性状的GS的柱状图
├── 4.modules                                         各模块结果
│   |-- all.glist.xls                                     模块基因总表
│   |-- *.glist.xls                                       各模块基因信息表格
│   |-- *_Express.pdf/png                                 各模块基因表达模式热图
│   |-- cytoscape                                         网络图所需文件目录
├── 5.enrichment                                      富集分析 
│   |-- *.glist                                           各模块基因列表
│   |-- KO                                                各模块基因KO富集分析
│   |-- GO                                                各模块基因GO富集分析
├── src                                               结果报告内容                                    
│   |-- css                                               结题报告css脚本
│   |-- js                                                结题报告js脚本
│   |-- doc                                               结题报告说明文档
│   |-- image                                             结题报告图片
├── index.html                                        网页版结题报告
9.3 文章引用与致谢

论文引用:

如果您的研究课题使用了基迪奥的测序和分析服务,我们期望您在论文发表时,在Method部分或Acknowledgements部分引用或提及基迪奥公司。以下语句可供参考:

Method部分:

The cDNA / DNA / Small RNA libraries were sequenced on the Illumina sequencing platform by Genedenovo Biotechnology Co., Ltd (Guangzhou, China).

Acknowledgements部分:

We are grateful to/thank Guangzhou Genedenovo Biotechnology Co., Ltd for assisting in sequencing and/or bioinformatics analysis.

帮助文档