Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_081180.1:
Cluster 1Saccharide53463725502901156.53Glyco_hydro_1, adh_short_C2, p450---
Cluster 2Saccharide200617572013957877.822OG-FeII_Oxy, DIOX_N, Glyco_hydro_1, Peptidase_S10---
Cluster 3Saccharide505232315057911355.88Epimerase, UDPGT_2, p450flavonoid--
Cluster 4Saccharide6002300960201524178.51UDPGT_2, p450, polyprenyl_synt*saccharide--
The following clusters are from record NC_081181.1:
Cluster 5Cyclopeptide75593616783962452802.63BURP---
Cluster 6Fatty_acid-Saccharide9189393592303762409.83FA_hydroxylase, Methyltransf_11, UDPGT_2flavonoid, oleananes--
Cluster 7Saccharide106323187106763597440.41ADH_N, ADH_zinc_N, Epimerase, Methyltransf_7, UDPGT_2flavonoid, oleananes--
The following clusters are from record NC_081182.1:
Cluster 8Cyclopeptide12838717143798831541.17BURP---
Cluster 9Cyclopeptide13423719144711711047.45BURP---
Cluster 10Fatty_acid4874037549028434288.06ADH_N, ADH_zinc_N, FA_desaturase_2, Peptidase_S10---
Cluster 11Fatty_acid-Saccharide6507289365598999526.11FA_desaturase_2, NAD_binding_1, Peptidase_S10, UDPGT_2cyanogenic glucoside-4, monoterpenoid-4--
Cluster 12Saccharide211940764212187324246.56Aminotran_1_2, Cellulose_synt, Epimerase---
The following clusters are from record NC_081183.1:
Cluster 13Saccharide1823692518580598343.672OG-FeII_Oxy, DIOX_N, Glycos_transf_1---
Cluster 14Cyclopeptide1202113411214594371248.10BURP---
Cluster 15Fatty_acid-Terpene-Alkaloid140501508140942755441.25BBE, Epimerase, FAD_binding_4, FA_hydroxylase, Terpene_synth, Terpene_synth_C---
Cluster 16Cyclopeptide178275720179082296806.58BURP---
Cluster 17Cyclopeptide1783240111797787101454.70BURP---
Cluster 18Polyketide197100185197239334139.153Beta_HSD, AMP-binding, Chal_sti_synt_C, Oxidored_FMN---
The following clusters are from record NC_081184.1:
Cluster 19Cyclopeptide75899235792332493334.01BURP---
Cluster 20Lignan-Saccharide107003866107293244289.38Dirigent, UDPGT_2, p450*saccharide--
Cluster 21Saccharide112839245113146412307.17AMP-binding, Amino_oxidase, Glyco_hydro_1, Lipoxygenase---
Cluster 22Terpene139867633140142542274.91Epimerase, Terpene_synth, Terpene_synth_C---
Cluster 23Fatty_acid-Alkaloid14752420114761549791.30BBE, FAD_binding_4, FA_hydroxylase---
Cluster 24Alkaloid147796592148042492245.90BBE, FAD_binding_4, Methyltransf_11---
Cluster 25Terpene-Polyketide157593453157715705122.25Chal_sti_synt_C, Chal_sti_synt_N, Methyltransf_11, Prenyltrans, Transferase---
Cluster 26Cyclopeptide159082940159926238843.30BURP---
The following clusters are from record NC_081185.1:
Cluster 27Phenolamide-Saccharide-Alkaloid2302861823315537286.92Glyco_hydro_1, Pyridoxal_deC, Transferase---
Cluster 28Fatty_acid7485400874968967114.96AMP-binding, Transferase, p450---
Cluster 29Lignan-Saccharide101845139102109089263.95Aldo_ket_red, Dirigent, Methyltransf_11, UDPGT_2flavonoid-4, oleananes-4--
Cluster 30Saccharide102177382102593696416.31Aldo_ket_red, Epimerase, Methyltransf_11, UDPGT_2, adh_short_C2flavonoid-5, oleananes-5--
Cluster 31Cyclopeptide1152798041164744291194.62BURP---
Cluster 32Fatty_acid130728644130841739113.09CER1-like_C, FA_hydroxylase, Lycopene_cycl, Methyltransf_11---
Cluster 33Saccharide15708284415716451281.67Acetyltransf_1, Epimerase, UDPGT_2small phenolic--
The following clusters are from record NC_081186.1:
Cluster 34Fatty_acid3783359238035634202.04FA_desaturase, Methyltransf_11, Transferase---
Cluster 35Cyclopeptide50474192520649761590.78BURP---
Cluster 36Cyclopeptide5126964351945586675.94BURP---
Cluster 37Cyclopeptide7498954875629424639.88BURP---
Cluster 38Saccharide880782908812544147.15Acetyltransf_1, Lyase_aromatic, UDPGT_2---
Cluster 39Terpene8997536990375791400.42Terpene_synth, Terpene_synth_C, p450---
Cluster 40Saccharide984093349846500255.67UDPGT_2, p450small phenolic-4--
Cluster 41Fatty_acid-Saccharide100070768100246005175.24CER1-like_C, FA_hydroxylase, Methyltransf_11, UDPGT_2small phenolic--
Cluster 42Cyclopeptide103769367104272802503.44BURP---
Cluster 43Putative117897938118218863320.93ADH_N, ADH_zinc_N, AMP-binding, Abhydrolase_3, p450---
Cluster 44Alkaloid127926819128392737465.92Bet_v_1, NAD_binding_1, adh_short---
The following clusters are from record NW_026704941.1:
Cluster 45Saccharide10601321270765210.63AMP-binding, Amino_oxidase, Glyco_hydro_1, UDPGT_2---
The following clusters are from record NW_026704977.1:
Cluster 46Fatty_acid12143471345981131.63CER1-like_C, FA_hydroxylase, Lycopene_cycl, Methyltransf_11---
The following clusters are from record NW_026704997.1:
Cluster 47Cyclopeptide23769117991121561.42BURP---
The following clusters are from record NW_026705087.1:
Cluster 48Saccharide205247463519258.27Cellulose_synt, Transferase---
The following clusters are from record NW_026705146.1:
Cluster 49Cyclopeptide111704911170.49BURP---
The following clusters are from record NW_026705156.1:
Cluster 50Polyketide7432071117375374.17Chal_sti_synt_C, Chal_sti_synt_N, adh_short---
The following clusters are from record NW_026705200.1:
Cluster 51Cyclopeptide1699046699.04BURP---
The following clusters are from record NW_026705238.1:
Cluster 52Fatty_acid-Saccharide260194915933655.74ADH_N, ADH_zinc_N, FA_desaturase_2, Lipoxygenase, NAD_binding_1, Peptidase_S10, UDPGT_2, adh_shortcyanogenic glucoside-8, monoterpenoid-8--
The following clusters are from record NW_026705502.1:
Cluster 53Cyclopeptide1437028437.03BURP---
The following clusters are from record NW_026705654.1:
Cluster 54Saccharide219592319723100.13Cellulose_synt, Transferase---
The following clusters are from record NW_026705869.1:
Cluster 55Fatty_acid-Alkaloid163682342477178.79BBE, FAD_binding_4, FA_hydroxylase---
The following clusters are from record NW_026706016.1:
Cluster 56Polyketide-Alkaloid1303682303.68Chal_sti_synt_C, Pyridoxal_deC, p450---

NC_081180 - Cluster 1 - Saccharide

Gene cluster description

NC_081180 - Gene Cluster 1. Type = saccharide. Location: 5346372 - 5502901 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081180 - Cluster 2 - Saccharide

Gene cluster description

NC_081180 - Gene Cluster 2. Type = saccharide. Location: 20061757 - 20139578 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081180 - Cluster 3 - Saccharide

Gene cluster description

NC_081180 - Gene Cluster 3. Type = saccharide. Location: 50523231 - 50579113 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081180 - Cluster 4 - Saccharide

Gene cluster description

NC_081180 - Gene Cluster 4. Type = saccharide. Location: 60023009 - 60201524 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081181 - Cluster 5 - Cyclopeptide

Gene cluster description

NC_081181 - Gene Cluster 5. Type = cyclopeptide. Location: 75593616 - 78396245 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081181 - Cluster 6 - Fatty_acid-saccharide

Gene cluster description

NC_081181 - Gene Cluster 6. Type = fatty_acid-saccharide. Location: 91893935 - 92303762 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081181 - Cluster 7 - Saccharide

Gene cluster description

NC_081181 - Gene Cluster 7. Type = saccharide. Location: 106323187 - 106763597 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081182 - Cluster 8 - Cyclopeptide

Gene cluster description

NC_081182 - Gene Cluster 8. Type = cyclopeptide. Location: 12838717 - 14379883 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131660367
Repeat occurs 23 times in a sequence of 761 amino acids
Location between 13471307 and 13474113
Coverage of 30.22 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTQ | YLDGWLKDTR | YLDGWLKDTL | YLDGWLKDTQ | YLDGWLKDIR
YLDGWLKDSH | YLDGWLKVSH | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[IVDN][ISTF][HKLQPR]
MAHRVVLSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKDQISSPDSNQVYLDGWL
KDTR
DEKAKVNPDSNQVYLDGWLKDTRDEKVKVNPDSNQVYLDGWLKDTRDEKAKVNSNLEQVY
LDGWLKDTR
AEKAQANPDSNQVYLDGWLKDTRAEKEKVNPDSHQVYLDGWLKDTRAEKTKANLN
SNQLYLDGWLKDTRAENAKSNSDSNQVYLDGWLKDTRVENAKYNPDSNQVYLDGWLKDTRAEKE
KVNPDSNQVYLDGWLKDTRDENAKSNPDSNQVYLDGWLKDTRAENANSNQIYLDGWLKDTRAEK
VKANLDSNQVYLDGWLKDTRAEKEKVNPDSNQVYLDGWLKDTRNENAKSNPDSNQVYLDGWLKD
TQ
AKNANSNQVYLDGWLKDTRAENAKSNQVYLDGWLKDTLAENAKSNLDSNQVYLDGWLKDTQA
LKEKSTLDSNQLYLDGWLKDIRVGKEMSSPNSNQVYLDGWLKDSHAENSKQAYLDGWLKVSHAK
NHIKIGQDLAESNRKLPSKVDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYAKFLPKKVADDIP
VSKSQIPSLLDLFSLTKDSPQGEDMIGIINQCEYEPQKGETKACSTSLESMVEFVHSVIGADTK
YDIHSTSYPTTSAVPLQNYTVLDISEDIYAPKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKG
QYGDTMDALGICHLDTSAMNPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131654939
Repeat occurs 23 times in a sequence of 554 amino acids
Location between 13690775 and 13693039
Coverage of 24.91 %
Instances:
WLKNPQ | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDTR | WLKDTR | WLKDSR | WLKDTR
WLKDTR | WLKDTR | WLKDIR | WLKDTR | WLKDTR
WLKDMR | WLKDTR | WLKDTR |
pattern: WLK[DN][IPSMT][QR]
MAYRVVLSLLPFLMLSLINDHGSFARDLNQVDQPYLDGWLKNPQLRNQQPSPDSNQVYLDGWL
KDTR
VEKAKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRGEKEKANTDSTQVY
LDGWLKDTRVEKAKINPDSTQVYLDGWLKDTRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPN
SNQIYLDGWLKDTRAEKVNAHPKSNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRAEKAKSNSD
SNQVYLDGWLKDTRAEKEKSNSDSNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQV
KVNPNSNQVYLDGWLKDSRAKKANPDSKQVYLDGWLKDTRAEKENSTPDSNQVYLDGWLKDTRA
KKAKSNSDSNQVYLDGWLKDTRAEKVKSTQESNQVYLDGWLKDIRSEKEKSTPDSNQVYLDGWL
KDTR
AEKAKANSESNQVYLDGWLKDTRAKKAESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVY
LDGWLKDTRDLKEKSTLDSKQVYLDGWLKDTRVEKENLSPNSN
Repeat found in LOC131660373
Repeat occurs 16 times in a sequence of 604 amino acids
Location between 13525543 and 13527875
Coverage of 26.49 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTG | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKNTQ | YLDGWLKDIR | YLDGWLKDSH | YLDGWLKDSH
YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[IDN][ISTF][GHKQPR]
MTHRVVLSLLPFLLLWLINNHGSFARDMNQVDQPYLDGWLKNTPLKNQIPSPDSNQVYLDGWL
KDTR
GEKEKVNFDSNQVYLDGWLKDTGSEKAKANPDSNQVYLDGWLKDTRVENAKSNPDSNQVY
LDGWLKDTR
DENAKSNPDSNQVYLDGWLKDTRAENANSNHVYLDGWLKDTRDEKVIANLDSNQV
YLDGWLKDTR
AEKEKVNPNSNQVYLDGWLKDTRNEKAKANPESNQVYLDGWLKDTRAENTDSNQ
VYLDGWLKDTRAENEKSNLVSNQVYLDGWLKNTQALKEKPTLDSNQLYLDGWLKDIRVGKEISS
PNSNQVYLDGWLKDSHVENSKQAYLDGWLKDSHANNHIKTGQAYAESDRKLPSKVDHTEAFKVF
FFGIEDLYVGSVMTLQFPIVEYANFLPKKVADDIPVSKSQIPSLLDLFSLTKDSPKGEDMIGII
NQCEYEPQKGETKACPTSLESMVEFVHSVIGADTKYDIHSTSYPTTSAVPLQNYTVLDISEDIY
APKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKGQYGDIMNALGICHLDTSAMNPTHFYFDLL
GMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131654936
Repeat occurs 14 times in a sequence of 581 amino acids
Location between 13619611 and 13621851
Coverage of 14.46 %
Instances:
WLKNTP | WLKDTR | WLKDTR | WLKDTR | WLKDIR
WLKDTK | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTQ | WLKVRQ | WLKDSH | WLKNSN |
pattern: WLK[VDN][ISRT][HKNQPR]
MAHKVILSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKNQIPSPDSNQVYLDGWL
KDTR
GEKAKVNPDSNQVYLDGWLKDTRDEKAKANPDSNQVYLDGWLKDTRDEKAKANPDSNQVY
LDGWLKDIRAEKVKANPDSNHVFLDGWLKDTKAEKVKDNLDSNQVYLDGWLKDTRAKKEKVNPN
SNQVYLDGWLKDTRDENAKFNSDSTQVYLDGWLKDTRAENADSNQVYLDGWLKDTRAGNANSNQ
VYLDGWLKDTQALKEKSTLDSNQLYLDGWLKVRQVGKEISSPNSNQVYLDGWLKDSHAENSKQA
YLDGWLKNSNAKNHIKIGQDLVESNRKQPSKVDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYA
NFLPKKVADDIPVSKSQIPSLLDLFSLTKDSPQGEDMIGIINQCEYEPQKGETKACPTSLESMV
EFVHSVIGADTKYDIHSTSYPTTSATPLQNYTVLEISEDIYAPKWVACHPRPYPYLLHYCHYLD
IGSKIFKVLLKGEYGDTMDALGICHLDTSAMNPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPR
PPVATK
Repeat found in LOC131654944
Repeat occurs 6 times in a sequence of 210 amino acids
Location between 13829284 and 13830780
Coverage of 20.0 %
Instances:
EPRPNIS | EPRPNVS | EPRPNIS | EPRPNIS | EPRPNIS
EPRPSLT |
pattern: EPRP[SN][ILV][ST]
The following known motifs were found:
FEPR was found 6 times in this sequence
MRPALALLPLFLLLIVAVVESRKDVGEYWKLVMKEQNMPEEIQGLLDANNKNNLKTLKQSFDD
KEKKNTVKDFEPRPNISAYEDEDVDVEEKKIGVEDFEPRPNVSAYGDKSIDAEEKKKGEEDFEP
R
PNISAYGDNGIEDSKKNNVDQDFEPRPNISSYGNNDIGNKEKEKAVEDFEPRPNISAYGNNDI
HAEFKEDFEPRPSLTKYDA
Repeat found in LOC131657576
Repeat occurs 4 times in a sequence of 616 amino acids
Location between 13441381 and 13443924
Coverage of 5.19 %
Instances:
QGVRDMQG | QGVRDMQG | QGVRDMQG | QGVHENTQ |
pattern: QGV[HR][ED][MN][QT][QG]
MQGVRDMQGVRDMQGVRDMQGVHENTQNNVMEQEPPAATAPAATTPIEVCTNQPPIGRKRRRA
NANAIRKRSEVWDHFNLIPDSDPATAACKYCHSKYMCDSKKHGTSNLKSHMKTCPKYPLNLSTD
PTQTVLTYSTIPGVGLVPTSSRFDPVGCRKGLAYFIILDEKPFRTVEGEGFKYFCYQMQPQFTI
PSRRTIARDCFQIYLDEKVRLKALFGSDCSRVAITTDCWTSVQNLNYLTLTAHFIDRDWQYQKR
IISFAIIPNHRGKTVGKKVEDVLKEWGLRNVSTITVDNAASNDVAVKYLEQKIRNMNGLFMDGF
GFHMRCCAHILNLVVRDGLKVASTSISSVRNAIRYVRSSPHRALKFNECVGYANITCKKSVCLD
VSTRWNSTYLMLDAAEKYETAFDKLEDEVEDYRDFFEGDSPPSSEDWENVRVFINFLKNYYEAT
TVFSVSTKASLHTAFPFLAGIYVELKSLNLDLNGLFASVAKEMLDKFKKYWIDITKMNQLLYLG
VIFDPRYKLRFVDWCFTDMYGPLSETKKSLLEEINNNLSKIFMVYKQAYDNVEGPVPVMAATVS
NGETAANIEVPAHVARQNAFQQHLKSMDSVEERIPSSVPNC
Repeat found in LOC131660368
Repeat occurs 22 times in a sequence of 742 amino acids
Location between 13454674 and 13457362
Coverage of 29.65 %
Instances:
YLDGWLKNTP | YLDGWLKNTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDNR
YLDGWLKDTQ | YLDGWLKDTR | YLDGWLKDTQ | YLDGWLKDIR | YLDGWLKDSH
YLDGWLKDSH | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[IDN][ISFTN][QPKRH]
MAHKVVMSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKNQISSTDSNQVYLDGWL
KNTR
DEKAKVNPDSNQVYLDGWLKDTRDEKVKVNPDSNQVYLDGWLKDTRDEKAKANPNLEQVY
LDGWLKDTR
AEKAQANPDSNQVYLDGWLKDTRAKKEKVNPDSHQVYLDGWLKDTRADKTKVNPN
SNQVYLDGWLKDTRAENAKSNSDSNQVYLDGWLKDTRVENVKSNPDSNQVYLDGWLKDTRAEKE
KVNPDSNQVYLDGWLKDTRDENAKSNPDSNQVYLDGWLKDTRAENANSNQVYLDGWLKDTRAEK
VKANLDSNQVYLDGWLKDTRAEKEKFNPDSNQVYLDGWLKDNRDENAKSNPDSNQVYLDGWLKD
TQ
AKNADSNQVYLDGWLKDTRAENAKSNLDSNQVYLDGWLKDTQALKEKLTLDSNQLYLDGWLK
DIR
VGKEMSSPNSNQVYLDGWLKDSHAENSKQAYLDGWLKDSHAKNHIKIGQDLAESNRKLPSK
VDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYAKFLPKKVADDIPVSKSQIPSLLDLFSLTKDS
PQGEDMIGIINQCEYEPQKGETKACPTSLESMVEFVHSVIGADTKYDIHSTSYPTTSAVPLQNY
TVLDISEDIYAPKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKGQYGDTMDALGICHLDTSAM
NPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660374
Repeat occurs 19 times in a sequence of 493 amino acids
Location between 13532827 and 13534870
Coverage of 38.54 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWFKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDMR | YLDGWLKDTR |
pattern: YLDGW[LF]K[DN][IMT][QR]
MAHRVVLSLLPFLMLSLINDHGSIARDLNQVDQPYLDGWLKNTQHQQHSPDSNQVYLDGWLKD
TR
VEKVKSNPHSNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRAEKAKLNPDSKQVYLD
GWLKDTR
SEKAKVNPNSNQVYLNGWLKDTRAEREKFNSDSNQVYLDGWLKDTRAEKEKSNSNSN
QVYLDGWLKDTRGEKAKADPGSNQVYLDGWFKDTRSEQVKVNANSNQVYLDGWLKDTRSKKVNP
YSKQVYLDGWLKDTRAEKEKSNSDSNQVYLDGWLKDTRDEKANANPGSTQVYLDGWLKDIRSKQ
VKINPNSNQVYLDGWLKDTRAKNVNPDSKPVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTR
AEKVKSAPESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRAEKEKANSDSNQVYLDGW
LKDMR
AEKAKVNPNSNQVYLDGWLKDTRVKIKNQTLTPTKFSLMDG
Repeat found in LOC131654943
Repeat occurs 14 times in a sequence of 420 amino acids
Location between 13814614 and 13817152
Coverage of 26.67 %
Instances:
FEPRPNLS | FEPKPNVS | FEPRPNLS | FEPRPNLS | FEPRPNLS
FEPRPNLS | FEPRPNLS | FEPRPNLS | FEPRPNVS | FEPRPNLS
FEPRPNLS | FEPKPVVS | FEPRPNVS | FEPRPSLT |
pattern: FEP[KR]P[SVN][LV][ST]
The following known motifs were found:
FEPR was found 12 times in this sequence
VS[AI]Y was found 2 times in this sequence
MMRPALALLPLLLLLIVGIVESRIDLGEYWKVVVKKQNMVEEIQELLDANTKHNFKTLKQSFD
GKEKKKALKDFEPRPNLSPNLSSYGENDIDDKENKGSIEDFEPKPNVSSYGENDIDAKEKKGAI
ENFEPRPNLSPNLSSYGENDIDDKENKGPIENFEPRPNLSPNLSSYGENKIDAKEKKGAIEDFE
PR
PNLSPNLSLCGENNIDDKENKGPIEDFEPRPNLSPNLSSYGENDIDDKEKKGVIEDFEPRPN
LS
PNLSLCGENNIDDKENKGPIEDFEPRPNLSPNLSSYGENDIDDKEKKGVIEDFEPRPNVSPN
VSSYGENDIDDKENKGPIEDFEPRPNLSPNLSSYGENDINAKEKKRDAIEDFEPRPNLSPNLSS
YAENEIHAKKRKGAIEDFEPKPVVSAYGDNNIDAEEKNKAFKDFEPRPNVSPNVSAYENNDIDV
EYTKDFEPRPSLTKDDA
Repeat found in LOC131654938
Repeat occurs 18 times in a sequence of 435 amino acids
Location between 13654539 and 13656416
Coverage of 41.38 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLRDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWVKDMR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR |
pattern: YLDGW[LV][KR][DN][IMT][QR]
MTHRVVLSLLPFLMLSLINDHGSFARDLNQADQLYLDGWLKNTQLKNQQHSPGSNQVYLDGWL
KDTR
VEKVKSNPDFNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRVEKAKVNHDSKQVY
LDGWLKDTR
SEKAKVNPNSNQVYLDGWLKDTRAEKPKFNSDSNQVYLDGWLKDTRAEKEKSNSD
SNQVYLDGWLRDTRGEKEKSNPNSNQVYLDGWLKDTRAKKANIDSKQVYLDGWLKDTRVEKAES
NSDSSQVYLDGWLKDTRAEKVKSVPESNQVYLDGWLKDIRDEKEKSTPNSNQVYLDGWLKDTRA
EKVKANSELNQVYLDGWLKDTRAEKEKANSNSNHVYLDGWVKDMRAEKEKVNPNSNQVYLDGWL
KDTR
AGNTKSNTGSNQVYLDGWLKDIRAEKEKSAFDSKQVYLDGWLKDTRAE
Repeat found in LOC131654935
Repeat occurs 20 times in a sequence of 483 amino acids
Location between 13578340 and 13580345
Coverage of 24.84 %
Instances:
WLKNIQ | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDIR | WLKDTR | WLKDTR | WLKDMR

pattern: WLK[DN][IMT][QR]
MTHRVVMSLLPFLLLLLINGHGIFGRDLNQVDQPYLDGWLKNIQLKKQKPSPDSNQVYLDGWL
KDTR
AEKVKSNSDSNQVYLDGWLKDTRTEKAKVNLDSNQVYLDGWLKDTRVEKAKSNPEPNQVY
LDGWLKDTRTEKAKVNLDSNQVYLDGWLKDTRAEEAKANSDSKQVYLDGWLKDTRSEQVKVNPN
SNQVYLDGWLKDTRATKSNPDSKQVYLDGWLKDTRAEKTKSNSNSNQVYLDGWLKDTRAEKEKS
NSDSNQVYIDGWLKDTRGDKAKANPDSTQVYLDGWLKDTRSEQVKVNPNSNQVYLDGWLKDTRA
KKAKANPNSKQVYLDGWLKDTRDEKEKSTPNSNQVYLDGWLKDTRAEKAKSNSESNQVYIDGWL
KDTR
AEKVKSNPESNQVYLDGWLKDIRTEKEKSTPDSNQVYLDGWLKDTRDEKAKVNLDSNQVY
LDGWLKDTRAEKAKANSDSNQVYLDGWLKDMRAEKA
Repeat found in LOC131654957
Repeat occurs 22 times in a sequence of 531 amino acids
Location between 14321941 and 14324104
Coverage of 24.86 %
Instances:
WLKNPE | WLKDTR | WLKDTR | WLKDTR | WLKDSR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTQ
WLKDTR | WLKDTR | WLKDSR | WLKDTR | WLKDTR
WLKDTR | WLKDIL | WLKDTR | WLKDTR | WLKDMR
WLKDTR | WLKDTL |
pattern: WLK[DN][IPSMT][QREL]
MAYRVVLSLLPFLMLSLINDHGSFARHLNQVDQPYLDGWLKNPELRNQQPSPDSNQVYLDGWL
KDTR
VEKAKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRAEKAKINLDSTQVY
LDGWLKDSRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPNSNQVYLDGWLKDTRAEKVNANPK
SNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRDEKAKSNSDSNQVYLDGWLKDTQTEKEKSNSD
SNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQVKVNPNSNQVYLDGWLKDSRAKKA
NPDTKQGYLDGWLKDTRAKKENSIPDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTRD
EKVKSTHESNQVYLDGWLKDILSEKEKSTPDSNQVYLDGWLKDTRVEKAKANSESNQVYLDGWL
KDTR
AKKVESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGWLKDTRDLKEKSTLDSKQVY
LDRWLKDTLAVKENSSPNSN
Repeat found in LOC131660374
Repeat occurs 18 times in a sequence of 470 amino acids
Location between 13532827 and 13534870
Coverage of 38.3 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWFKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDMR | YLDGWLKDTR |
pattern: YLDGW[LF]K[DN][IMT][QR]
MAHRVVLSLLPFLMLSLINDHGSIARDLNQVDQPYLDGWLKNTQHQQHSPDSNQVYLDGWLKD
TR
VEKVKSNPHSNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRAEKAKLNPDSKQVYLD
GWLKDTR
SEKAKVNPNSNQVYLNGWLKDTRAEREKFNSDSNQVYLDGWLKDTRAEKEKSNSNSN
QVYLDGWLKDTRGEKAKADPGSNQVYLDGWFKDTRSEQVKVNANSNQVYLDGWLKDTRSKKVNP
YSKQVYLDGWLKDTRAEKEKSNSDSNQVYLDGWLKDTRDEKANANPGSTQVYLDGWLKDIRSKQ
VKINPNSNQVYLDGWLKDTRAKNVNPDSKPVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIR
AEKEKSTPDSNQVYLDGWLKDTRAEKEKANSDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGW
LKDTR
VKIKNQTLTPTKFSLMDG

Similar gene clusters

NC_081182 - Cluster 9 - Cyclopeptide

Gene cluster description

NC_081182 - Gene Cluster 9. Type = cyclopeptide. Location: 13423719 - 14471171 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131660367
Repeat occurs 23 times in a sequence of 761 amino acids
Location between 13471307 and 13474113
Coverage of 30.22 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTQ | YLDGWLKDTR | YLDGWLKDTL | YLDGWLKDTQ | YLDGWLKDIR
YLDGWLKDSH | YLDGWLKVSH | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[IVDN][ISTF][HKLQPR]
MAHRVVLSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKDQISSPDSNQVYLDGWL
KDTR
DEKAKVNPDSNQVYLDGWLKDTRDEKVKVNPDSNQVYLDGWLKDTRDEKAKVNSNLEQVY
LDGWLKDTR
AEKAQANPDSNQVYLDGWLKDTRAEKEKVNPDSHQVYLDGWLKDTRAEKTKANLN
SNQLYLDGWLKDTRAENAKSNSDSNQVYLDGWLKDTRVENAKYNPDSNQVYLDGWLKDTRAEKE
KVNPDSNQVYLDGWLKDTRDENAKSNPDSNQVYLDGWLKDTRAENANSNQIYLDGWLKDTRAEK
VKANLDSNQVYLDGWLKDTRAEKEKVNPDSNQVYLDGWLKDTRNENAKSNPDSNQVYLDGWLKD
TQ
AKNANSNQVYLDGWLKDTRAENAKSNQVYLDGWLKDTLAENAKSNLDSNQVYLDGWLKDTQA
LKEKSTLDSNQLYLDGWLKDIRVGKEMSSPNSNQVYLDGWLKDSHAENSKQAYLDGWLKVSHAK
NHIKIGQDLAESNRKLPSKVDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYAKFLPKKVADDIP
VSKSQIPSLLDLFSLTKDSPQGEDMIGIINQCEYEPQKGETKACSTSLESMVEFVHSVIGADTK
YDIHSTSYPTTSAVPLQNYTVLDISEDIYAPKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKG
QYGDTMDALGICHLDTSAMNPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660374
Repeat occurs 19 times in a sequence of 493 amino acids
Location between 13532827 and 13534870
Coverage of 38.54 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWFKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDMR | YLDGWLKDTR |
pattern: YLDGW[LF]K[DN][IMT][QR]
MAHRVVLSLLPFLMLSLINDHGSIARDLNQVDQPYLDGWLKNTQHQQHSPDSNQVYLDGWLKD
TR
VEKVKSNPHSNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRAEKAKLNPDSKQVYLD
GWLKDTR
SEKAKVNPNSNQVYLNGWLKDTRAEREKFNSDSNQVYLDGWLKDTRAEKEKSNSNSN
QVYLDGWLKDTRGEKAKADPGSNQVYLDGWFKDTRSEQVKVNANSNQVYLDGWLKDTRSKKVNP
YSKQVYLDGWLKDTRAEKEKSNSDSNQVYLDGWLKDTRDEKANANPGSTQVYLDGWLKDIRSKQ
VKINPNSNQVYLDGWLKDTRAKNVNPDSKPVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTR
AEKVKSAPESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRAEKEKANSDSNQVYLDGW
LKDMR
AEKAKVNPNSNQVYLDGWLKDTRVKIKNQTLTPTKFSLMDG
Repeat found in LOC131660373
Repeat occurs 16 times in a sequence of 604 amino acids
Location between 13525543 and 13527875
Coverage of 26.49 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTG | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKNTQ | YLDGWLKDIR | YLDGWLKDSH | YLDGWLKDSH
YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[IDN][ISTF][GHKQPR]
MTHRVVLSLLPFLLLWLINNHGSFARDMNQVDQPYLDGWLKNTPLKNQIPSPDSNQVYLDGWL
KDTR
GEKEKVNFDSNQVYLDGWLKDTGSEKAKANPDSNQVYLDGWLKDTRVENAKSNPDSNQVY
LDGWLKDTR
DENAKSNPDSNQVYLDGWLKDTRAENANSNHVYLDGWLKDTRDEKVIANLDSNQV
YLDGWLKDTR
AEKEKVNPNSNQVYLDGWLKDTRNEKAKANPESNQVYLDGWLKDTRAENTDSNQ
VYLDGWLKDTRAENEKSNLVSNQVYLDGWLKNTQALKEKPTLDSNQLYLDGWLKDIRVGKEISS
PNSNQVYLDGWLKDSHVENSKQAYLDGWLKDSHANNHIKTGQAYAESDRKLPSKVDHTEAFKVF
FFGIEDLYVGSVMTLQFPIVEYANFLPKKVADDIPVSKSQIPSLLDLFSLTKDSPKGEDMIGII
NQCEYEPQKGETKACPTSLESMVEFVHSVIGADTKYDIHSTSYPTTSAVPLQNYTVLDISEDIY
APKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKGQYGDIMNALGICHLDTSAMNPTHFYFDLL
GMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660374
Repeat occurs 18 times in a sequence of 470 amino acids
Location between 13532827 and 13534870
Coverage of 38.3 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWFKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDMR | YLDGWLKDTR |
pattern: YLDGW[LF]K[DN][IMT][QR]
MAHRVVLSLLPFLMLSLINDHGSIARDLNQVDQPYLDGWLKNTQHQQHSPDSNQVYLDGWLKD
TR
VEKVKSNPHSNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRAEKAKLNPDSKQVYLD
GWLKDTR
SEKAKVNPNSNQVYLNGWLKDTRAEREKFNSDSNQVYLDGWLKDTRAEKEKSNSNSN
QVYLDGWLKDTRGEKAKADPGSNQVYLDGWFKDTRSEQVKVNANSNQVYLDGWLKDTRSKKVNP
YSKQVYLDGWLKDTRAEKEKSNSDSNQVYLDGWLKDTRDEKANANPGSTQVYLDGWLKDIRSKQ
VKINPNSNQVYLDGWLKDTRAKNVNPDSKPVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIR
AEKEKSTPDSNQVYLDGWLKDTRAEKEKANSDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGW
LKDTR
VKIKNQTLTPTKFSLMDG
Repeat found in LOC131654939
Repeat occurs 23 times in a sequence of 554 amino acids
Location between 13690775 and 13693039
Coverage of 24.91 %
Instances:
WLKNPQ | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDTR | WLKDTR | WLKDSR | WLKDTR
WLKDTR | WLKDTR | WLKDIR | WLKDTR | WLKDTR
WLKDMR | WLKDTR | WLKDTR |
pattern: WLK[DN][IPSMT][QR]
MAYRVVLSLLPFLMLSLINDHGSFARDLNQVDQPYLDGWLKNPQLRNQQPSPDSNQVYLDGWL
KDTR
VEKAKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRGEKEKANTDSTQVY
LDGWLKDTRVEKAKINPDSTQVYLDGWLKDTRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPN
SNQIYLDGWLKDTRAEKVNAHPKSNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRAEKAKSNSD
SNQVYLDGWLKDTRAEKEKSNSDSNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQV
KVNPNSNQVYLDGWLKDSRAKKANPDSKQVYLDGWLKDTRAEKENSTPDSNQVYLDGWLKDTRA
KKAKSNSDSNQVYLDGWLKDTRAEKVKSTQESNQVYLDGWLKDIRSEKEKSTPDSNQVYLDGWL
KDTR
AEKAKANSESNQVYLDGWLKDTRAKKAESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVY
LDGWLKDTRDLKEKSTLDSKQVYLDGWLKDTRVEKENLSPNSN
Repeat found in LOC131654935
Repeat occurs 20 times in a sequence of 483 amino acids
Location between 13578340 and 13580345
Coverage of 24.84 %
Instances:
WLKNIQ | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTR | WLKDIR | WLKDTR | WLKDTR | WLKDMR

pattern: WLK[DN][IMT][QR]
MTHRVVMSLLPFLLLLLINGHGIFGRDLNQVDQPYLDGWLKNIQLKKQKPSPDSNQVYLDGWL
KDTR
AEKVKSNSDSNQVYLDGWLKDTRTEKAKVNLDSNQVYLDGWLKDTRVEKAKSNPEPNQVY
LDGWLKDTRTEKAKVNLDSNQVYLDGWLKDTRAEEAKANSDSKQVYLDGWLKDTRSEQVKVNPN
SNQVYLDGWLKDTRATKSNPDSKQVYLDGWLKDTRAEKTKSNSNSNQVYLDGWLKDTRAEKEKS
NSDSNQVYIDGWLKDTRGDKAKANPDSTQVYLDGWLKDTRSEQVKVNPNSNQVYLDGWLKDTRA
KKAKANPNSKQVYLDGWLKDTRDEKEKSTPNSNQVYLDGWLKDTRAEKAKSNSESNQVYIDGWL
KDTR
AEKVKSNPESNQVYLDGWLKDIRTEKEKSTPDSNQVYLDGWLKDTRDEKAKVNLDSNQVY
LDGWLKDTRAEKAKANSDSNQVYLDGWLKDMRAEKA
Repeat found in LOC131657576
Repeat occurs 4 times in a sequence of 616 amino acids
Location between 13441381 and 13443924
Coverage of 5.19 %
Instances:
QGVRDMQG | QGVRDMQG | QGVRDMQG | QGVHENTQ |
pattern: QGV[HR][ED][MN][QT][QG]
MQGVRDMQGVRDMQGVRDMQGVHENTQNNVMEQEPPAATAPAATTPIEVCTNQPPIGRKRRRA
NANAIRKRSEVWDHFNLIPDSDPATAACKYCHSKYMCDSKKHGTSNLKSHMKTCPKYPLNLSTD
PTQTVLTYSTIPGVGLVPTSSRFDPVGCRKGLAYFIILDEKPFRTVEGEGFKYFCYQMQPQFTI
PSRRTIARDCFQIYLDEKVRLKALFGSDCSRVAITTDCWTSVQNLNYLTLTAHFIDRDWQYQKR
IISFAIIPNHRGKTVGKKVEDVLKEWGLRNVSTITVDNAASNDVAVKYLEQKIRNMNGLFMDGF
GFHMRCCAHILNLVVRDGLKVASTSISSVRNAIRYVRSSPHRALKFNECVGYANITCKKSVCLD
VSTRWNSTYLMLDAAEKYETAFDKLEDEVEDYRDFFEGDSPPSSEDWENVRVFINFLKNYYEAT
TVFSVSTKASLHTAFPFLAGIYVELKSLNLDLNGLFASVAKEMLDKFKKYWIDITKMNQLLYLG
VIFDPRYKLRFVDWCFTDMYGPLSETKKSLLEEINNNLSKIFMVYKQAYDNVEGPVPVMAATVS
NGETAANIEVPAHVARQNAFQQHLKSMDSVEERIPSSVPNC
Repeat found in LOC131654936
Repeat occurs 14 times in a sequence of 581 amino acids
Location between 13619611 and 13621851
Coverage of 14.46 %
Instances:
WLKNTP | WLKDTR | WLKDTR | WLKDTR | WLKDIR
WLKDTK | WLKDTR | WLKDTR | WLKDTR | WLKDTR
WLKDTQ | WLKVRQ | WLKDSH | WLKNSN |
pattern: WLK[VDN][ISRT][HKNQPR]
MAHKVILSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKNQIPSPDSNQVYLDGWL
KDTR
GEKAKVNPDSNQVYLDGWLKDTRDEKAKANPDSNQVYLDGWLKDTRDEKAKANPDSNQVY
LDGWLKDIRAEKVKANPDSNHVFLDGWLKDTKAEKVKDNLDSNQVYLDGWLKDTRAKKEKVNPN
SNQVYLDGWLKDTRDENAKFNSDSTQVYLDGWLKDTRAENADSNQVYLDGWLKDTRAGNANSNQ
VYLDGWLKDTQALKEKSTLDSNQLYLDGWLKVRQVGKEISSPNSNQVYLDGWLKDSHAENSKQA
YLDGWLKNSNAKNHIKIGQDLVESNRKQPSKVDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYA
NFLPKKVADDIPVSKSQIPSLLDLFSLTKDSPQGEDMIGIINQCEYEPQKGETKACPTSLESMV
EFVHSVIGADTKYDIHSTSYPTTSATPLQNYTVLEISEDIYAPKWVACHPRPYPYLLHYCHYLD
IGSKIFKVLLKGEYGDTMDALGICHLDTSAMNPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPR
PPVATK
Repeat found in LOC131654943
Repeat occurs 14 times in a sequence of 420 amino acids
Location between 13814614 and 13817152
Coverage of 26.67 %
Instances:
FEPRPNLS | FEPKPNVS | FEPRPNLS | FEPRPNLS | FEPRPNLS
FEPRPNLS | FEPRPNLS | FEPRPNLS | FEPRPNVS | FEPRPNLS
FEPRPNLS | FEPKPVVS | FEPRPNVS | FEPRPSLT |
pattern: FEP[KR]P[SVN][LV][ST]
The following known motifs were found:
FEPR was found 12 times in this sequence
VS[AI]Y was found 2 times in this sequence
MMRPALALLPLLLLLIVGIVESRIDLGEYWKVVVKKQNMVEEIQELLDANTKHNFKTLKQSFD
GKEKKKALKDFEPRPNLSPNLSSYGENDIDDKENKGSIEDFEPKPNVSSYGENDIDAKEKKGAI
ENFEPRPNLSPNLSSYGENDIDDKENKGPIENFEPRPNLSPNLSSYGENKIDAKEKKGAIEDFE
PR
PNLSPNLSLCGENNIDDKENKGPIEDFEPRPNLSPNLSSYGENDIDDKEKKGVIEDFEPRPN
LS
PNLSLCGENNIDDKENKGPIEDFEPRPNLSPNLSSYGENDIDDKEKKGVIEDFEPRPNVSPN
VSSYGENDIDDKENKGPIEDFEPRPNLSPNLSSYGENDINAKEKKRDAIEDFEPRPNLSPNLSS
YAENEIHAKKRKGAIEDFEPKPVVSAYGDNNIDAEEKNKAFKDFEPRPNVSPNVSAYENNDIDV
EYTKDFEPRPSLTKDDA
Repeat found in LOC131654957
Repeat occurs 22 times in a sequence of 531 amino acids
Location between 14321941 and 14324104
Coverage of 24.86 %
Instances:
WLKNPE | WLKDTR | WLKDTR | WLKDTR | WLKDSR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTQ
WLKDTR | WLKDTR | WLKDSR | WLKDTR | WLKDTR
WLKDTR | WLKDIL | WLKDTR | WLKDTR | WLKDMR
WLKDTR | WLKDTL |
pattern: WLK[DN][IPSMT][QREL]
MAYRVVLSLLPFLMLSLINDHGSFARHLNQVDQPYLDGWLKNPELRNQQPSPDSNQVYLDGWL
KDTR
VEKAKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRAEKAKINLDSTQVY
LDGWLKDSRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPNSNQVYLDGWLKDTRAEKVNANPK
SNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRDEKAKSNSDSNQVYLDGWLKDTQTEKEKSNSD
SNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQVKVNPNSNQVYLDGWLKDSRAKKA
NPDTKQGYLDGWLKDTRAKKENSIPDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTRD
EKVKSTHESNQVYLDGWLKDILSEKEKSTPDSNQVYLDGWLKDTRVEKAKANSESNQVYLDGWL
KDTR
AKKVESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGWLKDTRDLKEKSTLDSKQVY
LDRWLKDTLAVKENSSPNSN
Repeat found in LOC131660368
Repeat occurs 22 times in a sequence of 742 amino acids
Location between 13454674 and 13457362
Coverage of 29.65 %
Instances:
YLDGWLKNTP | YLDGWLKNTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDNR
YLDGWLKDTQ | YLDGWLKDTR | YLDGWLKDTQ | YLDGWLKDIR | YLDGWLKDSH
YLDGWLKDSH | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[IDN][ISFTN][QPKRH]
MAHKVVMSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKNQISSTDSNQVYLDGWL
KNTR
DEKAKVNPDSNQVYLDGWLKDTRDEKVKVNPDSNQVYLDGWLKDTRDEKAKANPNLEQVY
LDGWLKDTR
AEKAQANPDSNQVYLDGWLKDTRAKKEKVNPDSHQVYLDGWLKDTRADKTKVNPN
SNQVYLDGWLKDTRAENAKSNSDSNQVYLDGWLKDTRVENVKSNPDSNQVYLDGWLKDTRAEKE
KVNPDSNQVYLDGWLKDTRDENAKSNPDSNQVYLDGWLKDTRAENANSNQVYLDGWLKDTRAEK
VKANLDSNQVYLDGWLKDTRAEKEKFNPDSNQVYLDGWLKDNRDENAKSNPDSNQVYLDGWLKD
TQ
AKNADSNQVYLDGWLKDTRAENAKSNLDSNQVYLDGWLKDTQALKEKLTLDSNQLYLDGWLK
DIR
VGKEMSSPNSNQVYLDGWLKDSHAENSKQAYLDGWLKDSHAKNHIKIGQDLAESNRKLPSK
VDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYAKFLPKKVADDIPVSKSQIPSLLDLFSLTKDS
PQGEDMIGIINQCEYEPQKGETKACPTSLESMVEFVHSVIGADTKYDIHSTSYPTTSAVPLQNY
TVLDISEDIYAPKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKGQYGDTMDALGICHLDTSAM
NPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131654938
Repeat occurs 18 times in a sequence of 435 amino acids
Location between 13654539 and 13656416
Coverage of 41.38 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLRDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWVKDMR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR |
pattern: YLDGW[LV][KR][DN][IMT][QR]
MTHRVVLSLLPFLMLSLINDHGSFARDLNQADQLYLDGWLKNTQLKNQQHSPGSNQVYLDGWL
KDTR
VEKVKSNPDFNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRVEKAKVNHDSKQVY
LDGWLKDTR
SEKAKVNPNSNQVYLDGWLKDTRAEKPKFNSDSNQVYLDGWLKDTRAEKEKSNSD
SNQVYLDGWLRDTRGEKEKSNPNSNQVYLDGWLKDTRAKKANIDSKQVYLDGWLKDTRVEKAES
NSDSSQVYLDGWLKDTRAEKVKSVPESNQVYLDGWLKDIRDEKEKSTPNSNQVYLDGWLKDTRA
EKVKANSELNQVYLDGWLKDTRAEKEKANSNSNHVYLDGWVKDMRAEKEKVNPNSNQVYLDGWL
KDTR
AGNTKSNTGSNQVYLDGWLKDIRAEKEKSAFDSKQVYLDGWLKDTRAE
Repeat found in LOC131654944
Repeat occurs 6 times in a sequence of 210 amino acids
Location between 13829284 and 13830780
Coverage of 20.0 %
Instances:
EPRPNIS | EPRPNVS | EPRPNIS | EPRPNIS | EPRPNIS
EPRPSLT |
pattern: EPRP[SN][ILV][ST]
The following known motifs were found:
FEPR was found 6 times in this sequence
MRPALALLPLFLLLIVAVVESRKDVGEYWKLVMKEQNMPEEIQGLLDANNKNNLKTLKQSFDD
KEKKNTVKDFEPRPNISAYEDEDVDVEEKKIGVEDFEPRPNVSAYGDKSIDAEEKKKGEEDFEP
R
PNISAYGDNGIEDSKKNNVDQDFEPRPNISSYGNNDIGNKEKEKAVEDFEPRPNISAYGNNDI
HAEFKEDFEPRPSLTKYDA

Similar gene clusters

NC_081182 - Cluster 10 - Fatty_acid

Gene cluster description

NC_081182 - Gene Cluster 10. Type = fatty_acid. Location: 48740375 - 49028434 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081182 - Cluster 11 - Fatty_acid-saccharide

Gene cluster description

NC_081182 - Gene Cluster 11. Type = fatty_acid-saccharide. Location: 65072893 - 65598999 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081182 - Cluster 12 - Saccharide

Gene cluster description

NC_081182 - Gene Cluster 12. Type = saccharide. Location: 211940764 - 212187324 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081183 - Cluster 13 - Saccharide

Gene cluster description

NC_081183 - Gene Cluster 13. Type = saccharide. Location: 18236925 - 18580598 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081183 - Cluster 14 - Cyclopeptide

Gene cluster description

NC_081183 - Gene Cluster 14. Type = cyclopeptide. Location: 120211341 - 121459437 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131599572
Repeat occurs 3 times in a sequence of 304 amino acids
Location between 120834520 and 120836257
Coverage of 5.92 %
Instances:
QPFGFW | QPFGFW | QPFVAL |
pattern: QPF[GV][AF][LW]
MEFKNLSVLALFFLAFVGINGSKSGEEYWKSVWPNTTIPKALSDMLQSDIETSMPIKSHEENQ
YWTIFFEHDLYPGKTMNLGVQKHYESSEYDKKETKKENQPFGFWAWARKETEKENQPFGFWAWS
KKETRKEIQPFVALTSDENEAHILNNYCRLPAATGEDKHCALSLESMIDFVISKLGKNIKVMSS
SFVQNQDKYVVEEVNKIGDKAVMCHRLNLKKVVLYCHTVNATTTYIVPLVASDGTKSKALTSCH
HDTRGMDPSMLKEVLNVEPGTVPVCHFIDNKAIAWVPDMSESGGHSCVI

Similar gene clusters

NC_081183 - Cluster 15 - Fatty_acid-terpene-alkaloid

Gene cluster description

NC_081183 - Gene Cluster 15. Type = fatty_acid-terpene-alkaloid. Location: 140501508 - 140942755 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081183 - Cluster 16 - Cyclopeptide

Gene cluster description

NC_081183 - Gene Cluster 16. Type = cyclopeptide. Location: 178275720 - 179082296 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131596177
Repeat occurs 10 times in a sequence of 589 amino acids
Location between 178941060 and 178943303
Coverage of 10.19 %
Instances:
ASEPDV | ASEPDV | ASEPDV | ASEHDV | ASESDV
ASEPEV | ASEPGL | ASEPNV | ASEFNV | ASEKPK

pattern: ASE[PSFKH][PEDGN][KLV]
MEFHFFRIITFLMLLLVATNAARLPPQHYWKSVIPNSPMPKAITNLLLPSETDVSSKELSNWF
RYDHTGVDVNSKKLPNWFRYDRAASGSDVNSEELPNWFRYDHAVSGADVSSEELPNWFRYDHAV
SGADVSSEELPNWFRYNHAASEPDVNSEELSNVFKYDHAASEPDVSFKKFPNVFKYDHAARKPD
VSFKKLPNVFKYDHAASEPDVSFRKLPNVFKYDHVASEHDVSSRYFPNVFKYDHAASESDVSSK
YFSNVFKYDHAASEPEVNFRESPNGFSYHHAASEPGLNSRESLNGFFYRNAASEPNVNFRKSPN
GFSYHQAASVPDVNSGETSILSYNPPASEFNVNSGKLSNAFSYDHSASEKPKSTIFFFEKDLHH
GTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARIVNNTITDCEVE
SIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKVITQGVKRLGEKNKAVVCH
KQNYPYAVFFCHTTDTTKVYSAPLKGVDGSIVKAIVVCHTDTSQWNPKHFAFLELKVKVGSGPI
CHLLPQDHVAWISK
Repeat found in LOC131596171
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 178795056 and 178796447
Coverage of 25.85 %
Instances:
NVFQYPHAA | NVFQYPNEA | NVFQYPHAA | NVFQYPHAA | NVFRYNHAA
NVFQYPHAA | NVFRYDHAA | NVFQYPHAT | NVFRYDHAA | NVFQYPHAT
NVFRYDHAA |
pattern: NVF[QR]Y[PDN][HN][AE][AT]
MEFHFFHFITFLMLVTVATNAARLPPQLYWKSVLPNTPMPKAISNLLPPSIDVGVKKSYSHGG
KANENSEGLANVFQYPHAASEVEVNSGGLPNVFQYPNEASEVDVKSGELPNVFQYPHAASEIDV
KSGEFPNVFQYPHAASEADVSSKELPNVFRYNHAANVPGINFGELPNGFGYHHSARETAVNSGE
LSNGFGYHHDASEPSVNSREFSNVFQYPHAANKADVSSKKLPNVFRYDHAANEPGINSGELSNG
FGYHHDASEPGVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINSGELSNGFGYH
HDASEPSVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINFGELSNGFGYHHAAK
Repeat found in LOC131596180
Repeat occurs 21 times in a sequence of 728 amino acids
Location between 179037704 and 179040213
Coverage of 23.08 %
Instances:
NGFGYHDA | NGFGYHHA | NGFGYHHA | NGFGYHLA | NGFGYHHA
NGFGYHHA | NGFRYHLA | NGFVYHHA | NGFGYHHA | NGFGYHHV
NGFGYHHG | NGFGYHHA | NGFGYNHA | NGFGYHHG | NGFGYHHA
NGFGYHHA | NGFGYNHA | NGFRYHHA | NGFRYHHA | NGFRYHHA
NGFRYHHI |
pattern: NGF[RGV]Y[HN][HDL][AIGV]
MKFHFFHIITFLMLLLVATNAETLPPQLYWKSILPNSPMPKVITNLLLSDSPMAKTITNLLLP
GLDDGVKKSYSDGGEVEGKTDVIPKKLSNGFGYHDAVSEPNVNFKELSNGFGYHHAASEADGNS
EELPNGFGYHHAASEADVNSKKLPNGFGYHLAASEPNVNSKELSNGFGYHHAASEADGNSRELP
NGFGYHHA
ASDADVNSKILPNGFRYHLAASEADENSKELSNGFVYHHAASEADANSRELPNGFG
YHHA
GSESDINSREISNGFGYHHVASETDVNSRELPNGFGYHHGASEAYVNSRKLSNGFGYHHA
ASESGVTSGELPNGFGYNHAISGVDGNSKELPNGFGYHHGASEPDVNFRELPNGFGYHHATSEA
GVKSEELPNGFGYHHAASEPDVNSGELPNGFGYNHAISGVDGNSKELPNGFRYHHAASKPDVNS
GDLPNGFRYHHAASKSDVNSGDLSNGFRYHHAASKPNVNSGDSHNGFRYHHIASEAQLHDKPKA
TIFFFEKELHHGTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARI
VNNTITDCEVESIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTETYKESDFQQYKVITQGVKR
LGEKNKAVVCHKQNYPYAVFFCHTTDTTKVYSVPLEAVDGSRVKAIVVCHTDTSQWNPKHFAFL
ELKVKVGTAPICHLLPQDHVAWISK
Repeat found in LOC131596166
Repeat occurs 33 times in a sequence of 776 amino acids
Location between 178698592 and 178701415
Coverage of 29.77 %
Instances:
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKR | NAFYYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFFCGH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFYYKH | NAFYYKH
NAFSYKH | NAFSYKH | NAFSYKH |
pattern: NAF[YSF][YC][KG][HR]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGVNSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
VNSGELPNAFSYKHVAGNSDVNSKELPNAFSYKHAAGESYVNFGELSNAFSYKHAAQEPGVNYE
ELPNAFSYKHAASKPDVNFGESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPN
AFSYKH
AASEHEVTSGELPNAFSYKRAASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSY
KH
GASEPNVNSAELPNAFSYKHGASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAA
GESYVNFRELSNAFSYKHAAGEPSVNYEELPNAFFCGHVAGEPGVNYEELPNAFSYKHAASKPD
VNFGELPNAFSYKHAASESDVNFRESPNAFSYKHAKSEPGVNFGDSPNAFSYKHAASVPDVNSE
ELPNVFSYKYAASEREVNSGELPNAFSYKHAASERDVNSGELPNAFYYKHGASQLDVNSGELPN
AFYYKH
GASESDVNYGELPNAFSYKHATRATDVNHGELPNAFSYKHATRAADVNYGELPNAFSY
KH
AASEPDV
Repeat found in LOC131596171
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 178795056 and 178796447
Coverage of 25.85 %
Instances:
NVFQYPHAA | NVFQYPNEA | NVFQYPHAA | NVFQYPHAA | NVFRYNHAA
NVFQYPHAA | NVFRYDHAA | NVFQYPHAT | NVFRYDHAA | NVFQYPHAT
NVFRYDHAA |
pattern: NVF[QR]Y[PDN][HN][AE][AT]
MEFHFFHFITFLMLVTVATNAARLPPQLYWKSVLPNTPMPKAISNLLPPSIDVGVKKSYSHGG
KANENSEGLANVFQYPHAASEVEVNSGGLPNVFQYPNEASEVDVKSGELPNVFQYPHAASEIDV
KSGEFPNVFQYPHAASEADVSSKELPNVFRYNHAANVPGINFGELPNGFGYHHSARETAVNSGE
LSNGFGYHHDASEPSVNSREFSNVFQYPHAANKADVSSKKLPNVFRYDHAANEPGINSGELSNG
FGYHHDASEPGVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINSGELSNGFGYH
HDASEPSVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINFGELSNGFGYHHAAK
Repeat found in LOC131596166
Repeat occurs 32 times in a sequence of 756 amino acids
Location between 178698592 and 178701415
Coverage of 29.63 %
Instances:
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKR
NAFYYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFFCGH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFYYKH | NAFYYKH | NAFSYKH
NAFSYKH | NAFSYKH |
pattern: NAF[YSF][YC][KG][HR]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGVNSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
VNSGELPNAFSYKHVAGESYVNFGELSNAFSYKHAAQEPGVNYEELPNAFSYKHAASKPDVNFG
ESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPNAFSYKHAASEHEVTSGELPN
AFSYKR
AASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSYKHGASEPNVNSAELPNAFSY
KH
GASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAAGESYVNFRELSNAFSYKHAA
GEPSVNYEELPNAFFCGHVAGEPGVNYEELPNAFSYKHAASKPDVNFGELPNAFSYKHAASESD
VNFRESPNAFSYKHAKSEPGVNFGDSPNAFSYKHAASVPDVNSEELPNVFSYKYAASEREVNSG
ELPNAFSYKHAASERDVNSGELPNAFYYKHGASQLDVNSGELPNAFYYKHGASESDVNYGELPN
AFSYKH
ATRATDVNHGELPNAFSYKHATRAADVNYGELPNAFSYKHAASEPDV
Repeat found in LOC131596166
Repeat occurs 31 times in a sequence of 736 amino acids
Location between 178698592 and 178701415
Coverage of 29.48 %
Instances:
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKR | NAFYYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFYYKH | NAFYYKH | NAFSYKH | NAFSYKH
NAFSYKH |
pattern: NAF[YS]YK[HR]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGVNSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
VNSGELPNAFSYKHVAGNSDVNSKELPNAFSYKHAAGESYVNFGELSNAFSYKHAAQEPGVNYE
ELPNAFSYKHAASKPDVNFGESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPN
AFSYKH
AASEHEVTSGELPNAFSYKRAASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSY
KH
GASEPNVNSAELPNAFSYKHGASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAA
GEPGVNYEELPNAFSYKHAASKPDVNFGELPNAFSYKHAASESDVNFRESPNAFSYKHAKSEPG
VNFGDSPNAFSYKHAASVPDVNSEELPNVFSYKYAASEREVNSGELPNAFSYKHAASERDVNSG
ELPNAFYYKHGASQLDVNSGELPNAFYYKHGASESDVNYGELPNAFSYKHATRATDVNHGELPN
AFSYKH
ATRAADVNYGELPNAFSYKHAASEPDV
Repeat found in LOC131596178
Repeat occurs 10 times in a sequence of 589 amino acids
Location between 178972989 and 178975235
Coverage of 10.19 %
Instances:
ASEPDV | ASEPDV | ASEPDV | ASEHDV | ASESDV
ASEPEV | ASEPGL | ASEPNV | ASEFNV | ASEKPK

pattern: ASE[PSFKH][PEDGN][KLV]
MEFHFFRIITFLMLLLVATNAARLPPQRYWKSVIPNSPMPKAITNLLLPSETDVSSKELSNWF
RYDHTGVDVNSKKLPNWFRYDRAASGSDVNSEELPNWFRYDHAVSGADVSSEELPNWFRYDHAV
SGADVSSEELPNWFRYNHAASEPDVNSEELSNVFKYDHAASEPDVSFKKFPNVFKYDHAARKPD
VSFKKLPNVFKYDHAASEPDVSFRKLPNVFKYDHVASEHDVSSRYFPNVFKYDHAASESDVSSK
YFSNVFKYDHAASEPEVNFRESPNGFSYHHAASEPGLNSRESLNGFFYRNAASEPNVNFRKSPN
GFSYHQAASVPDVNSGETSILSYNPPASEFNVNSGKLSNAFSYDHSASEKPKSTIFFFEKDLHH
GTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARIVNNTITDCEVE
SIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKVITQGVKRLGEKNKAVVCH
KQNYPYAVFFCHTTDTTKVYSAPLKGVDGSIVKAIVVCHTDTSQWNPKHFAFLELKVKVGSGPI
CHLLPQDHVAWISK

Similar gene clusters

NC_081183 - Cluster 17 - Cyclopeptide

Gene cluster description

NC_081183 - Gene Cluster 17. Type = cyclopeptide. Location: 178324011 - 179778710 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131596171
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 178795056 and 178796447
Coverage of 25.85 %
Instances:
NVFQYPHAA | NVFQYPNEA | NVFQYPHAA | NVFQYPHAA | NVFRYNHAA
NVFQYPHAA | NVFRYDHAA | NVFQYPHAT | NVFRYDHAA | NVFQYPHAT
NVFRYDHAA |
pattern: NVF[QR]Y[PDN][HN][AE][AT]
MEFHFFHFITFLMLVTVATNAARLPPQLYWKSVLPNTPMPKAISNLLPPSIDVGVKKSYSHGG
KANENSEGLANVFQYPHAASEVEVNSGGLPNVFQYPNEASEVDVKSGELPNVFQYPHAASEIDV
KSGEFPNVFQYPHAASEADVSSKELPNVFRYNHAANVPGINFGELPNGFGYHHSARETAVNSGE
LSNGFGYHHDASEPSVNSREFSNVFQYPHAANKADVSSKKLPNVFRYDHAANEPGINSGELSNG
FGYHHDASEPGVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINSGELSNGFGYH
HDASEPSVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINFGELSNGFGYHHAAK
Repeat found in LOC131596166
Repeat occurs 31 times in a sequence of 736 amino acids
Location between 178698592 and 178701415
Coverage of 29.48 %
Instances:
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKR | NAFYYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFYYKH | NAFYYKH | NAFSYKH | NAFSYKH
NAFSYKH |
pattern: NAF[YS]YK[HR]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGVNSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
VNSGELPNAFSYKHVAGNSDVNSKELPNAFSYKHAAGESYVNFGELSNAFSYKHAAQEPGVNYE
ELPNAFSYKHAASKPDVNFGESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPN
AFSYKH
AASEHEVTSGELPNAFSYKRAASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSY
KH
GASEPNVNSAELPNAFSYKHGASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAA
GEPGVNYEELPNAFSYKHAASKPDVNFGELPNAFSYKHAASESDVNFRESPNAFSYKHAKSEPG
VNFGDSPNAFSYKHAASVPDVNSEELPNVFSYKYAASEREVNSGELPNAFSYKHAASERDVNSG
ELPNAFYYKHGASQLDVNSGELPNAFYYKHGASESDVNYGELPNAFSYKHATRATDVNHGELPN
AFSYKH
ATRAADVNYGELPNAFSYKHAASEPDV
Repeat found in LOC131596181
Repeat occurs 10 times in a sequence of 495 amino acids
Location between 179063717 and 179106096
Coverage of 24.24 %
Instances:
YHHAASELDVNS | YHHAASKPDVNS | YHHTASESGVNS | YHHAASQPDVNS | YHHAASEPDVNF
YHHAGSKPDVNS | YHHAASKPDVNS | YHHAASKPDINS | YHHAASKPNVNS | YHHASSKPNVNS

pattern: YHH[AT][ASG]S[QKE][PSL][DGN][IV]N[SF]
MTFRFFHIITFLMLLLVATNAETLPMTKVITNLLLPSSDGGVKKDYYDGGEARDKTDEISKKL
PNGFRYHHAASELDVNSRDLLNGFRYHHAASKPDVNSRDLPNGFRYHHTASESGVNSGDLLNGF
RYHHAASQPDVNSGDLPNGFRYHHAASEPDVNFGDLSNGFRYHHAGSKPDVNSGDLSNGFPYHH
AASKPDVNS
GDLSNGFPYHHAASKPDINSGDLPNGFRYHHAASKPNVNSRDLPNEFRYHHASSK
PNVNS
RDLHNTASETQLHDKPKATIFFFEKDLHHGTKSYFRFAKTNPNNEAKFLPREFANSIPF
SSKKLEYILNKLNIRKGSKAARIVNNTITDCEVESIKGEEKLCVTSLESMIDFSISKIGKNLEA
VSTEMYKESDFQQYTVITQGVKRLGENNKAVVCHKQNYPYAVFFCHTTDTTKIYSVPLEGVDGS
RVKAIAVCHGDTSQWNPKHFAFLELKVKVGTAPICHLLPQDHVAWIAK
Repeat found in LOC131598408
Repeat occurs 12 times in a sequence of 597 amino acids
Location between 179158108 and 179161660
Coverage of 12.06 %
Instances:
AFSYHH | AFSYYH | AFSYYA | AFSYHH | AFSYYA
AFSYHH | AFSYHH | AFSYHH | AFSYNH | AFSYHH
AFSFHH | AFSYHH |
pattern: AFS[YF][YHN][AH]
MDWQATNAETLPPQFYWKSVLPNSPIPKAITNLLPNSPMSKAITNILLPGVDVMSVELPNAFS
YHH
ATNEVAVKSAESSNAFSYYHAASKTNLNSRETPNAFSYYAASEPDVNSGELSNAFSYHHAA
SKTNLNSREPPNAFSYYAASEPDVNSGELSNAFSYHHDASKPDVNSEELSNAFSYHHAASEPDV
NSGELSNAFSYHHDASKPDVNSEELSNAFSYNHAASESDVNSRELSNAFSYHHAANEAGVKSAE
LPNAFSFHHTASKPDVSSEQLPNAFSYHHAASKPDASSGELSNNYQMHSYHHAASKPDVNSKEL
PNGFGYHHAASETGVKSEESPNGFGYHHSASEANLNSRKLPNGFGYHHVAGETQLHHKPHATIF
FFEKELHHGTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGTKGAHIVNN
TITDCEVDSIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKIITQGVKRLGE
KNKVVVCHKQNYPYAVFFCHTTDTTKVYSVPLEGVDGSRVKAIVVCHTDTSQWNPKHFAFLELK
VKVGTAPICHLLPQDHVAWIAK
Repeat found in LOC131596171
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 178795056 and 178796447
Coverage of 25.85 %
Instances:
NVFQYPHAA | NVFQYPNEA | NVFQYPHAA | NVFQYPHAA | NVFRYNHAA
NVFQYPHAA | NVFRYDHAA | NVFQYPHAT | NVFRYDHAA | NVFQYPHAT
NVFRYDHAA |
pattern: NVF[QR]Y[PDN][HN][AE][AT]
MEFHFFHFITFLMLVTVATNAARLPPQLYWKSVLPNTPMPKAISNLLPPSIDVGVKKSYSHGG
KANENSEGLANVFQYPHAASEVEVNSGGLPNVFQYPNEASEVDVKSGELPNVFQYPHAASEIDV
KSGEFPNVFQYPHAASEADVSSKELPNVFRYNHAANVPGINFGELPNGFGYHHSARETAVNSGE
LSNGFGYHHDASEPSVNSREFSNVFQYPHAANKADVSSKKLPNVFRYDHAANEPGINSGELSNG
FGYHHDASEPGVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINSGELSNGFGYH
HDASEPSVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINFGELSNGFGYHHAAK
Repeat found in LOC131596166
Repeat occurs 32 times in a sequence of 756 amino acids
Location between 178698592 and 178701415
Coverage of 29.63 %
Instances:
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKR
NAFYYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFFCGH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFYYKH | NAFYYKH | NAFSYKH
NAFSYKH | NAFSYKH |
pattern: NAF[YSF][YC][KG][HR]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGVNSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
VNSGELPNAFSYKHVAGESYVNFGELSNAFSYKHAAQEPGVNYEELPNAFSYKHAASKPDVNFG
ESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPNAFSYKHAASEHEVTSGELPN
AFSYKR
AASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSYKHGASEPNVNSAELPNAFSY
KH
GASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAAGESYVNFRELSNAFSYKHAA
GEPSVNYEELPNAFFCGHVAGEPGVNYEELPNAFSYKHAASKPDVNFGELPNAFSYKHAASESD
VNFRESPNAFSYKHAKSEPGVNFGDSPNAFSYKHAASVPDVNSEELPNVFSYKYAASEREVNSG
ELPNAFSYKHAASERDVNSGELPNAFYYKHGASQLDVNSGELPNAFYYKHGASESDVNYGELPN
AFSYKH
ATRATDVNHGELPNAFSYKHATRAADVNYGELPNAFSYKHAASEPDV
Repeat found in LOC131596178
Repeat occurs 10 times in a sequence of 589 amino acids
Location between 178972989 and 178975235
Coverage of 10.19 %
Instances:
ASEPDV | ASEPDV | ASEPDV | ASEHDV | ASESDV
ASEPEV | ASEPGL | ASEPNV | ASEFNV | ASEKPK

pattern: ASE[PSFKH][PEDGN][KLV]
MEFHFFRIITFLMLLLVATNAARLPPQRYWKSVIPNSPMPKAITNLLLPSETDVSSKELSNWF
RYDHTGVDVNSKKLPNWFRYDRAASGSDVNSEELPNWFRYDHAVSGADVSSEELPNWFRYDHAV
SGADVSSEELPNWFRYNHAASEPDVNSEELSNVFKYDHAASEPDVSFKKFPNVFKYDHAARKPD
VSFKKLPNVFKYDHAASEPDVSFRKLPNVFKYDHVASEHDVSSRYFPNVFKYDHAASESDVSSK
YFSNVFKYDHAASEPEVNFRESPNGFSYHHAASEPGLNSRESLNGFFYRNAASEPNVNFRKSPN
GFSYHQAASVPDVNSGETSILSYNPPASEFNVNSGKLSNAFSYDHSASEKPKSTIFFFEKDLHH
GTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARIVNNTITDCEVE
SIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKVITQGVKRLGEKNKAVVCH
KQNYPYAVFFCHTTDTTKVYSAPLKGVDGSIVKAIVVCHTDTSQWNPKHFAFLELKVKVGSGPI
CHLLPQDHVAWISK
Repeat found in LOC131596166
Repeat occurs 33 times in a sequence of 776 amino acids
Location between 178698592 and 178701415
Coverage of 29.77 %
Instances:
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKR | NAFYYKH | NAFSYKH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFFCGH | NAFSYKH | NAFSYKH
NAFSYKH | NAFSYKH | NAFSYKH | NAFYYKH | NAFYYKH
NAFSYKH | NAFSYKH | NAFSYKH |
pattern: NAF[YSF][YC][KG][HR]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGVNSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
VNSGELPNAFSYKHVAGNSDVNSKELPNAFSYKHAAGESYVNFGELSNAFSYKHAAQEPGVNYE
ELPNAFSYKHAASKPDVNFGESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPN
AFSYKH
AASEHEVTSGELPNAFSYKRAASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSY
KH
GASEPNVNSAELPNAFSYKHGASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAA
GESYVNFRELSNAFSYKHAAGEPSVNYEELPNAFFCGHVAGEPGVNYEELPNAFSYKHAASKPD
VNFGELPNAFSYKHAASESDVNFRESPNAFSYKHAKSEPGVNFGDSPNAFSYKHAASVPDVNSE
ELPNVFSYKYAASEREVNSGELPNAFSYKHAASERDVNSGELPNAFYYKHGASQLDVNSGELPN
AFYYKH
GASESDVNYGELPNAFSYKHATRATDVNHGELPNAFSYKHATRAADVNYGELPNAFSY
KH
AASEPDV
Repeat found in LOC131596181
Repeat occurs 10 times in a sequence of 495 amino acids
Location between 179104348 and 179106096
Coverage of 24.24 %
Instances:
YHHAASELDVNS | YHHAASKPDVNS | YHHTASESGVNS | YHHAASQPDVNS | YHHAASEPDVNF
YHHAGSKPDVNS | YHHAASKPDVNS | YHHAASKPDINS | YHHAASKPNVNS | YHHASSKPNVNS

pattern: YHH[AT][ASG]S[QKE][PSL][DGN][IV]N[SF]
MKFHFFHIITFLMLLLVATNAETLPMTKVITNLLLPSSDGGVKKDYYDGGEARDKTDEISKKL
PNGFRYHHAASELDVNSRDLLNGFRYHHAASKPDVNSRDLPNGFRYHHTASESGVNSGDLLNGF
RYHHAASQPDVNSGDLPNGFRYHHAASEPDVNFGDLSNGFRYHHAGSKPDVNSGDLSNGFPYHH
AASKPDVNS
GDLSNGFPYHHAASKPDINSGDLPNGFRYHHAASKPNVNSRDLPNEFRYHHASSK
PNVNS
RDLHNTASETQLHDKPKATIFFFEKDLHHGTKSYFRFAKTNPNNEAKFLPREFANSIPF
SSKKLEYILNKLNIRKGSKAARIVNNTITDCEVESIKGEEKLCVTSLESMIDFSISKIGKNLEA
VSTEMYKESDFQQYTVITQGVKRLGENNKAVVCHKQNYPYAVFFCHTTDTTKIYSVPLEGVDGS
RVKAIAVCHGDTSQWNPKHFAFLELKVKVGTAPICHLLPQDHVAWIAK
Repeat found in LOC131596177
Repeat occurs 10 times in a sequence of 589 amino acids
Location between 178941060 and 178943303
Coverage of 10.19 %
Instances:
ASEPDV | ASEPDV | ASEPDV | ASEHDV | ASESDV
ASEPEV | ASEPGL | ASEPNV | ASEFNV | ASEKPK

pattern: ASE[PSFKH][PEDGN][KLV]
MEFHFFRIITFLMLLLVATNAARLPPQHYWKSVIPNSPMPKAITNLLLPSETDVSSKELSNWF
RYDHTGVDVNSKKLPNWFRYDRAASGSDVNSEELPNWFRYDHAVSGADVSSEELPNWFRYDHAV
SGADVSSEELPNWFRYNHAASEPDVNSEELSNVFKYDHAASEPDVSFKKFPNVFKYDHAARKPD
VSFKKLPNVFKYDHAASEPDVSFRKLPNVFKYDHVASEHDVSSRYFPNVFKYDHAASESDVSSK
YFSNVFKYDHAASEPEVNFRESPNGFSYHHAASEPGLNSRESLNGFFYRNAASEPNVNFRKSPN
GFSYHQAASVPDVNSGETSILSYNPPASEFNVNSGKLSNAFSYDHSASEKPKSTIFFFEKDLHH
GTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARIVNNTITDCEVE
SIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKVITQGVKRLGEKNKAVVCH
KQNYPYAVFFCHTTDTTKVYSAPLKGVDGSIVKAIVVCHTDTSQWNPKHFAFLELKVKVGSGPI
CHLLPQDHVAWISK
Repeat found in LOC131596180
Repeat occurs 21 times in a sequence of 728 amino acids
Location between 179037704 and 179040213
Coverage of 23.08 %
Instances:
NGFGYHDA | NGFGYHHA | NGFGYHHA | NGFGYHLA | NGFGYHHA
NGFGYHHA | NGFRYHLA | NGFVYHHA | NGFGYHHA | NGFGYHHV
NGFGYHHG | NGFGYHHA | NGFGYNHA | NGFGYHHG | NGFGYHHA
NGFGYHHA | NGFGYNHA | NGFRYHHA | NGFRYHHA | NGFRYHHA
NGFRYHHI |
pattern: NGF[RGV]Y[HN][HDL][AIGV]
MKFHFFHIITFLMLLLVATNAETLPPQLYWKSILPNSPMPKVITNLLLSDSPMAKTITNLLLP
GLDDGVKKSYSDGGEVEGKTDVIPKKLSNGFGYHDAVSEPNVNFKELSNGFGYHHAASEADGNS
EELPNGFGYHHAASEADVNSKKLPNGFGYHLAASEPNVNSKELSNGFGYHHAASEADGNSRELP
NGFGYHHA
ASDADVNSKILPNGFRYHLAASEADENSKELSNGFVYHHAASEADANSRELPNGFG
YHHA
GSESDINSREISNGFGYHHVASETDVNSRELPNGFGYHHGASEAYVNSRKLSNGFGYHHA
ASESGVTSGELPNGFGYNHAISGVDGNSKELPNGFGYHHGASEPDVNFRELPNGFGYHHATSEA
GVKSEELPNGFGYHHAASEPDVNSGELPNGFGYNHAISGVDGNSKELPNGFRYHHAASKPDVNS
GDLPNGFRYHHAASKSDVNSGDLSNGFRYHHAASKPNVNSGDSHNGFRYHHIASEAQLHDKPKA
TIFFFEKELHHGTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARI
VNNTITDCEVESIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTETYKESDFQQYKVITQGVKR
LGEKNKAVVCHKQNYPYAVFFCHTTDTTKVYSVPLEAVDGSRVKAIVVCHTDTSQWNPKHFAFL
ELKVKVGTAPICHLLPQDHVAWISK

Similar gene clusters

NC_081183 - Cluster 18 - Polyketide

Gene cluster description

NC_081183 - Gene Cluster 18. Type = polyketide. Location: 197100185 - 197239334 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 19 - Cyclopeptide

Gene cluster description

NC_081184 - Gene Cluster 19. Type = cyclopeptide. Location: 75899235 - 79233249 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 20 - Lignan-saccharide

Gene cluster description

NC_081184 - Gene Cluster 20. Type = lignan-saccharide. Location: 107003866 - 107293244 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 21 - Saccharide

Gene cluster description

NC_081184 - Gene Cluster 21. Type = saccharide. Location: 112839245 - 113146412 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 22 - Terpene

Gene cluster description

NC_081184 - Gene Cluster 22. Type = terpene. Location: 139867633 - 140142542 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 23 - Fatty_acid-alkaloid

Gene cluster description

NC_081184 - Gene Cluster 23. Type = fatty_acid-alkaloid. Location: 147524201 - 147615497 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 24 - Alkaloid

Gene cluster description

NC_081184 - Gene Cluster 24. Type = alkaloid. Location: 147796592 - 148042492 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 25 - Terpene-polyketide

Gene cluster description

NC_081184 - Gene Cluster 25. Type = terpene-polyketide. Location: 157593453 - 157715705 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 26 - Cyclopeptide

Gene cluster description

NC_081184 - Gene Cluster 26. Type = cyclopeptide. Location: 159082940 - 159926238 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131607969
Repeat occurs 7 times in a sequence of 318 amino acids
Location between 159504110 and 159505067
Coverage of 33.02 %
Instances:
FWAWSRKEIEKENQP | FWAWSRKETEKENQP | FWAWSRKETEKENQP | FWAWSRKETEKENQP | FWAWSRKETEKENQP
FWAWSRKETEKENQP | FWAWSRKANRPFLRD |
pattern: FWAWSRK[AE][ITN][RE][PK][EF][LN][QR][PD]
MSLGIQKHSESSESDKKETTKENHPFGFWAWSRKEIEKENQPFGFWAWSRKETEKENQPFGFW
AWSRKETEKENQP
FGFWAWSRKETEKENQPFGFWAWSRKETEKENQPFGFWAWSRKETEKENQP
FGFWAWSRKANRPFLRDTKKETQNFLALTSDEKEAHIINNYCRTPSAIGEDKHCALSLESMMDF
AISKLGKNIKVMSSSLVQNQDKYVVEEVNKIGDKVVMCHRLNFKKVVFYCHAVNATTTYMVPFV
ASDGTKSKALTICHHDTRGMNPNVLSEVLNVKPGTVPVCHFIGNKAIAWVPDMSESGGHACVI

Similar gene clusters

NC_081185 - Cluster 27 - Phenolamide-saccharide-alkaloid

Gene cluster description

NC_081185 - Gene Cluster 27. Type = phenolamide-saccharide-alkaloid. Location: 23028618 - 23315537 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/phenolamide: (minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 28 - Fatty_acid

Gene cluster description

NC_081185 - Gene Cluster 28. Type = fatty_acid. Location: 74854008 - 74968967 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 29 - Lignan-saccharide

Gene cluster description

NC_081185 - Gene Cluster 29. Type = lignan-saccharide. Location: 101845139 - 102109089 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 30 - Saccharide

Gene cluster description

NC_081185 - Gene Cluster 30. Type = saccharide. Location: 102177382 - 102593696 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 31 - Cyclopeptide

Gene cluster description

NC_081185 - Gene Cluster 31. Type = cyclopeptide. Location: 115279804 - 116474429 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131612148
Repeat occurs 8 times in a sequence of 413 amino acids
Location between 115876046 and 115878186
Coverage of 13.56 %
Instances:
NQPFGFW | NQPFGFW | NQPFGFW | NQPFGFW | NQPFGFW
NQPFGFW | NQPFGFW | NQPFVRD |
pattern: NQPF[GV][RF][DW]
MEFQNLSVLALFFLALVGINGSKSGEEYWKSVWPNTTMPKALSDLLPSDIETSMPIKSHEENQ
YWTIFFEHDLYPGKKMNLGIQKPSESSEYDKKETSKASHPFGFWAWSRKETEKENQPFGFWAWS
RKETEKENQPFGFWIWQRKESEKENQPFGFWIWQRKETEKENQPFGFWIWQRKETEKKNQPFGF
W
AWSKKESEEQNQPFGFWAWSRKESEKENQPFGFWAWSRKANQPFVRDTQKESQHSVALTSDEK
EAHMINEYCRNPSAIGEDKYCALSLESMVDFVISKLGKNIKVMSSSLGQNQENYVVEEVNKIGD
KAVMCHRLNFKKVVFYCHEVNATTTYMVPLVAPDGTKSKALTICHHDTRGMNANMLKEVLNVKP
GTVPVCHFIGNKAIAWVPDMSEPSDHPCAI

Similar gene clusters

NC_081185 - Cluster 32 - Fatty_acid

Gene cluster description

NC_081185 - Gene Cluster 32. Type = fatty_acid. Location: 130728644 - 130841739 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 33 - Saccharide

Gene cluster description

NC_081185 - Gene Cluster 33. Type = saccharide. Location: 157082844 - 157164512 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 34 - Fatty_acid

Gene cluster description

NC_081186 - Gene Cluster 34. Type = fatty_acid. Location: 37833592 - 38035634 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 35 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 35. Type = cyclopeptide. Location: 50474192 - 52064976 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 36 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 36. Type = cyclopeptide. Location: 51269643 - 51945586 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 37 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 37. Type = cyclopeptide. Location: 74989548 - 75629424 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 38 - Saccharide

Gene cluster description

NC_081186 - Gene Cluster 38. Type = saccharide. Location: 88078290 - 88125441 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 39 - Terpene

Gene cluster description

NC_081186 - Gene Cluster 39. Type = terpene. Location: 89975369 - 90375791 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 40 - Saccharide

Gene cluster description

NC_081186 - Gene Cluster 40. Type = saccharide. Location: 98409334 - 98465002 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 41 - Fatty_acid-saccharide

Gene cluster description

NC_081186 - Gene Cluster 41. Type = fatty_acid-saccharide. Location: 100070768 - 100246005 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 42 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 42. Type = cyclopeptide. Location: 103769367 - 104272802 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 43 - Putative

Gene cluster description

NC_081186 - Gene Cluster 43. Type = putative. Location: 117897938 - 118218863 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 44 - Alkaloid

Gene cluster description

NC_081186 - Gene Cluster 44. Type = alkaloid. Location: 127926819 - 128392737 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026704941 - Cluster 45 - Saccharide

Gene cluster description

NW_026704941 - Gene Cluster 45. Type = saccharide. Location: 1060132 - 1270765 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026704977 - Cluster 46 - Fatty_acid

Gene cluster description

NW_026704977 - Gene Cluster 46. Type = fatty_acid. Location: 1214347 - 1345981 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026704997 - Cluster 47 - Cyclopeptide

Gene cluster description

NW_026704997 - Gene Cluster 47. Type = cyclopeptide. Location: 237691 - 1799112 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131623527
Repeat occurs 3 times in a sequence of 311 amino acids
Location between 764025 and 765666
Coverage of 5.79 %
Instances:
GIGLSV | GIGIGV | GIGVGI |
pattern: GIG[ILV][SG][IV]
MENSYKVNGNGSAENGYSIARHTHSYQPSLKGSLPWLDIRVFYVRVCKCEFDNTTPEVLTLNH
VPLNPDTLLEVNGVRSSIYSDGMSTLLKRDRVDRKSEEVTFVSTDSIRMSGSVKFEVFDKDNLL
LFGALELCNSNGVVRESNSNGQSWSMKCESNIIPGTRFFKEKQLLLPETTLPTVEVYIAGSFSG
TPIILTKTLHLSSQKRHSRKGVLKAIPENDANENGKDPSSALALQGPDYMLDKHEDEDYHSLYT
RTAYADGEDGELSWFNAGVRVGVGIGLSVCLGIGIGVGIMVKTYQGATGQFRRRLL
Repeat found in LOC131623552
Repeat occurs 6 times in a sequence of 169 amino acids
Location between 756731 and 757241
Coverage of 46.15 %
Instances:
NVEDANGEDENVE | NVEEANGEDENVE | NVEEANGKDENVE | NVEEANGEDENVE | NVEEANGEDENVE
NVEGVNSEDSETD |
pattern: NVE[EDG][AV]N[SG][KE]D[SE][EN][TV][ED]
MKDIVIRCLWYWNPKFAFSRGLWPLNDDKDVLRFMEDIRGYKLIDIYVEHKVEEVNIEDVRFN
DATDEEVNVEDANGEDENVEEANGEDENVEEANGKDENVEEANGEDENVEEANGEDENVEGVNS
EDSETDPDYNMSNEEYEGDEDDEDDEGSEDDELNLGHDIGVD
The following known motifs were found in CDS LOC131623490
Location between 1037244 and 1038700
FEPR was found 5 times in this sequence
Sequence:
MKPALALLPFLFLFLFVVTTESRKDLKEYWQTVMKDEEMPEGIKGLLQLKSETEPLKNSKVQE
QLVKGKCDQHLVTNTLDVKEKKVVTGEFEPRPSATKYDGNGGYKNMKLSVNGEFEPIPSVTKYD
GDEGYKNMKPSVNGEFKPIPSVTKYDGDEGYKNIKLPVNGEFEPIPSVTKYDGDESYKNMKLSV
NSEFEPIPSVTKYDGDESYKNMKLPVNGEFEPIPSVTKYDGSYKNTKLPVNGEFEPIPSVTRYD
GNDGYKSMKLPVNDEFEPRPSATKYDGDEGYKTMKVNVNDEFEPRPSATKYDGDEGYKNIKLQV
NGEFEPIPSVTRYDGDDGYKNIKLPMNNEFEPRPSATKYDGDEGYKTMKVNVNDEFEPRPSATK
YND
Repeat found in LOC131623496
Repeat occurs 9 times in a sequence of 591 amino acids
Location between 1300324 and 1302399
Coverage of 22.84 %
Instances:
SYGNHEAEKPDYNTG | SYGNHEAEKPDYNTG | SYGNREAEKPDYNTG | SYGNREAEKPDYNTG | SYGNHEAEKPNYNTG
SYGNHEAEKPDYDTG | SYGNHEAEKPDYDTG | SYGNHEAEKPDYNTG | SYGKHEPNHPYITQY |
pattern: SYG[KN][HR]E[AP][EN][HK]P[YDN][YI][DTN][QT][YG]
MAQTLALQFLTLLLFFFMNGQGITARDLKTELRDVDSNEQPYITQYGHTKPDYAASYNTHDSN
GPYKTSYGNHEAEKPDYNTGYRTHTHHSNELYKTSYGNHEAEKPDYNTGYKARTHDSNEPYKTS
YGNREAEKPDYNTG
YKARTHDSNEPYKTSYGNREAEKPDYNTGYRAHTHDSNEPYRTSYGNHEA
EKPNYNTG
YRAHSHTHDSNKPYKTSYGNHEAEKPDYDTGYRAHTHDSNEPYKTSYGNHEAEKPD
YDTG
YRAHTHDSNEPYKTSYGNHEAEKPDYNTGYRTHTQNSNKPYLTYANHETDESNGPYIASY
GKHEPNHPYITQY
GPTSLDLKDLTSPNSKNLKGSASSNLDRTEAFKTGFFNLDDLYVGHVMTLQ
FPVQEVSHYLPKKVADSIPLSKSQLPSVLQLFSISEDTSQAKSMRGTLEECEGETITGETKVCA
NSLESMLEFVDTIIGSDTKHSILSTSQPSPTAIPLQKYTILEVSHDIHAPKWVACHPLPYPYAI
HYCHYIATGSKVFKVTLVGDENGDKMEALGICHLDTSDWNPDHIVFKQLGIKAGKNTPVCHFFP
VNHLLWVPVEPSKATM
Repeat found in LOC131623527
Repeat occurs 3 times in a sequence of 311 amino acids
Location between 764025 and 765666
Coverage of 5.79 %
Instances:
GIGLSV | GIGIGV | GIGVGI |
pattern: GIG[ILV][SG][IV]
MENSYKVNGNGSAENGYSIARHTHSYQPSLKGSLPWLDIRVFYVRVCKCEFDNTTPEVLTLNH
VPLNPDTLLEVNGVRSSIYSDGMSTLLKRDRVDRKSEEVTFVSTDSIRMSGSVKFEVFDKDNLL
LFGALELCNSNGVVRESNSNGQSWSMKCESNIIPGTRFFKEKQLLLPETTLPTVEVYIAGSFSG
TPIILTKTLHLSSQKRHSRKGVLKAIPENDANENGKDPSSALALQGPDYMLDKHEDEDYHSLYT
RTAYADGEDGELSWFNAGVRVGVGIGLSVCLGIGIGVGIMVKTYQGATGQFRRRLL

Similar gene clusters

NW_026705087 - Cluster 48 - Saccharide

Gene cluster description

NW_026705087 - Gene Cluster 48. Type = saccharide. Location: 205247 - 463519 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705146 - Cluster 49 - Cyclopeptide

Gene cluster description

NW_026705146 - Gene Cluster 49. Type = cyclopeptide. Location: 1 - 1170491 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131626855
Repeat occurs 23 times in a sequence of 347 amino acids
Location between 532362 and 538144
Coverage of 99.42 %
Instances:
AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPG | AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA
AAGYPAAGTRLRVPG | AAGTRLRVPGCGYPA | AAGTPAAGTRLRVPG | AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA
AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA
AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA | AAGTRLRVPRLRVPG | AAGTRLRVPGCGYPA
AAGTRLRVPGCGYPA | AAGTRLRVPGCGYPA | AAGTRLRVPGCGGLT |
pattern: AAG[YT][PR][AL][AR][GV][PT][RG][CL][RG][YGV][PL][ATG]
MTLPICKYDSELHSEYPNHNFSVAGTRNPAAGTRLRVPGCGYPAAGTRLRVPGCGYPGCGYPA
AGTRLRVPGCGYPA
AGTRLRVPGCGYPAAGTRLRVPGCGYPAAGYPAAGTRLRVPGCGYPAAGT
PAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLR
VPGCGYPA
AGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPA
AGTRLRVPGCGYPAAGTRLRVPRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGT
RLRVPGCGYPA
AGTRLRVPGCGGLTICY

Similar gene clusters

NW_026705156 - Cluster 50 - Polyketide

Gene cluster description

NW_026705156 - Gene Cluster 50. Type = polyketide. Location: 743207 - 1117375 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705200 - Cluster 51 - Cyclopeptide

Gene cluster description

NW_026705200 - Gene Cluster 51. Type = cyclopeptide. Location: 1 - 699046 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 6.23 %
Instances:
APTQDAPTQE | APTQEAPTQD | APTQDAPTQE | APTQEAPTQE | APTQEAPTQE
APTQEAPTQA | APTQASQTSK |
pattern: APTQ[AED][AS][QP]T[QS][AKED]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQE
APTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 6.23 %
Instances:
APTQDAPTQE | APTQEAPTQD | APTQDAPTQE | APTQEAPTQE | APTQEAPTQE
APTQEAPTQA | APTQASQTSK |
pattern: APTQ[AED][AS][QP]T[QS][AKED]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQE
APTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 6.23 %
Instances:
APTQDAPTQE | APTQEAPTQD | APTQDAPTQE | APTQEAPTQE | APTQEAPTQE
APTQEAPTQA | APTQASQTSK |
pattern: APTQ[AED][AS][QP]T[QS][AKED]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQE
APTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 6.23 %
Instances:
APTQDAPTQE | APTQEAPTQD | APTQDAPTQE | APTQEAPTQE | APTQEAPTQE
APTQEAPTQA | APTQASQTSK |
pattern: APTQ[AED][AS][QP]T[QS][AKED]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQE
APTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 6.23 %
Instances:
APTQDAPTQE | APTQEAPTQD | APTQDAPTQE | APTQEAPTQE | APTQEAPTQE
APTQEAPTQA | APTQASQTSK |
pattern: APTQ[AED][AS][QP]T[QS][AKED]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQE
APTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1127 amino acids
Location between 604998 and 612530
Coverage of 6.21 %
Instances:
APTQDAPTQE | APTQEAPTQD | APTQDAPTQE | APTQEAPTQE | APTQEAPTQE
APTQEAPTQA | APTQASQTSK |
pattern: APTQ[AED][AS][QP]T[QS][AKED]
MLQEMSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERV
KMWWKHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGK
ACASEKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVP
NETDNNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEF
RSLKQFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSK
HKCGRQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIV
EGDSSQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFI
GLDGCHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISD
QQKGLVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKE
ANVDAYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMG
RFATLREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTC
SCNYWELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPII
MPPLFKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAART
SQEAPTQDAPTQEAPTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKT
KKSKTVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDD
VQDAVPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVK
TFVGKAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIP
GPGKKPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628110
Repeat occurs 8 times in a sequence of 413 amino acids
Location between 312549 and 314700
Coverage of 13.56 %
Instances:
NQPFGFW | NQPFGFW | NQPFGFW | NQPFGFW | NQPFGFW
NQPFGFW | NQPFGFW | NQPFVRD |
pattern: NQPF[GV][RF][DW]
MEFQNLSVLALFFLALVGINGSKSGEEYWKSVWPNTTMPKALSDLLPSDIETSMPIKSHEENQ
YWTIFFEHDLYPGKKMNLGIQKPSESSEYDKKETSKASHPFGFWAWSRKETEKENQPFGFWAWS
RKETEKENQPFGFWIWQRKESEKENQPFGFWIWQRKETEKENQPFGFWIWQRKETEKKNQPFGF
W
AWSRKESEEQNQPFGFWAWSRKESEKENQPFGFWAWSRKANQPFVRDTQKESQHSVALTSDEK
EAHMINEYCRKPSAIGEDKYCALSLESMVDFVISKLGKNIKVMSSSLGQNQENYVVEEVNKIGD
KAVMCHRLNFKKVVFYCHEVNATTTYMVPLVAPDGTKSKALTICHHDTRGMNANMLKEVLNVKP
GTVPVCHFIGNKAIAWVPDMSEPSDHPCAI
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1127 amino acids
Location between 604998 and 612530
Coverage of 6.21 %
Instances:
APTQDAPTQE | APTQEAPTQD | APTQDAPTQE | APTQEAPTQE | APTQEAPTQE
APTQEAPTQA | APTQASQTSK |
pattern: APTQ[AED][AS][QP]T[QS][AKED]
MLQEMSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERV
KMWWKHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGK
ACASEKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVP
NETDNNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEF
RSLKQFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSK
HKCGRQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIV
EGDSSQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFI
GLDGCHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISD
QQKGLVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKE
ANVDAYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMG
RFATLREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTC
SCNYWELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPII
MPPLFKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAART
SQEAPTQDAPTQEAPTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKT
KKSKTVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDD
VQDAVPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVK
TFVGKAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIP
GPGKKPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ

Similar gene clusters

NW_026705238 - Cluster 52 - Fatty_acid-saccharide

Gene cluster description

NW_026705238 - Gene Cluster 52. Type = fatty_acid-saccharide. Location: 260194 - 915933 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705502 - Cluster 53 - Cyclopeptide

Gene cluster description

NW_026705502 - Gene Cluster 53. Type = cyclopeptide. Location: 1 - 437028 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC131633585
Repeat occurs 10 times in a sequence of 647 amino acids
Location between 92680 and 94923
Coverage of 23.18 %
Instances:
DSNEQPYITQYGHTK | DSNGPYKTSYGNHEA | DSNEPYKTSYGNREA | DSNEPYRTSYGNHEA | DSNEPYKTSYGNHEA
DSNKPYKTSYGNHEA | DSNEPYKTSYGNREA | DSNEPYRTSYGNHEA | DSNKPYKTSYGNHEA | DSNEPYKTSYGNHEA

pattern: DSN[KEG][QP][YP][YKR][IT][ST][QY][YG][GN][HR][ET][AK]
MAQTLAPKFLTLLLFFFMNGQGITARDLKTELRDVDSNEQPYITQYGHTKPDYAASYNTHDSN
GPYKTSYGNHEA
EKPDYNTGYRTHTHHSNELYKTAYGNHEAEKPDYNTGYKARTHDSNEPYKTS
YGNREA
EKPDYNTGYRAHTHDSNEPYRTSYGNHEAEKPNYNTGYRAHTHDSNEPYKTSYGNHEA
EKPDYDTGYRAHTHDSNKPYKTSYGNHEAEKPDYDTGYRAHTHDSNEPYKTSYGNREAEKPDYN
TGYRAHTHDSNEPYRTSYGNHEAEKPDYDTGYRAHTHDSNKPYKTSYGNHEAEKPDYDTGYRAH
THDSNEPYKTSYGNHEAEKPDYDTGYRTHTHNSNKPYLTYAKHETDESNGPYIASYGKHEPNHP
YITQYGPTSLDLKDLTSPNSKDLEGSASSNLDRTEAFKTGFFNLDDLYVGHEMTLQFPVQEVSH
YLPKKAADSIPLSKSQLPSVLQLFSISEDSTQAKAMRGTLEECEGETITGETKICANSLESMLE
FVDTIIGTDTKHSILSTSNLSPTATPLQKYTILEVSHDIHAPKWVACHPLPYPYAIHYCHYIAT
GSKVFKVTLVGDENGDKMEALGICHLDTSDWNPDHIVFKQLGIKAGMNTPVCHFFPVNHLLWVP
EEPSKATM

Similar gene clusters

NW_026705654 - Cluster 54 - Saccharide

Gene cluster description

NW_026705654 - Gene Cluster 54. Type = saccharide. Location: 219592 - 319723 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705869 - Cluster 55 - Fatty_acid-alkaloid

Gene cluster description

NW_026705869 - Gene Cluster 55. Type = fatty_acid-alkaloid. Location: 163682 - 342477 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026706016 - Cluster 56 - Polyketide-alkaloid

Gene cluster description

NW_026706016 - Gene Cluster 56. Type = polyketide-alkaloid. Location: 1 - 303682 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/polyketide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters