Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_031989.1:
Cluster 1Cyclopeptide49097515647530737.78BURP---
Cluster 2Cyclopeptide60743024623277941584.77BURP---
Cluster 3Polyketide10049997810059035690.382OG-FeII_Oxy, Chal_sti_synt_C, Chal_sti_synt_N, DIOX_N---
The following clusters are from record NC_031990.1:
Cluster 4Putative61497356896024746.292OG-FeII_Oxy, Aldo_ket_red, DIOX_N, Methyltransf_11, p450---
Cluster 5Fatty_acid-Alkaloid-Saccharide4505308545185180132.09BBE, FAD_binding_4, FA_hydroxylase, Methyltransf_11, UDPGT_2*saccharide--
The following clusters are from record NC_031993.1:
Cluster 6Saccharide-Lignan1429050814427739137.23Dirigent, UDPGT_2alkaloid-2, cytokinin-2--
Cluster 7Saccharide1810206618408375306.312OG-FeII_Oxy, ADH_N_2, ADH_zinc_N, DIOX_N, Dimerisation, Methyltransf_2, UDPGT_2---
The following clusters are from record NC_031994.1:
Cluster 8Alkaloid-Saccharide53360475910675574.63Bet_v_1, ECH_2, Glycos_transf_1, HAD_RAM2_N, Pyridoxal_deC---
Cluster 9Alkaloid65753716859858284.49AMP-binding, Pyridoxal_deC, adh_short---
Cluster 10Saccharide88440829126395282.312OG-FeII_Oxy, DIOX_N, Epimerase, UDPGT_2flavonoidalpha-chaconine/alpha-solanine (22% of genes show similarity)BGC0002722.2_c1
Cluster 11Saccharide3264777632948617300.84Epimerase, UDPGT_2, p450alkaloid, cytokinin--
Cluster 12Alkaloid8705806387288737230.672OG-FeII_Oxy, Bet_v_1, DIOX_N---
The following clusters are from record NC_031995.1:
Cluster 13Saccharide38393753979456140.08Aldo_ket_red, Dimerisation, Glycos_transf_1, Methyltransf_2---
Cluster 14Saccharide41744344274438100.00Amino_oxidase, ECH_2, UDPGT_2*saccharide--
Cluster 15Alkaloid8087340981179205305.80Cu_amine_oxid, Methyltransf_11, p450---
The following clusters are from record NC_031997.1:
Cluster 16Terpene37490213882071133.05Terpene_synth, Terpene_synth_C, p450---
Cluster 17Saccharide91206089973347852.74Epimerase, Glyco_hydro_1, Methyltransf_3---
The following clusters are from record NC_032000.1:
Cluster 18Alkaloid1220354312419981216.44AMP-binding, Cu_amine_oxid, Str_synth---
Cluster 19Cyclopeptide21296380240230502726.67BURP-QS-21 (11% of genes show similarity)BGCMANUAL2.1_c1
The following clusters are from record NW_017670279.1:
Cluster 20Cyclopeptide113274841327.48BURP---
The following clusters are from record NW_017670303.1:
Cluster 21Saccharide7250801030319305.24UDPGT_2, p450flavonoid-2, oleananes-2--
The following clusters are from record NW_017670315.1:
Cluster 22Cyclopeptide110303221030.32BURP---
The following clusters are from record NW_017670343.1:
Cluster 23Fatty_acid58217432358374.142OG-FeII_Oxy, AMP-binding, DIOX_N, FA_desaturase---
The following clusters are from record NW_017670473.1:
Cluster 24Saccharide287779663437375.66Glyco_hydro_1, Lycopene_cycl, Transferase---
The following clusters are from record NW_017670748.1:
Cluster 25Cyclopeptide1467702467.70BURP---
The following clusters are from record NW_017670779.1:
Cluster 26Alkaloid96846289108192.26Bet_v_1, Cu_amine_oxid, p450---
The following clusters are from record NW_017671029.1:
Cluster 27Putative19326627049777.23Transferase, adh_short, p450---
The following clusters are from record NW_017672839.1:
Cluster 28Cyclopeptide1165968165.97BURP---
The following clusters are from record NW_017673468.1:
Cluster 29Saccharide4784811217164.32Cellulose_synt, UDPGT_2oleananes-3, flavonoid-2--

NC_031989 - Cluster 1 - Cyclopeptide

Gene cluster description

NC_031989 - Gene Cluster 1. Type = cyclopeptide. Location: 4909751 - 5647530 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_031989 - Cluster 2 - Cyclopeptide

Gene cluster description

NC_031989 - Gene Cluster 2. Type = cyclopeptide. Location: 60743024 - 62327794 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_031989 - Cluster 3 - Polyketide

Gene cluster description

NC_031989 - Gene Cluster 3. Type = polyketide. Location: 100499978 - 100590356 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031990 - Cluster 4 - Putative

Gene cluster description

NC_031990 - Gene Cluster 4. Type = putative. Location: 6149735 - 6896024 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031990 - Cluster 5 - Fatty_acid-alkaloid-saccharide

Gene cluster description

NC_031990 - Gene Cluster 5. Type = fatty_acid-alkaloid-saccharide. Location: 45053085 - 45185180 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031993 - Cluster 6 - Saccharide-lignan

Gene cluster description

NC_031993 - Gene Cluster 6. Type = saccharide-lignan. Location: 14290508 - 14427739 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031993 - Cluster 7 - Saccharide

Gene cluster description

NC_031993 - Gene Cluster 7. Type = saccharide. Location: 18102066 - 18408375 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031994 - Cluster 8 - Alkaloid-saccharide

Gene cluster description

NC_031994 - Gene Cluster 8. Type = alkaloid-saccharide. Location: 5336047 - 5910675 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031994 - Cluster 9 - Alkaloid

Gene cluster description

NC_031994 - Gene Cluster 9. Type = alkaloid. Location: 6575371 - 6859858 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031994 - Cluster 10 - Saccharide

Gene cluster description

NC_031994 - Gene Cluster 10. Type = saccharide. Location: 8844082 - 9126395 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_031994 - Cluster 11 - Saccharide

Gene cluster description

NC_031994 - Gene Cluster 11. Type = saccharide. Location: 32647776 - 32948617 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031994 - Cluster 12 - Alkaloid

Gene cluster description

NC_031994 - Gene Cluster 12. Type = alkaloid. Location: 87058063 - 87288737 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031995 - Cluster 13 - Saccharide

Gene cluster description

NC_031995 - Gene Cluster 13. Type = saccharide. Location: 3839375 - 3979456 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031995 - Cluster 14 - Saccharide

Gene cluster description

NC_031995 - Gene Cluster 14. Type = saccharide. Location: 4174434 - 4274438 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031995 - Cluster 15 - Alkaloid

Gene cluster description

NC_031995 - Gene Cluster 15. Type = alkaloid. Location: 80873409 - 81179205 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031997 - Cluster 16 - Terpene

Gene cluster description

NC_031997 - Gene Cluster 16. Type = terpene. Location: 3749021 - 3882071 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_031997 - Cluster 17 - Saccharide

Gene cluster description

NC_031997 - Gene Cluster 17. Type = saccharide. Location: 9120608 - 9973347 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_032000 - Cluster 18 - Alkaloid

Gene cluster description

NC_032000 - Gene Cluster 18. Type = alkaloid. Location: 12203543 - 12419981 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_032000 - Cluster 19 - Cyclopeptide

Gene cluster description

NC_032000 - Gene Cluster 19. Type = cyclopeptide. Location: 21296380 - 24023050 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC109235412
Repeat occurs 3 times in a sequence of 536 amino acids
Location between 21830011 and 21833848
Coverage of 3.36 %
Instances:
TLETWD | TLELGG | TLETGG |
pattern: TLE[LT][GW][GD]
MAARVFASRVSRSLTSSSHLLSRGSKSHLGRIAAYQYSTAAAFEEPIKPAVNVEHTKLFINGE
FVDAASGKTFPTLDPRTGEVIAHIAEGDAEDINRAVAAARKAFDEGPWPKMNAYERSKILLRLA
DLIEKHNDQIATLETWDTGKPYEQAAKIEVPMVVRLLRYYAGWADKIHGMTIPADGPYHVQTLH
EPIGVAGQIIPWNFPLLMFSWKIGPALACGNTVVLKTAEQTPLSAFYVAHLLQEAGLPEGVLNI
ISGFGPTAGASLCSHMDVDKLAFTGSTDTGKAILTLAAKSNLKPVTLELGGKSPFIVCEDADID
TAVEQAHFALFFNQGQCCCAGSRTYVHEKVYDEFLEKAKARALKRTVGDPFKSGTEQGPQIDSK
QFDKIMNYIRCGINSGATLETGGERLGERGYYIKPTVFSNVKDDMLIAQDEIFGPVQSILKFKD
LDDVIRRANNSRYGLAAGVFTQNIDTANTLTRALRVGTVWVNCFDTFDATIPFGGYKMSGHGRE
KGEYSLKNYLQVKAVVTPLKNPAWL
Repeat found in LOC109235424
Repeat occurs 3 times in a sequence of 403 amino acids
Location between 22791162 and 22800969
Coverage of 4.47 %
Instances:
AKLIEG | AKLVID | AKLVEG |
pattern: AKL[VI][EI][GD]
MQELTKRPTTPSILLTHLESKMKQFQSLMAVMLIFIEIASPIPFYGWQVWPISPAEAVLYSPE
TNIPRTGELALRRAIPANPNMKTIQDSLEDISYLLRIPQRKPFGTMEGNVKKALKIATDEKASI
LASIPAELRDEGSVLYAKLIEGKGGLQNLLQYIKDKDPDKVSVGLASTLDTVAQLELLQAPGLS
FLLPQQYLKYPRLTGRGIVEFTVEKVDGSTFSPEAAGVAKSTAKIQVVLDGYSAPLTAGNFAKL
VID
GAYDGMKLTSANQAILSDSELGKATGYSVPLEIKPSGQFEPLYRTTLSVQDGELPVLPLSV
YGAVAMAHSDVSEELSSPSQFFFYLYDKRNSGLGGLSFEEGQFSVFGYTTIGRDILPQIKTGDI
IRSAKLVEGQDRLVLPPQEN

Similar gene clusters

Similar known gene clusters

NW_017670279 - Cluster 20 - Cyclopeptide

Gene cluster description

NW_017670279 - Gene Cluster 20. Type = cyclopeptide. Location: 1 - 1327484 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC109237165
Repeat occurs 18 times in a sequence of 1157 amino acids
Location between 133403 and 138779
Coverage of 23.34 %
Instances:
SKEDPVKEKDAAESK | SKEDHVKVEDAQESK | SKEDHVKEEDAEDSK | SKEVPLKEEDAEDSK | SKEDPVKEEGLQDLK
SKEDPVKEKGLQDLK | SKEEAIKEKDVEESK | SKEDVVMGEDAEESE | SKEDAVMEKNVEESK | SKEDLVKEEDAEDSK
SKEYHVKEEDAEGSK | SKEDAVMEKDAEESK | SKEDLVKKEDAEYSK | SKEYPVKEDDAEGSK | SKEDAVKENDAEDSK
SKECPVMEEDGEESK | SKEEPVKGEDAEESK | SKEDPVKEKDAEDLN |
pattern: SKE[DVECY][VLHPA][ILV][MK][EGVK][EDKN][GDN][GLVA][EQA][EDGY][LS][EKN]
MRTRSAEKKSAGKQQIAAEPAAEPAAATKRTPPSRARQVRASGSTIPSPAEEAKASDADEAVQ
STPEAKPASGRKTTRRVVRKVVKKVPASSRTTPDKTPRSEDAGTAVPEDTDQDMVAEDSKEDPV
KEKDAAESK
KCVKKGDAEDSKEDHVKVEDAQESKKSIEEETENSKEDHVKEEDAEDSKEVPLKE
EDAEDSKEDPVKEEGLQDLKEDAAEEKDTEDLKEDIVKEEDAEDSKEDPVKEKGLQDLKEDATM
EKDAEDLKEDIVKEEDIVESKEEAIKEKDVEESKEDVVMGEDAEESEEYPAKEEDAEGSKEDAV
MEKNVEESK
EDLVKEEDAEDSKEYHVKEEDAEGSKEDAVMEKDAEESKEDLVKKEDAEYSKEYP
VKEDDAEGSKEDAVKENDAEDSKECPVMEEDGEESKEEPVKGEDAEESKEDPVKEKDAEDLNDA
GPTLMYEMDPEEMEESSNNLVGTVQDVGDSVMEDQNGNEEEQTGDAQEEPAKNTLMSLDEVNVG
SDMRSSEEIRHDMGTSENQGSTTTYVQSEDLIIGDMKSSDNQEHVSTEQKIQEGGGSNKAKEGS
ELMGGNDSNSTPTGDNINANCAEDRMEEDTDKKMKGKDISVDEVGEDKIEAFADQENGEEFVED
GVPEDCEEAEALDDERAQLAAHAKERMKRKELEVFVGGLDRDAVEEDLKRVFQHVGEVIDVRMH
REVSTNKNKGYAFVKFATKEQASRALSEMRNPVIRGKRCGTAPSEDNDTLFLGNICNTWTKEAV
RQKLKDYGVEGVQSVNLVADPKHEGLSRGFAFLEFPCHTDAMTAYKILQRPDVVFGHSERTAKV
AFAEPLRDPDPEVMAQVKSVFIDGLPPYWDEDRAREKFKCFGDISRITLARNMSTAKRKDFGFV
DFSTHEAAVACVAGINNTELGDGNLKAKVRARLSNPQPKTQAVKGGLSGGFRISRGPMGRGFPR
GGHTFGRANFPRGRGFYPRVPGHGGRMGFVEHQFGSSYPPIRGRSNFGRAGGRWNLSGAQPVSV
EGPMFLDRMRHGDRGHADDAFFRRQQFPVEGLNRPFMGRHFEDRYYHDNTDHGLKRPFPMTDPD
PDYSGPSRRPRFDHSESANSLHGDRYRDNFPPGGDHYTRDYYGSDRGRGPYPSFNGGDRPFGRG
YGRGYY
Repeat found in LOC109237165
Repeat occurs 18 times in a sequence of 1156 amino acids
Location between 133403 and 138779
Coverage of 23.36 %
Instances:
SKEDPVKEKDAAESK | SKEDHVKVEDAQESK | SKEDHVKEEDAEDSK | SKEVPLKEEDAEDSK | SKEDPVKEEGLQDLK
SKEDPVKEKGLQDLK | SKEEAIKEKDVEESK | SKEDVVMGEDAEESE | SKEDAVMEKNVEESK | SKEDLVKEEDAEDSK
SKEYHVKEEDAEGSK | SKEDAVMEKDAEESK | SKEDLVKKEDAEYSK | SKEYPVKEDDAEGSK | SKEDAVKENDAEDSK
SKECPVMEEDGEESK | SKEEPVKGEDAEESK | SKEDPVKEKDAEDLN |
pattern: SKE[DVECY][VLHPA][ILV][MK][EGVK][EDKN][GDN][GLVA][EQA][EDGY][LS][EKN]
MRTRSAEKKSAGKQQIAAEPAAEPAAATKRTPPSRARQVRASGSTIPSPAEEAKASDADEAVQ
STPEAKPASGRKTTRRVVRKVVKKVPASSRTTPDKTPRSEDAGTAVPEDTDQDMVAEDSKEDPV
KEKDAAESK
KCVKKGDAEDSKEDHVKVEDAQESKKSIEEETENSKEDHVKEEDAEDSKEVPLKE
EDAEDSKEDPVKEEGLQDLKEDAAEEKDTEDLKEDIVKEEDAEDSKEDPVKEKGLQDLKEDATM
EKDAEDLKEDIVKEEDIVESKEEAIKEKDVEESKEDVVMGEDAEESEEYPAKEEDAEGSKEDAV
MEKNVEESK
EDLVKEEDAEDSKEYHVKEEDAEGSKEDAVMEKDAEESKEDLVKKEDAEYSKEYP
VKEDDAEGSKEDAVKENDAEDSKECPVMEEDGEESKEEPVKGEDAEESKEDPVKEKDAEDLNDA
GPTLMYEMDPEEMEESSNNLVGTVQDVGDSVMEDQNGNEEEQTGDAQEEPAKNTLMSLDEVNVG
SDMRSSEEIRHDMGTSENQGSTTTYVQSEDLIIGDMKSSDNQEHVSTEQKIQEGGGSNKAKEGS
ELMGGNDSNSTPTGDNINANCAEDRMEEDTDKKMKGKDISVDEVGEDKIEAFADQENGEEFVED
GVPEDCEEAEALDDERAQLAAHAKERMKRKELEVFVGGLDRDAVEEDLKRVFQHVGEVIDVRMH
REVSTNKNKGYAFVKFATKEQASRALSEMRNPVIRGKRCGTAPSEDNDTLFLGNICNTWTKEAV
RQKLKDYGVEGVQSVNLVADPKHEGLSRGFAFLEFPCHTDAMTAYKILQRPDVVFGHSERTAKV
AFAEPLRDPDPEVMAQVKSVFIDGLPPYWDEDRAREKFKCFGDISRITLARNMSTAKRKDFGFV
DFSTHEAAVACVAGINNTELGDGNLKAKVRARLSNPQPKTQAVKGGLSGGFRISRGPMGRGFPR
GGHTFGRANFPRGRGFYPRVPGHGGRMGFVEHQFGSSYPPIRGRSNFGRGGRWNLSGAQPVSVE
GPMFLDRMRHGDRGHADDAFFRRQQFPVEGLNRPFMGRHFEDRYYHDNTDHGLKRPFPMTDPDP
DYSGPSRRPRFDHSESANSLHGDRYRDNFPPGGDHYTRDYYGSDRGRGPYPSFNGGDRPFGRGY
GRGYY
Repeat found in LOC109237168
Repeat occurs 10 times in a sequence of 680 amino acids
Location between 202221 and 206805
Coverage of 22.06 %
Instances:
DEVRELKQDNTANKE | DEVHGLKQDNTATKE | DEVHELKQDNTATKE | DEVRELKQDNTATKK | DEVHELKQANTATKD
DEVHELKQDNTATKE | DEVRELKQDNTATKE | DEVHELKHDNTVTKE | DEVRGLKRESTATKE | DEVRHLKRESHYMHH

pattern: DEV[RH][EGH]LK[RQH][EDA][SN][TH][YVA][TMN][HK][EDHK]
MKLQLLYSLTILWLAFVTSHAAISPEVYWKVKLPNTQIPKVIKDCLPQADNEISQLKQDNTAN
KEKVYYGLHKPYGYDRAATEDEVRELKQDNTANKEKVYYGLHEPYGYDRTATEDEVHGLKQDNT
ATKE
KVYYGLRKPYGYDRAASEDEVHELKQDNTATKEKIYNGLHKPYGYDRAAKEDEVRELKQD
NTATKK
KVYYGLHKPYGYDRAATDDEVHELKQANTATKDKVYYGLHKPYGYDRVATEDEVHELK
QDNTATKE
KVYYNLHKLYSYDRDASQNEVHELKQDNTATKEKLYYGLHKPYGYDHAALEDEVRE
LKQDNTATKE
KVYYGLHKPYGYDRAATEDEVHELKHDNTVTKEKVYYSLQKSYGYDRAAIEDEV
RGLKRESTATKE
KVYHGLHQRFGVHTLRRAAAEDEVRHLKRESHYMHHATTVNNLQQVNEGSSA
KSDLKDNYLYKPYFFEKNLEKGKIINFPSLKNKNKAPFLSRQFVESIPFSLEKVLEILNYFSID
SISKDAQTIEETIRHCEEPAMKGEKKICATSLESMVDISLTMLGTNNVHAVTTEVEGETQMLQK
YTIKKVQKIADGDNLICHKLSYTYAVYNCHVGGRTKIFMVSMVGADGTKVKAVLVCHKDTSFWN
PKGLPFVLLKVKPGTTPVCHFLQDDQIAFLPSKDATKLSDN
Repeat found in LOC109237168
Repeat occurs 13 times in a sequence of 746 amino acids
Location between 202221 and 206805
Coverage of 26.14 %
Instances:
DEVRELKQDNTANKE | DEVHGLKQDNTATKE | DEVHEMKQDNTATKE | DEVHELKQDNTATKE | DEVHELKQDNTDTKD
DEVHELKQDNTATKE | DEVRELKQDNTATKK | DEVHELKQANTATKD | DEVHELKQDNTATKE | DEVRELKQDNTATKE
DEVHELKHDNTVTKE | DEVRGLKRESTATKE | DEVRHLKRESHYMHH |
pattern: DEV[RH][EGH][LM]K[RQH][EDA][SN][TH][DYVA][TMN][HK][EDHK]
MKLQLLYSLTILWLAFVTSHAAISPEVYWKVKLPNTQIPKVIKDCLPQADNEISQLKQDNTAN
KEKVYYGLHKPYGYDRAATEDEVRELKQDNTANKEKVYYGLHEPYGYDRTATEDEVHGLKQDNT
ATKE
KVYYGLHKPYGYDRTATEDEVHEMKQDNTATKEKVYYGLHKPYGYDRAATEDEVHELKQD
NTATKE
KVYYGLHKPYGYNRAATEDEVHELKQDNTDTKDKVYYGLRKPYGYDRAASEDEVHELK
QDNTATKE
KIYNGLHKPYGYDRAAKEDEVRELKQDNTATKKKVYYGLHKPYGYDRAATDDEVHE
LKQANTATKD
KVYYGLHKPYGYDRVATEDEVHELKQDNTATKEKLYYGLHKPYGYDHAALEDEV
RELKQDNTATKE
KVYYGLHKPYGYDRAATEDEVHELKHDNTVTKEKVYYSLQKSYGYDRAAIED
EVRGLKRESTATKE
KVYHGLHQRFGVHTLRRAAAEDEVRHLKRESHYMHHATTVNNLQQVNEGS
SAKSDLKDNYLYKPYFFEKNLEKGKIINFPSLKNKNKAPFLSRQFVESIPFSLEKVLEILNYFS
IDSISKDAQTIEETIRHCEEPAMKGEKKICATSLESMVDISLTMLGTNNVHAVTTEVEGETQML
QKYTIKKVQKIADGDNLICHKLSYTYAVYNCHVGGRTKIFMVSMVGADGTKVKAVLVCHKDTSF
WNPKGLPFVLLKVKPGTTPVCHFLQDDQIAFLPSKDATKLSDN
Repeat found in LOC109237168
Repeat occurs 13 times in a sequence of 779 amino acids
Location between 202221 and 206805
Coverage of 25.03 %
Instances:
DEVRELKQDNTANKE | DEVHGLKQDNTATKE | DEVHEMKQDNTATKE | DEVHELKQDNTATKE | DEVHELKQDNTDTKD
DEVHELKQDNTATKE | DEVRELKQDNTATKK | DEVHELKQANTATKD | DEVHELKQDNTATKE | DEVRELKQDNTATKE
DEVHELKHDNTVTKE | DEVRGLKRESTATKE | DEVRHLKRESHYMHH |
pattern: DEV[RH][EGH][LM]K[RQH][EDA][SN][TH][DYVA][TMN][HK][EDHK]
MKLQLLYSLTILWLAFVTSHAAISPEVYWKVKLPNTQIPKVIKDCLPQADNEISQLKQDNTAN
KEKVYYGLHKPYGYDRAATEDEVRELKQDNTANKEKVYYGLHEPYGYDRTATEDEVHGLKQDNT
ATKE
KVYYGLHKPYGYDRTATEDEVHEMKQDNTATKEKVYYGLHKPYGYDRAATEDEVHELKQD
NTATKE
KVYYGLHKPYGYNRAATEDEVHELKQDNTDTKDKVYYGLRKPYGYDRAASEDEVHELK
QDNTATKE
KIYNGLHKPYGYDRAAKEDEVRELKQDNTATKKKVYYGLHKPYGYDRAATDDEVHE
LKQANTATKD
KVYYGLHKPYGYDRVATEDEVHELKQDNTATKEKVYYNLHKLYSYDRDASQNEV
HELKQDNTATKEKLYYGLHKPYGYDHAALEDEVRELKQDNTATKEKVYYGLHKPYGYDRAATED
EVHELKHDNTVTKE
KVYYSLQKSYGYDRAAIEDEVRGLKRESTATKEKVYHGLHQRFGVHTLRR
AAAEDEVRHLKRESHYMHHATTVNNLQQVNEGSSAKSDLKDNYLYKPYFFEKNLEKGKIINFPS
LKNKNKAPFLSRQFVESIPFSLEKVLEILNYFSIDSISKDAQTIEETIRHCEEPAMKGEKKICA
TSLESMVDISLTMLGTNNVHAVTTEVEGETQMLQKYTIKKVQKIADGDNLICHKLSYTYAVYNC
HVGGRTKIFMVSMVGADGTKVKAVLVCHKDTSFWNPKGLPFVLLKVKPGTTPVCHFLQDDQIAF
LPSKDATKLSDN

Similar gene clusters

NW_017670303 - Cluster 21 - Saccharide

Gene cluster description

NW_017670303 - Gene Cluster 21. Type = saccharide. Location: 725080 - 1030319 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_017670315 - Cluster 22 - Cyclopeptide

Gene cluster description

NW_017670315 - Gene Cluster 22. Type = cyclopeptide. Location: 1 - 1030322 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_017670343 - Cluster 23 - Fatty_acid

Gene cluster description

NW_017670343 - Gene Cluster 23. Type = fatty_acid. Location: 58217 - 432358 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_017670473 - Cluster 24 - Saccharide

Gene cluster description

NW_017670473 - Gene Cluster 24. Type = saccharide. Location: 287779 - 663437 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_017670748 - Cluster 25 - Cyclopeptide

Gene cluster description

NW_017670748 - Gene Cluster 25. Type = cyclopeptide. Location: 1 - 467702 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC109242957
Repeat occurs 3 times in a sequence of 264 amino acids
Location between 146841 and 147636
Coverage of 6.82 %
Instances:
GKRQNN | GKRRYN | GKRENN |
pattern: GKR[ERQ][YN]N
MIFLRHHLDEGLKVEYLTVKDPLELWNGLKERYDHLKATVLPRARYVWIHLRLQDFKTVSEYN
SAVFRITSQLKLCGEVMNDEDLLEKILTTFHASNMVLQQQYRERGFKKYSELISCLLVAEQQNT
LLLKNHEARPTGSAPLPETNMAARRDKSGKRQNNNYGHMNVHGHGNGKRRYNSRHRGGHGKREN
N
MGSQNNPSRGKSGNCHRCGMKGHWKIECRAPEHFVRLYQNSIKIKANNVGASSANAPVESHLT
FKNDFEAGP

Similar gene clusters

NW_017670779 - Cluster 26 - Alkaloid

Gene cluster description

NW_017670779 - Gene Cluster 26. Type = alkaloid. Location: 96846 - 289108 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_017671029 - Cluster 27 - Putative

Gene cluster description

NW_017671029 - Gene Cluster 27. Type = putative. Location: 193266 - 270497 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_017672839 - Cluster 28 - Cyclopeptide

Gene cluster description

NW_017672839 - Gene Cluster 28. Type = cyclopeptide. Location: 1 - 165968 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_017673468 - Cluster 29 - Saccharide

Gene cluster description

NW_017673468 - Gene Cluster 29. Type = saccharide. Location: 47848 - 112171 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters