Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_065570.1:
Cluster 1Saccharide-Fatty_acid5209011752321453231.34DIOX_N, ECH_2, LTP_2, Transferase, UDPGT_2cyanogenic glucoside-2, monoterpenoid-2--
Cluster 2Cyclopeptide5273074253073629342.89BURP---
Cluster 3Saccharide583046635839489790.23Aminotran_1_2, Glycos_transf_2, SE---
Cluster 4Terpene-Fatty_acid6730396267412102108.142OG-FeII_Oxy, DIOX_N, FA_desaturase, Terpene_synth, Terpene_synth_C, Transferase, p450---
Cluster 5Cyclopeptide7012015570376857256.70ADH_N, ADH_zinc_N, BURP, FA_desaturase_2, Peptidase_S10, p450---
Cluster 6Fatty_acid7028058670402715122.13ADH_N, ADH_zinc_N, FA_desaturase_2, Peptidase_S10, p450---
Cluster 7Putative707216777081344191.76Abhydrolase_3, Dimerisation, Epimerase, Methyltransf_11, Methyltransf_2---
Cluster 8Saccharide-Fatty_acid731617977318651924.72AMP-binding, Glycos_transf_2, Transferase---
The following clusters are from record NC_065571.1:
Cluster 9Alkaloid979553103966060.11Bet_v_1, Transferase---
Cluster 10Saccharide-Polyketide2216220230016183.94Cellulose_synt, Chal_sti_synt_C, Chal_sti_synt_N, SE---
Cluster 11Cyclopeptide71917887576760384.97BURP---
Cluster 12Cyclopeptide90680579587077519.02BURP---
Cluster 13Cyclopeptide92223109694710472.40BURP---
Cluster 14Saccharide-Terpene1047917210758654279.48Glyco_hydro_1, Terpene_synth, Terpene_synth_C, p450---
Cluster 15Alkaloid1183095812002352171.39Amino_oxidase, Bet_v_1, Methyltransf_11---
Cluster 16Cyclopeptide15540148166506731110.53BURP---
Cluster 17Saccharide-Terpene4389930344040042140.74Glycos_transf_1, Prenyltrans, SQHop_cyclase_C, SQHop_cyclase_N, p450Cycloartenoltirucalla (21% of genes show similarity)BGC0001314.3_c1
Cluster 18Terpene4733749047463827126.34Epimerase, Terpene_synth_C, Transferase, p450---
Cluster 19Saccharide491799104925836978.46Glycos_transf_2, UDPGT_2, p450---
Cluster 20Cyclopeptide5024394950595412351.46BURP-arabidiol/baruol (13% of genes show similarity)BGC0002906.1_c1
The following clusters are from record NC_065572.1:
Cluster 21Alkaloid-Fatty_acid5667928557201527522.24BBE, FAD_binding_4, FA_hydroxylase---
Cluster 22Cyclopeptide6737721167901756524.54BURP---
Cluster 23Transporter_associated6891831069103722185.41LTP_2, p450---
Cluster 24Putative694408496951745976.61Transferase, adh_short---
Cluster 25Terpene727070167274168834.67Terpene_synth, Terpene_synth_C, p450-casbene/5a-hydroxy-casbene/5-keto-casbene/5-keto-7,8-epoxy... (47% of genes show similarity)BGC0002393.2_c1
Cluster 26Cyclopeptide7410583274355763249.93BURP---
The following clusters are from record NC_065573.1:
Cluster 27Saccharide249929532507360780.652OG-FeII_Oxy, DIOX_N, Dimerisation, Methyltransf_2, UDPGT_2, p450flavonoid, oleananes--
Cluster 28Alkaloid287442812880459560.31AMP-binding, Str_synth---
Cluster 29Lignan305005753055536554.79Dirigent, Methyltransf_11, p450---
Cluster 30Putative3065840730765798107.392OG-FeII_Oxy, DIOX_N, Methyltransf_7, Transferase, p450---
Cluster 31Terpene312871483131368026.53Aldo_ket_red, Amino_oxidase, PRISE---
Cluster 32Saccharide362889813637632687.34AMP-binding, Amino_oxidase, Glyco_hydro_1, Glycos_transf_1, SQS_PSY---
Cluster 33Alkaloid377323713780440872.04Cu_amine_oxid, Epimerase, LTP_2, p450---
The following clusters are from record NC_065574.1:
Cluster 34Cyclopeptide54584385751141292.70BURP---
Cluster 35Terpene74253487552412127.06ABC2_membrane, ABC_tran, Aminotran_3, SQHop_cyclase_C, SQHop_cyclase_N, p450triterpene-4lupeol (11% of genes show similarity)BGC0001317.3_c1
Cluster 36Putative91140219261441147.422OG-FeII_Oxy, Aminotran_1_2, DIOX_N---
Cluster 37Phenolamide1058212810714408132.28Acetyltransf_1, MatE, Orn_Arg_deC_N, Orn_DAP_Arg_deC, Transferase, p450---
Cluster 38Saccharide120976091219454596.942OG-FeII_Oxy, DIOX_N, Lycopene_cycl, UDPGT_2---
Cluster 39Saccharide1619209916374525182.432OG-FeII_Oxy, DIOX_N, Glyco_hydro_1---
Cluster 40Terpene4249591242910700414.792OG-FeII_Oxy, DIOX_N, SQHop_cyclase_C, SQHop_cyclase_N, Terpene_synth, Terpene_synth_C, Transferase, p450triterpene-3--
Cluster 41Saccharide-Terpene4303229743211187178.89Glycos_transf_1, SQHop_cyclase_C, SQHop_cyclase_N, Terpene_synth, Terpene_synth_Ctriterpene--
Cluster 42Cyclopeptide46934471479546651020.19BURP---
The following clusters are from record NC_065575.1:
Cluster 43Saccharide-Terpene35292640557252.65PRISE, Prenyltransf, UDPGT_2---
Cluster 44Cyclopeptide23531512991554638.40BURP---
Cluster 45Cyclopeptide35175803869861352.28BURP---
Cluster 46Terpene419683584204201173.65Acetyltransf_1, Chalcone_3, Terpene_synth, Terpene_synth_C---
Cluster 47Alkaloid436653894372757262.18Abhydrolase_3, Acetyltransf_1, Bet_v_1---
Cluster 48Putative455151124560418889.082OG-FeII_Oxy, Abhydrolase_3, DIOX_N, p450---
The following clusters are from record NC_065576.1:
Cluster 49Terpene1316455913269223104.662OG-FeII_Oxy, AMP-binding, DIOX_N, Terpene_synth, Terpene_synth_C---
Cluster 50Cyclopeptide31282959328865441603.59BURP---
Cluster 51Alkaloid4000703640125374118.34Epimerase, Pyridoxal_deC, p450---
Cluster 52Saccharide-Fatty_acid4624099546343953102.96ADH_N_2, ADH_zinc_N, FA_desaturase_2, Glyco_hydro_1---
Cluster 53Lignan4674513746858876113.74Dirigent, Peptidase_S10, p450---
Cluster 54Putative4817316648370665197.50Abhydrolase_3, Transferase, p450---
Cluster 55Cyclopeptide4940770249952896545.19BURP---
The following clusters are from record NC_065577.1:
Cluster 56Saccharide72345281940695.95Methyltransf_3, UDPGT_2flavonoid-3, oleananes-3--
Cluster 57Saccharide1122141116079838.66ADH_N, ADH_zinc_N, UDPGT_2hydroxycinnamate--
Cluster 58Cyclopeptide18304352508685678.25BURP---
Cluster 59Cyclopeptide67797887067514287.73BURP---
Cluster 60Saccharide-Terpene85201918651367131.18Glyco_hydro_1, Terpene_synth, Terpene_synth_C, adh_short, p450---
Cluster 61Cyclopeptide88286429523848695.21BURP---
Cluster 62Cyclopeptide17864569208220192957.45BURP---
Cluster 63Cyclopeptide29769063310831101314.05BURP---
Cluster 64Putative3384050734121947281.44Amino_oxidase, Methyltransf_7, Peptidase_S10, Transferase---
Cluster 65Putative3626028536515762255.48Chalcone, Transferase, p450---

NC_065570 - Cluster 1 - Saccharide-fatty_acid

Gene cluster description

NC_065570 - Gene Cluster 1. Type = saccharide-fatty_acid. Location: 52090117 - 52321453 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065570 - Cluster 2 - Cyclopeptide

Gene cluster description

NC_065570 - Gene Cluster 2. Type = cyclopeptide. Location: 52730742 - 53073629 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_065570 - Cluster 3 - Saccharide

Gene cluster description

NC_065570 - Gene Cluster 3. Type = saccharide. Location: 58304663 - 58394897 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065570 - Cluster 4 - Terpene-fatty_acid

Gene cluster description

NC_065570 - Gene Cluster 4. Type = terpene-fatty_acid. Location: 67303962 - 67412102 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065570 - Cluster 5 - Cyclopeptide

Gene cluster description

NC_065570 - Gene Cluster 5. Type = cyclopeptide. Location: 70120155 - 70376857 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_065570 - Cluster 6 - Fatty_acid

Gene cluster description

NC_065570 - Gene Cluster 6. Type = fatty_acid. Location: 70280586 - 70402715 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065570 - Cluster 7 - Putative

Gene cluster description

NC_065570 - Gene Cluster 7. Type = putative. Location: 70721677 - 70813441 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065570 - Cluster 8 - Saccharide-fatty_acid

Gene cluster description

NC_065570 - Gene Cluster 8. Type = saccharide-fatty_acid. Location: 73161797 - 73186519 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065571 - Cluster 9 - Alkaloid

Gene cluster description

NC_065571 - Gene Cluster 9. Type = alkaloid. Location: 979553 - 1039660 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065571 - Cluster 10 - Saccharide-polyketide

Gene cluster description

NC_065571 - Gene Cluster 10. Type = saccharide-polyketide. Location: 2216220 - 2300161 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065571 - Cluster 11 - Cyclopeptide

Gene cluster description

NC_065571 - Gene Cluster 11. Type = cyclopeptide. Location: 7191788 - 7576760 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126666875
Repeat occurs 26 times in a sequence of 670 amino acids
Location between 7382887 and 7385660
Coverage of 58.21 %
Instances:
YVYSEPSKSESYVYR | YVYRHPSKPSDGYVY | YVYSQSSKPTEEGYV | YVYSQSTKPTEGYVY | YVYGQSSKPTEGYVY
YVYGQSSKPSEDYVY | YVYSQSSKPTEGYVY | YVYGQSSKPTEGYVY | YVYGQSSKPSEGYVY | YVYSQSSKPTEDYVY
YVYGQSSKPTEGYVY | YVYGQSSKPTEGYIY | YVYSQSSKPTEGYVY | YVYGQSSKPTEGYVY | YVYSQSSKPSEGYVY
YVYSQSSKPSEGYVY | YVYSQSSKLSKPSEG | YVYSQSSKPTEGYVY | YVYSQSSEPKEGYVY | YVYRQSFKPTESYVY
YVYSQSSKPSEGYVY | YVYSQSLKSNQEPSR | YVYNQPSKSIKGYVY | YVYSQASKSNQGYVY | YVYSQSSSHGYVYSQ
YVYSQPSKKKHGETI |
pattern: YVY[RNGS][QHE][APS][LFTS][SKE][SPHLK][GSTNKIE][DSYQHKE][GDYSPVE][GSYPEV][SYTEVI][GYQIRV]
MVSMEFHLLPIFVLLCVMFVRSNASVPAEVYWHSKLPNTAMPQKLHNLLYCDLGEKFPFWDMG
AKPSQRYVNQDRGLKSYTYSQPFRTKENYVYSEPSKSESYVYRHPSKPSDGYVYSQSSKPTEEG
YV
YSQSTKPTEGYVYGQSSKPTEGYVYGQSSKPSEDYVYSQSSKPTEGYVYGQSSKPTEGYVYG
QSSKPSEGYVY
SQSSKPTEDYVYGQSSKPTEGYVYGQSSKPTEGYIYGQSSKPSEGYVYSQSSK
PTEGYVY
GQSSKPTEGYVYSQSSKPSEGYVYSQSSKPSEGYVYSQSSKLSKPSEGYVYSQSSKP
TEGYVY
SQSSEPKEGYVYRQSFKPTESYVYSQSSKPSEGYVYSQSLKSNQEPSRAYNSRPLKTT
EGYVYNQPSKSIKGYVYSQASKSNQGYVYSQSSSHGYVYSQPSKKKHGETIDHSAIFFFLHSDL
HEGKTMRFEITESTNKARIIPRQVSESIPFSAEKLPEIFMKFSTSADSPQGEMIEKTVKDCGFP
GIKGEDKLCTTSLESLIDFVVAHIGNNAQVFYNEMDEVKTTEQEYSIMGVNMIGEDPVVCHKQK
YPYAVYYCHEIKDTKVYKAPLMGADGTKVEAIVVCHFDTSNWNPGNIAFLLLNVKPGQETICHV
IKSDTLVWLPTDQVMMGDRDRSEINHLSTTA

Similar gene clusters

NC_065571 - Cluster 12 - Cyclopeptide

Gene cluster description

NC_065571 - Gene Cluster 12. Type = cyclopeptide. Location: 9068057 - 9587077 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126669360
Repeat occurs 3 times in a sequence of 369 amino acids
Location between 9161985 and 9163181
Coverage of 5.69 %
Instances:
VASKGKH | VASKGKQ | VASMPLE |
pattern: VAS[KM][PG][LK][QEH]
MSEANLGSNPDASAIVPVKRKRGRPRKYPKLDRGVYFHGPRSLNPDHGGSRVPPGFLGVSENQ
PRQVDPVNDATGVMVGQTVHGVIEAAFDAGYLLTVRVSNSETTLRGVIFKPGHYIPVSTDNDVA
PGVHMIRRNEVPFPRESYAQVHSHNSRSRERNGAIHTARVSNSVASKGKHVPSMATDVFSPANS
RGNVVPVVLQPINISNGVAGESSSVVTQPDHAVASKGKQEPDDSRPLNGSTTTNQFHSQNNNQV
TPSSIPSLAGSAQSLQEAEAKSMKLAGMPFEQLLAEVSKRNQVPAQSTETDTSSAGKLSTDDVN
DVDQALSVEPLQAVQPDLNNHSAVASMPLENYRTGRMTELLQKVLQEKIN
Repeat found in LOC126669360
Repeat occurs 3 times in a sequence of 369 amino acids
Location between 9161985 and 9163181
Coverage of 5.69 %
Instances:
VASKGKH | VASKGKQ | VASMPLE |
pattern: VAS[KM][PG][LK][QEH]
MSEANLGSNPDASAIVPVKRKRGRPRKYPKLDRGVYFHGPRSLNPDHGGSRVPPGFLGVSENQ
PRQVDPVNDATGVMVGQTVHGVIEAAFDAGYLLTVRVSNSETTLRGVIFKPGHYIPVSTDNDVA
PGVHMIRRNEVPFPRESYAQVHSHNSRSRERNGAIHTARVSNSVASKGKHVPSMATDVFSPANS
RGNVVPVVLQPINISNGVAGESSSVVTQPDHAVASKGKQEPDDSRPLNGSTTTNQFHSQNNNQV
TPSSIPSLAGSAQSLQEAEAKSMKLAGMPFEQLLAEVSKRNQVPAQSTETDTSSAGKLSTDDVN
DVDQALSVEPLQAVQPDLNNHSAVASMPLENYRTGRMTELLQKVLQEKIN
Repeat found in LOC126669360
Repeat occurs 3 times in a sequence of 369 amino acids
Location between 9161985 and 9163181
Coverage of 5.69 %
Instances:
VASKGKH | VASKGKQ | VASMPLE |
pattern: VAS[KM][PG][LK][QEH]
MSEANLGSNPDASAIVPVKRKRGRPRKYPKLDRGVYFHGPRSLNPDHGGSRVPPGFLGVSENQ
PRQVDPVNDATGVMVGQTVHGVIEAAFDAGYLLTVRVSNSETTLRGVIFKPGHYIPVSTDNDVA
PGVHMIRRNEVPFPRESYAQVHSHNSRSRERNGAIHTARVSNSVASKGKHVPSMATDVFSPANS
RGNVVPVVLQPINISNGVAGESSSVVTQPDHAVASKGKQEPDDSRPLNGSTTTNQFHSQNNNQV
TPSSIPSLAGSAQSLQEAEAKSMKLAGMPFEQLLAEVSKRNQVPAQSTETDTSSAGKLSTDDVN
DVDQALSVEPLQAVQPDLNNHSAVASMPLENYRTGRMTELLQKVLQEKIN
Repeat found in LOC126669360
Repeat occurs 3 times in a sequence of 368 amino acids
Location between 9161985 and 9163181
Coverage of 5.71 %
Instances:
VASKGKH | VASKGKQ | VASMPLE |
pattern: VAS[KM][PG][LK][QEH]
MSEANLGSNPDASAIVPVKRKRGRPRKYPKLDRGVYFHGPRSLNPDHGGSRVPPGFLGVSENQ
PRQVDPVNDATGVMVGQTVHGVIEAAFDAGYLLTVRVSNSETTLRGVIFKPGHYIPVSTDNDVA
PGVHMIRRNEVPFPRESYAQVHSHNSRSRERNGAIHTARVSNSVASKGKHVPSMATDVFSPANS
RGNVVPVVLQPINISNGVAGESSSVVTQPDHAVASKGKQEPDDSRPLNGSTTTNQFHSQNNNQV
TPSSIPSLAGSAQSLQEAEAKSMKLAGMPFEQLLAEVSKRNQVPAQSTETDTSSAGKLSTDDVN
DVDQALSVEPLQAVQPDLNNHSAVASMPLENYRTGRMTELLQVLQEKIN
Repeat found in LOC126668814
Repeat occurs 8 times in a sequence of 604 amino acids
Location between 9164049 and 9170294
Coverage of 19.87 %
Instances:
ARPEVGISPFVARPE | ARPEVGISPFVARPE | ARPEVGISSFVARPE | ARPEVGISPFVARPE | ARPEVGISPFVARPE
ARPEVGIGPFVARPS | ARPSVTAKFNPYANL | ARPLLTPHHVQYPKH |
pattern: ARP[SLE][LV][GT][PAI][KHGS][PHFS][FNV][PQV][AY][RAP][KPN][SLHE]
MYTHKNTQRLKMGLKSRREVESETCLSESLLFATMCLIGLPVDVHVRDGSVYSGIFHTASVDK
DYAIVLKEAKLAKKGKLVANVVNGSVIETLVIRSGDLVQVVAKEVLFPSDGVNGNVASDHVEAV
AVKVHCFESFDSEAKESNKYGVDKKKINDNRNSAKSKIISADGFVPRKAGKELDGRKVSHRSEI
ATKVELPKQDVVDVSKSFPDASVSGRQIVDERSQGEHDHHKQKFQLEREMNDDEVQSSSSISSL
CLSEAKASEEGQQTRKLLPNGVFCRNSPCSSISTVTSPRVEVTQESHSGALSTSAELVTPQSLE
STRTSKDFKLNPGAKIFCPSFATPISANTAAPAVSSMAYVPGNSPMVPAVAVSARPEVGISPFV
ARPE
VGISPFVARPEVGISSFVARPEVGISPFVARPEVGISPFVARPEVGIGPFVARPSVTAKF
NPYANL
TTVNGGHGPQFSQQVVAHMGNRTQQLRYAGQYHAVQAAPAYVPPNSQAVMIGRLGQVV
YMQPIPHDLVHSTASISPVSARPLLTPHHVQYPKHQGSAAGHASPICAPPPLMAGVQQPFAMPN
HIPLLQPHIPANCPIPVPGCNGFFATKFQ

Similar gene clusters

NC_065571 - Cluster 13 - Cyclopeptide

Gene cluster description

NC_065571 - Gene Cluster 13. Type = cyclopeptide. Location: 9222310 - 9694710 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_065571 - Cluster 14 - Saccharide-terpene

Gene cluster description

NC_065571 - Gene Cluster 14. Type = saccharide-terpene. Location: 10479172 - 10758654 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065571 - Cluster 15 - Alkaloid

Gene cluster description

NC_065571 - Gene Cluster 15. Type = alkaloid. Location: 11830958 - 12002352 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065571 - Cluster 16 - Cyclopeptide

Gene cluster description

NC_065571 - Gene Cluster 16. Type = cyclopeptide. Location: 15540148 - 16650673 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_065571 - Cluster 17 - Saccharide-terpene

Gene cluster description

NC_065571 - Gene Cluster 17. Type = saccharide-terpene. Location: 43899303 - 44040042 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_065571 - Cluster 18 - Terpene

Gene cluster description

NC_065571 - Gene Cluster 18. Type = terpene. Location: 47337490 - 47463827 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065571 - Cluster 19 - Saccharide

Gene cluster description

NC_065571 - Gene Cluster 19. Type = saccharide. Location: 49179910 - 49258369 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065571 - Cluster 20 - Cyclopeptide

Gene cluster description

NC_065571 - Gene Cluster 20. Type = cyclopeptide. Location: 50243949 - 50595412 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126669599
Repeat occurs 4 times in a sequence of 186 amino acids
Location between 50249452 and 50250013
Coverage of 30.11 %
Instances:
NPNPNPNPTPNPNN | NPNPNPTPNPNNRS | NPNPTPNPNNRSNR | NPNNRSNRAQPNRR |
pattern: NPN[PN][RNT][PS][NT][RP][ANT][PNQ][RPN][PNS][RN][RNS]
MGVAILNPQDCLQNPLSSRRGLISLPPEMKLYRNPNPNPNPTPNPNNRSNRAQPNRRKRSPNT
SPPARAAVVAKVEAEKIPVLGQVKILKRGEQLPKLSPAKAVSQPINIEKKKDVDLDLGSTHRLG
PDPDRVQTQICFRSTGFYAGSAFFTSPPPSSLPVPAFFTKKNNIDDATCDLRRMLGLSL
Repeat found in LOC126667990
Repeat occurs 8 times in a sequence of 517 amino acids
Location between 50417008 and 50422352
Coverage of 23.21 %
Instances:
EQEKSLLNDFQPKPN | EQEKALLNDFQPKPN | EQEKALLNDFQPKPN | EQEKALLNDFQPKPN | EQEKPLLNDFQPKPN
EQEKALLNDFQPKPN | EQEKALLNDFQPKPN | EQEKSFRSDGMDGNS |
pattern: EQEK[PAS][LF][RL][NS]D[FG][MQ][PD][KG][PN][NS]
MSFFASMKFSLLSVFALLCLMLVVSNASMPNDNYLRSKVPNTPMPRKFGNRQQPAEIRGQDAF
WDMGFEERKIEYGSDGEQEKSLLNDFQPKPNLFPYFGYHNSRNDEQEKALLNDFQPKPNLFPYI
GYHNGRNDEQEKALLNDFQPKPNLFPYFGYHNGRNDEQEKALLNDFQPKPNLFPYFGYHNGRND
EQEKPLLNDFQPKPN
LFPYFGYHNGRNDEQEKALLNDFQPKPNLFPYFGYHNGRNDEQEKALLN
DFQPKPN
IFPYFGYRNGRNDEQEKSFRSDGMDGNSPFIVFFRRNDLKVNNKLPISLPPLNNDHS
AKDPPLLPREQAESYPFSSHKFPHILRLFSISPHSPQAKAMEEAITLCEGKSLTGETKLCATSY
EAMLDFVGSTFGLDTKRFKAITSSHLTRSKNKVQNYTVVQEPQEILAPKLVGCHPMSYPYTIFY
CHSATNTKGFKVSLVGDNGDMIEAIAVCHFDTSEWSRDHPAFRELGAEPGSTDVCHFFPADHMV
WFPSDA
Repeat found in LOC126667990
Repeat occurs 8 times in a sequence of 518 amino acids
Location between 50417008 and 50422352
Coverage of 23.17 %
Instances:
EQEKSLLNDFQPKPN | EQEKALLNDFQPKPN | EQEKALLNDFQPKPN | EQEKALLNDFQPKPN | EQEKPLLNDFQPKPN
EQEKALLNDFQPKPN | EQEKALLNDFQPKPN | EQEKSFRSDGMDGNS |
pattern: EQEK[PAS][LF][RL][NS]D[FG][MQ][PD][KG][PN][NS]
MSFFASMKFSLLSVFALLCLMLVVSNASMPNDNYLRSKVPNTPMPRKFGNRQQPAVEIRGQDA
FWDMGFEERKIEYGSDGEQEKSLLNDFQPKPNLFPYFGYHNSRNDEQEKALLNDFQPKPNLFPY
IGYHNGRNDEQEKALLNDFQPKPNLFPYFGYHNGRNDEQEKALLNDFQPKPNLFPYFGYHNGRN
DEQEKPLLNDFQPKPNLFPYFGYHNGRNDEQEKALLNDFQPKPNLFPYFGYHNGRNDEQEKALL
NDFQPKPN
IFPYFGYRNGRNDEQEKSFRSDGMDGNSPFIVFFRRNDLKVNNKLPISLPPLNNDH
SAKDPPLLPREQAESYPFSSHKFPHILRLFSISPHSPQAKAMEEAITLCEGKSLTGETKLCATS
YEAMLDFVGSTFGLDTKRFKAITSSHLTRSKNKVQNYTVVQEPQEILAPKLVGCHPMSYPYTIF
YCHSATNTKGFKVSLVGDNGDMIEAIAVCHFDTSEWSRDHPAFRELGAEPGSTDVCHFFPADHM
VWFPSDA
Repeat found in LOC126668664
Repeat occurs 4 times in a sequence of 148 amino acids
Location between 50591459 and 50591906
Coverage of 29.73 %
Instances:
PTKPTKPSNPT | PTKPSNPTKPT | PTKPTKPSNPT | PTKPSNPTPMP |
pattern: PTKP[ST][KN]P[TS][KPN][MP][PT]
MASTNNLSASIFILSVLILSTISNACVPCEPTHPTKPTKPSNPTKPTKPSNPTPMPPTPSKQT
CPIDALKLGVCADLLGVVNVVIGGSPSGSKCCALIQGLADLDVGLCLCTAIKANVLGINLNVPV
SLGLLVNACDKSLPSGFKCPS

Similar gene clusters

Similar known gene clusters

NC_065572 - Cluster 21 - Alkaloid-fatty_acid

Gene cluster description

NC_065572 - Gene Cluster 21. Type = alkaloid-fatty_acid. Location: 56679285 - 57201527 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065572 - Cluster 22 - Cyclopeptide

Gene cluster description

NC_065572 - Gene Cluster 22. Type = cyclopeptide. Location: 67377211 - 67901756 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126673833
Repeat occurs 3 times in a sequence of 190 amino acids
Location between 67445104 and 67448415
Coverage of 9.47 %
Instances:
KACDKT | KACFRC | KACFRC |
pattern: KAC[FD][RK][CT]
MATFGGTTQKCKACDKTVYLVDQLTADNKVYHKACFRCHHCKGTLKLSNYSSFEGVLYCKPHF
DQLFKMTGSLDKSFEGAPKTVRDRSADQANSNSRVSSMFAGTQDKCVACKRTVYPIEKVAVDGT
SYHKACFRCTHGGCVISPSNYVAHEHRLYCRHHHNQLFKQKGNFSQLDKHEDVTTSAEIPSTE
Repeat found in LOC126672896
Repeat occurs 7 times in a sequence of 1150 amino acids
Location between 67453969 and 67458514
Coverage of 9.13 %
Instances:
ADVDDCADVDDCADV | ADVDDCADVDDCADV | ADVDDCADVDDCADV | ADVDDCADVDDCADV | ADVDDCADVDDCADV
ADVDDCADVDDGDDF | ADVDDGDDFSDEDEE |
pattern: ADVDD[GC][AD]D[FV][SD]D[GCE][AD][ED][FEV]
MTTNALPALKIYWDGTILSTPGGVDYVSGKSEMVELTRVVNFAQLQVVIRRVIGLNDADEIVK
IYLRVPRFDEHGRFQKYDGFPMETDVHMEAMWRNVSRTPQMRVVEFYIVYNPLSQVRADVDDCA
DVDDCADV
DDCADVDDCADVDDCADVDDCADVDDGDDFSDEDEEEEHFYGAVADDNEDDDDDDD
DIGGENGDGTHGENVGGGSEQNTDHFESRLPHEYTDQNIDDMRVNMNLWPKENAIWEAGKEFEL
GMVFSSRYAVQTCAASYHIAANKEYKSSETTGKTIVLVCVKNDTCNWRLRASLLKGESDWMITR
YNGPHQCSGQSMNQDHRNLRARQIADHIDTQLHLQRDIRIKTLQEGIFQKLRVRPGYKKTWYAK
EKAIANRFGQWDDSFKEICNFMTNVTVANPRTVWHATGTPIENNPQFRTFKHMFWTYEPVVKGF
QYCKPVVYIDGTHTYGKYEMTLLIASAIEGNNHIMPIAFAIVKSETAASWRYFMWMLKRYVLGE
RKVCIISDRGSGIMSAMEGPEWGGGGGGDTHKWCIRHLVSNFHNAFKKKYLKKLAEKAGRAYQE
HKRDRYMSLIEADSPEGYAYLDRLDVTKWSMSGDTSGMRHGVMTTNYAESVNAMLKNIRGLPIT
AMLEAIFNKVNSVFIKHVNDYKTWLASGFMWTPVCAHRMETWENKSRTHTASQFNVAQKVFNVL
TQCDNVRQKGGNTQQVRLLDGTCTCGKFQQWKIPCSHAIAACNQYGENYRDYISWYYKCEYGIL
AWSSVSFEPLWNRKRWYFFKLYADFVERVRRDAPVDTELRRFAASTQRGTREDRRDVTSPPIEP
SIPPHRLPVIPDAPIDPTTLRHRRRQRRHPPAPPQRPTDPMPPPVVFHPFRGHYYYAGSSSAPP
PFSSGPPSSSGQFYGDSAHHYFQGSCSYPVPPGPSSPFVPPPPAYVPPTVPPFQVQWDQPTQGT
QDFPASQGLPQTPGTTDFLAYGSSWLGLDSMEAMMFRQQGEFVTPPPATTSTAIPQDQQGDDGA
DDEEGDAGEGDGDGRPGRRYLTISTGRRANRNRNNLRSNLPVTSRYDDRTPSKPRNTYAVCHVG
SANRVSVTRSAFAYVAPPARSHYINTRGEQEPNQTEKPDQIKPENYGNRTELTEVFRLKKIGV

Similar gene clusters

NC_065572 - Cluster 23 - Transporter_associated

Gene cluster description

NC_065572 - Gene Cluster 23. Type = transporter_associated. Location: 68918310 - 69103722 nt. Click on genes for more information.
Show pHMM detection rules used
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065572 - Cluster 24 - Putative

Gene cluster description

NC_065572 - Gene Cluster 24. Type = putative. Location: 69440849 - 69517459 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065572 - Cluster 25 - Terpene

Gene cluster description

NC_065572 - Gene Cluster 25. Type = terpene. Location: 72707016 - 72741688 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_065572 - Cluster 26 - Cyclopeptide

Gene cluster description

NC_065572 - Gene Cluster 26. Type = cyclopeptide. Location: 74105832 - 74355763 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126675296
Repeat occurs 6 times in a sequence of 1783 amino acids
Location between 74205661 and 74213799
Coverage of 2.02 %
Instances:
RGRPKK | RGRLAK | RGRRRK | RGRPPR | RGRKKG
RGRPKK |
pattern: RGR[RLPK][RKAP][RKG]
METEFIGKLVKKEFKERGVGVVSGIVQSYDVSSGYFEILYEDGDSERLDFSGVASLLRQSEEP
ADHERRRGRPKKRRRVDLSKVSKGGFDSNCGSLLGGETLGNGNFDLNDGLVEGYSGNLRNDVDN
GVKEVLDLNAGFNLNLNEGFDLNDEGVRCVNGSDVSGDMKMKNRECIDLNLEVNGDIDESLSKD
LGANLKEREIGFDLNLQIDEEEANGERGRLAKEISSSEVLEGIANGACFIANGMLQEVHVTDDL
SVQLAKGILKKGTVSIEGSGGVDSVNVQDANTFKEDSPEVINEKQSDVGSVHQDQSGDGPKRRG
RRRK
TLTHVDNLNSTPETVVVREAKIINGNQEDVRSLDKEGSGNLSKRRRGRPPRTLNSTTDTA
INAETHFIKEDCNVVTDEKQGDIESVYTVIGGKNRRLSDHVNATPERTVLRRSARRGLANNDVL
PVPFSVANEFSVSPAVSALTEEIPLKLRHEWDKEPLAFPAKVQLPPSSRKLDLAGIPVVDFFSV
YACLRSFSTLLFLSPFELEEFVAALRCSSSSSLFDSIHVCILQALKKHMDYLSNEGSESASNCL
RSLNWGFLDVITWPVFLAEYLLFLGSDFGTGIDPSLLMFLKIDYYKQPVSLKIEILRYLCDTII
EADAFRSELARRSSGADADVDFDRNINSGALRKRRSGMDASIVSALTEDAVDDSSDWNSDECCL
CKMDGSLICCDGCPAAYHSKCVGVVNDSLPEGDWFCPECAIERQKPWMKTRKSLRGAELLGADP
CGRLYFSSCGYLLVSDSCEPESSFSFYHKDDFNGVIQVLRSSEVIYGSILEVLLKHWDIPVNIN
GASNLDTLNHDMCVLPSVVASSEAPGDIHIVGQTCLSSEGSAETIQTSLENHNSKTELPSCSNK
STEPLGDDCLTPKKEIVIGSSTDGCTLTALNGTNADASQGKVGTGYLNYYSFGHTASLVAEDFM
RKPSDKKTEVIIMSAEEIISAQMKIMSKSCAKFHWPNIPSLSVNIQKENCGWCFSCRASSDDSG
CLFNMSLAPVGKGSLAEVVGLQSKRNKIGHLIDIVGHILVIEDRLQGLLLGPWLNPHYSKLWRK
SILKASDIVSVKHLLLTLESNISRLALSAEWLKHVDVSTTVGSASHIIIASSRASSKNGIGKKK
ARYPESDSNPSSNSSSGLSMLWWRGGRVSRRLFSWKAVPCSLVSKAARQAGSKKILGVVYPENS
DFAKRSKYIAWRAAVELSTTVEQLALQVREFDSNIKWDEILNTISLPMADKDCRKSIRLFKKAI
IRRKAVEAEVTKYLLDFGKRRCIPEIVIKKGSAVEESASERKKYWLNESYVPLHLLKSFEEKRI
ARRSSKMSSGKLSDADLLTKKPLKERGFSYLLAKAERPEYHQCRHCNKDVPIREAVCCQYCQGS
FHKRHVRKSMGSVSAQCKYTCHRCVDGNYMKVDSETAKNDAKKAKKKNRSSKNQYQKSKKVSEG
TSSVHPKNSKKTLRNSRSLRSEKNKKVTIVVPLRRSPRKAKLNALQNKKARGRKKGKPGRGRPK
K
VTGQQPTKATSWRKKRTQAYHSYWLNGLLLTRKPEDERVMHFRKKRFVAPSQSVIYDQPTCRL
CSEAGYTSTVNYISCEMCGGWFHGDALGLDAENINKLIGFRCHMCRNNTPPVCPFASLTKDHES
SMDVVENSVANEFCAEGTGVKYQTEVNLLQESHVNEDNQGSLHADNSPQGLDDKSFVPESKLEI
GNEIDQMEPTSCSIGVDVMESESIELHPQLSMESAELLDEGGNQVPSLMGSPQLEK

Similar gene clusters

NC_065573 - Cluster 27 - Saccharide

Gene cluster description

NC_065573 - Gene Cluster 27. Type = saccharide. Location: 24992953 - 25073607 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065573 - Cluster 28 - Alkaloid

Gene cluster description

NC_065573 - Gene Cluster 28. Type = alkaloid. Location: 28744281 - 28804595 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065573 - Cluster 29 - Lignan

Gene cluster description

NC_065573 - Gene Cluster 29. Type = lignan. Location: 30500575 - 30555365 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065573 - Cluster 30 - Putative

Gene cluster description

NC_065573 - Gene Cluster 30. Type = putative. Location: 30658407 - 30765798 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065573 - Cluster 31 - Terpene

Gene cluster description

NC_065573 - Gene Cluster 31. Type = terpene. Location: 31287148 - 31313680 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065573 - Cluster 32 - Saccharide

Gene cluster description

NC_065573 - Gene Cluster 32. Type = saccharide. Location: 36288981 - 36376326 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065573 - Cluster 33 - Alkaloid

Gene cluster description

NC_065573 - Gene Cluster 33. Type = alkaloid. Location: 37732371 - 37804408 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065574 - Cluster 34 - Cyclopeptide

Gene cluster description

NC_065574 - Gene Cluster 34. Type = cyclopeptide. Location: 5458438 - 5751141 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126680865
Repeat occurs 6 times in a sequence of 474 amino acids
Location between 5603753 and 5605825
Coverage of 11.39 %
Instances:
YQNSGNSEQ | YQNGADGEK | YQNGADGEK | YQNGADGEK | YQNGADGEK
YQNRGNDII |
pattern: YQN[RGS][AG][ND][DGS][IE][KQI]
MKVIFIVLALLSLHWLAFKLDAAKDLRETVPNGGEGITSQNDFQSNSKTPVLSEYQNSGNSEQ
EKSPLNDSQPEPNPLDIWKSQNGVDGEKEKIPQNYNFQPEPNQLFFWRYQNGADGEKVKILQNH
NFQPEPNQLFFWRYQNGADGEKVKILQNHNFQPEPNQLFFWKYQNGADGEKVKILQNHNFQPEP
NQLFFWRYQNGADGEKVKILQNHNFQPESNQLFFWKYQNRGNDIIRLPSMKKDGDDNPPFITFF
TKNDLKAGNKLPINLRPFDRSSKDPPLLSKKQAESYPFSYKQFEHLLHLFSVEPKSPQAQAMDE
ALKYCEAEPLRIETKFCATSFEAMLDLLTRTFGLDSKNLKAISTMHLTKPKNKVQNYTITEEPK
ELATPKLIACHIMPYPYIVFYCHSIEHTKGFRVSLVGENGDAIETIAVCHSDTSEWSPEHVAFR
EVGGKPGSTEVCHFSPSGHLLWIPLQA

Similar gene clusters

NC_065574 - Cluster 35 - Terpene

Gene cluster description

NC_065574 - Gene Cluster 35. Type = terpene. Location: 7425348 - 7552412 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_065574 - Cluster 36 - Putative

Gene cluster description

NC_065574 - Gene Cluster 36. Type = putative. Location: 9114021 - 9261441 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065574 - Cluster 37 - Phenolamide

Gene cluster description

NC_065574 - Gene Cluster 37. Type = phenolamide. Location: 10582128 - 10714408 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065574 - Cluster 38 - Saccharide

Gene cluster description

NC_065574 - Gene Cluster 38. Type = saccharide. Location: 12097609 - 12194545 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065574 - Cluster 39 - Saccharide

Gene cluster description

NC_065574 - Gene Cluster 39. Type = saccharide. Location: 16192099 - 16374525 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065574 - Cluster 40 - Terpene

Gene cluster description

NC_065574 - Gene Cluster 40. Type = terpene. Location: 42495912 - 42910700 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065574 - Cluster 41 - Saccharide-terpene

Gene cluster description

NC_065574 - Gene Cluster 41. Type = saccharide-terpene. Location: 43032297 - 43211187 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065574 - Cluster 42 - Cyclopeptide

Gene cluster description

NC_065574 - Gene Cluster 42. Type = cyclopeptide. Location: 46934471 - 47954665 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126682891
Repeat occurs 3 times in a sequence of 220 amino acids
Location between 47343510 and 47344240
Coverage of 8.18 %
Instances:
DGDAEE | DGDAED | DGDQLQ |
pattern: DGD[AQ][LE][QDE]
MASTVVLDHAAAEKREEGYLILPDSPDRKKLKMDGDAEEREEDSPIIPDSRNPETGGDTDDEI
KMDGDAEDYTSYVIDDDEITMDDLLPMYRTADNLQALKKYFDQIKESGGFDFDPYHGSPVIGGI
SPLDLGQDTIHVRGVKKALDFAIRRENAKRKDGDQLQLVRLIKSNYICGGTFYITFEAKSVDIQ
VYQAKVAYCVYGAIDEQTQAEVLLFRRKP
Repeat found in LOC126680041
Repeat occurs 3 times in a sequence of 127 amino acids
Location between 47352365 and 47353193
Coverage of 25.98 %
Instances:
DFEPRPNLSVY | DFEPRPNLSVY | DFEPRPNLSVY |
pattern: DFEPRPNLSVY
The following known motifs were found:
FEPR was found 3 times in this sequence
MKSSTFITFFLFFVIANTTINARKDVGVNWSSNVTEDDQIAETSANYEQKQVIMSEDKTSFSD
D
FEPRPNLSVYPNLSVYHDDASDGKDGKSFVKDFEPRPNLSVYPNLSVYDDDVGLKEEKPFVND
FEPR
PNLSVYSD

Similar gene clusters

NC_065575 - Cluster 43 - Saccharide-terpene

Gene cluster description

NC_065575 - Gene Cluster 43. Type = saccharide-terpene. Location: 352926 - 405572 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065575 - Cluster 44 - Cyclopeptide

Gene cluster description

NC_065575 - Gene Cluster 44. Type = cyclopeptide. Location: 2353151 - 2991554 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126653790
Repeat occurs 3 times in a sequence of 507 amino acids
Location between 2389602 and 2393397
Coverage of 4.73 %
Instances:
SSSSEPFF | SSSEPFFG | SSSDEPGF |
pattern: SSS[EDS][PE][PF][FG][FG]
MTNGGDDKRSKCRRDSHTAEEVITSPLKKGPWTSTEDSILADYVSQHGEGNWNAVQKHSGLSR
CGKSCRLRWANHLRPDLKKGAFTTEEERRIIELHAKMGNKWARMAAELPGRTDNEIKNYWNTRI
KRLQRAGLPIYPPEVCMQVLNGSHESYSMGTPQTTDTHNSDLIQTDHFEIPKVEFENLELNRGH
LSYSPHMLEISGSNMPKQGAHPSDCNNYFQHLHPAKRLRESDHIFPGLDGSIGSLPAFIQSVDC
SSEKITDSFGYSSLYDSHPFNYYQQPLAYLSGSHALLNDNSSSSEPFFGAVKLELPSLQYSDTQ
QDSWGRPTSPLPSLESVDTLIQSPSTEQSQSDCLSTRCDGTLEAVIYESQSLKNSKKCSKHQTS
VTSVAPRDVEDCPSIPRGTEWEVHGDSNSPVGNSAASLFNACTPVSRSSSDEPGFDIKPKIANS
GLSTFMEDKEAPCKFDSTRPDVLLGTFWCGFSSGHVKNENFRNEDVQVPFDAESFDGDWK
Repeat found in LOC126686626
Repeat occurs 3 times in a sequence of 249 amino acids
Location between 2505403 and 2506153
Coverage of 9.64 %
Instances:
EDKTLVEI | EDKTLVEI | EDKSVVEL |
pattern: EDK[ST][LV]VE[LI]
MSNEDKTLVEISEENNRVDLANEDKTLVEILEENNPIDLAKYISYVSAPQAGGIATFSGTTRD
TFEDKSVVELRYEAYVPMAIRELKSICASARTSWDIHSIAVAHRLGSVPVGETSVFVAVSAVHR
GDALDACKFIIDDLKASVPIWKKEVYANGEVWKENSEFMERKPEIGKSASYGVHGKKSCCGTKV
KVNEESPDISNQDSGIRVKVSEAGRKISSTHDGIRMSGGGRKTSAQNGGVTIKVSEVS
Repeat found in LOC130014734
Repeat occurs 11 times in a sequence of 502 amino acids
Location between 2642780 and 2645474
Coverage of 32.87 %
Instances:
DKREKYGKRYDAAFA | DKREKYGKRYDAAFA | DKREKYGKRYDAAFA | DKREKYGKRYDAAFA | DKREKYGKRYDAAFA
DKREKYGKRYDAAFA | DKREKYGKRYDAAFA | DKRKKYGKRYDAAFA | DKREKYGKRYDAAFA | DKREKYGKRYDAAFA
DKREKYGKRYDAEFK |
pattern: DKR[KE]KYGKRYDA[AE]F[KA]
MEFLLLPLFAVFCMVLVGINASVPSEDYWHSKLPNTPIPAELQNLIQPNAEIGGKFFFWDMGF
NNDHSLLQSADDKREKYGKRYDAAFAEGTNDKREKYGKRYDAAFAEGTNDKREKYGKRYDAAFA
KETNDKREKYGKRYDAAFAEGTNDKREKYGKRYDAAFAEGTNDKREKYGKRYDAAFAEGTNDKR
EKYGKRYDAAFA
EGTNDKRKKYGKRYDAAFAEGINDKREKYGKRYDAAFAEGTNDKREKYGKRY
DAAFA
EGTNDKREKYGKRYDAEFKKHSLPNSTVFFLQKDLQIGQKMKLHITKSSNKAKFLPREA
AESMPFSSKNLQKILQKFSIKPDSKQAKMIKQTIEDCESPGIKGEERYCPTSLESLLDFALPVV
GNKTKILYNEIDRPTRIQEYTIMGVELAGENQVVCHKQKYPYAVYYCHSVSATKVYQAPLVGAD
GIKAKAVAICHSDTSHWNPQHLAFLMLNIKPGEATVCHFIRSDTIVWTSNKAISN
Repeat found in LOC126686681
Repeat occurs 13 times in a sequence of 592 amino acids
Location between 2675304 and 2680230
Coverage of 32.94 %
Instances:
GSKFFFWDMGFNCGQ | GSKKKQRYKIGYDDA | GSKKRQRYKIGYDDA | GSKKKQRYKIGYENA | GSKKKQRYKIGYDDA
GSKKKQRYKIGYDGG | GSKKKQRYKIEYDDA | GSKKKQRYKIGYNDA | GSKKKQKYKIGYDDA | GSKKKQKYKIGYDDA
GSKKKQRYKIGYDDA | GSKKKQRYKIGYDDV | GSKKKQRYKIGYDAA |
pattern: GSK[KF][RKF][FQ][RWK][YD][MK][GI][FGE][NY][ENCD][ANGD][AQGV]
MTSMEFLLPLFAVFCLILVGSNASVPSEDYWHSKLPNTPIPDELQKLIQPTETGSKFFFWDMG
FNCGQ
SLLQAMDDKKSKDGDATFVEVDGSKKKQRYKIGYDDAGFEEADGSKKRQRYKIGYDDAE
FEEADGSKKKQRYKIGYENAGFEEADGSKKKQRYKIGYDDAEFEEADGSKKKQRYKIGYDGGFE
EADGSKKKQRYKIEYDDADFEEADGSKKKQRYKIGYNDADFEEADGSKKKQKYKIGYDDADFEE
ADGSKKKQKYKIGYDDADFEEADGSKKKQRYKIGYDDADFEEADGSKKKQRYKIGYDDVEFEEA
DGSKKKQRYKIGYDAALKEGDSAEKNKKYNIAYNAAFEEGGDARSKRRYKIGLNKHALPNSTVF
FLQKDLQAGQKMKLHITKSSNKAKFLPRQDSEAIPFSSKNLQAILQKFSIKPDSKQAKIIKQTI
IDCESQGIKGEEKNCPTSLESLIDFSIPVVGKKLQILYNEVARPTRIQEYTIMGVEMIGENQVV
CHKQKYPYAVYYCHSVSATKVYQAPLVGADGIKAKAVAICHSDTSDWNPKHLAFLMLNVKPGEG
TVCHFISSDTVVWTSKK
Repeat found in LOC126686681
Repeat occurs 13 times in a sequence of 593 amino acids
Location between 2675304 and 2680230
Coverage of 32.88 %
Instances:
GSKFFFWDMGFNCGQ | GSKKKQRYKIGYDDA | GSKKRQRYKIGYDDA | GSKKKQRYKIGYENA | GSKKKQRYKIGYDDA
GSKKKQRYKIGYDGG | GSKKKQRYKIEYDDA | GSKKKQRYKIGYNDA | GSKKKQKYKIGYDDA | GSKKKQKYKIGYDDA
GSKKKQRYKIGYDDA | GSKKKQRYKIGYDDV | GSKKKQRYKIGYDAA |
pattern: GSK[KF][RKF][FQ][RWK][YD][MK][GI][FGE][NY][ENCD][ANGD][AQGV]
MTSMEFLLPLFAVFCLILVGSNASVPSEDYWHSKLPNTPIPDELQKLIQPTAETGSKFFFWDM
GFNCGQ
SLLQAMDDKKSKDGDATFVEVDGSKKKQRYKIGYDDAGFEEADGSKKRQRYKIGYDDA
EFEEADGSKKKQRYKIGYENAGFEEADGSKKKQRYKIGYDDAEFEEADGSKKKQRYKIGYDGGF
EEADGSKKKQRYKIEYDDADFEEADGSKKKQRYKIGYNDADFEEADGSKKKQKYKIGYDDADFE
EADGSKKKQKYKIGYDDADFEEADGSKKKQRYKIGYDDADFEEADGSKKKQRYKIGYDDVEFEE
ADGSKKKQRYKIGYDAALKEGDSAEKNKKYNIAYNAAFEEGGDARSKRRYKIGLNKHALPNSTV
FFLQKDLQAGQKMKLHITKSSNKAKFLPRQDSEAIPFSSKNLQAILQKFSIKPDSKQAKIIKQT
IIDCESQGIKGEEKNCPTSLESLIDFSIPVVGKKLQILYNEVARPTRIQEYTIMGVEMIGENQV
VCHKQKYPYAVYYCHSVSATKVYQAPLVGADGIKAKAVAICHSDTSDWNPKHLAFLMLNVKPGE
GTVCHFISSDTVVWTSKK
Repeat found in LOC126687263
Repeat occurs 15 times in a sequence of 579 amino acids
Location between 2692185 and 2701924
Coverage of 38.86 %
Instances:
KYGKRYDASFSESVD | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN
KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN
KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYKIGFKKHAL

pattern: KYGKRY[KD][AI][AGS]F[KES][KE][HGS][ATV][LND]
MNLMASMEFLILPLFAVFCMILGGSNASVPSEEYWHSKLPNTPIPAELQKLIQPAELGGKFSF
WDMGFNNDQSLLQAMDDKRSKYGKRYDASFSESVDDKRSKYGKRYDAAFEEGTNDKREKYGKRY
DAAFEEGTN
DKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEG
TN
DKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEGTNDKREK
YGKRYDAAFEEGTN
DKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEGTNDKREKYGKRYDA
AFEEGTN
DKREKYGKRYDAAFEEGTNDRREKYGKRYKIGFKKHALPNSTVFFLQKDLQVGQKMK
LHITKSSNKAKFLPRQDSESMPFSSKILPQILQKFSVKPDSMQAKIIKQTIDDCESSAIKGEER
YCPKSLESLLDFAIPVVGNKTQVLYNEVERPSRIQEYTIMGVKMVGENQVVCHKQKYPYAVYYC
HSVSATKVYQAPLVGADGLKAKAVAICHSDTSNWNPQHLAFLMLNVKPGEATICHFIRSDTIVW
TSNK
Repeat found in LOC126687263
Repeat occurs 15 times in a sequence of 580 amino acids
Location between 2692185 and 2701924
Coverage of 38.79 %
Instances:
KYGKRYDASFSESVD | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN
KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN
KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYDAAFEEGTN | KYGKRYKIGFKKHAL

pattern: KYGKRY[KD][AI][AGS]F[KES][KE][HGS][ATV][LND]
MNLMASMEFLILPLFAVFCMILGGSNASVPSEEYWHSKLPNTPIPAELQKLIQPAAELGGKFS
FWDMGFNNDQSLLQAMDDKRSKYGKRYDASFSESVDDKRSKYGKRYDAAFEEGTNDKREKYGKR
YDAAFEEGTN
DKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEE
GTN
DKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEGTNDKRE
KYGKRYDAAFEEGTN
DKREKYGKRYDAAFEEGTNDKREKYGKRYDAAFEEGTNDKREKYGKRYD
AAFEEGTN
DKREKYGKRYDAAFEEGTNDRREKYGKRYKIGFKKHALPNSTVFFLQKDLQVGQKM
KLHITKSSNKAKFLPRQDSESMPFSSKILPQILQKFSVKPDSMQAKIIKQTIDDCESSAIKGEE
RYCPKSLESLLDFAIPVVGNKTQVLYNEVERPSRIQEYTIMGVKMVGENQVVCHKQKYPYAVYY
CHSVSATKVYQAPLVGADGLKAKAVAICHSDTSNWNPQHLAFLMLNVKPGEATICHFIRSDTIV
WTSNK

Similar gene clusters

NC_065575 - Cluster 45 - Cyclopeptide

Gene cluster description

NC_065575 - Gene Cluster 45. Type = cyclopeptide. Location: 3517580 - 3869861 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126653674
Repeat occurs 4 times in a sequence of 695 amino acids
Location between 3583614 and 3585702
Coverage of 3.45 %
Instances:
EALELF | EALELF | EALNVF | EALKLF |
pattern: EAL[KNE][LV]F
MDTAKLSTILLKCITSNSLLRGKLCHQKIITLGLQNNVALCKNLIKFYFSFQTYDYAKLVFKT
IDNPLDISLWNGLMAAYTKNYMYIEALELFQSLLQFPHLKPDSFTYPSLLKACAGLGKVSYGRM
VHAHVVKSGFVSDTVVVSSLVRLYGKCDLFVCAVKLFDEMPERDVVSWNTVISCCYQDGRAGEA
LELF
GRMREFGFEPNSVTVTVAISCCARVLDLERGREIHRDVVDNGLVFDGFVSSALVDMYGKC
GCLDVAKDIFERMTKKTPVAWNSLIAGYSLVGDSKECIQLLKRMIMEGTQPTLTTLTSILLVCS
RVSHFRHGRSIHGYLIRHKVDFDIFVGSGLIDLYFKCGRIPSAENIFNMLLPDANAVSWNIMIS
GYVSVGDYFKTIGLFDEMKGAKIKPDAVTFSTVLSACSQLAALVKGKEIHNCVEEQGLETNEIV
MGALLDMYAKCGAVDEALNVFNKLPEKDLISWTSMITAYGSHGQAFEALKLFEDMQQSDVKPDA
VTFIAVLSACGHAGLVDEGYGYFNQMTTDYGIKPGIEHCSCLIDLLGRAGRLHEAYNILQNSPE
IIDDVGLLSTLLSACLLHKDVELGEEIAELLIRKDPDDSATYITLSNLYASVKNWEKMGMLRLK
MKELGLKKNPGCSWIEIDKRIQPFFVKDWSHPEPDMVHDCLVILYNHMEKDELPLP
Repeat found in LOC126653674
Repeat occurs 4 times in a sequence of 695 amino acids
Location between 3583614 and 3585702
Coverage of 3.45 %
Instances:
EALELF | EALELF | EALNVF | EALKLF |
pattern: EAL[KNE][LV]F
MDTAKLSTILLKCITSNSLLRGKLCHQKIITLGLQNNVALCKNLIKFYFSFQTYDYAKLVFKT
IDNPLDISLWNGLMAAYTKNYMYIEALELFQSLLQFPHLKPDSFTYPSLLKACAGLGKVSYGRM
VHAHVVKSGFVSDTVVVSSLVRLYGKCDLFVCAVKLFDEMPERDVVSWNTVISCCYQDGRAGEA
LELF
GRMREFGFEPNSVTVTVAISCCARVLDLERGREIHRDVVDNGLVFDGFVSSALVDMYGKC
GCLDVAKDIFERMTKKTPVAWNSLIAGYSLVGDSKECIQLLKRMIMEGTQPTLTTLTSILLVCS
RVSHFRHGRSIHGYLIRHKVDFDIFVGSGLIDLYFKCGRIPSAENIFNMLLPDANAVSWNIMIS
GYVSVGDYFKTIGLFDEMKGAKIKPDAVTFSTVLSACSQLAALVKGKEIHNCVEEQGLETNEIV
MGALLDMYAKCGAVDEALNVFNKLPEKDLISWTSMITAYGSHGQAFEALKLFEDMQQSDVKPDA
VTFIAVLSACGHAGLVDEGYGYFNQMTTDYGIKPGIEHCSCLIDLLGRAGRLHEAYNILQNSPE
IIDDVGLLSTLLSACLLHKDVELGEEIAELLIRKDPDDSATYITLSNLYASVKNWEKMGMLRLK
MKELGLKKNPGCSWIEIDKRIQPFFVKDWSHPEPDMVHDCLVILYNHMEKDELPLP
Repeat found in LOC126687462
Repeat occurs 3 times in a sequence of 352 amino acids
Location between 3744747 and 3745912
Coverage of 5.11 %
Instances:
KEIEDH | KEIKSR | KEIQDR |
pattern: KEI[KQE][SD][RH]
MEGKTVSSELKLSELSRQNIRQSINQLHDRASSVLVLTLQWKEIEDHFNSIQRGIEQRAMELN
SVQESAEQRLKEIKSREDELEVVKESVSWRIREAEEREKEFKFIQKKEIQDRKVEMEWIEKSRN
QLDAESSAMGADVSFRVTMDGEALQLLLNDYCNDRDSIRQELLISLGFSPNPAKLVLDAVKGFY
QGGLEFGEGIVRSSCVFLLEILLQIRPEISPEVRNEAMQLSLDWMKQMRKDSEHSTEVLGCLLL
LGSYRLASAFDADELFRCLKIVSHHSQASQLLRALGLVDKLSGFIQNLVKQNKHIEAIRFIYDF
QLLNEFPLEPLLEDNISSCRNAITNNNVIAEWE

Similar gene clusters

NC_065575 - Cluster 46 - Terpene

Gene cluster description

NC_065575 - Gene Cluster 46. Type = terpene. Location: 41968358 - 42042011 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065575 - Cluster 47 - Alkaloid

Gene cluster description

NC_065575 - Gene Cluster 47. Type = alkaloid. Location: 43665389 - 43727572 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065575 - Cluster 48 - Putative

Gene cluster description

NC_065575 - Gene Cluster 48. Type = putative. Location: 45515112 - 45604188 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065576 - Cluster 49 - Terpene

Gene cluster description

NC_065576 - Gene Cluster 49. Type = terpene. Location: 13164559 - 13269223 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NC_065576 - Cluster 50 - Cyclopeptide

Gene cluster description

NC_065576 - Gene Cluster 50. Type = cyclopeptide. Location: 31282959 - 32886544 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126655831
Repeat occurs 8 times in a sequence of 524 amino acids
Location between 32083771 and 32085731
Coverage of 22.9 %
Instances:
PLFFYQSGGADEQAK | PLFFYQSGSADEQKK | PLFFYASRDVNKQEK | PLFFIQSGGADEEEK | PLFFYQSGDGHEQEK
PLFFYQSGGADEKEK | PLFFYQSRGADEQEK | PLFFYQNGGVDEQEK |
pattern: PLFF[IY][AQ][NS][RG][DGS][AGV][HND][KE][KQE][KAE]K
MRATFAVFAIFSLYSLAFKLDARKELEENTPLKKSESKPNIPSYLKYQKKSPQNDFQPISILP
IYLKYKSRDDSDKEKSQSNNFEPKRKFPINFKYQNGGAGEQEKSPFNDFQHKPNNPLFFYQSGG
ADEQAK
SPLKDFQHKPNNPLFFYQSGSADEQKKSPLNDFQDKPNNPLFFYASRDVNKQEKVILN
DFQLKLGNPGFFYQSGGADEQKKLPLNDFQYKPNDPLFFIQSGGADEEEKSPVNNFQHEPNNPL
FFYQSGDGHEQEK
ITLNDFQLKPNNPLFFYQSGGADEKEKLPLNNFQHKSNNPLFFYQSRGADE
QEK
SPLDEFQQKHNNPLFFYQNGGVDEQEKSLIKDVQPKPKFPIYFKYKNKVAPNSPQAKSMEE
ALVFCESKPISGETKFCATSFEAMIDFVSNTFGLASENFKAVTSTQLTKSKNKKQNYTITEVPK
EIQTAKQISCHPMSYPYKVFYCHTPKDTKVFRVSLVGENGDVIDAIAACHLREVGGKPTDVCHF
FPSDHMVWIPTEA
Repeat found in LOC126655831
Repeat occurs 8 times in a sequence of 607 amino acids
Location between 32083771 and 32085731
Coverage of 19.77 %
Instances:
PLFFYQSGGADEQAK | PLFFYQSGSADEQKK | PLFFYASRDVNKQEK | PLFFIQSGGADEEEK | PLFFYQSGDGHEQEK
PLFFYQSGGADEKEK | PLFFYQSRGADEQEK | PLFFYQNGGVDEQEK |
pattern: PLFF[IY][AQ][NS][RG][DGS][AGV][HND][KE][KQE][KAE]K
MRATFAVFAIFSLYSLAFKLDARKELEENTPLKKSESKPNIPSYLKYQKKSPQNDFQPISILP
IYLKYKSRDDSDKEKSQSNNFEPKRKFPINFKYQNGGAGEQEKSPFNDFQHKPNNPLFFYQSGG
ADEQAK
SPLKDFQHKPNNPLFFYQSGSADEQKKSPLNDFQDKPNNPLFFYASRDVNKQEKVILN
DFQLKLGNPGFFYQSGGADEQKKLPLNDFQYKPNDPLFFIQSGGADEEEKSPVNNFQHEPNNPL
FFYQSGDGHEQEK
ITLNDFQLKPNNPLFFYQSGGADEKEKLPLNNFQHKSNNPLFFYQSRGADE
QEK
SPLDEFQQKHNNPLFFYQNGGVDEQEKSLIKDVQPKPKFPIYFKYKNKGNNLLSMHDMNSN
SKHDSSVHGHHNLALIIPFTKNDLKVGNKLPVYLPAINHSSKDPPLLPREQAESYPFSYKQFHY
ILDLFSVAPNSPQAKSMEEALVFCESKPISGETKFCATSFEAMIDFVSNTFGLASENFKAVTST
QLTKSKNKKQNYTITEVPKEIQTAKQISCHPMSYPYKVFYCHTPKDTKVFRVSLVGENGDVIDA
IAACHLREVGGKPTDVCHFFPSDHMVWIPTEA

Similar gene clusters

NC_065576 - Cluster 51 - Alkaloid

Gene cluster description

NC_065576 - Gene Cluster 51. Type = alkaloid. Location: 40007036 - 40125374 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065576 - Cluster 52 - Saccharide-fatty_acid

Gene cluster description

NC_065576 - Gene Cluster 52. Type = saccharide-fatty_acid. Location: 46240995 - 46343953 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065576 - Cluster 53 - Lignan

Gene cluster description

NC_065576 - Gene Cluster 53. Type = lignan. Location: 46745137 - 46858876 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065576 - Cluster 54 - Putative

Gene cluster description

NC_065576 - Gene Cluster 54. Type = putative. Location: 48173166 - 48370665 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065576 - Cluster 55 - Cyclopeptide

Gene cluster description

NC_065576 - Gene Cluster 55. Type = cyclopeptide. Location: 49407702 - 49952896 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_065577 - Cluster 56 - Saccharide

Gene cluster description

NC_065577 - Gene Cluster 56. Type = saccharide. Location: 723452 - 819406 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065577 - Cluster 57 - Saccharide

Gene cluster description

NC_065577 - Gene Cluster 57. Type = saccharide. Location: 1122141 - 1160798 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065577 - Cluster 58 - Cyclopeptide

Gene cluster description

NC_065577 - Gene Cluster 58. Type = cyclopeptide. Location: 1830435 - 2508685 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126660335
Repeat occurs 5 times in a sequence of 948 amino acids
Location between 1836012 and 1839159
Coverage of 3.69 %
Instances:
AGDDIDV | AGDNIDE | AGDNIDE | AGDQIDE | AGDEEGD

pattern: AGD[QNED][EI][GD][EDV]
MVTFLLLCLSLYLYYLFVYLGNDRLEAKIVQMILFYGGRIQMNEGTLAYVGGIQEIGDPMFSL
DKLSYFNILSYLHYFEIEKGDLYLKSDRIEFTKFENDRCVLPFWKLLLDNKENKFFVYVDYSKI
LDSQGIQSSREGGCEASGSVQAAQVIDLGEDLTFDDICIDIEDHSDDDEVELQEARKNVLESRQ
DLAGLANARAGDDIDVLGAGDNIDEVAGDNIDEVAGDQIDEDSGVMDEGEGEREVPEPRTEVAG
DEEGD
SDSADLTYENESVYYSSSDVGSQFSENEDEYGDDSQREVSIRVQFDTTTEIPQFAVGMF
FSNLAEVRDAIARYAVMKGMCISYVKNDQQRVRAKCIVGCPWLIQVSPHNADHNWTIKTYKPDH
RCTRSNRVSFCDSNFLAKRYKTMVTNQHYIKLSDFRSIVRSELKQSVSLNVCRHAKTKIIAELM
GFYREEYAMLNDYAEIICHTNPGTHCFVKSTSENPEGKQEFHRFYVCFAACKKGWLQGCRKVIG
IDGCFLKGICKGQLLIAVGRDGNNQMFPIAWGVVLVENKDNWSWFMRMIQYDLELEDGEGFAVI
SDMQKGLESALKDILPKAEHRRCARHVYANWAKKWRGDERKKEFWSCAKATIGSDLKVRLKHLG
TLGKDIAKDALSYDIETWCKVYFDTSIKCDVVDNNLAETFNGWILEPRCKSIISMLEDIRIKVM
NRLWNKRDSIRGWISDISPRALQVIEKNKLLSFEWEVECNGDDGFEVAWLQDRRNKHTVDLVKR
TCTCKEWDLTGIPCKHSVTAIYAKRGNPEAYVHAYYSRDTYMKAYAYTIQPVPGKQHWLQSEKG
TIEPPPFKKLPGRPKKNRRKEPFEVKKKAKLSLHGRVMTCGICKGTGHNYRSCPQKGTNTQYKP
RQNKKKQPQASQPEATPSQQNTSRKRKRNQPSVDASTTVAARAKLRSRIRQTQ
Repeat found in LOC126662342
Repeat occurs 6 times in a sequence of 93 amino acids
Location between 1993840 and 1995376
Coverage of 45.16 %
Instances:
PPQGYPP | PPQGYPP | PPQGYPQ | PPQGYPP | PPQGGYP
PPQYAPQ |
pattern: PPQ[GY][AGY][PY][PQ]
MSYYNQQQPPVGVPPPQGYPPEGYAKDAYPPQGYPPQGYPQGYPPQGYPPQGGYPPQYAPQYA
QPPPRQNNSSGCMEGCLAALCCCCLLDACF

Similar gene clusters

NC_065577 - Cluster 59 - Cyclopeptide

Gene cluster description

NC_065577 - Gene Cluster 59. Type = cyclopeptide. Location: 6779788 - 7067514 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126660914
Repeat occurs 4 times in a sequence of 373 amino acids
Location between 6891111 and 6894173
Coverage of 6.43 %
Instances:
QQQQKP | QQQKPP | QQQQPS | QQQPSS |
pattern: QQQ[KQP][KPS][PS]
MANSKGSANVRNFMYSGKHALLPPKIPFPSVSPSYVDYVPSSVIGSKAVQRPKEGNSHHLRTS
SETLVIEEQPSWLDDLLDEPETPVRRGGHRRSSSDSFAYIDVANSYNTDYAAQDDYRYKNMIIP
SWGSQDYDYHKDARQASLYADVNPTKQRNRTLISSPNPLTRQSGLLSARESAVTQNSGCALQEA
DGIPSSLSEKQESAESSPYDSKASSERKDCSHAKSSIAETDTKRAKQQFAQRSRVRKLQYIAEL
ERNVQALQAEGSEVSAEVEFLNQQNLILNMENKALKQRLENLAQEQLIKYLEHEVLEREIGRLR
ALYQQQQKPPQQQQPSSSHRRTNSRDLESQFGNLSLKHKDANSGRDPVTGSLRT
Repeat found in LOC126660374
Repeat occurs 10 times in a sequence of 438 amino acids
Location between 6917285 and 6919895
Coverage of 18.26 %
Instances:
GPPPQNNG | GPPRYPPQ | GPPAQQNY | GPPQNYPQ | GPPQNHQP
GPPQNQPP | GPPQNQPP | GPPKNQPP | GPPQNQPP | GPPGQWER

pattern: GPP[GPQKRA][QNY][YPQNHW][PNEQ][GYPQR]
MALQLTRLRRALTSLPTLHRSLSSPIITPTSPPLPLSQNPIESYLNSPLIFSHFQTRLFRVSG
PSLSSSKKEYRVYKEGDEITEDMVLFEGCDFNHWLITVDFPKDPAPTPQEMVATYERICAEGLR
ISIEEAKTKIYACSTTTYQGFQAVMTEEESEKFKDINGVVFVLPDSYIDPQNKQYGGDLYENGV
ITHRPPPIQYKRGGGRFRDGGNRNPEPRRYPMPNQSGSSQNNQQGPPPQNNGPPRYPPQQNYGP
PAQQNY
GPPQNYPQQQNYGPPQNHQPQQNYGPPQNQPPQNYGPPQNQPPHQNYGPPKNQPPHQN
YGPPQNQPPHQSYGPPGQWERTPMNSGNHEANRGPYQTNQYTENPRGFAQGGQRDFMRENQNFS
LSQTGANGQGTTNAGYGQVHSGEGQRFSQMEQRNTQGEQANYAPAGQSEANRGRF
Repeat found in LOC126661132
Repeat occurs 3 times in a sequence of 399 amino acids
Location between 6969905 and 6971538
Coverage of 5.26 %
Instances:
SKSATIK | SKSTGRN | SKSTGRR |
pattern: SKS[AT][GT][RI][RKN]
MGDPPRSTTVDFYGILGISKSATIKEMCKAYKSLVTLWHPDKNPSNKDEAQVKFRQINEAYKA
LNEKKIQETPMKIAYEPKTPPPRDFPSRGGSSHHNKSMDESFFSRPSVQTNINKKSRISSRSPT
QLSRNASRRSTSPAQDTSRSKSTGRNMRRTTSPNPKNYLSTSPGSSPALPNKSRSKSTGRRRAS
DTEIPSSIISPTRMGATPIVYSQSTAWRIPSPVERKLECTLEELCHGCVKKIKITRDIISNGII
KKVDETLKIKVKPGWKQGTKIKFEGKGDEKPGYLPADIIFLIDEKRHPLFTRNGDDLEYGLEIQ
LVQSLTGCSISAPLLGGERMWLSFDEIIYPGYVKIIQGQGMPTKTEGKRGDLRITFFVEFPSEL
SDEQRTEAASILQDCS

Similar gene clusters

NC_065577 - Cluster 60 - Saccharide-terpene

Gene cluster description

NC_065577 - Gene Cluster 60. Type = saccharide-terpene. Location: 8520191 - 8651367 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065577 - Cluster 61 - Cyclopeptide

Gene cluster description

NC_065577 - Gene Cluster 61. Type = cyclopeptide. Location: 8828642 - 9523848 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126662183
Repeat occurs 4 times in a sequence of 269 amino acids
Location between 9011311 and 9012121
Coverage of 11.9 %
Instances:
PPPPPPIS | PPPPPISP | PPPPISPP | PPPISPPA |
pattern: PPP[PI][SPI][SPI][SPI][APS]
MGLLTKELVPSFLNKFVSKSALTTTTQPSSNHSVAFYDRLLPAFAPKKPRSVLVKKRLYIKIL
GQTIYTGNRLNQAYSKNGLKRKFVEEVKSNNHNSRKKLLANPPPPPPISPPAALEAKINAMGGY
DVRFVIEKMLYKTDVDKHEDRLSIPVRQIRDAWFLDEDEQKIIYKKGEELSVKVLEPSDDRCDM
KLSKWNKPNSLTLRTKWKEVLGRNMDIVDKETNVVVEAKYFKKGDIIQLWSFRKQSVEGLKPEH
WLALINMTRIHEEP
Repeat found in LOC126660955
Repeat occurs 3 times in a sequence of 489 amino acids
Location between 9443161 and 9445611
Coverage of 3.68 %
Instances:
AYGESG | AYGNSG | AYGQAG |
pattern: AYG[QNE][AS]G
The following known motifs were found:
VS[AI]Y was found 2 times in this sequence
MAIHSQPQPISYTSFGWSHSISTPPTNHLIKFLTELNFHVHPVILAVKSSKLGSKVAPVGLQR
HPKKDLSRFLRTDAAIQAIEQKANSDKYNRLWPKAVLEALDDAMRENRWESALKIFELLRKQHW
YEPRSRTYTKLLMMLGKCRQPEEAGFLFETMQTEGLQPTIDVYTALVSAYGESGQLDKAFLLVN
EMKSVSDCKPDVYTYSILINICTKLRHFDLIGRILDEMSYLGVECSTVTFNTIIKGYGKAKMFR
EMENTLTEMIESGNSLPDLFTFNSVIGAYGNSGRLEKMEKWYDEFQLMGIDPDIKTFNILIKSY
GKEGMYEKINSVMKFMNKRFYSPTIVTYNIIIETFGKAGNIEKMDEYFKIMKHQGMKPNAITYC
SLVSAYSKAGLIMKVDSILRHVENSDVVLDTTFFNCIIHAYGQAGDIEKMTELFVKMSERECKP
DNITFATMIQAYIAQGMTEAAQELENKILGANTTSGTKMIEA
Repeat found in LOC126660955
Repeat occurs 3 times in a sequence of 489 amino acids
Location between 9443161 and 9445611
Coverage of 3.68 %
Instances:
AYGESG | AYGNSG | AYGQAG |
pattern: AYG[QNE][AS]G
The following known motifs were found:
VS[AI]Y was found 2 times in this sequence
MAIHSQPQPISYTSFGWSHSISTPPTNHLIKFLTELNFHVHPVILAVKSSKLGSKVAPVGLQR
HPKKDLSRFLRTDAAIQAIEQKANSDKYNRLWPKAVLEALDDAMRENRWESALKIFELLRKQHW
YEPRSRTYTKLLMMLGKCRQPEEAGFLFETMQTEGLQPTIDVYTALVSAYGESGQLDKAFLLVN
EMKSVSDCKPDVYTYSILINICTKLRHFDLIGRILDEMSYLGVECSTVTFNTIIKGYGKAKMFR
EMENTLTEMIESGNSLPDLFTFNSVIGAYGNSGRLEKMEKWYDEFQLMGIDPDIKTFNILIKSY
GKEGMYEKINSVMKFMNKRFYSPTIVTYNIIIETFGKAGNIEKMDEYFKIMKHQGMKPNAITYC
SLVSAYSKAGLIMKVDSILRHVENSDVVLDTTFFNCIIHAYGQAGDIEKMTELFVKMSERECKP
DNITFATMIQAYIAQGMTEAAQELENKILGANTTSGTKMIEA

Similar gene clusters

NC_065577 - Cluster 62 - Cyclopeptide

Gene cluster description

NC_065577 - Gene Cluster 62. Type = cyclopeptide. Location: 17864569 - 20822019 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_065577 - Cluster 63 - Cyclopeptide

Gene cluster description

NC_065577 - Gene Cluster 63. Type = cyclopeptide. Location: 29769063 - 31083110 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_065577 - Cluster 64 - Putative

Gene cluster description

NC_065577 - Gene Cluster 64. Type = putative. Location: 33840507 - 34121947 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065577 - Cluster 65 - Putative

Gene cluster description

NC_065577 - Gene Cluster 65. Type = putative. Location: 36260285 - 36515762 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters