Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_044904.1:
Cluster 1Cyclopeptide1748199718330280848.28BURP---
Cluster 2Fatty_acid2772487428042730317.86FA_hydroxylase, adh_short, adh_short_C2---
The following clusters are from record NC_044905.1:
Cluster 3Saccharide1404031114260182219.87Glycos_transf_2, Transferase, p450---
Cluster 4Fatty_acid-Alkaloid2525776225670015412.25ADH_N, ADH_zinc_N, BBE, FAD_binding_4, FA_desaturase_2, Peptidase_S10---
Cluster 5Fatty_acid3523971435579411339.702OG-FeII_Oxy, DIOX_N, FA_hydroxylase, p450---
Cluster 6Saccharide6397816964232175254.01AMP-binding, Amino_oxidase, Glycos_transf_1, SQS_PSY, UDPGT_2---
Cluster 7Fatty_acid8598946386234690245.23DIOX_N, ECH_2, Transferase---
Cluster 8Terpene9802635398271995245.64Peptidase_S10, Terpene_synth, Terpene_synth_C, Transferase---
The following clusters are from record NC_044906.1:
Cluster 9Cyclopeptide59820935612314181410.48BURP---
Cluster 10Cyclopeptide60268399613445041076.11BURP---
Cluster 11Cyclopeptide6073530261679166943.86BURP---
The following clusters are from record NC_044907.1:
Cluster 12Saccharide39013644305197403.83Glyco_hydro_1, UDPGT_2*saccharide-2, small phenolic--
Cluster 13Cyclopeptide3286634133649590783.25BURP---
The following clusters are from record NC_044908.1:
Cluster 14Terpene40382834199788161.50Lipoxygenase, Prenyltrans, p450---
Cluster 15Phenolamide5445016254614463164.30Aminotran_1_2, Pyridoxal_deC, Transferase---
Cluster 16Alkaloid5855972358770855211.13Bet_v_1, p450---
Cluster 17Terpene7717334477839320665.98Epimerase, Terpene_synth, Terpene_synth_C---
Cluster 18Saccharide8309364683368937275.29GMC_oxred_C, GMC_oxred_N, Glycos_transf_2, Transferase, UDPGT_2---
Cluster 19Saccharide-Alkaloid-Phenolamide-Terpene8537201686257425885.41Glyco_hydro_1, Prenyltrans, Pyridoxal_deC, SQHop_cyclase_C, SQHop_cyclase_N, Transferase, p450beta-amyrin-4, triterpene-4, Cycloartenol-3yossoside I/yossoside II/yossoside III/yossoside IV/yossos... (100% of genes show similarity)BGC0002402.2_c1
Cluster 20Terpene-Alkaloid8698354787209086225.54Epimerase, Prenyltrans, Pyridoxal_deC, SQHop_cyclase_C, SQHop_cyclase_NCycloartenol-3alpha-chaconine/alpha-solanine (9% of genes show similarity)BGC0002722.2_c1
The following clusters are from record NC_044909.1:
Cluster 21Saccharide1311704313229860112.822OG-FeII_Oxy, DIOX_N, Glyco_hydro_1, Peptidase_S10---
The following clusters are from record NC_044910.1:
Cluster 22Saccharide27936343010196216.56Transferase, UDPGT_2alkaloid-2, cytokinin-2--
Cluster 23Putative1154453311883826339.29HMGL-like, Peptidase_S10, adh_short---
Cluster 24Phenolamide-Alkaloid1388134914073355192.01Methyltransf_11, Methyltransf_7, Pyridoxal_deC, Transferase---
Cluster 25Putative2512431625457061332.75ADH_N, ADH_zinc_N, Epimerase, NAD_binding_4, Transferase---
Cluster 26Putative4138429841541375157.082OG-FeII_Oxy, DIOX_N, Dimerisation, Methyltransf_2, Peptidase_S10, p450---
The following clusters are from record NC_044911.1:
Cluster 27Saccharide1950784319770724262.88Glyco_hydro_1, adh_short, adh_short_C2---
Cluster 28Saccharide-Polyketide4076385140919452155.60Chal_sti_synt_C, Chal_sti_synt_N, Dimerisation, Methyltransf_2, UDPGT_2*saccharide--
Cluster 29Putative5147570251843481367.78ADH_N, ADH_zinc_N, Abhydrolase_3, p450---
Cluster 30Putative5390671854326354419.64Amino_oxidase, Peptidase_S10, Transferase, p450---
Cluster 31Saccharide6201534662339139323.79LTP_2, UDPGT_2cytokinin-4--
The following clusters are from record NC_044912.1:
Cluster 32Alkaloid2304784623235472187.63Acetyltransf_1, Bet_v_1---
Cluster 33Saccharide-Alkaloid237423902380885166.46AMP-binding, BBE, Cellulose_synt, FAD_binding_4---
Cluster 34Terpene239256682399901973.35Methyltransf_7, Terpene_synth, Terpene_synth_C---
Cluster 35Saccharide2761293627755290142.352OG-FeII_Oxy, DIOX_N, Methyltransf_11, UDPGT_2---
The following clusters are from record NC_044913.1:
Cluster 36Cyclopeptide1164953612219019569.48BURP---
The following clusters are from record NC_044914.1:
Cluster 37Cyclopeptide8995328100507011055.37BURP---
Cluster 38Cyclopeptide4239248142871199478.72BURP---
Cluster 39Cyclopeptide4282465943516042691.38BURP---
Cluster 40Cyclopeptide4295136143651281699.92BURP---
The following clusters are from record NC_044915.1:
Cluster 41Saccharide344148824282480.13AMP-binding, Epimerase, Methyltransf_11, UDPGT_2cyanogenic glucoside-2, monoterpenoid-2--
Cluster 42Saccharide2550867325623594114.92Epimerase, Glycos_transf_2, UDPGT_2---
Cluster 43Cyclopeptide2580809126609489801.40BURP---

NC_044904 - Cluster 1 - Cyclopeptide

Gene cluster description

NC_044904 - Gene Cluster 1. Type = cyclopeptide. Location: 17481997 - 18330280 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115981568
Repeat occurs 3 times in a sequence of 74 amino acids
Location between 17489455 and 17493109
Coverage of 28.38 %
Instances:
AVGGAVG | AVGGAVG | AVGAVYG |
pattern: AVG[GA][VA][YV]G
MARDSCLARVTAGVAVGGAVGGAVGAVYGTYEAIRIKVPGLMKIRYIGQTTLGSAAIFGLFLG
AGSLIHCGKSY

Similar gene clusters

NC_044904 - Cluster 2 - Fatty_acid

Gene cluster description

NC_044904 - Gene Cluster 2. Type = fatty_acid. Location: 27724874 - 28042730 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044905 - Cluster 3 - Saccharide

Gene cluster description

NC_044905 - Gene Cluster 3. Type = saccharide. Location: 14040311 - 14260182 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044905 - Cluster 4 - Fatty_acid-alkaloid

Gene cluster description

NC_044905 - Gene Cluster 4. Type = fatty_acid-alkaloid. Location: 25257762 - 25670015 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044905 - Cluster 5 - Fatty_acid

Gene cluster description

NC_044905 - Gene Cluster 5. Type = fatty_acid. Location: 35239714 - 35579411 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044905 - Cluster 6 - Saccharide

Gene cluster description

NC_044905 - Gene Cluster 6. Type = saccharide. Location: 63978169 - 64232175 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044905 - Cluster 7 - Fatty_acid

Gene cluster description

NC_044905 - Gene Cluster 7. Type = fatty_acid. Location: 85989463 - 86234690 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044905 - Cluster 8 - Terpene

Gene cluster description

NC_044905 - Gene Cluster 8. Type = terpene. Location: 98026353 - 98271995 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044906 - Cluster 9 - Cyclopeptide

Gene cluster description

NC_044906 - Gene Cluster 9. Type = cyclopeptide. Location: 59820935 - 61231418 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115979177
Repeat occurs 4 times in a sequence of 662 amino acids
Location between 59984443 and 60000018
Coverage of 3.63 %
Instances:
EENVTE | EENVTE | EENVTE | EENPHE |
pattern: EEN[VP][TH]E
MAGGGEENVTEIVINNEAHDFSLLLSSSKRDFLVRNNGDRKMAGGSEENVTEIVINDEAHDFS
SLLSPSERDFLVYNNGDQKMIGGGEENVTEIVINDEAHDFSSLLSSFERDFLVRNNGDQVKIET
LKGKKLALYFSASWCGPCRRFTPTLIEVYNELSSKGDFEVIFVSGYEDEESFKGYFSKMPWLAI
PFSDSETRDRLDELFQVRGIPHCVILDEDGKVVSDNGVEIIQEYGIEVYPFTQQKIKDLKDQEE
AARKNQSLRSILVSRSRDFVISSDGKKVPVSELEGKTVGLYFSLFSYQACDDFTPKLDEVYEKL
KAKGENFEIVLIPLEEDEESFNQGFKSLHWFALPVKDKSCQKLARYFELATLPTLVIIGPDGKT
LQSNVAEAIDEHGVLAYPFTPERFTELAEVEKAKQEAQTLESILVSGDQDFVIGKDGVKIPVTD
LVGKNILLYFSAHWCPQCRAFLPKLMDAYHKIKAKDDAFEVIFISYDWDQASFDEFFSGMPWLA
LPFGDPRKASLSRTFKVCGIPLLVAIGPSGKTVTKEARDIIMIHGADAYPFTDERLKEIEAEIE
ELAKGWPEKLKHKLHEEHELVLARRRVYNCDGCDEEGHVWSYYCDKCNFDLHPKCALKEDKGAN
DDSKEEENPHEGWNCDGEICVKV
Repeat found in LOC115979177
Repeat occurs 4 times in a sequence of 662 amino acids
Location between 59984443 and 60000018
Coverage of 3.63 %
Instances:
EENVTE | EENITE | EENVTE | EENPHE |
pattern: EEN[IVP][TH]E
MAGGGEENVTEIVINNEAHDFSLLLSSSKRDFLVRNNGDRKMAGGGEENITEIVINDEAHNFS
SLLSSSERDFLVRNNGDQKMAGGSEENVTEIVINDEAHDFSSLLSPSERDFLVYNNGDQVKIET
LKGKKLALYFSASWCGPCRRFTPTLIEVYNELSSKGDFEVIFVSGYEDEESFKGYFSKMPWLAI
PFSDSETRDRLDELFQVRGIPHCVILDEDGKVVSDNGVEIIQEYGIEVYPFTQQKIKDLKDQEE
AARKNQSLRSILVSRSRDFVISSDGKKVPVSELEGKTVGLYFSLFSYQACDDFTPKLDEVYEKL
KAKGENFEIVLIPLEEDEESFNQGFKSLHWFALPVKDKSCQKLARYFELATLPTLVIIGPDGKT
LQSNVAEAIDEHGVLAYPFTPERFTELAEVEKAKQEAQTLESILVSGDQDFVIGKDGVKIPVTD
LVGKNILLYFSAHWCPQCRAFLPKLMDAYHKIKAKDDAFEVIFISYDWDQASFDEFFSGMPWLA
LPFGDPRKASLSRTFKVCGIPLLVAIGPSGKTVTKEARDIIMIHGADAYPFTDERLKEIEAEIE
ELAKGWPEKLKHKLHEEHELVLARRRVYNCDGCDEEGHVWSYYCDKCNFDLHPKCALKEDKGAN
DDSKEEENPHEGWNCDGEICVKV
Repeat found in LOC115979177
Repeat occurs 5 times in a sequence of 703 amino acids
Location between 59984443 and 60000018
Coverage of 10.67 %
Instances:
EENVTEIVINNEAHD | EENITEIVINDEAHN | EENVTEIVINDEAHD | EENVTEIVINDEAHD | EENPHEGWNCDGEIC

pattern: EEN[IVP][TH]E[IG][WV][NI][NC][DN][GE][EA][HI][DNC]
MAGGGEENVTEIVINNEAHDFSLLLSSSKRDFLVRNNGDRKMAGGGEENITEIVINDEAHNFS
SLLSSSERDFLVRNNGDQKMAGGSEENVTEIVINDEAHDFSSLLSPSERDFLVYNNGDQKMIGG
GEENVTEIVINDEAHDFSSLLSSFERDFLVRNNGDQVKIETLKGKKLALYFSASWCGPCRRFTP
TLIEVYNELSSKGDFEVIFVSGYEDEESFKGYFSKMPWLAIPFSDSETRDRLDELFQVRGIPHC
VILDEDGKVVSDNGVEIIQEYGIEVYPFTQQKIKDLKDQEEAARKNQSLRSILVSRSRDFVISS
DGKKVPVSELEGKTVGLYFSLFSYQACDDFTPKLDEVYEKLKAKGENFEIVLIPLEEDEESFNQ
GFKSLHWFALPVKDKSCQKLARYFELATLPTLVIIGPDGKTLQSNVAEAIDEHGVLAYPFTPER
FTELAEVEKAKQEAQTLESILVSGDQDFVIGKDGVKIPVTDLVGKNILLYFSAHWCPQCRAFLP
KLMDAYHKIKAKDDAFEVIFISYDWDQASFDEFFSGMPWLALPFGDPRKASLSRTFKVCGIPLL
VAIGPSGKTVTKEARDIIMIHGADAYPFTDERLKEIEAEIEELAKGWPEKLKHKLHEEHELVLA
RRRVYNCDGCDEEGHVWSYYCDKCNFDLHPKCALKEDKGANDDSKEEENPHEGWNCDGEICVKV

Similar gene clusters

NC_044906 - Cluster 10 - Cyclopeptide

Gene cluster description

NC_044906 - Gene Cluster 10. Type = cyclopeptide. Location: 60268399 - 61344504 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_044906 - Cluster 11 - Cyclopeptide

Gene cluster description

NC_044906 - Gene Cluster 11. Type = cyclopeptide. Location: 60735302 - 61679166 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 605 amino acids
Location between 61408175 and 61422297
Coverage of 5.29 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRKMTS
KEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKNTRA
HGVFCRLKVVWLPLNLKDAFQVHPNGSRHKKIQEQTVVLIDEDESQSLIEEDESESLEITERVD
NAECMKDAKITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYH
FFNTYFYSKLKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKE
DETGPIILHLDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAI
AVPQQKNDSDCGLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFK
KASDVNCDSESSPFTKGDDPANIDHQTDSD
Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 586 amino acids
Location between 61408175 and 61422297
Coverage of 5.46 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENIICGETEAFKEELSVLSRCDRRKM
TSKEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKKK
DAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDAKITYPSRDDPE
SIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYSKLKEAVSYKGS
DKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIILHLDSLRLHSSR
SIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKNDSDCGLFVLFFM
ERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCDSESSPFTKGDD
PANIDHQTDSD
Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 607 amino acids
Location between 61408175 and 61422297
Coverage of 5.27 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENIICGETEAFKEELSVLSRCDRRKM
TSKEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKNT
RAHGVFCRLKVVWLPLNLKDAFQVHPNGSRHKKIQEQTVVLIDEDESQSLIEEDESESLEITER
VDNAECMKDAKITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCD
YHFFNTYFYSKLKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPD
KEDETGPIILHLDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEK
AIAVPQQKNDSDCGLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLE
FKKASDVNCDSESSPFTKGDDPANIDHQTDSD
Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 587 amino acids
Location between 61408175 and 61422297
Coverage of 5.45 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKK
KDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDAKITYPSRDDP
ESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYSKLKEAVSYKG
SDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIILHLDSLRLHSS
RSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKNDSDCGLFVLFF
MERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCDSESSPFTKGD
DPANIDHQTDSD
Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 589 amino acids
Location between 61408175 and 61422297
Coverage of 5.43 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKK
KDAFQVHPNGSRHKKIQEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDAKITYPSRD
DPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYSKLKEAVSY
KGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIILHLDSLRLH
SSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKNDSDCGLFVL
FFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCDSESSPFTK
GDDPANIDHQTDSD
Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 597 amino acids
Location between 61408175 and 61422297
Coverage of 5.36 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKK
LKVVWLPLNLKDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDA
KITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYS
KLKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIIL
HLDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKND
SDCGLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCD
SESSPFTKGDDPANIDHQTDSD
Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 599 amino acids
Location between 61408175 and 61422297
Coverage of 5.34 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKK
LKVVWLPLNLKDAFQVHPNGSRHKKIQEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMK
DAKITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYF
YSKLKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPI
ILHLDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQK
NDSDCGLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVN
CDSESSPFTKGDDPANIDHQTDSD
Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 606 amino acids
Location between 61408175 and 61422297
Coverage of 5.28 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKN
TRAHGVFCRLKVVWLPLNLKDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERV
DNAECMKDAKITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDY
HFFNTYFYSKLKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDK
EDETGPIILHLDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKA
IAVPQQKNDSDCGLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEF
KKASDVNCDSESSPFTKGDDPANIDHQTDSD
Repeat found in LOC115982652
Repeat occurs 4 times in a sequence of 608 amino acids
Location between 61408175 and 61422297
Coverage of 5.26 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DPA][DPA][PEA][ELP]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTAAMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRRRGRSSSKQWSSQHHNKLSKHALLNANKKRRASPTYSLHQNGEHWSSSFAKN
TRAHGVFCRLKVVWLPLNLKDAFQVHPNGSRHKKIQEQTVVLIDEDESQSLIEEDESESLEITE
RVDNAECMKDAKITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMC
DYHFFNTYFYSKLKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIP
DKEDETGPIILHLDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEE
KAIAVPQQKNDSDCGLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILML
EFKKASDVNCDSESSPFTKGDDPANIDHQTDSD
Repeat found in LOC115982655
Repeat occurs 6 times in a sequence of 789 amino acids
Location between 61435280 and 61440816
Coverage of 4.56 %
Instances:
RDRDRE | RDRERD | RDRDRG | RDRGDE | RDRDRG
RDRGQG |
pattern: RDR[DGE][DQR][DGE]
MLLMCGIPRYPSPYPAMVRPTFPMRLPGAIGVPPIVSRPPIPGVPGVVRPIISPVIRPAVVPT
VTPAEKPQTTVYIGKIASTVDNDFMLSLLKLCGPVKSWKRAQDPTDGTPKGFGFCEFESAEGVL
RALRLLSKFNIDGQELVLNVNQATKEYLERYVDKQTENSKKLKDAQAAAPEKEEGSAVGVEKSE
HLKADVENSNKEDNSSGSKENHDFANFGIVTDEDKDTDREAVEKLTSMIEERLKTRPLPPPPPP
ITDGGSGNSNSDVPAKSRDGDSEMRNDAAEDKNDDETTSETKPTSEHDRPETSSPDRNRKYERS
RDRDRE
RDLKREKDREIDRYERETERERVRKEREQRRKIEEAERKYEECLKEWEYRERDKEKQR
QNEKDREKERERKRKKEIKHDEEDEDEDSKRWRRRTLEEKRKKRLREKEEDLADRRKEEEEIAD
AKRRAEEELQQKLQRDALKVSSNPVMNGVAKTVSVEPSSVEMKENAVEQGYEGDPAQELHTGDG
VVRNGTGEESTMASVAVSDTRHTGNAPAKKLGFGLVGSGKRTAVPSVFHEEEDDDAHKDKKMRP
LVPIDYSTEELEAVQPTAPGAPPSNLVAATEFAKRISNVNPKEDKFDAERERSRRSNDKSSQRD
RDRG
DEDVNRTRDENKEKSLDRDRDRGQGMDKLKTSENTKLLSLKQLIDMIPRTKEELFSYEVN
WAVYDKHELHERMRPWISKKITEFLGEEETTLVDYIVSSTQEHVKATQMLELLQSILDDEAEMF
VLKMWRMLIYEIKRVEAGLISK

Similar gene clusters

NC_044907 - Cluster 12 - Saccharide

Gene cluster description

NC_044907 - Gene Cluster 12. Type = saccharide. Location: 3901364 - 4305197 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044907 - Cluster 13 - Cyclopeptide

Gene cluster description

NC_044907 - Gene Cluster 13. Type = cyclopeptide. Location: 32866341 - 33649590 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115987493
Repeat occurs 3 times in a sequence of 475 amino acids
Location between 32905959 and 32907387
Coverage of 4.42 %
Instances:
KKEQDNK | KKEQDHK | KKELESM |
pattern: KKE[QL][DE][NSH][KM]
MKKMSTSDVVCSNGKEVQCSEITIDLEDVKPAQRHPKCCIYKVPNKLRQVKEEAYTPKLISIG
PVHHNKKEQDNKGQDSKKEQDHKKELESMESLKLRYFKEFFGHTTCKGLKEFAYIIEKREKDIR
HCYAEEISMPEKEDFVKMILVDSIFIIELFLRTDARQDYEQGDYILSKPWLEEGIRHDLILLEN
QLPFFILEELHHQIECSPGIRKSFIDLSCKYFLPNDNCKNIPEVKDVKHFTDLQRYFYYSPNLV
SGDIIVHLHSATKLDTAGLNFKKWKKKSSEDIRLLDIQIKKPDLSEICPHFICSWLLHCLPCLK
YFRCLKRMQTSLVIPQFVINHGTEHIFRNIMALEQCHYPFQAYICNYIVLLDFLINTRDDAELL
VDKKIIVHSLGSSKEVAKMVNKLGKEIVEKNSCFHNVAVELNKHYKNWWNKNIASLKTVYFRDI
WRGTATFVGIIVLLATIGNFLRPFVYHK
Repeat found in LOC115987493
Repeat occurs 3 times in a sequence of 475 amino acids
Location between 32905959 and 32907387
Coverage of 4.42 %
Instances:
KKEQDNK | KKEQDHK | KKELESM |
pattern: KKE[QL][DE][NSH][KM]
MKKMSTSDVVCSNGKEVQCSEITIDLEDVKPAQRHPKCCIYKVPNKLRQVKEEAYTPKLISIG
PVHHNKKEQDNKGQDSKKEQDHKKELESMESLKLRYFKEFFGHTTCKGLKEFAYIIEKREKDIR
HCYAEEISMPEKEDFVKMILVDSIFIIELFLRTDARQDYEQGDYILSKPWLEEGIRHDLILLEN
QLPFFILEELHHQIECSPGIRKSFIDLSCKYFLPNDNCKNIPEVKDVKHFTDLQRYFYYSPNLV
SGDIIVHLHSATKLDTAGLNFKKWKKKSSEDIRLLDIQIKKPDLSEICPHFICSWLLHCLPCLK
YFRCLKRMQTSLVIPQFVINHGTEHIFRNIMALEQCHYPFQAYICNYIVLLDFLINTRDDAELL
VDKKIIVHSLGSSKEVAKMVNKLGKEIVEKNSCFHNVAVELNKHYKNWWNKNIASLKTVYFRDI
WRGTATFVGIIVLLATIGNFLRPFVYHK
Repeat found in LOC115984197
Repeat occurs 3 times in a sequence of 200 amino acids
Location between 33362487 and 33363354
Coverage of 9.0 %
Instances:
DLPRLK | DLPQLE | DLPQLK |
pattern: DLP[QR]L[KE]
MKQIISDQPICLPKLETLWVTNCRELEYIFPISVARGLQQLERLGLGDLPRLKKVFGQNREGE
VGDCEIESHHQPTGFPKLKTIEVVNCGNLEYLFPISIARDLPQLESLSLKDLPQLKQVFGHEEG
GDDGDGNNSVLSKLRNLRLKNLPELVSLGGGNSSSVWPSLERLEVVNCPKVKLADVEANVPALQ
KVDKKEIKN

Similar gene clusters

NC_044908 - Cluster 14 - Terpene

Gene cluster description

NC_044908 - Gene Cluster 14. Type = terpene. Location: 4038283 - 4199788 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044908 - Cluster 15 - Phenolamide

Gene cluster description

NC_044908 - Gene Cluster 15. Type = phenolamide. Location: 54450162 - 54614463 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044908 - Cluster 16 - Alkaloid

Gene cluster description

NC_044908 - Gene Cluster 16. Type = alkaloid. Location: 58559723 - 58770855 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044908 - Cluster 17 - Terpene

Gene cluster description

NC_044908 - Gene Cluster 17. Type = terpene. Location: 77173344 - 77839320 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044908 - Cluster 18 - Saccharide

Gene cluster description

NC_044908 - Gene Cluster 18. Type = saccharide. Location: 83093646 - 83368937 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044908 - Cluster 19 - Saccharide-alkaloid-phenolamide-terpene

Gene cluster description

NC_044908 - Gene Cluster 19. Type = saccharide-alkaloid-phenolamide-terpene. Location: 85372016 - 86257425 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_044908 - Cluster 20 - Terpene-alkaloid

Gene cluster description

NC_044908 - Gene Cluster 20. Type = terpene-alkaloid. Location: 86983547 - 87209086 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_044909 - Cluster 21 - Saccharide

Gene cluster description

NC_044909 - Gene Cluster 21. Type = saccharide. Location: 13117043 - 13229860 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044910 - Cluster 22 - Saccharide

Gene cluster description

NC_044910 - Gene Cluster 22. Type = saccharide. Location: 2793634 - 3010196 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044910 - Cluster 23 - Putative

Gene cluster description

NC_044910 - Gene Cluster 23. Type = putative. Location: 11544533 - 11883826 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044910 - Cluster 24 - Phenolamide-alkaloid

Gene cluster description

NC_044910 - Gene Cluster 24. Type = phenolamide-alkaloid. Location: 13881349 - 14073355 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044910 - Cluster 25 - Putative

Gene cluster description

NC_044910 - Gene Cluster 25. Type = putative. Location: 25124316 - 25457061 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044910 - Cluster 26 - Putative

Gene cluster description

NC_044910 - Gene Cluster 26. Type = putative. Location: 41384298 - 41541375 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044911 - Cluster 27 - Saccharide

Gene cluster description

NC_044911 - Gene Cluster 27. Type = saccharide. Location: 19507843 - 19770724 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044911 - Cluster 28 - Saccharide-polyketide

Gene cluster description

NC_044911 - Gene Cluster 28. Type = saccharide-polyketide. Location: 40763851 - 40919452 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044911 - Cluster 29 - Putative

Gene cluster description

NC_044911 - Gene Cluster 29. Type = putative. Location: 51475702 - 51843481 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044911 - Cluster 30 - Putative

Gene cluster description

NC_044911 - Gene Cluster 30. Type = putative. Location: 53906718 - 54326354 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044911 - Cluster 31 - Saccharide

Gene cluster description

NC_044911 - Gene Cluster 31. Type = saccharide. Location: 62015346 - 62339139 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044912 - Cluster 32 - Alkaloid

Gene cluster description

NC_044912 - Gene Cluster 32. Type = alkaloid. Location: 23047846 - 23235472 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044912 - Cluster 33 - Saccharide-alkaloid

Gene cluster description

NC_044912 - Gene Cluster 33. Type = saccharide-alkaloid. Location: 23742390 - 23808851 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044912 - Cluster 34 - Terpene

Gene cluster description

NC_044912 - Gene Cluster 34. Type = terpene. Location: 23925668 - 23999019 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044912 - Cluster 35 - Saccharide

Gene cluster description

NC_044912 - Gene Cluster 35. Type = saccharide. Location: 27612936 - 27755290 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044913 - Cluster 36 - Cyclopeptide

Gene cluster description

NC_044913 - Gene Cluster 36. Type = cyclopeptide. Location: 11649536 - 12219019 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115962457
Repeat occurs 4 times in a sequence of 842 amino acids
Location between 11848718 and 11851247
Coverage of 3.8 %
Instances:
KCGAVEDA | KCGDVRTA | KCGDPETG | KCGDIDSS |
pattern: KCG[DA][IVP][DRE][DTS][SGA]
MAILTPLGSLNGHCTEHSYTLRPFKVTHHSLSHALYSKRASSLCYSTKFQNCSTSIQFQENPS
QKSIVKTHLLHNQPSNSVSSASKLIKECVDDGLFENAIRVYLKMFECDGFQAEKFRFFPCLIKA
FGGLCDVDKVKQVHGHVLNLGVLNDVSVGNSLLGVYLKCGAVEDAVQMFEEMGERDPVSWNSMI
SGFYQSGDYMGALITFSMMIWEYGVYPNRVACLSALSSCSSIKSLIHGREIHGFVVKSGLNVEE
FLVSGLIEMYMKCGDVRTAEHVFKGILDNESSRGNAVIWNVMILGYVSNGCLLQALDLFLDMLA
MGTKPDSSTLVAVLVLCSQLSDLAVGKQMHGCIVSFGLDYNVRVETALIDMYFKCGDPETGLKI
FRSSQNRNLVMWGAVISNCAQGGCAAEALELFYTFLSKYGFADSVMLLAVLRACSFLTLNSRGM
EIHGLAIKTGFDDDMFIGGALVDMYAKCQDIESAQKVFQRLPARDLVLWNSLISGYAQNECADK
ALKAFCEMQSEQIAPNSVTASCILSVCARFSVMILCKEIHCYLLRRGIESNVSVSNSLIATYAK
CGDIDSS
TTLFEKMLERDKVSWNSIILGFGMHGHTDEMFVLFKKMTETGMKPDHATFTALLSAC
SHTGRVDMGLKCFKSMVEDHKLEPRVEQYTCIVDLLGRAGHLKQAYDLIMTMPCVPDDLIWGSL
LGSCKTHGNEKLAELVSKHIFELDPTCISYRVLLSNIYEGFGKQNEVTRVRSEIKDLRLKKQPG
CSWIEVDHKIHSFISGDHSHHQSKEIYGAIESLTKEMKRVDYNPRTQNVIGSDESNVEGINSSK
DIDLLYNKSVC

Similar gene clusters

NC_044914 - Cluster 37 - Cyclopeptide

Gene cluster description

NC_044914 - Gene Cluster 37. Type = cyclopeptide. Location: 8995328 - 10050701 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_044914 - Cluster 38 - Cyclopeptide

Gene cluster description

NC_044914 - Gene Cluster 38. Type = cyclopeptide. Location: 42392481 - 42871199 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115967407
Repeat occurs 5 times in a sequence of 441 amino acids
Location between 42631048 and 42632631
Coverage of 9.07 %
Instances:
QKEMDQPY | QKEAITYT | QKEMDQPY | QKEAITYT | QKEMDQPY

pattern: QKE[AM][DI][QT][YP][TY]
MDHQEHDGVEDVTNQKEMDQPYIAGYNRQKEAITYTRQDAKNSYITQYGDASKNSRMDHQEHD
GVEDVTNQKEMDQPYIAGYNRQKEAITYTRQDAKNSFITQYGDASKNSRMDHQEHDGVEDVTKQ
KEMDQPY
LAGYRKDAKDNYMAQYGTRQEAKESPIQESDQSYIAGYRYKQEVKESNYITSYGDRE
DSKSSVIEHENNMKTKPSSHMDHLGAFKSGFFTMDDLFVGNIMPLYFPIQELSEFLPREEADSI
PFSMPQLQNVLQIFSIPIDSPNARAMEDTLQQCEATPITGETKLCATSLESMLDFVHSIMRSWA
NLNVLTTKHPTMSTALTQNYDVLRVSKELYAPKWVACHPLPYPYKIFFCHFIEKTKIFKVLLGG
ENGHKVEAVAVCHMDTSDWDPNHILFRQLGVKPGSSPVCHFLPKYHLVWVPSPTTASI

Similar gene clusters

NC_044914 - Cluster 39 - Cyclopeptide

Gene cluster description

NC_044914 - Gene Cluster 39. Type = cyclopeptide. Location: 42824659 - 43516042 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115966243
Repeat occurs 13 times in a sequence of 397 amino acids
Location between 43320026 and 43321588
Coverage of 22.92 %
Instances:
DFEPRPS | DFEPRPS | DFEPRPS | DFEPRPS | DFEPRPS
DFEPRPS | DFEPRPS | DFEPRPS | DFEPRPS | DFEPRPS
DFEPRPS | DFEQRPN | DFEPRPI |
pattern: DFE[QP]RP[NSI]
The following known motifs were found:
FEPR was found 12 times in this sequence
MTVDMDSRFAWLTLFSLLLFANTIESRKDPGEYWTSVMKDQPMPKAIQGLVHLDSSPSKLSKN
DNCHTSEGAIGKDQGEKTYVKDFEPRPSPSVTAYVNDVGLETKNFVKDFEPRPSPSVTAYNNDV
GLKTQKKFVKDFEPRPSPSTTAYVNDVGLETKNNFVKDFEPRPSPSVTTYINDVGLKNQNKFVK
D
FEPRPSPSATAYVNDVGLETKKKFVNDFEPRPSPSTTAYVNDVGLETKKKFVNDFEPRPSPST
TTYVNDVGLETKNNFVKDFEPRPSPSVTTYINDVGLKNQNKYVKDFEPRPSPSATAYVNDIGLE
TKKNFVKDFEPRPSPSATAYVNDVGLETKKKFVNDFEPRPSPSVTAYVNVVDPETKKKFVNDFE
QRPN
ISACNDDDVGSKDEKPFVEDFEPRPIASVYSE

Similar gene clusters

NC_044914 - Cluster 40 - Cyclopeptide

Gene cluster description

NC_044914 - Gene Cluster 40. Type = cyclopeptide. Location: 42951361 - 43651281 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115966243
Repeat occurs 13 times in a sequence of 397 amino acids
Location between 43320026 and 43321588
Coverage of 22.92 %
Instances:
DFEPRPS | DFEPRPS | DFEPRPS | DFEPRPS | DFEPRPS
DFEPRPS | DFEPRPS | DFEPRPS | DFEPRPS | DFEPRPS
DFEPRPS | DFEQRPN | DFEPRPI |
pattern: DFE[QP]RP[NSI]
The following known motifs were found:
FEPR was found 12 times in this sequence
MTVDMDSRFAWLTLFSLLLFANTIESRKDPGEYWTSVMKDQPMPKAIQGLVHLDSSPSKLSKN
DNCHTSEGAIGKDQGEKTYVKDFEPRPSPSVTAYVNDVGLETKNFVKDFEPRPSPSVTAYNNDV
GLKTQKKFVKDFEPRPSPSTTAYVNDVGLETKNNFVKDFEPRPSPSVTTYINDVGLKNQNKFVK
D
FEPRPSPSATAYVNDVGLETKKKFVNDFEPRPSPSTTAYVNDVGLETKKKFVNDFEPRPSPST
TTYVNDVGLETKNNFVKDFEPRPSPSVTTYINDVGLKNQNKYVKDFEPRPSPSATAYVNDIGLE
TKKNFVKDFEPRPSPSATAYVNDVGLETKKKFVNDFEPRPSPSVTAYVNVVDPETKKKFVNDFE
QRPN
ISACNDDDVGSKDEKPFVEDFEPRPIASVYSE

Similar gene clusters

NC_044915 - Cluster 41 - Saccharide

Gene cluster description

NC_044915 - Gene Cluster 41. Type = saccharide. Location: 344148 - 824282 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044915 - Cluster 42 - Saccharide

Gene cluster description

NC_044915 - Gene Cluster 42. Type = saccharide. Location: 25508673 - 25623594 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_044915 - Cluster 43 - Cyclopeptide

Gene cluster description

NC_044915 - Gene Cluster 43. Type = cyclopeptide. Location: 25808091 - 26609489 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC115970715
Repeat occurs 3 times in a sequence of 287 amino acids
Location between 26083141 and 26085345
Coverage of 7.32 %
Instances:
SSSSSHE | SSSSHEE | SSSHEEI |
pattern: SSS[HS][HSE][HE][IE]
MASNVREARRRKIMERGSDRLALITGQIQSLPSSSSSHEEITTDSDNPSSQPLISHTAAVSPH
GEDEGSGSFMVKHDPITDACQTDADIGESNLAPPLRKCEISDEALGARASEFNGKAQSPLVSSI
DQNSSASTLNSEQHLEQQRPQHRFFTPNQISSAIASSERTRFLCSVTVALLVVLSYLGFPLLGS
YVIKSMLSFRPLYLLLLTNVTIVLARLLVDKQTGFERAARDENETTSLNGYGWAEQVGNTLETG
LVMQKALDAIFMDCSVYAIIVICGLSLASLFS
Repeat found in LOC115970713
Repeat occurs 3 times in a sequence of 395 amino acids
Location between 26093198 and 26097595
Coverage of 4.56 %
Instances:
VSSNEV | VSSDYV | VSSDEM |
pattern: VSS[DN][YE][VM]
MVSLEVAQVMENRTKKRTRSAWDITPAQPQAKRGEVVSSNEVISRRRQASPPRRDDDREGHYV
FNLGENLTPRYKILSKMGEGTFGRVLECWDRQTREYVAIKVVRSIRKYRNAAMIEVDVLQHLAK
NDKGDSHCVQIRSWFDYRNHICIVFEKLGPSLFDFLKRNKYCPFPVDLVREFGRQLLESVAYMH
DLRLIHTDLKPENILLVSSDYVSSDEMLSRCLPKSSAIKLIDFGSAAFDNQNHSSIVSTRHYRA
PEVILGLGWSYPCDLWSVGCILVELCSGEALFQTHENLEHLAMMERVFGPLPEHMIRRADKGAE
KYFRRGLRLNWPEGAVSRESIRAVNKLHRLQDLVSQHVESSRFSLTDLLNGLLMYEPSERLTAR
QALNHPFFNNLT
Repeat found in LOC115970713
Repeat occurs 3 times in a sequence of 402 amino acids
Location between 26093198 and 26097595
Coverage of 4.48 %
Instances:
VSSNEV | VSSDYV | VSSDEM |
pattern: VSS[DN][YE][VM]
MVSLEVAQVMENRTKKRTRSAWDITPAQPQAKRGEVVSSNEVISRRRQASPPRRDDDREGHYV
FNLGENLTPRYKILSKMGEGTFGRVLECWDRQTREYVAIKVVRSIRKYRNAAMIEVDVLQHLAK
NDKGDSHCVQIRSWFDYRNHICIVFEKLGPSLFDFLKRNKYCPFPVDLVREFGRQLLESVAYMH
DLRLIHTDLKPENILLVSSDYVRLPGYKVSSDEMLSRCLPKSSAIKLIDFGSAAFDNQNHSSIV
STRHYRAPEVILGLGWSYPCDLWSVGCILVELCSGEALFQTHENLEHLAMMERVFGPLPEHMIR
RADKGAEKYFRRGLRLNWPEGAVSRESIRAVNKLHRLQDLVSQHVESSRFSLTDLLNGLLMYEP
SERLTARQALNHPFFNNLT
Repeat found in LOC115970713
Repeat occurs 3 times in a sequence of 403 amino acids
Location between 26093198 and 26097595
Coverage of 4.47 %
Instances:
VSSNEV | VSSDYV | VSSDEM |
pattern: VSS[DN][YE][VM]
MVSLEVAQVMENRTKKRTRSAWDITPAQPQAKRGEVVSSNEVISRRRQASPPRRDDDREGHYV
FNLGENLTPRYKILSKMGEGTFGRVLECWDRQTREYVAIKVVRSIRKYRNAAMIEVDVLQHLAK
NDKGDSHCVQIRSWFDYRNHICIVFEKLGPSLFDFLKRNKYCPFPVDLVREFGRQLLESVAYMH
DLRLIHTDLKPENILLVSSDYVRLPGYKKVSSDEMLSRCLPKSSAIKLIDFGSAAFDNQNHSSI
VSTRHYRAPEVILGLGWSYPCDLWSVGCILVELCSGEALFQTHENLEHLAMMERVFGPLPEHMI
RRADKGAEKYFRRGLRLNWPEGAVSRESIRAVNKLHRLQDLVSQHVESSRFSLTDLLNGLLMYE
PSERLTARQALNHPFFNNLT
Repeat found in LOC115970450
Repeat occurs 4 times in a sequence of 819 amino acids
Location between 26213035 and 26217764
Coverage of 2.93 %
Instances:
TQADCA | TQADGA | TQADGA | TQAEAT |
pattern: TQA[DE][GCA][TA]
MCGFRLCWLGSVPTKVGGFGWARLWVLMDGSAVGGYGFEVLVVEAFVVGRIESCPSGGFSLQI
HNSNTPSFFKVMDSSTNAPFVSTQADCAAATQADGAAATATQADGATATATQEAAAATQAEATD
GELPLVPPSVVSKTGTGSGRKKSLAWNHFEKVKVDDGVTMAVCNYCKKSYLADSKSCGTSNLLA
HVTICPKNPNREDKGQKTLAFEPKNDGDEGFKLVSTTFSVEASRKALAEMIIIDELPFRCVEGY
GFKKYVTTLQPKLRVKDIPSRQTVARDVIGIYNSEREKLRKSLKGCRVCLTTDTWTSLQNLNYM
CLTCHFIDDTWKLHKRILNFCQVEDHKGETIGRKIEMSLREWGIDGIFTLTVDNASSNLTTVKF
LQRVTKDWNGTVLENELMHMRCCAHILNLIVGEGLKEIDASVARVREAVRYVKSLPNRNQTFRN
FMERLGMESKSLLCLDVPTRWNSTYLMLETAEKFEKVFLRMDFEDDGYSSYFRSKEDSGGLGSP
CMSDFQNCRAFVTFLRLFYNATKKFSGSLYVTSNAFFDEIFVIQESISHLVKSQNTLLKNTATN
MQTKFEKYWGEGDKINPLLYVAVVLDPRKKLRFLKFSFSEIYGIEVGSVMVDKVKALLMKLYTF
FCSVNSPNVEEPSGGERTLMVVGDASDPYVMVHSRYELFLEAEQSIGCSNEVDKYLAENCDGRR
DGNFEVLGWWKDNSSRYPMLSKVAKDVLAVPVSTVASESAFSTGGRIVDPFRSSLSPLMVQNLV
CAQNWLQATVPISHRQSRDEVEALEEEFHDLGGITYFVKEAEPFGNILEDFL
Repeat found in LOC115970788
Repeat occurs 3 times in a sequence of 637 amino acids
Location between 26387334 and 26391259
Coverage of 2.83 %
Instances:
IISQVE | IISGYC | IISGWC |
pattern: IIS[QG][YWV][CE]
MGESGNEKAISKTTRKHQVRTTTIKKGWVQALPIKEHSLNDKNQLYPTSKSTNSKNQPYCIIC
KTKNSCRTVRSRTKLMNILIERGKPQEVESIFDSLIEGGHRPSLVTYTTLLAALTIQKRFESIH
LIISQVEENEMKPDSIFFNAVINAFSESGNIEEAMETFWKMKENGLKPTTCTYNTLIKGYGIAG
KPTESLKLLDLMSQEGNVKPNLRTYNMLVRAWCKKNNITEAWNVVCKMVDSGMQPDAVTYNTIS
TAYAQNGETDRAERMILEMQNNKVQPNERTCGIIISGYCKEGKLKEALRFVHRMKDLGVHPNLV
IFNSLIKGFVEIMDRDGADEVMTLMAEFGVKPDVITYSTIMNAWSTAGFMEKCREIFDDMMKAG
IEPDAHAYSILAKGYVRAQEPEKAKELLTAMIKSGVRPNVVIFTTIISGWCSSGKMDHAIKVFN
EMCEYKVSPNLKTFETLIWGYGEAKQPWKAEEMLQIMKEFGIQPEKSTILLVAEAWRATGLTKE
ANRILGTMKGKGMTRLIPTEEEIQEESLEKIYQRQPTGASYSNLLQIPAIFTSDQKGSAAATKK
SRMVLRDGDFSLESSCLATKTVYLTHTCKFGDRSPIICRKQSQGQLGTYAPLALSCTVGFLN
Repeat found in LOC115970788
Repeat occurs 3 times in a sequence of 637 amino acids
Location between 26387334 and 26391259
Coverage of 2.83 %
Instances:
IISQVE | IISGYC | IISGWC |
pattern: IIS[QG][YWV][CE]
MGESGNEKAISKTTRKHQVRTTTIKKGWVQALPIKEHSLNDKNQLYPTSKSTNSKNQPYCIIC
KTKNSCRTVRSRTKLMNILIERGKPQEVESIFDSLIEGGHRPSLVTYTTLLAALTIQKRFESIH
LIISQVEENEMKPDSIFFNAVINAFSESGNIEEAMETFWKMKENGLKPTTCTYNTLIKGYGIAG
KPTESLKLLDLMSQEGNVKPNLRTYNMLVRAWCKKNNITEAWNVVCKMVDSGMQPDAVTYNTIS
TAYAQNGETDRAERMILEMQNNKVQPNERTCGIIISGYCKEGKLKEALRFVHRMKDLGVHPNLV
IFNSLIKGFVEIMDRDGADEVMTLMAEFGVKPDVITYSTIMNAWSTAGFMEKCREIFDDMMKAG
IEPDAHAYSILAKGYVRAQEPEKAKELLTAMIKSGVRPNVVIFTTIISGWCSSGKMDHAIKVFN
EMCEYKVSPNLKTFETLIWGYGEAKQPWKAEEMLQIMKEFGIQPEKSTILLVAEAWRATGLTKE
ANRILGTMKGKGMTRLIPTEEEIQEESLEKIYQRQPTGASYSNLLQIPAIFTSDQKGSAAATKK
SRMVLRDGDFSLESSCLATKTVYLTHTCKFGDRSPIICRKQSQGQLGTYAPLALSCTVGFLN
Repeat found in LOC115970788
Repeat occurs 3 times in a sequence of 637 amino acids
Location between 26387334 and 26391259
Coverage of 2.83 %
Instances:
IISQVE | IISGYC | IISGWC |
pattern: IIS[QG][YWV][CE]
MGESGNEKAISKTTRKHQVRTTTIKKGWVQALPIKEHSLNDKNQLYPTSKSTNSKNQPYCIIC
KTKNSCRTVRSRTKLMNILIERGKPQEVESIFDSLIEGGHRPSLVTYTTLLAALTIQKRFESIH
LIISQVEENEMKPDSIFFNAVINAFSESGNIEEAMETFWKMKENGLKPTTCTYNTLIKGYGIAG
KPTESLKLLDLMSQEGNVKPNLRTYNMLVRAWCKKNNITEAWNVVCKMVDSGMQPDAVTYNTIS
TAYAQNGETDRAERMILEMQNNKVQPNERTCGIIISGYCKEGKLKEALRFVHRMKDLGVHPNLV
IFNSLIKGFVEIMDRDGADEVMTLMAEFGVKPDVITYSTIMNAWSTAGFMEKCREIFDDMMKAG
IEPDAHAYSILAKGYVRAQEPEKAKELLTAMIKSGVRPNVVIFTTIISGWCSSGKMDHAIKVFN
EMCEYKVSPNLKTFETLIWGYGEAKQPWKAEEMLQIMKEFGIQPEKSTILLVAEAWRATGLTKE
ANRILGTMKGKGMTRLIPTEEEIQEESLEKIYQRQPTGASYSNLLQIPAIFTSDQKGSAAATKK
SRMVLRDGDFSLESSCLATKTVYLTHTCKFGDRSPIICRKQSQGQLGTYAPLALSCTVGFLN
Repeat found in LOC115970788
Repeat occurs 3 times in a sequence of 637 amino acids
Location between 26387334 and 26391259
Coverage of 2.83 %
Instances:
IISQVE | IISGYC | IISGWC |
pattern: IIS[QG][YWV][CE]
MGESGNEKAISKTTRKHQVRTTTIKKGWVQALPIKEHSLNDKNQLYPTSKSTNSKNQPYCIIC
KTKNSCRTVRSRTKLMNILIERGKPQEVESIFDSLIEGGHRPSLVTYTTLLAALTIQKRFESIH
LIISQVEENEMKPDSIFFNAVINAFSESGNIEEAMETFWKMKENGLKPTTCTYNTLIKGYGIAG
KPTESLKLLDLMSQEGNVKPNLRTYNMLVRAWCKKNNITEAWNVVCKMVDSGMQPDAVTYNTIS
TAYAQNGETDRAERMILEMQNNKVQPNERTCGIIISGYCKEGKLKEALRFVHRMKDLGVHPNLV
IFNSLIKGFVEIMDRDGADEVMTLMAEFGVKPDVITYSTIMNAWSTAGFMEKCREIFDDMMKAG
IEPDAHAYSILAKGYVRAQEPEKAKELLTAMIKSGVRPNVVIFTTIISGWCSSGKMDHAIKVFN
EMCEYKVSPNLKTFETLIWGYGEAKQPWKAEEMLQIMKEFGIQPEKSTILLVAEAWRATGLTKE
ANRILGTMKGKGMTRLIPTEEEIQEESLEKIYQRQPTGASYSNLLQIPAIFTSDQKGSAAATKK
SRMVLRDGDFSLESSCLATKTVYLTHTCKFGDRSPIICRKQSQGQLGTYAPLALSCTVGFLN
Repeat found in LOC115970788
Repeat occurs 3 times in a sequence of 637 amino acids
Location between 26387334 and 26391259
Coverage of 2.83 %
Instances:
IISQVE | IISGYC | IISGWC |
pattern: IIS[QG][YWV][CE]
MGESGNEKAISKTTRKHQVRTTTIKKGWVQALPIKEHSLNDKNQLYPTSKSTNSKNQPYCIIC
KTKNSCRTVRSRTKLMNILIERGKPQEVESIFDSLIEGGHRPSLVTYTTLLAALTIQKRFESIH
LIISQVEENEMKPDSIFFNAVINAFSESGNIEEAMETFWKMKENGLKPTTCTYNTLIKGYGIAG
KPTESLKLLDLMSQEGNVKPNLRTYNMLVRAWCKKNNITEAWNVVCKMVDSGMQPDAVTYNTIS
TAYAQNGETDRAERMILEMQNNKVQPNERTCGIIISGYCKEGKLKEALRFVHRMKDLGVHPNLV
IFNSLIKGFVEIMDRDGADEVMTLMAEFGVKPDVITYSTIMNAWSTAGFMEKCREIFDDMMKAG
IEPDAHAYSILAKGYVRAQEPEKAKELLTAMIKSGVRPNVVIFTTIISGWCSSGKMDHAIKVFN
EMCEYKVSPNLKTFETLIWGYGEAKQPWKAEEMLQIMKEFGIQPEKSTILLVAEAWRATGLTKE
ANRILGTMKGKGMTRLIPTEEEIQEESLEKIYQRQPTGASYSNLLQIPAIFTSDQKGSAAATKK
SRMVLRDGDFSLESSCLATKTVYLTHTCKFGDRSPIICRKQSQGQLGTYAPLALSCTVGFLN

Similar gene clusters