Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_037254.2:
Cluster 1Cyclopeptide26333233283673650.35BURP---
Cluster 2Cyclopeptide1136566011778048412.39BURP---
Cluster 3Saccharide-Fatty_acid1959224819720551128.30CER1-like_C, FA_hydroxylase, Glycos_transf_1, Transferase---
Cluster 4Polyketide2027906620555128276.06Abhydrolase_3, Chal_sti_synt_C, Chal_sti_synt_N, Lyase_aromatic, p450---
Cluster 5Saccharide2275829623091821333.52ABC2_membrane, ABC_tran, Aminotran_1_2, Epimerase, Glyco_hydro_1---
The following clusters are from record NC_037255.2:
Cluster 6Cyclopeptide1340768813940620532.93BURP---
The following clusters are from record NC_037256.2:
Cluster 7Alkaloid1217100712312669141.66ECH_2, His_biosynth, Pyridoxal_deC, Trp_syntA-benzoxazinone DIMBOA (17% of genes show similarity)BGC0000810.3_c1
Cluster 8Cyclopeptide13835300151077861272.49BURP---
The following clusters are from record NC_037261.2:
Cluster 9Saccharide84292491313170.21ADH_N, ADH_zinc_N, Glyco_transf_28, Methyltransf_11, UDPGT*saccharide-2--
The following clusters are from record NC_037263.2:
Cluster 10Polyketide1708295177957771.28Acetyltransf_1, Chal_sti_synt_C, Chal_sti_synt_N, Epimerase---
The following clusters are from record NC_037264.2:
Cluster 11Cyclopeptide42507934968104717.31BURP---
The following clusters are from record NC_037265.2:
Cluster 12Cyclopeptide1121227312051162838.89BURP---
Cluster 13Saccharide150214901507660155.11ADH_N, ADH_zinc_N, Abhydrolase_3, Glyco_hydro_1---
The following clusters are from record NC_037266.2:
Cluster 14Terpene25441142653501109.39Aminotran_1_2, Aminotran_3, Prenyltrans, SQS_PSY---
The following clusters are from record NC_037268.2:
Cluster 15Cyclopeptide11978351822552624.72BURP---
Cluster 16Cyclopeptide1190877712778598869.82BURP---
The following clusters are from record NC_037269.2:
Cluster 17Cyclopeptide47730595222143449.08BURP---
The following clusters are from record NC_037270.2:
Cluster 18Saccharide-Fatty_acid1262211213006844384.732OG-FeII_Oxy, Acetyltransf_1, CER1-like_C, FA_hydroxylase, Glycos_transf_1---
Cluster 19Terpene142479231432828780.36Abhydrolase_3, SQHop_cyclase_C, SQHop_cyclase_N, adh_short_C2---
The following clusters are from record NC_037273.2:
Cluster 20Polyketide49171385028041110.90Chal_sti_synt_C, Chal_sti_synt_N, Lyase_aromatic, adh_short_C2---
Cluster 21Saccharide-Fatty_acid104680171053305465.04FA_hydroxylase, Glycos_transf_2, adh_short---

NC_037254 - Cluster 1 - Cyclopeptide

Gene cluster description

NC_037254 - Gene Cluster 1. Type = cyclopeptide. Location: 2633323 - 3283673 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112278897
Repeat occurs 5 times in a sequence of 659 amino acids
Location between 3047166 and 3049355
Coverage of 7.59 %
Instances:
EPEADPEPEA | EPEAEAEPEP | EPEPEPEPVV | EPEPEPVVEP | EPEPVVEPVP

pattern: EPE[PA][DEV][PVA][VE][PV][VE][PVA]
MEIQDSVAAACHADRQRLLEVFRVAIAKPLPASQTAVATMVEAVKPQKQTVLVQDENQLLENG
LRTLLEELVGAAVANGKSVMQYGRAEGPEDDGNGLLTRLLDIVLYLCEKGNVEGGMIFQLLEDL
TEVSTIKDCKEVFGYIEGKQETLGKSELFSRGKLVMLRTCNQLLRRLSKTNDVVFCGRILMFLA
HFFPLSERSAVNIKGVFNTSNETKYEKEPPGDSPAIDFTFYKTFWSLQEYFSNPTTLTHPSKWQ
TFSSNLGIVLETFEMQPIGEEDESLSLDGDDDSTFNIKYLTSSNLMSLELKDPGFRRHVLVQCL
IIFDYMKTPGKAEKDGPREGVREELKVHEDRVKKLLRTTPPKGREFLASIEHILERDKNWVWWK
RDGCIPFERFVQEKKIPSDPPKRRPRWRLGNRELSRLWKWSDDNPNAITHAKRVKMPPLAEYFK
PLAEDMDPEAGIEEEYHHKNNKVYCWKGLRFAARQDLDALSRYSDMKIMGVVPPDLLTPEVRSS
LYKPTDRPKAKGVKKDEEYTPNLTQTEDASGSAPTSVSLMELDATTLARDPEEATAMDADGPAV
ADVEPEADPEPEAEAEPEPEPEPVVEPVPLTDLSDAKPETVQTPVEAEPSSSKVVSEPPPVGNQ
SPSPLKSEARSGKKRAHPDD

Similar gene clusters

NC_037254 - Cluster 2 - Cyclopeptide

Gene cluster description

NC_037254 - Gene Cluster 2. Type = cyclopeptide. Location: 11365660 - 11778048 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112276432
Repeat occurs 4 times in a sequence of 358 amino acids
Location between 11724549 and 11727456
Coverage of 8.94 %
Instances:
AAAASPPK | AAASPPKP | AAAGGAKK | AAAGGAKK |
pattern: AAA[GAS][PSG][PA][PK][KP]
MACTGHCLGAAMATVSRLPHFVPCVNAQKCHGSAGGHLVVSLQQVAWVPSTFGGKGIAGLGNV
LKFRAGESSTYGRMASQRMIVMASQEAGTSTLVVQEVEFIWKGPGSKVLLSGDFVNWESQVPLE
KSGSEGWFVVKQNIAPGTYKYKFIVDGEWQHSPDYPTVSDNSGGFNNEITVSEEKSTPKPTRKE
DNAGSSKGEVKGSTEEVAKVDTPKPAAAASPPKPKVAAAGGAKKAAAGGAKKAVKAVAKPLPET
MLEDVIPRLTAYFEKEQGVSAVDIQFDDNQLQGTFIKNEVQYNFWAYFPDGKLEGSRGFSLTSH
GCSPSTVEPFLIDESKITGESVVQQVIKRLFAQKLLATN

Similar gene clusters

NC_037254 - Cluster 3 - Saccharide-fatty_acid

Gene cluster description

NC_037254 - Gene Cluster 3. Type = saccharide-fatty_acid. Location: 19592248 - 19720551 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NC_037254 - Cluster 4 - Polyketide

Gene cluster description

NC_037254 - Gene Cluster 4. Type = polyketide. Location: 20279066 - 20555128 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037254 - Cluster 5 - Saccharide

Gene cluster description

NC_037254 - Gene Cluster 5. Type = saccharide. Location: 22758296 - 23091821 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NC_037255 - Cluster 6 - Cyclopeptide

Gene cluster description

NC_037255 - Gene Cluster 6. Type = cyclopeptide. Location: 13407688 - 13940620 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112279812
Repeat occurs 3 times in a sequence of 290 amino acids
Location between 13657793 and 13658843
Coverage of 6.21 %
Instances:
EKSKDR | EKSGRS | EKSGRK |
pattern: EKS[KG][DR][KSR]
MGGHGGLNILPQKRWNVYNFDNREKVKRDEEKAAREEAIERQQARQREAELRLEKLREVAQAK
RNGSCEFRVADGEGTAEQPLLGNEQENEVAALAEAEGAKHFNLFEIAGQPEQGTAKAKGNNDWI
NDLKVGTRGGLDKQDIRELKKMEKSKDRAPEDEGYEFGCGLVGKSGKKPWYATKRFMVERTSDG
GVGSRISRSRSPEERPSKRSRRSEEKSGRSENRAKKEKREKSGRKSMEELRAERVRREQQERDR
AQKLLLANRKVNGDGVYGINRSGRPYYHQSFGNAR

Similar gene clusters

NC_037256 - Cluster 7 - Alkaloid

Gene cluster description

NC_037256 - Gene Cluster 7. Type = alkaloid. Location: 12171007 - 12312669 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_037256 - Cluster 8 - Cyclopeptide

Gene cluster description

NC_037256 - Gene Cluster 8. Type = cyclopeptide. Location: 13835300 - 15107786 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_037261 - Cluster 9 - Saccharide

Gene cluster description

NC_037261 - Gene Cluster 9. Type = saccharide. Location: 842924 - 913131 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037263 - Cluster 10 - Polyketide

Gene cluster description

NC_037263 - Gene Cluster 10. Type = polyketide. Location: 1708295 - 1779577 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037264 - Cluster 11 - Cyclopeptide

Gene cluster description

NC_037264 - Gene Cluster 11. Type = cyclopeptide. Location: 4250793 - 4968104 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_037265 - Cluster 12 - Cyclopeptide

Gene cluster description

NC_037265 - Gene Cluster 12. Type = cyclopeptide. Location: 11212273 - 12051162 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112290436
Repeat occurs 9 times in a sequence of 732 amino acids
Location between 11393637 and 11395836
Coverage of 8.61 %
Instances:
EDEEQDQ | EDELEQH | EDEHEKQ | EDEHEKQ | EDEHEKQ
EDEHEKQ | EDEHEKQ | EDEHEKQ | EDELEQH |
pattern: EDE[HLE][EQ][KDQ][HQ]
MYSLLSRKYSKQHHMRKPRCKFMHVCCSVTVLLILITVMCRPMPSAPSTPESIASMAVGESLQ
LNTQSIPAVEVAQETKTEADQSKLLENEMKEQVQGDSEATRSERREVEQEEGALRSGQEDEEQD
Q
HAEDKHEQRAENRHAQQAEDELEQHLEDVHGRHAEVKNKQRVEDEHEKQVEDEHEKQVEDEHE
KQ
VEDEHEKQMEDEHEKQMEDEHEKQVDDEHEQRVEDGQGQQAEDELEQHLEDVQDQHAEVKHQ
QRVDDEHEQQVKLDSRSAEHKQAGFPEEHQEGSKPKIKYSKELGAEHGHGIFQSKQSEQEQKVE
PEPQLKKHGREVTISCDGRRVYTYDLPPSMNIDILKNCSGKLVPWLNFCAHHQNYGFGIAVNTT
NNNFRKDWYGTDAYMLEVIFYERMRTYTCRTSNPGEADLFFIPFFSGLEALPYLYTDGKRRLQQ
GRELVEWLEANATQTWRRHGGHDHFLIAGRTAWDFCRPLTAVTWWGTSLFSNPEMENTTAMLLE
RRSWRGDEMAVPYPVGFHPSTSASLQSWIKLVRSSTRKYLFSFSGALRPQLVFSIREILSQQCT
QAGSACSRLDCGKIKCSHEPQPIYTSLLQAKFCLQPRGDTATRRSVIDSIVSGCIPVFFHKDTA
FTQYRWHLPNDYDNFSVFIDEEDIKNGKADVKKILEGYSAKQVEQMRERLIGIIPNVLYRHPKS
KDLSESMRDAFDLTIEGMAQKAIQFKSST
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 529 amino acids
Location between 12001500 and 12005035
Coverage of 3.4 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[LI][KSR][MI]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKAYNNILKVQFPLKCANGTVELIEGYRAQHSHHRLPVKG
GIRM
APNVDAEETMALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRK
SFIGPSIDVPAPDYGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVF
FCLRDFLDDEALVDKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAEC
ENGIDIPALKAYHKKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIV
AEAANGPVTPLAETILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRM
LINALEAELGNGHKISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAY
LIAIERIATSYRVSGIFP
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 529 amino acids
Location between 12001500 and 12005035
Coverage of 3.4 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[LI][KSR][MI]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKAYNNILKVQFPLKCANGTVELIEGYRAQHSHHRLPVKG
GIRM
APNVDAEETMALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRK
SFIGPSIDVPAPDYGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVF
FCLRDFLDDEALVDKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAEC
ENGIDIPALKAYHKKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIV
AEAANGPVTPLAETILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRM
LINALEAELGNGHKISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAY
LIAIERIATSYRVSGIFP
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 529 amino acids
Location between 12001500 and 12005035
Coverage of 3.4 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[LI][KSR][MI]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKAYNNILKVQFPLKCANGTVELIEGYRAQHSHHRLPVKG
GIRM
APNVDAEETMALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRK
SFIGPSIDVPAPDYGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVF
FCLRDFLDDEALVDKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAEC
ENGIDIPALKAYHKKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIV
AEAANGPVTPLAETILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRM
LINALEAELGNGHKISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAY
LIAIERIATSYRVSGIFP
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 529 amino acids
Location between 12001500 and 12005035
Coverage of 3.4 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[LI][KSR][MI]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKAYNNILKVQFPLKCANGTVELIEGYRAQHSHHRLPVKG
GIRM
APNVDAEETMALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRK
SFIGPSIDVPAPDYGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVF
FCLRDFLDDEALVDKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAEC
ENGIDIPALKAYHKKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIV
AEAANGPVTPLAETILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRM
LINALEAELGNGHKISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAY
LIAIERIATSYRVSGIFP
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 516 amino acids
Location between 12001500 and 12005035
Coverage of 3.49 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[LI][KSR][MI]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKCANGTVELIEGYRAQHSHHRLPVKGGIRMAPNVDAEET
MALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRKSFIGPSIDVPAPD
YGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVFFCLRDFLDDEALV
DKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAECENGIDIPALKAYH
KKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIVAEAANGPVTPLAE
TILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRMLINALEAELGNGH
KISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAYLIAIERIATSYRV
SGIFP

Similar gene clusters

NC_037265 - Cluster 13 - Saccharide

Gene cluster description

NC_037265 - Gene Cluster 13. Type = saccharide. Location: 15021490 - 15076601 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037266 - Cluster 14 - Terpene

Gene cluster description

NC_037266 - Gene Cluster 14. Type = terpene. Location: 2544114 - 2653501 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NC_037268 - Cluster 15 - Cyclopeptide

Gene cluster description

NC_037268 - Gene Cluster 15. Type = cyclopeptide. Location: 1197835 - 1822552 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_037268 - Cluster 16 - Cyclopeptide

Gene cluster description

NC_037268 - Gene Cluster 16. Type = cyclopeptide. Location: 11908777 - 12778598 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 1010 amino acids
Location between 12113828 and 12119939
Coverage of 16.34 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MPSGKKWRQSKEDSAARDEKGNLAPKIQTIPRPNPSPPPALRSSLYGKLARPQPAPSPPCAFI
ALVRLQGSYNRLLSLHRCCRYSWRCEKSISLVKYIETAQSFVTKSFEAKREAISYKMSNPSFGS
AGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAPVMPPGGAPR
YSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAVKGLAQVA
APPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFESAEGVLR
ALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGVEVVPAPK
KAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPPPLPSTSE
TNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRGRRSERVS
ERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREWEKERQRL
HDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLADRIREQEE
RASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFDSDSEEVA
GPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQEDEDPTP
DANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAKVENDKDR
SRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTESKTSETK
NVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTLVDFIVSS
TRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 1016 amino acids
Location between 12113828 and 12119939
Coverage of 16.24 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MPSGKKWRQSKEDSAARDEKGNLAPKIQTIPRPNPSPPPALRSSLYGKLARPQPAPSPPCAFI
ALVRLQGSYNRLLSLHRCCRYSWRCEKSISLVKYIETAQSFVTKSFEAKREAISYKMSNPSFGS
AGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAPVMPPGVPVS
GSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAV
KGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFE
SAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGV
EVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPP
PLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRG
RRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREW
EKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLAD
RIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFD
SDSEEVAGPKEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQE
DEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAKV
ENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTES
KTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTLV
DFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 1017 amino acids
Location between 12113828 and 12119939
Coverage of 16.22 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MPSGKKWRQSKEDSAARDEKGNLAPKIQTIPRPNPSPPPALRSSLYGKLARPQPAPSPPCAFI
ALVRLQGSYNRLLSLHRCCRYSWRCEKSISLVKYIETAQSFVTKSFEAKREAISYKMSNPSFGS
AGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAPVMPPGVPVS
GSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAV
KGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFE
SAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGV
EVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPP
PLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRG
RRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREW
EKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLAD
RIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFD
SDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQ
EDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAK
VENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTE
SKTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTL
VDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 898 amino acids
Location between 12113828 and 12119138
Coverage of 18.37 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MSNPSFGSAGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAP
VMPPGVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPP
GMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSP
KGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAAN
REEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERA
KTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSAD
VEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEER
EKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQ
REYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDP
SAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRR
SAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFA
KSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERER
DRDRSSRTESKTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITE
FLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGT
NKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 898 amino acids
Location between 12113828 and 12119138
Coverage of 18.37 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MSNPSFGSAGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAP
VMPPGVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPP
GMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSP
KGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAAN
REEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERA
KTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSAD
VEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEER
EKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQ
REYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDP
SAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRR
SAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFA
KSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERER
DRDRSSRTESKTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITE
FLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGT
NKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 838 amino acids
Location between 12113828 and 12118391
Coverage of 19.69 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MLSGVMYIAGMKPGSGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVL
KPPPGMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPT
TGSPKGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEE
EAANREEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAML
AERAKTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQ
TSADVEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERL
YE
EREKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKR
NRRQREYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVI
LPDPSAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIG
SGRRSAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAA
AEFAKSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDW
ERERDRDRSSRTESKTSETKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISK
KITEFLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGL
VTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 845 amino acids
Location between 12113828 and 12118391
Coverage of 19.53 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MLSGVMYIAGMKPGSVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGI
RPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSW
KRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQ
ERIALEEEAANREEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAV
QKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDD
TISEQKQTSADVEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENR
AREIERLYE
EREKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRS
AAYDEKRNRRQREYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTN
HVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRK
LGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGL
AHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRD
REK
DRDWERERDRDRSSRTESKTSETKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEK
MRPWISKKITEFLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEV
RRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 811 amino acids
Location between 12113828 and 12118174
Coverage of 20.35 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPH
TTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNK
YSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGVEVVPAPKKAMNGVY
SDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSS
VADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRGRRSERVSERERERD
LELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREWEKERQRLHDKEWEK
EREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLADRIREQEERASAELY
KRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDA
GNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRT
LVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDK
ANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTESKTSETKNVVNAKQ
LIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTLVDFIVSSTRKHVSA
ATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 811 amino acids
Location between 12113828 and 12118174
Coverage of 20.35 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[DSE][NKSR][KDSER][KSETAR][KDER][DRESTW][KDEIR][DSRETW][KDSER][LKDETR][DSEYTR][KDRE]
MVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPH
TTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNK
YSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGVEVVPAPKKAMNGVY
SDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSS
VADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRGRRSERVSERERERD
LELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREWEKERQRLHDKEWEK
EREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLADRIREQEERASAELY
KRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDA
GNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRT
LVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDK
ANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTESKTSETKNVVNAKQ
LIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTLVDFIVSSTRKHVSA
ATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP

Similar gene clusters

NC_037269 - Cluster 17 - Cyclopeptide

Gene cluster description

NC_037269 - Gene Cluster 17. Type = cyclopeptide. Location: 4773059 - 5222143 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112294411
Repeat occurs 3 times in a sequence of 122 amino acids
Location between 5006881 and 5007702
Coverage of 14.75 %
Instances:
LELQLQ | LELELR | LELRVE |
pattern: LEL[ERQ][LV][ERQ]
MPQQPVRILFWLFLSTVRDCCLFGECAPECNSSICYKGIAPDCYLELQLQTHLVLELELRVEA
ECNQILVWSVVGDEILSFPYSRFDSAFPQFRSRCSTYGKGLSSCGHGVELEGAKERNVR

Similar gene clusters

NC_037270 - Cluster 18 - Saccharide-fatty_acid

Gene cluster description

NC_037270 - Gene Cluster 18. Type = saccharide-fatty_acid. Location: 12622112 - 13006844 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037270 - Cluster 19 - Terpene

Gene cluster description

NC_037270 - Gene Cluster 19. Type = terpene. Location: 14247923 - 14328287 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037273 - Cluster 20 - Polyketide

Gene cluster description

NC_037273 - Gene Cluster 20. Type = polyketide. Location: 4917138 - 5028041 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037273 - Cluster 21 - Saccharide-fatty_acid

Gene cluster description

NC_037273 - Gene Cluster 21. Type = saccharide-fatty_acid. Location: 10468017 - 10533054 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters