Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record CM038907.1:
Cluster 1Cyclopeptide69386647612373673.71BURP---
Cluster 2Fatty_acid1159331111796720203.41ADH_N, ADH_zinc_N, Aminotran_1_2, FA_desaturase---
The following clusters are from record CM038908.1:
Cluster 3Cyclopeptide3589493036273875378.94BURP---
The following clusters are from record CM038909.1:
Cluster 4Polyketide-Alkaloid139709261404884077.91Chal_sti_synt_C, Chal_sti_synt_N, His_biosynth, Lyase_aromatic, Str_synth, Trp_syntA---
Cluster 5Polyketide1411455614229257114.70Acetyltransf_1, Chal_sti_synt_C, Chal_sti_synt_N, Epimerase, Lyase_aromatic, p450---
Cluster 6Saccharide179256551802051694.86Glycos_transf_1, Lyase_aromatic, adh_short, p450---
The following clusters are from record CM038910.1:
Cluster 7Cyclopeptide1535767716353559995.88BURP---
The following clusters are from record CM038911.1:
Cluster 8Fatty_acid-Alkaloid10125221163759151.24AMP-binding, FA_desaturase_2, Str_synth---
Cluster 9Saccharide26885812822046133.47Amino_oxidase, Cellulose_synt, Dimerisation, Methyltransf_2---
Cluster 10Fatty_acid53386895481412142.72FA_desaturase, NAD_binding_1, adh_short---
The following clusters are from record CM038912.1:
Cluster 11Cyclopeptide45894618592572.70BURP, Glycos_transf_1, Methyltransf_11, NAD_binding_1---
Cluster 12Saccharide39151048771396.20Glycos_transf_1, Methyltransf_11, NAD_binding_1---
Cluster 13Saccharide14237091657677233.97COesterase, Cellulose_synt, Glycos_transf_2, adh_short_C2, p450---
Cluster 14Saccharide1938059919502744122.14Glyco_hydro_1, p450---
The following clusters are from record CM038913.1:
Cluster 15Cyclopeptide384219748464571004.26BURP---
Cluster 16Transporter_associated66286146849920221.31ABC2_membrane, ABC_tran, ADH_N, ADH_zinc_N, Acetyltransf_1, Chalcone_2, Methyltransf_11, p450---
Cluster 17Saccharide1915097319339495188.52Glycos_transf_1, Methyltransf_11, Oxidored_FMN---
The following clusters are from record CM038914.1:
Cluster 18Alkaloid223633372244993886.60ECH_2, Prenyltransf, Str_synth---
The following clusters are from record CM038915.1:
Cluster 19Saccharide397922532235134.31Cellulose_synt, Methyltransf_11, p450---
Cluster 20Putative1878337219063336279.96Dimerisation, Methyltransf_2, adh_short, adh_short_C2, p450---
The following clusters are from record CM038916.1:
Cluster 21Polyketide76606987881218220.52AMP-binding, Chal_sti_synt_C, Chal_sti_synt_N, Methyltransf_7, p450---
Cluster 22Cyclopeptide1552815316187053658.90BURP---
The following clusters are from record CM038917.1:
Cluster 23Putative26786312789195110.56Aminotran_1_2, Transferase, adh_short, p450---
Cluster 24Saccharide88487628983268134.51Glycos_transf_1, UbiA, adh_short---
Cluster 25Polyketide100941081013210037.99Acetyltransf_1, Chal_sti_synt_C, Chal_sti_synt_N---
Cluster 26Cyclopeptide1603461216334180299.57BURP---
The following clusters are from record CM038919.1:
Cluster 27Alkaloid71418679213377.95AMP-binding, Cu_amine_oxid, Methyltransf_11---
The following clusters are from record CM038920.1:
Cluster 28Alkaloid17336081860541126.93BBE, FAD_binding_4, NAD_binding_1, p450---
Cluster 29Cyclopeptide199614132896641293.52BURP---
The following clusters are from record CM038921.1:
Cluster 30Cyclopeptide189469638618449.15BURP---
Cluster 31Cyclopeptide84492119806031135.68BURP---
Cluster 32Fatty_acid25967092744882148.17Amino_oxidase, ECH_2, Transferase---
Cluster 33Saccharide3952526403453982.01AMP-binding, Glycos_transf_1, adh_short---
The following clusters are from record CM038922.1:
Cluster 34Saccharide1080897117942098.52Cellulose_synt, Epimerase, Methyltransf_11, adh_short---
The following clusters are from record CM038924.1:
Cluster 35Putative133656279516145.86Amino_oxidase, Aminotran_3, Lipoxygenase, oMT, p450---
Cluster 36Saccharide5945525599798752.46Glyco_hydro_1, p450---
The following clusters are from record CM038925.1:
Cluster 37Cyclopeptide30003383535741535.40BURP---
Cluster 38Polyketide-Saccharide1008057910186109105.53Chal_sti_synt_C, Epimerase, FAE1_CUT1_RppA, Glyco_hydro_1, Methyltransf_11---

CM038907 - Cluster 1 - Cyclopeptide

Gene cluster description

CM038907 - Gene Cluster 1. Type = cyclopeptide. Location: 6938664 - 7612373 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in CY35_01G043200
Repeat occurs 3 times in a sequence of 325 amino acids
Location between 7007193 and 7009510
Coverage of 7.38 %
Instances:
ATSTPGGD | ATSTPGGT | ATSTPGGD |
pattern: ATSTPGG[DT]
MATSTPGGDGASDTKRGMATSTPGGTSDTKRSIATSTPGGDEGASETKRSRLAALHRKHAAAV
AATTTPSPSKQPPPSSSSPASFRIFKTDPNDLNYAKLCPSVFNGPLVSQLQLRKDDSNVAVDKV
LRELLGSNAKMSGKIGGTIETRVQDRVLLLDNPATQGGAVDRAKNKAHRSRSKRSSKHLSLRQH
RHLGSFNLPIEYQKYELYLPMHEMWKEYARKLVHNCNDAMMQARLLTADLHGAMIAVVETKSTS
YMGTNGIMVRETENTFGVMTIKDKLRVVPKAGTVFTLQLDTLRVTLFGNNLFLRGLHPSKRQQI
KPTIEL
Repeat found in CY35_01G044300
Repeat occurs 29 times in a sequence of 1318 amino acids
Location between 7078729 and 7082686
Coverage of 30.8 %
Instances:
KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLLECIEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK
KRLVKLPKCMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK
KRLVKLPKCMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK
KRLVKLPECMEEMK | KRLVKLPKCMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK
KRLMKLLECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLDVGGCDLDCLP
KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLMKLPECMEEMK | KRLVKLPECIEEMT |
pattern: KRL[MDV][VK][LG][PGL][ECK][CD][LMI][ED][EC][LM][TPK]
MKSLTWLNVRGCDLDCLPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVGGCDLDC
LPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTSLDVSECDLDCLPQGMGRLEKLSDLNWS
NNKRLVKLLECIEEMKSLTRLDVSGCDLDCLPQGMGRLEELSNLNLSNNKRLVKLPECMEEMKS
LTSLDVGGCDLDCLPQGMGRLEELSNLNLSNNKRLVKLPECMEEMKSLKKLDVGGCDLDCLPQG
MGRLEKLSDLNLSNNKRLVKLPKCMEEMKSLTWLDVSGCDLDCLPQGMGRLEKLSDLNLSNNKR
LVKLPECMEEMK
SLTRLDVGGCDLDCLPRGMGRLEKLYGLILKDNKRLVKLPECMEEMKSLKKL
DVGGCDLDCLLRGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVGGCDLDCLPRGMGRL
EKLYGLILKDNKRLVKLPECMEEMKSLKKLDVGGCDLDCLPQGMGRLEKLSDLNLSNNKRLVKL
PKCMEEMK
SLTWFDVGGCDLDYLPQKMGRLEKLYGLILKDNKRLVKLPECMEEMKSLTWLDVSG
CDLDCLPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVGGCDLDCLPRGMGRLEKLY
GLILKDNKRLVKLPECMEEMKSLKKLDVGGCDLDCLLRGMGRLEKLSDLNLSNNKRLVKLPECM
EEMK
SLTRLDVGGCDLDCLPRGMGRLEKLYGLILKDNKRLVKLPECMEEMKSLKKLDVGGCDLD
CLPQGMGRLEKLSDLNLSNNKRLVKLPKCMEEMKSLTWFDVGGCDLDYLPQKMGRLEKLYGLIL
KDNKRLVKLPECMEEMKSLTWLDVSGCDLDCLPQGMGRLEKLSDLNLNNNKRLVKLPECMEEMK
SLTWLDVSGCDLDCLPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLKTLDVGGCDLDCLPQ
GMGRLEKLYELILKDNKRLMKLLECMEEMKSLTRLDVGGCDLDCLPQGMGRLEELSNLNLSNNK
RLVKLPECMEEMK
SLTWLDVSGCDLDCLPQGMGRLEKLSDLNLNNNKRLVKLPECMEEMKSLTW
LDVGGCDLDCLPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLKRLDVGGCDLDCLPQGMGR
LEELSTLNLSNNKRLVKLPECMEEMKSLTWLDVGGCDLDCLLQGMGRLEKLSNLNLSNNKRLVK
LPECMEEMK
SLTRLDVGGCDLDCLPQGMGQLEKLSDLNLSNNKRLMKLPECMEEMKSLTRLDVG
GCDLDCLPQGMGRLEKLYGLILKDNKRLVKLPECIEEMT
Repeat found in CY35_01G044400
Repeat occurs 5 times in a sequence of 984 amino acids
Location between 7083149 and 7090399
Coverage of 4.07 %
Instances:
LNLKPISQ | LNLQERLQ | LNLSNNKR | LNLSNNKR | LNLSNNKR

pattern: LNL[SQK][EPN][RNI][LSK][RQ]
MENSMMNKCTSTIKNIHQRILESEEVLNQNQCEHLLMEFKRTLDIIKENVSKLTTCNEIGELL
TYDLHQVVIKVNDMVEACCSKKWWVEAIFQLHNEDAFMDILQDLKLCIDAMSTIVVDNNIHHVD
GESLNLKPISQDKLEDDQNQLSKRPSFQAICERLKKLKIELISSHLHTLKPTFGDKGLRTYASL
SSEIVDINKEMYQSHVDISKEEGEISKLPKTMEASLVTQMDNLQQSIESHKKNILNMPKAMEGQ
ILQSIPRELVGIEKIVLNLQERLQVQPTIGIVGMGGIGKTTMAKALYDHIYHNFEAHCFMPNIK
ANKDNFQLLIDILKELGHDGKITNIVKGEEVLRHLFCTKKMLIILDDVRCQKQLDDILPIDLDF
TNGSRIIMTSRSWIDLRNNVKEEGKFDMPYLDNNNAMELFKKYVANNQSERKEEFGLITSQIVK
ACGGLPLSLKVLGSYLRKETDLKIWQQALKKLQQAHSLDGRQNDEQLWGILRISFDELAEEERY
MFLDIVCFFCTSNNHSNMMTKATALRIWDDEKCSPELTLSTLVNMSLVQIASNGLFVVHDQLRD
MGRMISKKEYNGSRWNVEAKELTPQFLKGLEHTQGLLIEGGEILNYDQNVMVNMLDLRFLKVTS
WDCKHVNILIDIVKHSPNLKWLHLEFDHEFNGQKSLCQISPLFDLLELRVLNIMVGTYWNMYSS
TPTIFKKTICNFSKLKKLQEFGILGIFNLEFDENFGELSALKVVNLHVTSWKKLPKTFERLKNL
EELYLQQNANLIELPQSLGQLTNLKTINVSNCDLDYVPEGLGQLKNLSYLNLSNNKRLVKLLEC
MEEMKSLTSLNVGGCDLDCLPQGMGQLEKLSNLNLSNNKRLVKLPECMEEMKSLTWLDVGGCDL
DCLLQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVGGCDLDCLPQGMRRLEKLSNLI
LKNNKRLMKLPENGAIGKVVRLEFE
Repeat found in CY35_01G_324
Repeat occurs 5 times in a sequence of 1005 amino acids
Location between 7083149 and 7090399
Coverage of 3.98 %
Instances:
LNLKPISQ | LNLQERLQ | LNLSNNKR | LNLSNNKR | LNLSNNKR

pattern: LNL[SQK][EPN][RNI][LSK][RQ]
MENSMMNKCTSTIKNIHQRILESEEVLNQNQCEHLLMEFKRTLDIIKENVSKLTTCNEIGELL
TYDLHQVVIKVNDMVEACCSKKWWVEAIFQLHNEDAFMDILQDLKLCIDAMSTIVVDNNIHHVD
GESLNLKPISQDKLEDDQNQLSKRPSFQAICERLKKLKIELISSHLHTLKPTFGDKGLRTYASL
SSEIVDINKEMYQSHVDISKEEGEISKLPKTMEASLVTQMDNLQQSIESHKKNILNMPKAMEIS
SITQMQNFHFVEPHKEDNQGQILQSIPRELVGIEKIVLNLQERLQVQPTIGIVGMGGIGKTTMA
KALYDHIYHNFEAHCFMPNIKANKDNFQLLIDILKELGHDGKITNIVKGEEVLRHLFCTKKMLI
ILDDVRCQKQLDDILPIDLDFTNGSRIIMTSRSWIDLRNNVKEEGKFDMPYLDNNNAMELFKKY
VANNQSERKEEFGLITSQIVKACGGLPLSLKVLGSYLRKETDLKIWQQALKKLQQAHSLDGRQN
DEQLWGILRISFDELAEEERYMFLDIVCFFCTSNNHSNMMTKATALRIWDDEKCSPELTLSTLV
NMSLVQIASNGLFVVHDQLRDMGRMISKKEYNGSRWNVEAKELTPQFLKGLEHTQGLLIEGGEI
LNYDQNVMVNMLDLRFLKVTSWDCKHVNILIDIVKHSPNLKWLHLEFDHEFNGQKSLCQISPLF
DLLELRVLNIMVGTYWNMYSSTPTIFKKTICNFSKLKKLQEFGILGIFNLEFDENFGELSALKV
VNLHVTSWKKLPKTFERLKNLEELYLQQNANLIELPQSLGQLTNLKTINVSNCDLDYVPEGLGQ
LKNLSYLNLSNNKRLVKLLECMEEMKSLTSLNVGGCDLDCLPQGMGQLEKLSNLNLSNNKRLVK
LPECMEEMKSLTWLDVGGCDLDCLLQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVG
GCDLDCLPQGMRRLEKLSNLILKNNKRLMKLPENGAIGKVVRLEFE
Repeat found in CY35_01G047500
Repeat occurs 5 times in a sequence of 512 amino acids
Location between 7563737 and 7567965
Coverage of 5.86 %
Instances:
VGSTMD | VGSGVK | VGSGLG | VGSGVG | VGSGIG

pattern: VGS[GT][LMIV][GDK]
MVFVGLVFGFVVGVGLMTGLHYCMLHRSRKRIQKIAAIRLLNSIQQDELRKLCGSSFPTWVSF
PTFEKVNWLNHNLAKVWPSVVMATEQLVKEALQPILEQYRPPGIQALKLDKFNIGTVPPKFDGI
RVQSLHKSQVIMDMEFRWGGDASIILGINPVIGPKLPVQLKNFSLFTTVRVIFQLTEEMPCISA
VVVALLSKPKPQIKYTLKVIGGSTGAIPGLSEMIDEMIESAVADQVQWPHRIVVPIGNAPPDVL
SNLGLKLQGKLTVQVLKATNLKNLEMVGKSDPYVRLYVRVLFKEKTRVIDNNLNPVWNEQFEFD
VEDQETQSLILDVKDEDNIGTDKKLGVTSIPLASLKPDVEEEITKNLAVSLDRDRVKDKGDRGS
ITIKVLYHPYTKEEQDAAMEAEKKKLEEKERLKNAGIVGSTMDAVGSGVKLVGTGVGMVGSGLG
AGASVVGSGVGIVGSGIGKAGKRLSRVVTRHASSNKLTSPATASPVSASPMHQQNGSFRASITQ
E
Repeat found in CY35_01G_349
Repeat occurs 5 times in a sequence of 512 amino acids
Location between 7563737 and 7567965
Coverage of 5.86 %
Instances:
VGSTMD | VGSGVK | VGSGLG | VGSGVG | VGSGIG

pattern: VGS[GT][LMIV][GDK]
MVFVGLVFGFVVGVGLMTGLHYCMLHRSRKRIQKIAAIRLLNSIQQDELRKLCGSSFPTWVSF
PTFEKVNWLNHNLAKVWPSVVMATEQLVKEALQPILEQYRPPGIQALKLDKFNIGTVPPKFDGI
RVQSLHKSQVIMDMEFRWGGDASIILGINPVIGPKLPVQLKNFSLFTTVRVIFQLTEEMPCISA
VVVALLSKPKPQIKYTLKVIGGSTGAIPGLSEMIDEMIESAVADQVQWPHRIVVPIGNAPPDVL
SNLGLKLQGKLTVQVLKATNLKNLEMVGKSDPYVRLYVRVLFKEKTRVIDNNLNPVWNEQFEFD
VEDQETQSLILDVKDEDNIGTDKKLGVTSIPLASLKPDVEEEITKNLAVSLDRDRVKDKGDRGS
ITIKVLYHPYTKEEQDAAMEAEKKKLEEKERLKNAGIVGSTMDAVGSGVKLVGTGVGMVGSGLG
AGASVVGSGVGIVGSGIGKAGKRLSRVVTRHASSNKLTSPATASPVSASPMHQQNGSFRASITQ
E

Similar gene clusters

CM038907 - Cluster 2 - Fatty_acid

Gene cluster description

CM038907 - Gene Cluster 2. Type = fatty_acid. Location: 11593311 - 11796720 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038908 - Cluster 3 - Cyclopeptide

Gene cluster description

CM038908 - Gene Cluster 3. Type = cyclopeptide. Location: 35894930 - 36273875 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in CY35_02G208200
Repeat occurs 3 times in a sequence of 346 amino acids
Location between 36196274 and 36197698
Coverage of 6.94 %
Instances:
ATSTPKPD | ATSTPKSD | ATSIPKPD |
pattern: ATS[IT]PK[PS]D
MAFTNDALAGAHSCSLLILVMCLVVVAAVTTPSDYSPSVNPRATSTPKPDDQTRVDSRDAATS
TPKSD
VVGQIKYQYAFNNFYAATSIPKPDEDGQKKNRNADMHPKEDLENVPDFQMLGFFLESKL
VKGSTLELGANFLNDAANSRRDFLPGAVADTLPALKASNLPKLLQAFNIREGTEMASSMETTIS
YCEGIANLTPEEKPALCPTSEKAMAKFVSSLLGQNVELLTSTVISAKASAVSGPVTVVDYHTRS
ADIEDTPVVCHSLSFPSQVYYCHKLSKTRVLQATLEAAEGGRINAVAICHLDTSWWSSKHPAFA
ALNVQPGTEVCHWTVESTLVWVRNSNT
Repeat found in CY35_02G208400
Repeat occurs 6 times in a sequence of 649 amino acids
Location between 36214301 and 36216749
Coverage of 6.47 %
Instances:
ALESRCS | ALEEGRR | ALELYKR | ALEEGRR | ALEEGRR
ALEEGRR |
pattern: ALE[ELS][RGY][RCK][RS]
MAMRQRLALESRCSGPIASCGHIALVSSWSGHDGKVASGHTEKVWREFFSDPSQWWDCRPGKV
NARYPDFKHKKTQDALWLGDRRNPPWVAAGVAAMAPGTVQMDSFSWKRRLAKYVKAGEYEKTME
LFKQMQKRGMRPDRFTFVPVLNACAGLRALEEGRRAHQLIMQTGCEADVFVGSSVVDMYAKCGS
MEDAWRVFCKMPSQDVVTWTAMILGYVRCGQGQKALELYKRMQQEGVQPDAVTFVVVLNACASV
SALEEGRRTHERIIQSRCESNVFVRSSLIDMYAKCGSMEEASRVFNKLPSRAAVCWNAMIFGHV
KCGEGHKALQLFQKMQHEGVQPDPATYVGVLNACANVVALEEGRRTHERIIQSRCESNVFVKSS
LIDMYAKCGSMEEASRVFNKLPSRVVCWNAMIFGHVKCGEGHKALQLFQKMQHEGVQPDPATYV
GVLNACANVVALEEGRRTHERIIQSRCESNVFVKSSLVDMYAKCGSMEDACRVFNTMPSHDVVS
WNALLGGFAMHGQVYRIPAKLEHYTCMVDLLGRAGHLQEAENMIQGYVPNTKSVLHDVEEEEKV
IHLCHHSEKLAIAFGLMSTDPGTPLRIVKNLRVCEDCHTSTKFISKIVRRKIIVRDANRFHHFE
NGVCSCRDYW
Repeat found in CY35_02G_1714
Repeat occurs 6 times in a sequence of 697 amino acids
Location between 36214301 and 36216749
Coverage of 6.03 %
Instances:
ALESRCS | ALEEGRR | ALELYKR | ALEEGRR | ALEEGRR
ALEEGRR |
pattern: ALE[ELS][RGY][RCK][RS]
MAMRQRLALESRCSGPIASCGHIALVSSWSGHDGKVASGHTEKVWREFFSDPSQWWDCRPGKV
NARYPDFKHKKTQDALWLGDRRNPPWVAAGVAAMAPGTVQMDSFSWKRRLAKYVKAGEYEKTME
LFKQMQKRGMRPDRFTFVPVLNACAGLRALEEGRRAHQLIMQTGCEADVFVGSSVVDMYAKCGS
MEDAWRVFCKMPSQDVVTWTAMILGYVRCGQGQKALELYKRMQQEGVQPDAVTFVVVLNACASV
SALEEGRRTHERIIQSRCESNVFVRSSLIDMYAKCGSMEEASRVFNKLPSRAAVCWNAMIFGHV
KCGEGHKALQLFQKMQHEGVQPDPATYVGVLNACANVVALEEGRRTHERIIQSRCESNVFVKSS
LIDMYAKCGSMEEASRVFNKLPSRVVCWNAMIFGHVKCGEGHKALQLFQKMQHEGVQPDPATYV
GVLNACANVVALEEGRRTHERIIQSRCESNVFVKSSLVDMYAKCGSMEDACRVFNTMPSHDVVS
WNALLGGFAMHGQGKEALVHFERMCEEGVHPDDITFVCLLSACSHAGFVDEGLRFNALMTTVYR
IPAKLEHYTCMVDLLGRAGHLQEAENMIQGYVPNTKSVLHDVEEEEKVIHLCHHSEKLAIAFGL
MSTDPGTPLRIVKNLRVCEDCHTSTKFISKIVRRKIIVRDANRFHHFENGVCSCRDYW

Similar gene clusters

CM038909 - Cluster 4 - Polyketide-alkaloid

Gene cluster description

CM038909 - Gene Cluster 4. Type = polyketide-alkaloid. Location: 13970926 - 14048840 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038909 - Cluster 5 - Polyketide

Gene cluster description

CM038909 - Gene Cluster 5. Type = polyketide. Location: 14114556 - 14229257 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038909 - Cluster 6 - Saccharide

Gene cluster description

CM038909 - Gene Cluster 6. Type = saccharide. Location: 17925655 - 18020516 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038910 - Cluster 7 - Cyclopeptide

Gene cluster description

CM038910 - Gene Cluster 7. Type = cyclopeptide. Location: 15357677 - 16353559 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in CY35_04G087200
Repeat occurs 3 times in a sequence of 484 amino acids
Location between 15426660 and 15435744
Coverage of 4.34 %
Instances:
PIPPEIG | PIPKELG | PIPPEIS |
pattern: PIP[PK]E[LI][GS]
MPDPHVLQGPAMKNVHVCFLLSVCLVAASLQLLIYTESAVCEAVSAEGDALQTILREMNYSYT
YNTTDPCAWNETIVQCNSSTGSIIGLNFSNMGLSGPIPPEIGLFPSLQTLDFSSYPTNSQSCTG
PCNSVLGPIPKELGILTNLQILLLSRSTLSQPFPPPVLNLVKLTELAEFRHWCVNFWEQHCPIP
PEIS
NLSKLQILFLGNNSLVGPLPELGSLINLQQLTLWGNGIRDNIPAQWGKLGNLTYLNLHDN
LLWGPLPPELGNLSKLTRMLIYGNDLTGSIPDSWKGLTSVSELNIRQNYLSDPFPAWIVSLPSL
KTLDISHNQFTGTIPGFNESASLLNAYHVNTQNVDFGCNYFNGSQPTNLPPNVNITQNCFTGFT
KASDSCLRQFSCLTIFKNFEGNCPPCPGNFVLYNSSACLCQPSDYIASSSKSTLPIGAVVGGLI
GGLALILAMVALYLFKPIIFNPYVKRKNDCSLPSSPL
Repeat found in CY35_04G087400
Repeat occurs 5 times in a sequence of 938 amino acids
Location between 15469323 and 15475503
Coverage of 4.8 %
Instances:
EEEAEAAAK | EEEEEEDHE | EEEEEDHEE | EEEEDHEED | EEEDHEEDV

pattern: EEE[EAD][EDH][EADH][EADH][EADH][EVDK]
MTSRICVKNLPSYVTEQKLREHFSGKGEVTDAKIIRTRDGKTRQFGFVGFRTEEEAEAAAKYF
HQSFFDTSRLTCELAQPIGASNLGRPWSRHSKGSSAFDKSHTPIISEDLTISGRVDKERSKHKE
TKSSDNIENDPQLNEFLEVMQPRNKAKLWANDTTISNAAKSETGSKLGRLEKQRGEVSGKKPAL
EVKTVEGVLRRVPIGKGKGGDQLLQTHLRYEGTDDDESDEDDDELYEEAPVLDLEVEGTTAHSG
ADPKDLMTLDKDPIVENAAVSDLDYMKARVKGGDWSSDDEGNADADETNFRKESSKSESETSCS
EGEEEEEEDHEEDVHMTDAPTKVVIAKEDGELKDDAGIRNHESQIRVDTGAPDGQETIVEVEAE
EQESVSETGRLFVRNLPYTASEDELAAHFGRFGELSQVHVVLDKTTKRSKGLAYILYMFPEAAV
RAMEELDKSIFQGRLLHILPAKRPPAAPEPKQAIHAGPGTTKYKQEREEQRKAAEASGHTQAWN
PLFMRPDTIAENVARQYGMSKSEFLDPGAEDLAVRMALGETHIIAETKRSLSDEGVDVEVLEEV
ASGRMGKVNRSKSVILVKNLPFTTSEADLVSMFGVFGTLGRVILPPTKTLALVEFLEAGEAHRA
FKTLAYKRFKHVPLYLEWAPENLLSGNGKSKVKTIEKGLGVGSNRAIVEKELVSAQDDNDVDQA
RSIFVKNLNFSTTESSLKKHFQKHVTEGTIRSVTIKKKLSKAGKSLSMGFGFVELDSAEAAKMF
CKSLQGTVLDGHALMLQLSHNTKKSGDSDSRASAVKKVDKKESSTKIIVRNVAFEATRKDLQQL
FNPFGQIKSVRLPKKFDGNHRGFAFVEFITRQEAQNAFEALQSTHLYGRHLVLERAKEGESLED
LRARTASQFFEDNGVDLDNRRPSKKRKQAALDDSHISFGQFAD
Repeat found in CY35_04G_794
Repeat occurs 5 times in a sequence of 975 amino acids
Location between 15469775 and 15475503
Coverage of 4.62 %
Instances:
EEEAEAAAK | EEEEEEDHE | EEEEEDHEE | EEEEDHEED | EEEDHEEDV

pattern: EEE[EAD][EDH][EADH][EADH][EADH][EVDK]
MSPSRSYGSTSQGREKSRTQRLFAPGVFLIFTRVYGFLGFTVCCGLKLQAHLKQPTAWCRGSC
FDQQERKFCSSVDGKTRQFGFVGFRTEEEAEAAAKYFHQSFFDTSRLTCELAQPIGASNLGRPW
SRHSKGSSAFDKSHTPIISEDLTISGRVDKERSKHKETKSSDNIENDPQLNEFLEVMQPRNKAK
LWANDTTISNAAKSETGSKLGRLEKQRGEVSGKKPALEVKTVEGVLRRVPIGKGKGGDQLLQTH
LRYEGTDDDESDEDDDELYEEAPVLDLEVEGTTAHSGADPKDLMTLDKDPIVENAAVSDLDYMK
ARVKGGDWSSDDEGNADADETNFRKESSKSESETSCSEGEEEEEEDHEEDVHMTDAPTKVVIAK
EDGELKDDAGIRNHESQIRVDTGAPDGQETIVEVEAEEQESVSETGRLFVRNLPYTASEDELAA
HFGRFGELSQVHVVLDKTTKRSKGLAYILYMFPEAAVRAMEELDKSIFQGRLLHILPAKRPPAA
PEPKQAIHAGPGTTKYKQEREEQRKAAEASGHTQAWNPLFMRPDTIAENVARQYGMSKSEFLDP
GAEDLAVRMALGETHIIAETKRSLSDEGVDVEVLEEVASGRMGKVNRSKSVILVKNLPFTTSEA
DLVSMFGVFGTLGRVILPPTKTLALVEFLEAGEAHRAFKTLAYKRFKHVPLYLEWAPENLLSGN
GKSKVKTIEKGLGVGSNRAIVEKELVSAQDDNDVDQARSIFVKNLNFSTTESSLKKHFQKHVTE
GTIRSVTIKKKLSKAGKSLSMGFGFVELDSAEAAKMFCKSLQGTVLDGHALMLQLSHNTKKSGD
SDSRASAVKKVDKKESSTKIIVRNVAFEATRKDLQQLFNPFGQIKSVRLPKKFDGNHRGFAFVE
FITRQEAQNAFEALQSTHLYGRHLVLERAKEGESLEDLRARTASQFFEDNGVDLDNRRPSKKRK
QAALDDSHISFGQFAD
Repeat found in CY35_04G088700
Repeat occurs 3 times in a sequence of 163 amino acids
Location between 15704988 and 15705480
Coverage of 12.88 %
Instances:
SSSDEMS | SSSSEME | SSSEMES |
pattern: SSS[ESD][EM][EM][ES]
MTTAKGGRVSEKMQRAVRKLQHRIANSNQPYQNLLSSSDEMSAKSAVRRRASLSRSNRSSSSE
ME
SDEDGRESPPYDVPEGCLAVYVGREKQRFVISADYLNHAVFRILLEKSAEEFGFEHKGGLPI
ACDVAFFEHLLWLMESKDPALRKLDINELCGLCAYA
Repeat found in CY35_04G_809
Repeat occurs 3 times in a sequence of 163 amino acids
Location between 15704988 and 15705480
Coverage of 12.88 %
Instances:
SSSDEMS | SSSSEME | SSSEMES |
pattern: SSS[ESD][EM][EM][ES]
MTTAKGGRVSEKMQRAVRKLQHRIANSNQPYQNLLSSSDEMSAKSAVRRRASLSRSNRSSSSE
ME
SDEDGRESPPYDVPEGCLAVYVGREKQRFVISADYLNHAVFRILLEKSAEEFGFEHKGGLPI
ACDVAFFEHLLWLMESKDPALRKLDINELCGLCAYA
Repeat found in CY35_04G_810
Repeat occurs 3 times in a sequence of 163 amino acids
Location between 15704988 and 15705480
Coverage of 12.88 %
Instances:
SSSDEMS | SSSSEME | SSSEMES |
pattern: SSS[ESD][EM][EM][ES]
MTTAKGGRVSEKMQRAVRKLQHRIANSNQPYQNLLSSSDEMSAKSAVRRRASLSRSNRSSSSE
ME
SDEDGRESPPYDVPEGCLAVYVGREKQRFVISADYLNHAVFRILLEKSAEEFGFEHKGGLPI
ACDVAFFEHLLWLMESKDPALRKLDINELCGLCAYA

Similar gene clusters

CM038911 - Cluster 8 - Fatty_acid-alkaloid

Gene cluster description

CM038911 - Gene Cluster 8. Type = fatty_acid-alkaloid. Location: 1012522 - 1163759 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038911 - Cluster 9 - Saccharide

Gene cluster description

CM038911 - Gene Cluster 9. Type = saccharide. Location: 2688581 - 2822046 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038911 - Cluster 10 - Fatty_acid

Gene cluster description

CM038911 - Gene Cluster 10. Type = fatty_acid. Location: 5338689 - 5481412 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038912 - Cluster 11 - Cyclopeptide

Gene cluster description

CM038912 - Gene Cluster 11. Type = cyclopeptide. Location: 45894 - 618592 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in CY35_06G001100
Repeat occurs 3 times in a sequence of 552 amino acids
Location between 150202 and 152021
Coverage of 3.26 %
Instances:
RSEEWI | RSEDYI | RSEQSI |
pattern: RSE[EQD][WSY]I
MDEDSECNAIRQVLEERCPGAKFRGDDIKKLVDAGYDTETALAAASKDSLDRILPTRPGVVDV
LVKLFSQPGTEETPARAILTPQEHGYKDRSEEWIKHMLMPGLTTLPQHDYDLANVLFPEKHGKL
PVNTIMYHERLFKKPNLTERFLIEADESVVPTMLTWALSLPLGPAVSEMSTSARIVGFLLDLFE
LCRSYSPYKFTYQFFFNCNETTAGSRSSSRSRPDTLLTASGCTFLIGEDKLSSLFDAEGDLTAK
VRPLSTTFYGELQFILGYIAAGSAFQWLSIGKSGLVRRVGPRLDLSTDVGGCKFLLSIGYAYQL
IQKMIDSVPEVPGQHAMFTVDIVGDRKIYFFPDRVKKSIKNFNSFCRELMTGLDVILRAYKAGE
RCPSLPQLMGKRSEDYISRTGTYTVEISPLGHAPILRSEQSIRMMTKNVCEALSVLHEAGLVHR
DVRLPNVVQVSLEQFMLIDLETVAASPFRLPEGFQYFRGWSIEMLEGSFYTPMSDMYQLGRLLK
KDVHWMTEISESALQFIRKLTSKKCTAAMALSDPWLSDPIL
Repeat found in CY35_06G_1
Repeat occurs 3 times in a sequence of 536 amino acids
Location between 150202 and 152021
Coverage of 3.36 %
Instances:
RSEEWI | RSEDYI | RSEQSI |
pattern: RSE[EQD][WSY]I
MDEDSECNAIRQVLEERCPGAKFRGDDIKKLVDAGYDTETALAAASKDSLDRILPTRPGVVDV
LVKLFSQPGYKDRSEEWIKHMLMPGLTTLPQHDYDLANVLFPEKHGKLPVNTIMYHERLFKKPN
LTERFLIEADESVVPTMLTWALSLPLGPAVSEMSTSARIVGFLLDLFELCRSYSPYKFTYQFFF
NCNETTAGSRSSSRSRPDTLLTASGCTFLIGEDKLSSLFDAEGDLTAKVRPLSTTFYGELQFIL
GYIAAGSAFQWLSIGKSGLVRRVGPRLDLSTDVGGCKFLLSIGYAYQLIQKMIDSVPEVPGQHA
MFTVDIVGDRKIYFFPDRVKKSIKNFNSFCRELMTGLDVILRAYKAGERCPSLPQLMGKRSEDY
I
SRTGTYTVEISPLGHAPILRSEQSIRMMTKNVCEALSVLHEAGLVHRDVRLPNVVQVSLEQFM
LIDLETVAASPFRLPEGFQYFRGWSIEMLEGSFYTPMSDMYQLGRLLKKDVHWMTEISESALQF
IRKLTSKKCTAAMALSDPWLSDPIL

Similar gene clusters

CM038912 - Cluster 12 - Saccharide

Gene cluster description

CM038912 - Gene Cluster 12. Type = saccharide. Location: 391510 - 487713 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038912 - Cluster 13 - Saccharide

Gene cluster description

CM038912 - Gene Cluster 13. Type = saccharide. Location: 1423709 - 1657677 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038912 - Cluster 14 - Saccharide

Gene cluster description

CM038912 - Gene Cluster 14. Type = saccharide. Location: 19380599 - 19502744 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038913 - Cluster 15 - Cyclopeptide

Gene cluster description

CM038913 - Gene Cluster 15. Type = cyclopeptide. Location: 3842197 - 4846457 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in CY35_07G029900
Repeat occurs 4 times in a sequence of 118 amino acids
Location between 4009018 and 4009375
Coverage of 20.34 %
Instances:
GWMVGW | GWMNSS | GWMDGW | GWMDFL |
pattern: GWM[NDV][FGS][LWS]
MQQLEMKVSWVPMNSGLFLVLLFFFLFFFLLPSFLPSFPSADSLLLSAWERERKGSRKSHQQV
VVVLTVATWMDGWMVGWMNSSNGSPPGWMDGWMDFLQMGFHSKNSRRKENKRKEI

Similar gene clusters

CM038913 - Cluster 16 - Transporter_associated

Gene cluster description

CM038913 - Gene Cluster 16. Type = transporter_associated. Location: 6628614 - 6849920 nt. Click on genes for more information.
Show pHMM detection rules used
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038913 - Cluster 17 - Saccharide

Gene cluster description

CM038913 - Gene Cluster 17. Type = saccharide. Location: 19150973 - 19339495 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038914 - Cluster 18 - Alkaloid

Gene cluster description

CM038914 - Gene Cluster 18. Type = alkaloid. Location: 22363337 - 22449938 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038915 - Cluster 19 - Saccharide

Gene cluster description

CM038915 - Gene Cluster 19. Type = saccharide. Location: 397922 - 532235 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038915 - Cluster 20 - Putative

Gene cluster description

CM038915 - Gene Cluster 20. Type = putative. Location: 18783372 - 19063336 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038916 - Cluster 21 - Polyketide

Gene cluster description

CM038916 - Gene Cluster 21. Type = polyketide. Location: 7660698 - 7881218 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038916 - Cluster 22 - Cyclopeptide

Gene cluster description

CM038916 - Gene Cluster 22. Type = cyclopeptide. Location: 15528153 - 16187053 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

CM038917 - Cluster 23 - Putative

Gene cluster description

CM038917 - Gene Cluster 23. Type = putative. Location: 2678631 - 2789195 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038917 - Cluster 24 - Saccharide

Gene cluster description

CM038917 - Gene Cluster 24. Type = saccharide. Location: 8848762 - 8983268 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038917 - Cluster 25 - Polyketide

Gene cluster description

CM038917 - Gene Cluster 25. Type = polyketide. Location: 10094108 - 10132100 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038917 - Cluster 26 - Cyclopeptide

Gene cluster description

CM038917 - Gene Cluster 26. Type = cyclopeptide. Location: 16034612 - 16334180 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

CM038919 - Cluster 27 - Alkaloid

Gene cluster description

CM038919 - Gene Cluster 27. Type = alkaloid. Location: 714186 - 792133 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038920 - Cluster 28 - Alkaloid

Gene cluster description

CM038920 - Gene Cluster 28. Type = alkaloid. Location: 1733608 - 1860541 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038920 - Cluster 29 - Cyclopeptide

Gene cluster description

CM038920 - Gene Cluster 29. Type = cyclopeptide. Location: 1996141 - 3289664 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in CY35_14G013100
Repeat occurs 3 times in a sequence of 401 amino acids
Location between 2272337 and 2273543
Coverage of 4.49 %
Instances:
PGGAST | PGGGGH | PGGGST |
pattern: PGG[GA][GS][HT]
MSTVAANAGMSHGRRGNTPPGGASTFDLFSWSEGPDVTRSPPPVRHTATNDSNAFVSASDQSS
SSDSPVGRPTIRMHQPAGGISTITFGEQLSAEEADALLKRRPGSDLKKREMYGSGIFPGGGGHG
AAAASSDDQDQVGGPNGIAAAPVVDTRTSIRCYQSVGGAGVSQISFGGENCASPKKAVTVPEVA
KQKELSGTVETTDDLAARRVCSNAKSKELIGSKIFAAPTAADSLHIPYSRAQELLMRREEYYNS
SKQLQGISADQPDQPAAPRNIHTSVKVSNPAGGRCQISFGPEDGDQSSNDHLDVNSRKLHDQKK
AELSGHGIFTEYIRSAQSGDSTSAAYYTGSSSDQKNLHHQLSKAKLKEISGSDIFSDDKPADRD
FIGGIRKPPGGGSTISLQ
Repeat found in CY35_14G014100
Repeat occurs 4 times in a sequence of 378 amino acids
Location between 2536421 and 2537938
Coverage of 9.52 %
Instances:
TSTPKPDDQ | TSTPKPDEG | TSTPKPDEG | TSTPKPDED |
pattern: TSTPKPD[ED][GQD]
MAVTNDALAGAHSRSLLILVMCLVVVAAVTTPSDYSPSVNPRATSTPKPDDQTRVDSRDAATS
TPKPDEG
FKVVDEHVTTSTPKPDEGIKVVHEHVTRFRWPRPGNGGIKIFYVYAATSTPKPDEDG
QEKNRNADMHPKEDLENVPDFQMLGFFLESKLVKGSTMDLGANFLYDAANSRRDFLPGAVADTL
PALKASNLPKLLQAFNIREGTEMARSMETTISYCEAIANLTPDEKPASCPTSEKAMAKFVSSLL
GQNVELLTSKVISAKASAVSGPVTVVDYHTRSADIEDTPVVCHSLSFPSQVYYCHKLSKTRVLQ
ATLEAAEGGRINAVAICHLDTSWWSSKHPAFAALNVKPGTEVCHWTVESTLVWVRNSNA
Repeat found in CY35_14G014300
Repeat occurs 7 times in a sequence of 394 amino acids
Location between 2574870 and 2576440
Coverage of 10.66 %
Instances:
STPKPD | STPKPD | STPKPD | STPKPD | STPKPD
STPKPE | STPKPD |
pattern: STPKP[ED]
MAVTNDALAGAHSRSLLILVMCLVVVAAVTTPSDYSPSVNPPATSTPKPDDQTRVDSRDVATS
TPKPD
GDTFTNYGSTPKPDEADEFTTYGSTPKPDGDTFTNYGSTPKPDEADEFTTYGSTPKPEG
DTFTNYGSTPKPDEDGQEKNRNADMHPKEDLENVPDFQMLGVFLESKLVKGSTMDLGANFLYDA
ANSRRDFLPGAVADTLPALKASNLPKLLQAFNIREGTEMARSMERTISSCEGIANLTPEEKPAS
CPTSEKAMGKFVTSLLGQNVELLTSKVISAKASAVSGPVTVVDYHTRSADIKDTPVVCHSLSFP
SQVYYCHKLSKTRVLQATLEAAEGGRINAVAICHLDTSFWSSKHPAFAALNVKPGTEVCHWTVE
STLVWVRNSNT
Repeat found in CY35_14G014700
Repeat occurs 3 times in a sequence of 343 amino acids
Location between 2635949 and 2637381
Coverage of 7.87 %
Instances:
ATSTPKPDD | ATSTPKPDD | ATSTPKPDE |
pattern: ATSTPKPD[ED]
MAFTNDALAGAHSCSLLILVMCLVVVAAAVTTPSDYSPSVNPRATSTPKPDDQTRVDSRDAAT
STPKPDD
CMFLCIRYRDAATSTPKPDEDGQEKNRNADMHPKEDLENVPDFQMLGFFLESKLVKG
STMDFGANFLYDAANSRRDFLPGSVADTLPALIASNLPKLLQAFNIREGTEMARRMQTTISYCE
GIANLTPEEKPASCPTSEKAMAKFVSSLLGQNVELLTSKVISAKASAVSGPVTVVDYHTRSADI
EDTPVVCHSLSFPSQVYYCHKLSKTRVLQATLEAAERGRINAVAICHLDTSWWSSKHPAFAALN
VKPGTEVCHWTVESTLVWVRNSNT
Repeat found in CY35_14G014800
Repeat occurs 3 times in a sequence of 340 amino acids
Location between 2665341 and 2666752
Coverage of 8.82 %
Instances:
ATSTPKPDDQ | ATSTPKPDEE | ATSTPKPDED |
pattern: ATSTPKPD[ED][EQD]
MAFTNDALAGAHSCSLLILVMCLVVVAAVTTPSVNPRATSTPKPDDQTRVDSRDAATSTPKPD
EE
DPICDTCINYRDAATSTPKPDEDGQEKNRNADMHPKEDLENVPDFQMLGFFLESKLVKGSTM
DFGANFLYDVANSRRDFLPGAVADTLPALKASNLPKLLQAFNIREGTEIARSMETTISYCEGIA
NLTPEEKPASCPTSEKAMAKFVSSLLGQNVELLTSKVTSAKASAVSGPVTVVDYHTRSADNEDT
PVVCHSLSFPSQVYYCHKLSETRVLQATLEAAEGGRINAVAICHLDTSWWSSKHPAFAALNVKP
GTEVCHWTVESTLVWVRNSNT
Repeat found in CY35_14G014900
Repeat occurs 4 times in a sequence of 375 amino acids
Location between 2679014 and 2680532
Coverage of 12.8 %
Instances:
TSTPQPDDQTRV | TSTPKPDEEDPI | TSTPKPDEEDPI | TSTPKPDEAEPI |
pattern: TSTP[QK]PD[ED][EAQ][EDT][RP][IV]
MAFTNDALAGAHSCTLLILVMCLVVVAAAVTTPSDYSPSVNPRTTSTPQPDDQTRVDSRDAAT
STPKPDEEDPI
GDKYIKYRDAATSTPKPDEEDPIEDKYIKYRDAATSTPKPDEAEPIFVGYGYA
PTSPPKPDEDGQEKYRSADMLGFFLESKLVKGSTMDLVANFLYDAANSRRDFLPGAVADTLPAL
KASNLPKLLQAFNIREGTEMARRMQITISYCEGIANLTPEEKPASCPTSEKAMAKFVSSLLGQN
VELLTSKVISAKASAVRGPVTVVDYHTRSADIEDTPVVCHSFPFPSQVYYCHKLSKTRVLQATL
EAAEGGRINAVAICHLDTSWWSSKHPAFAALNVQPGTEVCHWTVESTLVWVRNSNT
Repeat found in CY35_14G015200
Repeat occurs 4 times in a sequence of 380 amino acids
Location between 2747860 and 2749383
Coverage of 12.63 %
Instances:
ATSTPKPDDQTR | ATSTPKSDEESR | ATSTPKSDEESR | ATSTPKPDEDGQ |
pattern: ATSTPK[PS]D[ED][EQD][GST][RQ]
MAFTNDALAGAHSCSLLILVMCLVVVAAVTTPSDYSPSVNPRATSTPKPDDQTRVDSRDAATS
TPKSDEESR
IGYGRIGYKSQVFYRDAATSTPKSDEESRIGYGRIEYKSQVYYRDAATSTPKPDE
DGQ
EKNKNADMHPKEDLENVPDFQMLGFFLESKLVKGSTMDLGANFLYDAANSRRDFLPGAVAE
TLPALKASNLPKLLQAFNIREGTEMARRMQTTISYCEGIANLTPEEKPASCPTSEKAMAKFVSS
LLGQNVELLTSKVISAKASAVSGPVTVVDYHTRSADIEDTPVVCHSLSFPSQVYYCHKLSKTRV
LQATLEAAEGGRINAVAVCHLDTSWWSSKHPAFAALNVKPGTEVCHWTVESTLVWVRNSNT
Repeat found in CY35_14G017400
Repeat occurs 6 times in a sequence of 358 amino acids
Location between 3108969 and 3110046
Coverage of 10.06 %
Instances:
LTGPLG | LTGPIS | LTGSIP | LTGPIP | LTGLAA
LTGPIP |
pattern: LTG[LPS][LAI][GASP]
MIIMGKFSFAFVLGFVLMISMQNIAVVVAQSCPAAERDALLAFKAGITEDPSGQLTTWVATTD
CCTWFGIFCKNGHVIDLDLRPDDASDDSGIYLTGPLGSSLFTLLELEVLDLSEMQNLTGPISPT
IGNLTNLVHLDFRTNKLTGSIPSSVGNLKKLNWLSLGGNPISGSIPSSIGTIGASLNYIYLGES
SLSGSLPDSLFQLSNLTSLFLGVSDLSGSLSPLVGNLGPTLTVLDLSSNSFSGALPSTLGQLTK
VFLLDLSFNKFTGQIPASIGNIGATLFNLELANNELTGPIPPTFAKQLTGLAALGLENNQLTGP
IP
QGAPFSNFSVDSFRPGNPGLCGHPLPPCTGGNRKKLL
Repeat found in CY35_14G017500
Repeat occurs 6 times in a sequence of 358 amino acids
Location between 3150703 and 3151780
Coverage of 10.06 %
Instances:
LTGPLG | LTGPIS | LTGSIP | LTGPIP | LTGLAA
LTGPIP |
pattern: LTG[LPS][LAI][GASP]
MIIMGKFSFAFVLGFVLMISMQNIAVVVAQSCPAAERDALLAFKAGITEDPSGQLTTWVATTD
CCTWFGISCRNGHVIDLDLRPDDASDNSGIYLTGPLGSSLFTLLELEVLDLSEMQNLTGPISPT
IGNLTNLVHLDFRTNKLTGSIPSSVGNLKKLNWLSLGGNPISGSVPSSIGTIGASLNYIYLGES
SLSGSLPDSLFQLSNLTSLFLGVSDLSGSLSPLVGNLGPTLTVLDLSSNSFSGALPSTLGQLTK
VSLLDLSFNKFTGQIPASIGNIGATLFNLELANNELTGPIPPTFAKQLTGLAALGLENNELTGP
IP
QGAPFSNFSVDSFRPGNPGLCGHPLPPCTGGNRKKLL
Repeat found in CY35_14G017900
Repeat occurs 6 times in a sequence of 358 amino acids
Location between 3196640 and 3197717
Coverage of 10.06 %
Instances:
LTGPLG | LTGPIS | LTGSIP | LTGPIP | LTGLAV
LTGPIP |
pattern: LTG[LPS][LAI][VGSP]
MIIMGKFSFVFVLGFVLMISMQNIAVVVAQSCPAAERDALLAFKAGITEDPSGQLTTWVATTD
CCTWSSIFCKNGHVIELDLRPDDAFDNSGIYLTGPLGSSLFTLLELEVLDLSEMQNLTGPISPT
IGNLKNLVHLDFRTNKLTGSIPSSVGNLKKLNFLSLGGNPISGSIPSSIGTIGASLNYIYLGQS
SLSGSLPDSLFQLSNLTSLFLEISDLSGSLSPLVGNLGPTLTVLDLSSNSFSGALPSTLGKLTK
VFLLDLSFNKFTGQIPASIGNIGATLFNLELANNELTGPIPPLFAKQLTGLAVLGLENNKLTGP
IP
QGAPFSNFSVDSFRPGNPGLCGHPLPPCTGGNRKKLL

Similar gene clusters

CM038921 - Cluster 30 - Cyclopeptide

Gene cluster description

CM038921 - Gene Cluster 30. Type = cyclopeptide. Location: 189469 - 638618 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in CY35_15G002100
Repeat occurs 17 times in a sequence of 738 amino acids
Location between 412838 and 415248
Coverage of 34.55 %
Instances:
AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNPAHRFDNN
AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNPAHRFDNN
AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNPAHRFDNN | AASSAVNAAHRFDNN
AASSAVNPAHRFDNN | AASSAVNPAHRQSEH |
pattern: AASSAVN[PA]AHR[FQ][SD][EN][NH]
MAFGNALRGAHTCRLLILFMCLVVIAAARKHSDHAASSAVNPAHRFDNNYYYKNVDAPKDEQA
ASSAVNPAHRFDNN
YYYKNVDAPKDEQAASSAVNPAHRFDNNYYYKNVDAPKDEQAASSAVNPA
HRFDNN
YYYKNVDAPKDEQAASSAVNPAHRFDNNYYYKNVDAPKEEQAASSAVNPAHRFDNNYY
YKNVDGPKDEQAASSAVNPAHRFDNNYYYKNVDAPKDEQAASSAVNPAHRFDNNYYYKNVDEPK
DEQAASSAVNPAHRFDNNYYYKNVDEPKDEQAASSAVNPAHRFDNNYYYKNVDAPKDEQAASSA
VNPAHRFDNN
YYYKNVDAPEDEQAASSAVNPAHRFDNNYYYKNVDAPEDEQAASSAVNPAHRFD
NN
YYYKNVDAPEDEQAASSAVNPAHRFDNNYYYKNVDAPEDEQAASSAVNAAHRFDNNYYYKNV
DAPEDEQAASSAVNPAHRFDNNYYYKNVDGPQDEQAASSAVNPAHRQSEHIHHESEHVHDFMLP
SYFLKSGLVKGSIIDLGVNFVHDTANPKRDFLPETVADTLPALKASNLPKLLQAFNISEGTETA
TDVERAIFYCEGIANLPAEEKDASCPTSEKAMAEFVSSQLGENAKLFHTTVVPAKALTVGANLT
IVEYNTRSADTEDNAVVCHNLPFPSQVYYCHKVSKTQVSQAILEAAEGLRINAVTLCHLDTSWW
SVNHLAFKTLNVPRGTEVCHWAFVSTLIWVPRSNI

Similar gene clusters

CM038921 - Cluster 31 - Cyclopeptide

Gene cluster description

CM038921 - Gene Cluster 31. Type = cyclopeptide. Location: 844921 - 1980603 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

CM038921 - Cluster 32 - Fatty_acid

Gene cluster description

CM038921 - Gene Cluster 32. Type = fatty_acid. Location: 2596709 - 2744882 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038921 - Cluster 33 - Saccharide

Gene cluster description

CM038921 - Gene Cluster 33. Type = saccharide. Location: 3952526 - 4034539 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038922 - Cluster 34 - Saccharide

Gene cluster description

CM038922 - Gene Cluster 34. Type = saccharide. Location: 1080897 - 1179420 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038924 - Cluster 35 - Putative

Gene cluster description

CM038924 - Gene Cluster 35. Type = putative. Location: 133656 - 279516 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038924 - Cluster 36 - Saccharide

Gene cluster description

CM038924 - Gene Cluster 36. Type = saccharide. Location: 5945525 - 5997987 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM038925 - Cluster 37 - Cyclopeptide

Gene cluster description

CM038925 - Gene Cluster 37. Type = cyclopeptide. Location: 3000338 - 3535741 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in CY35_19G017500
Repeat occurs 3 times in a sequence of 314 amino acids
Location between 3272428 and 3275205
Coverage of 7.64 %
Instances:
KEEEEEAM | KEEKRAAK | KEEKRAAK |
pattern: KEE[EK][ER][EA]A[MK]
MYHPSRGGVRGGRDQFNWDDVKADKHRENYLGHSIKAPVGRWQKGKDLMWYARDKQTAAEAAQ
AAREEIEKVKEEEEEAMRELLGLAPRREVRPRGSHLDKREQEELFKRGKAAEEVDPTYAQGERV
QGLGYAPAPKNEVPNKKDLLQMEGSPAGSPGDPLVGQSNGREMDREIRSSLAASREVHSPSVSD
DEDAKKHKRKEEKRAAKDKRKEEKRAAKQLRREKGEMASRGRDRSRDNKQRREDSDSSNEKEDR
RDLMRPTSHWDEKNWRHSVGHRGEKESEKRLKDSHNPDEDDLRVRERESHGSRTHKRKR
Repeat found in CY35_19G_160
Repeat occurs 3 times in a sequence of 314 amino acids
Location between 3272428 and 3275205
Coverage of 7.64 %
Instances:
KEEEEEAM | KEEKRAAK | KEEKRAAK |
pattern: KEE[EK][ER][EA]A[MK]
MYHPSRGGVRGGRDQFNWDDVKADKHRENYLGHSIKAPVGRWQKGKDLMWYARDKQTAAEAAQ
AAREEIEKVKEEEEEAMRELLGLAPRREVRPRGSHLDKREQEELFKRGKAAEEVDPTYAQGERV
QGLGYAPAPKNEVPNKKDLLQMEGSPAGSPGDPLVGQSNGREMDREIRSSLAASREVHSPSVSD
DEDAKKHKRKEEKRAAKDKRKEEKRAAKQLRREKGEMASRGRDRSRDNKQRREDSDSSNEKEDR
RDLMRPTSHWDEKNWRHSVGHRGEKESEKRLKDSHNPDEDDLRVRERESHGSRTHKRKR
The following known motifs were found in CDS CY35_19G018800
Location between 3485029 and 3486669
.L.Y..Y was found 2 times in this sequence
Sequence:
MEGHQNLIMPTEGYGYPDSLRTLTVWGILKESARVIQKHVKLLLPFVLTFNVPLSIYVAISWG
SKFPFDIGPAIHAPSLMQTSFTVTAIQQLVTNANPPNIAVTLVTGLFQALFVWLQIASIGYCVD
FVYKGNDNPEAEPVQEIFKRLPGAVFRIFVTSLWVFLLILCIDVLVMADYILCVQFFPSYMIYI
LLFHSTLFLGVSLIFSILFMFSNIISTLELGCYGRPALKQSFILVTQNKCRVLGIFVVQFLIGC
IVLAVGFFVSRLDPLHWTKPLGYICLFLVYLIYTVYSLVLQYVLYFSCKHLENATTNNYMALDA
HAP

Similar gene clusters

CM038925 - Cluster 38 - Polyketide-saccharide

Gene cluster description

CM038925 - Gene Cluster 38. Type = polyketide-saccharide. Location: 10080579 - 10186109 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters