Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record CM107116.1:
Cluster 1Putative1074340371118076314373.59ADH_N_2, ADH_zinc_N, Abhydrolase_3, NAD_binding_1, p450---
Cluster 2Fatty_acid-Alkaloid4419715604439528921981.33BBE, FAD_binding_4, FA_hydroxylase---
Cluster 3Alkaloid4989158415008552651939.42Bet_v_1, E1_dh---
Cluster 4Saccharide111069537111155456644850.29ADH_N, ADH_zinc_N, Epimerase, Methyltransf_7, UDPGT_2flavonoid, oleananes--
Cluster 5Cyclopeptide1258983472127206285913079.39BURP---
Cluster 6Transporter_associated-Terpene145461468914633488928734.20ABC_tran, AMP-binding, Terpene_synth, Terpene_synth_C, p450---
Cluster 7Saccharide151750770615211729773665.27Transferase, UDPGT_2flavonoid--
Cluster 8Saccharide177277287717788873446114.47Epimerase, Peptidase_S10, UDPGT_2, p450---
Cluster 9Saccharide180217034818092731807102.83ABC2_membrane, ABC_tran, Prenyltransf, UDPGT_2---
The following clusters are from record CM107117.1:
Cluster 10Putative1666524311681612611508.832OG-FeII_Oxy, DIOX_N-alpha-chaconine/alpha-solanine (9% of genes show similarity)BGC0002722.2_c1
Cluster 11Putative5534488335553224801873.65AMP-binding, Aminotran_1_2, Methyltransf_11---
Cluster 12Saccharide99824375210014346783190.93Dimerisation, Glyco_hydro_1, Methyltransf_2, UDPGT_2flavonoid--
Cluster 13Polyketide-Saccharide-Alkaloid102621402010285239512309.93Chal_sti_synt_C, FAE1_CUT1_RppA, Glycos_transf_1, Pyridoxal_deC---
Cluster 14Putative103025369910327906502536.95Aldo_ket_red, Chalcone, p450, polyprenyl_synt---
Cluster 15Saccharide125101206612533918972379.83ADH_N, Cellulose_synt, p450---
Cluster 16Saccharide144509268214470549181962.24Aldo_ket_red, Glyco_hydro_1---
The following clusters are from record CM107118.1:
Cluster 17Cyclopeptide90852782921528341300.05BURP---
Cluster 18Cyclopeptide9161504992100246485.20BURP---
Cluster 19Lignan976760061008654153189.41Dirigent, Methyltransf_11---
Cluster 20Cyclopeptide35793164037012223312190.59BURP---
Cluster 21Cyclopeptide3654938003700837894589.99BURP---
Cluster 22Cyclopeptide3673213743700559682734.59BURP---
Cluster 23Cyclopeptide368060841368681878621.04BURP---
Cluster 24Lignan-Polyketide6051442436082077453063.50Chal_sti_synt_C, Dirigent, Epimerase, FAE1_CUT1_RppA---
Cluster 25Saccharide6459496656501942924244.63Aminotran_1_2, Glycos_transf_2, SE---
Cluster 26Saccharide11251819241125741212559.29Acetyltransf_1, Aldo_ket_red, UDPGT_2flavonoid--
Cluster 27Saccharide116399701111685425824545.57Lipoxygenase, UDPGT_2cyanogenic glucoside-7, monoterpenoid-7--
Cluster 28Cyclopeptide118192962511886148916685.27BURP---
The following clusters are from record CM107119.1:
Cluster 29Fatty_acid9308065449338327223026.18ADH_N_2, ADH_zinc_N, FA_desaturase, Methyltransf_11---
Cluster 30Alkaloid9956910519982575042566.45Chalcone, Cu_amine_oxid---
Cluster 31Saccharide-Fatty_acid118477694211881833163406.37ADH_N, Cellulose_synt, FA_desaturase---
Cluster 32Cyclopeptide1349556425136902426319467.84BURP---
Cluster 33Cyclopeptide1353351149137063805817286.91BURP---
Cluster 34Cyclopeptide1504782181151668750011905.32BURP---
The following clusters are from record CM107120.1:
Cluster 35Saccharide61314156337698206.28UDPGT_2, p450small phenolic-3--
Cluster 36Cyclopeptide12109548113372135212625.87BURP---
Cluster 37Saccharide4734171344753991671982.032OG-FeII_Oxy, ADH_N, DIOX_N, PALP, UDPGT_2small phenolic-3--
Cluster 38Cyclopeptide49907425851176972912695.47BURP---
Cluster 39Alkaloid6168192796192430722423.79AMP-binding, Str_synth---
Cluster 40Saccharide7568896447594080912518.45UDPGT_2, p450, polyprenyl_synt*saccharide-2--
Cluster 41Putative9403392659438858743546.61AMP-binding, p450---
Cluster 42Saccharide9568721439602250863352.94Aminotran_1_2, UDPGT_2, adh_shortflavonoid, oleananes--
Cluster 43Lignan126422987312678363013606.43Dirigent, ECH_2, adh_short---
Cluster 44Saccharide12832870981284138509851.412OG-FeII_Oxy, DIOX_N, Glyco_hydro_1, Peptidase_S10---
Cluster 45Saccharide148511605314879753002859.25Glyco_hydro_1, p450---
The following clusters are from record CM107121.1:
Cluster 46Saccharide-Fatty_acid3355617683384732322911.46FA_desaturase, Glyco_hydro_1, UbiA---
Cluster 47Cyclopeptide3619746533653152723340.62BURP---
Cluster 48Fatty_acid6555636986591735213609.82CER1-like_C, FA_hydroxylase, Lycopene_cycl, Methyltransf_11---
Cluster 49Saccharide-Transporter_associated8516465768569535765307.00ABC2_membrane, ABC_tran, ADH_N, ADH_zinc_N, Cellulose_synt, Methyltransf_11, Pyridoxal_deC---
Cluster 50Saccharide109419264810954037271211.08Cellulose_synt, Lyase_aromatic---
The following clusters are from record CM107122.1:
Cluster 51Saccharide2231629682266125613449.59Aminotran_3, UDPGT_2flavonoid-2, oleananes-2--
Cluster 52Polyketide-Terpene4825305624835576471027.09Chal_sti_synt_C, Chal_sti_synt_N, Methyltransf_11, Prenyltrans---
Cluster 53Saccharide5189878245209465391958.71Aldo_ket_red, UDPGT_2flavonoid-7, oleananes-7--
Cluster 54Saccharide5850895645898642714774.71Cellulose_synt, Dimerisation, Methyltransf_2, p450---
Cluster 55Terpene6568512166581433561292.14Terpene_synth, Terpene_synth_C---
Cluster 56Fatty_acid-Alkaloid7230981157246976131599.50BBE, FAD_binding_4, FA_hydroxylase---
Cluster 57Saccharide-Terpene8720675068761066964039.19Epimerase, Glyco_hydro_1, Terpene_synth, Terpene_synth_C---
Cluster 58Saccharide8961364528973516041215.15Glyco_hydro_1, Methyltransf_11---
Cluster 59Cyclopeptide95890754597170244812794.90BURP---
Cluster 60Cyclopeptide97102617998809791517071.74BURP---
Cluster 61Cyclopeptide97578356398840789512624.33BURP---
Cluster 62Saccharide106129232810651335493841.22AMP-binding, Amino_oxidase, Glyco_hydro_1, Lipoxygenase---
Cluster 63Lignan110147358011033076521834.07Dirigent, p450---
Cluster 64Polyketide116789188411716362333744.35Chal_sti_synt_C, Epimerase, FAE1_CUT1_RppA, NAD_binding_4, p450---

CM107116 - Cluster 1 - Putative

Gene cluster description

CM107116 - Gene Cluster 1. Type = putative. Location: 107434037 - 111807631 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107116 - Cluster 2 - Fatty_acid-alkaloid

Gene cluster description

CM107116 - Gene Cluster 2. Type = fatty_acid-alkaloid. Location: 441971560 - 443952892 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107116 - Cluster 3 - Alkaloid

Gene cluster description

CM107116 - Gene Cluster 3. Type = alkaloid. Location: 498915841 - 500855265 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107116 - Cluster 4 - Saccharide

Gene cluster description

CM107116 - Gene Cluster 4. Type = saccharide. Location: 1110695371 - 1115545664 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107116 - Cluster 5 - Cyclopeptide

Gene cluster description

CM107116 - Gene Cluster 5. Type = cyclopeptide. Location: 1258983472 - 1272062859 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

CM107116 - Cluster 6 - Transporter_associated-terpene

Gene cluster description

CM107116 - Gene Cluster 6. Type = transporter_associated-terpene. Location: 1454614689 - 1463348892 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107116 - Cluster 7 - Saccharide

Gene cluster description

CM107116 - Gene Cluster 7. Type = saccharide. Location: 1517507706 - 1521172977 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107116 - Cluster 8 - Saccharide

Gene cluster description

CM107116 - Gene Cluster 8. Type = saccharide. Location: 1772772877 - 1778887344 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107116 - Cluster 9 - Saccharide

Gene cluster description

CM107116 - Gene Cluster 9. Type = saccharide. Location: 1802170348 - 1809273180 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107117 - Cluster 10 - Putative

Gene cluster description

CM107117 - Gene Cluster 10. Type = putative. Location: 166652431 - 168161261 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

CM107117 - Cluster 11 - Putative

Gene cluster description

CM107117 - Gene Cluster 11. Type = putative. Location: 553448833 - 555322480 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107117 - Cluster 12 - Saccharide

Gene cluster description

CM107117 - Gene Cluster 12. Type = saccharide. Location: 998243752 - 1001434678 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107117 - Cluster 13 - Polyketide-saccharide-alkaloid

Gene cluster description

CM107117 - Gene Cluster 13. Type = polyketide-saccharide-alkaloid. Location: 1026214020 - 1028523951 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107117 - Cluster 14 - Putative

Gene cluster description

CM107117 - Gene Cluster 14. Type = putative. Location: 1030253699 - 1032790650 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107117 - Cluster 15 - Saccharide

Gene cluster description

CM107117 - Gene Cluster 15. Type = saccharide. Location: 1251012066 - 1253391897 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107117 - Cluster 16 - Saccharide

Gene cluster description

CM107117 - Gene Cluster 16. Type = saccharide. Location: 1445092682 - 1447054918 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107118 - Cluster 17 - Cyclopeptide

Gene cluster description

CM107118 - Gene Cluster 17. Type = cyclopeptide. Location: 90852782 - 92152834 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_011716
Repeat occurs 4 times in a sequence of 242 amino acids
Location between 91740648 and 91756852
Coverage of 24.79 %
Instances:
CCYMKILGFGPGKIE | CCYMKILGFGPGKIE | CCYMKILGFGPGKIE | CCYMKEKSKFVTGST |
pattern: CCYMK[EI][LK][GS][KF][GF][PV][TG][GK][SI][ET]
MTWQYVILTLLHGVHITLLFRSLVLDQEKSKFVTGSTRKIMRGLLLHEDPWFWTRKNRSLSLV
LRERSCVDCCYMKILGFGPGKIEVCHWFYEKDHAWTCCYMKILGFGPGKIEVCHWFYEKDHAWT
CCYMKILGFGPGKIE
VCHWFYEKDHAWTCCYMKEKSKFVTGSTRKIMRGLLLHEGEGVVWNEVI
VRNVIHCYLEISYCVIYCYLVLSNYILGFGPGKIEVCHWFYEKDHAWIVAT

Similar gene clusters

No significant ClusterBlast hits found.

CM107118 - Cluster 18 - Cyclopeptide

Gene cluster description

CM107118 - Gene Cluster 18. Type = cyclopeptide. Location: 91615049 - 92100246 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_011716
Repeat occurs 4 times in a sequence of 242 amino acids
Location between 91740648 and 91756852
Coverage of 24.79 %
Instances:
CCYMKILGFGPGKIE | CCYMKILGFGPGKIE | CCYMKILGFGPGKIE | CCYMKEKSKFVTGST |
pattern: CCYMK[EI][LK][GS][KF][GF][PV][TG][GK][SI][ET]
MTWQYVILTLLHGVHITLLFRSLVLDQEKSKFVTGSTRKIMRGLLLHEDPWFWTRKNRSLSLV
LRERSCVDCCYMKILGFGPGKIEVCHWFYEKDHAWTCCYMKILGFGPGKIEVCHWFYEKDHAWT
CCYMKILGFGPGKIE
VCHWFYEKDHAWTCCYMKEKSKFVTGSTRKIMRGLLLHEGEGVVWNEVI
VRNVIHCYLEISYCVIYCYLVLSNYILGFGPGKIEVCHWFYEKDHAWIVAT

Similar gene clusters

No significant ClusterBlast hits found.

CM107118 - Cluster 19 - Lignan

Gene cluster description

CM107118 - Gene Cluster 19. Type = lignan. Location: 97676006 - 100865415 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107118 - Cluster 20 - Cyclopeptide

Gene cluster description

CM107118 - Gene Cluster 20. Type = cyclopeptide. Location: 357931640 - 370122233 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_012662
Repeat occurs 13 times in a sequence of 398 amino acids
Location between 362870641 and 362871838
Coverage of 19.6 %
Instances:
PNISAY | PNISAY | PNISAY | PNISAY | PNISAY
PNISAY | PNISAY | PNISAY | PNISAY | PNISAY
PNISAY | PNISAY | PNISAY |
pattern: PNISAY
The following known motifs were found:
FEPR was found 6 times in this sequence
MWFLCFTIKIVAVAESRKDLGEYWKLIMKDHDMLEEIQGLLDANTKNNFKTLKQSFDDKETKN
TVKDFEPRPNISAYENKDIDDKENKKGVEDFEPRPNISAYEDKDIDDKENKKGVEDFKPRPNIS
AY
ENKDIDDKENKKGVEDFKPRPNISAYENKDIGDKENKKGVEDFQPRPNISAYKNKDIDDKEN
KKGVEDFQPRPNISAYENKYIDSEENKKGVEDFEPRPNISAYTNKDIDVGEKKKGVEEFEPRPN
ISAY
KNKDIDDKENKKGVEDFQPRPNISAYENKYIDSEENKKGVEDFEPRPNISAYTNKDTDVG
EKKKGVEEFEPRPNISAYGDNEIDAGKKKKGVEDFKPRPNISAYGNNEINNKKKEKVVEDCEPK
PNISAY
GNNEIDAEI
Repeat found in RYX36_012668
Repeat occurs 4 times in a sequence of 158 amino acids
Location between 363511148 and 363511720
Coverage of 30.38 %
Instances:
FEPRPNVSSYED | FEPRPNVSSYED | FEPRPNVSSYED | FEPRPNLSAYGV |
pattern: FEPRPN[VL]S[AS]Y[EG][VD]
The following known motifs were found:
FEPR was found 4 times in this sequence
MRTTFALWPLLLLLFVAAIESRKDLGEHWKMVMKDKDMPEYILGMLDANTDKNLNIMKQSFKD
SKENFEPRPNVSSYEDPNVSSYEDDDIGVIEKKKFVKDFEPRPNVSSYEDPNVSSYEDDDIGVI
EKKKFVKDFEPRPNVSSYEDPNVSSYEDNNANNGKENKKAIKDFEPRPNLSAYGV
Repeat found in RYX36_012672
Repeat occurs 16 times in a sequence of 371 amino acids
Location between 365519860 and 365521015
Coverage of 38.81 %
Instances:
YLDGWLENT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWFKDA
YLDGWWKDT | YLDGWLKDT | YLDGWLKDI | YLDGWLKDT | YLDGWLKDT
YLDGWLKDI | YLDGWLKDI | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT
YLDGWLKQN |
pattern: YLDGW[WLF][EK][DNQ][ATNI]
MNQVDRPYLDGWLENTPLKNQKPSPNSNQVYLDGWLKDTRGEKTNVNLDFNQIYLDGWLKDTR
VKKAKVNHDSNQIYLDGWLKDTRAEKDKSNLDSNQKEKSNPDSNQVYLDGWFKDARGEKKKFNP
NSNQVYLDGWWKDTRVEKAKANHDSNQVYLDGWLKDTRVEKVKANPDSNHVYLDGWLKDIRTEK
SKVNPYSNQVYLDGWLKDTRIEKSKSTSNSNQVYLDGWLKDTHAEKAKSNPDSNQVYLDGWLKD
I
RAKKEKSTSEFNQVYLDGWLKDIRSEKEKSTPNSNQVYLDGWLKDTQTKNVKVNPNSNQVYLD
GWLKDT
RTKNAKSNHNSNQVYLDGWLKDTQTKNAKSNLDSKQIYLDGWLKQN
Repeat found in RYX36_012674
Repeat occurs 5 times in a sequence of 118 amino acids
Location between 365994786 and 365995807
Coverage of 33.9 %
Instances:
LKDTRVEK | LKDTRAEK | LKDIKTEK | LKDTRTEK | LKDTHTEK

pattern: LKD[TI][RKH][AVT]EK
MARRVILSLIPFLPLLLILKDTRVEKAKANHDSNQVYLDGLLKDTRAEKVKANHDSNQVYLDG
WLKDIKTEKSKVNPESKKVYLDGWLKDTRTEKAKSTSNSNQVYLDGWLKDTHTEK
Repeat found in RYX36_012675
Repeat occurs 13 times in a sequence of 304 amino acids
Location between 366067471 and 366068884
Coverage of 38.49 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWLKDT | YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWLKDT | YLDGWLKDT | YLDGWLKQN |
pattern: YLDGWLK[DNQ][TNI]
MLINGHGSFARYMNQVDQTYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTRGEKTKVNPNSN
QVYLDGWLKDTRVEKAKANHDTNQVYLDGWLKDTRAEKVKANPDYNKVYLDGWLKDIRTEKSKV
NPDSNQVYLDGWLKDTRAEKAKSTSNSNQVYLDGWLKDTHTEKAKSNPDSNQVYLDGWLKDIRA
EKEKSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNIKVNPNSNQVYLDGWL
KDT
RTENAKSNHNSNQVYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012676
Repeat occurs 4 times in a sequence of 73 amino acids
Location between 366192343 and 366195137
Coverage of 43.84 %
Instances:
QQQQQQND | QQQQQNDQ | QQQQNDQE | QQQNDQEN |
pattern: QQQ[NQ][DNQ][DNQ][EDNQ][EDNQ]
MEQPQQQQQQNDQENGGGGSSGKGSGFRQSSTRWTPTTDQIRILKVLYYNNGIRSPSAEQIHR
ISARLRQYVI
Repeat found in RYX36_012679
Repeat occurs 13 times in a sequence of 292 amino acids
Location between 366430533 and 366431412
Coverage of 40.07 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWLKDT | YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWLKDT | YLDGWLKDT | YLDGWLKQN |
pattern: YLDGWLK[DNQ][TNI]
MNQVDQTYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTRGEKTKVNPNSNQVYLDGWLKDTR
VEKAKANHDTNQVYLDGWLKDTRAEKVKANPDYNKVYLDGWLKDIRIEKSNVNPDSNQVYLDGW
LKDT
RAEKAKSTSNSNQVYLDGWLKDTHTEKEKSNPDSNQVYLDGWLKDIRDDKDKSTSESNQV
YLDGWLKDI
RAEKEKSTPDSNQVYLDGWLKDTRTKNVKVNPNSNQVYLDGWLKDTRTENAKSNR
NSNQVYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012682
Repeat occurs 4 times in a sequence of 115 amino acids
Location between 366722748 and 366723128
Coverage of 48.7 %
Instances:
VYLDGLLKDTRADK | VYLDGWLKDIKTEK | VYLDGWLKDTRTEK | VYLDGWLKDTHTEK |
pattern: VYLDG[WL]LKD[TI][RKH][AT][ED]K
MDCLKTPEARKQKSTLTPTKVEKAKANHDSNQVYLDGLLKDTRADKDKANPDSNQVYLDGWLK
DIKTEK
SKVNPISKKVYLDGWLKDTRTEKAKSTSNSNQVYLDGWLKDTHTEK
Repeat found in RYX36_012684
Repeat occurs 5 times in a sequence of 156 amino acids
Location between 366912748 and 366913243
Coverage of 32.05 %
Instances:
YLDGWLKNTP | YLDGLLEDSR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTH

pattern: YLDG[WL]L[EK][DN][TS][PRH]
MKQVDQHYLDGWLKNTPLKNPKSFPNPNQGYLDGLLEDSRCEKTKVNPNSNQFYLDGWLKDTR
FEKAKSNQVYLDGWLKDTRVEKAKANPCPNQDIRTEKSKVNPESNQVYLNGWLKDTRTEKAKST
SNSNQVYLDGWLKDTHTEKEKSNPDSNQV
Repeat found in RYX36_012685
Repeat occurs 13 times in a sequence of 301 amino acids
Location between 366930948 and >366931854
Coverage of 38.87 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWVKDT | YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWLKDT | YLDGWLKDT | YLDGWLKQN |
pattern: YLDGW[VL]K[DNQ][TNI]
NGHGSFARYMNQVDQTYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTKGEKTKVNPNSNQVY
LDGWLKDT
RVEKAKANHDSNQVYLDGWLKDTRAEKVKANPDYNQVYLDGWLKDIRTEKSKVNPD
SNQVYLDGWVKDTRDEKAESTSNSNQVYLDGWLKDTHTEKAKSNPDSNQIYLDGWLKDIQAEKE
KSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNVKVNPNSNRVYLDGWLKDT
RTENAKSNHNSNQIYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012686
Repeat occurs 4 times in a sequence of 104 amino acids
Location between 367062316 and 367062631
Coverage of 34.62 %
Instances:
YLDGWLKNT | YLDGWLEDT | YLDGWLKDT | YLDGWLKDT |
pattern: YLDGWL[EK][DN]T
MKQVYQPYLDGWLKNTPLKNQKSFPNHKQGYLDGWLEDTIGEKTKVNPNSNQVYLDGWLKDTR
VEKAKANHDFNQVYLDGWLKDTRAEKVKANPYPTKFSLMDD
Repeat found in RYX36_012687
Repeat occurs 16 times in a sequence of 406 amino acids
Location between 367327729 and 367391701
Coverage of 27.59 %
Instances:
NQVDQPY | NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG
NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG
NQVYLDG | NQVYLDG | NQVCLDG | NQVYLDG | NQVYLDG
NQVYLDG |
pattern: NQV[YDC][LQ][PD][YG]
MAHKVILSLIPFLLLLLINGHGSFARYMNQVDQPYLDGWLKNTPLKNQKLSPNTNQVYLDGWL
KDTKGEKIKVNPNSNQVYLDGWLKDTRVEKAKANHDSNQVYLDGWLKDTRAENAKANPDSNKVY
LDGWLKDIRTEKSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDTHTEKARSNPD
SNQVYLDGWLKDIRAEKDKSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNV
KVNPNSNQVYLDGWLKDTRTENAKSNHNSNQVYLDGWLKDTRTEKAKSTSNSNQVYLDGWLKDT
HTEKEKSNPDSNQVCLDGWFKDIRAEKEKSTSESNQVYLDGWLKDIRAETEKSTPDSNQVYLDG
WLKGTRTKNVKVNHNSNQVYLDG
Repeat found in RYX36_012688
Repeat occurs 12 times in a sequence of 296 amino acids
Location between 367445335 and 367446723
Coverage of 36.49 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDEWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWFKDT | YLDGWLKQN |
pattern: YLD[EG]W[LF]K[DNQ][TNI]
MARRVILSLIPFLLLLLINGHGSFARYMNQVDQPYLDGWLKNTPLKNQKLSPNSNQVYLDGWL
KDT
KGEKTKVNPNSNQVYLDEWLKDTRVEKARANHDSNQVYLDGWLKDTRAENAKANPDSNQVY
LDGWLKDI
RTENSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDIHTEKAKSNPD
SNQVYLDGWLKDIRAEKDKSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNV
KVNPNSNQVYLDGWFKDTRTENAKSNHNSNHVYLDGWLKQN
Repeat found in RYX36_012689
Repeat occurs 13 times in a sequence of 319 amino acids
Location between 367494242 and 367495699
Coverage of 32.6 %
Instances:
LDGWIKNT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDI
LDGWLKDT | LDGWLKDT | LDGWLKDI | LDGWLKDI | LDGWLKDT
LDGWLKDT | LDGWLKDT | LDGWLKQN |
pattern: LDGW[LI]K[DNQ][TNI]
MARRVILSLIPFLLLLLINGHGSFARYMNQVDQPNLDGWIKNTPLKNQKLSPNSNQVYLDGWL
KDT
KGEKTKVNPNSNQVYLDGWLKDTRVEKAKANHDSNQVYLDGWLKDTRAENAKANPDSNQVY
LDGWLKDI
STEKSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDTHTEKAKSNPD
SNQVYLDGWLKDIRAKKDKYTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKTV
KVNPNSNQVYLDGWLKDTRTENAKSNHNSNQVYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012690
Repeat occurs 10 times in a sequence of 240 amino acids
Location between 367524592 and 367525471
Coverage of 33.33 %
Instances:
LDGWLKNT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDI
LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKQN

pattern: LDGWLK[DNQ][TNI]
MNQVDQPYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTRGEKKKVNPNSNQVYLDGWLKDTR
VEKAKANHDSNQVYLDGWLKDTRSENAKANPDSNQVYLDGWLKDIKTEKSKVNPDSNQVYLDGW
LKDT
RGEKAKSTSTSNQVNLDGWLKDTHTEKAKSNPNSNQLKDTRTKNVKVNPNSNQVYLDGWL
KDT
RTENAKSNHNSNQVYLDGWLKDTRTKNAKSNIDSNQIYLDGWLKQN
Repeat found in RYX36_012692
Repeat occurs 15 times in a sequence of 567 amino acids
Location between 367672607 and 367674311
Coverage of 21.16 %
Instances:
YLDGWLKN | YLDGWLKN | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDIGSKI

pattern: YLD[GI][WG][LS]K[DNI]
MNQVDQPYLDGWLKNTPLKNQKLSSDSNQVYLDGWLKNTRGEKVKANPNSNQIYLDGWLKDIR
TEKAKGNHDSNQVYLDGWLKDTRVEKEKANSDSNQVYLDGWLKDTRVEKAKVNPESNQVYLDGW
LKD
NRDEKANVNPDSNQVYLDGWLKDARVEKEKSTPANRVYLDGWLKDARTEKEKSTHDSNQVY
LDGWLKD
PQNEKIETNTKSNHVYLDGWLKDNRDENAKVSADSNQVYLDGWLKDTRDLKEKLTID
SNQVYLDGWLKDTRDEKENSYPNSNQVYLDGWLKDSHVENSKSIQNSKQAYLDGWLKDSHAENH
MKIGQDFVESNEKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPKKVADDIPVS
KSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFVHSVIGAETKYN
IHSTSYPTTSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGED
GDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012693
Repeat occurs 12 times in a sequence of 506 amino acids
Location between 367708009 and 367709616
Coverage of 18.97 %
Instances:
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKY | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDIGSKI |
pattern: YLD[GI][WG][LS]K[YDI]
MDAEKAKGNHDSNQVYLDGWLKDTRVEEEKANSDSNQVYLDGWLKDTIVEKAKVNPESNQVYL
DGWLKD
NRDEKANVNPDSNQVYLDGWLKDARTEKEKSTPVNRVYLDGWLKDARTEKEKSTPDSN
QVYLDGWLKYPQNEKIETNTKSNHVYLDGWLKDNRDENAKASADSNQVYLDGWLKDTRDLKEKL
TLDSNQVYLDGWLKDTRDEKENSYPNSNQVYLDGWLKDSHVENSKSIQNSKQAYLDGWLKDSHA
ENHMKIGQDFVESNEKLSSKVDHTEAFKVAFFSIEDLYVGSVMTLQFPIREYAKFLPKKVADDI
PVSKSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFIHSVIGAET
KYNIHSTSYPTTSGARLQNYTVLDISNDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLK
GEDGDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012694
Repeat occurs 14 times in a sequence of 567 amino acids
Location between 367730226 and 367731930
Coverage of 19.75 %
Instances:
YLDGWLKN | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKY
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDIGSKI |
pattern: YLD[GI][WG][LS]K[YDNI]
MNQVDQPYLDGWLKNTPLKNQKLSSDSNQVYLDGWLKDTRGEKVKVNPNSNQIYLDGWLKDIR
AEKAKGNHDSNQVYLDGWLKDTRVEKEKDNSDSNQVYLDGWLKDTRVEKAKVNPESNQVYLDGW
LKD
NRDEKANVNPDSNQVYLDGWLKDARTEKEKSTPVNRVYLDGWLKDARTEKEKFTPDSNQVY
LDGWLKD
PQNEKIETNTKSNHVYLDGWLKYNRDENAKASADSNQVYLDGWLKDTRDLKEKLTLD
SNQVYLDGWLKDTRDEKENSYPNSNQVYVDGWLKDSHVENSKSIQNSKQAYLDGWLKDSYAENH
MKIGQDFVESNEKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPKKVADDIPVS
KSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFVHSVIGAETKYN
IHSTSYPTTSGARLQNYTVLDISNDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGED
GDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012695
Repeat occurs 13 times in a sequence of 438 amino acids
Location between 367828899 and 367830216
Coverage of 17.81 %
Instances:
DGWVKN | DGWLKD | DGWLKD | DGWLKD | DGWLKD
DGWLKD | DGWLKD | DGWLKD | DGWLKD | DGWLKD
DGWLKD | DGWLKD | DGWLKD |
pattern: DGW[VL]K[DN]
MNQVDQPYLDGWVKNTLLKNQKLSSDSNQVYLDGWLKDNRGEKVKVNPNSNQIYLDGWLKDIR
AEKAKGNHDSNQVYFDGWLKDTRVEKEKANSDSNQVYLDGWLKDTIVEKAKVNPKSNQVYLDGW
LKD
NRDEKAKVNHDSNQVYLDGWLKDAQTEKEKSTPANQVYLDGWLKDVRTEKEKSTPDSNQVY
LDGWLKDTHDEKIETNTNSNHVYLDGWLKDNRDENAKGSADSNQVYLDGWLKDTRDEKENSYPN
SNQVYLDGWLKDSHVENSKSMQNSKQAYLDGWLKDSHVVNHMKIGQDFEESNKKLSSKVDHTEA
FKVAFFDIEDLYVGSVMTLQFPIREYAKLLPKKVADDIPVSKSQISSLLDLFSLTKDSPRGEDM
IDIINQCEFPPNKGDTKAYLTSLESMLEFVHSVIGAETKYNIHSTSYPTTSGARL
Repeat found in RYX36_012702
Repeat occurs 5 times in a sequence of 108 amino acids
Location between 368394557 and 368394884
Coverage of 41.67 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDN

pattern: YLDGWLK[DN][TN]
MNQVDQPYLDGWLKNTPLKNQKPFSDSNQIYLDGWLKDTRGEKVKSNLDSNQVYLDGWLKDTR
NEKEKANPDSNQFYLDGWLKDTRVEKVKGNPDSNQVYLDGWLKDN
Repeat found in RYX36_012703
Repeat occurs 7 times in a sequence of 278 amino acids
Location between <368509853 and 368512007
Coverage of 32.73 %
Instances:
NQVDQPYLDGWLK | NQVYVNGWLKDTR | NQVYLDGWLKDTR | NQVYLDGCLKDTR | NQVYLDGWLKDNR
NQVYVDGWVKDTR | NQVDLYGWLKDNS |
pattern: NQV[YD][VLQ][PYDN][YG][WLC][LVD][GK][WD][LTN][RKS]
MFLLLLLINDHGSFARYMNQVDQPYLDGWLKNTPLKNQKSFHDSNQVYVNGWLKDTRSEKEKA
NPDSNQVYLDGWLKDTRDEKAKANLDSNQTEKAKANPEFNQVYLDGCLKDTRVKKEKVDPDSNQ
VYLDGWLKDNR
AEKANSNPNSNQVYVDGWVKDTRAKNAKFNPNSNQVDLYGWLKDNSGSVMTFQ
LPLREYANFLPRKVADHIPFSKSQVPSLLDLFSLTKDSTQDIYSPKWVACHPRPYLYGLYYFHY
LDIRSKIFKVLLKGEDGDIMNAL
Repeat found in RYX36_012705
Repeat occurs 9 times in a sequence of 229 amino acids
Location between 368517053 and 368518226
Coverage of 39.3 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTG
YLDGWLTDTR | YLDGWLKDTR | YLDGWLKDTR | YLDRWLKDTR |
pattern: YLD[RG]WL[TK][DN]T[PRG]
MFLLLLLINGHGSFARDMNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKV
NPDSNQVYLDGWLKDTRTEKAKANLDSNQVYLDGWLKDTRVEKEKSNPDSNQVYLDGWLKDTGA
KNAKLNPESNLVYLDGWLTDTRTEKEKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWL
KDTR
AEKEKSNPDFNQVYLDRWLKDTRVEKEKSNPDSN
Repeat found in RYX36_012706
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368521893 and 368523957
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAH
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAKSNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNPESNQVYLDGWLKDDKAEKANANPNSNQVYVDGWVKDTRSVK
VKSNPNSNQVYLDGWLKDAHAENEKSNPKSNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSPNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVIIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPLSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGFETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPIKHILWVPRPPAATPVATM
Repeat found in RYX36_012707
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368528159 and 368530223
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAH
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAKSNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNPESNQVYLDGWLKDDKAEKANANPNSNQVYVDGWVKDTRSVK
VKSNPNSNQVYLDGWLKDAHAENEKSNPKSNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSPNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPLSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGFETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPIKHILWVPRPPAATPVATM
Repeat found in RYX36_012708
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368534425 and 368536489
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTQAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNMDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPRM][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYPDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTQAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNMDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012709
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368540688 and 368542752
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNRDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYPDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012710
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368546951 and 368549015
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAQ
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGVKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGFNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012711
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368553215 and 368555279
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNRDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSPNYNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012712
Repeat occurs 20 times in a sequence of 714 amino acids
Location between 368806723 and 368809551
Coverage of 28.01 %
Instances:
YLDGWLKNTQ | YLDGWLKDAR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKNTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDSR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTQ | YLDGWLKDSH | YLDGWLKDSH | YLDIGSKIFK

pattern: YLD[GI][WG][LS]K[DNI][AFITS][RKHQ]
MAHIVVLSLLSFLLLMLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIR
ADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRVEKEKFNPD
SNQVYLDGWLKDTRAEKSKTNPDSNQVYLDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIRDEKE
KSNHDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLKDT
R
AENAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQVYLDGWLKDTRAENAKFNPESNQVYLDG
WLKDTR
GENAKSNPDSNQVYLDGWLKDTQVEKDNSSPNFNRVYLDGWLKDSHVEISKSIPNSKQ
AYLDGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKFADDIPFSKSQVPSLLDLFSLTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESM
LEFVHSVIGSETNYSTHSTSHPTPSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYL
DIGSKIFK
VLLKGEDGDIMNALGICHLDTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVP
RPPVATPVATM
Repeat found in RYX36_012713
Repeat occurs 19 times in a sequence of 714 amino acids
Location between 368831101 and 368833915
Coverage of 26.61 %
Instances:
YLDGWLKNTQ | YLDGWLKDAR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIQ | YLDGWLKDTR
YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDSR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTQ | YLDGWLKDSH | YLDGWLKDSH | YLDIGSKIFK |
pattern: YLD[GI][WG][LS]K[DNI][AFITS][RKHQ]
MAYRVVLSLLPFLLLLLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIR
ADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRAEKEKVNPD
SNQVYLDGWLKDTRAEKSKTNPDSNQVYIDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIQDDKE
KSNHDSNQVYLDGWLKDTRSEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLKDT
R
AEKAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQIYLDGWLKDTRAENAKFNPESNQVYLDG
WLKDTR
DESAKSNPDSNQVYLDGWLKDTQVEKDNFSPNFNRIYLDGWLKDSHVEITKSIPNSKQ
AYLDGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKVADDIPFSKSQVPSLLDLFSLTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESM
LEFVHSVIGSETNYSTHSTSHPTPSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYL
DIGSKIFK
VLLKGEDGDIMNALGICHLDTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVP
RPPVATPVATM
Repeat found in RYX36_012715
Repeat occurs 19 times in a sequence of 528 amino acids
Location between 368867545 and 368869811
Coverage of 25.19 %
Instances:
GWLKNTQ | GWLKDAR | GWLKDTR | GWLKDTR | GWLKDIR
GWLKDTR | GWLKDTR | GWLKDTR | GWLKNTR | GWLKDIQ
GWLKDTR | GWLKDIR | GWLTDTR | GWLKDSR | GWLKDTR
GWLKDTR | GWLKDTQ | GWLKDSH | GWLKDSH |
pattern: GWL[TK][DN][ASTI][RHQ]
MAYRVVLSLLPFLLLLLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIRADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRAEKEKFNPD
SNQVYLDGWLKDTRAEKSETNPDSNQVYIDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIQDEKE
KSNHDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLTDT
R
AEKAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQIYLDGWLKDTRAENEKFNPESNQVYLDG
WLKDTR
GENTKSNPDSNQVYLDGWLKDTQVEKDNSSPNFNRIYLDGWLKDSHVEISKSIPNSKQ
AYLNGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTKAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKVADDIPFSKS

Similar gene clusters

CM107118 - Cluster 21 - Cyclopeptide

Gene cluster description

CM107118 - Gene Cluster 21. Type = cyclopeptide. Location: 365493800 - 370083789 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_012672
Repeat occurs 16 times in a sequence of 371 amino acids
Location between 365519860 and 365521015
Coverage of 38.81 %
Instances:
YLDGWLENT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWFKDA
YLDGWWKDT | YLDGWLKDT | YLDGWLKDI | YLDGWLKDT | YLDGWLKDT
YLDGWLKDI | YLDGWLKDI | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT
YLDGWLKQN |
pattern: YLDGW[WLF][EK][DNQ][ATNI]
MNQVDRPYLDGWLENTPLKNQKPSPNSNQVYLDGWLKDTRGEKTNVNLDFNQIYLDGWLKDTR
VKKAKVNHDSNQIYLDGWLKDTRAEKDKSNLDSNQKEKSNPDSNQVYLDGWFKDARGEKKKFNP
NSNQVYLDGWWKDTRVEKAKANHDSNQVYLDGWLKDTRVEKVKANPDSNHVYLDGWLKDIRTEK
SKVNPYSNQVYLDGWLKDTRIEKSKSTSNSNQVYLDGWLKDTHAEKAKSNPDSNQVYLDGWLKD
I
RAKKEKSTSEFNQVYLDGWLKDIRSEKEKSTPNSNQVYLDGWLKDTQTKNVKVNPNSNQVYLD
GWLKDT
RTKNAKSNHNSNQVYLDGWLKDTQTKNAKSNLDSKQIYLDGWLKQN
Repeat found in RYX36_012674
Repeat occurs 5 times in a sequence of 118 amino acids
Location between 365994786 and 365995807
Coverage of 33.9 %
Instances:
LKDTRVEK | LKDTRAEK | LKDIKTEK | LKDTRTEK | LKDTHTEK

pattern: LKD[TI][RKH][AVT]EK
MARRVILSLIPFLPLLLILKDTRVEKAKANHDSNQVYLDGLLKDTRAEKVKANHDSNQVYLDG
WLKDIKTEKSKVNPESKKVYLDGWLKDTRTEKAKSTSNSNQVYLDGWLKDTHTEK
Repeat found in RYX36_012675
Repeat occurs 13 times in a sequence of 304 amino acids
Location between 366067471 and 366068884
Coverage of 38.49 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWLKDT | YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWLKDT | YLDGWLKDT | YLDGWLKQN |
pattern: YLDGWLK[DNQ][TNI]
MLINGHGSFARYMNQVDQTYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTRGEKTKVNPNSN
QVYLDGWLKDTRVEKAKANHDTNQVYLDGWLKDTRAEKVKANPDYNKVYLDGWLKDIRTEKSKV
NPDSNQVYLDGWLKDTRAEKAKSTSNSNQVYLDGWLKDTHTEKAKSNPDSNQVYLDGWLKDIRA
EKEKSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNIKVNPNSNQVYLDGWL
KDT
RTENAKSNHNSNQVYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012676
Repeat occurs 4 times in a sequence of 73 amino acids
Location between 366192343 and 366195137
Coverage of 43.84 %
Instances:
QQQQQQND | QQQQQNDQ | QQQQNDQE | QQQNDQEN |
pattern: QQQ[NQ][DNQ][DNQ][EDNQ][EDNQ]
MEQPQQQQQQNDQENGGGGSSGKGSGFRQSSTRWTPTTDQIRILKVLYYNNGIRSPSAEQIHR
ISARLRQYVI
Repeat found in RYX36_012679
Repeat occurs 13 times in a sequence of 292 amino acids
Location between 366430533 and 366431412
Coverage of 40.07 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWLKDT | YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWLKDT | YLDGWLKDT | YLDGWLKQN |
pattern: YLDGWLK[DNQ][TNI]
MNQVDQTYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTRGEKTKVNPNSNQVYLDGWLKDTR
VEKAKANHDTNQVYLDGWLKDTRAEKVKANPDYNKVYLDGWLKDIRIEKSNVNPDSNQVYLDGW
LKDT
RAEKAKSTSNSNQVYLDGWLKDTHTEKEKSNPDSNQVYLDGWLKDIRDDKDKSTSESNQV
YLDGWLKDI
RAEKEKSTPDSNQVYLDGWLKDTRTKNVKVNPNSNQVYLDGWLKDTRTENAKSNR
NSNQVYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012682
Repeat occurs 4 times in a sequence of 115 amino acids
Location between 366722748 and 366723128
Coverage of 48.7 %
Instances:
VYLDGLLKDTRADK | VYLDGWLKDIKTEK | VYLDGWLKDTRTEK | VYLDGWLKDTHTEK |
pattern: VYLDG[WL]LKD[TI][RKH][AT][ED]K
MDCLKTPEARKQKSTLTPTKVEKAKANHDSNQVYLDGLLKDTRADKDKANPDSNQVYLDGWLK
DIKTEK
SKVNPISKKVYLDGWLKDTRTEKAKSTSNSNQVYLDGWLKDTHTEK
Repeat found in RYX36_012684
Repeat occurs 5 times in a sequence of 156 amino acids
Location between 366912748 and 366913243
Coverage of 32.05 %
Instances:
YLDGWLKNTP | YLDGLLEDSR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTH

pattern: YLDG[WL]L[EK][DN][TS][PRH]
MKQVDQHYLDGWLKNTPLKNPKSFPNPNQGYLDGLLEDSRCEKTKVNPNSNQFYLDGWLKDTR
FEKAKSNQVYLDGWLKDTRVEKAKANPCPNQDIRTEKSKVNPESNQVYLNGWLKDTRTEKAKST
SNSNQVYLDGWLKDTHTEKEKSNPDSNQV
Repeat found in RYX36_012685
Repeat occurs 13 times in a sequence of 301 amino acids
Location between 366930948 and >366931854
Coverage of 38.87 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWVKDT | YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWLKDT | YLDGWLKDT | YLDGWLKQN |
pattern: YLDGW[VL]K[DNQ][TNI]
NGHGSFARYMNQVDQTYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTKGEKTKVNPNSNQVY
LDGWLKDT
RVEKAKANHDSNQVYLDGWLKDTRAEKVKANPDYNQVYLDGWLKDIRTEKSKVNPD
SNQVYLDGWVKDTRDEKAESTSNSNQVYLDGWLKDTHTEKAKSNPDSNQIYLDGWLKDIQAEKE
KSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNVKVNPNSNRVYLDGWLKDT
RTENAKSNHNSNQIYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012686
Repeat occurs 4 times in a sequence of 104 amino acids
Location between 367062316 and 367062631
Coverage of 34.62 %
Instances:
YLDGWLKNT | YLDGWLEDT | YLDGWLKDT | YLDGWLKDT |
pattern: YLDGWL[EK][DN]T
MKQVYQPYLDGWLKNTPLKNQKSFPNHKQGYLDGWLEDTIGEKTKVNPNSNQVYLDGWLKDTR
VEKAKANHDFNQVYLDGWLKDTRAEKVKANPYPTKFSLMDD
Repeat found in RYX36_012687
Repeat occurs 16 times in a sequence of 406 amino acids
Location between 367327729 and 367391701
Coverage of 27.59 %
Instances:
NQVDQPY | NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG
NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG
NQVYLDG | NQVYLDG | NQVCLDG | NQVYLDG | NQVYLDG
NQVYLDG |
pattern: NQV[YDC][LQ][PD][YG]
MAHKVILSLIPFLLLLLINGHGSFARYMNQVDQPYLDGWLKNTPLKNQKLSPNTNQVYLDGWL
KDTKGEKIKVNPNSNQVYLDGWLKDTRVEKAKANHDSNQVYLDGWLKDTRAENAKANPDSNKVY
LDGWLKDIRTEKSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDTHTEKARSNPD
SNQVYLDGWLKDIRAEKDKSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNV
KVNPNSNQVYLDGWLKDTRTENAKSNHNSNQVYLDGWLKDTRTEKAKSTSNSNQVYLDGWLKDT
HTEKEKSNPDSNQVCLDGWFKDIRAEKEKSTSESNQVYLDGWLKDIRAETEKSTPDSNQVYLDG
WLKGTRTKNVKVNHNSNQVYLDG
Repeat found in RYX36_012688
Repeat occurs 12 times in a sequence of 296 amino acids
Location between 367445335 and 367446723
Coverage of 36.49 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDEWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWFKDT | YLDGWLKQN |
pattern: YLD[EG]W[LF]K[DNQ][TNI]
MARRVILSLIPFLLLLLINGHGSFARYMNQVDQPYLDGWLKNTPLKNQKLSPNSNQVYLDGWL
KDT
KGEKTKVNPNSNQVYLDEWLKDTRVEKARANHDSNQVYLDGWLKDTRAENAKANPDSNQVY
LDGWLKDI
RTENSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDIHTEKAKSNPD
SNQVYLDGWLKDIRAEKDKSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNV
KVNPNSNQVYLDGWFKDTRTENAKSNHNSNHVYLDGWLKQN
Repeat found in RYX36_012689
Repeat occurs 13 times in a sequence of 319 amino acids
Location between 367494242 and 367495699
Coverage of 32.6 %
Instances:
LDGWIKNT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDI
LDGWLKDT | LDGWLKDT | LDGWLKDI | LDGWLKDI | LDGWLKDT
LDGWLKDT | LDGWLKDT | LDGWLKQN |
pattern: LDGW[LI]K[DNQ][TNI]
MARRVILSLIPFLLLLLINGHGSFARYMNQVDQPNLDGWIKNTPLKNQKLSPNSNQVYLDGWL
KDT
KGEKTKVNPNSNQVYLDGWLKDTRVEKAKANHDSNQVYLDGWLKDTRAENAKANPDSNQVY
LDGWLKDI
STEKSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDTHTEKAKSNPD
SNQVYLDGWLKDIRAKKDKYTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKTV
KVNPNSNQVYLDGWLKDTRTENAKSNHNSNQVYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012690
Repeat occurs 10 times in a sequence of 240 amino acids
Location between 367524592 and 367525471
Coverage of 33.33 %
Instances:
LDGWLKNT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDI
LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKQN

pattern: LDGWLK[DNQ][TNI]
MNQVDQPYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTRGEKKKVNPNSNQVYLDGWLKDTR
VEKAKANHDSNQVYLDGWLKDTRSENAKANPDSNQVYLDGWLKDIKTEKSKVNPDSNQVYLDGW
LKDT
RGEKAKSTSTSNQVNLDGWLKDTHTEKAKSNPNSNQLKDTRTKNVKVNPNSNQVYLDGWL
KDT
RTENAKSNHNSNQVYLDGWLKDTRTKNAKSNIDSNQIYLDGWLKQN
Repeat found in RYX36_012692
Repeat occurs 15 times in a sequence of 567 amino acids
Location between 367672607 and 367674311
Coverage of 21.16 %
Instances:
YLDGWLKN | YLDGWLKN | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDIGSKI

pattern: YLD[GI][WG][LS]K[DNI]
MNQVDQPYLDGWLKNTPLKNQKLSSDSNQVYLDGWLKNTRGEKVKANPNSNQIYLDGWLKDIR
TEKAKGNHDSNQVYLDGWLKDTRVEKEKANSDSNQVYLDGWLKDTRVEKAKVNPESNQVYLDGW
LKD
NRDEKANVNPDSNQVYLDGWLKDARVEKEKSTPANRVYLDGWLKDARTEKEKSTHDSNQVY
LDGWLKD
PQNEKIETNTKSNHVYLDGWLKDNRDENAKVSADSNQVYLDGWLKDTRDLKEKLTID
SNQVYLDGWLKDTRDEKENSYPNSNQVYLDGWLKDSHVENSKSIQNSKQAYLDGWLKDSHAENH
MKIGQDFVESNEKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPKKVADDIPVS
KSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFVHSVIGAETKYN
IHSTSYPTTSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGED
GDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012693
Repeat occurs 12 times in a sequence of 506 amino acids
Location between 367708009 and 367709616
Coverage of 18.97 %
Instances:
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKY | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDIGSKI |
pattern: YLD[GI][WG][LS]K[YDI]
MDAEKAKGNHDSNQVYLDGWLKDTRVEEEKANSDSNQVYLDGWLKDTIVEKAKVNPESNQVYL
DGWLKD
NRDEKANVNPDSNQVYLDGWLKDARTEKEKSTPVNRVYLDGWLKDARTEKEKSTPDSN
QVYLDGWLKYPQNEKIETNTKSNHVYLDGWLKDNRDENAKASADSNQVYLDGWLKDTRDLKEKL
TLDSNQVYLDGWLKDTRDEKENSYPNSNQVYLDGWLKDSHVENSKSIQNSKQAYLDGWLKDSHA
ENHMKIGQDFVESNEKLSSKVDHTEAFKVAFFSIEDLYVGSVMTLQFPIREYAKFLPKKVADDI
PVSKSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFIHSVIGAET
KYNIHSTSYPTTSGARLQNYTVLDISNDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLK
GEDGDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012694
Repeat occurs 14 times in a sequence of 567 amino acids
Location between 367730226 and 367731930
Coverage of 19.75 %
Instances:
YLDGWLKN | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKY
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDIGSKI |
pattern: YLD[GI][WG][LS]K[YDNI]
MNQVDQPYLDGWLKNTPLKNQKLSSDSNQVYLDGWLKDTRGEKVKVNPNSNQIYLDGWLKDIR
AEKAKGNHDSNQVYLDGWLKDTRVEKEKDNSDSNQVYLDGWLKDTRVEKAKVNPESNQVYLDGW
LKD
NRDEKANVNPDSNQVYLDGWLKDARTEKEKSTPVNRVYLDGWLKDARTEKEKFTPDSNQVY
LDGWLKD
PQNEKIETNTKSNHVYLDGWLKYNRDENAKASADSNQVYLDGWLKDTRDLKEKLTLD
SNQVYLDGWLKDTRDEKENSYPNSNQVYVDGWLKDSHVENSKSIQNSKQAYLDGWLKDSYAENH
MKIGQDFVESNEKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPKKVADDIPVS
KSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFVHSVIGAETKYN
IHSTSYPTTSGARLQNYTVLDISNDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGED
GDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012695
Repeat occurs 13 times in a sequence of 438 amino acids
Location between 367828899 and 367830216
Coverage of 17.81 %
Instances:
DGWVKN | DGWLKD | DGWLKD | DGWLKD | DGWLKD
DGWLKD | DGWLKD | DGWLKD | DGWLKD | DGWLKD
DGWLKD | DGWLKD | DGWLKD |
pattern: DGW[VL]K[DN]
MNQVDQPYLDGWVKNTLLKNQKLSSDSNQVYLDGWLKDNRGEKVKVNPNSNQIYLDGWLKDIR
AEKAKGNHDSNQVYFDGWLKDTRVEKEKANSDSNQVYLDGWLKDTIVEKAKVNPKSNQVYLDGW
LKD
NRDEKAKVNHDSNQVYLDGWLKDAQTEKEKSTPANQVYLDGWLKDVRTEKEKSTPDSNQVY
LDGWLKDTHDEKIETNTNSNHVYLDGWLKDNRDENAKGSADSNQVYLDGWLKDTRDEKENSYPN
SNQVYLDGWLKDSHVENSKSMQNSKQAYLDGWLKDSHVVNHMKIGQDFEESNKKLSSKVDHTEA
FKVAFFDIEDLYVGSVMTLQFPIREYAKLLPKKVADDIPVSKSQISSLLDLFSLTKDSPRGEDM
IDIINQCEFPPNKGDTKAYLTSLESMLEFVHSVIGAETKYNIHSTSYPTTSGARL
Repeat found in RYX36_012702
Repeat occurs 5 times in a sequence of 108 amino acids
Location between 368394557 and 368394884
Coverage of 41.67 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDN

pattern: YLDGWLK[DN][TN]
MNQVDQPYLDGWLKNTPLKNQKPFSDSNQIYLDGWLKDTRGEKVKSNLDSNQVYLDGWLKDTR
NEKEKANPDSNQFYLDGWLKDTRVEKVKGNPDSNQVYLDGWLKDN
Repeat found in RYX36_012703
Repeat occurs 7 times in a sequence of 278 amino acids
Location between <368509853 and 368512007
Coverage of 32.73 %
Instances:
NQVDQPYLDGWLK | NQVYVNGWLKDTR | NQVYLDGWLKDTR | NQVYLDGCLKDTR | NQVYLDGWLKDNR
NQVYVDGWVKDTR | NQVDLYGWLKDNS |
pattern: NQV[YD][VLQ][PYDN][YG][WLC][LVD][GK][WD][LTN][RKS]
MFLLLLLINDHGSFARYMNQVDQPYLDGWLKNTPLKNQKSFHDSNQVYVNGWLKDTRSEKEKA
NPDSNQVYLDGWLKDTRDEKAKANLDSNQTEKAKANPEFNQVYLDGCLKDTRVKKEKVDPDSNQ
VYLDGWLKDNR
AEKANSNPNSNQVYVDGWVKDTRAKNAKFNPNSNQVDLYGWLKDNSGSVMTFQ
LPLREYANFLPRKVADHIPFSKSQVPSLLDLFSLTKDSTQDIYSPKWVACHPRPYLYGLYYFHY
LDIRSKIFKVLLKGEDGDIMNAL
Repeat found in RYX36_012705
Repeat occurs 9 times in a sequence of 229 amino acids
Location between 368517053 and 368518226
Coverage of 39.3 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTG
YLDGWLTDTR | YLDGWLKDTR | YLDGWLKDTR | YLDRWLKDTR |
pattern: YLD[RG]WL[TK][DN]T[PRG]
MFLLLLLINGHGSFARDMNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKV
NPDSNQVYLDGWLKDTRTEKAKANLDSNQVYLDGWLKDTRVEKEKSNPDSNQVYLDGWLKDTGA
KNAKLNPESNLVYLDGWLTDTRTEKEKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWL
KDTR
AEKEKSNPDFNQVYLDRWLKDTRVEKEKSNPDSN
Repeat found in RYX36_012706
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368521893 and 368523957
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAH
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAKSNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNPESNQVYLDGWLKDDKAEKANANPNSNQVYVDGWVKDTRSVK
VKSNPNSNQVYLDGWLKDAHAENEKSNPKSNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSPNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVIIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPLSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGFETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPIKHILWVPRPPAATPVATM
Repeat found in RYX36_012707
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368528159 and 368530223
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAH
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAKSNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNPESNQVYLDGWLKDDKAEKANANPNSNQVYVDGWVKDTRSVK
VKSNPNSNQVYLDGWLKDAHAENEKSNPKSNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSPNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPLSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGFETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPIKHILWVPRPPAATPVATM
Repeat found in RYX36_012708
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368534425 and 368536489
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTQAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNMDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPRM][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYPDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTQAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNMDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012709
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368540688 and 368542752
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNRDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYPDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012710
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368546951 and 368549015
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAQ
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGVKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGFNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012711
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368553215 and 368555279
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNRDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSPNYNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012712
Repeat occurs 20 times in a sequence of 714 amino acids
Location between 368806723 and 368809551
Coverage of 28.01 %
Instances:
YLDGWLKNTQ | YLDGWLKDAR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKNTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDSR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTQ | YLDGWLKDSH | YLDGWLKDSH | YLDIGSKIFK

pattern: YLD[GI][WG][LS]K[DNI][AFITS][RKHQ]
MAHIVVLSLLSFLLLMLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIR
ADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRVEKEKFNPD
SNQVYLDGWLKDTRAEKSKTNPDSNQVYLDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIRDEKE
KSNHDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLKDT
R
AENAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQVYLDGWLKDTRAENAKFNPESNQVYLDG
WLKDTR
GENAKSNPDSNQVYLDGWLKDTQVEKDNSSPNFNRVYLDGWLKDSHVEISKSIPNSKQ
AYLDGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKFADDIPFSKSQVPSLLDLFSLTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESM
LEFVHSVIGSETNYSTHSTSHPTPSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYL
DIGSKIFK
VLLKGEDGDIMNALGICHLDTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVP
RPPVATPVATM
Repeat found in RYX36_012713
Repeat occurs 19 times in a sequence of 714 amino acids
Location between 368831101 and 368833915
Coverage of 26.61 %
Instances:
YLDGWLKNTQ | YLDGWLKDAR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIQ | YLDGWLKDTR
YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDSR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTQ | YLDGWLKDSH | YLDGWLKDSH | YLDIGSKIFK |
pattern: YLD[GI][WG][LS]K[DNI][AFITS][RKHQ]
MAYRVVLSLLPFLLLLLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIR
ADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRAEKEKVNPD
SNQVYLDGWLKDTRAEKSKTNPDSNQVYIDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIQDDKE
KSNHDSNQVYLDGWLKDTRSEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLKDT
R
AEKAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQIYLDGWLKDTRAENAKFNPESNQVYLDG
WLKDTR
DESAKSNPDSNQVYLDGWLKDTQVEKDNFSPNFNRIYLDGWLKDSHVEITKSIPNSKQ
AYLDGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKVADDIPFSKSQVPSLLDLFSLTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESM
LEFVHSVIGSETNYSTHSTSHPTPSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYL
DIGSKIFK
VLLKGEDGDIMNALGICHLDTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVP
RPPVATPVATM
Repeat found in RYX36_012715
Repeat occurs 19 times in a sequence of 528 amino acids
Location between 368867545 and 368869811
Coverage of 25.19 %
Instances:
GWLKNTQ | GWLKDAR | GWLKDTR | GWLKDTR | GWLKDIR
GWLKDTR | GWLKDTR | GWLKDTR | GWLKNTR | GWLKDIQ
GWLKDTR | GWLKDIR | GWLTDTR | GWLKDSR | GWLKDTR
GWLKDTR | GWLKDTQ | GWLKDSH | GWLKDSH |
pattern: GWL[TK][DN][ASTI][RHQ]
MAYRVVLSLLPFLLLLLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIRADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRAEKEKFNPD
SNQVYLDGWLKDTRAEKSETNPDSNQVYIDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIQDEKE
KSNHDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLTDT
R
AEKAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQIYLDGWLKDTRAENEKFNPESNQVYLDG
WLKDTR
GENTKSNPDSNQVYLDGWLKDTQVEKDNSSPNFNRIYLDGWLKDSHVEISKSIPNSKQ
AYLNGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTKAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKVADDIPFSKS

Similar gene clusters

CM107118 - Cluster 22 - Cyclopeptide

Gene cluster description

CM107118 - Gene Cluster 22. Type = cyclopeptide. Location: 367321374 - 370055968 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_012687
Repeat occurs 16 times in a sequence of 406 amino acids
Location between 367327729 and 367391701
Coverage of 27.59 %
Instances:
NQVDQPY | NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG
NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG | NQVYLDG
NQVYLDG | NQVYLDG | NQVCLDG | NQVYLDG | NQVYLDG
NQVYLDG |
pattern: NQV[YDC][LQ][PD][YG]
MAHKVILSLIPFLLLLLINGHGSFARYMNQVDQPYLDGWLKNTPLKNQKLSPNTNQVYLDGWL
KDTKGEKIKVNPNSNQVYLDGWLKDTRVEKAKANHDSNQVYLDGWLKDTRAENAKANPDSNKVY
LDGWLKDIRTEKSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDTHTEKARSNPD
SNQVYLDGWLKDIRAEKDKSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNV
KVNPNSNQVYLDGWLKDTRTENAKSNHNSNQVYLDGWLKDTRTEKAKSTSNSNQVYLDGWLKDT
HTEKEKSNPDSNQVCLDGWFKDIRAEKEKSTSESNQVYLDGWLKDIRAETEKSTPDSNQVYLDG
WLKGTRTKNVKVNHNSNQVYLDG
Repeat found in RYX36_012688
Repeat occurs 12 times in a sequence of 296 amino acids
Location between 367445335 and 367446723
Coverage of 36.49 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDEWLKDT | YLDGWLKDT | YLDGWLKDI
YLDGWLKDT | YLDGWLKDI | YLDGWLKDI | YLDGWLKDI | YLDGWLKDT
YLDGWFKDT | YLDGWLKQN |
pattern: YLD[EG]W[LF]K[DNQ][TNI]
MARRVILSLIPFLLLLLINGHGSFARYMNQVDQPYLDGWLKNTPLKNQKLSPNSNQVYLDGWL
KDT
KGEKTKVNPNSNQVYLDEWLKDTRVEKARANHDSNQVYLDGWLKDTRAENAKANPDSNQVY
LDGWLKDI
RTENSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDIHTEKAKSNPD
SNQVYLDGWLKDIRAEKDKSTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKNV
KVNPNSNQVYLDGWFKDTRTENAKSNHNSNHVYLDGWLKQN
Repeat found in RYX36_012689
Repeat occurs 13 times in a sequence of 319 amino acids
Location between 367494242 and 367495699
Coverage of 32.6 %
Instances:
LDGWIKNT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDI
LDGWLKDT | LDGWLKDT | LDGWLKDI | LDGWLKDI | LDGWLKDT
LDGWLKDT | LDGWLKDT | LDGWLKQN |
pattern: LDGW[LI]K[DNQ][TNI]
MARRVILSLIPFLLLLLINGHGSFARYMNQVDQPNLDGWIKNTPLKNQKLSPNSNQVYLDGWL
KDT
KGEKTKVNPNSNQVYLDGWLKDTRVEKAKANHDSNQVYLDGWLKDTRAENAKANPDSNQVY
LDGWLKDI
STEKSKVNPDSNQVYLDGWLKDTRGEKAKSTSTSNQVYLDGWLKDTHTEKAKSNPD
SNQVYLDGWLKDIRAKKDKYTSESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRTKTV
KVNPNSNQVYLDGWLKDTRTENAKSNHNSNQVYLDGWLKDTRTKNAKSNLDSNQIYLDGWLKQN
Repeat found in RYX36_012690
Repeat occurs 10 times in a sequence of 240 amino acids
Location between 367524592 and 367525471
Coverage of 33.33 %
Instances:
LDGWLKNT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDI
LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKDT | LDGWLKQN

pattern: LDGWLK[DNQ][TNI]
MNQVDQPYLDGWLKNTPLKNQKLSPNSNQVYLDGWLKDTRGEKKKVNPNSNQVYLDGWLKDTR
VEKAKANHDSNQVYLDGWLKDTRSENAKANPDSNQVYLDGWLKDIKTEKSKVNPDSNQVYLDGW
LKDT
RGEKAKSTSTSNQVNLDGWLKDTHTEKAKSNPNSNQLKDTRTKNVKVNPNSNQVYLDGWL
KDT
RTENAKSNHNSNQVYLDGWLKDTRTKNAKSNIDSNQIYLDGWLKQN
Repeat found in RYX36_012692
Repeat occurs 15 times in a sequence of 567 amino acids
Location between 367672607 and 367674311
Coverage of 21.16 %
Instances:
YLDGWLKN | YLDGWLKN | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDIGSKI

pattern: YLD[GI][WG][LS]K[DNI]
MNQVDQPYLDGWLKNTPLKNQKLSSDSNQVYLDGWLKNTRGEKVKANPNSNQIYLDGWLKDIR
TEKAKGNHDSNQVYLDGWLKDTRVEKEKANSDSNQVYLDGWLKDTRVEKAKVNPESNQVYLDGW
LKD
NRDEKANVNPDSNQVYLDGWLKDARVEKEKSTPANRVYLDGWLKDARTEKEKSTHDSNQVY
LDGWLKD
PQNEKIETNTKSNHVYLDGWLKDNRDENAKVSADSNQVYLDGWLKDTRDLKEKLTID
SNQVYLDGWLKDTRDEKENSYPNSNQVYLDGWLKDSHVENSKSIQNSKQAYLDGWLKDSHAENH
MKIGQDFVESNEKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPKKVADDIPVS
KSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFVHSVIGAETKYN
IHSTSYPTTSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGED
GDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012693
Repeat occurs 12 times in a sequence of 506 amino acids
Location between 367708009 and 367709616
Coverage of 18.97 %
Instances:
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKY | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDIGSKI |
pattern: YLD[GI][WG][LS]K[YDI]
MDAEKAKGNHDSNQVYLDGWLKDTRVEEEKANSDSNQVYLDGWLKDTIVEKAKVNPESNQVYL
DGWLKD
NRDEKANVNPDSNQVYLDGWLKDARTEKEKSTPVNRVYLDGWLKDARTEKEKSTPDSN
QVYLDGWLKYPQNEKIETNTKSNHVYLDGWLKDNRDENAKASADSNQVYLDGWLKDTRDLKEKL
TLDSNQVYLDGWLKDTRDEKENSYPNSNQVYLDGWLKDSHVENSKSIQNSKQAYLDGWLKDSHA
ENHMKIGQDFVESNEKLSSKVDHTEAFKVAFFSIEDLYVGSVMTLQFPIREYAKFLPKKVADDI
PVSKSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFIHSVIGAET
KYNIHSTSYPTTSGARLQNYTVLDISNDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLK
GEDGDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012694
Repeat occurs 14 times in a sequence of 567 amino acids
Location between 367730226 and 367731930
Coverage of 19.75 %
Instances:
YLDGWLKN | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDGWLKY
YLDGWLKD | YLDGWLKD | YLDGWLKD | YLDIGSKI |
pattern: YLD[GI][WG][LS]K[YDNI]
MNQVDQPYLDGWLKNTPLKNQKLSSDSNQVYLDGWLKDTRGEKVKVNPNSNQIYLDGWLKDIR
AEKAKGNHDSNQVYLDGWLKDTRVEKEKDNSDSNQVYLDGWLKDTRVEKAKVNPESNQVYLDGW
LKD
NRDEKANVNPDSNQVYLDGWLKDARTEKEKSTPVNRVYLDGWLKDARTEKEKFTPDSNQVY
LDGWLKD
PQNEKIETNTKSNHVYLDGWLKYNRDENAKASADSNQVYLDGWLKDTRDLKEKLTLD
SNQVYLDGWLKDTRDEKENSYPNSNQVYVDGWLKDSHVENSKSIQNSKQAYLDGWLKDSYAENH
MKIGQDFVESNEKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPKKVADDIPVS
KSQIPSLLDLFSLTKDSPQGEDMIDIINQCEFPPNKGETKACPTSLESMLEFVHSVIGAETKYN
IHSTSYPTTSGARLQNYTVLDISNDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGED
GDIMNALGICHLDTSDMNPSHFIFDLLGMKPGDAPLCHFFPVKHVLWVPSPPVVTK
Repeat found in RYX36_012695
Repeat occurs 13 times in a sequence of 438 amino acids
Location between 367828899 and 367830216
Coverage of 17.81 %
Instances:
DGWVKN | DGWLKD | DGWLKD | DGWLKD | DGWLKD
DGWLKD | DGWLKD | DGWLKD | DGWLKD | DGWLKD
DGWLKD | DGWLKD | DGWLKD |
pattern: DGW[VL]K[DN]
MNQVDQPYLDGWVKNTLLKNQKLSSDSNQVYLDGWLKDNRGEKVKVNPNSNQIYLDGWLKDIR
AEKAKGNHDSNQVYFDGWLKDTRVEKEKANSDSNQVYLDGWLKDTIVEKAKVNPKSNQVYLDGW
LKD
NRDEKAKVNHDSNQVYLDGWLKDAQTEKEKSTPANQVYLDGWLKDVRTEKEKSTPDSNQVY
LDGWLKDTHDEKIETNTNSNHVYLDGWLKDNRDENAKGSADSNQVYLDGWLKDTRDEKENSYPN
SNQVYLDGWLKDSHVENSKSMQNSKQAYLDGWLKDSHVVNHMKIGQDFEESNKKLSSKVDHTEA
FKVAFFDIEDLYVGSVMTLQFPIREYAKLLPKKVADDIPVSKSQISSLLDLFSLTKDSPRGEDM
IDIINQCEFPPNKGDTKAYLTSLESMLEFVHSVIGAETKYNIHSTSYPTTSGARL
Repeat found in RYX36_012702
Repeat occurs 5 times in a sequence of 108 amino acids
Location between 368394557 and 368394884
Coverage of 41.67 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDN

pattern: YLDGWLK[DN][TN]
MNQVDQPYLDGWLKNTPLKNQKPFSDSNQIYLDGWLKDTRGEKVKSNLDSNQVYLDGWLKDTR
NEKEKANPDSNQFYLDGWLKDTRVEKVKGNPDSNQVYLDGWLKDN
Repeat found in RYX36_012703
Repeat occurs 7 times in a sequence of 278 amino acids
Location between <368509853 and 368512007
Coverage of 32.73 %
Instances:
NQVDQPYLDGWLK | NQVYVNGWLKDTR | NQVYLDGWLKDTR | NQVYLDGCLKDTR | NQVYLDGWLKDNR
NQVYVDGWVKDTR | NQVDLYGWLKDNS |
pattern: NQV[YD][VLQ][PYDN][YG][WLC][LVD][GK][WD][LTN][RKS]
MFLLLLLINDHGSFARYMNQVDQPYLDGWLKNTPLKNQKSFHDSNQVYVNGWLKDTRSEKEKA
NPDSNQVYLDGWLKDTRDEKAKANLDSNQTEKAKANPEFNQVYLDGCLKDTRVKKEKVDPDSNQ
VYLDGWLKDNR
AEKANSNPNSNQVYVDGWVKDTRAKNAKFNPNSNQVDLYGWLKDNSGSVMTFQ
LPLREYANFLPRKVADHIPFSKSQVPSLLDLFSLTKDSTQDIYSPKWVACHPRPYLYGLYYFHY
LDIRSKIFKVLLKGEDGDIMNAL
Repeat found in RYX36_012705
Repeat occurs 9 times in a sequence of 229 amino acids
Location between 368517053 and 368518226
Coverage of 39.3 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTG
YLDGWLTDTR | YLDGWLKDTR | YLDGWLKDTR | YLDRWLKDTR |
pattern: YLD[RG]WL[TK][DN]T[PRG]
MFLLLLLINGHGSFARDMNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKV
NPDSNQVYLDGWLKDTRTEKAKANLDSNQVYLDGWLKDTRVEKEKSNPDSNQVYLDGWLKDTGA
KNAKLNPESNLVYLDGWLTDTRTEKEKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWL
KDTR
AEKEKSNPDFNQVYLDRWLKDTRVEKEKSNPDSN
Repeat found in RYX36_012706
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368521893 and 368523957
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAH
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAKSNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNPESNQVYLDGWLKDDKAEKANANPNSNQVYVDGWVKDTRSVK
VKSNPNSNQVYLDGWLKDAHAENEKSNPKSNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSPNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVIIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPLSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGFETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPIKHILWVPRPPAATPVATM
Repeat found in RYX36_012707
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368528159 and 368530223
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAH
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAKSNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNPESNQVYLDGWLKDDKAEKANANPNSNQVYVDGWVKDTRSVK
VKSNPNSNQVYLDGWLKDAHAENEKSNPKSNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSPNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPLSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGFETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPIKHILWVPRPPAATPVATM
Repeat found in RYX36_012708
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368534425 and 368536489
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTQAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNMDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPRM][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYPDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTQAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNMDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012709
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368540688 and 368542752
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNRDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYPDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012710
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368546951 and 368549015
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAQ
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGVKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGFNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012711
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368553215 and 368555279
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNRDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSPNYNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012712
Repeat occurs 20 times in a sequence of 714 amino acids
Location between 368806723 and 368809551
Coverage of 28.01 %
Instances:
YLDGWLKNTQ | YLDGWLKDAR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKNTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDSR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTQ | YLDGWLKDSH | YLDGWLKDSH | YLDIGSKIFK

pattern: YLD[GI][WG][LS]K[DNI][AFITS][RKHQ]
MAHIVVLSLLSFLLLMLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIR
ADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRVEKEKFNPD
SNQVYLDGWLKDTRAEKSKTNPDSNQVYLDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIRDEKE
KSNHDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLKDT
R
AENAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQVYLDGWLKDTRAENAKFNPESNQVYLDG
WLKDTR
GENAKSNPDSNQVYLDGWLKDTQVEKDNSSPNFNRVYLDGWLKDSHVEISKSIPNSKQ
AYLDGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKFADDIPFSKSQVPSLLDLFSLTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESM
LEFVHSVIGSETNYSTHSTSHPTPSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYL
DIGSKIFK
VLLKGEDGDIMNALGICHLDTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVP
RPPVATPVATM
Repeat found in RYX36_012713
Repeat occurs 19 times in a sequence of 714 amino acids
Location between 368831101 and 368833915
Coverage of 26.61 %
Instances:
YLDGWLKNTQ | YLDGWLKDAR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIQ | YLDGWLKDTR
YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDSR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTQ | YLDGWLKDSH | YLDGWLKDSH | YLDIGSKIFK |
pattern: YLD[GI][WG][LS]K[DNI][AFITS][RKHQ]
MAYRVVLSLLPFLLLLLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIR
ADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRAEKEKVNPD
SNQVYLDGWLKDTRAEKSKTNPDSNQVYIDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIQDDKE
KSNHDSNQVYLDGWLKDTRSEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLKDT
R
AEKAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQIYLDGWLKDTRAENAKFNPESNQVYLDG
WLKDTR
DESAKSNPDSNQVYLDGWLKDTQVEKDNFSPNFNRIYLDGWLKDSHVEITKSIPNSKQ
AYLDGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKVADDIPFSKSQVPSLLDLFSLTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESM
LEFVHSVIGSETNYSTHSTSHPTPSGARLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYL
DIGSKIFK
VLLKGEDGDIMNALGICHLDTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVP
RPPVATPVATM
Repeat found in RYX36_012715
Repeat occurs 19 times in a sequence of 528 amino acids
Location between 368867545 and 368869811
Coverage of 25.19 %
Instances:
GWLKNTQ | GWLKDAR | GWLKDTR | GWLKDTR | GWLKDIR
GWLKDTR | GWLKDTR | GWLKDTR | GWLKNTR | GWLKDIQ
GWLKDTR | GWLKDIR | GWLTDTR | GWLKDSR | GWLKDTR
GWLKDTR | GWLKDTQ | GWLKDSH | GWLKDSH |
pattern: GWL[TK][DN][ASTI][RHQ]
MAYRVVLSLLPFLLLLLINGHESFARDMSQVDQPYLDGWLKNTQLKNQKHFPDSNQVYLDGWL
KDAR
VEKEKSNPDSNHVYLDGWLKDTRDEKTKVNPESNQVYLDGWLKDTRAEKAKSNIDSNQVY
LDGWLKDIRADKAKANPDSNQVYLDGWLKDTRAEKAKANLESNQIYLDGWLKDTRAEKEKFNPD
SNQVYLDGWLKDTRAEKSETNPDSNQVYIDGWLKNTRVEKAKFKPDSNQVYLDGWLKDIQDEKE
KSNHDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIRVEKAKSTSDSNQVYLDGWLTDT
R
AEKAKSNPDFNQVYLDGWLKDSRVEKAKANPDTNQIYLDGWLKDTRAENEKFNPESNQVYLDG
WLKDTR
GENTKSNPDSNQVYLDGWLKDTQVEKDNSSPNFNRIYLDGWLKDSHVEISKSIPNSKQ
AYLNGWLKDSHAVNHMKIRQDLAESNGKLSSKVDHTKAFKVAFFGIEDLYVGSVMTLQFPIREY
AKFLPRKVADDIPFSKS

Similar gene clusters

CM107118 - Cluster 23 - Cyclopeptide

Gene cluster description

CM107118 - Gene Cluster 23. Type = cyclopeptide. Location: 368060841 - 368681878 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_012702
Repeat occurs 5 times in a sequence of 108 amino acids
Location between 368394557 and 368394884
Coverage of 41.67 %
Instances:
YLDGWLKNT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDT | YLDGWLKDN

pattern: YLDGWLK[DN][TN]
MNQVDQPYLDGWLKNTPLKNQKPFSDSNQIYLDGWLKDTRGEKVKSNLDSNQVYLDGWLKDTR
NEKEKANPDSNQFYLDGWLKDTRVEKVKGNPDSNQVYLDGWLKDN
Repeat found in RYX36_012703
Repeat occurs 7 times in a sequence of 278 amino acids
Location between <368509853 and 368512007
Coverage of 32.73 %
Instances:
NQVDQPYLDGWLK | NQVYVNGWLKDTR | NQVYLDGWLKDTR | NQVYLDGCLKDTR | NQVYLDGWLKDNR
NQVYVDGWVKDTR | NQVDLYGWLKDNS |
pattern: NQV[YD][VLQ][PYDN][YG][WLC][LVD][GK][WD][LTN][RKS]
MFLLLLLINDHGSFARYMNQVDQPYLDGWLKNTPLKNQKSFHDSNQVYVNGWLKDTRSEKEKA
NPDSNQVYLDGWLKDTRDEKAKANLDSNQTEKAKANPEFNQVYLDGCLKDTRVKKEKVDPDSNQ
VYLDGWLKDNR
AEKANSNPNSNQVYVDGWVKDTRAKNAKFNPNSNQVDLYGWLKDNSGSVMTFQ
LPLREYANFLPRKVADHIPFSKSQVPSLLDLFSLTKDSTQDIYSPKWVACHPRPYLYGLYYFHY
LDIRSKIFKVLLKGEDGDIMNAL
Repeat found in RYX36_012705
Repeat occurs 9 times in a sequence of 229 amino acids
Location between 368517053 and 368518226
Coverage of 39.3 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTG
YLDGWLTDTR | YLDGWLKDTR | YLDGWLKDTR | YLDRWLKDTR |
pattern: YLD[RG]WL[TK][DN]T[PRG]
MFLLLLLINGHGSFARDMNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKV
NPDSNQVYLDGWLKDTRTEKAKANLDSNQVYLDGWLKDTRVEKEKSNPDSNQVYLDGWLKDTGA
KNAKLNPESNLVYLDGWLTDTRTEKEKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWL
KDTR
AEKEKSNPDFNQVYLDRWLKDTRVEKEKSNPDSN
Repeat found in RYX36_012706
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368521893 and 368523957
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAH
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAKSNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNPESNQVYLDGWLKDDKAEKANANPNSNQVYVDGWVKDTRSVK
VKSNPNSNQVYLDGWLKDAHAENEKSNPKSNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSPNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVIIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPLSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGFETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPIKHILWVPRPPAATPVATM
Repeat found in RYX36_012707
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368528159 and 368530223
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAH
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAKSNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNPESNQVYLDGWLKDDKAEKANANPNSNQVYVDGWVKDTRSVK
VKSNPNSNQVYLDGWLKDAHAENEKSNPKSNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSPNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPLSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGFETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPIKHILWVPRPPAATPVATM
Repeat found in RYX36_012708
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368534425 and 368536489
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTQAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNMDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPRM][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYPDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTQAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNMDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012709
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368540688 and 368542752
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNRDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYPDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012710
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368546951 and 368549015
Coverage of 22.13 %
Instances:
DGWLKNTP | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTG
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDDK | DGWVKDTR | DGWLKDAQ
DGWLKDNR | DGWLKDTQ | DGWLKDNH | DGWLKDSH |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEKAKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGVKNAKLNPESNLVYLDGW
LKDTR
TEKVKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGFNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQ
AEKENSSTNSNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM
Repeat found in RYX36_012711
Repeat occurs 19 times in a sequence of 687 amino acids
Location between 368553215 and 368555279
Coverage of 30.42 %
Instances:
DGWLKNTPLKN | DGWLKDTRIEK | DGWLKDTRTEK | DGWLKDTRVEK | DGWLKDTGAKN
DGWLKDTRTEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRVEK | DGWLKDTRAQN
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDDKAEK | DGWVKDTRAVK | DGWLKDAQAEN
DGWLKDNRDLK | DGWLKDTQAEK | DGWLKDNHVDN | DGWLKDSHAEN |
pattern: DGW[VL]K[DN][ADNTS][GHQKPR][ADVLIT][EQDKVL][KN]
MNQVDQSYLDGWLKNTPLKNQKSFHDSNQVYLDGWLKDTRIEKAKVNPDSNQVYLDGWLKDTR
TEK
AKANLDSNQVYLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTGAKNAKLNPESNLVYLDGW
LKDTRTEK
VKANPDSNQVYLDGWLKDTRAEKAKANPDSNQVYLDGWLKDTRAEKAKSNPDFNQV
YLDGWLKDTRVEKEKANPDSNQVYLDGWLKDTRAQNAISNLGSNQVYLDGWLKDTRAEKEKVNT
DSNQIYIDGWLKDTRAEKEKSNSESNQVYLDGWLKDDKAEKANANPNSNQVYIDGWVKDTRAVK
VKSNPNSNQVYLDGWLKDAQAENEKSNPKTNQVYLDGWLKDNRDLKEKSTLDSNHVYLDGWLKD
TQAEK
ENSSPNYNQVYLDGWLKDNHVDNAKSIPNFKQAYLDGWLKDSHAENQVKIGQDLGESNG
KLSSKVDHTEAFKVAFFGIEDLYVGSVMTLQFPIREYAKFLPRKVADDIPFSKSQVPSLLDLFS
LTKDSPQGQDMIDVINQCEFPPNKGETKACPTSLESMLEFVHSVIGSETNYSTHSTSHPTPSGA
RLQNYTVLDISKDIYAPKWVACHPRPYPYALYYCHYLDIGSKIFKVLLKGEDGDIMNALGICHL
DTSDMNPNHFIFDLLGMKPGDAPLCHFFPVKHILWVPRPPAATPVATM

Similar gene clusters

CM107118 - Cluster 24 - Lignan-polyketide

Gene cluster description

CM107118 - Gene Cluster 24. Type = lignan-polyketide. Location: 605144243 - 608207745 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107118 - Cluster 25 - Saccharide

Gene cluster description

CM107118 - Gene Cluster 25. Type = saccharide. Location: 645949665 - 650194292 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107118 - Cluster 26 - Saccharide

Gene cluster description

CM107118 - Gene Cluster 26. Type = saccharide. Location: 1125181924 - 1125741212 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107118 - Cluster 27 - Saccharide

Gene cluster description

CM107118 - Gene Cluster 27. Type = saccharide. Location: 1163997011 - 1168542582 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107118 - Cluster 28 - Cyclopeptide

Gene cluster description

CM107118 - Gene Cluster 28. Type = cyclopeptide. Location: 1181929625 - 1188614891 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_015356
Repeat occurs 4 times in a sequence of 634 amino acids
Location between 1185271232 and 1185273283
Coverage of 4.42 %
Instances:
DSFKNYS | DSFKKYS | DSFTSYS | DSFQSYG |
pattern: DSF[TKQ][KNS]Y[GS]
MKIHFIFLFFSLSFFHGFIATIGSKEITETQEHSFKLKTNPFSPKASLIRHWNTRISNNLPNS
IPNFFLSKASPLTPQHYANLVNLLKQKPLSANFHNSLCSTPYLFCSFDHPNQYYQSKKNTKDDA
NFAVYSNKKFAAYGSSRLGGVDSFKNYSNGLNTNNDSFKKYSTTSSRHTGQFNSYAENGNVANT
NFTSYGSGSNSGTGEFKSYDKLVNDPNLGFTTYDSSASNHKLSFSSYGNETNSGSESFNSYGKH
VRSGNSDFTNYAVSSNILQSSFTGYGELGTGAANDSFTSYSFNGNNPRSTFKTYGAGSISGSDT
FVSYRNRANVGDDSFQSYGSKSKSGSASFTNYGQSFNEGNDTFTEYGKGSTGKTAFGFKIYGLG
RAFKGYNKNGVSFSSYNNFSLSSGKIVNKFVEPGKFFRESMLKEGNAMVMPDIRDKMPKRSFLP
LSISSKLPFSSLVLEDIKKTFHARKGSATERVITNALGECERDPSVGETKRCVGSAEAMIDFAV
SVMGPNVVVKTTENVNGSKKSVMIGKVYGINGGKVTKSVSCHQTLYPYLLYYCHSVPKVRVYEA
EILDVETKLKINHGVAICHMDTSAWGPQHGAFTALGSEPGKIEVCHWIFENDMTWTIAS

Similar gene clusters

CM107119 - Cluster 29 - Fatty_acid

Gene cluster description

CM107119 - Gene Cluster 29. Type = fatty_acid. Location: 930806544 - 933832722 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107119 - Cluster 30 - Alkaloid

Gene cluster description

CM107119 - Gene Cluster 30. Type = alkaloid. Location: 995691051 - 998257504 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107119 - Cluster 31 - Saccharide-fatty_acid

Gene cluster description

CM107119 - Gene Cluster 31. Type = saccharide-fatty_acid. Location: 1184776942 - 1188183316 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107119 - Cluster 32 - Cyclopeptide

Gene cluster description

CM107119 - Gene Cluster 32. Type = cyclopeptide. Location: 1349556425 - 1369024263 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_022193
Repeat occurs 13 times in a sequence of 476 amino acids
Location between <1361993759 and 1361995447
Coverage of 27.31 %
Instances:
ASEHDVNSRE | ASEPDVNSRE | ASEPDVNSRE | ASEPDVNSRE | ASEPDVNSGG
ASEPDVNSRE | ASEPDVNSRE | ASEPDVNSRE | ASEPDVNSGG | ASEPDVNSEG
ASEPDVNSRE | ASESDVNSGG | ASEPDASSET |
pattern: ASE[PHS]D[AV][NS]S[ERG][TEG]
MSFSPSIHSQYYIKAALPTFFFILSIIEFHLHSYNMVFHFFHIITFLMLVIVATNAETLPPQL
YWKSVLPNSPMPKAITNLLPNSPFPKPIINLLLPGEDLGVKKDYSDGGEADVKSVGLSNSQFYI
HAASEHDVNSRESPNVFRYQHDASEPDVNSRESPNLFRYQHAASEPDVNSRESPNAFSYKHVAS
EPDVNSRE
SPNGFGYKHAASEPDVNSGGLSNVFRYQHDASEPDVNSRESPNLFRYQHAASEPDV
NSRE
SPNAFSYKHVASEPDVNSRESPNGFGYKHAASEPDVNSGGLSNGFSYKHAASEPDVNSEG
LPNGFSYKHATSEPDVNSRESPNAFSYKHAASEPDVNSRESPNAFNYKHAASESDVNSGGLPNG
FSYHRAASEPDASSETQLHDKPKALLFFFEKNLHDGTKSNLQFMKTSSSNVAKFLPKEVANSIP
FSSNKVDYILNKMNIKKGSKGARIVKNTM

Similar gene clusters

CM107119 - Cluster 33 - Cyclopeptide

Gene cluster description

CM107119 - Gene Cluster 33. Type = cyclopeptide. Location: 1353351149 - 1370638058 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_022193
Repeat occurs 13 times in a sequence of 476 amino acids
Location between <1361993759 and 1361995447
Coverage of 27.31 %
Instances:
ASEHDVNSRE | ASEPDVNSRE | ASEPDVNSRE | ASEPDVNSRE | ASEPDVNSGG
ASEPDVNSRE | ASEPDVNSRE | ASEPDVNSRE | ASEPDVNSGG | ASEPDVNSEG
ASEPDVNSRE | ASESDVNSGG | ASEPDASSET |
pattern: ASE[PHS]D[AV][NS]S[ERG][TEG]
MSFSPSIHSQYYIKAALPTFFFILSIIEFHLHSYNMVFHFFHIITFLMLVIVATNAETLPPQL
YWKSVLPNSPMPKAITNLLPNSPFPKPIINLLLPGEDLGVKKDYSDGGEADVKSVGLSNSQFYI
HAASEHDVNSRESPNVFRYQHDASEPDVNSRESPNLFRYQHAASEPDVNSRESPNAFSYKHVAS
EPDVNSRE
SPNGFGYKHAASEPDVNSGGLSNVFRYQHDASEPDVNSRESPNLFRYQHAASEPDV
NSRE
SPNAFSYKHVASEPDVNSRESPNGFGYKHAASEPDVNSGGLSNGFSYKHAASEPDVNSEG
LPNGFSYKHATSEPDVNSRESPNAFSYKHAASEPDVNSRESPNAFNYKHAASESDVNSGGLPNG
FSYHRAASEPDASSETQLHDKPKALLFFFEKNLHDGTKSNLQFMKTSSSNVAKFLPKEVANSIP
FSSNKVDYILNKMNIKKGSKGARIVKNTM

Similar gene clusters

CM107119 - Cluster 34 - Cyclopeptide

Gene cluster description

CM107119 - Gene Cluster 34. Type = cyclopeptide. Location: 1504782181 - 1516687500 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

CM107120 - Cluster 35 - Saccharide

Gene cluster description

CM107120 - Gene Cluster 35. Type = saccharide. Location: 6131415 - 6337698 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107120 - Cluster 36 - Cyclopeptide

Gene cluster description

CM107120 - Gene Cluster 36. Type = cyclopeptide. Location: 121095481 - 133721352 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_023592
Repeat occurs 6 times in a sequence of 367 amino acids
Location between 127378456 and 127380081
Coverage of 9.81 %
Instances:
ENQYWT | ENQPFG | ENQPFG | ENQPFG | ENQPFA
ENQHSV |
pattern: ENQ[PYH][WFS][AGTV]
MEFKNLSVLALFLLVLVGIHGSNSGEEYWKSVWPNTPIPKALSDLLLSGTDMPIKIQEENQYW
T
IFFEHDLYPGKTMNLGIQKHSDIQSSKSSTYLPIRKTCQTLRTHKWFEETPEKENQPFGLCIW
FKKETTTENQPFGFWAWSKKETTKENQPFGFWIWAKKETIKESQPFGFWIWSKKETTKENQPFA
RDTQNENQHSVAYTSDEKEAHIIDNYCRTPSAIGEEKHCALSLESMMDFAISKLGKNIKVMSSS
LVQNQDKYEVEEVNKIGDKVVMCHRLNFKKVVFYCHAVNASTTYMVPLTASDGTKSKALTICHH
DTRGMNPNVLHEVLNVKPGTVPVCHFIGNKAIAWVPDMSESGGHPCVV
Repeat found in RYX36_023593
Repeat occurs 6 times in a sequence of 367 amino acids
Location between 127436781 and 127438376
Coverage of 9.81 %
Instances:
ENQYWT | ENQPFG | ENQPFG | ENQPFG | ENQPFA
ENQHSV |
pattern: ENQ[PYH][WFS][AGTV]
MEFKNLSVLALFLLVLVGIHGSNSGEEYWKSVWPNTPIPKALSDLLLSGTDMPIKIQEENQYW
T
IFFEHDLYPGKTMNLGIQKHSDIQSSKSSTYLPIRKTCQTLRTHKWFKETPEKENQPFGLCIW
FKKETTTENQPFGFWAWSKKETTKENQPFGFWIWAKKETIKESQPFGFWIWSKKETTKENQPFA
RDTQKENQHSVAYTSDEKEAHIIDNYCRTPSAIGEEKHCALSLESMMDFAISKLGKNIKVMSSS
LVQNQDKYEVEEVNKIGDKVVMCHRLNFKKVVFYCHAVNASTTYMVPLTASDGTKSKALTICHH
DTRGMNPNVLHEVLNVKPGTVPVCHFIGNKAIAWVPDMSESGGHPCVV

Similar gene clusters

CM107120 - Cluster 37 - Saccharide

Gene cluster description

CM107120 - Gene Cluster 37. Type = saccharide. Location: 473417134 - 475399167 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107120 - Cluster 38 - Cyclopeptide

Gene cluster description

CM107120 - Gene Cluster 38. Type = cyclopeptide. Location: 499074258 - 511769729 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_024917
Repeat occurs 6 times in a sequence of 505 amino acids
Location between 500505261 and 500507803
Coverage of 14.26 %
Instances:
QQQQQQQQKHHH | QQQQQQQKHHHR | QQQQQQKHHHRH | QQQQQKHHHRHQ | QQQQKHHHRHQY
QQQKHHHRHQYQ |
pattern: QQQ[KQ][KHQ][KHQ][KHQ][RKHQ][RKH][RHQ][YRHQ][YRHQ]
MTKQFLLLSLILHLLSVSAAPSTSSPAKLLSGFLSNAVPAFTKWVWSLKSTTKTGVLSKSMMK
FESGYNVETVFDGSKLGIEPYAVEVLHNGELLILDSANSNLYRISSSLSLYSRPKLVAGSAEGY
SGHVDGKLREARMNHPKGITIDDRGNIYVADTTNMAIRKISDSGVTTIAGGKWSRGGGHVDGPS
EEAKFSDDFDVVYVGSSCSLLVVDRGNQAIREIQLHFDDCAYRYGSDFPLGIAMLVGAGFFGYM
LALLQRRLGTIVESQDAQVPVTVMSYVSPSTYQKPLKSVRPPLIPSEYEPEKQEEGFFGSLGKL
LSNAGVSMVEIMGGLFPVFRRRPQSSQYHRETLIQQPQKLVNDWPPQESFVIPREDEPPSIDTR
AATPQKTYPFMLKDAEKIQQLRQSKAFYSGGWDGDQHQPQQQQQQQQKHHHRHQYQSSAPHTFY
EQTNETTNEVVFGAVQEQEGKKESVVITPVEYGGSLYEHRSIRPRMSSMGYNTYTHKY
Repeat found in RYX36_024934
Repeat occurs 4 times in a sequence of 642 amino acids
Location between 505067488 and 505069616
Coverage of 3.74 %
Instances:
DSFKTY | DSFTSY | DSFTSY | DSFTGY |
pattern: DSF[TK][GTS]Y
MQKQFNHIIHTLFILLSIITIGFAINHEEEKNQNPFTPKAFVIRYWDRVIKNKLPKPSFILSK
TSPLTATEAAAFAKHAAANTLSTKLPEFCSAAHLFCLPDINPSLKKQYPFDPTHFAVYNVDQNF
TTYTFGENFTTYGTHKTDGIDAFKNYSIGLFSAEHNEFKRYSKSSANHNDSFKTYAINTNAAVE
DFNNYGTSSVAGSGEFKTYNTPDSNVPNLKFNIYTADTVGREQTFKTYSDSSNAGNQSFTNYGK
NSLHAVNDFTSYATDSNVFKSEFSAYSEKGTSNEDSFTSYGKRLNNPDNNFKNYGKGSTSGTEK
FTNYRDQANAGGDSFTSYGESSRGGIHVGFNSYGRRPGFPKFPGFDSFTGYAKGADLDHKVTFI
TYGVNNTFKDYDKKGISFAKYNNTNSASVSGSLVKNLVQPGKFFREMMLKEGTIMPMPDIRDKL
PARSFLPRSILPKLNEMKQVFQVSENSPMKKIIVDALNECERAPSMGETKRCVGSLEDMIDFAT
SVLGRDVTVRSTENVNGWGKNVMVGKVKGINDGKVTESVSCHQSLFPSLLYYCHSVPKVRVYEA
DLLDSEFKVKINHGVAICHLDTTAWSPSHGAFMALGSGPGRIEVCHWIFENDMTWTTADSSSNN
LYH

Similar gene clusters

CM107120 - Cluster 39 - Alkaloid

Gene cluster description

CM107120 - Gene Cluster 39. Type = alkaloid. Location: 616819279 - 619243072 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107120 - Cluster 40 - Saccharide

Gene cluster description

CM107120 - Gene Cluster 40. Type = saccharide. Location: 756889644 - 759408091 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107120 - Cluster 41 - Putative

Gene cluster description

CM107120 - Gene Cluster 41. Type = putative. Location: 940339265 - 943885874 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107120 - Cluster 42 - Saccharide

Gene cluster description

CM107120 - Gene Cluster 42. Type = saccharide. Location: 956872143 - 960225086 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107120 - Cluster 43 - Lignan

Gene cluster description

CM107120 - Gene Cluster 43. Type = lignan. Location: 1264229873 - 1267836301 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107120 - Cluster 44 - Saccharide

Gene cluster description

CM107120 - Gene Cluster 44. Type = saccharide. Location: 1283287098 - 1284138509 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107120 - Cluster 45 - Saccharide

Gene cluster description

CM107120 - Gene Cluster 45. Type = saccharide. Location: 1485116053 - 1487975300 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107121 - Cluster 46 - Saccharide-fatty_acid

Gene cluster description

CM107121 - Gene Cluster 46. Type = saccharide-fatty_acid. Location: 335561768 - 338473232 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107121 - Cluster 47 - Cyclopeptide

Gene cluster description

CM107121 - Gene Cluster 47. Type = cyclopeptide. Location: 361974653 - 365315272 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_029858
Repeat occurs 3 times in a sequence of 338 amino acids
Location between 363644422 and 363645502
Coverage of 5.33 %
Instances:
SKETTL | SKEVGP | SKEVGP |
pattern: SKE[VT][GT][PL]
MRTFLIFIFSVIFLFTTLFAQTTSVDRAQKLSFNFKISNLSKETTLNSPELCFKAHLFCSSYE
EVGPLGGYQRFRQYRSEDKSKEVGPLGGYQRFRQYRSEEKSKEVGPLGGYQRFRQYRSEDKFVE
PGVFFREKSLKRGTVMLIPDITDKLPPREFLPSSIFSPAASSKVQQVFNVFKNNPKMKKMMRMA
MNDCESPASEGEVKKCVVTVEDMVEFVKSVLGENISVKTTANVNGSGKNILLDRVELITSGSVV
SCHQSLFPSIMYQCHHVPTVRLYKSQMLDLASKTVINTGIAICHVNTTAWAPTHSAFHLLGPGP
GKIEVCHWIYENDLAWIVG

Similar gene clusters

CM107121 - Cluster 48 - Fatty_acid

Gene cluster description

CM107121 - Gene Cluster 48. Type = fatty_acid. Location: 655563698 - 659173521 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107121 - Cluster 49 - Saccharide-transporter_associated

Gene cluster description

CM107121 - Gene Cluster 49. Type = saccharide-transporter_associated. Location: 851646576 - 856953576 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107121 - Cluster 50 - Saccharide

Gene cluster description

CM107121 - Gene Cluster 50. Type = saccharide. Location: 1094192648 - 1095403727 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 51 - Saccharide

Gene cluster description

CM107122 - Gene Cluster 51. Type = saccharide. Location: 223162968 - 226612561 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 52 - Polyketide-terpene

Gene cluster description

CM107122 - Gene Cluster 52. Type = polyketide-terpene. Location: 482530562 - 483557647 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 53 - Saccharide

Gene cluster description

CM107122 - Gene Cluster 53. Type = saccharide. Location: 518987824 - 520946539 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 54 - Saccharide

Gene cluster description

CM107122 - Gene Cluster 54. Type = saccharide. Location: 585089564 - 589864271 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 55 - Terpene

Gene cluster description

CM107122 - Gene Cluster 55. Type = terpene. Location: 656851216 - 658143356 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 56 - Fatty_acid-alkaloid

Gene cluster description

CM107122 - Gene Cluster 56. Type = fatty_acid-alkaloid. Location: 723098115 - 724697613 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 57 - Saccharide-terpene

Gene cluster description

CM107122 - Gene Cluster 57. Type = saccharide-terpene. Location: 872067506 - 876106696 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 58 - Saccharide

Gene cluster description

CM107122 - Gene Cluster 58. Type = saccharide. Location: 896136452 - 897351604 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 59 - Cyclopeptide

Gene cluster description

CM107122 - Gene Cluster 59. Type = cyclopeptide. Location: 958907545 - 971702448 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

CM107122 - Cluster 60 - Cyclopeptide

Gene cluster description

CM107122 - Gene Cluster 60. Type = cyclopeptide. Location: 971026179 - 988097915 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in RYX36_036187
Repeat occurs 3 times in a sequence of 311 amino acids
Location between 973846985 and 973848533
Coverage of 5.79 %
Instances:
GIGLSV | GIGIGV | GIGVGI |
pattern: GIG[VLI][GS][VI]
MENSYKVNGNGSAENGYSIARYTHSYQPSLKGSLPWLDIRVFYVRVCKCEFDSTTPEVLTLNH
VPLNPDTLLEVNGVRSSIYSDGMSTLLKRDRVDRKSEEVTFVSTDSIRTSGSVKFEVFDKDNLL
LFGALELCNSNGVVRESNSSGQSWSMKCESNIIPGTRFFKEKQLLLPETTLPTIEVYIAGSFSG
TPIILTKTLHLSSQKRHSRKGVLNAIPENDANENGKGPSSALALQAPDYMHDKLEDEDYQSLYT
RTAYADGEDGELSWFNAGVRVGVGIGLSVCLGIGIGVGIMVKTYQGATGQFRRRLL
The following known motifs were found in CDS RYX36_036201
Location between 978331198 and 978332110
FEPR was found 5 times in this sequence
Sequence:
MKDEEMPEGIQGLLQLKSEIAPLTNSKAQEQHAKGKCDQHPIPKILNFKEKKVVTEEFEPIPS
VTKYDGDGGYKNMKLPVNGEFEPIPSLTKYDGDENYKTMKLPVNGEFEPIPSLTKYDGDESYKT
MKLPVNNEFESIPSVTKYDDGEGYKSMKLPVNDEFEPIPSFLIYGEDDGYKSVKLPLNDEFEPR
PSATKYDGDEGYKSVKLPVNDEFEPRPSATKYDGDKGYKNMKLPVNGEFEPRPTATKYDGDEGY
KNMKLPVNGEFEPRPSATKYDGDEGHKNMKSSVNDEFEPRPSATKYND

Similar gene clusters

CM107122 - Cluster 61 - Cyclopeptide

Gene cluster description

CM107122 - Gene Cluster 61. Type = cyclopeptide. Location: 975783563 - 988407895 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


The following known motifs were found in CDS RYX36_036201
Location between 978331198 and 978332110
FEPR was found 5 times in this sequence
Sequence:
MKDEEMPEGIQGLLQLKSEIAPLTNSKAQEQHAKGKCDQHPIPKILNFKEKKVVTEEFEPIPS
VTKYDGDGGYKNMKLPVNGEFEPIPSLTKYDGDENYKTMKLPVNGEFEPIPSLTKYDGDESYKT
MKLPVNNEFESIPSVTKYDDGEGYKSMKLPVNDEFEPIPSFLIYGEDDGYKSVKLPLNDEFEPR
PSATKYDGDEGYKSVKLPVNDEFEPRPSATKYDGDKGYKNMKLPVNGEFEPRPTATKYDGDEGY
KNMKLPVNGEFEPRPSATKYDGDEGHKNMKSSVNDEFEPRPSATKYND

Similar gene clusters

CM107122 - Cluster 62 - Saccharide

Gene cluster description

CM107122 - Gene Cluster 62. Type = saccharide. Location: 1061292328 - 1065133549 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 63 - Lignan

Gene cluster description

CM107122 - Gene Cluster 63. Type = lignan. Location: 1101473580 - 1103307652 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

CM107122 - Cluster 64 - Polyketide

Gene cluster description

CM107122 - Gene Cluster 64. Type = polyketide. Location: 1167891884 - 1171636233 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters