Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_081180.1:
Cluster 1Saccharide53463725502901156.53Glyco_hydro_1, adh_short_C2, p450---
Cluster 2Saccharide200617572013957877.822OG-FeII_Oxy, DIOX_N, Glyco_hydro_1, Peptidase_S10---
Cluster 3Saccharide505232315057911355.88Epimerase, UDPGT_2, p450flavonoid--
Cluster 4Saccharide6002300960201524178.51UDPGT_2, p450, polyprenyl_synt*saccharide--
The following clusters are from record NC_081181.1:
Cluster 5Cyclopeptide56567306588905002323.19BURP---
Cluster 6Cyclopeptide75593616783962452802.63BURP---
Cluster 7Fatty_acid-Saccharide9189393592303762409.83FA_hydroxylase, Methyltransf_11, UDPGT_2flavonoid, oleananes--
Cluster 8Saccharide106323187106763597440.41ADH_N, ADH_zinc_N, Epimerase, Methyltransf_7, UDPGT_2flavonoid, oleananes--
Cluster 9Saccharide196945090197089493144.40Cellulose_synt, p450---
Cluster 10Cyclopeptide204957152205770761813.61BURP---
The following clusters are from record NC_081182.1:
Cluster 11Cyclopeptide12838717143798831541.17BURP---
Cluster 12Cyclopeptide13423719144711711047.45BURP---
Cluster 13Cyclopeptide1399821114967403969.19BURP---
Cluster 14Fatty_acid4874037549028434288.06ADH_N, ADH_zinc_N, FA_desaturase_2, Peptidase_S10---
Cluster 15Fatty_acid-Saccharide-Transporter_associated6507289365704315631.42ADH_N, ADH_zinc_N, FA_desaturase_2, LTP_2, NAD_binding_1, Peptidase_S10, UDPGT_2cyanogenic glucoside-4, monoterpenoid-4--
Cluster 16Cyclopeptide6658296867432532849.56BURP---
Cluster 17Saccharide211940764212187324246.56Aminotran_1_2, Cellulose_synt, Epimerase---
The following clusters are from record NC_081183.1:
Cluster 18Saccharide1823692518580598343.672OG-FeII_Oxy, DIOX_N, Glycos_transf_1---
Cluster 19Cyclopeptide1202113411214594371248.10BURP---
Cluster 20Fatty_acid-Alkaloid-Terpene140501508140942755441.25BBE, Epimerase, FAD_binding_4, FA_hydroxylase, Terpene_synth, Terpene_synth_C---
Cluster 21Cyclopeptide178275720179082296806.58BURP---
Cluster 22Cyclopeptide1783240111797787101454.70BURP---
Cluster 23Cyclopeptide189111539189952100840.56BURP---
Cluster 24Cyclopeptide189372371190349773977.40BURP---
Cluster 25Polyketide197100185197239334139.153Beta_HSD, AMP-binding, Chal_sti_synt_C, FAE1_CUT1_RppA, Oxidored_FMN---
The following clusters are from record NC_081184.1:
Cluster 26Cyclopeptide75899235792332493334.01BURP---
Cluster 27Saccharide-Lignan107003866107293244289.38Dirigent, UDPGT_2, p450*saccharide--
Cluster 28Saccharide112839245113146412307.17AMP-binding, Amino_oxidase, Glyco_hydro_1, Lipoxygenase---
Cluster 29Terpene139867633140142542274.91Epimerase, Terpene_synth, Terpene_synth_C---
Cluster 30Fatty_acid-Alkaloid14752420114761549791.30BBE, FAD_binding_4, FA_hydroxylase---
Cluster 31Alkaloid147796592148042492245.90BBE, FAD_binding_4, Methyltransf_11---
Cluster 32Polyketide-Terpene157593453157715705122.25Chal_sti_synt_C, Chal_sti_synt_N, Methyltransf_11, Prenyltrans, Transferase---
Cluster 33Cyclopeptide159082940159926238843.30BURP---
The following clusters are from record NC_081185.1:
Cluster 34Phenolamide-Alkaloid-Saccharide2302861823315537286.92Glyco_hydro_1, Pyridoxal_deC, Transferase---
Cluster 35Fatty_acid7485400874968967114.96AMP-binding, Transferase, p450---
Cluster 36Saccharide-Lignan101845139102109089263.95Aldo_ket_red, Dirigent, Methyltransf_11, UDPGT_2flavonoid-4, oleananes-4--
Cluster 37Saccharide102177382102593696416.31Aldo_ket_red, Epimerase, Methyltransf_11, UDPGT_2, adh_short_C2flavonoid-5, oleananes-5--
Cluster 38Cyclopeptide1152798041164744291194.62BURP---
Cluster 39Fatty_acid130728644130841739113.09CER1-like_C, FA_hydroxylase, Lycopene_cycl, Methyltransf_11---
Cluster 40Saccharide15708284415716451281.67Acetyltransf_1, Epimerase, UDPGT_2small phenolic--
The following clusters are from record NC_081186.1:
Cluster 41Cyclopeptide132695933269.59BURP---
Cluster 42Fatty_acid3783359238035634202.04FA_desaturase, Methyltransf_11, Transferase---
Cluster 43Cyclopeptide47940489498705931930.10BURP---
Cluster 44Cyclopeptide50474192520649761590.78BURP---
Cluster 45Cyclopeptide5126964351945586675.94BURP---
Cluster 46Cyclopeptide7498954875629424639.88BURP---
Cluster 47Saccharide880782908812544147.15Acetyltransf_1, Lyase_aromatic, UDPGT_2---
Cluster 48Terpene8997536990375791400.42Terpene_synth, Terpene_synth_C, p450---
Cluster 49Saccharide984093349846500255.67UDPGT_2, p450small phenolic-4--
Cluster 50Fatty_acid-Saccharide100070768100246005175.24CER1-like_C, FA_hydroxylase, Methyltransf_11, UDPGT_2small phenolic--
Cluster 51Cyclopeptide103769367104272802503.44BURP---
Cluster 52Putative117897938118218863320.93ADH_N, ADH_zinc_N, AMP-binding, Abhydrolase_3, p450---
Cluster 53Alkaloid127926819128392737465.92Bet_v_1, NAD_binding_1, adh_short---
The following clusters are from record NW_026704941.1:
Cluster 54Saccharide10601321270765210.63AMP-binding, Amino_oxidase, Glyco_hydro_1, UDPGT_2---
The following clusters are from record NW_026704977.1:
Cluster 55Fatty_acid12143471345981131.63CER1-like_C, FA_hydroxylase, Lycopene_cycl, Methyltransf_11---
The following clusters are from record NW_026704983.1:
Cluster 56Cyclopeptide1338293338.29BURP---
The following clusters are from record NW_026704997.1:
Cluster 57Cyclopeptide23769117991121561.42BURP---
The following clusters are from record NW_026705087.1:
Cluster 58Saccharide205247463519258.27Cellulose_synt, Transferase---
The following clusters are from record NW_026705146.1:
Cluster 59Cyclopeptide111704911170.49BURP---
The following clusters are from record NW_026705156.1:
Cluster 60Polyketide7432071117375374.17Chal_sti_synt_C, Chal_sti_synt_N, adh_short---
The following clusters are from record NW_026705200.1:
Cluster 61Cyclopeptide1699046699.04BURP---
The following clusters are from record NW_026705238.1:
Cluster 62Fatty_acid-Saccharide-Transporter_associated260194915933655.74ADH_N, ADH_zinc_N, FA_desaturase_2, LTP_2, Lipoxygenase, NAD_binding_1, Peptidase_S10, UDPGT_2, adh_shortcyanogenic glucoside-8, monoterpenoid-8--
The following clusters are from record NW_026705502.1:
Cluster 63Cyclopeptide1437028437.03BURP---
The following clusters are from record NW_026705654.1:
Cluster 64Saccharide219592319723100.13Cellulose_synt, Transferase---
The following clusters are from record NW_026705869.1:
Cluster 65Fatty_acid-Alkaloid163682342477178.79BBE, FAD_binding_4, FA_hydroxylase---
The following clusters are from record NW_026705877.1:
Cluster 66Cyclopeptide1385462385.46BURP---
The following clusters are from record NW_026706016.1:
Cluster 67Polyketide-Alkaloid1303682303.68Chal_sti_synt_C, FAE1_CUT1_RppA, Pyridoxal_deC, p450---

NC_081180 - Cluster 1 - Saccharide

Gene cluster description

NC_081180 - Gene Cluster 1. Type = saccharide. Location: 5346372 - 5502901 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081180 - Cluster 2 - Saccharide

Gene cluster description

NC_081180 - Gene Cluster 2. Type = saccharide. Location: 20061757 - 20139578 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081180 - Cluster 3 - Saccharide

Gene cluster description

NC_081180 - Gene Cluster 3. Type = saccharide. Location: 50523231 - 50579113 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081180 - Cluster 4 - Saccharide

Gene cluster description

NC_081180 - Gene Cluster 4. Type = saccharide. Location: 60023009 - 60201524 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081181 - Cluster 5 - Cyclopeptide

Gene cluster description

NC_081181 - Gene Cluster 5. Type = cyclopeptide. Location: 56567306 - 58890500 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131649101
Repeat occurs 6 times in a sequence of 512 amino acids
Location between 56633269 and 56634808
Coverage of 7.03 %
Instances:
RSPPRR | RSPPRK | RSPSSS | RSPPKR | RSPLRK
RSPLSE |
pattern: RSP[SLP][SKR][SKRE]
MAGNNNNTVLVPTPDPFRLTEMIEESNPALHRRGGRRRSAEGNGRSGHETPQSIRRPPPQHVE
QAQNDEAEQLQPQNNGEETARNDRNAELIQTQPEQPLTRVQHETDHRDGQTASSSTRSLERPRT
HHRQTRDATDARGETDPSTLVILKELQRTNRLIRLQGERIEKLEKRHRHRSPPRRHRRSRSSSS
RSPPRK
TRRRSPSSSRSPPKRSRRQRSYSRSPLRKSKKNRRPEPTEEESPSPDREERRLARKGR
KPRKEEPARRNTRSISPGASDEEEFRSPLSESIRRARLPRGMEKPPTLDAYDGTTDPDDHIRNL
EAVMEYHAVHGSIKCRIFPTTLRKGAMTWYRNLPPNSITSWTELKELFLSHFTASRRQPKSEAN
LEAVVQGTNEPLRDYLDRFNKEAVQVETTDYMKRYLLERGLLPDSEFRKAIKIEKMRSMNAILK
RAQAFISFEEGEAAAVKASRGNGVTRSSSQDPSAAPRANERKRDDRSRDTKERRGPAGRFNDYT
P

Similar gene clusters

NC_081181 - Cluster 6 - Cyclopeptide

Gene cluster description

NC_081181 - Gene Cluster 6. Type = cyclopeptide. Location: 75593616 - 78396245 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_081181 - Cluster 7 - Fatty_acid-saccharide

Gene cluster description

NC_081181 - Gene Cluster 7. Type = fatty_acid-saccharide. Location: 91893935 - 92303762 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081181 - Cluster 8 - Saccharide

Gene cluster description

NC_081181 - Gene Cluster 8. Type = saccharide. Location: 106323187 - 106763597 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081181 - Cluster 9 - Saccharide

Gene cluster description

NC_081181 - Gene Cluster 9. Type = saccharide. Location: 196945090 - 197089493 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081181 - Cluster 10 - Cyclopeptide

Gene cluster description

NC_081181 - Gene Cluster 10. Type = cyclopeptide. Location: 204957152 - 205770761 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131651136
Repeat occurs 6 times in a sequence of 358 amino acids
Location between 205229592 and 205235018
Coverage of 11.73 %
Instances:
RSPPRRH | RSPLRRH | RSPSSSR | RSPPRRH | RSPPRKS
RSPLSED |
pattern: RSP[SLP][SR][SKRE][SDRH]
MANNIPSSDPFLLEHLIEDSTREVTNRRRQEEPQHALAGQGRSRQETSQPRRQRVQNVQQLQG
LQQVQNVEQNPPEGNVQNGEDQQARTHNGPERPQNENETDARDGHAASSFTHTSDRRRTRHVEE
EEPLNIPEDADPVTVLLLKELQKTNSLLRLQDDRIHELERKRRYRSPPRRHYQSRSYSSSRSPL
RRH
RRRSPSSSRSPPRRHRRRRSYSRSPPRKSRKNQRPEATEARSLSPEQGHQGPSKAVLKPRE
RPSPRDNRAGPNNDQERPRRGRHSTSPRPSDEEDFRSPLSEDIRRARLPRGMEKPPALDQYDGT
TDPDDHIRSIEAKIPGIRGQTRDEAALHFGRAPPNQRTG

Similar gene clusters

NC_081182 - Cluster 11 - Cyclopeptide

Gene cluster description

NC_081182 - Gene Cluster 11. Type = cyclopeptide. Location: 12838717 - 14379883 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131660339
Repeat occurs 7 times in a sequence of 1198 amino acids
Location between 12863697 and 12874373
Coverage of 3.51 %
Instances:
PPQINL | PPQRQN | PPQSQG | PPQSQG | PPQSQG
PPQSQG | PPQSMP |
pattern: PPQ[SIR][NQM][NGLP]
MAAPNMDQFEAFFRRADLDGDGRISGAEAVGFFQGSNLPKHVLAQVWMHADQSKTGFLGRNDF
YNALRLVTVAQSKRDLTPDIVKAALFGPAAAKIPPPQINLAALPPQRQNPGVASSVGQMGVNAG
PTSTQNLAYRGQGLTGSAVNPQYLPSQQNANMRPPQSQGLAGSAANQQYLPSQQNANTRPPQSQ
G
LAGSVANQQYLPSQQSPNMRPPQSQGFSGFVPSPQYLPSQQSPNMRPPQSQGLPGSVANPQYF
PSQQNPTMRPTQSLPAGSAPGPQQFMPAGSAPGTQQFMPAGNTRPPQSMPAGAAPRPQQSFAGN
NVSNASISNDWNGGRTAMAPSRPAGVTQSVALPTPTSPSPVSPMSQPPAITSKALTVSGNGYPS
NSALDDDFFSSAASTPKQDPVRQNFPVSSAPASSSIVPVSSSANPASRQSSLDSLQSAFSMALT
NSQIPRTQSLPNTAQQISPSASSPLTTSGRSVGLGNTTSDNSQPPWPKMKPSDVQKYAKVFMEV
DTDRDGKITGEQARSLFLSWRLPIDVLKKVWDLSDQDSDSMLSLKEFCFALYLMERYREGRPLP
QSLPSNVIFDETLMSMTGHPKIPQGNAAWGVGPGFQQQQGMPGTRPVAPAAGLRPPVQGTPGWA
DGTVLPDQKKSGTSVLDDSFLNDTDNSNQNIETAEKKAEEMQNMILDSKEKIELYRNKMQELVL
YKSRCDNRLNEITERASADKREAESLGKKYEEKYKQVAEIASKLTIEEAKLRDIQERKTELQQA
IVKMEQGGSADGILQVRADRIQSDLEQLFKALAERCKKHEIDVKSIAMVQLPDGWQPGNPEGAA
VWDEDWDKFEDEGFANDLTFDTKSASSEPKPSFIPGEQNSFDDNPVHGSPVNANGRQENFANGD
YTAEEESSYAHSEDDLARSPRDSPFVRNTVESPSKEFSTSPFAKSSEADAETHRSFDESTWGAF
DNNDDVDSVWGFNPKDSDLEKQGDFFQSGDFGLNPVRTGTTFTDGASQAKSPFAFDDSVPGTPF
SKFGNSPRYSEAGDHFDRSMFDSSFSMHESGNSPQAERFTRFDSISSSRDFGNNFGNNFGSDHE
KFSRFDSISSSKDFGYNPDKFSRFDSMSSSKDFGYKPETLTRFDSMSSSKDFGFGGGQGHARFD
SISSSKDFGYSAPFSFDDSDPFGSSGPFKVSSENQSPKKDSDKWSAF
Repeat found in LOC131657576
Repeat occurs 4 times in a sequence of 616 amino acids
Location between 13441381 and 13443924
Coverage of 9.09 %
Instances:
MQGVRDMQGVRDMQ | MQGVRDMQGVRDMQ | MQGVRDMQGVHENT | MQGVHENTQNNVME |
pattern: MQGV[RH][DE][NM][TQ][GQ][VN][NRH][VDE][NM][TEQ]
MQGVRDMQGVRDMQGVRDMQGVHENTQNNVMEQEPPAATAPAATTPIEVCTNQPPIGRKRRRA
NANAIRKRSEVWDHFNLIPDSDPATAACKYCHSKYMCDSKKHGTSNLKSHMKTCPKYPLNLSTD
PTQTVLTYSTIPGVGLVPTSSRFDPVGCRKGLAYFIILDEKPFRTVEGEGFKYFCYQMQPQFTI
PSRRTIARDCFQIYLDEKVRLKALFGSDCSRVAITTDCWTSVQNLNYLTLTAHFIDRDWQYQKR
IISFAIIPNHRGKTVGKKVEDVLKEWGLRNVSTITVDNAASNDVAVKYLEQKIRNMNGLFMDGF
GFHMRCCAHILNLVVRDGLKVASTSISSVRNAIRYVRSSPHRALKFNECVGYANITCKKSVCLD
VSTRWNSTYLMLDAAEKYETAFDKLEDEVEDYRDFFEGDSPPSSEDWENVRVFINFLKNYYEAT
TVFSVSTKASLHTAFPFLAGIYVELKSLNLDLNGLFASVAKEMLDKFKKYWIDITKMNQLLYLG
VIFDPRYKLRFVDWCFTDMYGPLSETKKSLLEEINNNLSKIFMVYKQAYDNVEGPVPVMAATVS
NGETAANIEVPAHVARQNAFQQHLKSMDSVEERIPSSVPNC
Repeat found in LOC131660368
Repeat occurs 22 times in a sequence of 742 amino acids
Location between 13454674 and 13457362
Coverage of 29.65 %
Instances:
YLDGWLKNTP | YLDGWLKNTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDNR
YLDGWLKDTQ | YLDGWLKDTR | YLDGWLKDTQ | YLDGWLKDIR | YLDGWLKDSH
YLDGWLKDSH | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[DIN][TSFIN][RHKPQ]
MAHKVVMSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKNQISSTDSNQVYLDGWL
KNTR
DEKAKVNPDSNQVYLDGWLKDTRDEKVKVNPDSNQVYLDGWLKDTRDEKAKANPNLEQVY
LDGWLKDTR
AEKAQANPDSNQVYLDGWLKDTRAKKEKVNPDSHQVYLDGWLKDTRADKTKVNPN
SNQVYLDGWLKDTRAENAKSNSDSNQVYLDGWLKDTRVENVKSNPDSNQVYLDGWLKDTRAEKE
KVNPDSNQVYLDGWLKDTRDENAKSNPDSNQVYLDGWLKDTRAENANSNQVYLDGWLKDTRAEK
VKANLDSNQVYLDGWLKDTRAEKEKFNPDSNQVYLDGWLKDNRDENAKSNPDSNQVYLDGWLKD
TQ
AKNADSNQVYLDGWLKDTRAENAKSNLDSNQVYLDGWLKDTQALKEKLTLDSNQLYLDGWLK
DIR
VGKEMSSPNSNQVYLDGWLKDSHAENSKQAYLDGWLKDSHAKNHIKIGQDLAESNRKLPSK
VDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYAKFLPKKVADDIPVSKSQIPSLLDLFSLTKDS
PQGEDMIGIINQCEYEPQKGETKACPTSLESMVEFVHSVIGADTKYDIHSTSYPTTSAVPLQNY
TVLDISEDIYAPKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKGQYGDTMDALGICHLDTSAM
NPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660367
Repeat occurs 23 times in a sequence of 761 amino acids
Location between 13471307 and 13474113
Coverage of 30.22 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTQ | YLDGWLKDTR | YLDGWLKDTL | YLDGWLKDTQ | YLDGWLKDIR
YLDGWLKDSH | YLDGWLKVSH | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[DINV][TISF][RHKPLQ]
MAHRVVLSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKDQISSPDSNQVYLDGWL
KDTR
DEKAKVNPDSNQVYLDGWLKDTRDEKVKVNPDSNQVYLDGWLKDTRDEKAKVNSNLEQVY
LDGWLKDTR
AEKAQANPDSNQVYLDGWLKDTRAEKEKVNPDSHQVYLDGWLKDTRAEKTKANLN
SNQLYLDGWLKDTRAENAKSNSDSNQVYLDGWLKDTRVENAKYNPDSNQVYLDGWLKDTRAEKE
KVNPDSNQVYLDGWLKDTRDENAKSNPDSNQVYLDGWLKDTRAENANSNQIYLDGWLKDTRAEK
VKANLDSNQVYLDGWLKDTRAEKEKVNPDSNQVYLDGWLKDTRNENAKSNPDSNQVYLDGWLKD
TQ
AKNANSNQVYLDGWLKDTRAENAKSNQVYLDGWLKDTLAENAKSNLDSNQVYLDGWLKDTQA
LKEKSTLDSNQLYLDGWLKDIRVGKEMSSPNSNQVYLDGWLKDSHAENSKQAYLDGWLKVSHAK
NHIKIGQDLAESNRKLPSKVDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYAKFLPKKVADDIP
VSKSQIPSLLDLFSLTKDSPQGEDMIGIINQCEYEPQKGETKACSTSLESMVEFVHSVIGADTK
YDIHSTSYPTTSAVPLQNYTVLDISEDIYAPKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKG
QYGDTMDALGICHLDTSAMNPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660373
Repeat occurs 16 times in a sequence of 604 amino acids
Location between 13525543 and 13527875
Coverage of 26.49 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTG | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKNTQ | YLDGWLKDIR | YLDGWLKDSH | YLDGWLKDSH
YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[DIN][TISF][GRHKPQ]
MTHRVVLSLLPFLLLWLINNHGSFARDMNQVDQPYLDGWLKNTPLKNQIPSPDSNQVYLDGWL
KDTR
GEKEKVNFDSNQVYLDGWLKDTGSEKAKANPDSNQVYLDGWLKDTRVENAKSNPDSNQVY
LDGWLKDTR
DENAKSNPDSNQVYLDGWLKDTRAENANSNHVYLDGWLKDTRDEKVIANLDSNQV
YLDGWLKDTR
AEKEKVNPNSNQVYLDGWLKDTRNEKAKANPESNQVYLDGWLKDTRAENTDSNQ
VYLDGWLKDTRAENEKSNLVSNQVYLDGWLKNTQALKEKPTLDSNQLYLDGWLKDIRVGKEISS
PNSNQVYLDGWLKDSHVENSKQAYLDGWLKDSHANNHIKTGQAYAESDRKLPSKVDHTEAFKVF
FFGIEDLYVGSVMTLQFPIVEYANFLPKKVADDIPVSKSQIPSLLDLFSLTKDSPKGEDMIGII
NQCEYEPQKGETKACPTSLESMVEFVHSVIGADTKYDIHSTSYPTTSAVPLQNYTVLDISEDIY
APKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKGQYGDIMNALGICHLDTSAMNPTHFYFDLL
GMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660374
Repeat occurs 18 times in a sequence of 470 amino acids
Location between 13532827 and 13534870
Coverage of 38.3 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWFKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDMR | YLDGWLKDTR |
pattern: YLDGW[LF]K[DN][TIM][RQ]
MAHRVVLSLLPFLMLSLINDHGSIARDLNQVDQPYLDGWLKNTQHQQHSPDSNQVYLDGWLKD
TR
VEKVKSNPHSNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRAEKAKLNPDSKQVYLD
GWLKDTR
SEKAKVNPNSNQVYLNGWLKDTRAEREKFNSDSNQVYLDGWLKDTRAEKEKSNSNSN
QVYLDGWLKDTRGEKAKADPGSNQVYLDGWFKDTRSEQVKVNANSNQVYLDGWLKDTRSKKVNP
YSKQVYLDGWLKDTRAEKEKSNSDSNQVYLDGWLKDTRDEKANANPGSTQVYLDGWLKDIRSKQ
VKINPNSNQVYLDGWLKDTRAKNVNPDSKPVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIR
AEKEKSTPDSNQVYLDGWLKDTRAEKEKANSDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGW
LKDTR
VKIKNQTLTPTKFSLMDG
Repeat found in LOC131660374
Repeat occurs 19 times in a sequence of 493 amino acids
Location between 13532827 and 13534870
Coverage of 38.54 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWFKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDMR | YLDGWLKDTR |
pattern: YLDGW[LF]K[DN][TIM][RQ]
MAHRVVLSLLPFLMLSLINDHGSIARDLNQVDQPYLDGWLKNTQHQQHSPDSNQVYLDGWLKD
TR
VEKVKSNPHSNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRAEKAKLNPDSKQVYLD
GWLKDTR
SEKAKVNPNSNQVYLNGWLKDTRAEREKFNSDSNQVYLDGWLKDTRAEKEKSNSNSN
QVYLDGWLKDTRGEKAKADPGSNQVYLDGWFKDTRSEQVKVNANSNQVYLDGWLKDTRSKKVNP
YSKQVYLDGWLKDTRAEKEKSNSDSNQVYLDGWLKDTRDEKANANPGSTQVYLDGWLKDIRSKQ
VKINPNSNQVYLDGWLKDTRAKNVNPDSKPVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTR
AEKVKSAPESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRAEKEKANSDSNQVYLDGW
LKDMR
AEKAKVNPNSNQVYLDGWLKDTRVKIKNQTLTPTKFSLMDG
Repeat found in LOC131654935
Repeat occurs 20 times in a sequence of 483 amino acids
Location between 13578340 and 13580345
Coverage of 33.13 %
Instances:
DGWLKNIQ | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDIR | DGWLKDTR | DGWLKDTR | DGWLKDMR

pattern: DGWLK[DN][TIM][RQ]
MTHRVVMSLLPFLLLLLINGHGIFGRDLNQVDQPYLDGWLKNIQLKKQKPSPDSNQVYLDGWL
KDTR
AEKVKSNSDSNQVYLDGWLKDTRTEKAKVNLDSNQVYLDGWLKDTRVEKAKSNPEPNQVY
LDGWLKDTRTEKAKVNLDSNQVYLDGWLKDTRAEEAKANSDSKQVYLDGWLKDTRSEQVKVNPN
SNQVYLDGWLKDTRATKSNPDSKQVYLDGWLKDTRAEKTKSNSNSNQVYLDGWLKDTRAEKEKS
NSDSNQVYIDGWLKDTRGDKAKANPDSTQVYLDGWLKDTRSEQVKVNPNSNQVYLDGWLKDTRA
KKAKANPNSKQVYLDGWLKDTRDEKEKSTPNSNQVYLDGWLKDTRAEKAKSNSESNQVYIDGWL
KDTR
AEKVKSNPESNQVYLDGWLKDIRTEKEKSTPDSNQVYLDGWLKDTRDEKAKVNLDSNQVY
LDGWLKDTRAEKAKANSDSNQVYLDGWLKDMRAEKA
Repeat found in LOC131654936
Repeat occurs 14 times in a sequence of 581 amino acids
Location between 13619611 and 13621851
Coverage of 24.1 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTQ
YLDGWLKVRQ | YLDGWLKDSH | YLDGWLKNSN | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[DINV][TRSFI][RHKPNQ]
MAHKVILSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKNQIPSPDSNQVYLDGWL
KDTR
GEKAKVNPDSNQVYLDGWLKDTRDEKAKANPDSNQVYLDGWLKDTRDEKAKANPDSNQVY
LDGWLKDIR
AEKVKANPDSNHVFLDGWLKDTKAEKVKDNLDSNQVYLDGWLKDTRAKKEKVNPN
SNQVYLDGWLKDTRDENAKFNSDSTQVYLDGWLKDTRAENADSNQVYLDGWLKDTRAGNANSNQ
VYLDGWLKDTQALKEKSTLDSNQLYLDGWLKVRQVGKEISSPNSNQVYLDGWLKDSHAENSKQA
YLDGWLKNSN
AKNHIKIGQDLVESNRKQPSKVDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYA
NFLPKKVADDIPVSKSQIPSLLDLFSLTKDSPQGEDMIGIINQCEYEPQKGETKACPTSLESMV
EFVHSVIGADTKYDIHSTSYPTTSATPLQNYTVLEISEDIYAPKWVACHPRPYPYLLHYCHYLD
IGSKIFK
VLLKGEYGDTMDALGICHLDTSAMNPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPR
PPVATK
Repeat found in LOC131654938
Repeat occurs 18 times in a sequence of 435 amino acids
Location between 13654539 and 13656416
Coverage of 49.66 %
Instances:
YLDGWLKNTQLK | YLDGWLKDTRVE | YLDGWLKDTRGE | YLDGWLKDTRVE | YLDGWLKDTRSE
YLDGWLKDTRAE | YLDGWLKDTRAE | YLDGWLRDTRGE | YLDGWLKDTRAK | YLDGWLKDTRVE
YLDGWLKDTRAE | YLDGWLKDIRDE | YLDGWLKDTRAE | YLDGWLKDTRAE | YLDGWVKDMRAE
YLDGWLKDTRAG | YLDGWLKDIRAE | YLDGWLKDTRAE |
pattern: YLDGW[VL][KR][DN][TIM][RQ][GDSAVL][GKE]
MTHRVVLSLLPFLMLSLINDHGSFARDLNQADQLYLDGWLKNTQLKNQQHSPGSNQVYLDGWL
KDTRVE
KVKSNPDFNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRVEKAKVNHDSKQVY
LDGWLKDTRSE
KAKVNPNSNQVYLDGWLKDTRAEKPKFNSDSNQVYLDGWLKDTRAEKEKSNSD
SNQVYLDGWLRDTRGEKEKSNPNSNQVYLDGWLKDTRAKKANIDSKQVYLDGWLKDTRVEKAES
NSDSSQVYLDGWLKDTRAEKVKSVPESNQVYLDGWLKDIRDEKEKSTPNSNQVYLDGWLKDTRA
E
KVKANSELNQVYLDGWLKDTRAEKEKANSNSNHVYLDGWVKDMRAEKEKVNPNSNQVYLDGWL
KDTRAG
NTKSNTGSNQVYLDGWLKDIRAEKEKSAFDSKQVYLDGWLKDTRAE
Repeat found in LOC131654939
Repeat occurs 23 times in a sequence of 554 amino acids
Location between 13690775 and 13693039
Coverage of 45.67 %
Instances:
DGWLKNPQLRN | DGWLKDTRVEK | DGWLKDTRAEK | DGWLKDTRGEK | DGWLKDTRVEK
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRANP | DGWLKDTRAEK
DGWLKDTRAEK | DGWLKDTRGEK | DGWLKDTRSDQ | DGWLKDSRAKK | DGWLKDTRAEK
DGWLKDTRAKK | DGWLKDTRAEK | DGWLKDIRSEK | DGWLKDTRAEK | DGWLKDTRAKK
DGWLKDMRAEK | DGWLKDTRDLK | DGWLKDTRVEK |
pattern: DGWLK[DN][TSMPI][RQ][GDSAVL][DRKLNE][QNKP]
MAYRVVLSLLPFLMLSLINDHGSFARDLNQVDQPYLDGWLKNPQLRNQQPSPDSNQVYLDGWL
KDTRVEK
AKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRGEKEKANTDSTQVY
LDGWLKDTRVEKAKINPDSTQVYLDGWLKDTRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPN
SNQIYLDGWLKDTRAEKVNAHPKSNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRAEKAKSNSD
SNQVYLDGWLKDTRAEKEKSNSDSNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQV
KVNPNSNQVYLDGWLKDSRAKKANPDSKQVYLDGWLKDTRAEKENSTPDSNQVYLDGWLKDTRA
KK
AKSNSDSNQVYLDGWLKDTRAEKVKSTQESNQVYLDGWLKDIRSEKEKSTPDSNQVYLDGWL
KDTRAEK
AKANSESNQVYLDGWLKDTRAKKAESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVY
LDGWLKDTRDLKEKSTLDSKQVYLDGWLKDTRVEKENLSPNSN
Repeat found in LOC131654943
Repeat occurs 14 times in a sequence of 420 amino acids
Location between 13814614 and 13817152
Coverage of 40.0 %
Instances:
FEPRPNLSSYGE | FEPKPNVSSYGE | FEPRPNLSSYGE | FEPRPNLSSYGE | FEPRPNLSLCGE
FEPRPNLSSYGE | FEPRPNLSLCGE | FEPRPNLSSYGE | FEPRPNVSSYGE | FEPRPNLSSYGE
FEPRPNLSSYAE | FEPKPVVSAYGD | FEPRPNVSAYEN | FEPRPSLTKDDA |
pattern: FEP[KR]P[VSN][VL][TS][SKLA][DYC][DGEA][DEAN]
The following known motifs were found:
VS[AI]Y was found 2 times in this sequence
FEPR was found 12 times in this sequence
MMRPALALLPLLLLLIVGIVESRIDLGEYWKVVVKKQNMVEEIQELLDANTKHNFKTLKQSFD
GKEKKKALKDFEPRPNLSSYGEPNLSSYGENDIDDKENKGSIEDFEPKPNVSSYGENDIDAKEK
KGAIENFEPRPNLSSYGEPNLSSYGENDIDDKENKGPIENFEPRPNLSSYGEPNLSSYGENKID
AKEKKGAIEDFEPRPNLSLCGEPNLSLCGENNIDDKENKGPIEDFEPRPNLSSYGEPNLSSYGE
NDIDDKEKKGVIEDFEPRPNLSLCGEPNLSLCGENNIDDKENKGPIEDFEPRPNLSSYGEPNLS
SYGENDIDDKEKKGVIEDFEPRPNVSSYGEPNVSSYGENDIDDKENKGPIEDFEPRPNLSSYGE
PNLSSYGENDINAKEKKRDAIEDFEPRPNLSSYAEPNLSSYAENEIHAKKRKGAIEDFEPKPVV
SAY
GDGDNNIDAEEKNKAFKDFEPRPNVSAYENPNVSAYENNDIDVEYTKDFEPRPSLTKDDA
Repeat found in LOC131654944
Repeat occurs 6 times in a sequence of 210 amino acids
Location between 13829284 and 13830780
Coverage of 37.14 %
Instances:
DFEPRPNISAYED | DFEPRPNVSAYGD | DFEPRPNISAYGD | DFEPRPNISSYGN | DFEPRPNISAYGN
DFEPRPSLTKYDA |
pattern: DFEPRP[SN][IVL][TS][SKA]Y[DGE][DAN]
The following known motifs were found:
FEPR was found 6 times in this sequence
MRPALALLPLFLLLIVAVVESRKDVGEYWKLVMKEQNMPEEIQGLLDANNKNNLKTLKQSFDD
KEKKNTVKDFEPRPNISAYEDPNISAYEDEDVDVEEKKIGVEDFEPRPNVSAYGDPNVSAYGDK
SIDAEEKKKGEEDFEPRPNISAYGDPNISAYGDNGIEDSKKNNVDQDFEPRPNISSYGNPNISS
YGNNDIGNKEKEKAVEDFEPRPNISAYGNPNISAYGNNDIHAEFKEDFEPRPSLTKYDA
Repeat found in LOC131654957
Repeat occurs 22 times in a sequence of 531 amino acids
Location between 14321941 and 14324104
Coverage of 24.86 %
Instances:
WLKNPE | WLKDTR | WLKDTR | WLKDTR | WLKDSR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTQ
WLKDTR | WLKDTR | WLKDSR | WLKDTR | WLKDTR
WLKDTR | WLKDIL | WLKDTR | WLKDTR | WLKDMR
WLKDTR | WLKDTL |
pattern: WLK[DN][TSMPI][REQL]
MAYRVVLSLLPFLMLSLINDHGSFARHLNQVDQPYLDGWLKNPELRNQQPSPDSNQVYLDGWL
KDTR
VEKAKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRAEKAKINLDSTQVY
LDGWLKDSRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPNSNQVYLDGWLKDTRAEKVNANPK
SNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRDEKAKSNSDSNQVYLDGWLKDTQTEKEKSNSD
SNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQVKVNPNSNQVYLDGWLKDSRAKKA
NPDTKQGYLDGWLKDTRAKKENSIPDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTRD
EKVKSTHESNQVYLDGWLKDILSEKEKSTPDSNQVYLDGWLKDTRVEKAKANSESNQVYLDGWL
KDTR
AKKVESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGWLKDTRDLKEKSTLDSKQVY
LDRWLKDTLAVKENSSPNSN

Similar gene clusters

NC_081182 - Cluster 12 - Cyclopeptide

Gene cluster description

NC_081182 - Gene Cluster 12. Type = cyclopeptide. Location: 13423719 - 14471171 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131657576
Repeat occurs 4 times in a sequence of 616 amino acids
Location between 13441381 and 13443924
Coverage of 9.09 %
Instances:
MQGVRDMQGVRDMQ | MQGVRDMQGVRDMQ | MQGVRDMQGVHENT | MQGVHENTQNNVME |
pattern: MQGV[RH][DE][NM][TQ][GQ][VN][NRH][VDE][NM][TEQ]
MQGVRDMQGVRDMQGVRDMQGVHENTQNNVMEQEPPAATAPAATTPIEVCTNQPPIGRKRRRA
NANAIRKRSEVWDHFNLIPDSDPATAACKYCHSKYMCDSKKHGTSNLKSHMKTCPKYPLNLSTD
PTQTVLTYSTIPGVGLVPTSSRFDPVGCRKGLAYFIILDEKPFRTVEGEGFKYFCYQMQPQFTI
PSRRTIARDCFQIYLDEKVRLKALFGSDCSRVAITTDCWTSVQNLNYLTLTAHFIDRDWQYQKR
IISFAIIPNHRGKTVGKKVEDVLKEWGLRNVSTITVDNAASNDVAVKYLEQKIRNMNGLFMDGF
GFHMRCCAHILNLVVRDGLKVASTSISSVRNAIRYVRSSPHRALKFNECVGYANITCKKSVCLD
VSTRWNSTYLMLDAAEKYETAFDKLEDEVEDYRDFFEGDSPPSSEDWENVRVFINFLKNYYEAT
TVFSVSTKASLHTAFPFLAGIYVELKSLNLDLNGLFASVAKEMLDKFKKYWIDITKMNQLLYLG
VIFDPRYKLRFVDWCFTDMYGPLSETKKSLLEEINNNLSKIFMVYKQAYDNVEGPVPVMAATVS
NGETAANIEVPAHVARQNAFQQHLKSMDSVEERIPSSVPNC
Repeat found in LOC131660368
Repeat occurs 22 times in a sequence of 742 amino acids
Location between 13454674 and 13457362
Coverage of 29.65 %
Instances:
YLDGWLKNTP | YLDGWLKNTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDNR
YLDGWLKDTQ | YLDGWLKDTR | YLDGWLKDTQ | YLDGWLKDIR | YLDGWLKDSH
YLDGWLKDSH | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[DIN][TSFIN][RHKPQ]
MAHKVVMSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKNQISSTDSNQVYLDGWL
KNTR
DEKAKVNPDSNQVYLDGWLKDTRDEKVKVNPDSNQVYLDGWLKDTRDEKAKANPNLEQVY
LDGWLKDTR
AEKAQANPDSNQVYLDGWLKDTRAKKEKVNPDSHQVYLDGWLKDTRADKTKVNPN
SNQVYLDGWLKDTRAENAKSNSDSNQVYLDGWLKDTRVENVKSNPDSNQVYLDGWLKDTRAEKE
KVNPDSNQVYLDGWLKDTRDENAKSNPDSNQVYLDGWLKDTRAENANSNQVYLDGWLKDTRAEK
VKANLDSNQVYLDGWLKDTRAEKEKFNPDSNQVYLDGWLKDNRDENAKSNPDSNQVYLDGWLKD
TQ
AKNADSNQVYLDGWLKDTRAENAKSNLDSNQVYLDGWLKDTQALKEKLTLDSNQLYLDGWLK
DIR
VGKEMSSPNSNQVYLDGWLKDSHAENSKQAYLDGWLKDSHAKNHIKIGQDLAESNRKLPSK
VDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYAKFLPKKVADDIPVSKSQIPSLLDLFSLTKDS
PQGEDMIGIINQCEYEPQKGETKACPTSLESMVEFVHSVIGADTKYDIHSTSYPTTSAVPLQNY
TVLDISEDIYAPKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKGQYGDTMDALGICHLDTSAM
NPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660367
Repeat occurs 23 times in a sequence of 761 amino acids
Location between 13471307 and 13474113
Coverage of 30.22 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTQ | YLDGWLKDTR | YLDGWLKDTL | YLDGWLKDTQ | YLDGWLKDIR
YLDGWLKDSH | YLDGWLKVSH | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[DINV][TISF][RHKPLQ]
MAHRVVLSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKDQISSPDSNQVYLDGWL
KDTR
DEKAKVNPDSNQVYLDGWLKDTRDEKVKVNPDSNQVYLDGWLKDTRDEKAKVNSNLEQVY
LDGWLKDTR
AEKAQANPDSNQVYLDGWLKDTRAEKEKVNPDSHQVYLDGWLKDTRAEKTKANLN
SNQLYLDGWLKDTRAENAKSNSDSNQVYLDGWLKDTRVENAKYNPDSNQVYLDGWLKDTRAEKE
KVNPDSNQVYLDGWLKDTRDENAKSNPDSNQVYLDGWLKDTRAENANSNQIYLDGWLKDTRAEK
VKANLDSNQVYLDGWLKDTRAEKEKVNPDSNQVYLDGWLKDTRNENAKSNPDSNQVYLDGWLKD
TQ
AKNANSNQVYLDGWLKDTRAENAKSNQVYLDGWLKDTLAENAKSNLDSNQVYLDGWLKDTQA
LKEKSTLDSNQLYLDGWLKDIRVGKEMSSPNSNQVYLDGWLKDSHAENSKQAYLDGWLKVSHAK
NHIKIGQDLAESNRKLPSKVDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYAKFLPKKVADDIP
VSKSQIPSLLDLFSLTKDSPQGEDMIGIINQCEYEPQKGETKACSTSLESMVEFVHSVIGADTK
YDIHSTSYPTTSAVPLQNYTVLDISEDIYAPKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKG
QYGDTMDALGICHLDTSAMNPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660373
Repeat occurs 16 times in a sequence of 604 amino acids
Location between 13525543 and 13527875
Coverage of 26.49 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTG | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKNTQ | YLDGWLKDIR | YLDGWLKDSH | YLDGWLKDSH
YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[DIN][TISF][GRHKPQ]
MTHRVVLSLLPFLLLWLINNHGSFARDMNQVDQPYLDGWLKNTPLKNQIPSPDSNQVYLDGWL
KDTR
GEKEKVNFDSNQVYLDGWLKDTGSEKAKANPDSNQVYLDGWLKDTRVENAKSNPDSNQVY
LDGWLKDTR
DENAKSNPDSNQVYLDGWLKDTRAENANSNHVYLDGWLKDTRDEKVIANLDSNQV
YLDGWLKDTR
AEKEKVNPNSNQVYLDGWLKDTRNEKAKANPESNQVYLDGWLKDTRAENTDSNQ
VYLDGWLKDTRAENEKSNLVSNQVYLDGWLKNTQALKEKPTLDSNQLYLDGWLKDIRVGKEISS
PNSNQVYLDGWLKDSHVENSKQAYLDGWLKDSHANNHIKTGQAYAESDRKLPSKVDHTEAFKVF
FFGIEDLYVGSVMTLQFPIVEYANFLPKKVADDIPVSKSQIPSLLDLFSLTKDSPKGEDMIGII
NQCEYEPQKGETKACPTSLESMVEFVHSVIGADTKYDIHSTSYPTTSAVPLQNYTVLDISEDIY
APKWVACHPRPYPYLLHYCHYLDIGSKIFKVLLKGQYGDIMNALGICHLDTSAMNPTHFYFDLL
GMKPGEGPLCHFFPVKHLLWVPRPPAATK
Repeat found in LOC131660374
Repeat occurs 18 times in a sequence of 470 amino acids
Location between 13532827 and 13534870
Coverage of 38.3 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWFKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDMR | YLDGWLKDTR |
pattern: YLDGW[LF]K[DN][TIM][RQ]
MAHRVVLSLLPFLMLSLINDHGSIARDLNQVDQPYLDGWLKNTQHQQHSPDSNQVYLDGWLKD
TR
VEKVKSNPHSNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRAEKAKLNPDSKQVYLD
GWLKDTR
SEKAKVNPNSNQVYLNGWLKDTRAEREKFNSDSNQVYLDGWLKDTRAEKEKSNSNSN
QVYLDGWLKDTRGEKAKADPGSNQVYLDGWFKDTRSEQVKVNANSNQVYLDGWLKDTRSKKVNP
YSKQVYLDGWLKDTRAEKEKSNSDSNQVYLDGWLKDTRDEKANANPGSTQVYLDGWLKDIRSKQ
VKINPNSNQVYLDGWLKDTRAKNVNPDSKPVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDIR
AEKEKSTPDSNQVYLDGWLKDTRAEKEKANSDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGW
LKDTR
VKIKNQTLTPTKFSLMDG
Repeat found in LOC131660374
Repeat occurs 19 times in a sequence of 493 amino acids
Location between 13532827 and 13534870
Coverage of 38.54 %
Instances:
YLDGWLKNTQ | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDTR | YLDGWFKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDTR | YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR
YLDGWLKDIR | YLDGWLKDTR | YLDGWLKDMR | YLDGWLKDTR |
pattern: YLDGW[LF]K[DN][TIM][RQ]
MAHRVVLSLLPFLMLSLINDHGSIARDLNQVDQPYLDGWLKNTQHQQHSPDSNQVYLDGWLKD
TR
VEKVKSNPHSNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRAEKAKLNPDSKQVYLD
GWLKDTR
SEKAKVNPNSNQVYLNGWLKDTRAEREKFNSDSNQVYLDGWLKDTRAEKEKSNSNSN
QVYLDGWLKDTRGEKAKADPGSNQVYLDGWFKDTRSEQVKVNANSNQVYLDGWLKDTRSKKVNP
YSKQVYLDGWLKDTRAEKEKSNSDSNQVYLDGWLKDTRDEKANANPGSTQVYLDGWLKDIRSKQ
VKINPNSNQVYLDGWLKDTRAKNVNPDSKPVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTR
AEKVKSAPESNQVYLDGWLKDIRAEKEKSTPDSNQVYLDGWLKDTRAEKEKANSDSNQVYLDGW
LKDMR
AEKAKVNPNSNQVYLDGWLKDTRVKIKNQTLTPTKFSLMDG
Repeat found in LOC131654935
Repeat occurs 20 times in a sequence of 483 amino acids
Location between 13578340 and 13580345
Coverage of 33.13 %
Instances:
DGWLKNIQ | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR | DGWLKDTR
DGWLKDTR | DGWLKDIR | DGWLKDTR | DGWLKDTR | DGWLKDMR

pattern: DGWLK[DN][TIM][RQ]
MTHRVVMSLLPFLLLLLINGHGIFGRDLNQVDQPYLDGWLKNIQLKKQKPSPDSNQVYLDGWL
KDTR
AEKVKSNSDSNQVYLDGWLKDTRTEKAKVNLDSNQVYLDGWLKDTRVEKAKSNPEPNQVY
LDGWLKDTRTEKAKVNLDSNQVYLDGWLKDTRAEEAKANSDSKQVYLDGWLKDTRSEQVKVNPN
SNQVYLDGWLKDTRATKSNPDSKQVYLDGWLKDTRAEKTKSNSNSNQVYLDGWLKDTRAEKEKS
NSDSNQVYIDGWLKDTRGDKAKANPDSTQVYLDGWLKDTRSEQVKVNPNSNQVYLDGWLKDTRA
KKAKANPNSKQVYLDGWLKDTRDEKEKSTPNSNQVYLDGWLKDTRAEKAKSNSESNQVYIDGWL
KDTR
AEKVKSNPESNQVYLDGWLKDIRTEKEKSTPDSNQVYLDGWLKDTRDEKAKVNLDSNQVY
LDGWLKDTRAEKAKANSDSNQVYLDGWLKDMRAEKA
Repeat found in LOC131654936
Repeat occurs 14 times in a sequence of 581 amino acids
Location between 13619611 and 13621851
Coverage of 24.1 %
Instances:
YLDGWLKNTP | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDIR
YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTR | YLDGWLKDTQ
YLDGWLKVRQ | YLDGWLKDSH | YLDGWLKNSN | YLDIGSKIFK |
pattern: YLD[IG][GW][SL]K[DINV][TRSFI][RHKPNQ]
MAHKVILSLLPFLLLLLINDNGSFARDMNQVDQPYLDGWLKNTPLKNQIPSPDSNQVYLDGWL
KDTR
GEKAKVNPDSNQVYLDGWLKDTRDEKAKANPDSNQVYLDGWLKDTRDEKAKANPDSNQVY
LDGWLKDIR
AEKVKANPDSNHVFLDGWLKDTKAEKVKDNLDSNQVYLDGWLKDTRAKKEKVNPN
SNQVYLDGWLKDTRDENAKFNSDSTQVYLDGWLKDTRAENADSNQVYLDGWLKDTRAGNANSNQ
VYLDGWLKDTQALKEKSTLDSNQLYLDGWLKVRQVGKEISSPNSNQVYLDGWLKDSHAENSKQA
YLDGWLKNSN
AKNHIKIGQDLVESNRKQPSKVDHTEAFKVFFFGIEDLYVGSVMTLQFPIVEYA
NFLPKKVADDIPVSKSQIPSLLDLFSLTKDSPQGEDMIGIINQCEYEPQKGETKACPTSLESMV
EFVHSVIGADTKYDIHSTSYPTTSATPLQNYTVLEISEDIYAPKWVACHPRPYPYLLHYCHYLD
IGSKIFK
VLLKGEYGDTMDALGICHLDTSAMNPTHFYFDLLGMKPGEGPLCHFFPVKHLLWVPR
PPVATK
Repeat found in LOC131654938
Repeat occurs 18 times in a sequence of 435 amino acids
Location between 13654539 and 13656416
Coverage of 49.66 %
Instances:
YLDGWLKNTQLK | YLDGWLKDTRVE | YLDGWLKDTRGE | YLDGWLKDTRVE | YLDGWLKDTRSE
YLDGWLKDTRAE | YLDGWLKDTRAE | YLDGWLRDTRGE | YLDGWLKDTRAK | YLDGWLKDTRVE
YLDGWLKDTRAE | YLDGWLKDIRDE | YLDGWLKDTRAE | YLDGWLKDTRAE | YLDGWVKDMRAE
YLDGWLKDTRAG | YLDGWLKDIRAE | YLDGWLKDTRAE |
pattern: YLDGW[VL][KR][DN][TIM][RQ][GDSAVL][GKE]
MTHRVVLSLLPFLMLSLINDHGSFARDLNQADQLYLDGWLKNTQLKNQQHSPGSNQVYLDGWL
KDTRVE
KVKSNPDFNQVYLDGWLKDTRGEKAKANPDSTQVYLDGWLKDTRVEKAKVNHDSKQVY
LDGWLKDTRSE
KAKVNPNSNQVYLDGWLKDTRAEKPKFNSDSNQVYLDGWLKDTRAEKEKSNSD
SNQVYLDGWLRDTRGEKEKSNPNSNQVYLDGWLKDTRAKKANIDSKQVYLDGWLKDTRVEKAES
NSDSSQVYLDGWLKDTRAEKVKSVPESNQVYLDGWLKDIRDEKEKSTPNSNQVYLDGWLKDTRA
E
KVKANSELNQVYLDGWLKDTRAEKEKANSNSNHVYLDGWVKDMRAEKEKVNPNSNQVYLDGWL
KDTRAG
NTKSNTGSNQVYLDGWLKDIRAEKEKSAFDSKQVYLDGWLKDTRAE
Repeat found in LOC131654939
Repeat occurs 23 times in a sequence of 554 amino acids
Location between 13690775 and 13693039
Coverage of 45.67 %
Instances:
DGWLKNPQLRN | DGWLKDTRVEK | DGWLKDTRAEK | DGWLKDTRGEK | DGWLKDTRVEK
DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRAEK | DGWLKDTRANP | DGWLKDTRAEK
DGWLKDTRAEK | DGWLKDTRGEK | DGWLKDTRSDQ | DGWLKDSRAKK | DGWLKDTRAEK
DGWLKDTRAKK | DGWLKDTRAEK | DGWLKDIRSEK | DGWLKDTRAEK | DGWLKDTRAKK
DGWLKDMRAEK | DGWLKDTRDLK | DGWLKDTRVEK |
pattern: DGWLK[DN][TSMPI][RQ][GDSAVL][DRKLNE][QNKP]
MAYRVVLSLLPFLMLSLINDHGSFARDLNQVDQPYLDGWLKNPQLRNQQPSPDSNQVYLDGWL
KDTRVEK
AKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRGEKEKANTDSTQVY
LDGWLKDTRVEKAKINPDSTQVYLDGWLKDTRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPN
SNQIYLDGWLKDTRAEKVNAHPKSNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRAEKAKSNSD
SNQVYLDGWLKDTRAEKEKSNSDSNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQV
KVNPNSNQVYLDGWLKDSRAKKANPDSKQVYLDGWLKDTRAEKENSTPDSNQVYLDGWLKDTRA
KK
AKSNSDSNQVYLDGWLKDTRAEKVKSTQESNQVYLDGWLKDIRSEKEKSTPDSNQVYLDGWL
KDTRAEK
AKANSESNQVYLDGWLKDTRAKKAESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVY
LDGWLKDTRDLKEKSTLDSKQVYLDGWLKDTRVEKENLSPNSN
Repeat found in LOC131654943
Repeat occurs 14 times in a sequence of 420 amino acids
Location between 13814614 and 13817152
Coverage of 40.0 %
Instances:
FEPRPNLSSYGE | FEPKPNVSSYGE | FEPRPNLSSYGE | FEPRPNLSSYGE | FEPRPNLSLCGE
FEPRPNLSSYGE | FEPRPNLSLCGE | FEPRPNLSSYGE | FEPRPNVSSYGE | FEPRPNLSSYGE
FEPRPNLSSYAE | FEPKPVVSAYGD | FEPRPNVSAYEN | FEPRPSLTKDDA |
pattern: FEP[KR]P[VSN][VL][TS][SKLA][DYC][DGEA][DEAN]
The following known motifs were found:
VS[AI]Y was found 2 times in this sequence
FEPR was found 12 times in this sequence
MMRPALALLPLLLLLIVGIVESRIDLGEYWKVVVKKQNMVEEIQELLDANTKHNFKTLKQSFD
GKEKKKALKDFEPRPNLSSYGEPNLSSYGENDIDDKENKGSIEDFEPKPNVSSYGENDIDAKEK
KGAIENFEPRPNLSSYGEPNLSSYGENDIDDKENKGPIENFEPRPNLSSYGEPNLSSYGENKID
AKEKKGAIEDFEPRPNLSLCGEPNLSLCGENNIDDKENKGPIEDFEPRPNLSSYGEPNLSSYGE
NDIDDKEKKGVIEDFEPRPNLSLCGEPNLSLCGENNIDDKENKGPIEDFEPRPNLSSYGEPNLS
SYGENDIDDKEKKGVIEDFEPRPNVSSYGEPNVSSYGENDIDDKENKGPIEDFEPRPNLSSYGE
PNLSSYGENDINAKEKKRDAIEDFEPRPNLSSYAEPNLSSYAENEIHAKKRKGAIEDFEPKPVV
SAY
GDGDNNIDAEEKNKAFKDFEPRPNVSAYENPNVSAYENNDIDVEYTKDFEPRPSLTKDDA
Repeat found in LOC131654944
Repeat occurs 6 times in a sequence of 210 amino acids
Location between 13829284 and 13830780
Coverage of 37.14 %
Instances:
DFEPRPNISAYED | DFEPRPNVSAYGD | DFEPRPNISAYGD | DFEPRPNISSYGN | DFEPRPNISAYGN
DFEPRPSLTKYDA |
pattern: DFEPRP[SN][IVL][TS][SKA]Y[DGE][DAN]
The following known motifs were found:
FEPR was found 6 times in this sequence
MRPALALLPLFLLLIVAVVESRKDVGEYWKLVMKEQNMPEEIQGLLDANNKNNLKTLKQSFDD
KEKKNTVKDFEPRPNISAYEDPNISAYEDEDVDVEEKKIGVEDFEPRPNVSAYGDPNVSAYGDK
SIDAEEKKKGEEDFEPRPNISAYGDPNISAYGDNGIEDSKKNNVDQDFEPRPNISSYGNPNISS
YGNNDIGNKEKEKAVEDFEPRPNISAYGNPNISAYGNNDIHAEFKEDFEPRPSLTKYDA
Repeat found in LOC131654957
Repeat occurs 22 times in a sequence of 531 amino acids
Location between 14321941 and 14324104
Coverage of 24.86 %
Instances:
WLKNPE | WLKDTR | WLKDTR | WLKDTR | WLKDSR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTQ
WLKDTR | WLKDTR | WLKDSR | WLKDTR | WLKDTR
WLKDTR | WLKDIL | WLKDTR | WLKDTR | WLKDMR
WLKDTR | WLKDTL |
pattern: WLK[DN][TSMPI][REQL]
MAYRVVLSLLPFLMLSLINDHGSFARHLNQVDQPYLDGWLKNPELRNQQPSPDSNQVYLDGWL
KDTR
VEKAKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRAEKAKINLDSTQVY
LDGWLKDSRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPNSNQVYLDGWLKDTRAEKVNANPK
SNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRDEKAKSNSDSNQVYLDGWLKDTQTEKEKSNSD
SNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQVKVNPNSNQVYLDGWLKDSRAKKA
NPDTKQGYLDGWLKDTRAKKENSIPDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTRD
EKVKSTHESNQVYLDGWLKDILSEKEKSTPDSNQVYLDGWLKDTRVEKAKANSESNQVYLDGWL
KDTR
AKKVESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGWLKDTRDLKEKSTLDSKQVY
LDRWLKDTLAVKENSSPNSN

Similar gene clusters

NC_081182 - Cluster 13 - Cyclopeptide

Gene cluster description

NC_081182 - Gene Cluster 13. Type = cyclopeptide. Location: 13998211 - 14967403 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131654957
Repeat occurs 22 times in a sequence of 531 amino acids
Location between 14321941 and 14324104
Coverage of 24.86 %
Instances:
WLKNPE | WLKDTR | WLKDTR | WLKDTR | WLKDSR
WLKDTR | WLKDTR | WLKDTR | WLKDTR | WLKDTQ
WLKDTR | WLKDTR | WLKDSR | WLKDTR | WLKDTR
WLKDTR | WLKDIL | WLKDTR | WLKDTR | WLKDMR
WLKDTR | WLKDTL |
pattern: WLK[DN][TSMPI][REQL]
MAYRVVLSLLPFLMLSLINDHGSFARHLNQVDQPYLDGWLKNPELRNQQPSPDSNQVYLDGWL
KDTR
VEKAKSNPNSNQVYLDGWLKDTRAEKAKVNLDSNQVYLDGWLKDTRAEKAKINLDSTQVY
LDGWLKDSRAEKSKVIPDSKQVYIDGWLKDTRAEKEIANPNSNQVYLDGWLKDTRAEKVNANPK
SNQVYLDGWLKDTRANPDSKQVYLDGWLKDTRDEKAKSNSDSNQVYLDGWLKDTQTEKEKSNSD
SNIVYLDGWLKDTRGEKAKTNPDSTQVYLDGWLKDTRSDQVKVNPNSNQVYLDGWLKDSRAKKA
NPDTKQGYLDGWLKDTRAKKENSIPDSNQVYLDGWLKDTRAEKAKSNSDSNQVYLDGWLKDTRD
EKVKSTHESNQVYLDGWLKDILSEKEKSTPDSNQVYLDGWLKDTRVEKAKANSESNQVYLDGWL
KDTR
AKKVESNFDSNQVYLDGWLKDMRAEKAKVNPNSNQVYLDGWLKDTRDLKEKSTLDSKQVY
LDRWLKDTLAVKENSSPNSN
Repeat found in LOC131654960
Repeat occurs 12 times in a sequence of 368 amino acids
Location between 14474335 and 14477134
Coverage of 39.13 %
Instances:
FEPRPNLSSYGE | FEPKPNVSSYGE | FEPRPNLSSYGE | FEPRPNLSSYGE | FEPRPNLSSYGE
FEPRPNLSSYGE | FEPRPNVSSYGE | FEPRPNLSSYGE | FEPRPNLSSYAE | FEPKPVVSAYGD
FEPRPNVSAYEN | FEPRPSLTKDDA |
pattern: FEP[KR]P[VSN][VL][TS][SKA][DY][DGEA][DEAN]
The following known motifs were found:
VS[AI]Y was found 2 times in this sequence
FEPR was found 10 times in this sequence
MMRPALALLPLLLLLIVGIVESRIDLGEYRKVVVKKQNMVEEIQELLDANTKHNFKTLKQSFD
GKEKKKALKDFEPRPNLSSYGEPNLSSYGENDIDDKENKGSIEDFEPKPNVSSYGENDIDAKEK
KGAIENFEPRPNLSSYGEPNLSSYGENDIDDKENKGPIEDFEPRPNLSSYGEPNLSSYGENKID
AKEKKGAIEDFEPRPNLSSYGEPNLSSYGENNIDDKENKGPIEDFEPRPNLSSYGEPNLSSYGE
NDIDDKEKKGVIEDFEPRPNVSSYGEPNVSSYGENDIDDKENKGPIEDFEPRPNLSSYGEPNLS
SYGENDIDAKEKKRDAIEDFEPRPNLSSYAEPNLSSYAENEIHAKKRKGAIEDFEPKPVVSAYG
D
GDNNIDAEEKNKAFKDFEPRPNVSAYENPNVSAYENNDIDVEYTKDFEPRPSLTKDDA
Repeat found in LOC131657597
Repeat occurs 5 times in a sequence of 531 amino acids
Location between 14487813 and 14492642
Coverage of 14.12 %
Instances:
PSYPFQPSGGPSYPF | PSYPFQQSGGPSYPF | PSYPFQQSGGPSYSY | PSYSYQQSAGPSYSY | PSYSYQQSARPSFPP

pattern: PSY[SP][YF]Q[QP]S[GA][GR]PS[YF][SP][PYF]
MSLASTQPLMPLLSSPSFISPPSYPFQPSGGPSYPFQQSGGPSYPFQQSGGPSYSYQQSAGPS
YSYQQSARPSFPP
QVPPGFQQTGGSHTSHLTQVPLSFQQTGGSHTSPTTHIPPGGDDHVEADQD
DIEQAEDDVVLDGDNLRGDDNPDEIQIVGGRYYITPVGNNFTPSRTSAKCVSYVIQQMYKDAWT
RFGEVPNRDDWFNKFKEKCTWDILSEKVVKKVFFIRASKRLSDTLRRVRKRWERDGSIPPWMGK
ETLDKLLVYWDTPEFKAKSESAKKMRASEKGGHVNAGGSISTYEHMRRMEKRLGRKPLMVELVK
ETRTKKSGEYVDERTRQALEDYQTRLVQFLVTNPQFTPSEGQPLHPDVGFYIWNEVIGGKGPNG
CFFGGGSMAGCLRSGDRNLFERVRDGEGTSRPAQLSPQMMETIRQLAISEARRESEQREMALKA
QLEEQQRQIEAMTKRQAEMEGQQSLIERMAKRQAKMEEQMRLFMQFQGSDNQLFGGNVGVGGAP
ANQQDDEDLNLDDFPDPGDT
Repeat found in LOC131654965
Repeat occurs 4 times in a sequence of 120 amino acids
Location between 14552268 and 14553284
Coverage of 43.33 %
Instances:
DFEPRPNVSAFGN | DFEPRLSITKYGN | DFEPRPSITKYGN | DFEPRPSLTKYGA |
pattern: DFEPR[LP][SN][IVL][TS][KA][YF]G[NA]
The following known motifs were found:
FEPR was found 4 times in this sequence
MMRPALALLHLLLLLIVAVVESRKDLRENNIDVEEMKKDLKDFEPRPNVSAFGNPNVSAFGNN
VIGAEFTKDFEPRLSITKYGNLSITKYGNNEIGAEFTEDFEPRPSITKYGNPSITKYGNNEIGA
EFTEDFEPRPSLTKYGA

Similar gene clusters

NC_081182 - Cluster 14 - Fatty_acid

Gene cluster description

NC_081182 - Gene Cluster 14. Type = fatty_acid. Location: 48740375 - 49028434 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081182 - Cluster 15 - Fatty_acid-saccharide-transporter_associated

Gene cluster description

NC_081182 - Gene Cluster 15. Type = fatty_acid-saccharide-transporter_associated. Location: 65072893 - 65704315 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081182 - Cluster 16 - Cyclopeptide

Gene cluster description

NC_081182 - Gene Cluster 16. Type = cyclopeptide. Location: 66582968 - 67432532 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131656015
Repeat occurs 3 times in a sequence of 152 amino acids
Location between 66851385 and 66851844
Coverage of 15.79 %
Instances:
TTTTTRRS | TTTTRRSS | TTTRRSSC |
pattern: TTT[TR][TR][SR][SR][SC]
MELDSSSIGEETTTTTRRSSCFPCCFTYHCSTTDDLPWWERVRNTSLSRGFMKIREWSEIVAG
PRWKTFIRRFNRNKTGAGGYRQAGKYQYDPLSYALNFDEGHNGDLEDECPDEFPKFSDRYATAP
LYKPGSTDSEQNVVVSSRNEEQLQV
Repeat found in LOC131661755
Repeat occurs 4 times in a sequence of 635 amino acids
Location between 67006750 and 67008749
Coverage of 4.41 %
Instances:
DSFKNYS | DSFKKYS | DSFTSYS | DSFQSYG |
pattern: DSF[TKQ][SNK]Y[SG]
MKIHFIFLLFSLSFFHGSVVTIGNEGITETQEYSFKQKTNPFSPKASLIRYWNTKISNKLPNP
IPNFFLSKASPLTPQHYANLVNLLKQEPLSANFHNSLCSTPYLFCSFDHPSQYYQSKKSTTKND
ANFALYSNKKFATYGSSRLGGVDSFKNYSNGLNTNNDSFKKYSTTSTRHSGQFNSYAENGNVAN
TNFTSYSSGSSSGTGEFKSYDKLVNDPNLGFTTYDSSASNHKLSFASYGNETNSGSESFNSYGK
RVRSGNSDFTNYAVSSNILQSSFTGYGELGTGSANDSFTSYSFNGNNPRSTFKTYGAGSISGSD
TFVSYRNRANVGDDSFQSYGSKSKSGAASFTNYGQSFNEGNDTFTEYGKGATGKTAIGFKIYGL
GRAFKGYNKNGVSFSSYNNFTTSSGKTVNKFVERGKFFRESMLKEGNVMVMPDIRDKMPERSFL
PLSISSKLPFSSLMLEDIKKTFHAREGSATARVIKNALGECERAPSVGETKRCVGSAEAMVDFA
VSVLGPNVVVKTTENVNGSKNSVMIGKVYGINGGKVTKSVSCHQTLYPYLLYYCHSVPKVRVYE
AEILDLETKLKINHGVAICHIDTSSWGPEHGAFIALGSEPGKIEVCHWIFENDMTWTIAS

Similar gene clusters

NC_081182 - Cluster 17 - Saccharide

Gene cluster description

NC_081182 - Gene Cluster 17. Type = saccharide. Location: 211940764 - 212187324 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081183 - Cluster 18 - Saccharide

Gene cluster description

NC_081183 - Gene Cluster 18. Type = saccharide. Location: 18236925 - 18580598 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081183 - Cluster 19 - Cyclopeptide

Gene cluster description

NC_081183 - Gene Cluster 19. Type = cyclopeptide. Location: 120211341 - 121459437 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131599566
Repeat occurs 3 times in a sequence of 430 amino acids
Location between 120638901 and 120640299
Coverage of 5.58 %
Instances:
STSDSTSD | STSDSTSG | STSGGGGD |
pattern: STS[DG][SG][TG][SG][DG]
MSTSDSTSDSTSGGGGDVGIIVVIALVVVAVKVAIFVCVCRRITEARGGSKSTTSITPDSQFI
TLTMDKFLNDMEREKPIRFTGQQLRIATDNYSNFLGSGGFGTVYKGIFNNGSMVAVKVLRGSSN
KKIDEQFMAEVGTVGRIHHFNLVRLYGFCFERNLIALVYEYMGNGSLDRYLFHEKKVLEFEKLH
EIAIGTARGIAYLHEECQHRIIHYDIKPGNILLDKNFYPKVADFGLAKLCNRENTHITMTGGRG
TPGYAAPELWMPFPITHKCDVYSFGMLLFEIVGRRRNLEIENSDSQEWFPIWVWKKFDDGLLEE
AMIVCGIEEKNREIAERMIKVALMCVQYRQELRPIMSVVVKMLEGSLEIPKTLNPFQHLIDGAN
FATQSVQVSNTYTTTVTSSSVKVSDCSIVCATPIMKKYEIELASSIV
Repeat found in LOC131599572
Repeat occurs 3 times in a sequence of 304 amino acids
Location between 120834520 and 120836257
Coverage of 7.89 %
Instances:
ENQYWTIF | ENQPFGFW | ENQPFGFW |
pattern: ENQ[YP][WF][TG][IF][WF]
MEFKNLSVLALFFLAFVGINGSKSGEEYWKSVWPNTTIPKALSDMLQSDIETSMPIKSHEENQ
YWTIF
FEHDLYPGKTMNLGVQKHYESSEYDKKETKKENQPFGFWAWARKETEKENQPFGFWAWS
KKETRKEIQPFVALTSDENEAHILNNYCRLPAATGEDKHCALSLESMIDFVISKLGKNIKVMSS
SFVQNQDKYVVEEVNKIGDKAVMCHRLNLKKVVLYCHTVNATTTYIVPLVASDGTKSKALTSCH
HDTRGMDPSMLKEVLNVEPGTVPVCHFIDNKAIAWVPDMSESGGHSCVI

Similar gene clusters

NC_081183 - Cluster 20 - Fatty_acid-alkaloid-terpene

Gene cluster description

NC_081183 - Gene Cluster 20. Type = fatty_acid-alkaloid-terpene. Location: 140501508 - 140942755 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081183 - Cluster 21 - Cyclopeptide

Gene cluster description

NC_081183 - Gene Cluster 21. Type = cyclopeptide. Location: 178275720 - 179082296 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131596166
Repeat occurs 33 times in a sequence of 776 amino acids
Location between 178698592 and 178701415
Coverage of 59.54 %
Instances:
NAFSYKHDASEPDI | NAFSYKHAASEPDV | NAFSYKHGASEPDV | NAFSYKHGASKSGV | NAFSYKHGASKSGV
NAFSYKHGASEPVV | NAFSYKHGASETGV | NAFSYKHGSSEPVV | NAFSYKHVAGNSDV | NAFSYKHAAGESYV
NAFSYKHAAQEPGV | NAFSYKHAASKPDV | NAFSYKHAKSEPDV | NAFSYKHAASEPDV | NAFSYKHAASEHEV
NAFSYKRAASERDV | NAFYYKHDASHLDL | NAFSYKHGASEPNV | NAFSYKHGASVPAV | NAFSYKHAAGNSGV
NAFSYKHAAGESYV | NAFSYKHAAGEPSV | NAFFCGHVAGEPGV | NAFSYKHAASKPDV | NAFSYKHAASESDV
NAFSYKHAKSEPGV | NAFSYKHAASVPDV | NAFSYKHAASERDV | NAFYYKHGASQLDV | NAFYYKHGASESDV
NAFSYKHATRATDV | NAFSYKHATRAADV | NAFSYKHAASEPDV |
pattern: NAF[SYF][YC][GK][RH][VDGA][TSKA][SGRQ][HKAVEQN][TRHSPAL][DGSYAVEN][IVL]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGV
NSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
V
NSGELPNAFSYKHVAGNSDVNSKELPNAFSYKHAAGESYVNFGELSNAFSYKHAAQEPGVNYE
ELPNAFSYKHAASKPDVNFGESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPN
AFSYKHAASEHEV
TSGELPNAFSYKRAASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSY
KHGASEPNV
NSAELPNAFSYKHGASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAA
GESYV
NFRELSNAFSYKHAAGEPSVNYEELPNAFFCGHVAGEPGVNYEELPNAFSYKHAASKPD
V
NFGELPNAFSYKHAASESDVNFRESPNAFSYKHAKSEPGVNFGDSPNAFSYKHAASVPDVNSE
ELPNVFSYKYAASEREVNSGELPNAFSYKHAASERDVNSGELPNAFYYKHGASQLDVNSGELPN
AFYYKHGASESDV
NYGELPNAFSYKHATRATDVNHGELPNAFSYKHATRAADVNYGELPNAFSY
KHAASEPDV

Repeat found in LOC131596166
Repeat occurs 31 times in a sequence of 736 amino acids
Location between 178698592 and 178701415
Coverage of 58.97 %
Instances:
NAFSYKHDASEPDI | NAFSYKHAASEPDV | NAFSYKHGASEPDV | NAFSYKHGASKSGV | NAFSYKHGASKSGV
NAFSYKHGASEPVV | NAFSYKHGASETGV | NAFSYKHGSSEPVV | NAFSYKHVAGNSDV | NAFSYKHAAGESYV
NAFSYKHAAQEPGV | NAFSYKHAASKPDV | NAFSYKHAKSEPDV | NAFSYKHAASEPDV | NAFSYKHAASEHEV
NAFSYKRAASERDV | NAFYYKHDASHLDL | NAFSYKHGASEPNV | NAFSYKHGASVPAV | NAFSYKHAAGNSGV
NAFSYKHAAGEPGV | NAFSYKHAASKPDV | NAFSYKHAASESDV | NAFSYKHAKSEPGV | NAFSYKHAASVPDV
NAFSYKHAASERDV | NAFYYKHGASQLDV | NAFYYKHGASESDV | NAFSYKHATRATDV | NAFSYKHATRAADV
NAFSYKHAASEPDV |
pattern: NAF[SY]YK[RH][VDGA][TSKA][SGRQ][HKAVEQN][TRHSPAL][DGYAVEN][IVL]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGV
NSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
V
NSGELPNAFSYKHVAGNSDVNSKELPNAFSYKHAAGESYVNFGELSNAFSYKHAAQEPGVNYE
ELPNAFSYKHAASKPDVNFGESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPN
AFSYKHAASEHEV
TSGELPNAFSYKRAASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSY
KHGASEPNV
NSAELPNAFSYKHGASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAA
GEPGV
NYEELPNAFSYKHAASKPDVNFGELPNAFSYKHAASESDVNFRESPNAFSYKHAKSEPG
V
NFGDSPNAFSYKHAASVPDVNSEELPNVFSYKYAASEREVNSGELPNAFSYKHAASERDVNSG
ELPNAFYYKHGASQLDVNSGELPNAFYYKHGASESDVNYGELPNAFSYKHATRATDVNHGELPN
AFSYKHATRAADV
NYGELPNAFSYKHAASEPDV
Repeat found in LOC131596166
Repeat occurs 32 times in a sequence of 756 amino acids
Location between 178698592 and 178701415
Coverage of 59.26 %
Instances:
NAFSYKHDASEPDI | NAFSYKHAASEPDV | NAFSYKHGASEPDV | NAFSYKHGASKSGV | NAFSYKHGASKSGV
NAFSYKHGASEPVV | NAFSYKHGASETGV | NAFSYKHGSSEPVV | NAFSYKHVAGESYV | NAFSYKHAAQEPGV
NAFSYKHAASKPDV | NAFSYKHAKSEPDV | NAFSYKHAASEPDV | NAFSYKHAASEHEV | NAFSYKRAASERDV
NAFYYKHDASHLDL | NAFSYKHGASEPNV | NAFSYKHGASVPAV | NAFSYKHAAGNSGV | NAFSYKHAAGESYV
NAFSYKHAAGEPSV | NAFFCGHVAGEPGV | NAFSYKHAASKPDV | NAFSYKHAASESDV | NAFSYKHAKSEPGV
NAFSYKHAASVPDV | NAFSYKHAASERDV | NAFYYKHGASQLDV | NAFYYKHGASESDV | NAFSYKHATRATDV
NAFSYKHATRAADV | NAFSYKHAASEPDV |
pattern: NAF[SYF][YC][GK][RH][VDGA][TSKA][SGRQ][HKAVEQN][TRHSPAL][DGSYAVEN][IVL]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGV
NSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
V
NSGELPNAFSYKHVAGESYVNFGELSNAFSYKHAAQEPGVNYEELPNAFSYKHAASKPDVNFG
ESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPNAFSYKHAASEHEVTSGELPN
AFSYKRAASERDV
NSGELPNAFYYKHDASHLDLNSGELPNAFSYKHGASEPNVNSAELPNAFSY
KHGASVPAV
NSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAAGESYVNFRELSNAFSYKHAA
GEPSV
NYEELPNAFFCGHVAGEPGVNYEELPNAFSYKHAASKPDVNFGELPNAFSYKHAASESD
V
NFRESPNAFSYKHAKSEPGVNFGDSPNAFSYKHAASVPDVNSEELPNVFSYKYAASEREVNSG
ELPNAFSYKHAASERDVNSGELPNAFYYKHGASQLDVNSGELPNAFYYKHGASESDVNYGELPN
AFSYKHATRATDV
NHGELPNAFSYKHATRAADVNYGELPNAFSYKHAASEPDV
Repeat found in LOC131596171
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 178795056 and 178796447
Coverage of 31.59 %
Instances:
NVFQYPHAASE | NVFQYPNEASE | NVFQYPHAASE | NVFQYPHAASE | NVFRYNHAANV
NVFQYPHAANK | NVFRYDHAANE | NVFQYPHATSK | NVFRYDHAANE | NVFQYPHATSK
NVFRYDHAANE |
pattern: NVF[RQ]Y[DNP][NH][EA][TA][SN][VKE]
MEFHFFHFITFLMLVTVATNAARLPPQLYWKSVLPNTPMPKAISNLLPPSIDVGVKKSYSHGG
KANENSEGLANVFQYPHAASEVEVNSGGLPNVFQYPNEASEVDVKSGELPNVFQYPHAASEIDV
KSGEFPNVFQYPHAASEADVSSKELPNVFRYNHAANVPGINFGELPNGFGYHHSARETAVNSGE
LSNGFGYHHDASEPSVNSREFSNVFQYPHAANKADVSSKKLPNVFRYDHAANEPGINSGELSNG
FGYHHDASEPGVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINSGELSNGFGYH
HDASEPSVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINFGELSNGFGYHHAAK
Repeat found in LOC131596171
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 178795056 and 178796447
Coverage of 31.59 %
Instances:
NVFQYPHAASE | NVFQYPNEASE | NVFQYPHAASE | NVFQYPHAASE | NVFRYNHAANV
NVFQYPHAANK | NVFRYDHAANE | NVFQYPHATSK | NVFRYDHAANE | NVFQYPHATSK
NVFRYDHAANE |
pattern: NVF[RQ]Y[DNP][NH][EA][TA][SN][VKE]
MEFHFFHFITFLMLVTVATNAARLPPQLYWKSVLPNTPMPKAISNLLPPSIDVGVKKSYSHGG
KANENSEGLANVFQYPHAASEVEVNSGGLPNVFQYPNEASEVDVKSGELPNVFQYPHAASEIDV
KSGEFPNVFQYPHAASEADVSSKELPNVFRYNHAANVPGINFGELPNGFGYHHSARETAVNSGE
LSNGFGYHHDASEPSVNSREFSNVFQYPHAANKADVSSKKLPNVFRYDHAANEPGINSGELSNG
FGYHHDASEPGVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINSGELSNGFGYH
HDASEPSVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINFGELSNGFGYHHAAK
Repeat found in LOC131596177
Repeat occurs 10 times in a sequence of 589 amino acids
Location between 178941060 and 178943303
Coverage of 11.88 %
Instances:
YDHTGVD | YDHAVSG | YDHAVSG | YDHAASE | YDHAARK
YDHAASE | YDHVASE | YDHAASE | YDHAASE | YDHSASE

pattern: YDH[TSVA][VGA][SVR][DGEK]
MEFHFFRIITFLMLLLVATNAARLPPQHYWKSVIPNSPMPKAITNLLLPSETDVSSKELSNWF
RYDHTGVDVNSKKLPNWFRYDRAASGSDVNSEELPNWFRYDHAVSGADVSSEELPNWFRYDHAV
SG
ADVSSEELPNWFRYNHAASEPDVNSEELSNVFKYDHAASEPDVSFKKFPNVFKYDHAARKPD
VSFKKLPNVFKYDHAASEPDVSFRKLPNVFKYDHVASEHDVSSRYFPNVFKYDHAASESDVSSK
YFSNVFKYDHAASEPEVNFRESPNGFSYHHAASEPGLNSRESLNGFFYRNAASEPNVNFRKSPN
GFSYHQAASVPDVNSGETSILSYNPPASEFNVNSGKLSNAFSYDHSASEKPKSTIFFFEKDLHH
GTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARIVNNTITDCEVE
SIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKVITQGVKRLGEKNKAVVCH
KQNYPYAVFFCHTTDTTKVYSAPLKGVDGSIVKAIVVCHTDTSQWNPKHFAFLELKVKVGSGPI
CHLLPQDHVAWISK
Repeat found in LOC131596178
Repeat occurs 10 times in a sequence of 589 amino acids
Location between 178972989 and 178975235
Coverage of 11.88 %
Instances:
YDHTGVD | YDHAVSG | YDHAVSG | YDHAASE | YDHAARK
YDHAASE | YDHVASE | YDHAASE | YDHAASE | YDHSASE

pattern: YDH[TSVA][VGA][SVR][DGEK]
MEFHFFRIITFLMLLLVATNAARLPPQRYWKSVIPNSPMPKAITNLLLPSETDVSSKELSNWF
RYDHTGVDVNSKKLPNWFRYDRAASGSDVNSEELPNWFRYDHAVSGADVSSEELPNWFRYDHAV
SG
ADVSSEELPNWFRYNHAASEPDVNSEELSNVFKYDHAASEPDVSFKKFPNVFKYDHAARKPD
VSFKKLPNVFKYDHAASEPDVSFRKLPNVFKYDHVASEHDVSSRYFPNVFKYDHAASESDVSSK
YFSNVFKYDHAASEPEVNFRESPNGFSYHHAASEPGLNSRESLNGFFYRNAASEPNVNFRKSPN
GFSYHQAASVPDVNSGETSILSYNPPASEFNVNSGKLSNAFSYDHSASEKPKSTIFFFEKDLHH
GTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARIVNNTITDCEVE
SIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKVITQGVKRLGEKNKAVVCH
KQNYPYAVFFCHTTDTTKVYSAPLKGVDGSIVKAIVVCHTDTSQWNPKHFAFLELKVKVGSGPI
CHLLPQDHVAWISK
Repeat found in LOC131596180
Repeat occurs 21 times in a sequence of 728 amino acids
Location between 179037704 and 179040213
Coverage of 31.73 %
Instances:
NGFGYHDAVSE | NGFGYHHAASE | NGFGYHHAASE | NGFGYHLAASE | NGFGYHHAASE
NGFGYHHAASD | NGFRYHLAASE | NGFVYHHAASE | NGFGYHHAGSE | NGFGYHHVASE
NGFGYHHGASE | NGFGYHHAASE | NGFGYNHAISG | NGFGYHHGASE | NGFGYHHATSE
NGFGYHHAASE | NGFGYNHAISG | NGFRYHHAASK | NGFRYHHAASK | NGFRYHHAASK
NGFRYHHIASE |
pattern: NGF[VGR]Y[NH][DLH][IVGA][TGAIV]S[DGEK]
MKFHFFHIITFLMLLLVATNAETLPPQLYWKSILPNSPMPKVITNLLLSDSPMAKTITNLLLP
GLDDGVKKSYSDGGEVEGKTDVIPKKLSNGFGYHDAVSEPNVNFKELSNGFGYHHAASEADGNS
EELPNGFGYHHAASEADVNSKKLPNGFGYHLAASEPNVNSKELSNGFGYHHAASEADGNSRELP
NGFGYHHAASD
ADVNSKILPNGFRYHLAASEADENSKELSNGFVYHHAASEADANSRELPNGFG
YHHAGSE
SDINSREISNGFGYHHVASETDVNSRELPNGFGYHHGASEAYVNSRKLSNGFGYHHA
ASE
SGVTSGELPNGFGYNHAISGVDGNSKELPNGFGYHHGASEPDVNFRELPNGFGYHHATSEA
GVKSEELPNGFGYHHAASEPDVNSGELPNGFGYNHAISGVDGNSKELPNGFRYHHAASKPDVNS
GDLPNGFRYHHAASKSDVNSGDLSNGFRYHHAASKPNVNSGDSHNGFRYHHIASEAQLHDKPKA
TIFFFEKELHHGTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARI
VNNTITDCEVESIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTETYKESDFQQYKVITQGVKR
LGEKNKAVVCHKQNYPYAVFFCHTTDTTKVYSVPLEAVDGSRVKAIVVCHTDTSQWNPKHFAFL
ELKVKVGTAPICHLLPQDHVAWISK

Similar gene clusters

NC_081183 - Cluster 22 - Cyclopeptide

Gene cluster description

NC_081183 - Gene Cluster 22. Type = cyclopeptide. Location: 178324011 - 179778710 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131596166
Repeat occurs 33 times in a sequence of 776 amino acids
Location between 178698592 and 178701415
Coverage of 59.54 %
Instances:
NAFSYKHDASEPDI | NAFSYKHAASEPDV | NAFSYKHGASEPDV | NAFSYKHGASKSGV | NAFSYKHGASKSGV
NAFSYKHGASEPVV | NAFSYKHGASETGV | NAFSYKHGSSEPVV | NAFSYKHVAGNSDV | NAFSYKHAAGESYV
NAFSYKHAAQEPGV | NAFSYKHAASKPDV | NAFSYKHAKSEPDV | NAFSYKHAASEPDV | NAFSYKHAASEHEV
NAFSYKRAASERDV | NAFYYKHDASHLDL | NAFSYKHGASEPNV | NAFSYKHGASVPAV | NAFSYKHAAGNSGV
NAFSYKHAAGESYV | NAFSYKHAAGEPSV | NAFFCGHVAGEPGV | NAFSYKHAASKPDV | NAFSYKHAASESDV
NAFSYKHAKSEPGV | NAFSYKHAASVPDV | NAFSYKHAASERDV | NAFYYKHGASQLDV | NAFYYKHGASESDV
NAFSYKHATRATDV | NAFSYKHATRAADV | NAFSYKHAASEPDV |
pattern: NAF[SYF][YC][GK][RH][VDGA][TSKA][SGRQ][HKAVEQN][TRHSPAL][DGSYAVEN][IVL]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGV
NSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
V
NSGELPNAFSYKHVAGNSDVNSKELPNAFSYKHAAGESYVNFGELSNAFSYKHAAQEPGVNYE
ELPNAFSYKHAASKPDVNFGESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPN
AFSYKHAASEHEV
TSGELPNAFSYKRAASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSY
KHGASEPNV
NSAELPNAFSYKHGASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAA
GESYV
NFRELSNAFSYKHAAGEPSVNYEELPNAFFCGHVAGEPGVNYEELPNAFSYKHAASKPD
V
NFGELPNAFSYKHAASESDVNFRESPNAFSYKHAKSEPGVNFGDSPNAFSYKHAASVPDVNSE
ELPNVFSYKYAASEREVNSGELPNAFSYKHAASERDVNSGELPNAFYYKHGASQLDVNSGELPN
AFYYKHGASESDV
NYGELPNAFSYKHATRATDVNHGELPNAFSYKHATRAADVNYGELPNAFSY
KHAASEPDV

Repeat found in LOC131596166
Repeat occurs 31 times in a sequence of 736 amino acids
Location between 178698592 and 178701415
Coverage of 58.97 %
Instances:
NAFSYKHDASEPDI | NAFSYKHAASEPDV | NAFSYKHGASEPDV | NAFSYKHGASKSGV | NAFSYKHGASKSGV
NAFSYKHGASEPVV | NAFSYKHGASETGV | NAFSYKHGSSEPVV | NAFSYKHVAGNSDV | NAFSYKHAAGESYV
NAFSYKHAAQEPGV | NAFSYKHAASKPDV | NAFSYKHAKSEPDV | NAFSYKHAASEPDV | NAFSYKHAASEHEV
NAFSYKRAASERDV | NAFYYKHDASHLDL | NAFSYKHGASEPNV | NAFSYKHGASVPAV | NAFSYKHAAGNSGV
NAFSYKHAAGEPGV | NAFSYKHAASKPDV | NAFSYKHAASESDV | NAFSYKHAKSEPGV | NAFSYKHAASVPDV
NAFSYKHAASERDV | NAFYYKHGASQLDV | NAFYYKHGASESDV | NAFSYKHATRATDV | NAFSYKHATRAADV
NAFSYKHAASEPDV |
pattern: NAF[SY]YK[RH][VDGA][TSKA][SGRQ][HKAVEQN][TRHSPAL][DGYAVEN][IVL]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGV
NSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
V
NSGELPNAFSYKHVAGNSDVNSKELPNAFSYKHAAGESYVNFGELSNAFSYKHAAQEPGVNYE
ELPNAFSYKHAASKPDVNFGESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPN
AFSYKHAASEHEV
TSGELPNAFSYKRAASERDVNSGELPNAFYYKHDASHLDLNSGELPNAFSY
KHGASEPNV
NSAELPNAFSYKHGASVPAVNSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAA
GEPGV
NYEELPNAFSYKHAASKPDVNFGELPNAFSYKHAASESDVNFRESPNAFSYKHAKSEPG
V
NFGDSPNAFSYKHAASVPDVNSEELPNVFSYKYAASEREVNSGELPNAFSYKHAASERDVNSG
ELPNAFYYKHGASQLDVNSGELPNAFYYKHGASESDVNYGELPNAFSYKHATRATDVNHGELPN
AFSYKHATRAADV
NYGELPNAFSYKHAASEPDV
Repeat found in LOC131596166
Repeat occurs 32 times in a sequence of 756 amino acids
Location between 178698592 and 178701415
Coverage of 59.26 %
Instances:
NAFSYKHDASEPDI | NAFSYKHAASEPDV | NAFSYKHGASEPDV | NAFSYKHGASKSGV | NAFSYKHGASKSGV
NAFSYKHGASEPVV | NAFSYKHGASETGV | NAFSYKHGSSEPVV | NAFSYKHVAGESYV | NAFSYKHAAQEPGV
NAFSYKHAASKPDV | NAFSYKHAKSEPDV | NAFSYKHAASEPDV | NAFSYKHAASEHEV | NAFSYKRAASERDV
NAFYYKHDASHLDL | NAFSYKHGASEPNV | NAFSYKHGASVPAV | NAFSYKHAAGNSGV | NAFSYKHAAGESYV
NAFSYKHAAGEPSV | NAFFCGHVAGEPGV | NAFSYKHAASKPDV | NAFSYKHAASESDV | NAFSYKHAKSEPGV
NAFSYKHAASVPDV | NAFSYKHAASERDV | NAFYYKHGASQLDV | NAFYYKHGASESDV | NAFSYKHATRATDV
NAFSYKHATRAADV | NAFSYKHAASEPDV |
pattern: NAF[SYF][YC][GK][RH][VDGA][TSKA][SGRQ][HKAVEQN][TRHSPAL][DGSYAVEN][IVL]
MKFHFFHITTFLTLLLVATNAARLPPQLYWKSVIPNSPMPKVIANLLPNFPMPKPITNLLLPG
VKKSYSDGGEVGVSSGELPNAFSYKHDASEPDINSGELPNAFSYKHAASEPDVNSGESPDAFSY
KHAASESDVNSEELPNAFSYKHGASEPDVNFGELPNAFSYKHGASKSGVNSRELPNAFSYKHGA
SKSGV
NSGELPNAFSYKHGASEPVVNSGELPNAFSYKHGASETGVNSGELPNAFSYKHGSSEPV
V
NSGELPNAFSYKHVAGESYVNFGELSNAFSYKHAAQEPGVNYEELPNAFSYKHAASKPDVNFG
ESPNAFSYKHAKSEPDVNFRDSPNAFSYKHAASEPDVNSEELPNAFSYKHAASEHEVTSGELPN
AFSYKRAASERDV
NSGELPNAFYYKHDASHLDLNSGELPNAFSYKHGASEPNVNSAELPNAFSY
KHGASVPAV
NSGELPNAFSYKHAAGNSGVNSEELPNAFSYKHAAGESYVNFRELSNAFSYKHAA
GEPSV
NYEELPNAFFCGHVAGEPGVNYEELPNAFSYKHAASKPDVNFGELPNAFSYKHAASESD
V
NFRESPNAFSYKHAKSEPGVNFGDSPNAFSYKHAASVPDVNSEELPNVFSYKYAASEREVNSG
ELPNAFSYKHAASERDVNSGELPNAFYYKHGASQLDVNSGELPNAFYYKHGASESDVNYGELPN
AFSYKHATRATDV
NHGELPNAFSYKHATRAADVNYGELPNAFSYKHAASEPDV
Repeat found in LOC131596171
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 178795056 and 178796447
Coverage of 31.59 %
Instances:
NVFQYPHAASE | NVFQYPNEASE | NVFQYPHAASE | NVFQYPHAASE | NVFRYNHAANV
NVFQYPHAANK | NVFRYDHAANE | NVFQYPHATSK | NVFRYDHAANE | NVFQYPHATSK
NVFRYDHAANE |
pattern: NVF[RQ]Y[DNP][NH][EA][TA][SN][VKE]
MEFHFFHFITFLMLVTVATNAARLPPQLYWKSVLPNTPMPKAISNLLPPSIDVGVKKSYSHGG
KANENSEGLANVFQYPHAASEVEVNSGGLPNVFQYPNEASEVDVKSGELPNVFQYPHAASEIDV
KSGEFPNVFQYPHAASEADVSSKELPNVFRYNHAANVPGINFGELPNGFGYHHSARETAVNSGE
LSNGFGYHHDASEPSVNSREFSNVFQYPHAANKADVSSKKLPNVFRYDHAANEPGINSGELSNG
FGYHHDASEPGVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINSGELSNGFGYH
HDASEPSVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINFGELSNGFGYHHAAK
Repeat found in LOC131596171
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 178795056 and 178796447
Coverage of 31.59 %
Instances:
NVFQYPHAASE | NVFQYPNEASE | NVFQYPHAASE | NVFQYPHAASE | NVFRYNHAANV
NVFQYPHAANK | NVFRYDHAANE | NVFQYPHATSK | NVFRYDHAANE | NVFQYPHATSK
NVFRYDHAANE |
pattern: NVF[RQ]Y[DNP][NH][EA][TA][SN][VKE]
MEFHFFHFITFLMLVTVATNAARLPPQLYWKSVLPNTPMPKAISNLLPPSIDVGVKKSYSHGG
KANENSEGLANVFQYPHAASEVEVNSGGLPNVFQYPNEASEVDVKSGELPNVFQYPHAASEIDV
KSGEFPNVFQYPHAASEADVSSKELPNVFRYNHAANVPGINFGELPNGFGYHHSARETAVNSGE
LSNGFGYHHDASEPSVNSREFSNVFQYPHAANKADVSSKKLPNVFRYDHAANEPGINSGELSNG
FGYHHDASEPGVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINSGELSNGFGYH
HDASEPSVNSREFSNVFQYPHATSKADVSSKKLPNVFRYDHAANEPGINFGELSNGFGYHHAAK
Repeat found in LOC131596177
Repeat occurs 10 times in a sequence of 589 amino acids
Location between 178941060 and 178943303
Coverage of 11.88 %
Instances:
YDHTGVD | YDHAVSG | YDHAVSG | YDHAASE | YDHAARK
YDHAASE | YDHVASE | YDHAASE | YDHAASE | YDHSASE

pattern: YDH[TSVA][VGA][SVR][DGEK]
MEFHFFRIITFLMLLLVATNAARLPPQHYWKSVIPNSPMPKAITNLLLPSETDVSSKELSNWF
RYDHTGVDVNSKKLPNWFRYDRAASGSDVNSEELPNWFRYDHAVSGADVSSEELPNWFRYDHAV
SG
ADVSSEELPNWFRYNHAASEPDVNSEELSNVFKYDHAASEPDVSFKKFPNVFKYDHAARKPD
VSFKKLPNVFKYDHAASEPDVSFRKLPNVFKYDHVASEHDVSSRYFPNVFKYDHAASESDVSSK
YFSNVFKYDHAASEPEVNFRESPNGFSYHHAASEPGLNSRESLNGFFYRNAASEPNVNFRKSPN
GFSYHQAASVPDVNSGETSILSYNPPASEFNVNSGKLSNAFSYDHSASEKPKSTIFFFEKDLHH
GTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARIVNNTITDCEVE
SIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKVITQGVKRLGEKNKAVVCH
KQNYPYAVFFCHTTDTTKVYSAPLKGVDGSIVKAIVVCHTDTSQWNPKHFAFLELKVKVGSGPI
CHLLPQDHVAWISK
Repeat found in LOC131596178
Repeat occurs 10 times in a sequence of 589 amino acids
Location between 178972989 and 178975235
Coverage of 11.88 %
Instances:
YDHTGVD | YDHAVSG | YDHAVSG | YDHAASE | YDHAARK
YDHAASE | YDHVASE | YDHAASE | YDHAASE | YDHSASE

pattern: YDH[TSVA][VGA][SVR][DGEK]
MEFHFFRIITFLMLLLVATNAARLPPQRYWKSVIPNSPMPKAITNLLLPSETDVSSKELSNWF
RYDHTGVDVNSKKLPNWFRYDRAASGSDVNSEELPNWFRYDHAVSGADVSSEELPNWFRYDHAV
SG
ADVSSEELPNWFRYNHAASEPDVNSEELSNVFKYDHAASEPDVSFKKFPNVFKYDHAARKPD
VSFKKLPNVFKYDHAASEPDVSFRKLPNVFKYDHVASEHDVSSRYFPNVFKYDHAASESDVSSK
YFSNVFKYDHAASEPEVNFRESPNGFSYHHAASEPGLNSRESLNGFFYRNAASEPNVNFRKSPN
GFSYHQAASVPDVNSGETSILSYNPPASEFNVNSGKLSNAFSYDHSASEKPKSTIFFFEKDLHH
GTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARIVNNTITDCEVE
SIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKVITQGVKRLGEKNKAVVCH
KQNYPYAVFFCHTTDTTKVYSAPLKGVDGSIVKAIVVCHTDTSQWNPKHFAFLELKVKVGSGPI
CHLLPQDHVAWISK
Repeat found in LOC131596180
Repeat occurs 21 times in a sequence of 728 amino acids
Location between 179037704 and 179040213
Coverage of 31.73 %
Instances:
NGFGYHDAVSE | NGFGYHHAASE | NGFGYHHAASE | NGFGYHLAASE | NGFGYHHAASE
NGFGYHHAASD | NGFRYHLAASE | NGFVYHHAASE | NGFGYHHAGSE | NGFGYHHVASE
NGFGYHHGASE | NGFGYHHAASE | NGFGYNHAISG | NGFGYHHGASE | NGFGYHHATSE
NGFGYHHAASE | NGFGYNHAISG | NGFRYHHAASK | NGFRYHHAASK | NGFRYHHAASK
NGFRYHHIASE |
pattern: NGF[VGR]Y[NH][DLH][IVGA][TGAIV]S[DGEK]
MKFHFFHIITFLMLLLVATNAETLPPQLYWKSILPNSPMPKVITNLLLSDSPMAKTITNLLLP
GLDDGVKKSYSDGGEVEGKTDVIPKKLSNGFGYHDAVSEPNVNFKELSNGFGYHHAASEADGNS
EELPNGFGYHHAASEADVNSKKLPNGFGYHLAASEPNVNSKELSNGFGYHHAASEADGNSRELP
NGFGYHHAASD
ADVNSKILPNGFRYHLAASEADENSKELSNGFVYHHAASEADANSRELPNGFG
YHHAGSE
SDINSREISNGFGYHHVASETDVNSRELPNGFGYHHGASEAYVNSRKLSNGFGYHHA
ASE
SGVTSGELPNGFGYNHAISGVDGNSKELPNGFGYHHGASEPDVNFRELPNGFGYHHATSEA
GVKSEELPNGFGYHHAASEPDVNSGELPNGFGYNHAISGVDGNSKELPNGFRYHHAASKPDVNS
GDLPNGFRYHHAASKSDVNSGDLSNGFRYHHAASKPNVNSGDSHNGFRYHHIASEAQLHDKPKA
TIFFFEKELHHGTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGSKGARI
VNNTITDCEVESIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTETYKESDFQQYKVITQGVKR
LGEKNKAVVCHKQNYPYAVFFCHTTDTTKVYSVPLEAVDGSRVKAIVVCHTDTSQWNPKHFAFL
ELKVKVGTAPICHLLPQDHVAWISK
Repeat found in LOC131596181
Repeat occurs 10 times in a sequence of 495 amino acids
Location between 179063717 and 179106096
Coverage of 30.3 %
Instances:
YHHAASELDVNSRDL | YHHAASKPDVNSRDL | YHHTASESGVNSGDL | YHHAASQPDVNSGDL | YHHAASEPDVNFGDL
YHHAGSKPDVNSGDL | YHHAASKPDVNSGDL | YHHAASKPDINSGDL | YHHAASKPNVNSRDL | YHHASSKPNVNSRDL

pattern: YHH[TA][SGA]S[KEQ][SLP][DGN][IV]N[SF][GR]DL
MTFRFFHIITFLMLLLVATNAETLPMTKVITNLLLPSSDGGVKKDYYDGGEARDKTDEISKKL
PNGFRYHHAASELDVNSRDLLNGFRYHHAASKPDVNSRDLPNGFRYHHTASESGVNSGDLLNGF
RYHHAASQPDVNSGDLPNGFRYHHAASEPDVNFGDLSNGFRYHHAGSKPDVNSGDLSNGFPYHH
AASKPDVNSGDL
SNGFPYHHAASKPDINSGDLPNGFRYHHAASKPNVNSRDLPNEFRYHHASSK
PNVNSRDL
HNTASETQLHDKPKATIFFFEKDLHHGTKSYFRFAKTNPNNEAKFLPREFANSIPF
SSKKLEYILNKLNIRKGSKAARIVNNTITDCEVESIKGEEKLCVTSLESMIDFSISKIGKNLEA
VSTEMYKESDFQQYTVITQGVKRLGENNKAVVCHKQNYPYAVFFCHTTDTTKIYSVPLEGVDGS
RVKAIAVCHGDTSQWNPKHFAFLELKVKVGTAPICHLLPQDHVAWIAK
Repeat found in LOC131596181
Repeat occurs 10 times in a sequence of 495 amino acids
Location between 179104348 and 179106096
Coverage of 30.3 %
Instances:
YHHAASELDVNSRDL | YHHAASKPDVNSRDL | YHHTASESGVNSGDL | YHHAASQPDVNSGDL | YHHAASEPDVNFGDL
YHHAGSKPDVNSGDL | YHHAASKPDVNSGDL | YHHAASKPDINSGDL | YHHAASKPNVNSRDL | YHHASSKPNVNSRDL

pattern: YHH[TA][SGA]S[KEQ][SLP][DGN][IV]N[SF][GR]DL
MKFHFFHIITFLMLLLVATNAETLPMTKVITNLLLPSSDGGVKKDYYDGGEARDKTDEISKKL
PNGFRYHHAASELDVNSRDLLNGFRYHHAASKPDVNSRDLPNGFRYHHTASESGVNSGDLLNGF
RYHHAASQPDVNSGDLPNGFRYHHAASEPDVNFGDLSNGFRYHHAGSKPDVNSGDLSNGFPYHH
AASKPDVNSGDL
SNGFPYHHAASKPDINSGDLPNGFRYHHAASKPNVNSRDLPNEFRYHHASSK
PNVNSRDL
HNTASETQLHDKPKATIFFFEKDLHHGTKSYFRFAKTNPNNEAKFLPREFANSIPF
SSKKLEYILNKLNIRKGSKAARIVNNTITDCEVESIKGEEKLCVTSLESMIDFSISKIGKNLEA
VSTEMYKESDFQQYTVITQGVKRLGENNKAVVCHKQNYPYAVFFCHTTDTTKIYSVPLEGVDGS
RVKAIAVCHGDTSQWNPKHFAFLELKVKVGTAPICHLLPQDHVAWIAK
Repeat found in LOC131598408
Repeat occurs 12 times in a sequence of 597 amino acids
Location between 179158108 and 179161660
Coverage of 26.13 %
Instances:
NAFSYHHATNEVA | NAFSYYHAASKTN | NAFSYYAASEPDV | NAFSYHHAASKTN | NAFSYYAASEPDV
NAFSYHHDASKPD | NAFSYHHAASEPD | NAFSYHHDASKPD | NAFSYNHAASESD | NAFSYHHAANEAG
NAFSFHHTASKPD | NAFSYHHAASKPD |
pattern: NAFS[YF][YHN][HA][TDA][TSA][SNE][KEP][TDSPAV][DGAVN]
MDWQATNAETLPPQFYWKSVLPNSPIPKAITNLLPNSPMSKAITNILLPGVDVMSVELPNAFS
YHHATNEVA
VKSAESSNAFSYYHAASKTNLNSRETPNAFSYYAASEPDVNSGELSNAFSYHHAA
SKTN
LNSREPPNAFSYYAASEPDVNSGELSNAFSYHHDASKPDVNSEELSNAFSYHHAASEPDV
NSGELSNAFSYHHDASKPDVNSEELSNAFSYNHAASESDVNSRELSNAFSYHHAANEAGVKSAE
LPNAFSFHHTASKPDVSSEQLPNAFSYHHAASKPDASSGELSNNYQMHSYHHAASKPDVNSKEL
PNGFGYHHAASETGVKSEESPNGFGYHHSASEANLNSRKLPNGFGYHHVAGETQLHHKPHATIF
FFEKELHHGTKSYIRFAKTNPNNEAKFLPREVASSIPFSSKKLEYILNKLNIKKGTKGAHIVNN
TITDCEVDSIKGEEKLCVTSLESMIDFSISKIGKNLEAVSTEMYKESDFQQYKIITQGVKRLGE
KNKVVVCHKQNYPYAVFFCHTTDTTKVYSVPLEGVDGSRVKAIVVCHTDTSQWNPKHFAFLELK
VKVGTAPICHLLPQDHVAWIAK

Similar gene clusters

NC_081183 - Cluster 23 - Cyclopeptide

Gene cluster description

NC_081183 - Gene Cluster 23. Type = cyclopeptide. Location: 189111539 - 189952100 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_081183 - Cluster 24 - Cyclopeptide

Gene cluster description

NC_081183 - Gene Cluster 24. Type = cyclopeptide. Location: 189372371 - 190349773 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131596477
Repeat occurs 6 times in a sequence of 806 amino acids
Location between 190261927 and 190264524
Coverage of 11.17 %
Instances:
EDEAQPEPENDMVFD | EDEEDEEDEECEEDE | EDEEDEECEEDEEDE | EDEECEEDEEDEEDA | EDEEDEEDASEEGSD
EDEEDASEEGSDDED |
pattern: EDE[EA][DQC][EPA][SE][DEPC][EA][SNGE][SDEC][DEM][DVGE][SDEF][DEA]
METTPPTTQPSSSDVPSVGSSAANIHIPSGDIGWKFNHLKDLNNKRKVTCNFCKETSTGGISR
AKQHQLGIKGNVKACTQTPEDVKLILQEHEDKKLAAKKSMSGDVHEDDDEASHLLEISRIRAGK
RPAEEGSMPTAKKNVKGPLDVIYYRKPEETLKKGKQTSLNDACDKKARSSCCKYIARFFYRNGI
AFNVANSKSFKLMVEAIGMYGAHLKPPSYYELRVPMLQEELKLTHEMLSSNKKEQEKYGCSIMS
DGWTDTKGRTLINFLVNSPAGTMFVKSVDASANMKTGKKLFELLDNFVEEIGESNVVQLVTDNG
SNYVLAAKHLQISRPKIFWTPCAAHCLDLMLEDIGKIARVKKVIQKGMTLVGFIYNHSLVLNLM
REKLESELVRTGVTRFATNFLTLHRLHSLKSKIRPMFTSEEWLNLNASKEVKGKKAAAIVLQVS
FWEDIVFALKAMGPLVKVLRLVDNEKKPAMGSIYHAMLEAKELIKKNFNNEAKSKEVIDIVDRR
WSIQLHHPLHAAGYYLNPKYFYSNPMIENDDKLLDGLYACINKLSASEKVIDDIHGELAKYKMC
AGHFGLNEAVRQRANVAPAEWWRRYGAKTPNLQLLAIKILSLTCSSSGCERNWSAFEHIHSKKR
NRLEHQMLQDLVFIKYNQNLKERFDSDDLIDPVVLEDDFDTHNLWLLGGEDEAQPEPENDMVFD
GEDLSWLDVEIASGAAEPAINTRSQATLQKNAAAPPPPQPPSSSRSKPKAKEVVVVDDEFGDGE
YIGEDEEDEEDEECEEDEEDEEDASEEGSDDEDLDFNDS

Similar gene clusters

NC_081183 - Cluster 25 - Polyketide

Gene cluster description

NC_081183 - Gene Cluster 25. Type = polyketide. Location: 197100185 - 197239334 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 26 - Cyclopeptide

Gene cluster description

NC_081184 - Gene Cluster 26. Type = cyclopeptide. Location: 75899235 - 79233249 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_081184 - Cluster 27 - Saccharide-lignan

Gene cluster description

NC_081184 - Gene Cluster 27. Type = saccharide-lignan. Location: 107003866 - 107293244 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 28 - Saccharide

Gene cluster description

NC_081184 - Gene Cluster 28. Type = saccharide. Location: 112839245 - 113146412 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 29 - Terpene

Gene cluster description

NC_081184 - Gene Cluster 29. Type = terpene. Location: 139867633 - 140142542 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 30 - Fatty_acid-alkaloid

Gene cluster description

NC_081184 - Gene Cluster 30. Type = fatty_acid-alkaloid. Location: 147524201 - 147615497 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 31 - Alkaloid

Gene cluster description

NC_081184 - Gene Cluster 31. Type = alkaloid. Location: 147796592 - 148042492 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 32 - Polyketide-terpene

Gene cluster description

NC_081184 - Gene Cluster 32. Type = polyketide-terpene. Location: 157593453 - 157715705 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081184 - Cluster 33 - Cyclopeptide

Gene cluster description

NC_081184 - Gene Cluster 33. Type = cyclopeptide. Location: 159082940 - 159926238 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131603518
Repeat occurs 6 times in a sequence of 523 amino acids
Location between 159302796 and 159306670
Coverage of 14.91 %
Instances:
GGGGGGGGNGGDG | GGGGGGGNGGDGF | GGGGGGNGGDGFG | GGGGGNGGDGFGN | GGGGNGGDGFGNG
GGGNGGDGFGNGW |
pattern: GGG[GN][GN][GN][DGN][DGN][DNGF][DGF][DGFN][DGFN][GWFN]
MALNFSHRPVFTGGLAEDNMRMGNGYRVEGINEMSGGGGGGGGNGGDGFGNGWQDVEDCFDYG
NDRCDRGGVSGTQDSVSNDIVDRLPTDPFGMGIGTTFTAITGWLEDLDFDYGDYGGYGDYGGDE
MGTSGENYPLFAGLNFIWNNAVRFHTFPQGGNVGMEENRFQGVSEVGEASCSFSGESSRDMDGV
MGASVSGDVNMNDVKCHAIDEYTHSAIMYALPHLGLSDLFAVERVCKYLYSIVRNDALLWTSIH
IDQPLSERLTDDVLLELTSRAQGKLECLSLVECSRITDDGLRRVLEANPDIIKLSVPGCTRLSI
DGIVGMLKAYKNSPGTRGVKHLHIGGLYGVTPTHFEEIKLLLSTDRKKQKQSHTDSQLQEQSHT
DSQQQEHSHTDNQQQEHSGADSQLKKHSNKPHFYCRWNLYIPRDDDRSLDIEVCPRCENLRLVY
DCPAEGCHGVVGHASKVCRACTLCIPRCSQCGRCISDGEYEETFCLELLCSQCSHHLSKLIGKT
NGKVGLIRQSSP
Repeat found in LOC131607969
Repeat occurs 7 times in a sequence of 318 amino acids
Location between 159504110 and 159505067
Coverage of 33.02 %
Instances:
KETTKENHPFGFWAW | KETEKENQPFGFWAW | KETEKENQPFGFWAW | KETEKENQPFGFWAW | KETEKENQPFGFWAW
KETEKENQPFGFWAW | KETQNFLALTSDEKE |
pattern: KET[TEQ][NK][EF][NL][QHA][LP][TF][SG][DF][EW][KA][EW]
MSLGIQKHSESSESDKKETTKENHPFGFWAWSRKEIEKENQPFGFWAWSRKETEKENQPFGFW
AW
SRKETEKENQPFGFWAWSRKETEKENQPFGFWAWSRKETEKENQPFGFWAWSRKETEKENQP
FGFWAW
SRKANRPFLRDTKKETQNFLALTSDEKEAHIINNYCRTPSAIGEDKHCALSLESMMDF
AISKLGKNIKVMSSSLVQNQDKYVVEEVNKIGDKVVMCHRLNFKKVVFYCHAVNATTTYMVPFV
ASDGTKSKALTICHHDTRGMNPNVLSEVLNVKPGTVPVCHFIGNKAIAWVPDMSESGGHACVI

Similar gene clusters

NC_081185 - Cluster 34 - Phenolamide-alkaloid-saccharide

Gene cluster description

NC_081185 - Gene Cluster 34. Type = phenolamide-alkaloid-saccharide. Location: 23028618 - 23315537 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 35 - Fatty_acid

Gene cluster description

NC_081185 - Gene Cluster 35. Type = fatty_acid. Location: 74854008 - 74968967 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 36 - Saccharide-lignan

Gene cluster description

NC_081185 - Gene Cluster 36. Type = saccharide-lignan. Location: 101845139 - 102109089 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 37 - Saccharide

Gene cluster description

NC_081185 - Gene Cluster 37. Type = saccharide. Location: 102177382 - 102593696 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 38 - Cyclopeptide

Gene cluster description

NC_081185 - Gene Cluster 38. Type = cyclopeptide. Location: 115279804 - 116474429 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131612148
Repeat occurs 8 times in a sequence of 413 amino acids
Location between 115876046 and 115878186
Coverage of 29.06 %
Instances:
PFGFWAWSRKETEKE | PFGFWAWSRKETEKE | PFGFWIWQRKESEKE | PFGFWIWQRKETEKE | PFGFWIWQRKETEKK
PFGFWAWSKKESEEQ | PFGFWAWSRKESEKE | PFGFWAWSRKANQPF |
pattern: PFGFW[IA]W[SQ][KR]K[EA][TSN][EQ][KEP][FKEQ]
MEFQNLSVLALFFLALVGINGSKSGEEYWKSVWPNTTMPKALSDLLPSDIETSMPIKSHEENQ
YWTIFFEHDLYPGKKMNLGIQKPSESSEYDKKETSKASHPFGFWAWSRKETEKENQPFGFWAWS
RKETEKE
NQPFGFWIWQRKESEKENQPFGFWIWQRKETEKENQPFGFWIWQRKETEKKNQPFGF
WAWSKKESEEQ
NQPFGFWAWSRKESEKENQPFGFWAWSRKANQPFVRDTQKESQHSVALTSDEK
EAHMINEYCRNPSAIGEDKYCALSLESMVDFVISKLGKNIKVMSSSLGQNQENYVVEEVNKIGD
KAVMCHRLNFKKVVFYCHEVNATTTYMVPLVAPDGTKSKALTICHHDTRGMNANMLKEVLNVKP
GTVPVCHFIGNKAIAWVPDMSEPSDHPCAI

Similar gene clusters

NC_081185 - Cluster 39 - Fatty_acid

Gene cluster description

NC_081185 - Gene Cluster 39. Type = fatty_acid. Location: 130728644 - 130841739 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081185 - Cluster 40 - Saccharide

Gene cluster description

NC_081185 - Gene Cluster 40. Type = saccharide. Location: 157082844 - 157164512 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 41 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 41. Type = cyclopeptide. Location: 1 - 3269593 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131616860
Repeat occurs 11 times in a sequence of 609 amino acids
Location between 153761 and 156568
Coverage of 10.84 %
Instances:
KVSKHF | KVSKYF | KVSKYF | KVSKYF | KVSKYF
KVSKYF | KVSKYF | KVSESC | KVSKYF | KVSDSC
KVSDSS |
pattern: KVS[DKE][SYH][SCF]
MENKNVDNLFNEFAYKGSSEGEHPKKTVFLSPYFPYKGNTRNTFYRSPYSCSSRKVSKHFREV
GESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFVKDG
ESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFAKDGESKVGDSCSRKVPKHFREVGES
KEDDSCPSRKVSKYFREVGEPEVGNSCPSRRTSKYFGEVGESKVSESCTSSKVSKYFVKVDDSK
VSDSC
RSRNLLEHFWEVEELELDYQKRVSPCSSRKVKNNRKSMTGFRECVKKRKCKGIGVIENL
KVEDSSCLAKIYEYLAKVEEIIIKGSLLLQKVSDSSRGGYQVEENCMKEINDFLNQYVYHGRSV
PISRKCVKKSENLEEGGDSEYRAKVKEAKIKFESILGDLSNSFQEGPKVEEGSSLVPPKMKSKS
RGKKTQPFHKAERYKEAYKRKSAENNWLPPRSHWNLIQEDHFHDPWRVLVICMLLNRTTGAQTK
KVLDEFFKLCPNAESCMRVSREEIQEVIKSLGLQGKRSAMLQRLSCEYLSDSWTHVTELHSVGK
YAADAYAIFCTGKWDEVVPNDHMLNKYWIFLRNL
Repeat found in LOC131616860
Repeat occurs 11 times in a sequence of 609 amino acids
Location between 153761 and 156568
Coverage of 10.84 %
Instances:
KVSKHF | KVSKYF | KVSKYF | KVSKYF | KVSKYF
KVSKYF | KVSKYF | KVSESC | KVSKYF | KVSDSC
KVSDSS |
pattern: KVS[DKE][SYH][SCF]
MENKNVDNLFNEFAYKGSSEGEHPKKTVFLSPYFPYKGNTRNTFYRSPYSCSSRKVSKHFREV
GESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFVKDG
ESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFAKDGESKVGDSCSRKVPKHFREVGES
KEDDSCPSRKVSKYFREVGEPEVGNSCPSRRTSKYFGEVGESKVSESCTSSKVSKYFVKVDDSK
VSDSC
RSRNLLEHFWEVEELELDYQKRVSPCSSRKVKNNRKSMTGFRECVKKRKCKGIGVIENL
KVEDSSCLAKIYEYLAKVEEIIIKGSLLLQKVSDSSRGGYQVEENCMKEINDFLNQYVYHGRSV
PISRKCVKKSENLEEGGDSEYRAKVKEAKIKFESILGDLSNSFQEGPKVEEGSSLVPPKMKSKS
RGKKTQPFHKAERYKEAYKRKSAENNWLPPRSHWNLIQEDHFHDPWRVLVICMLLNRTTGAQTK
KVLDEFFKLCPNAESCMRVSREEIQEVIKSLGLQGKRSAMLQRLSCEYLSDSWTHVTELHSVGK
YAADAYAIFCTGKWDEVVPNDHMLNKYWIFLRNL
Repeat found in LOC131616860
Repeat occurs 11 times in a sequence of 524 amino acids
Location between 153761 and 156510
Coverage of 12.6 %
Instances:
KVSKHF | KVSKYF | KVSKYF | KVSKYF | KVSKYF
KVSKYF | KVSKYF | KVSESC | KVSKYF | KVSDSC
KVSDSS |
pattern: KVS[DKE][SYH][SCF]
MENKNVDNLFNEFAYKGSSEGEHPKKTVFLSPYFPYKGNTRNTFYRSPYSCSSRKVSKHFREV
GESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFVKDG
ESKVGDSCPSRKVSKYFVKDGESKVGDSCPSRKVSKYFAKDGESKVGDSCSRKVPKHFREVGES
KEDDSCPSRKVSKYFREVGEPEVGNSCPSRRTSKYFGEVGESKVSESCTSSKVSKYFVKVDDSK
VSDSC
RSRNLLEHFWEVEELELDYQKRVSPCSSRKVKNNRKSMTGFRECVKKRKCKGIGVIENL
KVEDSSCLAKIYEYLAKVEEIIIKGSLLLQKVSDSSRGGYQVEENCMKEINDFLNQYVYHGRSV
PISRKCVKKSENLEEGGDSEYRAKVKEAKIKFESILGDLSNSFQEGPKVEEGSSLVPPKMKSKS
RGKKTQPFHKAERYKEAYKRKSAENNWLPPRSHWNLIQEDHFHDPWRVLVICMLLNRTTGAQVC
GGCLCYILYWKVG
Repeat found in LOC131616862
Repeat occurs 5 times in a sequence of 938 amino acids
Location between 161720 and 166207
Coverage of 6.93 %
Instances:
DVCIPDSDLEDMS | DVCVPDSDLEDMS | DVCVPESDLDDMS | DVCVPDSDLDDMS | DVCVPDSVLDESQ

pattern: DVC[IV]P[DE]S[VD]L[DE][DE][SM][SQ]
MVRSRTAAKPSAPAPENEYNLEVISIGSLYNGQWHKKYWSTSRGKDRHPYPVGYQAVRAHNGV
TCKMEILEGVNGPTFLISSDDGNSASGKTPDIAWAGFQRKGFTRTTKVLRRKRHSPKMDGVELF
GFKNQFILRLLRGLVADTNRIAEPSGLVADTNGIAEPSFRNEVSKTEHDDSLIPLGRSRSKESQ
GCIDVDLYAPDTPDEEVLKYDSNISPKNDYYNSTDKDVAKSMMSLLLPQAVPLLRNNSTDEKFT
LIPSDTLPFKEEQNEVGCVSDVPSSDKNRPHDYDVCIPDSDLEDMSSPDVCVPDSDLEDMSSPD
VCVPESDLDDMS
PPDVCVPDSDLDDMSPPDVCVPDSVLDESQAHDSFKSVQNDLPIAQDFTGEK
TQDEAGGKIGNAKALAISSSQSPNFINTIRKRRNFVSPRTRTARNSVPPSKTIQAKNDKLRKPD
DSAGLILEKPEVHSDVPALQNNLAELNLSSQNPNPLSCENKCSGGKEAQFISEPMVPRNQELKD
NPKDNVKFVGRYMLPMRVSSLLLKTVGDEIHICVLCGLLTDQNRTLFAYKVAIKEPNFGCPSAM
VHTPIMLPDPKYNFMTETMVERTGMEFTPDGQFIVLIGSIKTPNCREGKIDCCCSTCTLICSKK
NGLKIVHVESGYASVVATLEAVEDLHCISVCEPNRLVSVGGSGNIQVWVMNSTWSEKIEHFIIP
SAGSTSPGIVELKRVPKCAHLVVGRNIYGEFSLWDITKLNCVSSFSASKYPINDFFPISLFHLQ
TKDLRLSYASKEEKAEKLLEASNFWLSKQRETYKYLPTEDVAMWFLVSTPSVIDCCQDHVSTSI
HHDVHTARSWTLALLTKNSISFGSPLDPRTTAVGVCGGYGIMSTSDGVVYMWDLSRGSKLDILH
HFKDDTITSIVSDDLDSNSSVVGVAGEEGHLLLYLHRDQDSTD
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131620259
Repeat occurs 4 times in a sequence of 339 amino acids
Location between 248064 and 249084
Coverage of 11.8 %
Instances:
SESESWSESE | SESWSESESE | SESESETDSD | SESETDSDSE |
pattern: SES[EW][TS][EDW][TS][DE]S[DE]
MAKTKQLLSPLLEEPLSASSSEEDSSEDILPKKKLILQSKNPPPLASKNPKPPSSSESESWSE
SE
SETDSDSEPEPTPPVKPLASPLKILTTPSPAKAGTKRAIEFQTTGTKRAADNNGTDSDSSKR
LKKKENKTERRGSNDEKEAARDSVKKVSAQRLFSEEDELAILKGLADFISTKNDPLENTQAFYN
FVKNSIRADANKRQLRRKVLALKKKFERNDENFTKEHDQKAFELFKKIGFKKQRTPKKGLGSDS
LSAKNEGAVKNVKNSLAFKEMSKFGNCFGLSLDAMEKGMELMGESKNAMLEEKWKKVQIVEMKL
LMTRAGLVKEQASMILESLV
Repeat found in LOC131616853
Repeat occurs 3 times in a sequence of 215 amino acids
Location between 249103 and 254162
Coverage of 8.37 %
Instances:
SSLPRK | SSLMGA | SSLMGA |
pattern: SSL[PM][GR][KA]
MELFDESHKENNCGDEISSPEADESRLPLELAIVGRHNVGKSTLPNTLLQEDRVLVGPEAVVS
KAFNAFKAAKRARELVISNSVLRSSSLPRKITDCSSSNSENCGLGEMMPLQLWQTTMDPEQQLL
NKLTAEDAAEANIVFSSLMGARGLGEMMPLQLWQTTTDPEQQLLNKLTAEDADEANIVFSSLMG
A
QVDVRKELIRNAANMIDLDQLDI
Repeat found in LOC131616867
Repeat occurs 5 times in a sequence of 935 amino acids
Location between 280597 and 285080
Coverage of 6.95 %
Instances:
DVCIPDSDLEDMS | DVCVPDSDLEDMS | DVCVPESDLDDMS | DVCVPDSDLDDMS | DVCVPDSLLDESQ

pattern: DVC[IV]P[DE]S[DL]L[DE][DE][SM][SQ]
MVRSRTAAKPSAPAPENEYNLEVISIGSLYNGQWHKKYWSTSRGKDRHPYPVGYQAVRAHNGV
TCKMEILEGVNGPTFLISSDDGNSASGKTPDIAWAGFQRKGFTRTTKVLRRKRHSPKMDGVELF
GFKNQFILRLLRRLVADTNGIAEPSGLVADTNGIAEPSFRNEVSKTEHDDSLIPLGRSRSKESQ
GCIDVDLYAPDTPDEEVLKYDSNISPKNDYYNSTDKDVAKSMMSLLLPQAVPLLRNNSTDEKFT
LIPSDTLPFKEEQNEVGCVSDVPSSDKNRPHDYDVCIPDSDLEDMSSPDVCVPDSDLEDMSSPD
VCVPESDLDDMS
PPDVCVPDSDLDDMSPPDVCVPDSLLDESQAHDSFKSVQNDLPIAQDFTGEK
TQDEAGGKIGNAKTLAISSSQSPNFINTIRKRRNFVSPRTRNNVPPSKTIQAKNDKLRKPDDSA
GLILEKPEVHSDVPDLQNNLAELNLSSQNPNPLSCENKCSGGKEAQFISEPMVPRNQELKDNPK
DNVKFVGRYMLPMRVSSLLLKTVGDEIHICVLCGLLTDQNRTLFAYKVAIKEPNFGCPSAMVHT
PIMLPDPKYNFMTETMVERTGMEFTPDGQFIVLIGSIKTPSCRESKIDCCCSTCTLICSKKNGL
KIVHVESGYASVVATLEAVEDLHCISVCEPNRLVSVGGSGNIQVWVMNSTWSEKIEHFIIPSAG
STSPGIVELKRVPKCAHLVVGRNIYGEFSLWDITKLNCVSSFSASKYPINDFFPISLFHLQTKD
LRLSYASKEEKAEKLLEASNFWLSKQRETYNYLPTEDVAMWFLVSTPSVIDCCQDHVSTSIHHD
VHTARSWTLALLTKNSISFGSPLDPRTTAVGVCGGYGIMSTSDGVVYMWDLSRGSKLDILHHFK
DDTITSIVSDDLDSNSSVVGVAGEEGHLLLYLHRDQDSTD
Repeat found in LOC131616886
Repeat occurs 5 times in a sequence of 480 amino acids
Location between 1816079 and 1817937
Coverage of 10.42 %
Instances:
ERTKTQIDEK | ERTTTQLDVK | ERTTTQLDVK | ERTKTQIDEK | ERTKTQIDEK

pattern: ERT[TK]TQ[IL]D[VE]K
MALNSQRTFRAQSLRYPLSNRRVYETVEPSSEIKETAEAYFLHVKLPGYTIIRPKIALEDASK
KLRITGERPIADDKWEKFDQTYPVPENSDAEKLEAKFEQEILILKMQKKPISPSQVVAPKNQPD
PLSNEDLDETKVEKVQETMKTDSPQTQSIEKNTQEATHDDTLSQIAKEAISNNEPQRGLQELEQ
KPTFIERTKTQIDEKGQEAFEKKPTLLQRIITQIYEKAQKSREELEEKATSIERTTTQLDVKAQ
KGQEEFIERTTTQLDVKAQKGQEEFIERTKTQIDEKDHKGQAEFETKPTVTERTKTQIDEKVKK
GQEESIEKSQTETNEKPQKVQEEFEPKPIEKLEEKIIRNSVKKERMLEKEESEDKKEKPYYESI
KAMIDVKNQNLKENEIEKEKLPAPKVTENENESCSIITPKQEEKNGVEKLAITSSQLVTRMADG
KWRSNERNLVKNVGVAVLVIAAFGAYISYRFSS

Similar gene clusters

NC_081186 - Cluster 42 - Fatty_acid

Gene cluster description

NC_081186 - Gene Cluster 42. Type = fatty_acid. Location: 37833592 - 38035634 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 43 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 43. Type = cyclopeptide. Location: 47940489 - 49870593 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_081186 - Cluster 44 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 44. Type = cyclopeptide. Location: 50474192 - 52064976 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131619040
Repeat occurs 4 times in a sequence of 348 amino acids
Location between 51604173 and 51605272
Coverage of 17.24 %
Instances:
QEEREFRLAKPQVSY | QEEKEAIFAEPQVGY | QEEKEAILAKPQAGY | QEEKEAILAKPPLTY |
pattern: QEE[KR]E[FA][IR][LF]A[KE]P[PQ][VLA][TSG]Y
MEPMYNEQSTDSTISSRKDTSSGYQEEREFRLAKPQVSYQLHYNKNHPPTQQEEKEAIFAEPQ
VGY
QLHSNNKNHPPSQQEEKEAILAKPQAGYQLQYNNNNHPPSQQEEKEAILAKPPLTYHTYKK
QNKGFKGVKSVEEGLFFRQSMMTVGSIIRMQNIKDYIPRQSFFTRSILKKLPSASSEMSQLLKL
FNNLSKTMMESVKVCHNPANTGEERSCEVSLKGMLDFAKFILGRNITAHTTESVNVSSQNIIVV
GVNVIGEKSVTCHRSWFPFLMCYCHSVPKVCIYQVDIWEVKSKQKINNGVAVCHLDTSSWSPTH
VAFKVLGYGPGLIEACHWVFENDMVWIAA
Repeat found in LOC131619044
Repeat occurs 4 times in a sequence of 397 amino acids
Location between 51615159 and 51616353
Coverage of 15.11 %
Instances:
QEEREFRLAKPQVSY | QEEREAILAEPQVGY | QEEKEAILAKPQAGY | QEEKEAILAKPPLTY |
pattern: QEE[KR]E[FA][IR]LA[KE]P[PQ][VLA][TSG]Y
MRVILIFILFSALVSLTHKKFTSTPRELAIEYWNQSSTSKVPIPLFLIEKILPPVTKLEKHLF
FQDANTLCFSQEEREFRLAKPQVSYQLRYNKNHPPSHQEEREAILAEPQVGYQLHYNNKNHPPS
QQEEKEAILAKPQAGYQLQYNNNNHPPSQQEEKEAILAKPPLTYHTYKKQNKGFKWVKSVEEGL
FFRQSMMTVGSIIRMQNIKDYIPRQSFFTRSLLKKLPSASSEMSQLLNVFNNLSKTMMESVKAC
HNPANTGEERNCEVSLKGMLDFAKFILSRNITAHTTESVNGSSQNIIVAGVNVIGEKSVTCHRS
WFPFLMYYCHNVPRVCIYQVDIWEVKSKQKINNGVAVCHLDTSSWSPTHVDFKVLGYGPGLIEA
CHWVFENDMVWIAA

Similar gene clusters

NC_081186 - Cluster 45 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 45. Type = cyclopeptide. Location: 51269643 - 51945586 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131619040
Repeat occurs 4 times in a sequence of 348 amino acids
Location between 51604173 and 51605272
Coverage of 17.24 %
Instances:
QEEREFRLAKPQVSY | QEEKEAIFAEPQVGY | QEEKEAILAKPQAGY | QEEKEAILAKPPLTY |
pattern: QEE[KR]E[FA][IR][LF]A[KE]P[PQ][VLA][TSG]Y
MEPMYNEQSTDSTISSRKDTSSGYQEEREFRLAKPQVSYQLHYNKNHPPTQQEEKEAIFAEPQ
VGY
QLHSNNKNHPPSQQEEKEAILAKPQAGYQLQYNNNNHPPSQQEEKEAILAKPPLTYHTYKK
QNKGFKGVKSVEEGLFFRQSMMTVGSIIRMQNIKDYIPRQSFFTRSILKKLPSASSEMSQLLKL
FNNLSKTMMESVKVCHNPANTGEERSCEVSLKGMLDFAKFILGRNITAHTTESVNVSSQNIIVV
GVNVIGEKSVTCHRSWFPFLMCYCHSVPKVCIYQVDIWEVKSKQKINNGVAVCHLDTSSWSPTH
VAFKVLGYGPGLIEACHWVFENDMVWIAA
Repeat found in LOC131619044
Repeat occurs 4 times in a sequence of 397 amino acids
Location between 51615159 and 51616353
Coverage of 15.11 %
Instances:
QEEREFRLAKPQVSY | QEEREAILAEPQVGY | QEEKEAILAKPQAGY | QEEKEAILAKPPLTY |
pattern: QEE[KR]E[FA][IR]LA[KE]P[PQ][VLA][TSG]Y
MRVILIFILFSALVSLTHKKFTSTPRELAIEYWNQSSTSKVPIPLFLIEKILPPVTKLEKHLF
FQDANTLCFSQEEREFRLAKPQVSYQLRYNKNHPPSHQEEREAILAEPQVGYQLHYNNKNHPPS
QQEEKEAILAKPQAGYQLQYNNNNHPPSQQEEKEAILAKPPLTYHTYKKQNKGFKWVKSVEEGL
FFRQSMMTVGSIIRMQNIKDYIPRQSFFTRSLLKKLPSASSEMSQLLNVFNNLSKTMMESVKAC
HNPANTGEERNCEVSLKGMLDFAKFILSRNITAHTTESVNGSSQNIIVAGVNVIGEKSVTCHRS
WFPFLMYYCHNVPRVCIYQVDIWEVKSKQKINNGVAVCHLDTSSWSPTHVDFKVLGYGPGLIEA
CHWVFENDMVWIAA

Similar gene clusters

NC_081186 - Cluster 46 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 46. Type = cyclopeptide. Location: 74989548 - 75629424 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131619369
Repeat occurs 3 times in a sequence of 362 amino acids
Location between 75422379 and 75423658
Coverage of 9.94 %
Instances:
GFLDAQSPKEDE | GFLKAQSPKEDE | GFLEAPEHERKR |
pattern: GFL[DKE]A[PQ][SE][HP][KE][RE][DK][RE]
MALFSTNIKVLAIHHDAILLDRIQQMCDRCHYLLTKCTSAYHALNLLAQRNGYFDMMLIDVCM
PDMDSYEFLRYVTLQHRIPVIVISSDGSKSSVMNSVIYGACDYWVQPLHEKQFKTMWQHVARNN
FNENKAEHHVPSDQQHHVLGFLDAQSPKEDEVLGFLKAQSPKEDEVVGFLEAPEHERKREREDD
KPSKEKSGKKSRLSWTPQLHQQFLNAVNQVGFDEAKPRNILKRMDVCGLTTAQVASHLQKYRNY
LKGPSCGKKSNKSQRIDTPTKTASETEDANSVLHHENQSCQLNSTLQSYNISETQQDSNHDIGY
NYEAQQHSIHVTDNQVSDIINKDFPGASDLLLDLDDELLSLLF

Similar gene clusters

NC_081186 - Cluster 47 - Saccharide

Gene cluster description

NC_081186 - Gene Cluster 47. Type = saccharide. Location: 88078290 - 88125441 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 48 - Terpene

Gene cluster description

NC_081186 - Gene Cluster 48. Type = terpene. Location: 89975369 - 90375791 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 49 - Saccharide

Gene cluster description

NC_081186 - Gene Cluster 49. Type = saccharide. Location: 98409334 - 98465002 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 50 - Fatty_acid-saccharide

Gene cluster description

NC_081186 - Gene Cluster 50. Type = fatty_acid-saccharide. Location: 100070768 - 100246005 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 51 - Cyclopeptide

Gene cluster description

NC_081186 - Gene Cluster 51. Type = cyclopeptide. Location: 103769367 - 104272802 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131618126
Repeat occurs 6 times in a sequence of 430 amino acids
Location between 104019406 and 104022762
Coverage of 16.74 %
Instances:
IQPFGAWRNEAA | IQPFGAWRNEKE | IQPFGAWRNEKE | IQPFGAWRNEKE | IQPFGAWRNEKE
IQPFEAWRNEEI |
pattern: IQPF[GE]AWRNE[KEA][IEA]
MEFTSLSILALLCVALMGTNASKSGEEYWKSIWPDTPVPKILSDLVLMDNTELIRGQEMKQYW
TVFFNHDLYPGKQISLGIQKQSYVQPSRSNAQILIEKASPHFSTKEEVEKSMKPLKDLAWHGEA
TKEDIKEQIQPFGAWRNEAAKKENEEPIQPFGAWRNEKEIEEPIQPFGAWRNEKEIDEPIQPFG
AWRNEKE
IEEPIQPFGAWRNEKEIKEPIQPFEAWRNEEINKEIERPNKHFEAVVWPHKTTIKKI
EKVSQTSITRPSGEKEAHVIHDYCEKPSAIGEDRHCVTSLESMMDFAISKLGKNIKVMSSSFAQ
NQNQYVMEEVRKIGDKAVMCHKMNLKNVVFYCHQVNATTIYKVPLVASDGTKSNALTICHHDTR
GMNANALYKVLKVRPGTVPICHFIGNKAIAWVPNDNVSEYDDCPRVI

Similar gene clusters

NC_081186 - Cluster 52 - Putative

Gene cluster description

NC_081186 - Gene Cluster 52. Type = putative. Location: 117897938 - 118218863 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_081186 - Cluster 53 - Alkaloid

Gene cluster description

NC_081186 - Gene Cluster 53. Type = alkaloid. Location: 127926819 - 128392737 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026704941 - Cluster 54 - Saccharide

Gene cluster description

NW_026704941 - Gene Cluster 54. Type = saccharide. Location: 1060132 - 1270765 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026704977 - Cluster 55 - Fatty_acid

Gene cluster description

NW_026704977 - Gene Cluster 55. Type = fatty_acid. Location: 1214347 - 1345981 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026704983 - Cluster 56 - Cyclopeptide

Gene cluster description

NW_026704983 - Gene Cluster 56. Type = cyclopeptide. Location: 1 - 338293 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_026704997 - Cluster 57 - Cyclopeptide

Gene cluster description

NW_026704997 - Gene Cluster 57. Type = cyclopeptide. Location: 237691 - 1799112 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131623552
Repeat occurs 6 times in a sequence of 169 amino acids
Location between 756731 and 757241
Coverage of 53.25 %
Instances:
NVEDANGEDENVEEA | NVEEANGEDENVEEA | NVEEANGKDENVEEA | NVEEANGEDENVEEA | NVEEANGEDENVEGV
NVEGVNSEDSETDPD |
pattern: NVE[DGE][VA]N[SG][KE]D[SE][NE][TV][DE][GEP][DVA]
MKDIVIRCLWYWNPKFAFSRGLWPLNDDKDVLRFMEDIRGYKLIDIYVEHKVEEVNIEDVRFN
DATDEEVNVEDANGEDENVEEANGEDENVEEANGKDENVEEANGEDENVEEANGEDENVEGVNS
EDSETDPDYNMSNEEYEGDEDDEDDEGSEDDELNLGHDIGVD
Repeat found in LOC131623527
Repeat occurs 3 times in a sequence of 311 amino acids
Location between 764025 and 765666
Coverage of 5.79 %
Instances:
GIGLSV | GIGIGV | GIGVGI |
pattern: GIG[IVL][SG][IV]
MENSYKVNGNGSAENGYSIARHTHSYQPSLKGSLPWLDIRVFYVRVCKCEFDNTTPEVLTLNH
VPLNPDTLLEVNGVRSSIYSDGMSTLLKRDRVDRKSEEVTFVSTDSIRMSGSVKFEVFDKDNLL
LFGALELCNSNGVVRESNSNGQSWSMKCESNIIPGTRFFKEKQLLLPETTLPTVEVYIAGSFSG
TPIILTKTLHLSSQKRHSRKGVLKAIPENDANENGKDPSSALALQGPDYMLDKHEDEDYHSLYT
RTAYADGEDGELSWFNAGVRVGVGIGLSVCLGIGIGVGIMVKTYQGATGQFRRRLL
Repeat found in LOC131623527
Repeat occurs 3 times in a sequence of 311 amino acids
Location between 764025 and 765666
Coverage of 5.79 %
Instances:
GIGLSV | GIGIGV | GIGVGI |
pattern: GIG[IVL][SG][IV]
MENSYKVNGNGSAENGYSIARHTHSYQPSLKGSLPWLDIRVFYVRVCKCEFDNTTPEVLTLNH
VPLNPDTLLEVNGVRSSIYSDGMSTLLKRDRVDRKSEEVTFVSTDSIRMSGSVKFEVFDKDNLL
LFGALELCNSNGVVRESNSNGQSWSMKCESNIIPGTRFFKEKQLLLPETTLPTVEVYIAGSFSG
TPIILTKTLHLSSQKRHSRKGVLKAIPENDANENGKDPSSALALQGPDYMLDKHEDEDYHSLYT
RTAYADGEDGELSWFNAGVRVGVGIGLSVCLGIGIGVGIMVKTYQGATGQFRRRLL
The following known motifs were found in CDS LOC131623490
Location between 1037244 and 1038700
FEPR was found 5 times in this sequence
Sequence:
MKPALALLPFLFLFLFVVTTESRKDLKEYWQTVMKDEEMPEGIKGLLQLKSETEPLKNSKVQE
QLVKGKCDQHLVTNTLDVKEKKVVTGEFEPRPSATKYDGNGGYKNMKLSVNGEFEPIPSVTKYD
GDEGYKNMKPSVNGEFKPIPSVTKYDGDEGYKNIKLPVNGEFEPIPSVTKYDGDESYKNMKLSV
NSEFEPIPSVTKYDGDESYKNMKLPVNGEFEPIPSVTKYDGSYKNTKLPVNGEFEPIPSVTRYD
GNDGYKSMKLPVNDEFEPRPSATKYDGDEGYKTMKVNVNDEFEPRPSATKYDGDEGYKNIKLQV
NGEFEPIPSVTRYDGDDGYKNIKLPMNNEFEPRPSATKYDGDEGYKTMKVNVNDEFEPRPSATK
YND
Repeat found in LOC131623496
Repeat occurs 9 times in a sequence of 591 amino acids
Location between 1300324 and 1302399
Coverage of 22.84 %
Instances:
SYGNHEAEKPDYNTG | SYGNHEAEKPDYNTG | SYGNREAEKPDYNTG | SYGNREAEKPDYNTG | SYGNHEAEKPNYNTG
SYGNHEAEKPDYDTG | SYGNHEAEKPDYDTG | SYGNHEAEKPDYNTG | SYGKHEPNHPYITQY |
pattern: SYG[NK][RH]E[PA][NE][KH]P[YDN][IY][DNT][TQ][YG]
MAQTLALQFLTLLLFFFMNGQGITARDLKTELRDVDSNEQPYITQYGHTKPDYAASYNTHDSN
GPYKTSYGNHEAEKPDYNTGYRTHTHHSNELYKTSYGNHEAEKPDYNTGYKARTHDSNEPYKTS
YGNREAEKPDYNTG
YKARTHDSNEPYKTSYGNREAEKPDYNTGYRAHTHDSNEPYRTSYGNHEA
EKPNYNTG
YRAHSHTHDSNKPYKTSYGNHEAEKPDYDTGYRAHTHDSNEPYKTSYGNHEAEKPD
YDTG
YRAHTHDSNEPYKTSYGNHEAEKPDYNTGYRTHTQNSNKPYLTYANHETDESNGPYIASY
GKHEPNHPYITQY
GPTSLDLKDLTSPNSKNLKGSASSNLDRTEAFKTGFFNLDDLYVGHVMTLQ
FPVQEVSHYLPKKVADSIPLSKSQLPSVLQLFSISEDTSQAKSMRGTLEECEGETITGETKVCA
NSLESMLEFVDTIIGSDTKHSILSTSQPSPTAIPLQKYTILEVSHDIHAPKWVACHPLPYPYAI
HYCHYIATGSKVFKVTLVGDENGDKMEALGICHLDTSDWNPDHIVFKQLGIKAGKNTPVCHFFP
VNHLLWVPVEPSKATM

Similar gene clusters

NW_026705087 - Cluster 58 - Saccharide

Gene cluster description

NW_026705087 - Gene Cluster 58. Type = saccharide. Location: 205247 - 463519 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705146 - Cluster 59 - Cyclopeptide

Gene cluster description

NW_026705146 - Gene Cluster 59. Type = cyclopeptide. Location: 1 - 1170491 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131626855
Repeat occurs 23 times in a sequence of 347 amino acids
Location between 532362 and 538144
Coverage of 99.42 %
Instances:
AGTRNPAAGTRLRVP | AGTRLRVPGCGYPAA | AGTRLRVPGCGYPGC | AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA
AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA | AGTPAAGTRLRVPGC | AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA
AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA
AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA | AGTRLRVPRLRVPGC | AGTRLRVPGCGYPAA
AGTRLRVPGCGYPAA | AGTRLRVPGCGYPAA | AGTRLRVPGCGGLTI |
pattern: AGT[RP][NLA][RPA][VGA][TPA][GR][TCL][GR][VYGL][RPL][TVGA][ICPA]
MTLPICKYDSELHSEYPNHNFSVAGTRNPAAGTRLRVPGCGYPAAGTRLRVPGCGYPGCGYPA
AGTRLRVPGCGYPAA
GTRLRVPGCGYPAAGTRLRVPGCGYPAAGYPAAGTRLRVPGCGYPAAGT
PAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLR
VPGCGYPAA
GTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPA
A
GTRLRVPGCGYPAAGTRLRVPRLRVPGCGYPAAGTRLRVPGCGYPAAGTRLRVPGCGYPAAGT
RLRVPGCGYPAA
GTRLRVPGCGGLTICY

Similar gene clusters

NW_026705156 - Cluster 60 - Polyketide

Gene cluster description

NW_026705156 - Gene Cluster 60. Type = polyketide. Location: 743207 - 1117375 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705200 - Cluster 61 - Cyclopeptide

Gene cluster description

NW_026705200 - Gene Cluster 61. Type = cyclopeptide. Location: 1 - 699046 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131628110
Repeat occurs 8 times in a sequence of 413 amino acids
Location between 312549 and 314700
Coverage of 29.06 %
Instances:
PFGFWAWSRKETEKE | PFGFWAWSRKETEKE | PFGFWIWQRKESEKE | PFGFWIWQRKETEKE | PFGFWIWQRKETEKK
PFGFWAWSRKESEEQ | PFGFWAWSRKESEKE | PFGFWAWSRKANQPF |
pattern: PFGFW[IA]W[SQ]RK[EA][TSN][EQ][KEP][FKEQ]
MEFQNLSVLALFFLALVGINGSKSGEEYWKSVWPNTTMPKALSDLLPSDIETSMPIKSHEENQ
YWTIFFEHDLYPGKKMNLGIQKPSESSEYDKKETSKASHPFGFWAWSRKETEKENQPFGFWAWS
RKETEKE
NQPFGFWIWQRKESEKENQPFGFWIWQRKETEKENQPFGFWIWQRKETEKKNQPFGF
WAWSRKESEEQ
NQPFGFWAWSRKESEKENQPFGFWAWSRKANQPFVRDTQKESQHSVALTSDEK
EAHMINEYCRKPSAIGEDKYCALSLESMVDFVISKLGKNIKVMSSSLGQNQENYVVEEVNKIGD
KAVMCHRLNFKKVVFYCHEVNATTTYMVPLVAPDGTKSKALTICHHDTRGMNANMLKEVLNVKP
GTVPVCHFIGNKAIAWVPDMSEPSDHPCAI
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1127 amino acids
Location between 604998 and 612530
Coverage of 9.32 %
Instances:
APTQDAPTQEAPTQD | APTQEAPTQDAPTQE | APTQDAPTQEAPTQE | APTQEAPTQEAPTQE | APTQEAPTQEAPTQA
APTQEAPTQASQTSK | APTQASQTSKKRGRP |
pattern: APTQ[DEA][SA][QP]T[SQ][DKEA][SKA][QRP][TG][SRQ][DKPAE]
MLQEMSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERV
KMWWKHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGK
ACASEKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVP
NETDNNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEF
RSLKQFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSK
HKCGRQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIV
EGDSSQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFI
GLDGCHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISD
QQKGLVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKE
ANVDAYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMG
RFATLREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTC
SCNYWELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPII
MPPLFKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAART
SQEAPTQDAPTQEAPTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKT
KKSKTVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDD
VQDAVPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVK
TFVGKAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIP
GPGKKPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1127 amino acids
Location between 604998 and 612530
Coverage of 9.32 %
Instances:
APTQDAPTQEAPTQD | APTQEAPTQDAPTQE | APTQDAPTQEAPTQE | APTQEAPTQEAPTQE | APTQEAPTQEAPTQA
APTQEAPTQASQTSK | APTQASQTSKKRGRP |
pattern: APTQ[DEA][SA][QP]T[SQ][DKEA][SKA][QRP][TG][SRQ][DKPAE]
MLQEMSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERV
KMWWKHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGK
ACASEKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVP
NETDNNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEF
RSLKQFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSK
HKCGRQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIV
EGDSSQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFI
GLDGCHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISD
QQKGLVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKE
ANVDAYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMG
RFATLREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTC
SCNYWELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPII
MPPLFKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAART
SQEAPTQDAPTQEAPTQDAPTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKT
KKSKTVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDD
VQDAVPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVK
TFVGKAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIP
GPGKKPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 9.35 %
Instances:
APTQDAPTQEAPTQD | APTQEAPTQDAPTQE | APTQDAPTQEAPTQE | APTQEAPTQEAPTQE | APTQEAPTQEAPTQA
APTQEAPTQASQTSK | APTQASQTSKKRGRP |
pattern: APTQ[DEA][SA][QP]T[SQ][DKEA][SKA][QRP][TG][SRQ][DKPAE]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQEAPTQD
APTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 9.35 %
Instances:
APTQDAPTQEAPTQD | APTQEAPTQDAPTQE | APTQDAPTQEAPTQE | APTQEAPTQEAPTQE | APTQEAPTQEAPTQA
APTQEAPTQASQTSK | APTQASQTSKKRGRP |
pattern: APTQ[DEA][SA][QP]T[SQ][DKEA][SKA][QRP][TG][SRQ][DKPAE]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQEAPTQD
APTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 9.35 %
Instances:
APTQDAPTQEAPTQD | APTQEAPTQDAPTQE | APTQDAPTQEAPTQE | APTQEAPTQEAPTQE | APTQEAPTQEAPTQA
APTQEAPTQASQTSK | APTQASQTSKKRGRP |
pattern: APTQ[DEA][SA][QP]T[SQ][DKEA][SKA][QRP][TG][SRQ][DKPAE]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQEAPTQD
APTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 9.35 %
Instances:
APTQDAPTQEAPTQD | APTQEAPTQDAPTQE | APTQDAPTQEAPTQE | APTQEAPTQEAPTQE | APTQEAPTQEAPTQA
APTQEAPTQASQTSK | APTQASQTSKKRGRP |
pattern: APTQ[DEA][SA][QP]T[SQ][DKEA][SKA][QRP][TG][SRQ][DKPAE]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQEAPTQD
APTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ
Repeat found in LOC131628119
Repeat occurs 7 times in a sequence of 1123 amino acids
Location between 608914 and 612530
Coverage of 9.35 %
Instances:
APTQDAPTQEAPTQD | APTQEAPTQDAPTQE | APTQDAPTQEAPTQE | APTQEAPTQEAPTQE | APTQEAPTQEAPTQA
APTQEAPTQASQTSK | APTQASQTSKKRGRP |
pattern: APTQ[DEA][SA][QP]T[SQ][DKEA][SKA][QRP][TG][SRQ][DKPAE]
MSKFKVIFYTRGKFVKDPELKYNGGDMYAFSGQDSDYWSFFEACDLVKSIDPEFDFERVKMWW
KHEAGSFEADLNPFRDDGDSAELAAYAVGHGAEVEIYCEPKPDEGELTFMESVRQKGKGKACAS
EKEPLHECSGDSSDESIKDVHFDDSEEERMNGFEDDFEELVGVGKTSEVRSPPVTNNAVPNETD
NNIFITDEMGKAHVIEEEYMTDELDSGADDDSGDDRPCVIRFNAEESFTKDFVFKVGMEFRSLK
QFKDAILEHNVLNGREVKFVKNDANRCRVVCKDKEKCDYTVLCSRVLTSTTFRIKTLYSKHKCG
RQFFNRCAKADWVAKVIVDGLKNNTKMKLNDVVADVRLRYATEIPGCRAFKARQIARQIVEGDS
SQQFNLLWSYGAELRRASPGNTFKLNTTCAGEGLNPRFEKCYMCFDGTKMALKKACRPFIGLDG
CHLKHKYGGILLIAVGRDPNDQYLPIAFAVVENESKDTWSWFMKLLLEDIGDGRWCFISDQQKG
LVNVFDEEYPSFEHRFCLRHLYANFKKKFGGGTLFRDLMMAAAKATYYEAHEAKMLMIKEANVD
AYEWLQAIPKHKWCKHAFPFYSKCDVLMNNLSESFNATILLQRDKPIITMFEWIRNYLMGRFAT
LREKVNAYKGYVMTKPLRRLDREIEKSASWTATYAGRLTFQVTHVLFTDSFVVDLEKHTCSCNY
WELIGIPCRHAVAAIHRKVDDPVKYVHKCYLRTTYEHCYSEVITPLNGQNKWPKTAHPIIMPPL
FKRGPGRPKKLRRREPDEANQTKWQRTNTSHRCKICFVLGHNKRTCKKNKQIVVVSAARTSQEA
PTQDAPTQEAPTQD
APTQEAPTQEAPTQEAPTQASQTSKKRGRPTGSINKKKKDKVAPKTKKSK
TVEPECDVAANAVPDAVPEKDDVQADIPMQDDVPVPDSVPEKADVQADIPVQDADPVQDDVQDA
VPVQDDVQDAPNTDINDSQSSLKKYCGIDAETLAKILDDDEILDVQPLNVDTSPVKRTVKTFVG
KAPKVVKKTSKSTASSSSRPTCSEPVKKPIQKHKPKKIVDMGVQKRQSDRLRTIKTKNIPGPGK
KPDDPLVIYEDEATSDQSKGGKQSEDWEAICKRMTQ

Similar gene clusters

NW_026705238 - Cluster 62 - Fatty_acid-saccharide-transporter_associated

Gene cluster description

NW_026705238 - Gene Cluster 62. Type = fatty_acid-saccharide-transporter_associated. Location: 260194 - 915933 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705502 - Cluster 63 - Cyclopeptide

Gene cluster description

NW_026705502 - Gene Cluster 63. Type = cyclopeptide. Location: 1 - 437028 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC131633585
Repeat occurs 10 times in a sequence of 647 amino acids
Location between 92680 and 94923
Coverage of 23.18 %
Instances:
DSNEQPYITQYGHTK | DSNGPYKTSYGNHEA | DSNEPYKTSYGNREA | DSNEPYRTSYGNHEA | DSNEPYKTSYGNHEA
DSNKPYKTSYGNHEA | DSNEPYKTSYGNREA | DSNEPYRTSYGNHEA | DSNKPYKTSYGNHEA | DSNEPYKTSYGNHEA

pattern: DSN[GKE][PQ][YP][YKR][TI][TS][YQ][YG][GN][RH][TE][KA]
MAQTLAPKFLTLLLFFFMNGQGITARDLKTELRDVDSNEQPYITQYGHTKPDYAASYNTHDSN
GPYKTSYGNHEA
EKPDYNTGYRTHTHHSNELYKTAYGNHEAEKPDYNTGYKARTHDSNEPYKTS
YGNREA
EKPDYNTGYRAHTHDSNEPYRTSYGNHEAEKPNYNTGYRAHTHDSNEPYKTSYGNHEA
EKPDYDTGYRAHTHDSNKPYKTSYGNHEAEKPDYDTGYRAHTHDSNEPYKTSYGNREAEKPDYN
TGYRAHTHDSNEPYRTSYGNHEAEKPDYDTGYRAHTHDSNKPYKTSYGNHEAEKPDYDTGYRAH
THDSNEPYKTSYGNHEAEKPDYDTGYRTHTHNSNKPYLTYAKHETDESNGPYIASYGKHEPNHP
YITQYGPTSLDLKDLTSPNSKDLEGSASSNLDRTEAFKTGFFNLDDLYVGHEMTLQFPVQEVSH
YLPKKAADSIPLSKSQLPSVLQLFSISEDSTQAKAMRGTLEECEGETITGETKICANSLESMLE
FVDTIIGTDTKHSILSTSNLSPTATPLQKYTILEVSHDIHAPKWVACHPLPYPYAIHYCHYIAT
GSKVFKVTLVGDENGDKMEALGICHLDTSDWNPDHIVFKQLGIKAGMNTPVCHFFPVNHLLWVP
EEPSKATM

Similar gene clusters

NW_026705654 - Cluster 64 - Saccharide

Gene cluster description

NW_026705654 - Gene Cluster 64. Type = saccharide. Location: 219592 - 319723 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705869 - Cluster 65 - Fatty_acid-alkaloid

Gene cluster description

NW_026705869 - Gene Cluster 65. Type = fatty_acid-alkaloid. Location: 163682 - 342477 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026705877 - Cluster 66 - Cyclopeptide

Gene cluster description

NW_026705877 - Gene Cluster 66. Type = cyclopeptide. Location: 1 - 385462 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_026706016 - Cluster 67 - Polyketide-alkaloid

Gene cluster description

NW_026706016 - Gene Cluster 67. Type = polyketide-alkaloid. Location: 1 - 303682 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters