Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_080047.1:
Cluster 1Lignan29588493128493169.64Dirigent, Methyltransf_11, p450---
Cluster 2Cyclopeptide73554357828264472.83BURP---
Cluster 3Cyclopeptide74261617916798490.64BURP---
Cluster 4Saccharide96591979781682122.48ADH_N, ADH_zinc_N, UDPGT_2hydroxycinnamate--
Cluster 5Transporter_associated4226161442644789383.18ABC2_membrane, ABC_tran, Lycopene_cycl, Methyltransf_11, Methyltransf_7, oMT---
The following clusters are from record NC_080048.1:
Cluster 6Saccharide26782492889925211.68AMP-binding, Cellulose_synt, p450---
Cluster 7Saccharide30237243128925105.20Acetyltransf_1, Glyco_hydro_1, adh_short---
Cluster 8Terpene-Saccharide46219794739624117.64Amino_oxidase, Terpene_synth, Terpene_synth_C, UDPGT_2alkaloid, cytokinin--
Cluster 9Alkaloid3168313631807546124.41Cu_amine_oxid, Dimerisation, Methyltransf_2, polyprenyl_synt---
Cluster 10Phenolamide-Alkaloid-Transporter_associated3462780934872101244.292OG-FeII_Oxy, ABC2_membrane, ABC_tran, DIOX_N, Epimerase, Orn_Arg_deC_N, Orn_DAP_Arg_deC, Peptidase_S10, Transferase---
Cluster 11Saccharide406821734075251070.34Aldo_ket_red, Amino_oxidase, Glycos_transf_1---
The following clusters are from record NC_080049.1:
Cluster 12Phenolamide20220062128017106.01Pyridoxal_deC, Transferase---
Cluster 13Putative2472820825052655324.45Amino_oxidase, Methyltransf_11, p450---
Cluster 14Cyclopeptide3607057036489565419.00BURP---
The following clusters are from record NC_080050.1:
Cluster 15Putative2879864828936835138.193Beta_HSD, Epimerase, Oxidored_FMN, adh_short---
Cluster 16Putative3042021130525920105.71ADH_N, ADH_zinc_N, Peptidase_S10, p450---
The following clusters are from record NC_080051.1:
Cluster 17Terpene2358275523704483121.73Epimerase, SQHop_cyclase_C, SQHop_cyclase_N, p450beta-amyrin-2, triterpene-2yossoside I/yossoside II/yossoside III/yossoside IV/yossos... (60% of genes show similarity)BGC0002402.2_c1
Cluster 18Saccharide2426487324522446257.57Peptidase_S10, UDPGT_2, p450flavonoid, oleananes--
Cluster 19Alkaloid2631082226435683124.86ADH_N, ADH_zinc_N, Epimerase, Pyridoxal_deC---
The following clusters are from record NC_080052.1:
Cluster 20Terpene-Lignan3095902331071846112.82Dirigent, Epimerase, Terpene_synth, Terpene_synth_C---
The following clusters are from record NC_080054.1:
Cluster 21Polyketide-Transporter_associated2710008327283548183.47AMP-binding, Acetyltransf_1, Chal_sti_synt_C, Chal_sti_synt_N, LTP_2, Methyltransf_7---
Cluster 22Cyclopeptide2843926728869696430.43BURP---
Cluster 23Alkaloid290401162912842188.31Aminotran_1_2, Cu_amine_oxid, p450---
The following clusters are from record NC_080055.1:
Cluster 24Cyclopeptide83625079111747749.24BURP---
Cluster 25Fatty_acid2559002725750196160.17ABC_tran, FA_hydroxylase, Methyltransf_11---
The following clusters are from record NC_080056.1:
Cluster 26Alkaloid52848915461530176.643Beta_HSD, Cu_amine_oxid, Lipoxygenase---
Cluster 27Saccharide59594386129503170.06HMGL-like, PALP, UDPGT_2carboxyl-2--
Cluster 28Cyclopeptide78604278400765540.34BURP---
The following clusters are from record NC_080058.1:
Cluster 29Fatty_acid-Saccharide10591181165943106.83FA_desaturase_2, Glyco_hydro_1, Transferase---
Cluster 30Saccharide5833150590538372.23AMP-binding, Glycos_transf_1, SQS_PSY---
Cluster 31Cyclopeptide15066258163388811272.62BURP---
Cluster 32Cyclopeptide16405130180792591674.13BURP---
Cluster 33Cyclopeptide17565958193967801830.82BURP---
The following clusters are from record NC_080059.1:
Cluster 34Cyclopeptide1289362289.36BURP---
Cluster 35Polyketide-Saccharide-Transporter_associated1133213511725359393.22Aminotran_1_2, Chal_sti_synt_C, FAE1_CUT1_RppA, Glycos_transf_1, MatE, Peptidase_S10, p450---
The following clusters are from record NC_080060.1:
Cluster 36Putative1859910192538565.472OG-FeII_Oxy, DIOX_N, Prenyltransf---
Cluster 37Fatty_acid7020404706594745.54FA_desaturase, p450---
Cluster 38Transporter_associated72909147496923206.01Epimerase, FA_desaturase, GMC_oxred_C, GMC_oxred_N, MatE---
Cluster 39Alkaloid8877280896696089.68Cu_amine_oxid, LTP_2, Methyltransf_11, p450---
The following clusters are from record NC_080061.1:
Cluster 40Putative47289334829249100.322OG-FeII_Oxy, Methyltransf_11, Transferase---
Cluster 41Saccharide50018705122885121.02Aminotran_1_2, Cellulose_synt, Epimerase---
Cluster 42Alkaloid5239698533933799.64AMP-binding, Methyltransf_11, Pyridoxal_deC---
Cluster 43Terpene-Saccharide57333975856465123.07Glycos_transf_1, SQHop_cyclase_C, SQHop_cyclase_N, p450Cycloartenol-3--
The following clusters are from record NC_080062.1:
Cluster 44Polyketide21191902241392122.20Abhydrolase_3, Chal_sti_synt_C, FAE1_CUT1_RppA, p450---
Cluster 45Cyclopeptide62938566690347396.49BURP---
Cluster 46Saccharide71903937332065141.67Aminotran_1_2, Glycos_transf_1---
Cluster 47Cyclopeptide78945938538640644.05Aminotran_1_2, BURP, Bet_v_1---
Cluster 48Alkaloid8478179855381675.64Aminotran_1_2, Bet_v_1---
The following clusters are from record NC_080063.1:
Cluster 49Saccharide50669305206190139.26ABC2_membrane, ABC_tran, Prenyltransf, UDPGT_2, p450*saccharide--
Cluster 50Terpene-Transporter_associated70487287283282234.55ABC2_membrane, ABC_tran, Abhydrolase_3, Amino_oxidase, ECH_2, Prenyltrans---
Cluster 51Cyclopeptide1537909515779384400.29BURP---

NC_080047 - Cluster 1 - Lignan

Gene cluster description

NC_080047 - Gene Cluster 1. Type = lignan. Location: 2958849 - 3128493 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080047 - Cluster 2 - Cyclopeptide

Gene cluster description

NC_080047 - Gene Cluster 2. Type = cyclopeptide. Location: 7355435 - 7828264 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC130828702
Repeat occurs 14 times in a sequence of 549 amino acids
Location between 7650796 and 7655345
Coverage of 25.5 %
Instances:
GEQSPSIGAI | GEQSPSIGAI | GEQSLFINAI | GEQSPSIGAI | GEQSPSIGAV
GEQSPFIDAI | GEQSPSIGAI | GEQSLFIDAI | GEQSPSIGAI | GEQSLFIDAI
GEQSPSIGAI | GEQSPSIGAI | GEQSPSVGAI | GEQSPSIAAI |
pattern: GEQS[PL][FS][IV][GADN]A[IV]
The following known motifs were found:
.L.Y..Y was found 13 times in this sequence
MNKALLLLDMEGYIRILFTILFSVMLLASQIDGRKDLKAYWDSMMDGQDMPQAIKNYINEKSY
EGHQKGDYHSYTINPKQSPSRGAISKEFNNEFDPKPNRLKYSTYTVNGEQSPSIGAIRKDFNNE
FDAKPNRLKYGTYTINGEQSPSIGAIRKDFNNEFDPKPNRLKYGTYTINGEQSLFINAISKELN
KDFDPKPNRLKYGTYTMKGEQSPSIGAIRKDFNDEFDPKPNRLKYGTYTMNGEQSPSIGAVRKD
FNNEFDPKPNRLKYGTYTINGEQSPFIDAISKELNKDFDPKPNRLKYGTYTMNGEQSPSIGAIR
KDFNNEFDPKPNRLKYGTYTINGEQSLFIDAISKELNKDFDPKPNRLKYGTYTMNGEQSPSIGA
I
RKDFNNEFDPKPNRLKYGTYTINGEQSLFIDAISKELNKDFDPKPNRLKYSTYTMNGEQSPSI
GAI
RKDFNNEFDPKPNRLKYGTYTMNGEQSPSIGAIRKDFNNEFDPKPNRLKYGTYTFNGEQSP
SVGAI
RKDFNNEFDSKPSPFLYDSYAMNGEQSPSIAAI
Repeat found in LOC130828720
Repeat occurs 4 times in a sequence of 606 amino acids
Location between 7680439 and 7682421
Coverage of 3.96 %
Instances:
RSSSRG | RSSSRG | RSSGRS | RSSGRE |
pattern: RSS[GS]R[GES]
MIVGSSRKPGERPWTVRVCPGVKGKHNHPFLVYRDGHVRARINVDIREHIRQLSATGMQPAFI
MNSIRDNFPGFYASMNQIYNIRQSIRRDEMEGRTLLQHCLHMATEHNYVVWTELDNDGHLSRLL
IANPTSIQMIRTWPYVVLIDTTYKTNKSKWPLCEVIGMTPTNHNFLVAFCLMRDEAAVSYSWVL
QGLRDIFGSAQTPSVIVTDRDEGLSAAIRDVFPDVRHLLCTWHIGNDVENMVDKLCGGKKNQQG
QLFRKSRWNPLVESATIREYEKRWEGIVSTWSVRNRRVVRYLTGTWIPLREKFVRAWTNDGLHF
GNHTTSRVESQHSSFKYYLGSGNSSFDTLFKRAHAQITNQQAKIRQSLQESMTSVPRMLRQYFF
RPLYRHVSLYALEQIQNEFNRMLELGDFALNKCGCVLLKTHGLPCACYLQIKIGSHGALYLDDI
HEFWSTLRYTEVGDEPNEEVRNANANDKEYFQSLVDEVLKSDPAFVRRMAEVLEYELHPDGADI
PEPYASPPRKGRPSTSKTMRRRKSSFEYSRSSSRGRGSRSSSRGRSSGRSSGRETQSSVGIKFS
FNLSDDPGGHDVSQFPWPDYIPFMLPPYLFD
Repeat found in LOC130828723
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 7689900 and 7694839
Coverage of 25.85 %
Instances:
QSPSIAAVS | QSPSVAAIR | QSPSVAVIR | QSPSVTAIR | QSPSVAAIR
QSPSVAKTT | QSPSVAAIR | QSPSVAAIR | QSPSVAKTT | QSPSVAAIR
QSPSVAAKK |
pattern: QSPS[IV][AT][KAV][KTIV][KTRS]
MNKALLMLDMEGYIRILFTILLSVMLLASQIDGRKDPKAYWEAMMEGQAMPQAIKNYINEKGD
YYTDKINLKQSPSIAAVSKEFNNDFDLKPNLLLYPKYGEQSPSVAAIRKDFNNDFDPKSNWFYY
PKYGEQSPSVAVIRKDFNNDFDPKSNWFYYPKYGEQSPSVTAIRKDFNNDFDPKSNWFYYPKYG
EQSPSVAAIRKDFNNDFDPKSNWFYYPKHGEQSPSVAKTTRKDFNNDFDPKSNWFYYPKYGEQS
PSVAAIR
KDFNNDFDPKSNWFYYPKYGEQSPSVAAIRKDFNNDFDPKSNWFYYPKHGEQSPSVA
KTT
RKDFNNDFDPKSNWFYYPKYGEQSPSVAAIRKDFNNDFDPKSNWFYYPKYGEQSPSVAAKK
Repeat found in LOC130828734
Repeat occurs 4 times in a sequence of 850 amino acids
Location between 7758213 and 7766148
Coverage of 2.82 %
Instances:
EKPVEE | EKPDKE | EKPDEK | EKPDAV |
pattern: EKP[DV][KAE][KEV]
MKKFHANPSTNIELKDLSELSVGDPDARQEIMEFLDHWGLINYHPLPQSDPAKDNADPNTDAE
MAENTDSLLEKLYRFENEQSSLQLVPRAKMSTATVPSGLFPESIAEELVKQEGPAVEYHCNSCS
ADCSRKRYHCQKQADFDLCTECYNNGKFGSGMCPSDFILMEPAEASGTTGGKWTDQETLLLLEA
LELFKENWNEIAEHVATKTKAQCILHFLQMPIEDSFLDYDDKINGVQENGEPTSNIESPVPNDI
SEPSEGKASKDNSGAAEGVSGNDDSEPSAGKINKDKPELEMKTDANAAQAVSVEAETLNFEEAP
DGNVVHDKGDDIVIKALKEAFHIAGCPLTPEDKLSFAEAGNSVMTLAAFLSQLVEPGLATASAH
DSLKTMSHSSPGLQLAERHCFVLEDPPDVEKELIASESMTTENIDQNASEMAKENLKNDKTSIR
NEAKDNPLSVEKHSTESQIDKKKASKSINDPTLPPRELDACADNSRLPEKPVEETLLQVTEKPD
KE
STQITEKPDEKIPKVSEKTDIETQLKGKPDKETPQVTEKSEEKSQVTEKSDEEKSEVAEKPD
AV
SKVEHADTKLVDELKSSDVQNEQLDTTKEQVSSATEVRPPSVCEPENLKTTTESVQCTDSQK
DKDMVLSSVVSDKTGQRDVPAAVPMDAFVADAGEDKMEVDKSEDTKSMKKIRTKEEDNVAKLKR
AALATLSAAAVKAKLLANQEEGEIRQLAIVMIEKQMRKLEIKLSYFTEMESAIMRVREHLERSR
QKLFSERAQIIASRLGTLPPSRMLPSSYPMNKMPPGFVNSMMRPPLSMGAQRPFISRPVATSAP
PTMNTSAPPEKTGPEGKPT

Similar gene clusters

NC_080047 - Cluster 3 - Cyclopeptide

Gene cluster description

NC_080047 - Gene Cluster 3. Type = cyclopeptide. Location: 7426161 - 7916798 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC130828702
Repeat occurs 14 times in a sequence of 549 amino acids
Location between 7650796 and 7655345
Coverage of 25.5 %
Instances:
GEQSPSIGAI | GEQSPSIGAI | GEQSLFINAI | GEQSPSIGAI | GEQSPSIGAV
GEQSPFIDAI | GEQSPSIGAI | GEQSLFIDAI | GEQSPSIGAI | GEQSLFIDAI
GEQSPSIGAI | GEQSPSIGAI | GEQSPSVGAI | GEQSPSIAAI |
pattern: GEQS[PL][FS][IV][GADN]A[IV]
The following known motifs were found:
.L.Y..Y was found 13 times in this sequence
MNKALLLLDMEGYIRILFTILFSVMLLASQIDGRKDLKAYWDSMMDGQDMPQAIKNYINEKSY
EGHQKGDYHSYTINPKQSPSRGAISKEFNNEFDPKPNRLKYSTYTVNGEQSPSIGAIRKDFNNE
FDAKPNRLKYGTYTINGEQSPSIGAIRKDFNNEFDPKPNRLKYGTYTINGEQSLFINAISKELN
KDFDPKPNRLKYGTYTMKGEQSPSIGAIRKDFNDEFDPKPNRLKYGTYTMNGEQSPSIGAVRKD
FNNEFDPKPNRLKYGTYTINGEQSPFIDAISKELNKDFDPKPNRLKYGTYTMNGEQSPSIGAIR
KDFNNEFDPKPNRLKYGTYTINGEQSLFIDAISKELNKDFDPKPNRLKYGTYTMNGEQSPSIGA
I
RKDFNNEFDPKPNRLKYGTYTINGEQSLFIDAISKELNKDFDPKPNRLKYSTYTMNGEQSPSI
GAI
RKDFNNEFDPKPNRLKYGTYTMNGEQSPSIGAIRKDFNNEFDPKPNRLKYGTYTFNGEQSP
SVGAI
RKDFNNEFDSKPSPFLYDSYAMNGEQSPSIAAI
Repeat found in LOC130828720
Repeat occurs 4 times in a sequence of 606 amino acids
Location between 7680439 and 7682421
Coverage of 3.96 %
Instances:
RSSSRG | RSSSRG | RSSGRS | RSSGRE |
pattern: RSS[GS]R[GES]
MIVGSSRKPGERPWTVRVCPGVKGKHNHPFLVYRDGHVRARINVDIREHIRQLSATGMQPAFI
MNSIRDNFPGFYASMNQIYNIRQSIRRDEMEGRTLLQHCLHMATEHNYVVWTELDNDGHLSRLL
IANPTSIQMIRTWPYVVLIDTTYKTNKSKWPLCEVIGMTPTNHNFLVAFCLMRDEAAVSYSWVL
QGLRDIFGSAQTPSVIVTDRDEGLSAAIRDVFPDVRHLLCTWHIGNDVENMVDKLCGGKKNQQG
QLFRKSRWNPLVESATIREYEKRWEGIVSTWSVRNRRVVRYLTGTWIPLREKFVRAWTNDGLHF
GNHTTSRVESQHSSFKYYLGSGNSSFDTLFKRAHAQITNQQAKIRQSLQESMTSVPRMLRQYFF
RPLYRHVSLYALEQIQNEFNRMLELGDFALNKCGCVLLKTHGLPCACYLQIKIGSHGALYLDDI
HEFWSTLRYTEVGDEPNEEVRNANANDKEYFQSLVDEVLKSDPAFVRRMAEVLEYELHPDGADI
PEPYASPPRKGRPSTSKTMRRRKSSFEYSRSSSRGRGSRSSSRGRSSGRSSGRETQSSVGIKFS
FNLSDDPGGHDVSQFPWPDYIPFMLPPYLFD
Repeat found in LOC130828723
Repeat occurs 11 times in a sequence of 383 amino acids
Location between 7689900 and 7694839
Coverage of 25.85 %
Instances:
QSPSIAAVS | QSPSVAAIR | QSPSVAVIR | QSPSVTAIR | QSPSVAAIR
QSPSVAKTT | QSPSVAAIR | QSPSVAAIR | QSPSVAKTT | QSPSVAAIR
QSPSVAAKK |
pattern: QSPS[IV][AT][KAV][KTIV][KTRS]
MNKALLMLDMEGYIRILFTILLSVMLLASQIDGRKDPKAYWEAMMEGQAMPQAIKNYINEKGD
YYTDKINLKQSPSIAAVSKEFNNDFDLKPNLLLYPKYGEQSPSVAAIRKDFNNDFDPKSNWFYY
PKYGEQSPSVAVIRKDFNNDFDPKSNWFYYPKYGEQSPSVTAIRKDFNNDFDPKSNWFYYPKYG
EQSPSVAAIRKDFNNDFDPKSNWFYYPKHGEQSPSVAKTTRKDFNNDFDPKSNWFYYPKYGEQS
PSVAAIR
KDFNNDFDPKSNWFYYPKYGEQSPSVAAIRKDFNNDFDPKSNWFYYPKHGEQSPSVA
KTT
RKDFNNDFDPKSNWFYYPKYGEQSPSVAAIRKDFNNDFDPKSNWFYYPKYGEQSPSVAAKK
Repeat found in LOC130828734
Repeat occurs 4 times in a sequence of 850 amino acids
Location between 7758213 and 7766148
Coverage of 2.82 %
Instances:
EKPVEE | EKPDKE | EKPDEK | EKPDAV |
pattern: EKP[DV][KAE][KEV]
MKKFHANPSTNIELKDLSELSVGDPDARQEIMEFLDHWGLINYHPLPQSDPAKDNADPNTDAE
MAENTDSLLEKLYRFENEQSSLQLVPRAKMSTATVPSGLFPESIAEELVKQEGPAVEYHCNSCS
ADCSRKRYHCQKQADFDLCTECYNNGKFGSGMCPSDFILMEPAEASGTTGGKWTDQETLLLLEA
LELFKENWNEIAEHVATKTKAQCILHFLQMPIEDSFLDYDDKINGVQENGEPTSNIESPVPNDI
SEPSEGKASKDNSGAAEGVSGNDDSEPSAGKINKDKPELEMKTDANAAQAVSVEAETLNFEEAP
DGNVVHDKGDDIVIKALKEAFHIAGCPLTPEDKLSFAEAGNSVMTLAAFLSQLVEPGLATASAH
DSLKTMSHSSPGLQLAERHCFVLEDPPDVEKELIASESMTTENIDQNASEMAKENLKNDKTSIR
NEAKDNPLSVEKHSTESQIDKKKASKSINDPTLPPRELDACADNSRLPEKPVEETLLQVTEKPD
KE
STQITEKPDEKIPKVSEKTDIETQLKGKPDKETPQVTEKSEEKSQVTEKSDEEKSEVAEKPD
AV
SKVEHADTKLVDELKSSDVQNEQLDTTKEQVSSATEVRPPSVCEPENLKTTTESVQCTDSQK
DKDMVLSSVVSDKTGQRDVPAAVPMDAFVADAGEDKMEVDKSEDTKSMKKIRTKEEDNVAKLKR
AALATLSAAAVKAKLLANQEEGEIRQLAIVMIEKQMRKLEIKLSYFTEMESAIMRVREHLERSR
QKLFSERAQIIASRLGTLPPSRMLPSSYPMNKMPPGFVNSMMRPPLSMGAQRPFISRPVATSAP
PTMNTSAPPEKTGPEGKPT
Repeat found in LOC130828829
Repeat occurs 3 times in a sequence of 301 amino acids
Location between 7884613 and 7887115
Coverage of 5.98 %
Instances:
GIGLSI | GIGIGV | GIGVGL |
pattern: GIG[VIL][GS][LIV]
MMDSSYGAVNGNGYLGNNGHLPSRLASPRLSWLDLRVFYVRISKCEIDDSTPEHLTLNHIPLD
PNTLLEVNGVRASIYSDGVSTLLRRDRLDKKSEEATFVSTDSIRITGSVKFEVFDRGVLVLSGS
LNKCDSNGFIDGDDSDHASRIWSINCETDATMGTGFLKGKQLIGSDLVSPCIEVYVAGSFLGSP
TILTKSLNLNSRRKHMRMAMLNSIPEHDTTEGEKESPNQLPVQVEHEGYNYHHPGSEYFEGEDG
ELSFFNAGVRVGVGIGLSICLGIGIGVGLLVKTYQGTTRTFRRRFF
Repeat found in LOC130828829
Repeat occurs 3 times in a sequence of 301 amino acids
Location between 7884613 and 7887115
Coverage of 5.98 %
Instances:
GIGLSI | GIGIGV | GIGVGL |
pattern: GIG[VIL][GS][LIV]
MMDSSYGAVNGNGYLGNNGHLPSRLASPRLSWLDLRVFYVRISKCEIDDSTPEHLTLNHIPLD
PNTLLEVNGVRASIYSDGVSTLLRRDRLDKKSEEATFVSTDSIRITGSVKFEVFDRGVLVLSGS
LNKCDSNGFIDGDDSDHASRIWSINCETDATMGTGFLKGKQLIGSDLVSPCIEVYVAGSFLGSP
TILTKSLNLNSRRKHMRMAMLNSIPEHDTTEGEKESPNQLPVQVEHEGYNYHHPGSEYFEGEDG
ELSFFNAGVRVGVGIGLSICLGIGIGVGLLVKTYQGTTRTFRRRFF
Repeat found in LOC130828829
Repeat occurs 3 times in a sequence of 301 amino acids
Location between 7884613 and 7887115
Coverage of 5.98 %
Instances:
GIGLSI | GIGIGV | GIGVGL |
pattern: GIG[VIL][GS][LIV]
MMDSSYGAVNGNGYLGNNGHLPSRLASPRLSWLDLRVFYVRISKCEIDDSTPEHLTLNHIPLD
PNTLLEVNGVRASIYSDGVSTLLRRDRLDKKSEEATFVSTDSIRITGSVKFEVFDRGVLVLSGS
LNKCDSNGFIDGDDSDHASRIWSINCETDATMGTGFLKGKQLIGSDLVSPCIEVYVAGSFLGSP
TILTKSLNLNSRRKHMRMAMLNSIPEHDTTEGEKESPNQLPVQVEHEGYNYHHPGSEYFEGEDG
ELSFFNAGVRVGVGIGLSICLGIGIGVGLLVKTYQGTTRTFRRRFF

Similar gene clusters

NC_080047 - Cluster 4 - Saccharide

Gene cluster description

NC_080047 - Gene Cluster 4. Type = saccharide. Location: 9659197 - 9781682 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080047 - Cluster 5 - Transporter_associated

Gene cluster description

NC_080047 - Gene Cluster 5. Type = transporter_associated. Location: 42261614 - 42644789 nt. Click on genes for more information.
Show pHMM detection rules used
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080048 - Cluster 6 - Saccharide

Gene cluster description

NC_080048 - Gene Cluster 6. Type = saccharide. Location: 2678249 - 2889925 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080048 - Cluster 7 - Saccharide

Gene cluster description

NC_080048 - Gene Cluster 7. Type = saccharide. Location: 3023724 - 3128925 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080048 - Cluster 8 - Terpene-saccharide

Gene cluster description

NC_080048 - Gene Cluster 8. Type = terpene-saccharide. Location: 4621979 - 4739624 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080048 - Cluster 9 - Alkaloid

Gene cluster description

NC_080048 - Gene Cluster 9. Type = alkaloid. Location: 31683136 - 31807546 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080048 - Cluster 10 - Phenolamide-alkaloid-transporter_associated

Gene cluster description

NC_080048 - Gene Cluster 10. Type = phenolamide-alkaloid-transporter_associated. Location: 34627809 - 34872101 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080048 - Cluster 11 - Saccharide

Gene cluster description

NC_080048 - Gene Cluster 11. Type = saccharide. Location: 40682173 - 40752510 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080049 - Cluster 12 - Phenolamide

Gene cluster description

NC_080049 - Gene Cluster 12. Type = phenolamide. Location: 2022006 - 2128017 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080049 - Cluster 13 - Putative

Gene cluster description

NC_080049 - Gene Cluster 13. Type = putative. Location: 24728208 - 25052655 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080049 - Cluster 14 - Cyclopeptide

Gene cluster description

NC_080049 - Gene Cluster 14. Type = cyclopeptide. Location: 36070570 - 36489565 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


The following known motifs were found in CDS LOC130807548
Location between 36344044 and 36347316
.L.Y..Y was found 2 times in this sequence
Sequence:
MISKLEQEKHKNPTKMEGNEEETVTISLLQNSTNPTIKNNIPRDSYNLAYIIYFILGSGYLLP
WNAFITAVDYFQYLYPDCSVDRIFSVVSQLIMLTTVLFLVFCFSKSHAYVRINLGLGLFLLSLL
VVPFMDVVYIKGQSGLYLGFYITVGAVAVSAMANGLVQASLIGSAGELPDRYIQALFCGTAGSG
VLVSLLRIFTKALYPQGTRSLRNSAFLYFIVTNIFMIICIILYNTVHKLPVIKHYTELKTQAAA
ANIDKQEQLSLPGRILEIIGKVKWYGIGVMLIYIVTLSIFPGFITEDVHSEFLGNWYGILLITC
YNVFDLIGKSLTAVYLLDDANAAIAACFMRFLFYPLYLGCLHGPRVFQTEIPVMVLTCLLGLTN
GYFTSVLMILVQKPVRLQYAETAGILTALFLIVGLAIGSIVSWFWVI

Similar gene clusters

NC_080050 - Cluster 15 - Putative

Gene cluster description

NC_080050 - Gene Cluster 15. Type = putative. Location: 28798648 - 28936835 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080050 - Cluster 16 - Putative

Gene cluster description

NC_080050 - Gene Cluster 16. Type = putative. Location: 30420211 - 30525920 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080051 - Cluster 17 - Terpene

Gene cluster description

NC_080051 - Gene Cluster 17. Type = terpene. Location: 23582755 - 23704483 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_080051 - Cluster 18 - Saccharide

Gene cluster description

NC_080051 - Gene Cluster 18. Type = saccharide. Location: 24264873 - 24522446 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080051 - Cluster 19 - Alkaloid

Gene cluster description

NC_080051 - Gene Cluster 19. Type = alkaloid. Location: 26310822 - 26435683 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080052 - Cluster 20 - Terpene-lignan

Gene cluster description

NC_080052 - Gene Cluster 20. Type = terpene-lignan. Location: 30959023 - 31071846 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080054 - Cluster 21 - Polyketide-transporter_associated

Gene cluster description

NC_080054 - Gene Cluster 21. Type = polyketide-transporter_associated. Location: 27100083 - 27283548 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080054 - Cluster 22 - Cyclopeptide

Gene cluster description

NC_080054 - Gene Cluster 22. Type = cyclopeptide. Location: 28439267 - 28869696 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_080054 - Cluster 23 - Alkaloid

Gene cluster description

NC_080054 - Gene Cluster 23. Type = alkaloid. Location: 29040116 - 29128421 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080055 - Cluster 24 - Cyclopeptide

Gene cluster description

NC_080055 - Gene Cluster 24. Type = cyclopeptide. Location: 8362507 - 9111747 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC130823210
Repeat occurs 9 times in a sequence of 295 amino acids
Location between 8797760 and 8798722
Coverage of 39.66 %
Instances:
DDDDDDDDDDDEN | DDDDDDDDDDENE | DDDDDDDDDENEG | DDDDDDDDENEGN | DDDDDDDENEGND
DDDDDDENEGNDQ | DDDDDENEGNDQD | DDDDENEGNDQDD | DDDENEGNDQDDQ |
pattern: DDD[ED][NED][NED][NEDG][NEDG][NEDG][EQNDG][EQNDG][EQNDG][EQNDG]
MFHHHHQSHNVNSNIDIYSTPIRGESLDENEMEYDDDDDDDDDDDENEGNDQDDQYGEDGGNS
VHHVDASLYTQPSFQDLVSQFGSPPSFTQLVQPSQQLPNNNDEAPIPSKKSWDLIEDIALICSV
MNTSTDPIVSTNQKIRVRWQKVKEAYEAARMERPHLIPRRTADMLKCRWGRVALACLKWSGSYD
EALRRKKSGTTNEDVLKEAHLIHQRKHGNFNLIEQWKILTNSKSSGKRSRTEEDSETPTSEPQG
GSSTRPEGVKKAKARMTGKMVADQSIQALSAFGESLRLNS
Repeat found in LOC130823652
Repeat occurs 5 times in a sequence of 289 amino acids
Location between 9000587 and 9005003
Coverage of 15.57 %
Instances:
QSTSGQSTS | QSTSGQSTS | QSTSGQSTS | QSTSKQSTS | QSTSRQLTD

pattern: QSTS[KGR]Q[LS]T[DS]
MISTQLNTVACPKGCLISIHNKRGLHSLPLVPATFVFCPPLKFPGPIKVTSNAFKWKQSVPVC
SSGGQARAGNGNQEPLWKSFGKAFENFGKKSAVDVLKRQIEKREYYDEGGIPPGGKGGGGSGGS
GGFGGLGGSEEEGSSGVLDETIQVVMATLGLIFLYVYIIRGAEMRLLAKDFIKYIFGGKPSIRL
SRVLKKWERIFKSSRGKEIMVDRYWLERAIILTPTWWDHPEKYKRILIQSIQSTSGQSTSGQST
SGQSTS
KQSTSRQLTDPEYDERGYDLGYMSEEEY
Repeat found in LOC130823652
Repeat occurs 5 times in a sequence of 289 amino acids
Location between 9000587 and 9005003
Coverage of 15.57 %
Instances:
QSTSGQSTS | QSTSGQSTS | QSTSGQSTS | QSTSKQSTS | QSTSRQLTD

pattern: QSTS[KGR]Q[LS]T[DS]
MISTQLNTVACPKGCLISIHNKRGLHSLPLVPATFVFCPPLKFPGPIKVTSNAFKWKQSVPVC
SSGGQARAGNGNQEPLWKSFGKAFENFGKKSAVDVLKRQIEKREYYDEGGIPPGGKGGGGSGGS
GGFGGLGGSEEEGSSGVLDETIQVVMATLGLIFLYVYIIRGAEMRLLAKDFIKYIFGGKPSIRL
SRVLKKWERIFKSSRGKEIMVDRYWLERAIILTPTWWDHPEKYKRILIQSIQSTSGQSTSGQST
SGQSTS
KQSTSRQLTDPEYDERGYDLGYMSEEEY
Repeat found in LOC130823652
Repeat occurs 5 times in a sequence of 289 amino acids
Location between 9000587 and 9005003
Coverage of 15.57 %
Instances:
QSTSGQSTS | QSTSGQSTS | QSTSGQSTS | QSTSKQSTS | QSTSRQLTD

pattern: QSTS[KGR]Q[LS]T[DS]
MISTQLNTVACPKGCLISIHNKRGLHSLPLVPATFVFCPPLKFPGPIKVTSNAFKWKQSVPVC
SSGGQARAGNGNQEPLWKSFGKAFENFGKKSAVDVLKRQIEKREYYDEGGIPPGGKGGGGSGGS
GGFGGLGGSEEEGSSGVLDETIQVVMATLGLIFLYVYIIRGAEMRLLAKDFIKYIFGGKPSIRL
SRVLKKWERIFKSSRGKEIMVDRYWLERAIILTPTWWDHPEKYKRILIQSIQSTSGQSTSGQST
SGQSTS
KQSTSRQLTDPEYDERGYDLGYMSEEEY
Repeat found in LOC130823652
Repeat occurs 5 times in a sequence of 289 amino acids
Location between 9000587 and 9005003
Coverage of 15.57 %
Instances:
QSTSGQSTS | QSTSGQSTS | QSTSGQSTS | QSTSKQSTS | QSTSRQLTD

pattern: QSTS[KGR]Q[LS]T[DS]
MISTQLNTVACPKGCLISIHNKRGLHSLPLVPATFVFCPPLKFPGPIKVTSNAFKWKQSVPVC
SSGGQARAGNGNQEPLWKSFGKAFENFGKKSAVDVLKRQIEKREYYDEGGIPPGGKGGGGSGGS
GGFGGLGGSEEEGSSGVLDETIQVVMATLGLIFLYVYIIRGAEMRLLAKDFIKYIFGGKPSIRL
SRVLKKWERIFKSSRGKEIMVDRYWLERAIILTPTWWDHPEKYKRILIQSIQSTSGQSTSGQST
SGQSTS
KQSTSRQLTDPEYDERGYDLGYMSEEEY
Repeat found in LOC130823657
Repeat occurs 3 times in a sequence of 226 amino acids
Location between 9069686 and 9075245
Coverage of 7.96 %
Instances:
EDEPVS | EDEDEP | EDEPEP |
pattern: EDE[PD][EV][PS]
MEEWEDEPVSLILKKDQPRSNWDDEDVDDDGVKESWEDEDEPEPAPAPVKALEKPAKKTGSKE
TGKKGKTEVVEDIVLDPVAEKLRQQRLVEEADYKSTTELFAGTSDEKSNKIKSLDNIIPKSEND
FLEYAELVSQRLRLHEKSYHYIGLLKAVMRLSMTSLKASDAKEVASSITAIANEKLKAEKEANA
GKKKTGLKKKQLHVDKPDDDLVVNAYDDIDDYDFM

Similar gene clusters

NC_080055 - Cluster 25 - Fatty_acid

Gene cluster description

NC_080055 - Gene Cluster 25. Type = fatty_acid. Location: 25590027 - 25750196 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080056 - Cluster 26 - Alkaloid

Gene cluster description

NC_080056 - Gene Cluster 26. Type = alkaloid. Location: 5284891 - 5461530 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080056 - Cluster 27 - Saccharide

Gene cluster description

NC_080056 - Gene Cluster 27. Type = saccharide. Location: 5959438 - 6129503 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080056 - Cluster 28 - Cyclopeptide

Gene cluster description

NC_080056 - Gene Cluster 28. Type = cyclopeptide. Location: 7860427 - 8400765 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC130825202
Repeat occurs 3 times in a sequence of 909 amino acids
Location between 8065009 and 8090965
Coverage of 2.64 %
Instances:
RDRDRDRE | RDRDRERE | RDRERELG |
pattern: RDR[ED]R[ED][LR][GE]
MDATLDLKNCIERRRMRDRDRDRERELGLAASGRSREHQRVKVYRLNEDGKWEDLGTGHVTVD
YVEKSEELGLFVIDEEDNETLLLHRITLDDIYQKQEDTIISWRDSEYSTELALSFQEPAGCSYI
WDHICNVQRNLHFSNLNHDTFHSVNSELRELPHVELSTLPLILKVIFVASFHIYVLQIVLESGV
TDQMRVSELILHDQEFFRKLMNLFRISEDLENLDSLHIIYKIVKGILMLNNPQIFEKVFGDEFI
MDVIGCLEYDPDVSHVLHRDFLKEHVFFKEAIPIKDPMVLSKIHQTYRISYLKDMVLPRVLDDT
TVASLNSIVHTNNGVVVSLLKEDSTFIQELFSRLRSPSTSVESKKNLVYFLNEFCTLSKSLQIV
QQLRLFRDLVNEGLFDIISDILQSEDKRLVLTGTDILVFLLNQDPNLLRGYVVRPEGTPLLGLL
VQGMLTDFGDDMHCQFLEVLRTLLDSYTLPGSQRDSIIEIFYERHFGQLIDVITSSCSGVGKWI
ATAPMENFVKPSYTKPEILLSICELLCFCVVHHPHRIKCNFLLNNSMDKVLYLTRRREKYLVAA
AVRFFRTVVARNDENINNYIITQNLLKPIVDAFVANGIRYNLLNSAVLELFEYIHKEANMNLFK
YLVVSFWSQLSKFENLASIHSLKVRYAQYLETVGTKSVVTDLKKRVEERVMEKEKEDYFNDDSD
EEDTASASMLHNQRAQVHPVSSNGSAPSTTLSSKPPGVVDYDDDEDDEDCKPPPRRQTGASDGD
VSTLELSSVKRKVAAEEEPGLIKKQRLDKNSKLKENIFAAPCSTISQAVLPNKDAASIVHNSSH
SASTKTTNCNHQQDNEADLARDHIVQNSSDDENRSEQNIVPQNNSDTLQKRQENGQLKGEEHPH
VPANTSSEMAVNGS
Repeat found in LOC130825202
Repeat occurs 3 times in a sequence of 897 amino acids
Location between 8065009 and 8090965
Coverage of 2.68 %
Instances:
RDRDRDRE | RDRDRERE | RDRERELG |
pattern: RDR[ED]R[ED][LR][GE]
MDATLDLKNCIERRRMRDRDRDRERELGLAASGRSREHQRVKVYRLNEDGKWEDLGTGHVTVD
YVEKSEELGLFVIDEEDNETLLLHRITLDDIYQKQEDTIISWRDSEYSTELALSFQEPAGCSYI
WDHICNVQRNLHFSNLNHDTFHSVNSELRELPHVELSTLPLILKQIVLESGVTDQMRVSELILH
DQEFFRKLMNLFRISEDLENLDSLHIIYKIVKGILMLNNPQIFEKVFGDEFIMDVIGCLEYDPD
VSHVLHRDFLKEHVFFKEAIPIKDPMVLSKIHQTYRISYLKDMVLPRVLDDTTVASLNSIVHTN
NGVVVSLLKEDSTFIQELFSRLRSPSTSVESKKNLVYFLNEFCTLSKSLQIVQQLRLFRDLVNE
GLFDIISDILQSEDKRLVLTGTDILVFLLNQDPNLLRGYVVRPEGTPLLGLLVQGMLTDFGDDM
HCQFLEVLRTLLDSYTLPGSQRDSIIEIFYERHFGQLIDVITSSCSGVGKWIATAPMENFVKPS
YTKPEILLSICELLCFCVVHHPHRIKCNFLLNNSMDKVLYLTRRREKYLVAAAVRFFRTVVARN
DENINNYIITQNLLKPIVDAFVANGIRYNLLNSAVLELFEYIHKEANMNLFKYLVVSFWSQLSK
FENLASIHSLKVRYAQYLETVGTKSVVTDLKKRVEERVMEKEKEDYFNDDSDEEDTASASMLHN
QRAQVHPVSSNGSAPSTTLSSKPPGVVDYDDDEDDEDCKPPPRRQTGASDGDVSTLELSSVKRK
VAAEEEPGLIKKQRLDKNSKLKENIFAAPCSTISQAVLPNKDAASIVHNSSHSASTKTTNCNHQ
QDNEADLARDHIVQNSSDDENRSEQNIVPQNNSDTLQKRQENGQLKGEEHPHVPANTSSEMAVN
GS
Repeat found in LOC130825202
Repeat occurs 3 times in a sequence of 896 amino acids
Location between 8065009 and 8090965
Coverage of 2.68 %
Instances:
RDRDRDRE | RDRDRERE | RDRERELG |
pattern: RDR[ED]R[ED][LR][GE]
MDATLDLKNCIERRRMRDRDRDRERELGLAASGRSREHQRVKVYRLNEDGKWEDLGTGHVTVD
YVEKSEELGLFVIDEEDNETLLLHRITLDDIYQKQEDTIISWRDSEYSTELALSFQEPAGCSYI
WDHICNVQRNLHFSNLNHDTFHSVNSELRELPHVELSTLPLILKIVLESGVTDQMRVSELILHD
QEFFRKLMNLFRISEDLENLDSLHIIYKIVKGILMLNNPQIFEKVFGDEFIMDVIGCLEYDPDV
SHVLHRDFLKEHVFFKEAIPIKDPMVLSKIHQTYRISYLKDMVLPRVLDDTTVASLNSIVHTNN
GVVVSLLKEDSTFIQELFSRLRSPSTSVESKKNLVYFLNEFCTLSKSLQIVQQLRLFRDLVNEG
LFDIISDILQSEDKRLVLTGTDILVFLLNQDPNLLRGYVVRPEGTPLLGLLVQGMLTDFGDDMH
CQFLEVLRTLLDSYTLPGSQRDSIIEIFYERHFGQLIDVITSSCSGVGKWIATAPMENFVKPSY
TKPEILLSICELLCFCVVHHPHRIKCNFLLNNSMDKVLYLTRRREKYLVAAAVRFFRTVVARND
ENINNYIITQNLLKPIVDAFVANGIRYNLLNSAVLELFEYIHKEANMNLFKYLVVSFWSQLSKF
ENLASIHSLKVRYAQYLETVGTKSVVTDLKKRVEERVMEKEKEDYFNDDSDEEDTASASMLHNQ
RAQVHPVSSNGSAPSTTLSSKPPGVVDYDDDEDDEDCKPPPRRQTGASDGDVSTLELSSVKRKV
AAEEEPGLIKKQRLDKNSKLKENIFAAPCSTISQAVLPNKDAASIVHNSSHSASTKTTNCNHQQ
DNEADLARDHIVQNSSDDENRSEQNIVPQNNSDTLQKRQENGQLKGEEHPHVPANTSSEMAVNG
S
Repeat found in LOC130825202
Repeat occurs 3 times in a sequence of 908 amino acids
Location between 8065009 and 8090965
Coverage of 2.64 %
Instances:
RDRDRDRE | RDRDRERE | RDRERELG |
pattern: RDR[ED]R[ED][LR][GE]
MDATLDLKNCIERRRMRDRDRDRERELGLAASGRSREHQRVKVYRLNEDGKWEDLGTGHVTVD
YVESEELGLFVIDEEDNETLLLHRITLDDIYQKQEDTIISWRDSEYSTELALSFQEPAGCSYIW
DHICNVQRNLHFSNLNHDTFHSVNSELRELPHVELSTLPLILKVIFVASFHIYVLQIVLESGVT
DQMRVSELILHDQEFFRKLMNLFRISEDLENLDSLHIIYKIVKGILMLNNPQIFEKVFGDEFIM
DVIGCLEYDPDVSHVLHRDFLKEHVFFKEAIPIKDPMVLSKIHQTYRISYLKDMVLPRVLDDTT
VASLNSIVHTNNGVVVSLLKEDSTFIQELFSRLRSPSTSVESKKNLVYFLNEFCTLSKSLQIVQ
QLRLFRDLVNEGLFDIISDILQSEDKRLVLTGTDILVFLLNQDPNLLRGYVVRPEGTPLLGLLV
QGMLTDFGDDMHCQFLEVLRTLLDSYTLPGSQRDSIIEIFYERHFGQLIDVITSSCSGVGKWIA
TAPMENFVKPSYTKPEILLSICELLCFCVVHHPHRIKCNFLLNNSMDKVLYLTRRREKYLVAAA
VRFFRTVVARNDENINNYIITQNLLKPIVDAFVANGIRYNLLNSAVLELFEYIHKEANMNLFKY
LVVSFWSQLSKFENLASIHSLKVRYAQYLETVGTKSVVTDLKKRVEERVMEKEKEDYFNDDSDE
EDTASASMLHNQRAQVHPVSSNGSAPSTTLSSKPPGVVDYDDDEDDEDCKPPPRRQTGASDGDV
STLELSSVKRKVAAEEEPGLIKKQRLDKNSKLKENIFAAPCSTISQAVLPNKDAASIVHNSSHS
ASTKTTNCNHQQDNEADLARDHIVQNSSDDENRSEQNIVPQNNSDTLQKRQENGQLKGEEHPHV
PANTSSEMAVNGS
Repeat found in LOC130825202
Repeat occurs 3 times in a sequence of 895 amino acids
Location between 8065009 and 8090965
Coverage of 2.68 %
Instances:
RDRDRDRE | RDRDRERE | RDRERELG |
pattern: RDR[ED]R[ED][LR][GE]
MDATLDLKNCIERRRMRDRDRDRERELGLAASGRSREHQRVKVYRLNEDGKWEDLGTGHVTVD
YVESEELGLFVIDEEDNETLLLHRITLDDIYQKQEDTIISWRDSEYSTELALSFQEPAGCSYIW
DHICNVQRNLHFSNLNHDTFHSVNSELRELPHVELSTLPLILKIVLESGVTDQMRVSELILHDQ
EFFRKLMNLFRISEDLENLDSLHIIYKIVKGILMLNNPQIFEKVFGDEFIMDVIGCLEYDPDVS
HVLHRDFLKEHVFFKEAIPIKDPMVLSKIHQTYRISYLKDMVLPRVLDDTTVASLNSIVHTNNG
VVVSLLKEDSTFIQELFSRLRSPSTSVESKKNLVYFLNEFCTLSKSLQIVQQLRLFRDLVNEGL
FDIISDILQSEDKRLVLTGTDILVFLLNQDPNLLRGYVVRPEGTPLLGLLVQGMLTDFGDDMHC
QFLEVLRTLLDSYTLPGSQRDSIIEIFYERHFGQLIDVITSSCSGVGKWIATAPMENFVKPSYT
KPEILLSICELLCFCVVHHPHRIKCNFLLNNSMDKVLYLTRRREKYLVAAAVRFFRTVVARNDE
NINNYIITQNLLKPIVDAFVANGIRYNLLNSAVLELFEYIHKEANMNLFKYLVVSFWSQLSKFE
NLASIHSLKVRYAQYLETVGTKSVVTDLKKRVEERVMEKEKEDYFNDDSDEEDTASASMLHNQR
AQVHPVSSNGSAPSTTLSSKPPGVVDYDDDEDDEDCKPPPRRQTGASDGDVSTLELSSVKRKVA
AEEEPGLIKKQRLDKNSKLKENIFAAPCSTISQAVLPNKDAASIVHNSSHSASTKTTNCNHQQD
NEADLARDHIVQNSSDDENRSEQNIVPQNNSDTLQKRQENGQLKGEEHPHVPANTSSEMAVNGS
Repeat found in LOC130825208
Repeat occurs 6 times in a sequence of 262 amino acids
Location between 8135393 and 8136986
Coverage of 34.35 %
Instances:
NPIVLMHDGKAISTM | NPIVLTHDDKATSTK | NPIVLTHDGKAISTM | NPIVLTHDGKATSTK | NPIVLTHKRKDISTK
NPIVLTHNRKVIFTK |
pattern: NPIVL[TM]H[KND][GDR]K[ADV][TI][FS]T[KM]
MRLFITLLLAVFLHVSSTNGRKDAGEYWGEKAIEALGMNLELNEEGNPLMKKPISISLVKDFG
PRHNPIVLMHDGKAISTMENNGLLTGQSFVMDFGPRYNPIVLTHDDKATSTKENNGVLTSQSFV
KNFGPRHNPIVLTHDGKAISTMKNNDLLTDQSFVKDFRPRHNPIVLTHDGKATSTKENNGLFTS
QSFVKDFAPHHNPIVLTHKRKDISTKEDNGLSITQSFVKDFKSRYNPIVLTHNRKVIFTKENNG
LSTSQSL

Similar gene clusters

NC_080058 - Cluster 29 - Fatty_acid-saccharide

Gene cluster description

NC_080058 - Gene Cluster 29. Type = fatty_acid-saccharide. Location: 1059118 - 1165943 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080058 - Cluster 30 - Saccharide

Gene cluster description

NC_080058 - Gene Cluster 30. Type = saccharide. Location: 5833150 - 5905383 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080058 - Cluster 31 - Cyclopeptide

Gene cluster description

NC_080058 - Gene Cluster 31. Type = cyclopeptide. Location: 15066258 - 16338881 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_080058 - Cluster 32 - Cyclopeptide

Gene cluster description

NC_080058 - Gene Cluster 32. Type = cyclopeptide. Location: 16405130 - 18079259 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_080058 - Cluster 33 - Cyclopeptide

Gene cluster description

NC_080058 - Gene Cluster 33. Type = cyclopeptide. Location: 17565958 - 19396780 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_080059 - Cluster 34 - Cyclopeptide

Gene cluster description

NC_080059 - Gene Cluster 34. Type = cyclopeptide. Location: 1 - 289362 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC130797649
Repeat occurs 4 times in a sequence of 634 amino acids
Location between 44693 and 46704
Coverage of 4.42 %
Instances:
DSFKNYS | DSFTSYG | DSFKSYG | DSFKGKQ |
pattern: DSF[KT][GNS][KY][GQS]
MSKLRILHFLHLLILLLFSLFLHVSIAEENKENPFTARASLIRYWNTHISNNLPKPKFLLSKA
SPLNAVETAVFAKLASTNALHTRLSSFCSAANLFCDFESSLQGQGHHGGREDANFASYNNKQFS
VYGGSRVGGVDSFKNYSDNINMGVNAFTRYSTNSAGHGETFSSYADNGNVANDSFTSYGAGGTG
GLGEFKSYQSNVNVPNLKFTSYSGNANNHKHSFVSYTEDTNSGTATFTSYGKNGNGVPNQFNNY
ATSSNIVASTFASYGQLGNAANDSFKSYGSSANNPHNTFKNYGSKVSSGIERFQNYRDSANVGD
DTFQNYLRVSTSTKATFINYGNSFNQGIDKFKGYGKAGVNRQIDFKTYGVNNSFSGYGDNKKGI
SFAGYSRRINFINTDNKNHHIHNKWRVDRDGDEGKFFRESMLKPGTIMKMPDIKDKMPKRSFLP
RVISSKLPFSSYKLAQLRDSFKGKQNSTLELLILNALGDCERPPSPNETKRCVASLEDMIDFTI
SVLGDNVVVRTTENVNGSTKRVVIGFVKGINGGEVTKSVSCHQSLLPYLLYYCHSVPKVRVYEA
EILDIESKAKINNGVAICHLDTSAWSAGHAAFLALGSGPGLIEVCHWIFENDMTWTVAH

Similar gene clusters

NC_080059 - Cluster 35 - Polyketide-saccharide-transporter_associated

Gene cluster description

NC_080059 - Gene Cluster 35. Type = polyketide-saccharide-transporter_associated. Location: 11332135 - 11725359 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080060 - Cluster 36 - Putative

Gene cluster description

NC_080060 - Gene Cluster 36. Type = putative. Location: 1859910 - 1925385 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080060 - Cluster 37 - Fatty_acid

Gene cluster description

NC_080060 - Gene Cluster 37. Type = fatty_acid. Location: 7020404 - 7065947 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080060 - Cluster 38 - Transporter_associated

Gene cluster description

NC_080060 - Gene Cluster 38. Type = transporter_associated. Location: 7290914 - 7496923 nt. Click on genes for more information.
Show pHMM detection rules used
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080060 - Cluster 39 - Alkaloid

Gene cluster description

NC_080060 - Gene Cluster 39. Type = alkaloid. Location: 8877280 - 8966960 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080061 - Cluster 40 - Putative

Gene cluster description

NC_080061 - Gene Cluster 40. Type = putative. Location: 4728933 - 4829249 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080061 - Cluster 41 - Saccharide

Gene cluster description

NC_080061 - Gene Cluster 41. Type = saccharide. Location: 5001870 - 5122885 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080061 - Cluster 42 - Alkaloid

Gene cluster description

NC_080061 - Gene Cluster 42. Type = alkaloid. Location: 5239698 - 5339337 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080061 - Cluster 43 - Terpene-saccharide

Gene cluster description

NC_080061 - Gene Cluster 43. Type = terpene-saccharide. Location: 5733397 - 5856465 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080062 - Cluster 44 - Polyketide

Gene cluster description

NC_080062 - Gene Cluster 44. Type = polyketide. Location: 2119190 - 2241392 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080062 - Cluster 45 - Cyclopeptide

Gene cluster description

NC_080062 - Gene Cluster 45. Type = cyclopeptide. Location: 6293856 - 6690347 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC130803225
Repeat occurs 18 times in a sequence of 772 amino acids
Location between 6488190 and 6496012
Coverage of 34.97 %
Instances:
TVGSWKYDVDKNKVV | TVGSWKYDADKNKVK | TVGSWKYDVDKNKVK | TVGSWRYDADKNKVE | TVGSWKYDADKNKVV
TVGSWKYDVDKNKVK | TVGSWKYDVDKNKVK | TVGSWRYDADTNKVK | TVGSWKYDADKNKVV | TVGSWKYDADKNKVK
TVGSWKYDADKNKVK | TVGSWKYDVDKNKVK | TVGSWRYDADTNKVK | TVGSWKYDADKNKVV | TVGSWKYDVDKNKVK
TVGSWKYDADKNKVK | TVGSWKYDVDKNEVK | TVGSWKYNENDESKQ |
pattern: TVGSW[KR]Y[ND][AEV][ND][KDT][EN][KES][KV][KEVQ]
MLSLLAHINRIPNSRSLHYLNSYFLPNMAMDLRLQVPALFLLTFLAIHVSSCKQEDYWKMKLP
KVPMPEAIKHSLLHSGGENKLKDEYTIKQPYTVGSWKYDVDKNKVVDESALKQPFTVGSWKYDA
DKNKVK
DDSAFKQPFTVGSWKYDVDKNKVKDDSALKQPYTVGSWRYDADKNKVEDESALKQPYT
VGSWKYDADKNKVV
DESALKQPYTVGSWKYDVDKNKVKDDSALKQPYTVGSWKYDVDKNKVKDD
SALKQPYTVGSWRYDADTNKVKDESALKQPFTVGSWKYDADKNKVVDESALKQPYTVGSWKYDA
DKNKVK
DDSALKQPYTVGSWKYDADKNKVKDDSALKQPYTVGSWKYDVDKNKVKDDSALKQPYT
VGSWRYDADTNKVK
DESALKQPFTVGSWKYDADKNKVVDESALKQPYTVGSWKYDVDKNKVKDD
SALKQPFTVGSWKYDADKNKVKDDSALRQPYTVGSWKYDVDKNEVKDDSALKQPYTVGSWKYNE
NDESKQ
ASPHHLHHQKLMHENVNSNDKEDLTDGSVFFVEKSLHIGSKLKHDFQKTPEMSFLSKQ
EAQSIPFSMEKIGDILNLFSLKSNSAEANAIKGTLDICLYRPKVSKENRTCAQSMEDIVDFVVG
ELGTNEVEIKMMNNNIEVPNGIQDYLLSKVEKLFVPGNTAVACHRMSYPYIVYYCHHQQDIGQY
NVTLVSPSTGAAFQTTAVCHYDTYAWQPDVVALKYLGIRPGDAPVCHFSAINDMFWTLKDEPKS
LDMVQ

Similar gene clusters

NC_080062 - Cluster 46 - Saccharide

Gene cluster description

NC_080062 - Gene Cluster 46. Type = saccharide. Location: 7190393 - 7332065 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080062 - Cluster 47 - Cyclopeptide

Gene cluster description

NC_080062 - Gene Cluster 47. Type = cyclopeptide. Location: 7894593 - 8538640 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC130802818
Repeat occurs 8 times in a sequence of 703 amino acids
Location between 7983216 and 7985328
Coverage of 15.93 %
Instances:
RERERERERERDRE | RERERERERDRERE | RERERERDRERERD | RERERDRERERDRD | RERDRERERDRDIE
RERERDRDIERRNR | RERDRDIERRNREK | REREKELELIKEQY |
pattern: RER[ED][KR][ED][LIR][ED][LIR][EDRI][KNR][EDR][EQRNI][ERKDY]
MKRSFEEVANNVKPVFLTKAQREQLALQRRQEEVAEQRRRAELLKGHSSDKESLDSSHHRSSR
DGDRDRYRDRDRDRERERERERERDRERERDRDIERRNREKEREEESKARDRARLEKLTERERE
KELELIKEQY
LGSKKPKKRVIKPSEKFRFSFDWENTEDTSRDMNALYQNPHEAQLLFGRGFRAG
IDRREQKKLAAKNEVELREEIRRKDGTEENPEEAAALRRREQAAEKYESHDMRVDRHWSEKRLE
EMTDRDWRIFREDFNISYKGSRIPRPMRNWVESKLSSELLKAVERAGYKTPSPIQMAAIPLGLQ
QRDVIGIAETGSGKTAAFVLPMLAYITRLPPITEENEAEGPYAVVMAPTRELAQQIEEETVKFA
HYLGIKVVSIVGGQSIEEQGFKIRQGCEIVIATPGRLLDCLERRYVVLNQCNYVVLDEADRMID
MGFEPQVVGVLEAMPSSNMKPENEEEELDERKIYRTTYMFSATMPPAVERLARKFLRNPVVVTI
GTAGKATDLITQHVIMMKDSEKTFRLQKLLDDLGDKTAIVFVNTKKTADVLAKNLDKAGYRVTT
LHGGKSQEQREISLEGFRTKRYNVMVATDVAGRGIDIPDVAHVINYDMPGNIEMYTHRIGRTGR
AGKTGVATTFLTLHDTDVFFDLKQMLVQSNSPVPPELARHEASKFKPGTVPDRPPRRNETLFAH
Repeat found in LOC130802835
Repeat occurs 3 times in a sequence of 217 amino acids
Location between 8324030 and 8332974
Coverage of 12.44 %
Instances:
SSSSSGSSK | SSSSGSSKS | SSSGSSKSR |
pattern: SSS[GS][GS][GS][KS][KS][KRS]
MSENKPLIGLTWEPKLPSFVPASSSSSGSSKSRFALENSMLYKPASQLIEGLYVPPNDPTKLN
KLLRKQKKDTVGSQWFDMPAPVLTPELKKDLQLLKLRSVIDPKRHYKKSDAKSKTLPKYFQVGT
IVESASDFFSSRLTKKERKSSIADELLSDGSLSHYRKRKVREIEDQHQPGGNGNWKIRGKSTLK
RAKEKRHLPEKRLAPSRNDKKRFKRI
Repeat found in LOC130802835
Repeat occurs 3 times in a sequence of 217 amino acids
Location between 8324030 and 8332974
Coverage of 12.44 %
Instances:
SSSSSGSSK | SSSSGSSKS | SSSGSSKSR |
pattern: SSS[GS][GS][GS][KS][KS][KRS]
MSENKPLIGLTWEPKLPSFVPASSSSSGSSKSRFALENSMLYKPASQLIEGLYVPPNDPTKLN
KLLRKQKKDTVGSQWFDMPAPVLTPELKKDLQLLKLRSVIDPKRHYKKSDAKSKTLPKYFQVGT
IVESASDFFSSRLTKKERKSSIADELLSDGSLSHYRKRKVREIEDQHQPGGNGNWKIRGKSTLK
RAKEKRHLPEKRLAPSRNDKKRFKRI
Repeat found in LOC130802835
Repeat occurs 3 times in a sequence of 217 amino acids
Location between 8324030 and 8332974
Coverage of 12.44 %
Instances:
SSSSSGSSK | SSSSGSSKS | SSSGSSKSR |
pattern: SSS[GS][GS][GS][KS][KS][KRS]
MSENKPLIGLTWEPKLPSFVPASSSSSGSSKSRFALENSMLYKPASQLIEGLYVPPNDPTKLN
KLLRKQKKDTVGSQWFDMPAPVLTPELKKDLQLLKLRSVIDPKRHYKKSDAKSKTLPKYFQVGT
IVESASDFFSSRLTKKERKSSIADELLSDGSLSHYRKRKVREIEDQHQPGGNGNWKIRGKSTLK
RAKEKRHLPEKRLAPSRNDKKRFKRI

Similar gene clusters

NC_080062 - Cluster 48 - Alkaloid

Gene cluster description

NC_080062 - Gene Cluster 48. Type = alkaloid. Location: 8478179 - 8553816 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080063 - Cluster 49 - Saccharide

Gene cluster description

NC_080063 - Gene Cluster 49. Type = saccharide. Location: 5066930 - 5206190 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080063 - Cluster 50 - Terpene-transporter_associated

Gene cluster description

NC_080063 - Gene Cluster 50. Type = terpene-transporter_associated. Location: 7048728 - 7283282 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_080063 - Cluster 51 - Cyclopeptide

Gene cluster description

NC_080063 - Gene Cluster 51. Type = cyclopeptide. Location: 15379095 - 15779384 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC130804619
Repeat occurs 10 times in a sequence of 467 amino acids
Location between 15577267 and 15581211
Coverage of 27.84 %
Instances:
GLLSYKNNKGLQS | GLLAYSKKGLQSS | GLLAYSKKGLQSS | GLLAYSKKDLQST | GLLAYSKKGLQST
GLLAYSKKGLQST | GLLAYSKKDLQST | GLLAYSKKGLQST | GLLAYSKKGLQST | GLLLPRQVSNQIP

pattern: GLL[ALS][PY][KRS][KNQ][KNV][KGDS][GNL][LQ][SIQ][PTS]
MTKLLLLFNFLFSLGLMRMGSGAPASISPEAYWKLKLPNTLMPQIIKQTLASAGLPISVPRKA
LQPNTKDSLSTVKGYDGLLSYKNNKGLQSINSYGGLLAYSKKGLQSSYGGLLAYSKKGLQSSYG
GLLAYSKKDLQST
NSYGGLLAYSKKGLQSTNSYGGLLAYSKKGLQSTNSYGGLLAYSKKDLQST
NSYGGLLAYSKKGLQSTTGYGGLLAYSKKGLQSTNSYTFSYKDKHDDVKESVHVDESSSLKKDQ
VFFVEKDLHIGTTMTLHFQKSTRNGLLLPRQVSNQIPFSSEKLQETLQILSIDPKSNEANVLAH
RIELCELPTIEGVEKKCVTSLESMIDYVASMIGTNVEALTTEVIKESKMEFTIKGVKKIAKNDH
EIVICHKMGYPYAVYFCHRTKTLRSYRATLLGKDSTQIEAIVACHKETNDFLDNYAKNVLKIIP
GSTHICHFPPAEDTIFWVPK
Repeat found in LOC130804632
Repeat occurs 8 times in a sequence of 452 amino acids
Location between 15750016 and 15759460
Coverage of 26.55 %
Instances:
QRAQAYNPQGGQVYN | QRASGYDGYRGVYDM | QRAAYDPQRAAYDPQ | QRAAYDPQRAAYDPQ | QRAAYDPQRAAYDPQ
QRAAYDPQRAPYDPQ | QRAPYDPQRAAYEMP | QRAAYEMPRGAIAPL |
pattern: QRA[PSAQ][GAY][YDE][PDNM][PGQ][YRQ][GAR][PGA][YIVQ][EADYV][PYDM][QMLNP]
MGSKGRLPPHHMRHSVHGPGLVHPEPFGAGIRPPHDGFLHNEMLPPLEILEKKLATQHAEMER
LATENQRLAGTHGSLRQQLAAAQEDLRMVDSQIRVSKSEREQQIRALMEKISKMEAELQAGDRL
KLELQHARTEAQNLVEVRQELLSKVQQLNNELKRAHVDVQQIPTMMSELDHLRQEFHKYKATFE
HERKVYRDHLESLQAMGKEYRSMADEVAKLRAELSNPANVDKRPAYSSGAGYRDASTHNPSVHS
SYEESFGVPQTHQTFPSSAAAGGHASAATAGGGVVASSGGTPTYSAPQTGPAYPGYEAQRAQAY
NPQGGQVYN
PQRASGYDGYRGVYDMPRPPYDLQRAAYDPQRAAYDPQRAAYDPQRAAYDPQRAP
YDPQ
RAAYEMPRGAIAPLQGQAPANNVPYGSAAPPPANNVPYGSAAPPAAPQTAAGHETQPQGD
HPTCR

Similar gene clusters