Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_035433.2:
Cluster 1Alkaloid7207512727198964.483Beta_HSD, Bet_v_1, Chalcone_2---
Cluster 2Polyketide972927510087682358.41Chal_sti_synt_C, FAE1_CUT1_RppA, His_biosynth, Methyltransf_11, Trp_syntA---
Cluster 3Cyclopeptide1429435801444698191526.24BURP---
Cluster 4Cyclopeptide1430075211460034992995.98BURP---
The following clusters are from record NC_035434.2:
Cluster 5Cyclopeptide1219657061246318772666.17BURP---
Cluster 6Saccharide145425139145546465121.33Transferase, UDPGT_2cytokinin-2--
Cluster 7Saccharide161270579161491393220.81Cellulose_synt, Methyltransf_7, adh_short_C2---
Cluster 8Polyketide170134067170248803114.742OG-FeII_Oxy, AMP-binding, Chal_sti_synt_C, DIOX_N, FAE1_CUT1_RppA, SQS_PSY---
The following clusters are from record NC_035435.2:
Cluster 9Cyclopeptide1091212071113755812254.37BURP---
Cluster 10Cyclopeptide151206423152064773858.35BURP---
Cluster 11Saccharide161063486161307793244.31AMP-binding, Methyltransf_11, UDPGT_2*saccharide--
Cluster 12Saccharide173025516173324210298.692OG-FeII_Oxy, DIOX_N, UDPGT_2---
The following clusters are from record NC_035436.2:
Cluster 13Saccharide-Alkaloid180637540181035289397.75Glycos_transf_1, Pyridoxal_deC, p450---
Cluster 14Cyclopeptide206502905207047230544.33BURP---
The following clusters are from record NC_035437.2:
Cluster 15Cyclopeptide1617637321635760411812.31BURP---
Cluster 16Saccharide173312659173594943282.28ADH_N, ADH_zinc_N, Peptidase_S10, UDPGT_2cyanogenic glucoside, monoterpenoid--
Cluster 17Saccharide176616319176731623115.30DAHP_synth_2, Methyltransf_11, UDPGT_2cyanogenic glucoside-2, monoterpenoid-2--
The following clusters are from record NC_035438.2:
Cluster 18Saccharide142308901427302442.13Epimerase, Transferase, UDPGT_2small phenolic--
Cluster 19Saccharide6489406865225782331.71Abhydrolase_3, Cellulose_synt, Methyltransf_11---
The following clusters are from record NC_035440.2:
Cluster 20Transporter_associated-Saccharide-Terpene8498376785488737504.972OG-FeII_Oxy, ABC2_membrane, ABC_tran, DIOX_N, PRISE, Prenyltransf, UDPGT_2flavonoid--
Cluster 21Cyclopeptide1252486701285108093262.14BURP---
Cluster 22Saccharide-Fatty_acid164741548165068983327.44FA_hydroxylase, UDPGT_2, adh_short_C2cytokinin-2--
The following clusters are from record NC_035441.2:
Cluster 23Saccharide137305588137849473543.88Epimerase, Glycos_transf_1, Methyltransf_7---
Cluster 24Alkaloid178682716178817060134.34Aminotran_1_2, Cu_amine_oxid, adh_short---
Cluster 25Transporter_associated184811803184923615111.812OG-FeII_Oxy, ABC2_membrane, ABC_tran, Aminotran_1_2, DIOX_N---
The following clusters are from record NC_035442.2:
Cluster 26Transporter_associated-Saccharide1194710041215880032117.00ABC2_membrane, ABC_tran, NAD_binding_4, Oxidored_FMN, UDPGT_2, UbiA, adh_short_C2flavonoid--
Cluster 27Putative1359473731370161921068.822OG-FeII_Oxy, DIOX_N---
Cluster 28Saccharide-Terpene160283825160438213154.39Glycos_transf_1, SQHop_cyclase_C, SQHop_cyclase_N, p450Lanosterolyossoside I/yossoside II/yossoside III/yossoside IV/yossos... (40% of genes show similarity)BGC0002402.2_c1
The following clusters are from record NC_035443.2:
Cluster 29Transporter_associated90258599524409498.55AMP-binding, Epimerase, MatE, Methyltransf_11, p450---
Cluster 30Cyclopeptide1328808221361401533259.33BURP---
Cluster 31Cyclopeptide1818234351830501091226.67BURP---
The following clusters are from record NC_035444.2:
Cluster 32Alkaloid2661834126985388367.05Acetyltransf_1, BBE, FAD_binding_4, NAD_binding_1---
Cluster 33Cyclopeptide57516206593920331875.83BURP---
Cluster 34Polyketide83337326844454001108.07ADH_N_2, ADH_zinc_N, Chal_sti_synt_C, FAE1_CUT1_RppA, NAD_binding_1, p450---
The following clusters are from record NC_035445.2:
Cluster 35Saccharide7255584072755251199.41Cellulose_synt, Methyltransf_7, adh_short_C2---
Cluster 36Fatty_acid140442392140659654217.26AMP-binding, Methyltransf_11, Transferase---
Cluster 37Fatty_acid1539900571552170661227.01AMP-binding, HMGL-like, Transferase---
The following clusters are from record NC_035446.2:
Cluster 38Saccharide7968251180100762418.252OG-FeII_Oxy, Glycos_transf_1, Methyltransf_11---
Cluster 39Saccharide8504343385993685950.252OG-FeII_Oxy, DIOX_N, UDPGT_2flavonoid-5--
The following clusters are from record NC_035447.2:
Cluster 40Saccharide9712953978924476.29Aminotran_3, Cellulose_synt, Epimeraseoleananes--
Cluster 41Saccharide-Fatty_acid138927964139094241166.28CER1-like_C, FA_hydroxylase, Glycos_transf_2, Transferase---
Cluster 42Fatty_acid168599547168991008391.46FA_hydroxylase, Peptidase_S10, Transferase, adh_short---
The following clusters are from record NC_035448.2:
Cluster 43Cyclopeptide27321220287752251454.01BURP---
Cluster 44Cyclopeptide2817097028833641662.67BURP---
Cluster 45Terpene173798932174216046417.11ADH_N_2, ADH_zinc_N, Terpene_synth, Terpene_synth_C, p450---
Cluster 46Terpene1745705431756545611084.02Terpene_synth, Terpene_synth_C, p450---
Cluster 47Saccharide185403955185748127344.17Aldo_ket_red, Dimerisation, Methyltransf_2, UDPGT_2, adh_shortalkaloid-3, cytokinin-3--
The following clusters are from record NC_035449.2:
Cluster 48Saccharide2092409921106935182.84Aminotran_1_2, UDPGT_2, p450flavonoid--
Cluster 49Alkaloid54415028559990991584.07Bet_v_1, Methyltransf_7, p450---

NC_035433 - Cluster 1 - Alkaloid

Gene cluster description

NC_035433 - Gene Cluster 1. Type = alkaloid. Location: 7207512 - 7271989 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035433 - Cluster 2 - Polyketide

Gene cluster description

NC_035433 - Gene Cluster 2. Type = polyketide. Location: 9729275 - 10087682 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035433 - Cluster 3 - Cyclopeptide

Gene cluster description

NC_035433 - Gene Cluster 3. Type = cyclopeptide. Location: 142943580 - 144469819 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC110883334
Repeat occurs 3 times in a sequence of 228 amino acids
Location between 143661557 and 143663143
Coverage of 18.42 %
Instances:
SFLNRYTEAASEKA | SFLNRYTEAASEKA | SFLNRYTEATPENQ |
pattern: SFLNRYTEA[AT][PS]E[KN][AQ]
MKFFLLLVFTLFSVALVGSHAVTPEVYWKSVLPNSPMPKSIKDLLYSEWNDEKSTTGKSGVGV
HTYQGKPLYFDVQNVPGFHTSFLNRYTEAASEKAASFLNRYTEAASEKAVSFLNRYTEATPENQ
LKDNPNAALFFFENGLHLGKEMNLYFTKNDQKATFLPREIADSIPFSSNKLPQIYNEFSVNPDS
VEAKIMKQTLEECEKPSNEGEERYCATSLESMVDFSM
Repeat found in LOC110886618
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 143750499 and 143751841
Coverage of 17.11 %
Instances:
SFLNRYTEAASEK | SFLNRYTEAASEK | SFLNRYTEAASEK | SFLNRYTEAASEK | SFLNRYTEATSEN

pattern: SFLNRYTEA[AT]SE[KN]
MKFFLLLFFTLFSVALVGSHAVTPEVYWKSVLPNSPMPKSIKDLLYSEWNDEKSTAGKSGVGV
HTYKGKPLYFDVQNVPAFHTSFLNRYTEAASEKDVSFLNRYTEAASEKTASFLNRYTEAASEKT
ASFLNRYTEAASEKAASVSFLNRYTEATSENQLKDNPNAALFFFENDLHLGKEMNLYFNKNDQK
AKFLPREIADSIPFSSNKLPQIYNEFSVNPGSMEAKVMKQTLEECEKPSNEGEEGYCATSLESM
VDFSTTRLGKKVKVISTEVNTKESILVGMYKIEVAKKFPANTAVVCHKLNYAHAVFYCHKTVST
RAYAVSMISADGKKMNAAAVCHTDTSKWNPKHLAFQVLKVKPGAVPICHFLPEDHVAWVPY
Repeat found in LOC110886604
Repeat occurs 4 times in a sequence of 247 amino acids
Location between 143942511 and 143943676
Coverage of 12.96 %
Instances:
ISLQRLQI | ISLPCNLP | ISLPDNLP | ISLPCNLP |
pattern: ISL[QP][CRD][NL][QL][PI]
MFGYMEEVLEGLEPNPSLETLEMCNYLGKTISPSWMDNLRNLVEIVLSRCKKCEHIPPLGRLP
SLRVIQLSFMYSLKYFHDDDINMSVDNTDLFISLQRLQIFSCGELISLPCNLPNLKVLEVTDCW
RLISLPDNLPKLEDLELRSCSKLISLPCNLPSIRKLRIEDCNGLLSLPDEIQSFKDLNKLEITG
CKHLRKRCEREKGEDWHKISHIPDLSIQAPLWWEVSGHDDDDEDVDEDGDDEINER
Repeat found in LOC110933533
Repeat occurs 6 times in a sequence of 463 amino acids
Location between 144269803 and 144285272
Coverage of 14.25 %
Instances:
DEVEDDLKKVP | DEVKDDLKKVP | DEVKDDLKKVP | DEVKDDLENTA | DEVKDGLENTA
DEVKGHPRKLP |
pattern: DEV[KE][DG][HDG][PL][KRE][KN][VLT][AP]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFF
YVAPKDEVKDDLKKVPSVSSRDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKDEVKDGL
ENTA
FFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKE
FKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGE
EIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTY
AVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITFQLLNVTPGSATICHFLPE
DHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 8 times in a sequence of 483 amino acids
Location between 144269803 and 144285272
Coverage of 23.19 %
Instances:
TAFFYVAPKDEVKD | TAFFYVAPKNEVKD | TAFFYVAPKDEVKD | TAFFYVAPKDEVKD | TAFFYVAPKDEVKD
TAFFYVAPKDEVKD | TAFFYVAPKDEVKG | TAFLPKNISDSIPF |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDGLENTA
FFYVAPKDEVKD
DLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKD
DLENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNH
QKDNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSM
EAKMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIE
SVKKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHI
TFQLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 6 times in a sequence of 463 amino acids
Location between 144269803 and 144285272
Coverage of 19.44 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKGH
TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKE
FKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGE
EIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTY
AVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITFQLLNVTPGSATICHFLPE
DHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 7 times in a sequence of 481 amino acids
Location between 144269803 and 144285272
Coverage of 21.83 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG
TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDL
ENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQK
DNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEA
KMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESV
KKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITF
QLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 6 times in a sequence of 463 amino acids
Location between 144269803 and 144285272
Coverage of 19.44 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKGH
TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKE
FKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGE
EIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTY
AVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITFQLLNVTPGSATICHFLPE
DHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 7 times in a sequence of 481 amino acids
Location between 144269803 and 144285272
Coverage of 21.83 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKDG
TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQK
DNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEA
KMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESV
KKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITF
QLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 7 times in a sequence of 481 amino acids
Location between 144269803 and 144285272
Coverage of 21.83 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKDG
TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQK
DNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEA
KMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESV
KKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITF
QLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 7 times in a sequence of 481 amino acids
Location between 144269803 and 144285272
Coverage of 21.83 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKDD
TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKDDLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQK
DNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEA
KMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESV
KKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITF
QLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 8 times in a sequence of 499 amino acids
Location between 144269803 and 144285272
Coverage of 24.05 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKDD
TAFFYVAPKDEVKDG | TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKDDLENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSC
KDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSK
NLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALST
EVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKI
VVICHTETSKWDPKHITFQLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 8 times in a sequence of 515 amino acids
Location between 144269803 and 144285272
Coverage of 23.3 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDD
TAFFYVAPKDEVKDG | TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDL
KKVPSVSSRDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKDEVKDGLENTAFFYVAPKD
EVKGH
PRKLPIFNGSCKDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKEFKLYFMKNDQKT
AFLPKNISDSIPFS
SKNLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGEEIFCATSLESMV
DFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTYAVFYCHISVGTE
SYVASLEGADGTKVKIVVICHTETSKWDPKHITFQLLNVTPGSATICHFLPEDHVLWVRSSKND
TLYM
Repeat found in LOC110883267
Repeat occurs 11 times in a sequence of 496 amino acids
Location between 144325060 and 144329885
Coverage of 31.05 %
Instances:
YGASKDEVKDQPQQ | YGASKDDVKNQPQK | YGATKDETKDQPYQ | YGASKDEVKDQPQQ | YGASKDDVKNQPQK
YGATKDETKDQPYQ | YGASKDEVKDQPQQ | YGATKDEIKDQPQQ | YGASKDEVKDQPQQ | YGASKDDVKNQPQK
YGASKNQLSKNQQE |
pattern: YGA[ST]K[DN][DQE][VILT][KS][KDN][QN][PQ][YQ][KQE]
MKFTILQVYAFLSLALVGSHAVSPEVYWKSVLPNSPMPKAIKDLLYIKAWDDSKEKASIDSTS
MNSKLKDQPYQFYTYGASKDEVKDQPQQGYIYGASKDDVKNQPQKSYIYGATKDETKDQPYQFY
TYGASKDEVKDQPQQGYIYGASKDDVKNQPQKSYIYGATKDETKDQPYQFYTYGASKDEVKDQP
QQ
GYIYRASKDDVKNQPQKSYIYGATKDEIKDQPQQGYIYGASKDEVKDQPQQGYNYGASKDDV
KNQPQK
SYIYGASKNQLSKNQQEHDQMVNLLFLENNLHQGEELKLHFIKNYHKTGFLPKHISDS
IPFSLKDIPQIYNKFSVKPDSLEAKIMKQTLSLCEVKGIEGEETFCVTSLESMVDLSTTKLGKK
VKALSTEVNAKESTPSQTYKIEFVKKLIVNKALICHRQNYAYAVFYCHTIIGTESYVVSLEGVD
GTKVKAVAICHTDTTKWNPKSLAFEVLKVSPGTTICHFLHEEHVIWVPY

Similar gene clusters

NC_035433 - Cluster 4 - Cyclopeptide

Gene cluster description

NC_035433 - Gene Cluster 4. Type = cyclopeptide. Location: 143007521 - 146003499 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC110883334
Repeat occurs 3 times in a sequence of 228 amino acids
Location between 143661557 and 143663143
Coverage of 18.42 %
Instances:
SFLNRYTEAASEKA | SFLNRYTEAASEKA | SFLNRYTEATPENQ |
pattern: SFLNRYTEA[AT][PS]E[KN][AQ]
MKFFLLLVFTLFSVALVGSHAVTPEVYWKSVLPNSPMPKSIKDLLYSEWNDEKSTTGKSGVGV
HTYQGKPLYFDVQNVPGFHTSFLNRYTEAASEKAASFLNRYTEAASEKAVSFLNRYTEATPENQ
LKDNPNAALFFFENGLHLGKEMNLYFTKNDQKATFLPREIADSIPFSSNKLPQIYNEFSVNPDS
VEAKIMKQTLEECEKPSNEGEERYCATSLESMVDFSM
Repeat found in LOC110886618
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 143750499 and 143751841
Coverage of 17.11 %
Instances:
SFLNRYTEAASEK | SFLNRYTEAASEK | SFLNRYTEAASEK | SFLNRYTEAASEK | SFLNRYTEATSEN

pattern: SFLNRYTEA[AT]SE[KN]
MKFFLLLFFTLFSVALVGSHAVTPEVYWKSVLPNSPMPKSIKDLLYSEWNDEKSTAGKSGVGV
HTYKGKPLYFDVQNVPAFHTSFLNRYTEAASEKDVSFLNRYTEAASEKTASFLNRYTEAASEKT
ASFLNRYTEAASEKAASVSFLNRYTEATSENQLKDNPNAALFFFENDLHLGKEMNLYFNKNDQK
AKFLPREIADSIPFSSNKLPQIYNEFSVNPGSMEAKVMKQTLEECEKPSNEGEEGYCATSLESM
VDFSTTRLGKKVKVISTEVNTKESILVGMYKIEVAKKFPANTAVVCHKLNYAHAVFYCHKTVST
RAYAVSMISADGKKMNAAAVCHTDTSKWNPKHLAFQVLKVKPGAVPICHFLPEDHVAWVPY
Repeat found in LOC110886604
Repeat occurs 4 times in a sequence of 247 amino acids
Location between 143942511 and 143943676
Coverage of 12.96 %
Instances:
ISLQRLQI | ISLPCNLP | ISLPDNLP | ISLPCNLP |
pattern: ISL[QP][CRD][NL][QL][PI]
MFGYMEEVLEGLEPNPSLETLEMCNYLGKTISPSWMDNLRNLVEIVLSRCKKCEHIPPLGRLP
SLRVIQLSFMYSLKYFHDDDINMSVDNTDLFISLQRLQIFSCGELISLPCNLPNLKVLEVTDCW
RLISLPDNLPKLEDLELRSCSKLISLPCNLPSIRKLRIEDCNGLLSLPDEIQSFKDLNKLEITG
CKHLRKRCEREKGEDWHKISHIPDLSIQAPLWWEVSGHDDDDEDVDEDGDDEINER
Repeat found in LOC110933533
Repeat occurs 6 times in a sequence of 463 amino acids
Location between 144269803 and 144285272
Coverage of 14.25 %
Instances:
DEVEDDLKKVP | DEVKDDLKKVP | DEVKDDLKKVP | DEVKDDLENTA | DEVKDGLENTA
DEVKGHPRKLP |
pattern: DEV[KE][DG][HDG][PL][KRE][KN][VLT][AP]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFF
YVAPKDEVKDDLKKVPSVSSRDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKDEVKDGL
ENTA
FFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKE
FKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGE
EIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTY
AVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITFQLLNVTPGSATICHFLPE
DHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 8 times in a sequence of 483 amino acids
Location between 144269803 and 144285272
Coverage of 23.19 %
Instances:
TAFFYVAPKDEVKD | TAFFYVAPKNEVKD | TAFFYVAPKDEVKD | TAFFYVAPKDEVKD | TAFFYVAPKDEVKD
TAFFYVAPKDEVKD | TAFFYVAPKDEVKG | TAFLPKNISDSIPF |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDGLENTA
FFYVAPKDEVKD
DLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKD
DLENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNH
QKDNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSM
EAKMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIE
SVKKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHI
TFQLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 6 times in a sequence of 463 amino acids
Location between 144269803 and 144285272
Coverage of 19.44 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKGH
TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKE
FKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGE
EIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTY
AVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITFQLLNVTPGSATICHFLPE
DHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 7 times in a sequence of 481 amino acids
Location between 144269803 and 144285272
Coverage of 21.83 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG
TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDL
ENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQK
DNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEA
KMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESV
KKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITF
QLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 6 times in a sequence of 463 amino acids
Location between 144269803 and 144285272
Coverage of 19.44 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKGH
TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKE
FKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGE
EIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTY
AVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITFQLLNVTPGSATICHFLPE
DHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 7 times in a sequence of 481 amino acids
Location between 144269803 and 144285272
Coverage of 21.83 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKDG
TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQK
DNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEA
KMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESV
KKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITF
QLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 7 times in a sequence of 481 amino acids
Location between 144269803 and 144285272
Coverage of 21.83 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKDG
TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQK
DNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEA
KMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESV
KKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITF
QLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 7 times in a sequence of 481 amino acids
Location between 144269803 and 144285272
Coverage of 21.83 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKDD
TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKDDLENTAFFYVAPKDEVKGHPRKLPIFNGSCKDKVKHHHHYDASHNHQK
DNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSKNLPEIYNKFSVKHDSMEA
KMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESV
KKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKIVVICHTETSKWDPKHITF
QLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 8 times in a sequence of 499 amino acids
Location between 144269803 and 144285272
Coverage of 24.05 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDG | TAFFYVAPKDEVKDD
TAFFYVAPKDEVKDG | TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDGL
ENTAFFYVAPKDEVKDDLENTAFFYVAPKDEVKDGLENTAFFYVAPKDEVKGHPRKLPIFNGSC
KDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKEFKLYFMKNDQKTAFLPKNISDSIPFSSK
NLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGEEIFCATSLESMVDFTTTKLGKRVKALST
EVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTYAVFYCHISVGTESYVASLEGADGTKVKI
VVICHTETSKWDPKHITFQLLNVTPGSATICHFLPEDHVLWVRSSKNDTLYM
Repeat found in LOC110933533
Repeat occurs 8 times in a sequence of 515 amino acids
Location between 144269803 and 144285272
Coverage of 23.3 %
Instances:
TAFFYVAPKDEVKDD | TAFFYVAPKNEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDD | TAFFYVAPKDEVKDD
TAFFYVAPKDEVKDG | TAFFYVAPKDEVKGH | TAFLPKNISDSIPFS |
pattern: TAF[LF][YP][VK][AN][PI][KS][DN][SE][VI][KP][DGF][HDSG]
MEFKLLHVYVLFSLALVGSHAVTPEVYWKSVLPNTPIPKAIKDLLYSKAWDDGLEKVSTSTAS
NDEVEDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKNEVKDDLKKVP
SVLSKDAVEDGLENTAFFYVAPKDEVKDDLKKVPSVSSKDAVEDGLENTAFFYVAPKDEVKDDL
KKVPSVSSRDAVEDGLENTAFFYVAPKDEVKDDLENTAFFYVAPKDEVKDGLENTAFFYVAPKD
EVKGH
PRKLPIFNGSCKDKVKHHHHYDASHNHQKDNSLAKPFLEKDLHEGKEFKLYFMKNDQKT
AFLPKNISDSIPFS
SKNLPEIYNKFSVKHDSMEAKMMKQTIDICEHKGVEGEEIFCATSLESMV
DFTTTKLGKRVKALSTEVYTKEPTPSQNYKIESVKKLIANKLVVCHRLNYTYAVFYCHISVGTE
SYVASLEGADGTKVKIVVICHTETSKWDPKHITFQLLNVTPGSATICHFLPEDHVLWVRSSKND
TLYM
Repeat found in LOC110883267
Repeat occurs 11 times in a sequence of 496 amino acids
Location between 144325060 and 144329885
Coverage of 31.05 %
Instances:
YGASKDEVKDQPQQ | YGASKDDVKNQPQK | YGATKDETKDQPYQ | YGASKDEVKDQPQQ | YGASKDDVKNQPQK
YGATKDETKDQPYQ | YGASKDEVKDQPQQ | YGATKDEIKDQPQQ | YGASKDEVKDQPQQ | YGASKDDVKNQPQK
YGASKNQLSKNQQE |
pattern: YGA[ST]K[DN][DQE][VILT][KS][KDN][QN][PQ][YQ][KQE]
MKFTILQVYAFLSLALVGSHAVSPEVYWKSVLPNSPMPKAIKDLLYIKAWDDSKEKASIDSTS
MNSKLKDQPYQFYTYGASKDEVKDQPQQGYIYGASKDDVKNQPQKSYIYGATKDETKDQPYQFY
TYGASKDEVKDQPQQGYIYGASKDDVKNQPQKSYIYGATKDETKDQPYQFYTYGASKDEVKDQP
QQ
GYIYRASKDDVKNQPQKSYIYGATKDEIKDQPQQGYIYGASKDEVKDQPQQGYNYGASKDDV
KNQPQK
SYIYGASKNQLSKNQQEHDQMVNLLFLENNLHQGEELKLHFIKNYHKTGFLPKHISDS
IPFSLKDIPQIYNKFSVKPDSLEAKIMKQTLSLCEVKGIEGEETFCVTSLESMVDLSTTKLGKK
VKALSTEVNAKESTPSQTYKIEFVKKLIVNKALICHRQNYAYAVFYCHTIIGTESYVVSLEGVD
GTKVKAVAICHTDTTKWNPKSLAFEVLKVSPGTTICHFLHEEHVIWVPY
Repeat found in LOC110883299
Repeat occurs 7 times in a sequence of 377 amino acids
Location between 144644100 and 144646022
Coverage of 13.0 %
Instances:
VGVGKGG | VGVGKGG | VGVHTGK | VGVGKGG | VGVHTGK
VGVGKGG | VGVHKGP |
pattern: VGV[HG][KT]G[KPG]
MKFFLVHVCTLLSVSLVGSHAATPEVYWKSVLPNSPMPKSIKDLLYTEWNDEKSTAVGVGKGG
VSVHTGKPGKRTDVGVGKGGVGVHTGKPGKGTDVGVGKGGVGVHTGKPGKRTDVGVGKGGVEVH
TGHKGKPVYVGVHKGPDPFVYLYAASEDQLKDNPNIAMFFLENDLHQGKEMNLHFTKNDQKATF
LPRKIADSIPFSSNKLPQIYDEFSVKTDSMEAEIMKQTLNECESTGIEDEEKYCATSLESMVDF
STTKLGKNVKAISTEVSAKESTPLQKYKIKVAKKLAANKAVVCHKQNYAYAVFYCHKTISTQAY
AVSMVGGDGTKVNAVAVCHTDTAKWNPKHLAFQVLKVKPGTVPICHFLPEDHVVWVPY
Repeat found in LOC110883321
Repeat occurs 9 times in a sequence of 432 amino acids
Location between 144709581 and 144711303
Coverage of 14.58 %
Instances:
VGVGKGG | VGVGKGG | VGVHTGK | VGVGKGG | VGVHTGK
VGVGKGG | VGVHTGK | VGVGKGG | VGVRKGP |
pattern: VGV[HRG][KT]G[KPG]
MNISHFINSTSTTHQTSSKLTPSISCSFGFSFSPMRFFLLHVCTLFSVALLGSHAVTPEVYWK
SVLPNSPMPKSIKDLLYTEWSDEKSTAVGVGKGGVSVHTGKPGKRTDVGVGKGGVGVHTGKPGK
RTDVGVGKGGVGVHTGKPGKRTDVGVGKGGVGVHTGKPGKGTHVGVGKGGVNVHTRHKGKPVYV
GVRKGP
NPFVYKYAASENQLKDNPNVALFFKENDLHQGKEMNLHFTKNDEKATFLPRQIADSIP
FSSNKLPQIYEEFSVKPDSMEAEIMKQTLSECESTGIQGEEKYCATSLESMVDFSTTKLGKNVK
AISTEVSAKESTPLQKYKIEVAKKLVANKAVVCHKQNYAYAVFYCHKTASTQAYAVSMVGADGT
KVNAVAVCHTDTAKWNPKHLAFQVLKVKPGTVPICHFLPEDHVVWVPYK
Repeat found in LOC110936586
Repeat occurs 7 times in a sequence of 500 amino acids
Location between 145863681 and 145866110
Coverage of 21.0 %
Instances:
LIFFNFSDLFNPFKM | LIFAGKQLEDGRTLA | LIFAGKQLEDGRTLA | LIFAGKQLEDGRTLA | LIFAGKQLEDGRTLA
LIFAGKQLEDGRTLA | LIFAGKQLEDGRTLA |
pattern: LIF[AF][GN][KF][QS][DL][EL][DF][GN][RP][TF][KL][MA]
MLYCSINSPPIYPLHSSTNSLIFFNFSDLFNPFKMQIFVKTLTGKTITLEVESSDTIDNVKAK
IQDKEGIPPDQQRLIFAGKQLEDGRTLADYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEV
ESSDTIDNVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLADYNIQKESTLHLVLRLRGGMQIFV
KTLTGKTITLEVESSDTIDNVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLADYNIQKESTLHL
VLRLRGGMQIFVKTLTGKTITLEVESSDTIDNVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLA
DYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVESSDTIDNVKAKIQDKEGIPPDQQRLIF
AGKQLEDGRTLA
DYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVESSDTIDNVKAKIQDK
EGIPPDQQRLIFAGKQLEDGRTLADYNIQKXVHSAPCSSSAWWVLDVCGGCDV

Similar gene clusters

NC_035434 - Cluster 5 - Cyclopeptide

Gene cluster description

NC_035434 - Gene Cluster 5. Type = cyclopeptide. Location: 121965706 - 124631877 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC110895150
Repeat occurs 3 times in a sequence of 250 amino acids
Location between 121987742 and 121988495
Coverage of 8.4 %
Instances:
APVLPDD | APVVPEP | APVVAPP |
pattern: APV[VL][AP][DEP][DP]
MELEQPAEAPVLPDDQIPVIPADHQPAPVVPEPIVALDPVPVTDAPVVAPPAVEIPDIAPIPD
PMAIFDDLAPFATHIDLRYANTSDGWIEEDDYPLYVVPVTPPTIPITAPLDIPVFPPPTSDAHR
TDLPITFLRDIPPPQPGEGSSRQPFGHTPFVTGDNQLIPPTPYHSFVPPVSSTAPFMSQFPHTT
SQIAFTPHTTHPALAPMGEPFQWSSPHVMPVSDPYHSFHVGYSIEDMLTSLQLQQDALR
Repeat found in LOC110921428
Repeat occurs 3 times in a sequence of 611 amino acids
Location between 123453061 and 123467261
Coverage of 4.42 %
Instances:
DSKSKHEDK | DSKSKHEDK | DSKMGNNVN |
pattern: DSK[MS][KG][HN][NE][VD][KN]
MEKVDENARPNFEWLKERGVRQNNGVQEYESFKYYGVKYMLYDCVYMCNGGRGEPYIGKLTKI
WECPDKKKKVEVHWFFRPKQIPRWLGETKTLESEILIATGEGLGLANINPLEAIDGPCKVVCIS
KDNRNPQPSDEQLKAADFVFYRTANVQTSAISDNIGDKVGGLEIKYVFNREKTQKTSHFQESIS
DTKEMSTKKSMVDSKSKHEDKSPADVKNLIASGTSDTTEDRKKCKISAKDTSEDLDGQILKKRR
DHNMKTTVDSKSKHEDKSTADVNNLNASGTFGTMEDRKKRKILEKETSDDREGNKIKRTRDGGS
FKVPDSKMGNNVNVRDSVAISKGKSTSGIASDIIRNNKNNTYNKEGKEGQSGENNRELSMTKSL
KVSTLSTNKVNKNAYRYWEFVVNPKPKAEKHCWFTRAHWEDRLKSAYNQGTAILLHNVNPDYNS
GDIEDIIWHAFKVNCDAEIHCDAKILQRTSVSSPHHAQALIVLKTKEDAQRVLKKLDEECLMLS
DGKNGFMPLVGTLCPTISTGKNSSFYGNLAGKARFRNQREDEAVSTSHFSQPNTIEYDMAMEWR
LLQLQSVKCWEKLHEQQGLELENLKNGLKENVHEDS

Similar gene clusters

NC_035434 - Cluster 6 - Saccharide

Gene cluster description

NC_035434 - Gene Cluster 6. Type = saccharide. Location: 145425139 - 145546465 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035434 - Cluster 7 - Saccharide

Gene cluster description

NC_035434 - Gene Cluster 7. Type = saccharide. Location: 161270579 - 161491393 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035434 - Cluster 8 - Polyketide

Gene cluster description

NC_035434 - Gene Cluster 8. Type = polyketide. Location: 170134067 - 170248803 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035435 - Cluster 9 - Cyclopeptide

Gene cluster description

NC_035435 - Gene Cluster 9. Type = cyclopeptide. Location: 109121207 - 111375581 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_035435 - Cluster 10 - Cyclopeptide

Gene cluster description

NC_035435 - Gene Cluster 10. Type = cyclopeptide. Location: 151206423 - 152064773 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_035435 - Cluster 11 - Saccharide

Gene cluster description

NC_035435 - Gene Cluster 11. Type = saccharide. Location: 161063486 - 161307793 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035435 - Cluster 12 - Saccharide

Gene cluster description

NC_035435 - Gene Cluster 12. Type = saccharide. Location: 173025516 - 173324210 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035436 - Cluster 13 - Saccharide-alkaloid

Gene cluster description

NC_035436 - Gene Cluster 13. Type = saccharide-alkaloid. Location: 180637540 - 181035289 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035436 - Cluster 14 - Cyclopeptide

Gene cluster description

NC_035436 - Gene Cluster 14. Type = cyclopeptide. Location: 206502905 - 207047230 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC118491249
Repeat occurs 13 times in a sequence of 383 amino acids
Location between 206727480 and 206728762
Coverage of 33.94 %
Instances:
PRPSATSYVT | PRPSATSYVT | PRPSATSYVT | PRPSATSYVT | PRPSATSYVT
PRPSATSYVT | PRPSATSYVA | PRPSATSYVA | PRPSATSYVT | PRPSATSYVA
PRPSATSYVA | PRPSATVYDN | PRPSATSYDI |
pattern: PRPSAT[VS]Y[VD][AINT]
The following known motifs were found:
FEPR was found 10 times in this sequence
MESLHSFLVFFSLTMVVFIIEARPDSGEYWQNLIFQKSDSVSPFEGNKAHCHTLKETTLKASS
FEDESPYGNIKDKDHEDITHLNEKKAFNTDFEPRPSATSYVTGVSDKKSYDTKFEPRPSATSYV
T
GLNGKKTFNIDFEPRPSATSYVTGLNGKKTFDTGFEPRPSATSYVTGLNGKKTFNIEFEPRPS
ATSYVT
GENGKKTFDTGFEPRPSATSYVTGLNGKKTFDTGFEPRPSATSYVAGLNGKKTFNIDF
KPRPSATSYVAGQNGMKTSDTNFEPRPSATSYVTGLNGKKTFDTGFEPRPSATSYVAGQNGKKT
FNIDFIPRPSATSYVAGQNGMKTSDTNFEPRPSATVYDNRVNTKDNAPFDKDLEPRPSATSYDI
Repeat found in LOC110938217
Repeat occurs 13 times in a sequence of 359 amino acids
Location between 206732337 and 206733575
Coverage of 36.21 %
Instances:
PRPSAISYVT | PRPSATSYVT | PRPSATSYVT | PRPSATNYVT | PRPSATSYVT
PRPSATSYVT | PRPSATSYVA | PRPSATSYVS | PRPSATSYVA | PRPSATSYVT
PRPSATSYVA | PRPNATSYVA | PRPSAISYDI |
pattern: PRP[NS]A[IT][NS]Y[VD][AIST]
The following known motifs were found:
FEPR was found 12 times in this sequence
MESLPSFLVFFSLAMVVFIIEARPDSGECWQNLIVQKSEYRRSATDFEPRPSAISYVTGVSDK
KSYDTKFEPRPSATSYVTGLNGKKTLNIDFEPRPSATSYVTSLNGKKTFDTGFEPRPSATNYVT
GLKGKKTFNIDFEPRPSATSYVTGQNGKKTFDTGFEPRPSATSYVTGLNGKKPFGTEFEPRPSA
TSYVA
DLNGKKTFDTGFEPRPSATSYVSGLNGKKPFGTEFEPRPSATSYVAGLNGKKTFDTGFE
PR
PSATSYVTGLNGKKPFDTEFEPRPSATSYVAGINDKKTFNIDFKPRPNATSYVAGQNGMKTF
DTNFEPRSSATVYDNRVNTKDNAPFDKDLKPRPSAISYDI

Similar gene clusters

NC_035437 - Cluster 15 - Cyclopeptide

Gene cluster description

NC_035437 - Gene Cluster 15. Type = cyclopeptide. Location: 161763732 - 163576041 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC110941822
Repeat occurs 23 times in a sequence of 1234 amino acids
Location between 162125153 and 162128858
Coverage of 27.96 %
Instances:
GLQAHPGSTMGLQAH | GLQAHPGSTMGLQAH | GLQAHPSSNMGLQAH | GLQAHPGSTMGLQAH | GLQAHPGSTMGLQSH
GLQSHPGTTMGLQAH | GLQAHPGSTMGLQAH | GLQAHPGSTMGLQAH | GLQAHPGSTMGLQAH | GLQAHPGSTMGLQAH
GLQAHPGTTMGFQAH | GLQAHPGTTMGLQAN | GLQANPSSTMGLQTL | GLQTLPGSTMGSQAN | GLQAHPVSTMGLQAH
GLQAHPGTTMGLQAN | GLQANPGSTMGLQAN | GLQANPVSTMGLQAH | GLQAHPGTTMGLQAN | GLQANPGSNMGLQAN
GLQANPGSIMHLPAH | GLQAHPGSTMDLQAN | GLQNDKPFGPVLVNY |
pattern: GLQ[ANST][HLDN][KP][VPSG][FST][GINT][MP][HVDG][SLF][VQP][ANST][HLNY]
MLNGVCMETRKISETLIEGLVSSKQNDDVDQFGGKFPEVSQCLDLGKVKCEESAFAPVKESRC
VDSCSVKGSDRVGSSGRGDVSIKGISLFVELTGGMSKKPNGESFRDDKSRLMYGNQVKVDVGQQ
ESEVNVGDFVWAIIKKQSWWPGIVCDAANAPKEAANGPTREDDILVKCFGNDNFIWCSPYEVRP
FIGYFDRLPNQSKAKKFLVALEKAVTEFGHRVKAEFTCRCFSKMKTEKTGNVGDFSVTRFEPAR
FLDYIKDLGKDVSMTHEIDNVVKQSCLSAFYRSLGHLQIPMHQLMPANGTPLKIKTEEENGYFG
DDTEKTNETREKRKSRFLSYPGECGIDDSNVDDYEGVDLNKTSGQGQSQPVKKPRKKWTRKNKE
VRANVCSTEVLSQLQFSAQDCLFPCESKNFDSVKRFIYGFRKSAFKDSSSEIPVDVSNSNHQIS
QETLPKKVKKKKDKSVISPTIGNNSNHASQFMNFQNVGSFNTESMANQRTEWPANQVPIFGFGY
PPPTQPTQPPLRKLAPKRPNKVAENVTPMYNWNADISTLPNMNNMEGLYQAPVFFTNRFGPIPP
YYTGNYGQPQVVPVPNGLTSVPKKRGRKRKKVDLQAHLDPTMGLQAHPGSTMGLQAHPGSTMGL
QAHPSSNMGLQAH
PGSTMGLQAHPGSTMGLQSHPGTTMGLQAHPGSTMGLQAHPGSTMGLQAHP
GSTMGLQAH
PGSTMGLQAHPGTTMGFQAHPGTTMGLQAHPGTTMGLQANPSSTMGLQTLPGSTM
GSQAN
PVSTMGLQAHPVSTMGLQAHPGTTMGLQANPGSTMGLQANPVSTMGLQAHPGTTMGLQA
N
PGSNMGLQANPGSIMHLPAHPGSLMGLQAHPGSTMDLQANPGSTVIPNLNGSITGQKKQKRAN
KNNEVGVPCIDLSYNKVQQDNLELKGTAFLLKFSSDYPLPSIQDLNSVFSKFGALIESETQVLN
ETLSGQVVFLDSSSAGGAFWGLQNDKPFGPVLVNYKIQHLAGSESTVPFKTPIKSPSGPKPMDL
NTNSIVSPAMIPDLNTNSTEVQIIKRSKTIEEIRLPSNHNHLGNNKVHQGNEVTGTTGTALLLT
FSPNFPLPSIQDLNSVFCKYGELKERETRVFGQSFTGQVVFVKPSTAADAIHRLHKDQPFGVAL
ISYQLHHLHNVQPAIRVKSPVNALKPPQDLGVIKKNLERMNSMLEKSGDSLSPEMRRKLESEIK
GLMNKVSAVDGSSSSSSCL
Repeat found in LOC110943886
Repeat occurs 3 times in a sequence of 336 amino acids
Location between 163048031 and 163052871
Coverage of 5.36 %
Instances:
YGGYRD | YGGYRE | YGGYGD |
pattern: YGGY[RG][DE]
MDYGGYRDKPVVEPYYPAQEWYPDYVGYGGYREYGGYGDEPPNTPLYDDYQPSNPGNRFKISY
SGVLQDPEYEASASSKLPVNLKLVFSNEVASPVFTKQKIIGKVIGDNGNTSIQVILVDRNTHQR
ITSGAEASAKVKMVLLRGDCGGSTTRGFEENIVVDWEKKKNILQGDMYVYLQRGCGTVGEIRIK
HDKNSLKNVQFRLGAMIVDTSYEVEEAITNPFEVKDHRNLPKCLRPLSLEDRVGKMMNISNKCT
GKIRTRLESKNIRTVRDFLGMHSLNPQELQEICGIKGKRWEMTVKHAKTSLVGNMGTKFSSQHS
NTLTSDDSDMLKQWYDG

Similar gene clusters

NC_035437 - Cluster 16 - Saccharide

Gene cluster description

NC_035437 - Gene Cluster 16. Type = saccharide. Location: 173312659 - 173594943 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035437 - Cluster 17 - Saccharide

Gene cluster description

NC_035437 - Gene Cluster 17. Type = saccharide. Location: 176616319 - 176731623 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035438 - Cluster 18 - Saccharide

Gene cluster description

NC_035438 - Gene Cluster 18. Type = saccharide. Location: 14230890 - 14273024 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035438 - Cluster 19 - Saccharide

Gene cluster description

NC_035438 - Gene Cluster 19. Type = saccharide. Location: 64894068 - 65225782 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035440 - Cluster 20 - Transporter_associated-saccharide-terpene

Gene cluster description

NC_035440 - Gene Cluster 20. Type = transporter_associated-saccharide-terpene. Location: 84983767 - 85488737 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035440 - Cluster 21 - Cyclopeptide

Gene cluster description

NC_035440 - Gene Cluster 21. Type = cyclopeptide. Location: 125248670 - 128510809 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC110917162
Repeat occurs 3 times in a sequence of 300 amino acids
Location between 126090527 and 126092392
Coverage of 10.0 %
Instances:
NPNPNPNHNH | NPNPNHNHNH | NPNHNHNHQL |
pattern: NPN[HP]N[HP]NH[QN][HL]
MGLTSLQVCMDSPDWFQGTNPEELAIDTSSPSGLTDSMLSCSRPLMDRRLRPPQDHPLKCPRC
DSTHTKFCYYNNYSLTQPRYFCKTCRRYWTKGGTLRNIPVGGGCRKNKKVVSSSKKPIIQHHQK
NSHLVDIYQNPNPNPNHNHNHQLVMNCVDLQLSYPESTMQYSGSNLLGIGNSSNATSFMFENMN
SSYGGVNVMGQSYQGGSLCSPFGGMTLDSINSFGERGLIAYNREGNGVNDDNIGGFVDVKPNVL
ALEWQDDGHASATSGGGDGGYLTGLGSSWGGLVHGYGPSTTNPLV
Repeat found in LOC118480868
Repeat occurs 3 times in a sequence of 191 amino acids
Location between 126268645 and 126271355
Coverage of 15.71 %
Instances:
EREREREGTR | EREREGTRER | EREGTRERTA |
pattern: ERE[RG][ET][RG][ET][RG][ET][AR]
MEREREREGTRERTAGGSGGRRSELQPSENIGRQHCNYGGEYVLIQFWFGSHLGSDLRGARVG
FGGMLGSGLRCVRFGFELCTVSGSHQRAWVTFQRLESTVNRSQSWSTSSHGSGLVNSGQIRKRF
GQTQVNISQLSRSTQLTRSARLSTREDGCISKLWHDWNKRTMWSRVLSGVRFLEVLAIFASAVQ
Repeat found in LOC118480868
Repeat occurs 3 times in a sequence of 157 amino acids
Location between 126268645 and 126270284
Coverage of 19.11 %
Instances:
EREREREGTR | EREREGTRER | EREGTRERTA |
pattern: ERE[RG][ET][RG][ET][RG][ET][AR]
MEREREREGTRERTAGGSGGRRSELQPSENIGRQHCNYGGEYVLIQFWFGSHLGSDLRGARVG
FGGMLGSGLRCVRFGFELCTVSGSHQRAWVTFQRLESTVNRSQSWSTSSHGSGLVNSGQIRKRF
GQTQVNISQLSRSTQLTRSARLSTREDGIL
Repeat found in LOC110921466
Repeat occurs 3 times in a sequence of 190 amino acids
Location between 127546145 and 127549218
Coverage of 9.47 %
Instances:
KACEKT | KACFRC | KACFKC |
pattern: KAC[EF][KR][CT]
MASYGGTTQKCKACEKTVYFVDELTVDNKVYHKACFRCHHCKGTLKLSNYSSFEGVLYCQPHF
DQLFKMTGSLDKSFEGTPRTARSYDQGQSNSKVSSMFGGTQDKCVTCKKTVYPLEKVGVDGNAY
HKACFKCSYGGCHISSSNYVTHEHQLYCKHHHTQLFMAKGDFSQFDKHREPENGVTAENTTEA
Repeat found in LOC110907004
Repeat occurs 7 times in a sequence of 543 amino acids
Location between 128349759 and 128351391
Coverage of 7.73 %
Instances:
TPPPEP | TPPRQP | TPPQQP | TPPRQP | TPPRQT
TPPRQP | TPPLTQ |
pattern: TPP[RPQL][QET][PQT]
MGYAGALNVGNYLKSKFQKPYKFIVHCVLMSLSHTKGGYDAMRDYQMNMVTALILNKKYNFSH
IVFHYMVENILTKSKSWMYPRFVQMLIDHAYPEIERDVKDDLLVQSRMSNDSLKQLARYHPNHP
EPKIVAKFFGFIKDANYVDPDPVDHQNWRNEEEIKEAAYADELKTLEEFKTTRNDWFVKEPRRR
GKKATLKSQEGEGSSSQPKKKQKKVTKTLLVDEPEVEEPVVTAEDDPYADIDKVMLNVDDLVSE
QAVNVEAEKEKVLDDVEGDDINKSTTSSSSSSDDEIDEAERLRRIQEATEKEKQLRKRKRQEKD
DAAYIPSLEHVSESQSPSSGKKKAGAKKRIISPKIKKVTTKITKPKIVLKKKPAKESSKPSTPP
PEP
IPIQSPPHQTPPRQPTPPQQPTPPRQPSPPKQPTPPRQTSPPKQPTPPRQPSSLHLSPLHL
SPPQQQTLLTSQEIFQTPPLTQIQLTPGSSGHRGLHNPPDNLEDIVDFGFANDEQVKKLEKKMD
DVLNENKVLAAESKKVADQEKDSRNACKEARI

Similar gene clusters

NC_035440 - Cluster 22 - Saccharide-fatty_acid

Gene cluster description

NC_035440 - Gene Cluster 22. Type = saccharide-fatty_acid. Location: 164741548 - 165068983 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035441 - Cluster 23 - Saccharide

Gene cluster description

NC_035441 - Gene Cluster 23. Type = saccharide. Location: 137305588 - 137849473 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035441 - Cluster 24 - Alkaloid

Gene cluster description

NC_035441 - Gene Cluster 24. Type = alkaloid. Location: 178682716 - 178817060 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035441 - Cluster 25 - Transporter_associated

Gene cluster description

NC_035441 - Gene Cluster 25. Type = transporter_associated. Location: 184811803 - 184923615 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035442 - Cluster 26 - Transporter_associated-saccharide

Gene cluster description

NC_035442 - Gene Cluster 26. Type = transporter_associated-saccharide. Location: 119471004 - 121588003 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035442 - Cluster 27 - Putative

Gene cluster description

NC_035442 - Gene Cluster 27. Type = putative. Location: 135947373 - 137016192 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035442 - Cluster 28 - Saccharide-terpene

Gene cluster description

NC_035442 - Gene Cluster 28. Type = saccharide-terpene. Location: 160283825 - 160438213 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_035443 - Cluster 29 - Transporter_associated

Gene cluster description

NC_035443 - Gene Cluster 29. Type = transporter_associated. Location: 9025859 - 9524409 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035443 - Cluster 30 - Cyclopeptide

Gene cluster description

NC_035443 - Gene Cluster 30. Type = cyclopeptide. Location: 132880822 - 136140153 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_035443 - Cluster 31 - Cyclopeptide

Gene cluster description

NC_035443 - Gene Cluster 31. Type = cyclopeptide. Location: 181823435 - 183050109 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC110891364
Repeat occurs 6 times in a sequence of 747 amino acids
Location between 181943841 and 181946670
Coverage of 4.82 %
Instances:
VNSEVN | VNSEVN | VNSEVN | VNSEVN | VNSQLI
VNSLTS |
pattern: VNS[LQE][VLT][SIN]
MGSLYGTHEIVELKDEPNSSDFSNDKKKKPQGCSRYLEDDINKLFEGINIRTSFNGSSFSDET
GARTPRKNPSKKPMRMSLSSSSGIGYSEPVSLKQALRGLCISQASEMAALKRSSMPYSNSAQST
PRFHLEKSKKSVALLKEMLLVPESDSQSSNEPAEEKKVISVSPEASLESVNSEVNSEVNSEVNS
EVNPVNNREKNNCTSSKPTPKPRHKGKSIRTSPRAVKTVIRTKTVVKKKSKPELTSVTPASNAV
NSQLI
CQKCNCAIKDEKDEPNKDSQRKVNSLTSYGKTISLLNNKNSKYGDKGEFTQSSNSSICE
YSSSTSLSEERNLSGCILINRPHMSKDVRWQAINHTMKRYGFLGLRHFNLLKKLGSGDIGTVYL
AELCGTNYPFAIKVMDNEFLERRQKMPRAHMEREILSLLDHPFLPTLYAHFVSENLSCLVMEYC
PGGDLHVLRQKQPARYFHEQAARFYVAEVLLALEYLHMLGIVYRDLKPENILVREDGHIMVTDF
DLSLRCPVNPTLIQSPSSGPMEPPRVSGPCAGSGCIDPFCIKPTCQQVSCFKSKIRKTKPDLAA
LHRSLPVLVAEPTEARSNSFVGTHEYLAPEIIKGEGHGSAVDWWTFGIFLYELLYGKTPFKGPG
NDETLTNVILESLRFPETPLVSFHARDLIKGLLVKEPENRLGSQRGAAEIKKHPFFDGLNWALI
RCASPPEVPEPYDVTVSKAASLERARKYLDYNGSSDGNLEFELF
Repeat found in LOC110891392
Repeat occurs 15 times in a sequence of 685 amino acids
Location between 182687461 and 182695857
Coverage of 32.85 %
Instances:
GQQIYPQQMGQQMYP | GQQMYPQQMGQQMYP | GQQMYPQQLGQQMYT | GQQMYTNQMGQQVYG | GQQVYGNQVGQQMYG
GQQMYGNQMGQQAYG | GQQAYGNQMGQQVYG | GQQVYGNQMGQQMYG | GQQMYGNQMGQQMYG | GQQMYGNQMGQQMYG
GQQMYGNQMGQQMYG | GQQMYGNQMGQQQMY | GQQQMYNSQMGQQMY | GQQMYNNQMGQAYGY | GQQQNGQYIGKSMPG

pattern: GQQ[IAMQV][MYN][YNTPG][QN][YQS][IMQVL][MG][KQG][AQS][YMAQV][MYPG][YPGT]
MVNSMVARATNNMLMGPDWAMNLEICDVCNRDPVQAKDVVKGIKKRLGSRNPKSQLLALTLLE
SVVKNCGDVVHLNVAERNLLHDMIKIVKKKKPDYHVKEKILILIDTWQEAFGGERGRYPQYYVA
YLELLRLGVTFPEKSAREGPVLTPPQTHPITSVPYNPQHPENENETRVHTAEADFPTLSLTEMQ
NARGVMDVLAEMLTAVDPTKKEGLRQEVVVDLVEQCRTYKRRLVHLVNSTSDESLLSLGLALND
DLQRVLAKHEALLTTTHVVSSEKSNAEKSTPETSRAIVPVTTPLGATDTKQVGSTSGAADQPDL
ISLAPATTNGQSTSSAIVNFDLLSGDDFDTSTASNALAIVPVGETQPATPVSQQNALALVSVLS
PQTQQPRNMSARQTSFYQNRNVAQNQSWAPNNQGNNPFGVTMNNGLPPAPWEAPHADNQLVVAG
SMQYSQPHGIQTPGNINMNINNQMGQQIYPQQMGQQMYPQQMGQQMYPQQLGQQMYTNQMGQQV
YGNQVGQQMYG
NQMGQQAYGNQMGQQVYGNQMGQQMYGNQMGQQMYGNQMGQQMYGNQMGQQMY
G
NQMGQQQMYNSQMGQQMYNNQMGQAYGYGPISVYGQQQNGQYIGKSMPGMPVRDDNFLTSTVY
TGPSMANASYVHVSKTQKSEDKLFGDLVDFSKVKPKKMTPGIGERV

Similar gene clusters

NC_035444 - Cluster 32 - Alkaloid

Gene cluster description

NC_035444 - Gene Cluster 32. Type = alkaloid. Location: 26618341 - 26985388 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035444 - Cluster 33 - Cyclopeptide

Gene cluster description

NC_035444 - Gene Cluster 33. Type = cyclopeptide. Location: 57516206 - 59392033 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_035444 - Cluster 34 - Polyketide

Gene cluster description

NC_035444 - Gene Cluster 34. Type = polyketide. Location: 83337326 - 84445400 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035445 - Cluster 35 - Saccharide

Gene cluster description

NC_035445 - Gene Cluster 35. Type = saccharide. Location: 72555840 - 72755251 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035445 - Cluster 36 - Fatty_acid

Gene cluster description

NC_035445 - Gene Cluster 36. Type = fatty_acid. Location: 140442392 - 140659654 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035445 - Cluster 37 - Fatty_acid

Gene cluster description

NC_035445 - Gene Cluster 37. Type = fatty_acid. Location: 153990057 - 155217066 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035446 - Cluster 38 - Saccharide

Gene cluster description

NC_035446 - Gene Cluster 38. Type = saccharide. Location: 79682511 - 80100762 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035446 - Cluster 39 - Saccharide

Gene cluster description

NC_035446 - Gene Cluster 39. Type = saccharide. Location: 85043433 - 85993685 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035447 - Cluster 40 - Saccharide

Gene cluster description

NC_035447 - Gene Cluster 40. Type = saccharide. Location: 9712953 - 9789244 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035447 - Cluster 41 - Saccharide-fatty_acid

Gene cluster description

NC_035447 - Gene Cluster 41. Type = saccharide-fatty_acid. Location: 138927964 - 139094241 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035447 - Cluster 42 - Fatty_acid

Gene cluster description

NC_035447 - Gene Cluster 42. Type = fatty_acid. Location: 168599547 - 168991008 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035448 - Cluster 43 - Cyclopeptide

Gene cluster description

NC_035448 - Gene Cluster 43. Type = cyclopeptide. Location: 27321220 - 28775225 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_035448 - Cluster 44 - Cyclopeptide

Gene cluster description

NC_035448 - Gene Cluster 44. Type = cyclopeptide. Location: 28170970 - 28833641 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_035448 - Cluster 45 - Terpene

Gene cluster description

NC_035448 - Gene Cluster 45. Type = terpene. Location: 173798932 - 174216046 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035448 - Cluster 46 - Terpene

Gene cluster description

NC_035448 - Gene Cluster 46. Type = terpene. Location: 174570543 - 175654561 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035448 - Cluster 47 - Saccharide

Gene cluster description

NC_035448 - Gene Cluster 47. Type = saccharide. Location: 185403955 - 185748127 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035449 - Cluster 48 - Saccharide

Gene cluster description

NC_035449 - Gene Cluster 48. Type = saccharide. Location: 20924099 - 21106935 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_035449 - Cluster 49 - Alkaloid

Gene cluster description

NC_035449 - Gene Cluster 49. Type = alkaloid. Location: 54415028 - 55999099 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters