Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_065534.1:
Cluster 1Saccharide678642281319411345.52AMP-binding, Amino_oxidase, Glyco_hydro_1, Lipoxygenase, UDPGT_2---
Cluster 2Cyclopeptide1815903918882722723.68BURP---
Cluster 3Fatty_acid2904284129608647565.81FA_hydroxylase, adh_short, adh_short_C2---
The following clusters are from record NC_065535.1:
Cluster 4Saccharide1486219215054933192.74Glycos_transf_2, Transferase, p450---
Cluster 5Saccharide-Fatty_acid2612126726669693548.43ADH_N, ADH_zinc_N, FA_desaturase_2, Peptidase_S10, UDPGT_2---
Cluster 6Lignan3282356133032434208.87Aminotran_1_2, Dirigent, Epimerase---
Cluster 7Fatty_acid3683009937080945250.852OG-FeII_Oxy, DIOX_N, FA_hydroxylase, p450---
Cluster 8Saccharide6034065060615683275.03AMP-binding, Amino_oxidase, Glycos_transf_1, SQS_PSY, UDPGT_2---
Cluster 9Fatty_acid8086502581053783188.76DIOX_N, ECH_2, Transferase---
The following clusters are from record NC_065536.1:
Cluster 10Putative5145691552198624741.71Transferase, p450-arabidiol-baruol (6% of genes show similarity)BGC0001313.3_c1
Cluster 11Alkaloid5486317855195437332.26Aminotran_1_2, Cu_amine_oxid, MatE, UbiA---
Cluster 12Cyclopeptide56116042577850521669.01BURP-momilactone B (28% of genes show similarity)BGC0000671.3_c1
The following clusters are from record NC_065537.1:
Cluster 13Saccharide8409454284431528336.99Glyco_hydro_1, UDPGT_2*saccharide, small phenolic-2--
The following clusters are from record NC_065538.1:
Cluster 14Terpene37713003941185169.88Lipoxygenase, Prenyltrans, p450---
Cluster 15Transporter_associated2452953725095794566.262OG-FeII_Oxy, ABC2_membrane, ABC_tran, Aminotran_1_2, DIOX_N, p450---
Cluster 16Phenolamide6251368562637370123.69Aminotran_1_2, Pyridoxal_deC, Transferase---
Cluster 17Alkaloid6662248967359978737.49BBE, FAD_binding_4, Methyltransf_11---
Cluster 18Terpene7934454379895040550.50Epimerase, Terpene_synth, Terpene_synth_C---
Cluster 19Saccharide8470704184861302154.26GMC_oxred_C, GMC_oxred_N, Glycos_transf_2, UDPGT_2---
Cluster 20Terpene-Saccharide8666931687536970867.65Epimerase, Glyco_hydro_1, Prenyltrans, SQHop_cyclase_C, SQHop_cyclase_N, Transferase, p450beta-amyrin-5, triterpene-5, Cycloartenol-2yossoside I/yossoside II/yossoside III/yossoside IV/yossos... (100% of genes show similarity)BGC0002402.2_c1
The following clusters are from record NC_065539.1:
Cluster 21Saccharide1862588718767581141.69Glycos_transf_1, UDPGT_2flavonoid, small phenolic--
Cluster 22Saccharide4262473442751251126.522OG-FeII_Oxy, DIOX_N, Glyco_hydro_1, Peptidase_S10---
Cluster 23Transporter_associated5268135752925846244.49Acetyltransf_1, LTP_2, p450---
The following clusters are from record NC_065540.1:
Cluster 24Putative73604987527730167.232OG-FeII_Oxy, DIOX_N, Dimerisation, Methyltransf_2, Peptidase_S10, p450---
Cluster 25Saccharide3472776135293114565.35UDPGT_2, p450-cucurbitacin C (33% of genes show similarity)BGC0001315.3_c1
Cluster 26Alkaloid-Phenolamide3581279235953745140.95Methyltransf_11, Methyltransf_7, Pyridoxal_deC, Transferase---
Cluster 27Putative3787010138335652465.55HMGL-like, Peptidase_S10, Transferase, adh_short---
The following clusters are from record NC_065541.1:
Cluster 28Terpene-Saccharide1515038415711951561.57Epimerase, Glyco_hydro_1, Methyltransf_11, Terpene_synth, Terpene_synth_C---
Cluster 29Terpene-Alkaloid1951837219995729477.36Amino_oxidase, Bet_v_1, Methyltransf_11, Prenyltrans, SQHop_cyclase_C, SQHop_cyclase_N, Terpene_synth, Terpene_synth_Clupeol--
Cluster 30Saccharide2030443320558024253.59Glyco_hydro_1, adh_short, adh_short_C2---
Cluster 31Lignan-Saccharide3442503435100289675.25Cellulose_synt, Dirigent, p450---
Cluster 32Putative5410864554573919465.27ADH_N, Abhydrolase_3, p450---
Cluster 33Putative5629502256908444613.42Amino_oxidase, Peptidase_S10, Transferase, UbiA---
Cluster 34Fatty_acid6548448165619458134.982OG-FeII_Oxy, DIOX_N, FA_desaturase---
The following clusters are from record NC_065542.1:
Cluster 35Putative1591012716059704149.58Methyltransf_11, p450-casbene/5a-hydroxy-casbene/5-keto-casbene/5-keto-7,8-epoxy... (11% of genes show similarity)BGC0002393.2_c1
Cluster 36Saccharide1732179617478480156.68Aminotran_1_2, Glycos_transf_2, SE---
Cluster 37Alkaloid-Saccharide248194362488739867.96AMP-binding, BBE, Cellulose_synt, FAD_binding_4---
The following clusters are from record NC_065543.1:
Cluster 38Fatty_acid4094351341138250194.742OG-FeII_Oxy, DIOX_N, FA_desaturase_2, Methyltransf_11, p450---
Cluster 39Cyclopeptide4516703345809661642.63BURP---
Cluster 40Saccharide5134109751517369176.27AMP-binding, Glyco_hydro_1---
The following clusters are from record NC_065544.1:
Cluster 41Saccharide-Polyketide79406298331729391.10Chal_sti_synt_C, FAE1_CUT1_RppA, Glycos_transf_1, p450-QS-21 (11% of genes show similarity)BGCMANUAL2.1_c1
Cluster 42Cyclopeptide9781704111744901392.79BURP---
Cluster 43Cyclopeptide18110417193714841261.07BURP---
Cluster 44Cyclopeptide1872047419235053514.58BURP---
Cluster 45Transporter_associated-Saccharide21788195230002621212.07ABC2_membrane, ABC_tran, DAHP_synth_2, UDPGT_2, UbiA, p450cyanogenic glucoside-8, monoterpenoid-8--
Cluster 46Cyclopeptide4406447844857511793.03BURP---
The following clusters are from record NC_065545.1:
Cluster 47Cyclopeptide1557949416395498816.00BURP---
Cluster 48Saccharide1662006516734829114.76Epimerase, Glycos_transf_2, UDPGT_2---
Cluster 49Transporter_associated-Alkaloid3071122431262929551.71Cu_amine_oxid, Epimerase, LTP_2, p450---
Cluster 50Saccharide4281021343250549440.34AMP-binding, Epimerase, Methyltransf_11, UDPGT_2cyanogenic glucoside, monoterpenoid--

NC_065534 - Cluster 1 - Saccharide

Gene cluster description

NC_065534 - Gene Cluster 1. Type = saccharide. Location: 6786422 - 8131941 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065534 - Cluster 2 - Cyclopeptide

Gene cluster description

NC_065534 - Gene Cluster 2. Type = cyclopeptide. Location: 18159039 - 18882722 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_065534 - Cluster 3 - Fatty_acid

Gene cluster description

NC_065534 - Gene Cluster 3. Type = fatty_acid. Location: 29042841 - 29608647 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065535 - Cluster 4 - Saccharide

Gene cluster description

NC_065535 - Gene Cluster 4. Type = saccharide. Location: 14862192 - 15054933 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065535 - Cluster 5 - Saccharide-fatty_acid

Gene cluster description

NC_065535 - Gene Cluster 5. Type = saccharide-fatty_acid. Location: 26121267 - 26669693 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065535 - Cluster 6 - Lignan

Gene cluster description

NC_065535 - Gene Cluster 6. Type = lignan. Location: 32823561 - 33032434 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065535 - Cluster 7 - Fatty_acid

Gene cluster description

NC_065535 - Gene Cluster 7. Type = fatty_acid. Location: 36830099 - 37080945 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065535 - Cluster 8 - Saccharide

Gene cluster description

NC_065535 - Gene Cluster 8. Type = saccharide. Location: 60340650 - 60615683 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065535 - Cluster 9 - Fatty_acid

Gene cluster description

NC_065535 - Gene Cluster 9. Type = fatty_acid. Location: 80865025 - 81053783 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065536 - Cluster 10 - Putative

Gene cluster description

NC_065536 - Gene Cluster 10. Type = putative. Location: 51456915 - 52198624 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_065536 - Cluster 11 - Alkaloid

Gene cluster description

NC_065536 - Gene Cluster 11. Type = alkaloid. Location: 54863178 - 55195437 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065536 - Cluster 12 - Cyclopeptide

Gene cluster description

NC_065536 - Gene Cluster 12. Type = cyclopeptide. Location: 56116042 - 57785052 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126718480
Repeat occurs 5 times in a sequence of 547 amino acids
Location between 56333494 and 56338892
Coverage of 6.4 %
Instances:
KPKSPPH | KPKSPPH | KPKSPPH | KPKAPPL | KPKEEFP

pattern: KPK[ASE][PE][PF][LHP]
MMRGLSLRSLTFMLFIAFILCSSNFETCNARRGKHWRHSRGISASLFKKKGKSQGSSNNHHSG
GSKPKSPPHKASPLPTPKPKSPPHKASPLPTPKPKSPPHSHKPKAPPLPTPKPKEEFPSNPPAG
HSTTFNVLDFGAKGDGSTDDTKAFQAAWAAACKIEASMIVVPAEYEFLVGPISFSGPYCQANIV
FQLDGTIIAPTNSNAWGKGLLQWLEFTKLRGITIQGNGIIDGRGSVWWQDAPFDDPIDDEIKLL
VPLNSSVEENPPMPVRSNLGRIMPSIKPTALRFYGSFNVTVTGITIQSSPQCHLKFDYCTGVLV
HDISVSSPGDSPNTDGIHLQGSKDVLIYSCNLACGDDCVSIQTGCSNVYIHNVNCGPGHGISIG
SLGKDNTKACVSNITVRDVIMHNTMNGVRIKTWQGGSGSVQGVLFSNIQVSEVEFPIVIDQYYC
DKSKCSNQTSAVALSGINYERIRGTYTVKPVHFACSDSLPCVDVSLTAIQLKPMQEKYHMYDPF
CWQTFGELRTPTMPPIGCLQIGKPSSNRIQSDHDLC
Repeat found in LOC126718527
Repeat occurs 4 times in a sequence of 593 amino acids
Location between 57415404 and 57429473
Coverage of 5.4 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DAP][DAP][AEP][LPE]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTATMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGVSNGFREESTSSQANSGSTFASRLCQKMEENIICGETEAFKEELSVLSRCDRRKMTSK
EELSQSRRHRGRSSSKQWSSQRHNKLSKHALLNANKKRRASPTSSLHQNGEHWSSSFAKKLKVV
WLPLNLKDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDAKITY
PSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYNKLKE
AVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIILHLDS
LRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKNDSDCG
LFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCDSESS
PFTKGDDPANIDHQTDSD
Repeat found in LOC126718527
Repeat occurs 4 times in a sequence of 594 amino acids
Location between 57415404 and 57429473
Coverage of 5.39 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DAP][DAP][AEP][LPE]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTATMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRKMTS
KEELSQSRRHRGRSSSKQWSSQRHNKLSKHALLNANKKRRASPTSSLHQNGEHWSSSFAKKLKV
VWLPLNLKDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDAKIT
YPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYNKLK
EAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIILHLD
SLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKNDSDC
GLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCDSES
SPFTKGDDPANIDHQTDSD
Repeat found in LOC126718527
Repeat occurs 4 times in a sequence of 586 amino acids
Location between 57415404 and 57429473
Coverage of 5.46 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DAP][DAP][AEP][LPE]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTATMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENIICGETEAFKEELSVLSRCDRRKM
TSKEELSQSRRHRGRSSSKQWSSQRHNKLSKHALLNANKKRRASPTSSLHQNGEHWSSSFAKKK
DAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDAKITYPSRDDPE
SIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYNKLKEAVSYKGS
DKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIILHLDSLRLHSSR
SIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKNDSDCGLFVLFFM
ERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCDSESSPFTKGDD
PANIDHQTDSD
Repeat found in LOC126718527
Repeat occurs 4 times in a sequence of 596 amino acids
Location between 57415404 and 57429473
Coverage of 5.37 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DAP][DAP][AEP][LPE]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTATMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENIICGETEAFKEELSVLSRCDRRKM
TSKEELSQSRRHRGRSSSKQWSSQRHNKLSKHALLNANKKRRASPTSSLHQNGEHWSSSFAKKL
KVVWLPLNLKDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDAK
ITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYNK
LKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIILH
LDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKNDS
DCGLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCDS
ESSPFTKGDDPANIDHQTDSD
Repeat found in LOC126718527
Repeat occurs 4 times in a sequence of 587 amino acids
Location between 57415404 and 57429473
Coverage of 5.45 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DAP][DAP][AEP][LPE]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTATMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRHRGRSSSKQWSSQRHNKLSKHALLNANKKRRASPTSSLHQNGEHWSSSFAKK
KDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDAKITYPSRDDP
ESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYNKLKEAVSYKG
SDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIILHLDSLRLHSS
RSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKNDSDCGLFVLFF
MERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCDSESSPFTKGD
DPANIDHQTDSD
Repeat found in LOC126718527
Repeat occurs 4 times in a sequence of 597 amino acids
Location between 57415404 and 57429473
Coverage of 5.36 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DAP][DAP][AEP][LPE]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTATMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRHRGRSSSKQWSSQRHNKLSKHALLNANKKRRASPTSSLHQNGEHWSSSFAKK
LKVVWLPLNLKDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDA
KITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYN
KLKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIIL
HLDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAVPQQKND
SDCGLFVLFFMERFIEEAPERLKKKDLAMFGRQWFKPEEASGLRVKIRKILMLEFKKASDVNCD
SESSPFTKGDDPANIDHQTDSD
Repeat found in LOC126718527
Repeat occurs 4 times in a sequence of 512 amino acids
Location between 57417182 and 57429473
Coverage of 6.25 %
Instances:
DDDDDDAP | DDDDDAPP | DDDDAPPE | DDDAPPEL |
pattern: DDD[DA][DAP][DAP][AEP][LPE]
MEEDRAKKRPLDLDWEKLLPNSSQNDDDDDDAPPELIVKSSTTTATMGSGDERHEDFEKWSDR
MLEEQIERKRRTCETMGPRMPDKGEKLRLTLKRIEDERERRKLARPQKLNSSYAYEGDGCEKPT
QSRSSGTVGVSNGFREESTSSQANSGSTFASRLCQKMEENQIICGETEAFKEELSVLSRCDRRK
MTSKEELSQSRRHRGRSSSKQWSSQRHNKLSKHALLNANKKRRASPTSSLHQNGEHWSSSFAKK
LKVVWLPLNLKDAFQVHPNGSRHKKEQTVVLIDEDESQSLIEEDESESLEITERVDNAECMKDA
KITYPSRDDPESIEICYVDINCLAPEGYLTSTIMNFYIRYIQLQASPTNRAMCDYHFFNTYFYN
KLKEAVSYKGSDKDTYFAKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICIPDKEDETGPIIL
HLDSLRLHSSRSIFENIKSFMKAEWTYLASEVAPLNLPIADKIWKHLPRRIEEKAIAEWHLCPA
I

Similar gene clusters

Similar known gene clusters

NC_065537 - Cluster 13 - Saccharide

Gene cluster description

NC_065537 - Gene Cluster 13. Type = saccharide. Location: 84094542 - 84431528 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065538 - Cluster 14 - Terpene

Gene cluster description

NC_065538 - Gene Cluster 14. Type = terpene. Location: 3771300 - 3941185 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065538 - Cluster 15 - Transporter_associated

Gene cluster description

NC_065538 - Gene Cluster 15. Type = transporter_associated. Location: 24529537 - 25095794 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065538 - Cluster 16 - Phenolamide

Gene cluster description

NC_065538 - Gene Cluster 16. Type = phenolamide. Location: 62513685 - 62637370 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065538 - Cluster 17 - Alkaloid

Gene cluster description

NC_065538 - Gene Cluster 17. Type = alkaloid. Location: 66622489 - 67359978 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065538 - Cluster 18 - Terpene

Gene cluster description

NC_065538 - Gene Cluster 18. Type = terpene. Location: 79344543 - 79895040 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065538 - Cluster 19 - Saccharide

Gene cluster description

NC_065538 - Gene Cluster 19. Type = saccharide. Location: 84707041 - 84861302 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065538 - Cluster 20 - Terpene-saccharide

Gene cluster description

NC_065538 - Gene Cluster 20. Type = terpene-saccharide. Location: 86669316 - 87536970 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_065539 - Cluster 21 - Saccharide

Gene cluster description

NC_065539 - Gene Cluster 21. Type = saccharide. Location: 18625887 - 18767581 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065539 - Cluster 22 - Saccharide

Gene cluster description

NC_065539 - Gene Cluster 22. Type = saccharide. Location: 42624734 - 42751251 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065539 - Cluster 23 - Transporter_associated

Gene cluster description

NC_065539 - Gene Cluster 23. Type = transporter_associated. Location: 52681357 - 52925846 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065540 - Cluster 24 - Putative

Gene cluster description

NC_065540 - Gene Cluster 24. Type = putative. Location: 7360498 - 7527730 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065540 - Cluster 25 - Saccharide

Gene cluster description

NC_065540 - Gene Cluster 25. Type = saccharide. Location: 34727761 - 35293114 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_065540 - Cluster 26 - Alkaloid-phenolamide

Gene cluster description

NC_065540 - Gene Cluster 26. Type = alkaloid-phenolamide. Location: 35812792 - 35953745 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065540 - Cluster 27 - Putative

Gene cluster description

NC_065540 - Gene Cluster 27. Type = putative. Location: 37870101 - 38335652 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065541 - Cluster 28 - Terpene-saccharide

Gene cluster description

NC_065541 - Gene Cluster 28. Type = terpene-saccharide. Location: 15150384 - 15711951 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065541 - Cluster 29 - Terpene-alkaloid

Gene cluster description

NC_065541 - Gene Cluster 29. Type = terpene-alkaloid. Location: 19518372 - 19995729 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065541 - Cluster 30 - Saccharide

Gene cluster description

NC_065541 - Gene Cluster 30. Type = saccharide. Location: 20304433 - 20558024 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065541 - Cluster 31 - Lignan-saccharide

Gene cluster description

NC_065541 - Gene Cluster 31. Type = lignan-saccharide. Location: 34425034 - 35100289 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065541 - Cluster 32 - Putative

Gene cluster description

NC_065541 - Gene Cluster 32. Type = putative. Location: 54108645 - 54573919 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065541 - Cluster 33 - Putative

Gene cluster description

NC_065541 - Gene Cluster 33. Type = putative. Location: 56295022 - 56908444 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065541 - Cluster 34 - Fatty_acid

Gene cluster description

NC_065541 - Gene Cluster 34. Type = fatty_acid. Location: 65484481 - 65619458 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065542 - Cluster 35 - Putative

Gene cluster description

NC_065542 - Gene Cluster 35. Type = putative. Location: 15910127 - 16059704 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_065542 - Cluster 36 - Saccharide

Gene cluster description

NC_065542 - Gene Cluster 36. Type = saccharide. Location: 17321796 - 17478480 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065542 - Cluster 37 - Alkaloid-saccharide

Gene cluster description

NC_065542 - Gene Cluster 37. Type = alkaloid-saccharide. Location: 24819436 - 24887398 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065543 - Cluster 38 - Fatty_acid

Gene cluster description

NC_065543 - Gene Cluster 38. Type = fatty_acid. Location: 40943513 - 41138250 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065543 - Cluster 39 - Cyclopeptide

Gene cluster description

NC_065543 - Gene Cluster 39. Type = cyclopeptide. Location: 45167033 - 45809661 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126704228
Repeat occurs 3 times in a sequence of 347 amino acids
Location between 45171748 and 45173516
Coverage of 11.24 %
Instances:
AGLKIINPFSHGF | AGLKMITPRQLNS | AGLKMINPFSHGF |
pattern: AGLK[MI]I[NT]P[FR][SQ][LH][GN][FS]
MDSVVVRKVWKENLVKEFELLKVALRSHRMVSIDTEFPGVVYRPANVDGHDLGKLPPNWNYQV
ISYVARLGLALCDDKGCLPDFGTKSQYVWEFNFKDFDVYNDLQNPESIELLERQGIDFDKNLKE
GIDSADFAALMLKSGLLGNHSAFTWVTFHGAYDIAHLMKILIRRPLPYDLTGFMIMVQSIFGKR
LFDLKHMMKFCDGLYGGLEKVANALGVQRLAGRSHQAGSDTLLTLQTFKKFLDVYFKEQNNGGL
GVRHNGHLLTRMQCVLHGLELNNGFHQSNNAGLKIINPFSHGFRRLNLNSAGLKMITPRQLNSA
GLKMINPFSHGF
HVINLNLCRQTSAICN
Repeat found in LOC126703264
Repeat occurs 4 times in a sequence of 842 amino acids
Location between 45565976 and 45568505
Coverage of 3.8 %
Instances:
KCGAVEDA | KCGDVRTA | KCGDPETG | KCGDIDSS |
pattern: KCG[AD][VPI][RED][DST][GAS]
MAILTPLGSLNGHCTEQSYTLRPFKVTHYSLSHALYSKRASSLCYSTKFQNCSTSIQFQENPS
QKSIVKTHLLHNQPSNSVSSASKLIKECVDDGLFENAIRVYLKMFECDGFQAEKFRFFPCLIKA
FGGLGDVDKVKQVHGHVLNLGVLNDVSVGNSLLSVYLKCGAVEDAVQMFEEMGERDPVSWNSMI
SGFYQSGDYMGALITFSMMIWEYGVYPNRVACLSALSSCSSIKSLIRGREIHGFVVKSGLNVEE
FLVSGLIEMYMKCGDVRTAEHVFKGILDNESSRGNAVIWNVMILGYVSNGCLLQALDLFLDMLA
MGTKPDSSTLVAVLVLCSQLSDLAVGKQMHGCIVSFGLDYNVRVETALIDMYFKCGDPETGLKI
FRSSQNRNLVMWGAVISNCAQGGCAAEALELFYTFLSKYGFADSVMLLAVLRACSFLTLNSRGM
EIHGLAIKTGFDDDMFIGGALVDMYAKCQDIESAQKVFQRLPARDLVLWNSLISGYAQNECADK
ALKAFCEMQSEQIAPNSVTASCILSVCARFSVMILCKEIHCYLLRRGIESNVSVSNSLIATYAK
CGDIDSS
TTLFEKMLERDKVSWNSIILGFGMHGHTDEMFVLFKKMTETGMKPDHATFTALLSAC
SHTGRVDMGLKCFKSMVEDHKLEPRVEQYTCIVDLLGRAGHLKQAYDLIMTMPCVPDDLIWGSL
LGSCKTHGNEKLAELVSKHIFELDPTCISYRVLLSNIYEGFGKQNEVTRVRSEIKDLRLKKQPG
CSWIEVDHKIHSFISGDHSHHQSKEIYGAIESLTTEMKRVGYNPRTQNVIGSDESNVEGINSSK
DIDLLYNKSVC
Repeat found in LOC126704231
Repeat occurs 4 times in a sequence of 581 amino acids
Location between 45589276 and 45591261
Coverage of 6.2 %
Instances:
AAATQADGA | AAASQADGA | AAAATQAEA | AAATQAEAI |
pattern: AAA[AST][QT][AQ][DAE][GAE][AI]
MDSSISTQADGAAATQADGAAASQADGATATATATQEAAAATQAEAIDGELPLVPPSVVSKTG
TGSGRKKSLAWNHFEKVKVDDGVTMAVCNYCKKSYLADSKSCGTSNLLAHVTICPKNPNREDKG
QKTLAFEPKNDGDEGFKLVSTTFSVEASRKALAEMIIIDELPFRCVEGYGFKKYVTTLQPKLRV
KDIPSRQTVARDVIGIYNSEREKLRKSLMGCRVCLTMDTWTSLQNLNYMCLTCHFIDDTWKLHK
RILNFCQVEDHKGETIGRKIEMSLREWGIDGIFTLTVDNASSNLTTVKFLQRVTKDWNGTVLGN
ELMHMRCCAHILNLIVGEGLKEIDASVGRVREAVRYVKSSPNRNQTFRNFMERLGMESKSLLCL
DVPTRWNSTYLMLETAKKFEKVFLRMDFEDDGYSSYFRSKEDSGGLGSPCMSDFQNCRAFVTFL
RLFYNATKKISGSLYVTSNAFFDEIFVIQEMGSVMVDKVKALLMKLYNFFCSVNSPNVEEPSGG
ERTPMVVGDASDPYVMVHSRYELFLEAEQSIGCSNEVDKYLAENCDGRRDGNFEVLGWWKDNSN
SFNCCI

Similar gene clusters

NC_065543 - Cluster 40 - Saccharide

Gene cluster description

NC_065543 - Gene Cluster 40. Type = saccharide. Location: 51341097 - 51517369 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065544 - Cluster 41 - Saccharide-polyketide

Gene cluster description

NC_065544 - Gene Cluster 41. Type = saccharide-polyketide. Location: 7940629 - 8331729 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_065544 - Cluster 42 - Cyclopeptide

Gene cluster description

NC_065544 - Gene Cluster 42. Type = cyclopeptide. Location: 9781704 - 11174490 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126704583
Repeat occurs 4 times in a sequence of 381 amino acids
Location between 10454009 and 10455671
Coverage of 6.3 %
Instances:
DSAIDS | DSAVDS | DSAIDS | DSATTK |
pattern: DSA[VIT][DT][KS]
MMHTKLSPMTSLLLFISLSSLTVGSTGENPFTPKTYLIRYWKKEIQNNQAKPIFLLSKASPLS
ATESASFSKLAAQNALSSRFSAFCASANLLCFPDSLPSTFGQKVSTEVVLQKNYGDSAIDSMDV
VLQKNYGDSAVDSMDVVLQKNYGDSAIDSATTKGGSGRFFRESMLKKGKVIPMPDISDKFPERS
FLPRSISSKQPFSTLKVSELKHIFHAGDNSTMEIMIVKSLSECKRAPNPGETKRCVSSAEDMID
FAVSVLGRNIDILTMKNVNGSKQSIMIGSIEGINGGKVTDSVTCHQLLFPYLLYYCHSVPKVRV
YQVDILDPNSKAKLNNGVVICHLNTSGWNPTHEAFLALGSAPGRIEVCHWVYQNDVTWTVVD
Repeat found in LOC126704583
Repeat occurs 4 times in a sequence of 381 amino acids
Location between 10454009 and 10455671
Coverage of 6.3 %
Instances:
DSAIDS | DSAVDS | DSAIDS | DSATTK |
pattern: DSA[VIT][DT][KS]
MMHTKLSPMTSLLLFISLSSLTVGSTGENPFTPKTYLIRYWKKEIQNNQAKPIFLLSKASPLS
ATESASFSKLAAQNALSSRFSAFCASANLLCFPDSLPSTFGQKVSTEVVLQKNYGDSAIDSMDV
VLQKNYGDSAVDSMDVVLQKNYGDSAIDSATTKGGSGRFFRESMLKKGKVIPMPDISDKFPERS
FLPRSISSKQPFSTLKVSELKHIFHAGDNSTMEIMIVKSLSECKRAPNPGETKRCVSSAEDMID
FAVSVLGRNIDILTMKNVNGSKQSIMIGSIEGINGGKVTDSVTCHQLLFPYLLYYCHSVPKVRV
YQVDILDPNSKAKLNNGVVICHLNTSGWNPTHEAFLALGSAPGRIEVCHWVYQNDVTWTVVD
Repeat found in LOC126704582
Repeat occurs 4 times in a sequence of 381 amino acids
Location between 10508538 and 10510200
Coverage of 6.3 %
Instances:
DSAIDS | DSAVDS | DSAIDS | DSATTK |
pattern: DSA[VIT][DT][KS]
MMHTKLSPMTSLLLFISLSSLTVGSTGENPFTPKTYLIRYWKKEIQNNQAKPIFLLSKASPLS
ATESASFSKLAAQNALSSRFSAFCASANLLCFPDSLPSTFGQKVSTEVVLQKNYGDSAIDSMDV
VLQKNYGDSAVDSMDVVLQKNYGDSAIDSATTKGGSGRFFRESMLKKGKVIPMPDISDKFPERS
FLPRSISSKQPFSTLKVSELKHIFHAGDNSTMEIMIVKSLSECKRAPNPGETKRCVSSAEDMID
FAVSVLGRNIDILTMKNVNGSKQSIMIGSIEGINGGKVTDSVTCHQLLFPYLLYYCHSVPKVRV
YQVDILDPNSKAKLNNGVVICHLNTSGWNPTHEAFLALGSAPGRIEVCHWVYQNDVTWTVVD
Repeat found in LOC126704582
Repeat occurs 4 times in a sequence of 381 amino acids
Location between 10508538 and 10510200
Coverage of 6.3 %
Instances:
DSAIDS | DSAVDS | DSAIDS | DSATTK |
pattern: DSA[VIT][DT][KS]
MMHTKLSPMTSLLLFISLSSLTVGSTGENPFTPKTYLIRYWKKEIQNNQAKPIFLLSKASPLS
ATESASFSKLAAQNALSSRFSAFCASANLLCFPDSLPSTFGQKVSTEVVLQKNYGDSAIDSMDV
VLQKNYGDSAVDSMDVVLQKNYGDSAIDSATTKGGSGRFFRESMLKKGKVIPMPDISDKFPERS
FLPRSISSKQPFSTLKVSELKHIFHAGDNSTMEIMIVKSLSECKRAPNPGETKRCVSSAEDMID
FAVSVLGRNIDILTMKNVNGSKQSIMIGSIEGINGGKVTDSVTCHQLLFPYLLYYCHSVPKVRV
YQVDILDPNSKAKLNNGVVICHLNTSGWNPTHEAFLALGSAPGRIEVCHWVYQNDVTWTVVD
Repeat found in LOC126704582
Repeat occurs 4 times in a sequence of 381 amino acids
Location between 10508538 and 10510200
Coverage of 6.3 %
Instances:
DSAIDS | DSAVDS | DSAIDS | DSATTK |
pattern: DSA[VIT][DT][KS]
MMHTKLSPMTSLLLFISLSSLTVGSTGENPFTPKTYLIRYWKKEIQNNQAKPIFLLSKASPLS
ATESASFSKLAAQNALSSRFSAFCASANLLCFPDSLPSTFGQKVSTEVVLQKNYGDSAIDSMDV
VLQKNYGDSAVDSMDVVLQKNYGDSAIDSATTKGGSGRFFRESMLKKGKVIPMPDISDKFPERS
FLPRSISSKQPFSTLKVSELKHIFHAGDNSTMEIMIVKSLSECKRAPNPGETKRCVSSAEDMID
FAVSVLGRNIDILTMKNVNGSKQSIMIGSIEGINGGKVTDSVTCHQLLFPYLLYYCHSVPKVRV
YQVDILDPNSKAKLNNGVVICHLNTSGWNPTHEAFLALGSAPGRIEVCHWVYQNDVTWTVVD
Repeat found in LOC126704582
Repeat occurs 4 times in a sequence of 381 amino acids
Location between 10508538 and 10510200
Coverage of 6.3 %
Instances:
DSAIDS | DSAVDS | DSAIDS | DSATTK |
pattern: DSA[VIT][DT][KS]
MMHTKLSPMTSLLLFISLSSLTVGSTGENPFTPKTYLIRYWKKEIQNNQAKPIFLLSKASPLS
ATESASFSKLAAQNALSSRFSAFCASANLLCFPDSLPSTFGQKVSTEVVLQKNYGDSAIDSMDV
VLQKNYGDSAVDSMDVVLQKNYGDSAIDSATTKGGSGRFFRESMLKKGKVIPMPDISDKFPERS
FLPRSISSKQPFSTLKVSELKHIFHAGDNSTMEIMIVKSLSECKRAPNPGETKRCVSSAEDMID
FAVSVLGRNIDILTMKNVNGSKQSIMIGSIEGINGGKVTDSVTCHQLLFPYLLYYCHSVPKVRV
YQVDILDPNSKAKLNNGVVICHLNTSGWNPTHEAFLALGSAPGRIEVCHWVYQNDVTWTVVD
Repeat found in LOC126704582
Repeat occurs 4 times in a sequence of 381 amino acids
Location between 10508538 and 10510200
Coverage of 6.3 %
Instances:
DSAIDS | DSAVDS | DSAIDS | DSATTK |
pattern: DSA[VIT][DT][KS]
MMHTKLSPMTSLLLFISLSSLTVGSTGENPFTPKTYLIRYWKKEIQNNQAKPIFLLSKASPLS
ATESASFSKLAAQNALSSRFSAFCASANLLCFPDSLPSTFGQKVSTEVVLQKNYGDSAIDSMDV
VLQKNYGDSAVDSMDVVLQKNYGDSAIDSATTKGGSGRFFRESMLKKGKVIPMPDISDKFPERS
FLPRSISSKQPFSTLKVSELKHIFHAGDNSTMEIMIVKSLSECKRAPNPGETKRCVSSAEDMID
FAVSVLGRNIDILTMKNVNGSKQSIMIGSIEGINGGKVTDSVTCHQLLFPYLLYYCHSVPKVRV
YQVDILDPNSKAKLNNGVVICHLNTSGWNPTHEAFLALGSAPGRIEVCHWVYQNDVTWTVVD

Similar gene clusters

NC_065544 - Cluster 43 - Cyclopeptide

Gene cluster description

NC_065544 - Gene Cluster 43. Type = cyclopeptide. Location: 18110417 - 19371484 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126706475
Repeat occurs 4 times in a sequence of 857 amino acids
Location between 18653279 and 18655853
Coverage of 4.2 %
Instances:
LIDFYAKNG | LIDFYAKCG | LIDMYAKCN | LIDVYSKCS |
pattern: LID[MFV]Y[AS]K[NC][GNS]
MSMSMSMRKHWQFTYQHYHLTLRHCSHKHHAFHFHSSTTLSPPPHLLQPLPRRAKSHQFAHFL
QLPIPPSHSHNPILYYQTLHAQLVVSGLHNQVFLANILLHSYSKCSRLRDARLLFDVMSQRNLI
SWSSLVSMYTRHGFNQQAFIVFTEFLTSCDQNPNEYILASVFRACSQLGGVYEGAQLHGFVVKT
GFEQDVYVGTSLIDFYAKNGDLDEARLVFDVLAEKTAVTWTTIITGYTKSGRGEVSLQLFDQMR
ETDLVPDKYVLSSVLSACSMLEFLDGGKQIHAYVLRRGAEMDVSVTNVLIDFYAKCGLVYTARN
LFDQMVVKNMIAWTTMIAGYMKNSFNLEAMKLFAEMAGLGWKPDGFACTSVLTSCGSLEALEHG
RQVHGYTIKANLEYDNFVKNGLIDMYAKCNSLTDARRVFEGMADHNVVSYNAMIEGYSSEEKLH
EALDLFREMRLRLLPPSLLTFVSLLGASAALLTLELSRQIHSLIIKYGVSLDVFAGSALIDVYS
KCS
CVRDARLVFNEINEKDIVVWNAMFFGYTQQLENEEALKLYSELQLTRKTPNEFTFAALITA
ASNLASLQHGQQLHNQVIKLGADSDPFVTNSLVDMYAKCGCIKEAHKTFTSTFWRDIVCWNSMI
STYAQHGEAEDALQMFKGMIDEGVKPNYVTFVGVLSACSHAGLVEDGLHHFESMAHFGIEPGTD
HYTCIVSLLGRAGRLTEAREFIDKMPIKPAAVVWRSLLSACRVAGHVDIGRYAAEMAISVDPTD
SGSYILLSNIFASKGMWEDVKRLRKRMDFNGVVKEPGSSWIELNKDVHSFIARDRAHSMANLIY
SVLDNLILQIKGVGYVPDTFMLLMTD
Repeat found in LOC126706475
Repeat occurs 4 times in a sequence of 857 amino acids
Location between 18653279 and 18655853
Coverage of 4.2 %
Instances:
LIDFYAKNG | LIDFYAKCG | LIDMYAKCN | LIDVYSKCS |
pattern: LID[MFV]Y[AS]K[NC][GNS]
MSMSMSMRKHWQFTYQHYHLTLRHCSHKHHAFHFHSSTTLSPPPHLLQPLPRRAKSHQFAHFL
QLPIPPSHSHNPILYYQTLHAQLVVSGLHNQVFLANILLHSYSKCSRLRDARLLFDVMSQRNLI
SWSSLVSMYTRHGFNQQAFIVFTEFLTSCDQNPNEYILASVFRACSQLGGVYEGAQLHGFVVKT
GFEQDVYVGTSLIDFYAKNGDLDEARLVFDVLAEKTAVTWTTIITGYTKSGRGEVSLQLFDQMR
ETDLVPDKYVLSSVLSACSMLEFLDGGKQIHAYVLRRGAEMDVSVTNVLIDFYAKCGLVYTARN
LFDQMVVKNMIAWTTMIAGYMKNSFNLEAMKLFAEMAGLGWKPDGFACTSVLTSCGSLEALEHG
RQVHGYTIKANLEYDNFVKNGLIDMYAKCNSLTDARRVFEGMADHNVVSYNAMIEGYSSEEKLH
EALDLFREMRLRLLPPSLLTFVSLLGASAALLTLELSRQIHSLIIKYGVSLDVFAGSALIDVYS
KCS
CVRDARLVFNEINEKDIVVWNAMFFGYTQQLENEEALKLYSELQLTRKTPNEFTFAALITA
ASNLASLQHGQQLHNQVIKLGADSDPFVTNSLVDMYAKCGCIKEAHKTFTSTFWRDIVCWNSMI
STYAQHGEAEDALQMFKGMIDEGVKPNYVTFVGVLSACSHAGLVEDGLHHFESMAHFGIEPGTD
HYTCIVSLLGRAGRLTEAREFIDKMPIKPAAVVWRSLLSACRVAGHVDIGRYAAEMAISVDPTD
SGSYILLSNIFASKGMWEDVKRLRKRMDFNGVVKEPGSSWIELNKDVHSFIARDRAHSMANLIY
SVLDNLILQIKGVGYVPDTFMLLMTD
Repeat found in LOC126706475
Repeat occurs 4 times in a sequence of 857 amino acids
Location between 18653279 and 18655853
Coverage of 4.2 %
Instances:
LIDFYAKNG | LIDFYAKCG | LIDMYAKCN | LIDVYSKCS |
pattern: LID[MFV]Y[AS]K[NC][GNS]
MSMSMSMRKHWQFTYQHYHLTLRHCSHKHHAFHFHSSTTLSPPPHLLQPLPRRAKSHQFAHFL
QLPIPPSHSHNPILYYQTLHAQLVVSGLHNQVFLANILLHSYSKCSRLRDARLLFDVMSQRNLI
SWSSLVSMYTRHGFNQQAFIVFTEFLTSCDQNPNEYILASVFRACSQLGGVYEGAQLHGFVVKT
GFEQDVYVGTSLIDFYAKNGDLDEARLVFDVLAEKTAVTWTTIITGYTKSGRGEVSLQLFDQMR
ETDLVPDKYVLSSVLSACSMLEFLDGGKQIHAYVLRRGAEMDVSVTNVLIDFYAKCGLVYTARN
LFDQMVVKNMIAWTTMIAGYMKNSFNLEAMKLFAEMAGLGWKPDGFACTSVLTSCGSLEALEHG
RQVHGYTIKANLEYDNFVKNGLIDMYAKCNSLTDARRVFEGMADHNVVSYNAMIEGYSSEEKLH
EALDLFREMRLRLLPPSLLTFVSLLGASAALLTLELSRQIHSLIIKYGVSLDVFAGSALIDVYS
KCS
CVRDARLVFNEINEKDIVVWNAMFFGYTQQLENEEALKLYSELQLTRKTPNEFTFAALITA
ASNLASLQHGQQLHNQVIKLGADSDPFVTNSLVDMYAKCGCIKEAHKTFTSTFWRDIVCWNSMI
STYAQHGEAEDALQMFKGMIDEGVKPNYVTFVGVLSACSHAGLVEDGLHHFESMAHFGIEPGTD
HYTCIVSLLGRAGRLTEAREFIDKMPIKPAAVVWRSLLSACRVAGHVDIGRYAAEMAISVDPTD
SGSYILLSNIFASKGMWEDVKRLRKRMDFNGVVKEPGSSWIELNKDVHSFIARDRAHSMANLIY
SVLDNLILQIKGVGYVPDTFMLLMTD
Repeat found in LOC126706475
Repeat occurs 4 times in a sequence of 857 amino acids
Location between 18653279 and 18655853
Coverage of 4.2 %
Instances:
LIDFYAKNG | LIDFYAKCG | LIDMYAKCN | LIDVYSKCS |
pattern: LID[MFV]Y[AS]K[NC][GNS]
MSMSMSMRKHWQFTYQHYHLTLRHCSHKHHAFHFHSSTTLSPPPHLLQPLPRRAKSHQFAHFL
QLPIPPSHSHNPILYYQTLHAQLVVSGLHNQVFLANILLHSYSKCSRLRDARLLFDVMSQRNLI
SWSSLVSMYTRHGFNQQAFIVFTEFLTSCDQNPNEYILASVFRACSQLGGVYEGAQLHGFVVKT
GFEQDVYVGTSLIDFYAKNGDLDEARLVFDVLAEKTAVTWTTIITGYTKSGRGEVSLQLFDQMR
ETDLVPDKYVLSSVLSACSMLEFLDGGKQIHAYVLRRGAEMDVSVTNVLIDFYAKCGLVYTARN
LFDQMVVKNMIAWTTMIAGYMKNSFNLEAMKLFAEMAGLGWKPDGFACTSVLTSCGSLEALEHG
RQVHGYTIKANLEYDNFVKNGLIDMYAKCNSLTDARRVFEGMADHNVVSYNAMIEGYSSEEKLH
EALDLFREMRLRLLPPSLLTFVSLLGASAALLTLELSRQIHSLIIKYGVSLDVFAGSALIDVYS
KCS
CVRDARLVFNEINEKDIVVWNAMFFGYTQQLENEEALKLYSELQLTRKTPNEFTFAALITA
ASNLASLQHGQQLHNQVIKLGADSDPFVTNSLVDMYAKCGCIKEAHKTFTSTFWRDIVCWNSMI
STYAQHGEAEDALQMFKGMIDEGVKPNYVTFVGVLSACSHAGLVEDGLHHFESMAHFGIEPGTD
HYTCIVSLLGRAGRLTEAREFIDKMPIKPAAVVWRSLLSACRVAGHVDIGRYAAEMAISVDPTD
SGSYILLSNIFASKGMWEDVKRLRKRMDFNGVVKEPGSSWIELNKDVHSFIARDRAHSMANLIY
SVLDNLILQIKGVGYVPDTFMLLMTD
Repeat found in LOC126706119
Repeat occurs 6 times in a sequence of 276 amino acids
Location between 19007579 and 19016942
Coverage of 17.39 %
Instances:
SRSRSPSY | SRSPSYRR | SRSISPRH | SRSPLRRH | SRSPTPRH
SRSSSLSP |
pattern: SRS[RSIP][LST][YPLR][RS][YPHR]
MPRDLSRSRSPSYRRRYSPSPVGHRSSRRSRRNRSQSPYSYSRRKSRSISPRHRRSRSPLRRH
KSRSPTPRHYKRQRSRSSSLSPIRKSSSSSLGSIERKNASEKLRIEEEEKKRRQQEAEQKLLEE
ETAKRVEEAIRRKVEESLLSKEIKLEIQRRLEEGRKRLHDEVAAQLEKEKEAAIIEARQREEQA
RKEKEELERLLEENRKRVEEAQRREALEQQRREEERYRELEELQRQKEEALRRKKQQEEEERSN
QLKLLGKNKSRPKLSFALGSK
Repeat found in LOC126706119
Repeat occurs 6 times in a sequence of 276 amino acids
Location between 19007579 and 19016942
Coverage of 17.39 %
Instances:
SRSRSPSY | SRSPSYRR | SRSISPRH | SRSPLRRH | SRSPTPRH
SRSSSLSP |
pattern: SRS[RSIP][LST][YPLR][RS][YPHR]
MPRDLSRSRSPSYRRRYSPSPVGHRSSRRSRRNRSQSPYSYSRRKSRSISPRHRRSRSPLRRH
KSRSPTPRHYKRQRSRSSSLSPIRKSSSSSLGSIERKNASEKLRIEEEEKKRRQQEAEQKLLEE
ETAKRVEEAIRRKVEESLLSKEIKLEIQRRLEEGRKRLHDEVAAQLEKEKEAAIIEARQREEQA
RKEKEELERLLEENRKRVEEAQRREALEQQRREEERYRELEELQRQKEEALRRKKQQEEEERSN
QLKLLGKNKSRPKLSFALGSK
Repeat found in LOC126706678
Repeat occurs 4 times in a sequence of 472 amino acids
Location between 19313557 and 19315139
Coverage of 12.71 %
Instances:
DQFDQDQFHQDQFHQ | DQFHQDQFHQDQFHQ | DQFHQDQFHQPREMV | DQFHQPREMVTALLR |
pattern: DQF[DH]Q[DP][RQ][FE][MH][VQ][DPT][RAQ][LFE][LMH][VRQ]
MNQERVIIFCGHMRDEPFVFSEDDGCCKTTMSWFSILAPEIRSPSSGTAPSSSPLRTLKSTLS
PFSVLKLPRRTFFTAVGSNIFIFHDSVNKILTFNNSSGDWKLAPPMNYPRTSPHVIVLDGKLYV
IGGLIHPIGGLKFPLPANYCFMEFFDPDIGTWESLPNPPFMDQFDQDQFHQDQFHQDQFHQPRE
MVTALLR
DRKILVTSYTESVLYVYYIDYRVWTLFSDLSYLSKRNLPHFPCLSNLSKGNFLLSPD
MVKSVGVGDMLYRVSFDHVDHPKHLVIQAFNLALDQWFEGCLNVGSEIFGEFNEVLEDGYPCCG
LIHLSDQKFCLLLQSSYTEDGNGNGNGDFDRRRSAYFKPHSTSSFYLYSVVLEVSMILDKPRDL
TGFMGLNIVVLSTEKYRLVSPLLIQDAMLVDITFHPSNMLALRCNESLLNEFGMARPSKAVCEE
LLQQHLVIAKRIWDGPTHEELLFNL

Similar gene clusters

NC_065544 - Cluster 44 - Cyclopeptide

Gene cluster description

NC_065544 - Gene Cluster 44. Type = cyclopeptide. Location: 18720474 - 19235053 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126706119
Repeat occurs 6 times in a sequence of 276 amino acids
Location between 19007579 and 19016942
Coverage of 17.39 %
Instances:
SRSRSPSY | SRSPSYRR | SRSISPRH | SRSPLRRH | SRSPTPRH
SRSSSLSP |
pattern: SRS[RSIP][LST][YPLR][RS][YPHR]
MPRDLSRSRSPSYRRRYSPSPVGHRSSRRSRRNRSQSPYSYSRRKSRSISPRHRRSRSPLRRH
KSRSPTPRHYKRQRSRSSSLSPIRKSSSSSLGSIERKNASEKLRIEEEEKKRRQQEAEQKLLEE
ETAKRVEEAIRRKVEESLLSKEIKLEIQRRLEEGRKRLHDEVAAQLEKEKEAAIIEARQREEQA
RKEKEELERLLEENRKRVEEAQRREALEQQRREEERYRELEELQRQKEEALRRKKQQEEEERSN
QLKLLGKNKSRPKLSFALGSK
Repeat found in LOC126706119
Repeat occurs 6 times in a sequence of 276 amino acids
Location between 19007579 and 19016942
Coverage of 17.39 %
Instances:
SRSRSPSY | SRSPSYRR | SRSISPRH | SRSPLRRH | SRSPTPRH
SRSSSLSP |
pattern: SRS[RSIP][LST][YPLR][RS][YPHR]
MPRDLSRSRSPSYRRRYSPSPVGHRSSRRSRRNRSQSPYSYSRRKSRSISPRHRRSRSPLRRH
KSRSPTPRHYKRQRSRSSSLSPIRKSSSSSLGSIERKNASEKLRIEEEEKKRRQQEAEQKLLEE
ETAKRVEEAIRRKVEESLLSKEIKLEIQRRLEEGRKRLHDEVAAQLEKEKEAAIIEARQREEQA
RKEKEELERLLEENRKRVEEAQRREALEQQRREEERYRELEELQRQKEEALRRKKQQEEEERSN
QLKLLGKNKSRPKLSFALGSK

Similar gene clusters

NC_065544 - Cluster 45 - Transporter_associated-saccharide

Gene cluster description

NC_065544 - Gene Cluster 45. Type = transporter_associated-saccharide. Location: 21788195 - 23000262 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065544 - Cluster 46 - Cyclopeptide

Gene cluster description

NC_065544 - Gene Cluster 46. Type = cyclopeptide. Location: 44064478 - 44857511 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126706450
Repeat occurs 8 times in a sequence of 663 amino acids
Location between 44422103 and 44424934
Coverage of 7.24 %
Instances:
DAKKSG | DAKDSY | DAKNLG | DAKDSY | DAKDNY
DAKDGY | DAKDNY | DAKDNY |
pattern: DAK[KDN][GLNS][GY]
MALRLTYWILFFHLLFLMCDHGNGARDIVAKQDWKQVEMTDQPDSASHETTSGTKESDYTTAY
RTTDAKKSGYVAQGGYAAAYSYSSKQDAKDSYKTAYGTKEASKDSEMEHKKHCDHANGRKHVPK
QHLKDAKNLGYVGGYAAAYSYGSKQDAKDSYITAYETRKDAKDNYITAYTKRDAKDGYVTQYGT
KDASKNSRMKHEKHGEVEEVANHLKQGKVENQKEMDRPYIAGYNKNREANYITSYNSQATKDSY
ISKYRTKEASKNSGMEHKKHDGVEDVTNQKEMDQPYIAGYNRNKEANYLTSYGTKEDAKDNYKL
NSNITQYGDASKNSRMDHQEHDGVEDVTNQKEMDQPYLAGYRKDAKDNYMAQYRTRQEAKESPI
QESDQPYIAGYRNKQEVKESNYITSFGDREDSKSSVIEHENNMKPKPSSHMDHLGAFKSGFFTM
DDLFVGNIMPLYFPIQELSEFLPREEADSIPFSMPQLPNVLQLFSIPIDSPNARAMEDTLQQCE
ATPITGETKLCATSLESMLDFVHSITGLWANLNVLTTKHPTMSTSLTQNYDVLRVSKELYAPKW
VACHPLPYPYKIFFCHFIENTKIFKVLLGGENGHKVEAVAVCHMDTSDWDPNHILFRQLGVKPG
SSPVCHFLPKYHLVWVPSTTTASI
Repeat found in LOC126706446
Repeat occurs 12 times in a sequence of 386 amino acids
Location between 44521955 and 44523465
Coverage of 40.41 %
Instances:
DFEPRPSVTVYVN | DFEPRPSVSAYNN | DFEPRPSITAYVN | DFEPRPSLTAYVN | DFEPRPSVTAYIN
DFEPRPSVTTYVN | DFEPRPSVTAYVN | DFEPRPSVTTYIN | DFEPRPSVTTYVN | DFEPRPSVTAYVN
DFEQRPNISACND | DFEPRPIASVYSE |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC][VNIS][DNE]
The following known motifs were found:
FEPR was found 11 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYVNPSVTVYVNDVGLETK
NFVKDFEPRPSVSAYNNPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYVNPSITAYVNDVGLET
KNNFAKDFEPRPSLTAYVNPSLTAYVNDVGLETKNNFVKDFEPRPSVTAYINPSVTAYINDVGL
KNQNKFVKDFEPRPSVTTYVNPSVTTYVNDVGLEIKKKFVNDFEPRPSVTAYVNPSVTAYVNDV
GLETKKNFVKDFEPRPSVTTYINPSVTTYINDVSPETNNFVKDFEPRPSVTTYVNPSVTTYVND
VGLETKKKFVNDFEPRPSVTAYVNPSVTAYVNVVDPETKKKFVNDFEQRPNISACNDDDVGSKD
EKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 11 times in a sequence of 362 amino acids
Location between 44521955 and 44523465
Coverage of 39.5 %
Instances:
DFEPRPSVTVYVN | DFEPRPSVSAYNN | DFEPRPSITAYVN | DFEPRPSLTAYVN | DFEPRPSVTAYIN
DFEPRPSVTTYVN | DFEPRPSVTAYVN | DFEPRPSVTTYVN | DFEPRPSVTAYVN | DFEQRPNISACND
DFEPRPIASVYSE |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC][VNIS][DNE]
The following known motifs were found:
FEPR was found 10 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYVNPSVTVYVNDVGLETK
NFVKDFEPRPSVSAYNNPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYVNPSITAYVNDVGLET
KNNFAKDFEPRPSLTAYVNPSLTAYVNDVGLETKNNFVKDFEPRPSVTAYINPSVTAYINDVGL
KNQNKFVKDFEPRPSVTTYVNPSVTTYVNDVGLEIKKKFVNDFEPRPSVTAYVNPSVTAYVNDV
GLETKKNFVKDFEPRPSVTTYVNPSVTTYVNDVGLETKKKFVNDFEPRPSVTAYVNPSVTAYVN
VVDPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 337 amino acids
Location between 44521955 and 44523465
Coverage of 32.64 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSLTAY | DFEPRPSVTAY
DFEPRPSVTTY | DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC | DFEPRPIASVY

pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSLTAYPSLTAYVNDVGLETKNNFVKDFEPRPSVTAYPSVTAYINDVGLKNQNKFVKDF
EPR
PSVTTYPSVTTYVNDVGLEIKKKFVNDFEPRPSVTTYPSVTTYVNDVGLETKKKFVNDFEP
R
PSVTAYPSVTAYVNVVDPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVY
SE
Repeat found in LOC126706446
Repeat occurs 11 times in a sequence of 361 amino acids
Location between 44521955 and 44523465
Coverage of 33.52 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSLTAY | DFEPRPSVTAY
DFEPRPSVTAY | DFEPRPSVTTY | DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC
DFEPRPIASVY |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 10 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSLTAYPSLTAYVNDVGLETKNNFVKDFEPRPSVTAYPSVTAYINDVGLKNQNKFVKDF
EPR
PSVTAYPSVTAYVNDVGLETKKNFVKDFEPRPSVTTYPSVTTYINDVSPETNNFVKDFEPR
PSVTTY
PSVTTYVNDVGLETKKKFVNDFEPRPSVTAYPSVTAYVNVVDPETKKKFVNDFEQRPN
ISAC
NDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 337 amino acids
Location between 44521955 and 44523465
Coverage of 32.64 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSLTAY | DFEPRPSVTAY
DFEPRPSVTAY | DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC | DFEPRPIASVY

pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSLTAYPSLTAYVNDVGLETKNNFVKDFEPRPSVTAYPSVTAYINDVGLKNQNKFVKDF
EPR
PSVTAYPSVTAYVNDVGLETKKNFVKDFEPRPSVTTYPSVTTYVNDVGLETKKKFVNDFEP
R
PSVTAYPSVTAYVNVVDPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVY
SE
Repeat found in LOC126706446
Repeat occurs 9 times in a sequence of 312 amino acids
Location between 44521955 and 44523465
Coverage of 31.73 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSLTAY | DFEPRPSVTAY
DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC | DFEPRPIASVY |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 8 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSLTAYPSLTAYVNDVGLETKNNFVKDFEPRPSVTAYPSVTAYINDVGLKNQNKFVKDF
EPR
PSVTTYPSVTTYVNDVGLETKKKFVNDFEPRPSVTAYPSVTAYVNVVDPETKKKFVNDFEQ
RPNISAC
NDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 11 times in a sequence of 361 amino acids
Location between 44521955 and 44523465
Coverage of 39.61 %
Instances:
DFEPRPSVTVYVN | DFEPRPSVSAYNN | DFEPRPSITAYVN | DFEPRPSLTAYVN | DFEPRPSVTTYVN
DFEPRPSVTAYVN | DFEPRPSVTTYIN | DFEPRPSVTTYVN | DFEPRPSVTAYVN | DFEQRPNISACND
DFEPRPIASVYSE |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC][VNIS][DNE]
The following known motifs were found:
FEPR was found 10 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYVNPSVTVYVNDVGLETK
NFVKDFEPRPSVSAYNNPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYVNPSITAYVNDVGLET
KNNFAKDFEPRPSLTAYVNPSLTAYVNDVGLETKNNFVKDFEPRPSVTTYVNPSVTTYVNDVGL
EIKKKFVNDFEPRPSVTAYVNPSVTAYVNDVGLETKKNFVKDFEPRPSVTTYINPSVTTYINDV
SPETNNFVKDFEPRPSVTTYVNPSVTTYVNDVGLETKKKFVNDFEPRPSVTAYVNPSVTAYVNV
VDPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 9 times in a sequence of 312 amino acids
Location between 44521955 and 44523465
Coverage of 31.73 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSLTAY | DFEPRPSVTTY
DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC | DFEPRPIASVY |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 8 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSLTAYPSLTAYVNDVGLETKNNFVKDFEPRPSVTTYPSVTTYVNDVGLEIKKKFVNDF
EPR
PSVTTYPSVTTYVNDVGLETKKKFVNDFEPRPSVTAYPSVTAYVNVVDPETKKKFVNDFEQ
RPNISAC
NDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 336 amino acids
Location between 44521955 and 44523465
Coverage of 32.74 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSLTAY | DFEPRPSVTAY
DFEPRPSVTTY | DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC | DFEPRPIASVY

pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSLTAYPSLTAYVNDVGLETKNNFVKDFEPRPSVTAYPSVTAYVNDVGLETKKNFVKDF
EPR
PSVTTYPSVTTYINDVSPETNNFVKDFEPRPSVTTYPSVTTYVNDVGLETKKKFVNDFEPR
PSVTAY
PSVTAYVNVVDPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYS
E
Repeat found in LOC126706446
Repeat occurs 9 times in a sequence of 312 amino acids
Location between 44521955 and 44523465
Coverage of 31.73 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSLTAY | DFEPRPSVTAY
DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC | DFEPRPIASVY |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 8 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSLTAYPSLTAYVNDVGLETKNNFVKDFEPRPSVTAYPSVTAYVNDVGLETKKNFVKDF
EPR
PSVTTYPSVTTYVNDVGLETKKKFVNDFEPRPSVTAYPSVTAYVNVVDPETKKKFVNDFEQ
RPNISAC
NDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 11 times in a sequence of 361 amino acids
Location between 44521955 and 44523465
Coverage of 33.52 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSVTAY | DFEPRPSVTTY
DFEPRPSVTAY | DFEPRPSVTTY | DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC
DFEPRPIASVY |
pattern: DFE[PQ]RP[NSI][VAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 10 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSVTAYPSVTAYINDVGLKNQNKFVKDFEPRPSVTTYPSVTTYVNDVGLEIKKKFVNDF
EPR
PSVTAYPSVTAYVNDVGLETKKNFVKDFEPRPSVTTYPSVTTYINDVSPETNNFVKDFEPR
PSVTTY
PSVTTYVNDVGLETKKKFVNDFEPRPSVTAYPSVTAYVNVVDPETKKKFVNDFEQRPN
ISAC
NDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 336 amino acids
Location between 44521955 and 44523465
Coverage of 32.74 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSITAY | DFEPRPSVTAY | DFEPRPSVTAY
DFEPRPSVTTY | DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC | DFEPRPIASVY

pattern: DFE[PQ]RP[NSI][VAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSITAYPSITAYVNDVGLETKNNFAK
D
FEPRPSVTAYPSVTAYINDVGLKNQNKFVKDFEPRPSVTAYPSVTAYVNDVGLETKKNFVKDF
EPR
PSVTTYPSVTTYINDVSPETNNFVKDFEPRPSVTTYPSVTTYVNDVGLETKKKFVNDFEPR
PSVTAY
PSVTAYVNVVDPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYS
E
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 336 amino acids
Location between 44521955 and 44523465
Coverage of 26.79 %
Instances:
DFEPRPSVT | DFEPRPSVS | DFEPRPSIT | DFEPRPSVT | DFEPRPSVT
DFEPRPSVT | DFEPRPSVT | DFEPRPSVT | DFEQRPNIS | DFEPRPIAS

pattern: DFE[PQ]RP[NSI][VAI][ST]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTPSVTVYVNDVGLETKNFVK
D
FEPRPSVSPSVSAYNNDVGLKTQKKFVKDFEPRPSITPSITAYVNDVGLETKNNFAKDFEPRP
SVT
PSVTTYVNDVGLEIKKKFVNDFEPRPSVTPSVTAYVNDVGLETKKNFVKDFEPRPSVTPSV
TTYINDVSPETNNFVKDFEPRPSVTPSVTTYVNDVGLETKKKFVNDFEPRPSVTPSVTAYVNVV
DPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 11 times in a sequence of 361 amino acids
Location between 44521955 and 44523465
Coverage of 33.52 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSLTAY | DFEPRPSVTAY | DFEPRPSVTTY
DFEPRPSVTAY | DFEPRPSVTTY | DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC
DFEPRPIASVY |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 10 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSLTAYPSLTAYVNDVGLETKNNFVK
D
FEPRPSVTAYPSVTAYINDVGLKNQNKFVKDFEPRPSVTTYPSVTTYVNDVGLEIKKKFVNDF
EPR
PSVTAYPSVTAYVNDVGLETKKNFVKDFEPRPSVTTYPSVTTYINDVSPETNNFVKDFEPR
PSVTTY
PSVTTYVNDVGLETKKKFVNDFEPRPSVTAYPSVTAYVNVVDPETKKKFVNDFEQRPN
ISAC
NDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 336 amino acids
Location between 44521955 and 44523465
Coverage of 26.79 %
Instances:
DFEPRPSVT | DFEPRPSVS | DFEPRPSLT | DFEPRPSVT | DFEPRPSVT
DFEPRPSVT | DFEPRPSVT | DFEPRPSVT | DFEQRPNIS | DFEPRPIAS

pattern: DFE[PQ]RP[NSI][VLAI][ST]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTPSVTVYVNDVGLETKNFVK
D
FEPRPSVSPSVSAYNNDVGLKTQKKFVKDFEPRPSLTPSLTAYVNDVGLETKNNFVKDFEPRP
SVT
PSVTTYVNDVGLEIKKKFVNDFEPRPSVTPSVTAYVNDVGLETKKNFVKDFEPRPSVTPSV
TTYINDVSPETNNFVKDFEPRPSVTPSVTTYVNDVGLETKKKFVNDFEPRPSVTPSVTAYVNVV
DPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 9 times in a sequence of 311 amino acids
Location between 44521955 and 44523465
Coverage of 31.83 %
Instances:
DFEPRPSVTVY | DFEPRPSVSAY | DFEPRPSLTAY | DFEPRPSVTAY | DFEPRPSVTTY
DFEPRPSVTTY | DFEPRPSVTAY | DFEQRPNISAC | DFEPRPIASVY |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC]
The following known motifs were found:
FEPR was found 8 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYPSVTVYVNDVGLETKNF
VKDFEPRPSVSAYPSVSAYNNDVGLKTQKKFVKDFEPRPSLTAYPSLTAYVNDVGLETKNNFVK
D
FEPRPSVTAYPSVTAYVNDVGLETKKNFVKDFEPRPSVTTYPSVTTYINDVSPETNNFVKDFE
PR
PSVTTYPSVTTYVNDVGLETKKKFVNDFEPRPSVTAYPSVTAYVNVVDPETKKKFVNDFEQR
PNISAC
NDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 336 amino acids
Location between 44521955 and 44523465
Coverage of 26.79 %
Instances:
DFEPRPSVT | DFEPRPSVS | DFEPRPSVT | DFEPRPSVT | DFEPRPSVT
DFEPRPSVT | DFEPRPSVT | DFEPRPSVT | DFEQRPNIS | DFEPRPIAS

pattern: DFE[PQ]RP[NSI][VAI][ST]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTPSVTVYVNDVGLETKNFVK
D
FEPRPSVSPSVSAYNNDVGLKTQKKFVKDFEPRPSVTPSVTAYINDVGLKNQNKFVKDFEPRP
SVT
PSVTTYVNDVGLEIKKKFVNDFEPRPSVTPSVTAYVNDVGLETKKNFVKDFEPRPSVTPSV
TTYINDVSPETNNFVKDFEPRPSVTPSVTTYVNDVGLETKKKFVNDFEPRPSVTPSVTAYVNVV
DPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 9 times in a sequence of 311 amino acids
Location between 44521955 and 44523465
Coverage of 26.05 %
Instances:
DFEPRPSVT | DFEPRPSVS | DFEPRPSVT | DFEPRPSVT | DFEPRPSVT
DFEPRPSVT | DFEPRPSVT | DFEQRPNIS | DFEPRPIAS |
pattern: DFE[PQ]RP[NSI][VAI][ST]
The following known motifs were found:
FEPR was found 8 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTPSVTVYVNDVGLETKNFVK
D
FEPRPSVSPSVSAYNNDVGLKTQKKFVKDFEPRPSVTPSVTTYVNDVGLEIKKKFVNDFEPRP
SVT
PSVTAYVNDVGLETKKNFVKDFEPRPSVTPSVTTYINDVSPETNNFVKDFEPRPSVTPSVT
TYVNDVGLETKKKFVNDFEPRPSVTPSVTAYVNVVDPETKKKFVNDFEQRPNISACNDDDVGSK
DEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 11 times in a sequence of 361 amino acids
Location between 44521955 and 44523465
Coverage of 39.61 %
Instances:
DFEPRPSVTVYVN | DFEPRPSITAYVN | DFEPRPSLTAYVN | DFEPRPSVTAYIN | DFEPRPSVTTYVN
DFEPRPSVTAYVN | DFEPRPSVTTYIN | DFEPRPSVTTYVN | DFEPRPSVTAYVN | DFEQRPNISACND
DFEPRPIASVYSE |
pattern: DFE[PQ]RP[NSI][VLAI][ST][VAT][YC][VNIS][DNE]
The following known motifs were found:
FEPR was found 10 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTVYVNPSVTVYVNDVGLETK
NFVKDFEPRPSITAYVNPSITAYVNDVGLETKNNFAKDFEPRPSLTAYVNPSLTAYVNDVGLET
KNNFVKDFEPRPSVTAYINPSVTAYINDVGLKNQNKFVKDFEPRPSVTTYVNPSVTTYVNDVGL
EIKKKFVNDFEPRPSVTAYVNPSVTAYVNDVGLETKKNFVKDFEPRPSVTTYINPSVTTYINDV
SPETNNFVKDFEPRPSVTTYVNPSVTTYVNDVGLETKKKFVNDFEPRPSVTAYVNPSVTAYVNV
VDPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 336 amino acids
Location between 44521955 and 44523465
Coverage of 26.79 %
Instances:
DFEPRPSVT | DFEPRPSIT | DFEPRPSVT | DFEPRPSVT | DFEPRPSVT
DFEPRPSVT | DFEPRPSVT | DFEPRPSVT | DFEQRPNIS | DFEPRPIAS

pattern: DFE[PQ]RP[NSI][VAI][ST]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTPSVTVYVNDVGLETKNFVK
D
FEPRPSITPSITAYVNDVGLETKNNFAKDFEPRPSVTPSVTAYINDVGLKNQNKFVKDFEPRP
SVT
PSVTTYVNDVGLEIKKKFVNDFEPRPSVTPSVTAYVNDVGLETKKNFVKDFEPRPSVTPSV
TTYINDVSPETNNFVKDFEPRPSVTPSVTTYVNDVGLETKKKFVNDFEPRPSVTPSVTAYVNVV
DPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 10 times in a sequence of 336 amino acids
Location between 44521955 and 44523465
Coverage of 26.79 %
Instances:
DFEPRPSVT | DFEPRPSLT | DFEPRPSVT | DFEPRPSVT | DFEPRPSVT
DFEPRPSVT | DFEPRPSVT | DFEPRPSVT | DFEQRPNIS | DFEPRPIAS

pattern: DFE[PQ]RP[NSI][VLAI][ST]
The following known motifs were found:
FEPR was found 9 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTPSVTVYVNDVGLETKNFVK
D
FEPRPSLTPSLTAYVNDVGLETKNNFVKDFEPRPSVTPSVTAYINDVGLKNQNKFVKDFEPRP
SVT
PSVTTYVNDVGLEIKKKFVNDFEPRPSVTPSVTAYVNDVGLETKKNFVKDFEPRPSVTPSV
TTYINDVSPETNNFVKDFEPRPSVTPSVTTYVNDVGLETKKKFVNDFEPRPSVTPSVTAYVNVV
DPETKKKFVNDFEQRPNISACNDDDVGSKDEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 9 times in a sequence of 311 amino acids
Location between 44521955 and 44523465
Coverage of 26.05 %
Instances:
DFEPRPSVT | DFEPRPSLT | DFEPRPSVT | DFEPRPSVT | DFEPRPSVT
DFEPRPSVT | DFEPRPSVT | DFEQRPNIS | DFEPRPIAS |
pattern: DFE[PQ]RP[NSI][VLAI][ST]
The following known motifs were found:
FEPR was found 8 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTPSVTVYVNDVGLETKNFVK
D
FEPRPSLTPSLTAYVNDVGLETKNNFVKDFEPRPSVTPSVTTYVNDVGLEIKKKFVNDFEPRP
SVT
PSVTAYVNDVGLETKKNFVKDFEPRPSVTPSVTTYINDVSPETNNFVKDFEPRPSVTPSVT
TYVNDVGLETKKKFVNDFEPRPSVTPSVTAYVNVVDPETKKKFVNDFEQRPNISACNDDDVGSK
DEKPFVEDFEPRPIASVYSE
Repeat found in LOC126706446
Repeat occurs 9 times in a sequence of 311 amino acids
Location between 44521955 and 44523465
Coverage of 26.05 %
Instances:
DFEPRPSVT | DFEPRPSVT | DFEPRPSVT | DFEPRPSVT | DFEPRPSVT
DFEPRPSVT | DFEPRPSVT | DFEQRPNIS | DFEPRPIAS |
pattern: DFE[PQ]RP[NSI][VAI][ST]
The following known motifs were found:
FEPR was found 8 times in this sequence
MNLLCRLKRSNYFELMTVDMESRFAWLTLFLLLLFANTIESRKDPGQYWTSVMKDQPMPKAIQ
GLVHLDSSPSKLSKNDNCHTSEGAIGKDQGEKPYVKDFEPRPSVTPSVTVYVNDVGLETKNFVK
D
FEPRPSVTPSVTAYINDVGLKNQNKFVKDFEPRPSVTPSVTTYVNDVGLEIKKKFVNDFEPRP
SVT
PSVTAYVNDVGLETKKNFVKDFEPRPSVTPSVTTYINDVSPETNNFVKDFEPRPSVTPSVT
TYVNDVGLETKKKFVNDFEPRPSVTPSVTAYVNVVDPETKKKFVNDFEQRPNISACNDDDVGSK
DEKPFVEDFEPRPIASVYSE

Similar gene clusters

NC_065545 - Cluster 47 - Cyclopeptide

Gene cluster description

NC_065545 - Gene Cluster 47. Type = cyclopeptide. Location: 15579494 - 16395498 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC126710478
Repeat occurs 3 times in a sequence of 403 amino acids
Location between 16108677 and 16113110
Coverage of 4.47 %
Instances:
VSSNEV | VSSDYV | VSSDEM |
pattern: VSS[DN][YE][VM]
MVSLEVAQVMENRTKKRTRSAWDITPAQPQAKRGEVVSSNEVISRRRQASPPRRDDDREGHYV
FNLGENLTPRYKILSKMGEGTFGRVLECWDRQTREYVAIKVVRSIRKYRNAAMIEVDVLQHLAK
NDKGDSHCVQIRSWFDYRNHICIVFEKLGPSLFDFLKRNKYCPFPVDLVREFGRQLLESVAYMH
DLRLIHTDLKPENILLVSSDYVRLPGYKKVSSDEMLSRCLPKSSAIKLIDFGSAAFDNQNHSSI
VSTRHYRAPEVILGLGWSYPCDLWSVGCILVELCSGEALFQTHENLEHLAMMERVFGPLPEHMI
RRADKGAEKYFRRGLRLNWPEGAVSRESIRAVNKLHRLQDLVSQHVESSRFSLTDLLNGLLMYE
PSERLTARQALNHPFFNNLT
Repeat found in LOC126710478
Repeat occurs 3 times in a sequence of 402 amino acids
Location between 16108677 and 16113110
Coverage of 4.48 %
Instances:
VSSNEV | VSSDYV | VSSDEM |
pattern: VSS[DN][YE][VM]
MVSLEVAQVMENRTKKRTRSAWDITPAQPQAKRGEVVSSNEVISRRRQASPPRRDDDREGHYV
FNLGENLTPRYKILSKMGEGTFGRVLECWDRQTREYVAIKVVRSIRKYRNAAMIEVDVLQHLAK
NDKGDSHCVQIRSWFDYRNHICIVFEKLGPSLFDFLKRNKYCPFPVDLVREFGRQLLESVAYMH
DLRLIHTDLKPENILLVSSDYVRLPGYKVSSDEMLSRCLPKSSAIKLIDFGSAAFDNQNHSSIV
STRHYRAPEVILGLGWSYPCDLWSVGCILVELCSGEALFQTHENLEHLAMMERVFGPLPEHMIR
RADKGAEKYFRRGLRLNWPEGAVSRESIRAVNKLHRLQDLVSQHVESSRFSLTDLLNGLLMYEP
SERLTARQALNHPFFNNLT

Similar gene clusters

NC_065545 - Cluster 48 - Saccharide

Gene cluster description

NC_065545 - Gene Cluster 48. Type = saccharide. Location: 16620065 - 16734829 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065545 - Cluster 49 - Transporter_associated-alkaloid

Gene cluster description

NC_065545 - Gene Cluster 49. Type = transporter_associated-alkaloid. Location: 30711224 - 31262929 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_065545 - Cluster 50 - Saccharide

Gene cluster description

NC_065545 - Gene Cluster 50. Type = saccharide. Location: 42810213 - 43250549 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters