Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_056557.1:
Cluster 1Saccharide-Polyketide31368703578906442.042OG-FeII_Oxy, Chal_sti_synt_C, DIOX_N, FAE1_CUT1_RppA, Glyco_hydro_1---
Cluster 2Cyclopeptide425132659887941737.47BURP---
Cluster 3Cyclopeptide513554862034941067.95BURP---
Cluster 4Alkaloid80231398190446167.31Bet_v_1, adh_short---
Cluster 5Alkaloid85936438743651150.01Bet_v_1, Methyltransf_11, p450---
Cluster 6Cyclopeptide943751710214777777.26BURP---
Cluster 7Cyclopeptide1447879615150464671.67BURP---
Cluster 8Saccharide248193342490174682.41ADH_N, UDPGT_2, UbiAflavonoid--
Cluster 9Saccharide2685027827083885233.61Glyco_hydro_1, UDPGT_2, UbiA, oMTflavonoid-3--
Cluster 10Polyketide2992850930278324349.81Chal_sti_synt_C, Chal_sti_synt_N, Methyltransf_11, p450---
The following clusters are from record NC_056558.1:
Cluster 11Terpene94195549609400189.85PRISE, Prenyltransf, p450---
Cluster 12Terpene1323664613467623230.98Abhydrolase_3, Epimerase, Terpene_synth, Terpene_synth_C, adh_short---
Cluster 13Saccharide-Transporter_associated-Alkaloid2748042527727869247.442OG-FeII_Oxy, ABC_tran, Cellulose_synt, DIOX_N, NAD_binding_1, Pyridoxal_deC---
Cluster 14Cyclopeptide30504081319896251485.54BURP-arabidiol/baruol (9% of genes show similarity)BGC0002906.1_c1
Cluster 15Transporter_associated4095772041215780258.062OG-FeII_Oxy, ABC_tran, DIOX_N, Lycopene_cycl, Methyltransf_11---
Cluster 16Saccharide427528584284210389.25Glyco_hydro_1, Lipoxygenase, p450---
The following clusters are from record NC_056559.1:
Cluster 17Cyclopeptide42423194915182672.86BURP---
Cluster 18Saccharide-Terpene56567785839640182.86DIOX_N, SQHop_cyclase_C, SQHop_cyclase_N, UDPGT_2flavonoid--
Cluster 19Cyclopeptide14709718157270481017.33BURP---
Cluster 20Alkaloid2730553527436472130.94Bet_v_1, NAD_binding_4, Oxidored_FMN---
Cluster 21Cyclopeptide2774492928577238832.31BURP---
Cluster 22Saccharide2919132729395074203.75Aminotran_1_2, LTP_2, Transferase, UDPGT_2*saccharide-3--
Cluster 23Cyclopeptide33425384345329041107.52BURP---
The following clusters are from record NC_056560.1:
Cluster 24Cyclopeptide996099010326087365.10BURP---
Cluster 25Fatty_acid149401031503742397.32Acetyltransf_1, FA_hydroxylase, Methyltransf_11---
The following clusters are from record NC_056561.1:
Cluster 26Cyclopeptide35571614064220507.06BURP---
Cluster 27Fatty_acid-Alkaloid86616048866671205.07Acetyltransf_1, FA_desaturase_2, LTP_2, Pyridoxal_deC---
Cluster 28Alkaloid9789812988302693.21Aldo_ket_red, BBE, FAD_binding_4, p450---
Cluster 29Transporter_associated1197025312447568477.31ABC_tran, Abhydrolase_3, Methyltransf_3, Peptidase_S10, p450---
Cluster 30Saccharide2264616722757068110.90Aldo_ket_red, UDPGT_2, p450flavonoid--
Cluster 31Fatty_acid297852192988436899.15FA_hydroxylase, Methyltransf_11, p450---
The following clusters are from record NC_056562.1:
Cluster 32Saccharide1663572016908032272.31Cellulose_synt, UDPGT_2, p450---
Cluster 33Cyclopeptide2077862121582324803.70BURP---
Cluster 34Cyclopeptide2458046625168949588.48BURP---
Cluster 35Cyclopeptide2686736027509910642.55BURP---
Cluster 36Alkaloid2850354128661211157.67Epimerase, Pyridoxal_deC, adh_short, adh_short_C2-QS-21 (11% of genes show similarity)BGCMANUAL2.1_c1
Cluster 37Saccharide340873073415425066.94UDPGT_2, p450hydroxycinnamate--
The following clusters are from record NC_056563.1:
Cluster 38Cyclopeptide44339695366052932.08BURP---
Cluster 39Cyclopeptide1153826112047513509.25BURP---
Cluster 40Cyclopeptide14399246170387122639.47BURP---
Cluster 41Cyclopeptide1738825918315061926.80BURP---
Cluster 42Saccharide-Cyclopeptide2169130422598935907.63ABC_tran, BURP, Glyco_hydro_1, p450---
Cluster 43Terpene3184676331969629122.872OG-FeII_Oxy, DIOX_N, Epimerase, Terpene_synth_C---
The following clusters are from record NC_056564.1:
Cluster 44Cyclopeptide118775823562581168.50BURP---
Cluster 45Saccharide82672168451874184.66Glycos_transf_1, HAD_RAM2_N, Methyltransf_7, Transferase, p450---
Cluster 46Saccharide1468302114903315220.29ADH_N, ADH_zinc_N, Aminotran_1_2, Glyco_hydro_1, Methyltransf_11---
Cluster 47Fatty_acid1612982216274753144.93FA_hydroxylase, Transferase, p450---
The following clusters are from record NC_056565.1:
Cluster 48Cyclopeptide146884729572461488.40BURP---
Cluster 49Saccharide-Transporter_associated-Alkaloid1570517815982781277.603Beta_HSD, ADH_N, ADH_zinc_N, Acetyltransf_1, BBE, FAD_binding_4, LTP_2, UDPGT_2small phenolic-3--
Cluster 50Cyclopeptide1740148818007318605.83BURP---
Cluster 51Cyclopeptide1758800118081731493.73BURP---
Cluster 52Cyclopeptide2250934123184634675.29BURP---
Cluster 53Alkaloid2912223829392236270.00ADH_N, ADH_zinc_N, Epimerase, Pyridoxal_deC---
Cluster 54Saccharide3077521331002316227.10ADH_N, ADH_zinc_N, Glyco_transf_28, UDPGT, p450---
Cluster 55Polyketide338906353398423493.60Chal_sti_synt_C, Chal_sti_synt_N, Methyltransf_3, adh_short---
Cluster 56Saccharide3575552136069554314.03Glycos_transf_1, Methyltransf_11, p450---
The following clusters are from record NC_056566.1:
Cluster 57Putative17265732147314420.74HMGL-like, Methyltransf_11, Transferase, p450---
Cluster 58Cyclopeptide24820162769107287.09BURP---
Cluster 59Cyclopeptide313008052304152100.34BURP---
Cluster 60Cyclopeptide1404184214637964596.12BURP---
Cluster 61Terpene-Alkaloid2123008121356153126.07Acetyltransf_1, Chalcone_2, Lipoxygenase, Str_synth, Terpene_synth, Terpene_synth_C---
Cluster 62Terpene2512062225270390149.772OG-FeII_Oxy, Abhydrolase_3, Terpene_synth_C---
The following clusters are from record NC_056567.1:
Cluster 63Alkaloid42675874420228152.64Aminotran_3, BBE, Epimerase, FAD_binding_4---
Cluster 64Cyclopeptide49760455905856929.81BURP-alpha-chaconine/alpha-solanine (13% of genes show similarity)BGC0002722.2_c1
Cluster 65Cyclopeptide821562198484231632.80BURP---
Cluster 66Putative2390868524145955237.27ADH_N, ADH_zinc_N, Epimerase, GMC_oxred_N, UbiA, p450---
The following clusters are from record NC_056568.1:
Cluster 67Saccharide14723411933240460.902OG-FeII_Oxy, Amino_oxidase, Cellulose_synt, DIOX_N, Dimerisation, Methyltransf_2---
Cluster 68Fatty_acid2772287286552393.24FA_hydroxylase, Methyltransf_11, adh_short, adh_short_C2---
Cluster 69Cyclopeptide14588908167370272148.12BURP---
Cluster 70Cyclopeptide2005182720830064778.24BURP---
Cluster 71Cyclopeptide2294714823400988453.84BURP---
The following clusters are from record NC_056569.1:
Cluster 72Cyclopeptide20149316442121442.72BURP---
Cluster 73Cyclopeptide14194142052731633.32BURP---
Cluster 74Saccharide2212726226097548.25Methyltransf_7, UDPGT_2---
Cluster 75Cyclopeptide59461906557160610.97BURP---
Cluster 76Cyclopeptide2086669021483577616.89BURP---
Cluster 77Cyclopeptide2431437625027571713.20BURP---
The following clusters are from record NC_056570.1:
Cluster 78Lignan-Fatty_acid5617287569428377.00AMP-binding, Dirigent, FA_desaturase_2---
Cluster 79Terpene1106579511331195265.40TPMT, Terpene_synth, Terpene_synth_C, p450---
Cluster 80Putative2144978121833882384.102OG-FeII_Oxy, Aminotran_1_2, DIOX_N, Dimerisation, Lycopene_cycl, Methyltransf_2, adh_short, p450---
Cluster 81Saccharide-Alkaloid2204450822275765231.26Amino_oxidase, Aminotran_1_2, BBE, FAD_binding_4, Glyco_transf_28, Lycopene_cycl, UDPGT---
Cluster 82Saccharide2566724525782663115.42E1_dh, Glycos_transf_1, Peptidase_S10, UDPGT_2*saccharide--
Cluster 83Cyclopeptide2870630429428941722.64BURP, Lipoxygenase, Transferase---
Cluster 84Putative2914312629280706137.58Lipoxygenase, Transferase---
The following clusters are from record NW_024868115.1:
Cluster 85Phenolamide19568859786.64Orn_Arg_deC_N, Orn_DAP_Arg_deC, Transferase---
The following clusters are from record NW_024868124.1:
Cluster 86Cyclopeptide1189126189.12BURP---
The following clusters are from record NW_024868169.1:
Cluster 87Cyclopeptide1173245173.24BURP---
The following clusters are from record NW_024868210.1:
Cluster 88Cyclopeptide1160178160.18BURP---
The following clusters are from record NW_024868316.1:
Cluster 89Transporter_associated78681324179245.502OG-FeII_Oxy, ABC2_membrane, ABC_tran, Aminotran_1_2, DIOX_N, PRISE---
The following clusters are from record NW_024868374.1:
Cluster 90Cyclopeptide1118553118.55BURP---
The following clusters are from record NW_024868400.1:
Cluster 91Cyclopeptide1529537529.54BURP---
The following clusters are from record NW_024868405.1:
Cluster 92Cyclopeptide125395985247.31BURP---
The following clusters are from record NW_024868422.1:
Cluster 93Cyclopeptide1110796110.80BURP---
The following clusters are from record NW_024868658.1:
Cluster 94Cyclopeptide17381173.81BURP---
The following clusters are from record NW_024868771.1:
Cluster 95Cyclopeptide308206281231.99BURP---
The following clusters are from record NW_024868871.1:
Cluster 96Cyclopeptide15364953.65BURP---
The following clusters are from record NW_024868887.1:
Cluster 97Cyclopeptide14315943.16BURP---
The following clusters are from record NW_024868967.1:
Cluster 98Cyclopeptide14639446.39BURP---
The following clusters are from record NW_024868972.1:
Cluster 99Cyclopeptide14596245.96BURP---
The following clusters are from record NW_024869088.1:
Cluster 100Cyclopeptide13731637.31BURP---
The following clusters are from record NW_024869167.1:
Cluster 101Cyclopeptide13308833.09BURP---
The following clusters are from record NW_024869172.1:
Cluster 102Cyclopeptide13290632.91BURP---
The following clusters are from record NW_024869429.1:
Cluster 103Cyclopeptide11950619.50BURP---
The following clusters are from record NW_024869446.1:
Cluster 104Cyclopeptide11840918.41BURP---
The following clusters are from record NW_024869731.1:
Cluster 105Cyclopeptide1401614401.61BURP---
The following clusters are from record NW_024869847.1:
Cluster 106Cyclopeptide1385394385.39BURP---
The following clusters are from record NW_024870494.1:
Cluster 107Lignan28872635073362.01Dirigent, Epimerase---
The following clusters are from record NW_024870495.1:
Cluster 108Cyclopeptide1317436317.44BURP---
The following clusters are from record NW_024870497.1:
Cluster 109Cyclopeptide1315566315.56BURP---
The following clusters are from record NW_024870581.1:
Cluster 110Cyclopeptide1240977240.98BURP---
The following clusters are from record NW_024870586.1:
Cluster 111Cyclopeptide1247233247.23BURP---
The following clusters are from record NW_024870639.1:
Cluster 112Terpene-Alkaloid560333704711144.38Cu_amine_oxid, Lipoxygenase, Terpene_synth, Terpene_synth_C---
The following clusters are from record NW_024870659.1:
Cluster 113Fatty_acid8147814942667.95FA_hydroxylase, Methyltransf_11, p450---
The following clusters are from record NW_024870669.1:
Cluster 114Cyclopeptide1404052404.05BURP---
The following clusters are from record NW_024870673.1:
Cluster 115Cyclopeptide1230815230.81BURP---

NC_056557 - Cluster 1 - Saccharide-polyketide

Gene cluster description

NC_056557 - Gene Cluster 1. Type = saccharide-polyketide. Location: 3136870 - 3578906 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056557 - Cluster 2 - Cyclopeptide

Gene cluster description

NC_056557 - Gene Cluster 2. Type = cyclopeptide. Location: 4251326 - 5988794 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122078207
Repeat occurs 3 times in a sequence of 692 amino acids
Location between 4401737 and 4403816
Coverage of 3.9 %
Instances:
LIDMYCACG | LIDMYCKCG | LIDMYGKCG |
pattern: LIDMY[CG][AK]CG
MACLSPSILSLPPPQRLEDPNSSQKPIEEWLFCSLKSPKTLPQLKTLHAHIYKTGLTGNDLVS
GQLLLCCSLSNSIHYARQIFDAIPRPKPFFYNAMIKGYSETGAHHEALRLYSIMRARSVPCDSF
TFPVVLRSASSLARIDVGEELHGLITKTAFDSRVIVRTALIDMYCACGLSDCGRLVFDRTSDRD
VICWNTMIAGYVKCGEFSRAKQLFDGMPEKNISSWNTLIDMYCKCGDIDTAHRLFDEMPGRDLI
SWNSILSGYSKVGQCEAARRLFDEMPQRNVVSWNVVITCYVHNRLFTEALELFRMMQLSDIKPN
EVTVVAVLPACAHLGALDNGQWIHAYICRLQIKMDIYVNTALIDMYGKCGGLEDAQRVFDEANK
KDTFLCSTMIEVMAMHGKAEEAFRVFTYMTSTGIKPSDVTFVGLLKACSHAGLVDFGRKYFEMM
TAEFGLTPKMEHFGCMVDLLGRAGFLEEAHELVKSMPMEPHPVVWATLLSSCRIHGKVKLAEQV
ALRLIELVPQSGETYVLLSNIYSQAGRWNEAANLRKMMKEKGVIKKPGCSSIEVNNDVYEFFAG
DRDHPHCKEIYEMLEQMASRLNTEGYVPQTSSALHDVNLDEKEQALLYHSEKLAVAFGLLSTQH
RTPIRIVKNLRVCDDCHLFMKVVSKYYKRTIIARDCNRFHHFSDGSCSCSDFW

Similar gene clusters

NC_056557 - Cluster 3 - Cyclopeptide

Gene cluster description

NC_056557 - Gene Cluster 3. Type = cyclopeptide. Location: 5135548 - 6203494 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056557 - Cluster 4 - Alkaloid

Gene cluster description

NC_056557 - Gene Cluster 4. Type = alkaloid. Location: 8023139 - 8190446 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056557 - Cluster 5 - Alkaloid

Gene cluster description

NC_056557 - Gene Cluster 5. Type = alkaloid. Location: 8593643 - 8743651 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056557 - Cluster 6 - Cyclopeptide

Gene cluster description

NC_056557 - Gene Cluster 6. Type = cyclopeptide. Location: 9437517 - 10214777 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_056557 - Cluster 7 - Cyclopeptide

Gene cluster description

NC_056557 - Gene Cluster 7. Type = cyclopeptide. Location: 14478796 - 15150464 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_056557 - Cluster 8 - Saccharide

Gene cluster description

NC_056557 - Gene Cluster 8. Type = saccharide. Location: 24819334 - 24901746 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056557 - Cluster 9 - Saccharide

Gene cluster description

NC_056557 - Gene Cluster 9. Type = saccharide. Location: 26850278 - 27083885 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056557 - Cluster 10 - Polyketide

Gene cluster description

NC_056557 - Gene Cluster 10. Type = polyketide. Location: 29928509 - 30278324 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056558 - Cluster 11 - Terpene

Gene cluster description

NC_056558 - Gene Cluster 11. Type = terpene. Location: 9419554 - 9609400 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056558 - Cluster 12 - Terpene

Gene cluster description

NC_056558 - Gene Cluster 12. Type = terpene. Location: 13236646 - 13467623 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056558 - Cluster 13 - Saccharide-transporter_associated-alkaloid

Gene cluster description

NC_056558 - Gene Cluster 13. Type = saccharide-transporter_associated-alkaloid. Location: 27480425 - 27727869 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056558 - Cluster 14 - Cyclopeptide

Gene cluster description

NC_056558 - Gene Cluster 14. Type = cyclopeptide. Location: 30504081 - 31989625 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122064987
Repeat occurs 3 times in a sequence of 358 amino acids
Location between 31401147 and 31403074
Coverage of 5.03 %
Instances:
PLHVAI | PLHLAV | PLHCAI |
pattern: PLH[CLV]A[IV]
MDESLINAAKAGDRSHLYELLRQNAFILEVFDQIPFVDTPLHVAILAGQTHFVVEVINLKPSF
TKKLNQDGFSPLHLAVANGHFEIVRELLRIDQNLCFLRGRDDKIPLHCAIENGKMDVLDLLIST
CPNSVTELSIGGETILHRAMKNGRVETFKALVERLPTLKMTNILGWKDQEGNTILHVAISLRQL
EMMEFLLLGPCRKSVRKVANVTNVRGLTAFDIFRELDTSHEVERNDRLRLRKIGDILSSASGPK
RCRCILASACFCCCGGCNLMRCVTKKFKVNKGTPTTSSSGGATATDNISAFIELVKAANYAAFT
LSFILIYNLTNGFPLRIPVRMALIFMLWVISKVESRTRI

Similar gene clusters

Similar known gene clusters

NC_056558 - Cluster 15 - Transporter_associated

Gene cluster description

NC_056558 - Gene Cluster 15. Type = transporter_associated. Location: 40957720 - 41215780 nt. Click on genes for more information.
Show pHMM detection rules used
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056558 - Cluster 16 - Saccharide

Gene cluster description

NC_056558 - Gene Cluster 16. Type = saccharide. Location: 42752858 - 42842103 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056559 - Cluster 17 - Cyclopeptide

Gene cluster description

NC_056559 - Gene Cluster 17. Type = cyclopeptide. Location: 4242319 - 4915182 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056559 - Cluster 18 - Saccharide-terpene

Gene cluster description

NC_056559 - Gene Cluster 18. Type = saccharide-terpene. Location: 5656778 - 5839640 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056559 - Cluster 19 - Cyclopeptide

Gene cluster description

NC_056559 - Gene Cluster 19. Type = cyclopeptide. Location: 14709718 - 15727048 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122073634
Repeat occurs 3 times in a sequence of 364 amino acids
Location between 15375863 and 15393834
Coverage of 4.95 %
Instances:
SSEPPG | SSEPPY | SSESSG |
pattern: SSE[PS][PS][YG]
MDAREATRLASSEPPGMMAGPNTYGMSLHTTSMLNPSSAGIMQNLRLPYNQVVTPASRPSDPS
NSGGYHTEGLRPSGFTVNEPVKKKRGRPRKYGPDGSMALGLVPLSSVDGGHSNNNDNNNENNDN
NHPNESSTPSSEPPYKRNRGRPLGSGKKQQQDALGAPGVGFTTHVITVKAGEDVASKIVAFSQQ
GSRTVCIISANGAICNVTLRQPATSGGTVTYEGRFEIISLSGSFLFTENGGSRIRTGGLSVALA
GSDGRVLGGGVAGMLMAAGPVQVIVGSFIADGKKSKSERRLSLPAPQTVGLGAPVATGSPPFQG
ASSESSGQPDSPLNHSSGACNNTGQIQGFSSYSSIPWGSHSANPH
Repeat found in LOC122073634
Repeat occurs 3 times in a sequence of 364 amino acids
Location between 15375863 and 15393834
Coverage of 4.95 %
Instances:
SSEPPG | SSEPPY | SSESSG |
pattern: SSE[PS][PS][YG]
MDAREATRLASSEPPGMMAGPNTYGMSLHTTSMLNPSSAGIMQNLRLPYNQVVTPASRPSDPS
NSGGYHTEGLRPSGFTVNEPVKKKRGRPRKYGPDGSMALGLVPLSSVDGGHSNNNDNNNENNDN
NHPNESSTPSSEPPYKRNRGRPLGSGKKQQQDALGAPGVGFTTHVITVKAGEDVASKIVAFSQQ
GSRTVCIISANGAICNVTLRQPATSGGTVTYEGRFEIISLSGSFLFTENGGSRIRTGGLSVALA
GSDGRVLGGGVAGMLMAAGPVQVIVGSFIADGKKSKSERRLSLPAPQTVGLGAPVATGSPPFQG
ASSESSGQPDSPLNHSSGACNNTGQIQGFSSYSSIPWGSHSANPH
Repeat found in LOC122074821
Repeat occurs 9 times in a sequence of 1591 amino acids
Location between 15676839 and 15681615
Coverage of 5.66 %
Instances:
EPSMNSALAE | EPSQENECDE | EPSQGNEPDE | EPSQGNEPDE | EPSQGNEPDE
EPSQGNEPDE | EPSQGNEHHE | EPSQGNEPIE | EPSENLDGEL |
pattern: EPS[EMQ][GEN][SNL][ADE][GCPHL][DEIAH][EL]
MENGVAIDVEPHFVDIVAVEDGTSHPKVEDKGVHGFHVSKDSEGDIFVEASEEIHTQESSVKL
DAEQVINGEVPVDKTGDSVSLGVNGMSNVDHGTEKFEEAIKRPGNELNETVYGVGGADEPANVE
DKAEELVPVESNGLDDAGSVGTAETVVNLGDDRGNEPNETFYEVGVADGPANVKDKAEEVVPVE
SNGLDDASSVGIAEAVVNLGNDYTQEREQQVTVEKMTEDSVSTKPVDDIRLADETGNVECNGTE
VVLVQSNGFDGGALVGNSEVELREFSDIVSDRDIEPSMNSALAEGRSDAVPEKSENGDLVKNNL
GESPFHDKADVHAELVKSDVIEPSQENECDELIVHEDSESLELKEPLPNLDGEHQDHQSVQLQN
SSSEKDHSLKESDVIEPSQGNEPDELIVHQASENAELKGHSANLDGEHQDHQSVQLKNSSSEKD
CSVNESDVIEPSQGNEPDELIVHQASESAELKGPSANLDGEHQHHQSVQLKNSSSEKDCFVNES
DVIEPSQGNEPDELIVHQASESAELKGPSANLDGEHQHHQSVQLKNSSSEKDCFVNESDVIEPS
QGNEPDE
LIVHQASESAELKGPSANLDGEHQVHLSLQLKNSSYEKDRSLKESDVIEPSQGNEHH
E
PIVHQTTENMALNGTLANLDGEHQDHKSMQLKNSSSEHGPSLKESDVIEPSQGNEPIELIVHQ
YSENAKLKEPSENLDGELQDHQNVPLKNSSAEQDPSLKEAGTPEVEFPLSAEESDTVKDDAAEP
QAKDTQDSRPMLADRNGREGSSSHVKVGEPDVQETQKVKDRGSTIQSAQNVFSSSENSTNPPLA
LPSRPAGLGSAPPLGAAHRALQQPRANGSIPQRPTQLVEDPMNGEAEENDDEAREKLQMIRVKF
LRLAHRLGQTPHNVVVAQVLYRLGLAEQLQGGNTTRAGAFSFDRASAMAEQLEAAGQEPLDFSC
TIMVIGKTGVGKSATINSIFDEVKFSTDAFNVGTKKVQDVVGTVQGIKVRVIDTPGLLPSMSDQ
RHNEKILHSVKRFIKKTPPDIILYLDRLDMQSRDFGDMPLLRTITEIFGPSIWFNAIVVLTHAA
SAPPEGPNGTASSYEMFVTQRSHVVQQAIRQAAGDMRLMNPVSLVENHSACRTNRAGQRVLPNG
QVWKPHLLLLSFASKILAEANTLLKLQDGPPGKPFATRSRAPPLPFLLSSLLQSRPQLKLPEEQ
FGSDDDAFDEDLDEASDSDEDSEYDELPPFKRLTKAQLAKLSKAQKKAYFDELEYREKLFMKKQ
LKDERKRRKLLKKMTDLAKDLPSEYSESAEEESGGAASVPVPMPDLSLPASFDSDNPTHRYRFL
DSSNQWLVRPVLDTHGWDHDVGYEGINVERLFVIKEKIPLSFSGQVTKDKKEANVQMELAGSVK
HGEGKATSLGFDMQTVGKDMAYTLRSETRFSDFRHNKMAAGLSVTHLGDAVSAGVKVEDKLIVN
KRFRMVLTGGAMTGRGDVAYGGSLEATLRDKDYPLGRTLTTLGLSIMDWHGDLAIGCNVQSQLP
LGRSSNVIARANLNNRGAGQVSIRLNSSEQLQIALIGLIPLFRKLVGYSQQLPFGQ
Repeat found in LOC122074821
Repeat occurs 9 times in a sequence of 1591 amino acids
Location between 15676839 and 15681615
Coverage of 5.66 %
Instances:
EPSMNSALAE | EPSQENECDE | EPSQGNEPDE | EPSQGNEPDE | EPSQGNEPDE
EPSQGNEPDE | EPSQGNEHHE | EPSQGNEPIE | EPSENLDGEL |
pattern: EPS[EMQ][GEN][SNL][ADE][GCPHL][DEIAH][EL]
MENGVAIDVEPHFVDIVAVEDGTSHPKVEDKGVHGFHVSKDSEGDIFVEASEEIHTQESSVKL
DAEQVINGEVPVDKTGDSVSLGVNGMSNVDHGTEKFEEAIKRPGNELNETVYGVGGADEPANVE
DKAEELVPVESNGLDDAGSVGTAETVVNLGDDRGNEPNETFYEVGVADGPANVKDKAEEVVPVE
SNGLDDASSVGIAEAVVNLGNDYTQEREQQVTVEKMTEDSVSTKPVDDIRLADETGNVECNGTE
VVLVQSNGFDGGALVGNSEVELREFSDIVSDRDIEPSMNSALAEGRSDAVPEKSENGDLVKNNL
GESPFHDKADVHAELVKSDVIEPSQENECDELIVHEDSESLELKEPLPNLDGEHQDHQSVQLQN
SSSEKDHSLKESDVIEPSQGNEPDELIVHQASENAELKGHSANLDGEHQDHQSVQLKNSSSEKD
CSVNESDVIEPSQGNEPDELIVHQASESAELKGPSANLDGEHQHHQSVQLKNSSSEKDCFVNES
DVIEPSQGNEPDELIVHQASESAELKGPSANLDGEHQHHQSVQLKNSSSEKDCFVNESDVIEPS
QGNEPDE
LIVHQASESAELKGPSANLDGEHQVHLSLQLKNSSYEKDRSLKESDVIEPSQGNEHH
E
PIVHQTTENMALNGTLANLDGEHQDHKSMQLKNSSSEHGPSLKESDVIEPSQGNEPIELIVHQ
YSENAKLKEPSENLDGELQDHQNVPLKNSSAEQDPSLKEAGTPEVEFPLSAEESDTVKDDAAEP
QAKDTQDSRPMLADRNGREGSSSHVKVGEPDVQETQKVKDRGSTIQSAQNVFSSSENSTNPPLA
LPSRPAGLGSAPPLGAAHRALQQPRANGSIPQRPTQLVEDPMNGEAEENDDEAREKLQMIRVKF
LRLAHRLGQTPHNVVVAQVLYRLGLAEQLQGGNTTRAGAFSFDRASAMAEQLEAAGQEPLDFSC
TIMVIGKTGVGKSATINSIFDEVKFSTDAFNVGTKKVQDVVGTVQGIKVRVIDTPGLLPSMSDQ
RHNEKILHSVKRFIKKTPPDIILYLDRLDMQSRDFGDMPLLRTITEIFGPSIWFNAIVVLTHAA
SAPPEGPNGTASSYEMFVTQRSHVVQQAIRQAAGDMRLMNPVSLVENHSACRTNRAGQRVLPNG
QVWKPHLLLLSFASKILAEANTLLKLQDGPPGKPFATRSRAPPLPFLLSSLLQSRPQLKLPEEQ
FGSDDDAFDEDLDEASDSDEDSEYDELPPFKRLTKAQLAKLSKAQKKAYFDELEYREKLFMKKQ
LKDERKRRKLLKKMTDLAKDLPSEYSESAEEESGGAASVPVPMPDLSLPASFDSDNPTHRYRFL
DSSNQWLVRPVLDTHGWDHDVGYEGINVERLFVIKEKIPLSFSGQVTKDKKEANVQMELAGSVK
HGEGKATSLGFDMQTVGKDMAYTLRSETRFSDFRHNKMAAGLSVTHLGDAVSAGVKVEDKLIVN
KRFRMVLTGGAMTGRGDVAYGGSLEATLRDKDYPLGRTLTTLGLSIMDWHGDLAIGCNVQSQLP
LGRSSNVIARANLNNRGAGQVSIRLNSSEQLQIALIGLIPLFRKLVGYSQQLPFGQ

Similar gene clusters

NC_056559 - Cluster 20 - Alkaloid

Gene cluster description

NC_056559 - Gene Cluster 20. Type = alkaloid. Location: 27305535 - 27436472 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056559 - Cluster 21 - Cyclopeptide

Gene cluster description

NC_056559 - Gene Cluster 21. Type = cyclopeptide. Location: 27744929 - 28577238 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122074599
Repeat occurs 3 times in a sequence of 490 amino acids
Location between 27756404 and 27758531
Coverage of 3.67 %
Instances:
APTGGS | APTGGS | APTGDT |
pattern: APTG[DG][ST]
MMGSRSRDVVLTCLLIGLLLWRVDARAPGGPKDHKKKPHQSKAKGQEAAEAPTGGSTSSTAPV
ASPAMPGNAAPAGAPTGGSASSTAPVASPATPGAAAPAGAPTGDTEAPQGTLFDVTKHGAVGDG
KTDNNKAFSDAWVAACKSSSTGPSTFLIPRGDFLLSTIVFAGPCTNKTLSPSVIIRGNLIASTT
DSNSKSWITFQSLSNLHISGAGTGSFNGKGETAWSGGCRHTANSHCNVAPANLNVQSVLSGSLN
DITSVNSQGFHILVGNTQDFNIHDVKINAPKFSPNTDGIHVGNSTGITIANAEIGTGDDCVSIG
SGTSKISVTNVKCGPGHGISLGSLGKYPDEKAVSSLLVKNCTLTSTDNGVRIKTWPGSPDNIVN
DVRFEDITMTNVSNPIIINQNYCDKKACDSTPSHVKLSGIHFKNIKGTSFTKTAVALGCSSGAP
CEQVELVDIDLKPATDAGAVDATCANVKGVITSGVLNPKPSCV
Repeat found in LOC122074600
Repeat occurs 3 times in a sequence of 474 amino acids
Location between 27769284 and 27771212
Coverage of 3.8 %
Instances:
TPEASP | TPEASP | TPEASP |
pattern: TPEASP
MMGCRSRDVVLTCLLIGLLLWSADARTPGGSKESKENKKKSTLSKDKAPKAAKAPMESSGSVP
SAATPEASPNTPEASPATPEASPVTPGTADPQGTLFDVTKHGAVGDGKTDNNKAFAAAWAAVCK
SSSTGPSTFLVPRGDFLLSPIIFAGPCSDKALSPSVIIRGKLIASTSDSSASKSWITFQNLGQL
HVSGAGTGSLDGKGETAWANACRHSANKQCSLPPANLKLEGITGGSLNDIISVNSQGFHFHISN
SKDFNIHDFKINAPGFSPNTDGLHISNSASVSITNAEIGTGDDCVSIGAGTSNISVTNVICGPG
HGISVGSLGKYPNEKEVNGLLVKNCTLASTTNGVRIKTWPGSPDNTVTNVRFEDITLTNVSNPI
IINQNYCDKKACESTPSRVKLTDIHFKNIKGTTFGKTGVTLTCSKGVPCEKVELVDIDLKPSTA
AGDLASTCENVNGVTTTGVLNPKPPCA
Repeat found in LOC122073874
Repeat occurs 9 times in a sequence of 937 amino acids
Location between 27966135 and 28002531
Coverage of 12.49 %
Instances:
EEEEDGEEEEEEE | EEEDGEEEEEEEE | EEEEEEEEDNDEE | EEEEEEEDNDEEG | EEEEEEDNDEEGE
EEEEEDNDEEGEE | EEEEDNDEEGEED | EEEDNDEEGEEDI | EEEREDEAILLLS |
pattern: EEE[DER][DEGN][DEGN][DEN][DAEN][EGNID][EGNDL][DEGL][DEGL][SEGID]
MGEEKAETSGLQEAEVSVSASGMNLERDAEVVEEEEDGEEEEEEEEDNDEEGEEDIEDGEYTL
RFEGEMDPLDFTEDDAYGIQPYQQFERLEYEALAEKKRKALAQSQRDGSAKKPRQEMLGASIAE
IMEVMNYGMRRKSRKPRKRGRRKGSKNKLSPEVTRKLGDATLHYAFGRYEEAISVLKEVVRLAP
NLPDSYHTLGLVYNAIGDKKKALNFYMISAHLTPKDPSLWKLLFTWSIEQGNTGQARYCLSKAI
TADPEDINLRFDRALLHIELGDYQKAAESYEQILKRRPENVEALKMAAKMYQKCGQDERAVRIL
EDYLKDHQTQIDLDVANLLVAIHMKNNAHAEALQQIEHARLVYNRGEEFPLYLNVKAGICEVHL
GNLDKAEIIFRDLQKEHTDDHPKLISDVADSFMNLGQYESALKYYFLLEANGGNDHGFLHFKIA
QCYLSLNKRVQAIPFFQRGLHSMEGNIDARLTLASLLLEEEREDEAILLLSPPKNSESIVDKNS
TQSTHWWLNGKIKKLLARIYHARGMLEEFVNAIFSSVRETLFIETINQKVRNRKRLPRSVLLER
LKKGLGDRQADTDTVFHGFRPLAAASDLSKAARAKKSLQKKAALKEEKKAAALAAGLDWQSDDS
DDEPVEQAREPPLPNLLKDEEHYQLILDLSRALSSLRMYWEALEIINHTLRLAYNTLSVEKKEE
LRSLGAQIAYNTTNPTHGYDCARYIVQQHPYSFAAWNCYYKVISRLGYRLSKHSKFLYSMRCTY
ADCVPPMLISGHQFTMISQHQAATREYLQAYKLQPDNPLINLCVGSSLINLALGLRLQNKHHSL
AQGFAFLHNNLRICQNSQEALYNIARACHHVGLVTLAVSYYEKVLAICEKDYPIPKLPNEESSL
TEIQKPGYCDLRREAAYNLHLIYKQSGAFDLARQVLKDHCTP
Repeat found in LOC122073874
Repeat occurs 9 times in a sequence of 814 amino acids
Location between 27966135 and 28002466
Coverage of 14.37 %
Instances:
EEEEDGEEEEEEE | EEEDGEEEEEEEE | EEEEEEEEDNDEE | EEEEEEEDNDEEG | EEEEEEDNDEEGE
EEEEEDNDEEGEE | EEEEDNDEEGEED | EEEDNDEEGEEDI | EEEREDEAILLLS |
pattern: EEE[DER][DEGN][DEGN][DEN][DAEN][EGNID][EGNDL][DEGL][DEGL][SEGID]
MGEEKAETSGLQEAEVSVSASGMNLERDAEVVEEEEDGEEEEEEEEDNDEEGEEDIEDGEYTL
RFEGEMDPLDFTEDDAYGIQPYQQFERLEYEALAEKKRKALAQSQRDGSAKKPRQEMLGASIAE
IMEVMNYGMRRKSRKPRKRGRRKGSKNKLSPEVTRKLGDATLHYAFGRYEEAISVLKEVVRLAP
NLPDSYHTLGLVYNAIGDKKKALNFYMISAHLTPKDPSLWKLLFTWSIEQGNTGQARYCLSKAI
TADPEDINLRFDRALLHIELGDYQKAAESYEQILKRRPENVEALKMAAKMYQKCGQDERAVRIL
EDYLKDHQTQIDLDVANLLVAIHMKNNAHAEALQQIEHARLVYNRGEEFPLYLNVKAGICEVHL
GNLDKAEIIFRDLQKEHTDDHPKLISDVADSFMNLGQYESALKYYFLLEANGGNDHGFLHFKIA
QCYLSLNKRVQAIPFFQRGLHSMEGNIDARLTLASLLLEEEREDEAILLLSPPKNSESIVDKNS
TQSTHWWLNGKIKKLLARIYHARGMLEEFVNAIFSSVRETLFIETINQKVRNRKRLPRSVLLER
LKKGLGDRQADTDTVFHGFRPLAAASDLSKAARAKKSLQKKAALKEEKKAAALAAGLDWQSDDS
DDEPVEQAREPPLPNLLKDEEHYQLILDLSRALSSLRMYWEALEIINHTLRLAYNTLSVEKKEE
LRSLGAQIAYNTTNPTHGYDCARYIVQQHPYSFAAWNCYYKVISRKPCITLLEHVITLDLLHLL
CHIMRKCWQFVRRITLSQSFLMRNQALQKFRSQVIVTFAERLHTICT
Repeat found in LOC122074768
Repeat occurs 6 times in a sequence of 968 amino acids
Location between 28012438 and 28015422
Coverage of 3.72 %
Instances:
LDLSDN | LDLSYN | LDLAYN | LDLSFN | LDLSAN
LDLSHN |
pattern: LDL[AS][ADYHF]N
MGYSIKLAFLCLIFFSSSASGSRVEVVHSQFSSEAEALLKWKATLPSYSLPALSSWKLTPPSP
NTTTTNTTTPSAAAAAISCKWFGITCNRAGSMVEISLPHLELQGTLDNFPFSSFPNLLRLDLSD
N
GITGTIPVHIGSLSKLAHLDLSYNNLSNILPPLSNLSRLRFFFLHHNQINGTIPLEIGNMKNL
VKLALCWNEFTGMIPPVLTNLTNLSLLYLNHNELNGPIPSKIGDMENLVVLDLAYNKLCGLVPQ
SLANSSKLEILYLHENQLSGPLPQDLGGQPSIVSIQLSSNLLSGSLPQQICRGSLQYLSVRNNS
LMGTIPDFRNCTSLRRLRLDNNLLVGNITGSFGVHPHLYYIDVSHNRLYGELSPKWGECKNLSG
IKISGNNISGNIPPEFGHLTELGLLDLSFNKLVGEIPKALGGLSSLLSLYLNDNQISGHVPVEI
GNLINLEDLDLSANRLIGSIPKQLGQCSKLLSLNLSWNSLDGSIPSQIGDLVFLQDQLDLSHNS
ISGLIPPELAKLRMLEILNLSHNMITGSIPLSLEEMVSLVSLDFSYNELEGVVPNNRIFKNASP
QAFRNNKGLCGELQGLLPCPRPQTSNRSGKKGPKFFISIIASLIGIVFLAYAIIGVLILLREKV
KKENIEATRGNHGNIFSIWSYNGNIAYEDMIEATENFDAKYCIGTGTYGSVYKAVLPTGHVVAL
KKFHPLEGEIIVDEESFGNEIRILIDIRHRNIVKLYGFCSHPQCMFLVYEYMEKGSLASILSNQ
AEAMGLDWLKRVNVIKSVANALSYLHHDCVPPIIHRDISSKNILLDVELEARVSDFGIARLLKP
DSSNWISLKGTHGYIAPELAYTMALTEKCDVYSFGVVALETIMGRHPGELISSLSSLAGQKMLL
RDVLDPCLTFPSYQKVAKDMLAVVRVALACLRNHPQSRPTMHQVSKELLVPRSSFVEHFHTITL
GDLDDVEVL

Similar gene clusters

NC_056559 - Cluster 22 - Saccharide

Gene cluster description

NC_056559 - Gene Cluster 22. Type = saccharide. Location: 29191327 - 29395074 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056559 - Cluster 23 - Cyclopeptide

Gene cluster description

NC_056559 - Gene Cluster 23. Type = cyclopeptide. Location: 33425384 - 34532904 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122074532
Repeat occurs 4 times in a sequence of 633 amino acids
Location between 33798937 and 33808945
Coverage of 5.69 %
Instances:
SFKSVKSFK | SFKESKSFK | SFKTSYSFK | SFKSSRCGD |
pattern: SFK[SET][SV][KYR][CS][GF][KD]
MKQPQKKQAELLWPRVILKKWLNLSSNESDFSADPDEGDSSSDSDDEEFFCWGQDSFSKKGAP
ETRPKRRRRNSETIRAQYIKAKELRICVGTWNVGGKLPPDDLDISEWLETDEPADMYVIGLQEI
VPLNAGNIFGSEDSRPVPKWECIIRETLNRIQHEKPKYKCYSDSPSPSRFKASDDVRDIEDEIL
TETDSDGDEEVYPLDEEFSDSNEVKNGTTTSENEFIRSEALNSTENADIVSSEEHDIQRQFSSP
KKLDRLNCFRTEDSMENPEPSVPQQKGILSKTLSRTERIGLSWPEPPMDLLAKHVLERPNSFKS
VKSFK
ESKSFKTSYSFKSSRCGDYRGASETALLADIDLETIIRRKRSPFVKLVSKKMVGIFVSI
WVRRSLRRHIRNLKVSTVGVGVMGYIGNKGSVSVSMSIYQTLFCFISSHLTAGEKEGDEHKRNT
DVHEIHRRTHFNPVSGIGFPKNILDHERVIWLGDLNYRINLSYEKTRELISKKEWSKLIERDQL
IRELRKGRAFDGWSEGTLNFPPTYKYEFNSEKYYGEDPKAGRRNPAWTDRILSFGNGMKLLNYK
RAEHTLSDHRPVTAIYMAEVEEFCHRKLQKALTFTDAELEDEVIADMELDVGMSRLRF
Repeat found in LOC122074532
Repeat occurs 4 times in a sequence of 633 amino acids
Location between 33798937 and 33808945
Coverage of 5.69 %
Instances:
SFKSVKSFK | SFKESKSFK | SFKTSYSFK | SFKSSRCGD |
pattern: SFK[SET][SV][KYR][CS][GF][KD]
MKQPQKKQAELLWPRVILKKWLNLSSNESDFSADPDEGDSSSDSDDEEFFCWGQDSFSKKGAP
ETRPKRRRRNSETIRAQYIKAKELRICVGTWNVGGKLPPDDLDISEWLETDEPADMYVIGLQEI
VPLNAGNIFGSEDSRPVPKWECIIRETLNRIQHEKPKYKCYSDSPSPSRFKASDDVRDIEDEIL
TETDSDGDEEVYPLDEEFSDSNEVKNGTTTSENEFIRSEALNSTENADIVSSEEHDIQRQFSSP
KKLDRLNCFRTEDSMENPEPSVPQQKGILSKTLSRTERIGLSWPEPPMDLLAKHVLERPNSFKS
VKSFK
ESKSFKTSYSFKSSRCGDYRGASETALLADIDLETIIRRKRSPFVKLVSKKMVGIFVSI
WVRRSLRRHIRNLKVSTVGVGVMGYIGNKGSVSVSMSIYQTLFCFISSHLTAGEKEGDEHKRNT
DVHEIHRRTHFNPVSGIGFPKNILDHERVIWLGDLNYRINLSYEKTRELISKKEWSKLIERDQL
IRELRKGRAFDGWSEGTLNFPPTYKYEFNSEKYYGEDPKAGRRNPAWTDRILSFGNGMKLLNYK
RAEHTLSDHRPVTAIYMAEVEEFCHRKLQKALTFTDAELEDEVIADMELDVGMSRLRF
Repeat found in LOC122074532
Repeat occurs 4 times in a sequence of 629 amino acids
Location between 33798937 and 33808945
Coverage of 5.72 %
Instances:
SFKSVKSFK | SFKESKSFK | SFKTSYSFK | SFKSSRCGD |
pattern: SFK[SET][SV][KYR][CS][GF][KD]
MKQPQKKQAELLWPRVILKKWLNLSSNESDFSADPDEGDSSSDSDDEEFFCWGQDSFSKKGAP
ETRPKRRRRNSETIRAQYIKAKELRICVGTWNVGGKLPPDDLDISEWLETDEPADILQEIVPLN
AGNIFGSEDSRPVPKWECIIRETLNRIQHEKPKYKCYSDSPSPSRFKASDDVRDIEDEILTETD
SDGDEEVYPLDEEFSDSNEVKNGTTTSENEFIRSEALNSTENADIVSSEEHDIQRQFSSPKKLD
RLNCFRTEDSMENPEPSVPQQKGILSKTLSRTERIGLSWPEPPMDLLAKHVLERPNSFKSVKSF
K
ESKSFKTSYSFKSSRCGDYRGASETALLADIDLETIIRRKRSPFVKLVSKKMVGIFVSIWVRR
SLRRHIRNLKVSTVGVGVMGYIGNKGSVSVSMSIYQTLFCFISSHLTAGEKEGDEHKRNTDVHE
IHRRTHFNPVSGIGFPKNILDHERVIWLGDLNYRINLSYEKTRELISKKEWSKLIERDQLIREL
RKGRAFDGWSEGTLNFPPTYKYEFNSEKYYGEDPKAGRRNPAWTDRILSFGNGMKLLNYKRAEH
TLSDHRPVTAIYMAEVEEFCHRKLQKALTFTDAELEDEVIADMELDVGMSRLRF
Repeat found in LOC122074558
Repeat occurs 5 times in a sequence of 988 amino acids
Location between 34390002 and 34394939
Coverage of 4.55 %
Instances:
SWNSIISVY | SWNSMISGL | SWNSMIGAF | SWNSMIAGY | SWNSMISGY

pattern: SWNS[IM]I[ASG][AGV][YLF]
MVWFLIMITNLNPWAVMHMVNPTTSWDSCTLLLGVLIPCSHVPRTCRTLQLRAAQRSAVRRSE
LSVSAADQGLADEACRLFRSMLYAGFVPTHYVLGNALRACQDSGPCGLQFGMQIHALVLKSQYA
SDVVVCNGLISMYGSCCLDLAVYAHRVFDGIQLKNSISWNSIISVYSHRGDAFSAFGLFSAMQR
EGLGVSFKPSEYTFGSLITAAYSTPSDAGFFLLEQIVNRVTKSGFLSDLYVGSALVSGFARFGL
LDNAKKIFEQMSERNAVSMNGLMVGLVRQKRGEEAAQVFCETSDLVGINCDSYVILLSACAEFL
VPIEGRRKGREVHAYVLRTGLNEIKVAIANGLVNMYAKCGAIANASRVFKLMGVKDLVSWNSMI
SGL
DQNGCFREAVRSFCEMRRSGLIPSNFTLISTLSSCASLGCIQLGAQIHSEGIKLGLDLDVS
VSNSLLGLYAETGSIIDCQKVFSLMPGYDQVSWNSMIGAFADSDASISGAIEYFLNMMRSGWSP
NRVTFINVLAAVSALSIHELGQQVHALALKYCVADDIAIENAFLSCYGKCGEMDDCEKIFSSMF
ARRDEVSWNSMIAGYIHNGHLPKAMDLVWFMMQRGQRLDCFTFATVLSACASVAALERGKEIHA
CGIRACLETDVVVGSALVDMYAKCGRIDYASRAFRLMPMKNEFSWNSMISGYARHGIGVKALQL
FSEMLQEGQPPDHVTFVGVLSACSHVGLVDEGLEHFESMSNKYGLVPRMEHYSCMVDLLGRAGQ
LTMVEEFVKRMPMKPNILMWRTILGACCRVNGANTELGRHAAEMLLALEPQNAVNYVLVSNMNA
FGGRWDDVAKARAAMREAAVKKEAGCSWVTMKDGVHVFVAGDKSHPDTEKIYEKLLDMNRKMRD
LGYVPQTKFALYDLEVESKEELLSCHSEKLAVAFVLIRTSALPIRIMKNLRVCGDCHSAFGYIS
KIVGRQIVLRDSNRFHHFNDGKCSCGDYW

Similar gene clusters

NC_056560 - Cluster 24 - Cyclopeptide

Gene cluster description

NC_056560 - Gene Cluster 24. Type = cyclopeptide. Location: 9960990 - 10326087 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122075489
Repeat occurs 4 times in a sequence of 742 amino acids
Location between 10315493 and 10317722
Coverage of 3.23 %
Instances:
QIHGHV | QIHGYI | QIHCYL | QIHATL |
pattern: QIH[ACG][YHT][ILV]
MIRDYTANGLFKDAIGVYLKLLNFGILTQQLPFFPSLIKAFAGLLDLRMGQQIHGHVLKFGLL
TDVYVANSLLAMYWKCGALEDALHLFENILNRDSVSWNTMISGFDQSVMPLQALVMFRRMVREL
GAIPNRVACLSALSSCSSCGSLLHGKEIHAFVTKNGLDLDPSLVNEMIGMYMKCGHLRNAEHVF
GILLEREHLRENTLIWNAMISGYVFNGCFWDAFALFQKMLMLGINLDSSTMVTVLVLCSQLLDL
KVGRQIHGYIVCVGLKLNARIETAILDMYCKCGEIQDGLNVFERSSNMNFVMWGAIIAGCAQNG
LSTKALELFSKSRVEGGLVDPITVLAVLRACSSLSLMLNGMETHGLAIKMGFSSDVFIGGALTD
MYAKCGDIESSQRIFLGLSARDVISWNALVSGYAQNDYADKALKAFRNMQYELVRPNTVTFASI
LSVCARLSALILCKQIHCYLIRNRFDANVLVSNSLISAYARCGDISSALIIFNGISERNEISWN
SMIAGLGIHGHMDDMLALFDKMKAADIKPDHVTFTAILSACSHAGRVDEGWRHFRSMVKDYNLK
PELEHYTCMVDLLGRAGHLDQAFNMIMDMPCQPDGHIWGSLLASCKIHCNAEMAERAANHLFKL
DSSSMGYQILLSNIYEDFGRWDNVSRIRTAIKDMGLKKNPGCSWIEVNHKLHVFVAKDRSHYQS
EQIHATLEILTKDITGAGYIPQSYPLSIQHDETSGEGII

Similar gene clusters

NC_056560 - Cluster 25 - Fatty_acid

Gene cluster description

NC_056560 - Gene Cluster 25. Type = fatty_acid. Location: 14940103 - 15037423 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056561 - Cluster 26 - Cyclopeptide

Gene cluster description

NC_056561 - Gene Cluster 26. Type = cyclopeptide. Location: 3557161 - 4064220 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122079721
Repeat occurs 5 times in a sequence of 943 amino acids
Location between 3640829 and 3643661
Coverage of 3.18 %
Instances:
RPRKDV | RPRKNV | RPRKEK | RPRKEK | RPRKKK

pattern: RPRK[DKEN][KV]
MEKGFLIAEESLCKKGKDKAASVGLKRNQGGIKTKTKSTAREGQGLKSFTGDQDVPGVTVSTF
DYSVESHFRAIDMILKLCGEVGEEALGMCDIKQLTAAITFLREWKHFCYKPRVIRFSCETGSPH
GENVINGINLPQFSSVTVPKTTSISEDSRSFQSTSPCDKDFVLYVGGPVWALDWCPRVCQSSDC
NISCEYLAVAAHPPESPYHKVGAPLVGRGLIQIWCLLNVNVEEEGLPPLNKRKGRPRKDVSRES
TPLRKPRGRPRKNVSEESNPLKKPRGRPRKEKLLEETPLKKARGRPRKEKVSEESLPLKILSSR
PRKKK
AIEDSDDINSISQFVPALAVEFPEYSAQFPANSGVGMHTNENAMLKGCETDAVFGTGPQ
ECFNHQLSTSKANVKTPQQKKRTKEKARLGSNSDGGCQSLSNRNEDKELSAAVLPANNRSMQLL
AAPSQNVPYDGSLDAVSTCLPKDVAFPRLVLCLAHNGKVAWDVKWRPSDINDLASKHRMGYLAV
SLGNGSVEVWDVPSLSAIKILYSSCYKEGTDPRFMRLEPVFRCSKLKFGDRQSIPLTLEWSPSF
PHDLILAGCHDGTVALWKFSARHSSEDTRPLLCFSADTVPIRALAWAPNENDPESANVIVTAGN
GGLKFWDIRDPYHPLWDLSYVRRAVCSLDWLANPNCVILSFDDGTLRILSLLRAAYDVPVTGKP
FVGTQQQGLNSYYCSPFSIWSVQASRLTGLVAYCSADGNVLHFQLTAKAVEKDPLRNRAPHFLC
GSLTEENSTLTVHTPLPHVPFPMKKSLNEYADTPRSIRGFVSLANQAKRVNDQRVDYQNPALCY
EDNSIVGVEPENALAGPNSRAAKRKTSKKKKPDPNQESIAEGLENFPRGDNEKGETRSEVEVLP
HKDIAMHRVRWNMNKGSERWLCYGGAAGIVRCQQIDVSCVAKRKDIKK
Repeat found in LOC122079721
Repeat occurs 5 times in a sequence of 943 amino acids
Location between 3640829 and 3643661
Coverage of 3.18 %
Instances:
RPRKDV | RPRKNV | RPRKEK | RPRKEK | RPRKKK

pattern: RPRK[DKEN][KV]
MEKGFLIAEESLCKKGKDKAASVGLKRNQGGIKTKTKSTAREGQGLKSFTGDQDVPGVTVSTF
DYSVESHFRAIDMILKLCGEVGEEALGMCDIKQLTAAITFLREWKHFCYKPRVIRFSCETGSPH
GENVINGINLPQFSSVTVPKTTSISEDSRSFQSTSPCDKDFVLYVGGPVWALDWCPRVCQSSDC
NISCEYLAVAAHPPESPYHKVGAPLVGRGLIQIWCLLNVNVEEEGLPPLNKRKGRPRKDVSRES
TPLRKPRGRPRKNVSEESNPLKKPRGRPRKEKLLEETPLKKARGRPRKEKVSEESLPLKILSSR
PRKKK
AIEDSDDINSISQFVPALAVEFPEYSAQFPANSGVGMHTNENAMLKGCETDAVFGTGPQ
ECFNHQLSTSKANVKTPQQKKRTKEKARLGSNSDGGCQSLSNRNEDKELSAAVLPANNRSMQLL
AAPSQNVPYDGSLDAVSTCLPKDVAFPRLVLCLAHNGKVAWDVKWRPSDINDLASKHRMGYLAV
SLGNGSVEVWDVPSLSAIKILYSSCYKEGTDPRFMRLEPVFRCSKLKFGDRQSIPLTLEWSPSF
PHDLILAGCHDGTVALWKFSARHSSEDTRPLLCFSADTVPIRALAWAPNENDPESANVIVTAGN
GGLKFWDIRDPYHPLWDLSYVRRAVCSLDWLANPNCVILSFDDGTLRILSLLRAAYDVPVTGKP
FVGTQQQGLNSYYCSPFSIWSVQASRLTGLVAYCSADGNVLHFQLTAKAVEKDPLRNRAPHFLC
GSLTEENSTLTVHTPLPHVPFPMKKSLNEYADTPRSIRGFVSLANQAKRVNDQRVDYQNPALCY
EDNSIVGVEPENALAGPNSRAAKRKTSKKKKPDPNQESIAEGLENFPRGDNEKGETRSEVEVLP
HKDIAMHRVRWNMNKGSERWLCYGGAAGIVRCQQIDVSCVAKRKDIKK
Repeat found in LOC122080223
Repeat occurs 7 times in a sequence of 152 amino acids
Location between 3927309 and 3928284
Coverage of 69.08 %
Instances:
KKEEPKKEEPKKEEP | KKEEPKKEEPKKEEP | KKEEPKKEEPKKEEP | KKEEPKKEEPKKEEK | KKEEPKKEEKKEDPI
KKEEKKEDPIAELIQ | KKEDPIAELIQLMKE |
pattern: KKE[DE][KP][KI][KEA][DE][PEL][KPI][KAQ][KEL][DEML][KPEI][EQKPI]
MKKVVVKVEVYDAKSKLKAMRAVSSVSGINSISVDMEKHLMTVIGEVDPVDIVGKLRKFWQHT
VIDFVGPPEEPKKEEPKKEEPKKEEPKKEEPKKEEPKKEEKKEDPIAELIQLMKERHESGHPYN
IPYGTTYYSLKDVSDTNPNTSCVIS
Repeat found in LOC122080223
Repeat occurs 7 times in a sequence of 133 amino acids
Location between 3927791 and 3928284
Coverage of 78.95 %
Instances:
KKEEPKKEEPKKEEP | KKEEPKKEEPKKEEP | KKEEPKKEEPKKEEP | KKEEPKKEEPKKEEK | KKEEPKKEEKKEDPI
KKEEKKEDPIAELIQ | KKEDPIAELIQLMKE |
pattern: KKE[DE][KP][KI][KEA][DE][PEL][KPI][KAQ][KEL][DEML][KPEI][EQKPI]
MRAVSSVSGINSISVDMEKHLMTVIGEVDPVDIVGKLRKFWQHTVIDFVGPPEEPKKEEPKKE
EPKKEEP
KKEEPKKEEPKKEEKKEDPIAELIQLMKERHESGHPYNIPYGTTYYSLKDVSDTNPN
TSCVIS

Similar gene clusters

NC_056561 - Cluster 27 - Fatty_acid-alkaloid

Gene cluster description

NC_056561 - Gene Cluster 27. Type = fatty_acid-alkaloid. Location: 8661604 - 8866671 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056561 - Cluster 28 - Alkaloid

Gene cluster description

NC_056561 - Gene Cluster 28. Type = alkaloid. Location: 9789812 - 9883026 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056561 - Cluster 29 - Transporter_associated

Gene cluster description

NC_056561 - Gene Cluster 29. Type = transporter_associated. Location: 11970253 - 12447568 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056561 - Cluster 30 - Saccharide

Gene cluster description

NC_056561 - Gene Cluster 30. Type = saccharide. Location: 22646167 - 22757068 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056561 - Cluster 31 - Fatty_acid

Gene cluster description

NC_056561 - Gene Cluster 31. Type = fatty_acid. Location: 29785219 - 29884368 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056562 - Cluster 32 - Saccharide

Gene cluster description

NC_056562 - Gene Cluster 32. Type = saccharide. Location: 16635720 - 16908032 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056562 - Cluster 33 - Cyclopeptide

Gene cluster description

NC_056562 - Gene Cluster 33. Type = cyclopeptide. Location: 20778621 - 21582324 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122081347
Repeat occurs 3 times in a sequence of 442 amino acids
Location between 21258399 and 21259805
Coverage of 4.07 %
Instances:
EPGGEE | EPGSDQ | EPGSNK |
pattern: EPG[SG][DEN][KEQ]
MLLKFNFSINSINVVQDHYFADSKYSLRETNMLEDTDGNFQGYTDLACTVQPQEIKANRTEQE
QTRERVSRPIYIDLDEFRNKTMQGKREVQSQVGNITHRLEPGGEEYNYASSSKGAKVLTNNKEA
RGANNILGRDQDKYLRNPCSVEHKFVVIELAEETLVDAFKIANFEHYSSNFKDFELSGSLSFPT
ETWTPLGNFVAANVKHAQRFMLLEPKWVRYLRLSLLSHYGSEFYCTLSLLEVYGVDAIERMLQD
LIVVSEEPGSDQSLNHTSSIGTKSSIKEPGSNKRDVSSQVQIAVDAVGKGLDNIDDGQRPNIEV
TKNTVSNSNIPNPVLEARHQNNGRIPSDTVLKILMQKMKSLELNLSVLEEYIKQLNRSQRDVLP
ELDKEISRCALLMEKVKSEIKGLIEWKEITEKGINELEWWKSVVVYRIDALERENRMLR

Similar gene clusters

NC_056562 - Cluster 34 - Cyclopeptide

Gene cluster description

NC_056562 - Gene Cluster 34. Type = cyclopeptide. Location: 24580466 - 25168949 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


The following known motifs were found in CDS LOC122082170
Location between 24854279 and 24869474
.L.Y..Y was found 2 times in this sequence
Sequence:
MLWISDVVSIIIGSSIYYVGIFGKEREDEDFHLLEPLLKSSSNRNHDVDIQGSSGGGESITSY
ANANLLSFLTFSWMSPLVALGYKKTLDLEDVPQLSNCDSANMVFSCFRNKIESDSKNGGQVSSL
KLMKALILLTWKEILWTGLLSIVCTLASYVGPYFIEAFFQYLKSSRQLKYEGYVLVFIFFLSKL
IRCFSERHLFFQLRKMGNRVRTALFAIIYKKNLSLSSQAKQGHTSGENVNLMTVDVEKTGFFSW
YLHDIWRVPVQFILALLNLYKRLGLASLVAFAVTMILMLANAPLVKLQEKIQGKLMDSKDRRMK
VTSEALRNMRILKLQGWEMKFLSKIIERRKFEMKWLKKLLYTSAMASFVYSSTPLFVSMATFGS
CELMGIQLDAGKILSALATFQMLQGPIYNLPDTISMILQIKVSLDRITTFLCLDDFDKNIVQKL
PRDSTEVAVEIVKGNFSWDIHSPNLTLQDLNFQVKHGMKVVICGSVGSGKSSILSCILGEVPKV
SGAIKLKGTTAYVAQTPWIQSGKIVDNILFGKEMDKERYEMILEACSLKKDLELFAFGDQTIIG
ERGINLSGGQKQRIQIARALYHDADIYLLDDPFSAVDAHTGTHLFKECLLGILGSKTVIYVTHQ
VEFFPSADLILVMKGGKITQAGKYEEILSSGTDFMELVGAHKKALAKLNSIGHGAALDNLINDK
EVDNMLCSENSNQDDEVIESNNYKIEKLIEPKGQLVQEEKREKGGVNPLLYLKYVTTAYKGAFV
PWILLAQILFELLQIVSSYWMVLDNPDSGEDVRPHVMGSTLIFIYVALTFGSSLCTLIRSMLIV
TAGYKTSTLLFNKMHLCIFRAPMSFFDSTPSGRILNRVSIDQSAVDTSIPHKFEELLISIIGFL
GTIAVMSQVAWQMLIVSIPMTVACIWYQILMRKFPRLKRLVPVEKTIGGVATSLSTKGVWFQIV
EVESTKDFVTLKIMASVPNSLGRMVRNSHEEYGKYQGV

Similar gene clusters

NC_056562 - Cluster 35 - Cyclopeptide

Gene cluster description

NC_056562 - Gene Cluster 35. Type = cyclopeptide. Location: 26867360 - 27509910 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122082881
Repeat occurs 3 times in a sequence of 256 amino acids
Location between 27068691 and 27076765
Coverage of 12.89 %
Instances:
FRTVCNRGKCG | FRTTCNMKKCG | FRTVCNRKGCG |
pattern: FRT[TV]CN[MR][KG][KG]CG
MASAKVDNRGSLGSKRSRNDGTRNDGDWTCPQCGNVNFSFRTVCNRGKCGAARPPSTPQRMGS
APVPSPFNNAPPYYLGGVGAPPPMPLGMPSSYGLPFSLSGMRYDYGAHGSATGPYGLMSAYPPG
PLGGVAYGPGSALDGYGFGYRGSPLPMPGSWSGGGLPDNSASRKRRGGPDGLSEGDWVCPKCDN
VNFAFRTTCNMKKCGAPRPSSGPNQSDSGVPEGSWTCSKCENLNYPFRTVCNRKGCGNEKPATA
K
Repeat found in LOC122082881
Repeat occurs 3 times in a sequence of 255 amino acids
Location between 27068691 and 27076765
Coverage of 12.94 %
Instances:
FRTVCNRGKCG | FRTTCNMKKCG | FRTVCNRKGCG |
pattern: FRT[TV]CN[MR][KG][KG]CG
MASAKVDNRGSLGSKRSRNDGTRNDGDWTCPQCGNVNFSFRTVCNRGKCGAARPPSTPRMGSA
PVPSPFNNAPPYYLGGVGAPPPMPLGMPSSYGLPFSLSGMRYDYGAHGSATGPYGLMSAYPPGP
LGGVAYGPGSALDGYGFGYRGSPLPMPGSWSGGGLPDNSASRKRRGGPDGLSEGDWVCPKCDNV
NFAFRTTCNMKKCGAPRPSSGPNQSDSGVPEGSWTCSKCENLNYPFRTVCNRKGCGNEKPATAK

Similar gene clusters

NC_056562 - Cluster 36 - Alkaloid

Gene cluster description

NC_056562 - Gene Cluster 36. Type = alkaloid. Location: 28503541 - 28661211 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_056562 - Cluster 37 - Saccharide

Gene cluster description

NC_056562 - Gene Cluster 37. Type = saccharide. Location: 34087307 - 34154250 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056563 - Cluster 38 - Cyclopeptide

Gene cluster description

NC_056563 - Gene Cluster 38. Type = cyclopeptide. Location: 4433969 - 5366052 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_056563 - Cluster 39 - Cyclopeptide

Gene cluster description

NC_056563 - Gene Cluster 39. Type = cyclopeptide. Location: 11538261 - 12047513 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122084744
Repeat occurs 5 times in a sequence of 640 amino acids
Location between 11930051 and 11935437
Coverage of 5.47 %
Instances:
PTQIGNL | PTQIGNL | PTQIGNL | PTQMGKL | PTQLVAQ

pattern: PTQ[IML][GV][KAN][LQ]
MASFTALLVMEIIVVVVWVAASFTIVRPLATIAVASESAAAAEANALLNWKASGSNFPDSWNA
NDTSPCKWEGIACNTAGRVSEINLHSYSLNADLGILNLKGLQLGSNMISGPIPTQIGNLSNLTE
LDLNNNEFTGSIPSQIWNLRNLERLELGSNLLSGSIPTQIGNLRNLMQFGLGSNMFSGSIPTQI
GNL
RDLNELNLSGNKFTGSIPMEIGYLERLTRLYLAHNNLSGLIPTQMGKLHIDSLDLDMRYND
FEKKIEKKLKLKIPIIISLSSIIILVAFVIVIVWFLFKQKSINSITDATRTQKNGDIFSIWNYD
GTIAYSDIMEATEEFDIKYCIGTGGYGSVYRAMLPTGKVVAVKKLHRIEAEEKAYEKSFTNEIH
VLTRIRHRNIVKLYGFCSHPRCNFLVYEYIERGSLAYVLGNEAEAVELNWRKRINVIKGTAHAL
SYLHHDYIPALIHRDISSNNVLLDANLEPHVADFGTSRFLNPNSSNRTIPAGTYGYIAPELAYT
MVVTEKCDVYSFGVLTLEIIMGKHPAELISSLSSPNAEDIMLNDVLDPRPPPPTQLVAQDLNVS
MMLGIACLHTNPKSCPSMQYVSQKLRSSTDLIAVLLYKFIVAIRHMKIKPNPLSYMEMVHPLLR
Y

Similar gene clusters

NC_056563 - Cluster 40 - Cyclopeptide

Gene cluster description

NC_056563 - Gene Cluster 40. Type = cyclopeptide. Location: 14399246 - 17038712 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056563 - Cluster 41 - Cyclopeptide

Gene cluster description

NC_056563 - Gene Cluster 41. Type = cyclopeptide. Location: 17388259 - 18315061 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122085213
Repeat occurs 3 times in a sequence of 400 amino acids
Location between 17885382 and 17888387
Coverage of 9.75 %
Instances:
SMKQAKELQKKLV | SMKQAKELRKKLA | SMKQAKELRKKLA |
pattern: SMKQAKEL[RQ]KKL[AV]
MEDPEDQMTIAQFSKTRQQLPPPHTLKRKRNAWRKRNPNLLRFWNSSPPLSSPNHNQVVPDQG
EKNRSRRRMYWSSVGANSCSMKQAKELQKKLVPEKMPTCVKFMCPTYVGSYFWMRLPLDFCKLN
LPKEDCVITLVDENCEEYETKYIPGRAVLSGGWRAFSVAHKLVPKDTVLFQLIKATKLKVYILR
EKVAKKVILNTSSDGSNSHSMKQAKELRKKLALEKMPLSSDAANSHSMKQAKELRKKLALEEMP
TCMKFIRPSHVSGCFWLRLPVDFCKLNLPKEDCIITLVDERGEEYEMKYVAERVGIGGGWRAFS
VAHKLVPKDTVLFQLIKATKFKVYILRENVAKKETLTEEDNVDEGASHHLSDLQKKNQNAIPPG
STYNPGPPAGEQFENES
Repeat found in LOC122085213
Repeat occurs 3 times in a sequence of 399 amino acids
Location between 17885382 and 17888387
Coverage of 9.77 %
Instances:
SMKQAKELQKKLV | SMKQAKELRKKLA | SMKQAKELRKKLA |
pattern: SMKQAKEL[RQ]KKL[AV]
MEDPEDQMTIAQFSKTRQQLPPPHTLKRKRNAWRKRNPNLLRFWNSSPPLSSPNHNQVVPDQG
EKNRRRRMYWSSVGANSCSMKQAKELQKKLVPEKMPTCVKFMCPTYVGSYFWMRLPLDFCKLNL
PKEDCVITLVDENCEEYETKYIPGRAVLSGGWRAFSVAHKLVPKDTVLFQLIKATKLKVYILRE
KVAKKVILNTSSDGSNSHSMKQAKELRKKLALEKMPLSSDAANSHSMKQAKELRKKLALEEMPT
CMKFIRPSHVSGCFWLRLPVDFCKLNLPKEDCIITLVDERGEEYEMKYVAERVGIGGGWRAFSV
AHKLVPKDTVLFQLIKATKFKVYILRENVAKKETLTEEDNVDEGASHHLSDLQKKNQNAIPPGS
TYNPGPPAGEQFENES

Similar gene clusters

NC_056563 - Cluster 42 - Saccharide-cyclopeptide

Gene cluster description

NC_056563 - Gene Cluster 42. Type = saccharide-cyclopeptide. Location: 21691304 - 22598935 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122084405
Repeat occurs 9 times in a sequence of 119 amino acids
Location between 21706362 and 21706827
Coverage of 113.45 %
Instances:
HAHAHDHAHAHAHAH | HAHDHAHAHAHAHAH | HAHAHAHAHAHAHSH | HAHAHAHAHAHSHVH | HAHAHAHAHSHVHAH
HAHAHAHSHVHAHGP | HAHAHSHVHAHGPAT | HAHSHVHAHGPATAV | HAHGPATAVSVPVPA |
pattern: HAH[ASDG][PH][DSAV][HT][ASV][HV][ASGV][PHV][SGVPA][PHTV][SGVPA][TVPAH]
MAKGAAFFVLVLALAFVFLMDSAYSDDSAYEAGRKVKETAHDLKHKTEDFFKGIFGRLAKNHH
HHDHAHAHDHAHAHAHAHAHAHSHVHAHGPATAVSVPVPAPAETPKAAKAEAPSHV
Repeat found in LOC122084114
Repeat occurs 3 times in a sequence of 325 amino acids
Location between 21851396 and 21856523
Coverage of 10.15 %
Instances:
EEEEEKKLKKK | EEEEKKLKKKK | EEEKKLKKKKA |
pattern: EEE[KE][KE][KL][KL][KL]KK[KA]
MVNLVAAQKPLLHGMMKLAGLSPKTVEIEPGTTMNFWVPSETIKKAKADPAHTLEEEEEKKLK
KK
KANKKKPVVVMIHGFAAEGIVTWQFQVGALTKNYAIYIPDLLFFGGSITDETDRSPEFQTEC
LVKALTKLRVDTCTVVGFSYGGMVAFKMAELHPNFVKSLVVSGTNPALTDSISEETMSRLGFSS
SSEMLLPTSVKGLKSLLKVAIHKKLWFPNRLYKDYLEVMFTNRKERGELLETLVNNRKNITIPN
LTQRIYLLWGEEDQIFNMEIAQNLKGQLGDNATLLSIKKGGHLVHLERPCVYNRLLKSILASLH
TDTAQQ

Similar gene clusters

NC_056563 - Cluster 43 - Terpene

Gene cluster description

NC_056563 - Gene Cluster 43. Type = terpene. Location: 31846763 - 31969629 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056564 - Cluster 44 - Cyclopeptide

Gene cluster description

NC_056564 - Gene Cluster 44. Type = cyclopeptide. Location: 1187758 - 2356258 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122087336
Repeat occurs 3 times in a sequence of 168 amino acids
Location between 2297065 and 2297572
Coverage of 16.07 %
Instances:
DDDDDEDGE | DDDDEDGEE | DDDEDGEEG |
pattern: DDD[DE][DE][DEG][DEG][EG][EG]
MAAGKSYLARPSYRFLGGEIEPIRSESVFEFDEADIWSTNHTASSPQEFKKMTIPSSRGSKKS
TKRVDNGGDYRRSAGATGSSLPVNIPDWSKILKEDRRRGIPGDVDDDDDEDGEEGRVPPHEFIA
KQSARNGVSSFSVHEGIGRTLKGRDLSRVRNAIWEKTGFQD

Similar gene clusters

NC_056564 - Cluster 45 - Saccharide

Gene cluster description

NC_056564 - Gene Cluster 45. Type = saccharide. Location: 8267216 - 8451874 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056564 - Cluster 46 - Saccharide

Gene cluster description

NC_056564 - Gene Cluster 46. Type = saccharide. Location: 14683021 - 14903315 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056564 - Cluster 47 - Fatty_acid

Gene cluster description

NC_056564 - Gene Cluster 47. Type = fatty_acid. Location: 16129822 - 16274753 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056565 - Cluster 48 - Cyclopeptide

Gene cluster description

NC_056565 - Gene Cluster 48. Type = cyclopeptide. Location: 1468847 - 2957246 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122088571
Repeat occurs 31 times in a sequence of 1811 amino acids
Location between 2703546 and 2712694
Coverage of 22.25 %
Instances:
YSPTSPGYSPTSP | YSPTSPGYSPTSP | YSPTSPGYSPTSP | YSPTSPTYCPSSP | YSPTSPAYSPTSP
YSPTSPSYSPTSP | YSPTSPSYSPTSP | YSPTSPSYSPTSP | YSPTSPSYSPTSP | YSPTSPSYSPPSP
YSPPSPSYSPTSP | YSPTSPAYSPTSP | YSPTSPAYSPTSP | YSPTSPSYSPTSP | YSPTSPSYSPTSP
YSPTSPSYSPTSP | YSPTSPSYSPTSP | YSPTSPAYSPTSP | YSPTSPGYSPTSP | YSPTSPSYSPTSR
YSPTSRSYSPTSP | YSPTSPSYNPPSA | YSPSLAYSPSSPR | YSPSSPRLSPSSP | YSPTSPSYSPPSP
YSPPSPSYSPSSP | YSPSSPDNSGVSP | YSPSSPQYSPSAG | YSPSAGYSPTAPG | YSPTAPGYSPSST
YSPSSTSQYTPQF |
pattern: YSP[PST][ASL][GTPRA][SDGTQRAY][SNQYL][SNCPY][PSGT][STVPA][APSQ][GTPRAF]
MDMRYQFSPAEVKKVRRVQFGIISPDEIRQMSVVQVEHGETTEKGKPKPGGLSDPRLGTIDRK
LKCETCNATMSECPGHFGHLELAKPMFHIGFLKTVLSIMRCVCFNCSKILVDEDDHKFKQALRI
KNPKHRMKKILDACKNKAKCDGGEDINVEGQNSEQPVKKSQGGCGAQQPKLTVEGMKMVAEYKT
QRKKNDEQDQLPEPVERKQQLSAERVLGVLKRIRDEDSVKLGLDPEYARPDWMILQVLPIPPPP
VRPSVMMDTSSRSEDDLTHQLAMIIRHNENLRRQEKNGAPAHIISEFAQLLQFHIATYFDNELP
GQPRATQRSGRPIKSICSRLKAKEGRIRGNLMGKRVDFSARTVITPDPTINIDELGVPWSIALN
LTYPETVTPYNIERLKELVEFGPHPPPGKTGAKYIIREDGQRLDLRYLKKSSDHHLELGYKVER
HLQDGDYVLFNRQPSLHKMSIMGHRIKIMPYSTFRLNLSVTTPYNADFDGDEMNMHVPQSFETR
AEVLELMMVPKCIVSPQANRPVMGIVQDTLLGCRKITKRDIFIEKDVFMNILMWWEDFDGKIPA
PAILKPKPLWTGKQVFNLIIPKQINLLRNSAWHSESETGFITPADTQVRIEKGELLSGILCKKT
LGTSTGSLIHVIWEEVGPDAARKFLGHTQWLVNYWLLQNAFSIGIGDTIADAATMEKINETISK
AKSEVKELIKASQESQLEAEPGRTMMESFENKVNQVLNKARDDAGSSAQKSLSESNNLKAMVTA
GSKGSFINISQMTACVGQQNVEGKRIPFGFIGRTLPHFTKDDNGPESRGFVENSYLRGLTPQEF
FFHAMGGREGLIDTAVKTSETGYIQRRLVKAMEDIMVKYDGTVRNSLGDVIQFLYGEDGMDAVW
IESQKLDSLKMKKSEFNAVFRYEIDDENWNPNYMLPEHVEDLKTIREFRNVFDAEVQKLEADRY
QLGTEIATTGENSWPMPVNLKRLIWNAQKTFKLDLRRPSDMHPMEIVEAVDKLQERLKVVPGDD
FLSTEAQKNATLLFNILLRSTFASKRVLKEYGLTREAFDWVIGEIESRFLQSLVAPGEMIGCVA
AQSIGEPATQMTLNTFHYAGVSAKNVTLGVPRLREIINVAKKIKTPSLSVYLKPEVNKTKEKAK
NVQCALEYTTLRSVTQATEVWYDPDPMSTIIEEDVEFVKSYYEMPDEEISPEKISPWLLRIELN
REMMVDKKLSMADIAEKINLEFDDDLTCIFNDDNAEKLILRIRIMNDEASKGESQDESAEDDVF
LKRIESNMLTEMALRGIPDINKVFIKSGKVNQFDENEGFKPEVEWMLDTEGVNLLAVMCHEDVD
ATRTTSNHLIEVIEVLGIEAVRRALLDELRVVISFDGSYVNYRHLAILCDTMTYRGHLMAITRH
GINRNDTGPMMRCSFEETVDILLDAAVYAETDFLRGVTENIMLGQLAPIGTGGCALYLNDHMLQ
QAIELQLPSYMEGLGMTPSRSPITGTPYNEGMVSPSYLLSPTIRSSPITDAQFSPYVGGMAFSP
TSSPCYSLSSPGYSPTSPGYSPTSPGYSPTSPGYSPTSPTYCPSSPGYSPTSPAYSPTSPSYSP
TSPSYSPTSP
SYSPTSPSYSPTSPSYSPPSPSYSPTSPAYSPTSPAYSPTSPSYSPTSPSYSPT
SP
SYSPTSPSYSPTSPAYSPTSPGYSPTSPSYSPTSRSYSPTSPSYNPPSAKYSPSLAYSPSSP
R
LSPSSPYSRSSPNYSPTSPSYSPPSPSYSPSSPDNSGVSPDYSPSSPQYSPSAGYSPTAPGYS
PSST
SQYTPQFSNKDEGSTP

Similar gene clusters

NC_056565 - Cluster 49 - Saccharide-transporter_associated-alkaloid

Gene cluster description

NC_056565 - Gene Cluster 49. Type = saccharide-transporter_associated-alkaloid. Location: 15705178 - 15982781 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056565 - Cluster 50 - Cyclopeptide

Gene cluster description

NC_056565 - Gene Cluster 50. Type = cyclopeptide. Location: 17401488 - 18007318 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122089584
Repeat occurs 6 times in a sequence of 168 amino acids
Location between 17823083 and 17823590
Coverage of 39.29 %
Instances:
PSGPSCPPARE | PSGPSDPPARE | PSGISDPPANE | PSGPSDPPARE | PSGPSHPPARE
PSGPSDPPAHD |
pattern: PSG[PI]S[DCH]PPA[HNR][DE]
MAMLTMKVLVLILILSPAINGVVEARAGSDWFTYLPKGAPIPPSGPSCPPARERFHFLPKGAP
IPPSGPSDPPAREGFHYLSKGSPILPSGISDPPANEGFHYLPKGAPIPPSGPSDPPAREGFHYL
PKGSPIPPSGPSHPPAREGFHYLPKGAPIPPSGPSDPPAHD
Repeat found in LOC122087989
Repeat occurs 19 times in a sequence of 467 amino acids
Location between 17824445 and 17831698
Coverage of 61.03 %
Instances:
YLPKGAPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR | YLPKGSPIPPSGPSR | YLPKGAPIPPAGPFD
YLPKGSPIPPSGPSR | YLPKGSPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR | YLPKGSPIPPSGPSR
YLPKGSPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR
YLPKGSPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR | YLPKGAPIPPYGPSD |
pattern: YLPKG[AS]PIPP[ASY]GP[SF][DR]
MATLTMKFLMLVFLLSPAINGVVEARAGSDWFTYLPKGAPIPPSGPSRPPAREGFHYLPKGAP
IPPSGPSD
PPAYEGFHYLPKGSPIPPSGPSRPPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYL
PKGAPIPPAGPFD
PPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYLPKGSPIPPSGPSRPPARE
GFHYLPKGAPIPPSGPSDLPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYLPKGSPIPPSGPSR
PPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYLPKGAPIPPSGPSDPPAHEGFHYLPKGSPIPP
SGPSR
PSAREGFHYLPKGAPIPPSGPSDPPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYLPKG
SPIPPSGPSR
PPAREGFHYLPKGAPIPPSGPSDPPAHEGFHYLPKGSPIPPSGPSRPPAREGFH
YLPKGAPIPPYGPSD
PPAHE

Similar gene clusters

NC_056565 - Cluster 51 - Cyclopeptide

Gene cluster description

NC_056565 - Gene Cluster 51. Type = cyclopeptide. Location: 17588001 - 18081731 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122089584
Repeat occurs 6 times in a sequence of 168 amino acids
Location between 17823083 and 17823590
Coverage of 39.29 %
Instances:
PSGPSCPPARE | PSGPSDPPARE | PSGISDPPANE | PSGPSDPPARE | PSGPSHPPARE
PSGPSDPPAHD |
pattern: PSG[PI]S[DCH]PPA[HNR][DE]
MAMLTMKVLVLILILSPAINGVVEARAGSDWFTYLPKGAPIPPSGPSCPPARERFHFLPKGAP
IPPSGPSDPPAREGFHYLSKGSPILPSGISDPPANEGFHYLPKGAPIPPSGPSDPPAREGFHYL
PKGSPIPPSGPSHPPAREGFHYLPKGAPIPPSGPSDPPAHD
Repeat found in LOC122087989
Repeat occurs 19 times in a sequence of 467 amino acids
Location between 17824445 and 17831698
Coverage of 61.03 %
Instances:
YLPKGAPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR | YLPKGSPIPPSGPSR | YLPKGAPIPPAGPFD
YLPKGSPIPPSGPSR | YLPKGSPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR | YLPKGSPIPPSGPSR
YLPKGSPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR
YLPKGSPIPPSGPSR | YLPKGAPIPPSGPSD | YLPKGSPIPPSGPSR | YLPKGAPIPPYGPSD |
pattern: YLPKG[AS]PIPP[ASY]GP[SF][DR]
MATLTMKFLMLVFLLSPAINGVVEARAGSDWFTYLPKGAPIPPSGPSRPPAREGFHYLPKGAP
IPPSGPSD
PPAYEGFHYLPKGSPIPPSGPSRPPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYL
PKGAPIPPAGPFD
PPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYLPKGSPIPPSGPSRPPARE
GFHYLPKGAPIPPSGPSDLPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYLPKGSPIPPSGPSR
PPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYLPKGAPIPPSGPSDPPAHEGFHYLPKGSPIPP
SGPSR
PSAREGFHYLPKGAPIPPSGPSDPPAHEGFHYLPKGSPIPPSGPSRPPAREGFHYLPKG
SPIPPSGPSR
PPAREGFHYLPKGAPIPPSGPSDPPAHEGFHYLPKGSPIPPSGPSRPPAREGFH
YLPKGAPIPPYGPSD
PPAHE

Similar gene clusters

NC_056565 - Cluster 52 - Cyclopeptide

Gene cluster description

NC_056565 - Gene Cluster 52. Type = cyclopeptide. Location: 22509341 - 23184634 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056565 - Cluster 53 - Alkaloid

Gene cluster description

NC_056565 - Gene Cluster 53. Type = alkaloid. Location: 29122238 - 29392236 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056565 - Cluster 54 - Saccharide

Gene cluster description

NC_056565 - Gene Cluster 54. Type = saccharide. Location: 30775213 - 31002316 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056565 - Cluster 55 - Polyketide

Gene cluster description

NC_056565 - Gene Cluster 55. Type = polyketide. Location: 33890635 - 33984234 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056565 - Cluster 56 - Saccharide

Gene cluster description

NC_056565 - Gene Cluster 56. Type = saccharide. Location: 35755521 - 36069554 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056566 - Cluster 57 - Putative

Gene cluster description

NC_056566 - Gene Cluster 57. Type = putative. Location: 1726573 - 2147314 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056566 - Cluster 58 - Cyclopeptide

Gene cluster description

NC_056566 - Gene Cluster 58. Type = cyclopeptide. Location: 2482016 - 2769107 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056566 - Cluster 59 - Cyclopeptide

Gene cluster description

NC_056566 - Gene Cluster 59. Type = cyclopeptide. Location: 3130080 - 5230415 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122091569
Repeat occurs 3 times in a sequence of 365 amino acids
Location between 3296674 and 3305197
Coverage of 5.75 %
Instances:
ERGEDGE | ERGDGWM | ERGEDGE |
pattern: ERG[DE][DG][WG][EM]
MGELFNERGEDGELEMSLQEVPSGHWKSSLIIQVQPTNIQGKPFFYFPLRKKKNLLSEDTRER
SRQTAISKKKKEGEMEQGSAFDLYLLPEDCISNILSLTTPRDVCRSSTVSPVFRSAANSDSVWE
KFLPSDIHQILSSSSVSPSLPTFSSKKQLFLHLCDNPLLIDGGTKTFALEKCSGKKCYMIGARE
LTIIWGSSPEYWKWIPHPESRFSEVAELLCVWWLEIRGKIETRLLSPNTTYGAYLVLKFTNRAY
GFEPLAEVSVKLASGTGGAEEGEVKSVSLNPRSLRSRNASSSEEAVADPPPEGPKERGDGWMEI
EMGEFFNERGEDGEVVMSLMEVNCSICKSGLIIQGIELRPKENQPV
Repeat found in LOC122091773
Repeat occurs 4 times in a sequence of 494 amino acids
Location between 5110451 and 5112373
Coverage of 7.29 %
Instances:
GGGAPGGGG | GGGGGSGGP | GGGGSGGPA | GGGSGGPAP |
pattern: GGG[ASG][PSG][SG][PG][APG][APG]
MEGGQTGGGAPGGGGGSGGPAPFLLKTYDMIDDSSTDKIVSWSDNKNSFVVWNPPEFAARLLP
TYFKHNNFSSFIRQLNTYGFRKIDPEKWEFANEDFVKDQKHLLKNIYRRKPIHSHSHPQGSGGD
AERAALEEEIEKLSREKTTLQGTLWRFRQQQSGTKVQLEDLDRRVEGMEQRQMRMMTFLAKAME
NPIFVEHLVRMAGSSMDLSAINKRRRLPNLDCCQEVLDNSLADNHNSPKPDIGHIFHQDFSSKL
NLELSPAVSDSNLVYPSAQSSNEDGGSPQRRASEGDVRDAHVMMEGLSFTPETFDLSDTGTSFA
LKKDATLSRQVRENDTSRLHSLPHGLTSTEEGDGHISCHLNLTLASFALQVENSHYSSRISQLG
PESGTSTELRSSANGNEADLRVASKNRSTSNDDAPKPSSNDAPTNNKGPSAPQGRVNDVFWEQF
LTERPGSSDTEEASSSLRANPYDDQEEKMSGHGKSWRNRKDMEQLTL

Similar gene clusters

NC_056566 - Cluster 60 - Cyclopeptide

Gene cluster description

NC_056566 - Gene Cluster 60. Type = cyclopeptide. Location: 14041842 - 14637964 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_056566 - Cluster 61 - Terpene-alkaloid

Gene cluster description

NC_056566 - Gene Cluster 61. Type = terpene-alkaloid. Location: 21230081 - 21356153 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056566 - Cluster 62 - Terpene

Gene cluster description

NC_056566 - Gene Cluster 62. Type = terpene. Location: 25120622 - 25270390 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056567 - Cluster 63 - Alkaloid

Gene cluster description

NC_056567 - Gene Cluster 63. Type = alkaloid. Location: 4267587 - 4420228 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056567 - Cluster 64 - Cyclopeptide

Gene cluster description

NC_056567 - Gene Cluster 64. Type = cyclopeptide. Location: 4976045 - 5905856 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122093949
Repeat occurs 3 times in a sequence of 504 amino acids
Location between 5194749 and 5198072
Coverage of 4.76 %
Instances:
ASASASAP | ASASAPAP | ASAPAPTY |
pattern: ASA[PS]A[PS][AT][PY]
MHPAISLFPNIEFYGKQILDAPNVKEKIHEKHFLQGNIYGPYSFINVAYGREENDGHGKKNMV
EVAVVSAIVKSLSEASFASRQDLSVGVISPYKAQVFAIKEKLGNTYNTLKYFSVSIRTVDGFQG
GEEDVIIISTVRSNGRGLVGFLANFQRTNVALTRARYCLWVLGNGPTLINSGSIWRKLVLDAKD
RGCFFNADHEKSLKEAIIAGLVELREFDKLLNTDSLLFAGARWKVLFSNDFWKSIVKIRRVETR
NEVLSLLMKLASGWRNPQKWKNLHLMDGTSSELLNVYKVDEWFNLVWTVDILRENSKYIQVLKV
WDILPLQEIPNLAKRLDVIFGNYKVDEMKRSKFKCSEGNLEVPMSWDTSSAATLMDLPNPKPLQ
YLSSHFTSLHLGKEPNASASASAPAPTYIAKDGKTNDLAETTKMKWRSLPGLLKRGIWKFALPT
WEEDPVLPTVIVPHGQVMWLFLLLDREYMVHMSHMPTLQISKCESIKYPHVYWHARL
Repeat found in LOC122092727
Repeat occurs 7 times in a sequence of 747 amino acids
Location between 5326888 and 5329132
Coverage of 6.56 %
Instances:
VSWNTMI | VSWNSML | VSWNLML | VSWVTML | VSWTTMI
VSWNTMI | VSWNSMI |
pattern: VSW[NTV][SLT]M[IL]
MRSAVKSIGESGSLVFNQNLKIAQLGKCGQIEEAIKIFERMVNRNTVTWNSMISAYAKNGRIT
DARRLFDQMHSKNLVSWNTMIAGYLHNDQIHEAADLFGKMPKRDIFSWTLMITCYTRNGELQKA
RYFFDRLPDKRNLVCWNAMITGYVKNQRLEEARRLFDQMPVRDMVSWNSMLTGYTQNGEIGMGL
KFFDQMEERDAVSWNLMLDGFIQACDLDSAWLFFLRIPSPNIVSWVTMLSGYARNGQIAEAQRI
FEQMPEKNVVAWNAMLAGYVQNGEIDEAIRLFKQMPESNSVSWTTMINGYVRVGKLSEARELLN
SMPYKNVAAKTAMISGYVQSMRMDEARDIFDSVGSRDVVCWNTMIAGYAMCRRMDEAIKLFKQM
HKKDIVSWNTMIAGYAQEGQMDKALHIFDEMVERNTVSWNSMISGFTQNGLCTDALQHFVLMVR
EGKKPDQSTFATSLSACANLAALQVGKQLHHLVMKLGYVHDLFVGNALITMYAKCGRVLSADQV
FNDISSADIISWNSLIAGYALNGYGKEAIQLFQEMEMVGVAPDQVTFVGLLSACSHAGLIDQGL
KLFKSMTDDFSIEPVAEHYACMVDLLGRAGRLEEAYQLVMKMPIRANAGTWGALLGACRMYRNL
ELGKFAAEKLSEFEPHKTSNYVLLSNIHAEAGRWEEVERVRVLMKEQGVEKQPGCSWIEVKNQV
HTFISDDPALPRAADICEMLRNLTEQMKSRGYMLDINFSLLDCG
Repeat found in LOC122092727
Repeat occurs 7 times in a sequence of 747 amino acids
Location between 5326888 and 5329132
Coverage of 6.56 %
Instances:
VSWNTMI | VSWNSML | VSWNLML | VSWVTML | VSWTTMI
VSWNTMI | VSWNSMI |
pattern: VSW[NTV][SLT]M[IL]
MRSAVKSIGESGSLVFNQNLKIAQLGKCGQIEEAIKIFERMVNRNTVTWNSMISAYAKNGRIT
DARRLFDQMHSKNLVSWNTMIAGYLHNDQIHEAADLFGKMPKRDIFSWTLMITCYTRNGELQKA
RYFFDRLPDKRNLVCWNAMITGYVKNQRLEEARRLFDQMPVRDMVSWNSMLTGYTQNGEIGMGL
KFFDQMEERDAVSWNLMLDGFIQACDLDSAWLFFLRIPSPNIVSWVTMLSGYARNGQIAEAQRI
FEQMPEKNVVAWNAMLAGYVQNGEIDEAIRLFKQMPESNSVSWTTMINGYVRVGKLSEARELLN
SMPYKNVAAKTAMISGYVQSMRMDEARDIFDSVGSRDVVCWNTMIAGYAMCRRMDEAIKLFKQM
HKKDIVSWNTMIAGYAQEGQMDKALHIFDEMVERNTVSWNSMISGFTQNGLCTDALQHFVLMVR
EGKKPDQSTFATSLSACANLAALQVGKQLHHLVMKLGYVHDLFVGNALITMYAKCGRVLSADQV
FNDISSADIISWNSLIAGYALNGYGKEAIQLFQEMEMVGVAPDQVTFVGLLSACSHAGLIDQGL
KLFKSMTDDFSIEPVAEHYACMVDLLGRAGRLEEAYQLVMKMPIRANAGTWGALLGACRMYRNL
ELGKFAAEKLSEFEPHKTSNYVLLSNIHAEAGRWEEVERVRVLMKEQGVEKQPGCSWIEVKNQV
HTFISDDPALPRAADICEMLRNLTEQMKSRGYMLDINFSLLDCG
Repeat found in LOC122092727
Repeat occurs 7 times in a sequence of 747 amino acids
Location between 5326888 and 5329132
Coverage of 6.56 %
Instances:
VSWNTMI | VSWNSML | VSWNLML | VSWVTML | VSWTTMI
VSWNTMI | VSWNSMI |
pattern: VSW[NTV][SLT]M[IL]
MRSAVKSIGESGSLVFNQNLKIAQLGKCGQIEEAIKIFERMVNRNTVTWNSMISAYAKNGRIT
DARRLFDQMHSKNLVSWNTMIAGYLHNDQIHEAADLFGKMPKRDIFSWTLMITCYTRNGELQKA
RYFFDRLPDKRNLVCWNAMITGYVKNQRLEEARRLFDQMPVRDMVSWNSMLTGYTQNGEIGMGL
KFFDQMEERDAVSWNLMLDGFIQACDLDSAWLFFLRIPSPNIVSWVTMLSGYARNGQIAEAQRI
FEQMPEKNVVAWNAMLAGYVQNGEIDEAIRLFKQMPESNSVSWTTMINGYVRVGKLSEARELLN
SMPYKNVAAKTAMISGYVQSMRMDEARDIFDSVGSRDVVCWNTMIAGYAMCRRMDEAIKLFKQM
HKKDIVSWNTMIAGYAQEGQMDKALHIFDEMVERNTVSWNSMISGFTQNGLCTDALQHFVLMVR
EGKKPDQSTFATSLSACANLAALQVGKQLHHLVMKLGYVHDLFVGNALITMYAKCGRVLSADQV
FNDISSADIISWNSLIAGYALNGYGKEAIQLFQEMEMVGVAPDQVTFVGLLSACSHAGLIDQGL
KLFKSMTDDFSIEPVAEHYACMVDLLGRAGRLEEAYQLVMKMPIRANAGTWGALLGACRMYRNL
ELGKFAAEKLSEFEPHKTSNYVLLSNIHAEAGRWEEVERVRVLMKEQGVEKQPGCSWIEVKNQV
HTFISDDPALPRAADICEMLRNLTEQMKSRGYMLDINFSLLDCG

Similar gene clusters

Similar known gene clusters

NC_056567 - Cluster 65 - Cyclopeptide

Gene cluster description

NC_056567 - Gene Cluster 65. Type = cyclopeptide. Location: 8215621 - 9848423 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_056567 - Cluster 66 - Putative

Gene cluster description

NC_056567 - Gene Cluster 66. Type = putative. Location: 23908685 - 24145955 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056568 - Cluster 67 - Saccharide

Gene cluster description

NC_056568 - Gene Cluster 67. Type = saccharide. Location: 1472341 - 1933240 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056568 - Cluster 68 - Fatty_acid

Gene cluster description

NC_056568 - Gene Cluster 68. Type = fatty_acid. Location: 2772287 - 2865523 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056568 - Cluster 69 - Cyclopeptide

Gene cluster description

NC_056568 - Gene Cluster 69. Type = cyclopeptide. Location: 14588908 - 16737027 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056568 - Cluster 70 - Cyclopeptide

Gene cluster description

NC_056568 - Gene Cluster 70. Type = cyclopeptide. Location: 20051827 - 20830064 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056568 - Cluster 71 - Cyclopeptide

Gene cluster description

NC_056568 - Gene Cluster 71. Type = cyclopeptide. Location: 22947148 - 23400988 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056569 - Cluster 72 - Cyclopeptide

Gene cluster description

NC_056569 - Gene Cluster 72. Type = cyclopeptide. Location: 201493 - 1644212 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122059846
Repeat occurs 3 times in a sequence of 525 amino acids
Location between 1064732 and 1066977
Coverage of 8.57 %
Instances:
LLDNTKAELAKKNDE | LLDNTKAELVKKNDE | LLDGTKVELAKKDDE |
pattern: LLD[GN]TK[AV]EL[AV]KK[DN]DE
MFEAKEMSDDEGEFFECQYDSSFLERVVPLGPFGFPNLLLLKPQVVNEFQNLLDNTKAELAKK
NDE
LNELRSLLDNTKAELVKKNDELNELRSLLDGTKVELAKKDDEMNELKKQKIDEDWNPEQFR
VSIVAITNSMKRVFVDFGLEPPESGDPKSVLKFLEMLHDLIIYKEFNSTFLVAANGILASIASI
FEKRSGLTIGMEIAISNPARLIENLFTLRGFIDSHFFIDGSVIVEDFHLMNKDDLPIIFRDSAV
DVPELKQKDIGSKSIDNIDRNFHFYGYFNIPMVGMFKGVLTRVHVNLPDEIMVKLQVGKHTLND
KRFWFVNANGASIFQKELLSPKDIKFTPSRILGDRTDFVGFFRRMNRIFVNGKGTYLKFDFIAG
KYSGSGAVFDDTVRYYIKEGVIFYPFKMVQFDVNTPATNIVQREVYLPLSTSKDMGQNDATYIV
DVLGIDCAFTFNDFCFVGKITKATGTFRENALVSVFFEQNEHGENCYVDALATTFFILKTGKVT
SHVYRDKLKDPKFV
Repeat found in LOC122059846
Repeat occurs 3 times in a sequence of 519 amino acids
Location between 1065417 and 1066977
Coverage of 8.67 %
Instances:
LLDNTKAELAKKNDE | LLDNTKAELVKKNDE | LLDGTKVELAKKDDE |
pattern: LLD[GN]TK[AV]EL[AV]KK[DN]DE
MSDDEGEFFECQYDSSFLERVVPLGPFGFPNLLLLKPQVVNEFQNLLDNTKAELAKKNDELNE
LRSLLDNTKAELVKKNDELNELRSLLDGTKVELAKKDDEMNELKKQKIDEDWNPEQFRVSIVAI
TNSMKRVFVDFGLEPPESGDPKSVLKFLEMLHDLIIYKEFNSTFLVAANGILASIASIFEKRSG
LTIGMEIAISNPARLIENLFTLRGFIDSHFFIDGSVIVEDFHLMNKDDLPIIFRDSAVDVPELK
QKDIGSKSIDNIDRNFHFYGYFNIPMVGMFKGVLTRVHVNLPDEIMVKLQVGKHTLNDKRFWFV
NANGASIFQKELLSPKDIKFTPSRILGDRTDFVGFFRRMNRIFVNGKGTYLKFDFIAGKYSGSG
AVFDDTVRYYIKEGVIFYPFKMVQFDVNTPATNIVQREVYLPLSTSKDMGQNDATYIVDVLGID
CAFTFNDFCFVGKITKATGTFRENALVSVFFEQNEHGENCYVDALATTFFILKTGKVTSHVYRD
KLKDPKFV
Repeat found in LOC122059846
Repeat occurs 3 times in a sequence of 519 amino acids
Location between 1065417 and 1066977
Coverage of 8.67 %
Instances:
LLDNTKAELAKKNDE | LLDNTKAELVKKNDE | LLDGTKVELAKKDDE |
pattern: LLD[GN]TK[AV]EL[AV]KK[DN]DE
MSDDEGEFFECQYDSSFLERVVPLGPFGFPNLLLLKPQVVNEFQNLLDNTKAELAKKNDELNE
LRSLLDNTKAELVKKNDELNELRSLLDGTKVELAKKDDEMNELKKQKIDEDWNPEQFRVSIVAI
TNSMKRVFVDFGLEPPESGDPKSVLKFLEMLHDLIIYKEFNSTFLVAANGILASIASIFEKRSG
LTIGMEIAISNPARLIENLFTLRGFIDSHFFIDGSVIVEDFHLMNKDDLPIIFRDSAVDVPELK
QKDIGSKSIDNIDRNFHFYGYFNIPMVGMFKGVLTRVHVNLPDEIMVKLQVGKHTLNDKRFWFV
NANGASIFQKELLSPKDIKFTPSRILGDRTDFVGFFRRMNRIFVNGKGTYLKFDFIAGKYSGSG
AVFDDTVRYYIKEGVIFYPFKMVQFDVNTPATNIVQREVYLPLSTSKDMGQNDATYIVDVLGID
CAFTFNDFCFVGKITKATGTFRENALVSVFFEQNEHGENCYVDALATTFFILKTGKVTSHVYRD
KLKDPKFV
Repeat found in LOC122060157
Repeat occurs 4 times in a sequence of 591 amino acids
Location between 1112362 and 1114138
Coverage of 4.06 %
Instances:
ELAKKN | ELAKKN | ELAKKN | ELAKQK |
pattern: ELAK[KQ][KN]
MSWSSSDNFYVNARLEVFSVDLLSEPLELTRSTGVVYDGPKGDDFLRFLRSKNFRVNRQGKAI
VYVQKRNGDGEFLECLYDTFILETVVPLGPFGFQNVPLPKPQEVIELQSLLDSTKSELAKKNDV
IKELESNLGNNKAELAKKNYALNELQSLLDNTKAELAKKNDALNELAKQKIDQDWTQEQFRLTM
VAITNSMKRVFIGFGLEPPASGGPESVLRFLEMLCDLILHKECGATFLVAANEILASMANIFEK
RTGLSIGMETTNSTPMTPACLIKSLSTLRVFMVDAHFFIDGNVIVRDLHLMNEEDLPVIFRDSK
ADVAELKQNKEMGTKSIDNINKKSIFYGYFNIPTIGIFKGVLTRVHVNLPDEIMVKLRAGGKHT
LNDNRFWFVYANGAGIFESTDNLEIVKFAPSRVLGDRTNFLEFFLRMNRTFVNEKGTYLKFDFI
ASNPSILRDWPDNTVRYYDKDFIMHNPFKLVQFDVNECPTDTLQQEVSFPGASKATSGKYATWI
VNEALGNNCAFQFKNFCFIGKITKEASGIFRKNALVSVFFEQNQHGENCYVDALATTFFVLKTE
QLNSWINDRKKSVEFI

Similar gene clusters

NC_056569 - Cluster 73 - Cyclopeptide

Gene cluster description

NC_056569 - Gene Cluster 73. Type = cyclopeptide. Location: 1419414 - 2052731 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056569 - Cluster 74 - Saccharide

Gene cluster description

NC_056569 - Gene Cluster 74. Type = saccharide. Location: 2212726 - 2260975 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056569 - Cluster 75 - Cyclopeptide

Gene cluster description

NC_056569 - Gene Cluster 75. Type = cyclopeptide. Location: 5946190 - 6557160 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122059669
Repeat occurs 3 times in a sequence of 230 amino acids
Location between 6469517 and 6470210
Coverage of 7.83 %
Instances:
VIGELT | VIGFVI | VIGYII |
pattern: VIG[YEF][ILV][IT]
MMFKKGNNSKPQYIGDFYYRDSVTIMNKGLEMELVRILTIYTALDLSNNRFQGEIPKVIGELT
SLVVLNMSQNDLVGQIPSSMGNMAELESLDLSQNKLSGQIPRQLANLTFLAFLNLSQNHLEGPI
PRGNQLETFSNTSYNANPGLCGFPLSRKCRATSDDMPPQGPQGERSTNDSKFDWKFMLMGYGCG
MVIGFVIGYIIFKDEWIIRIFRLAPIGRRRKFKGGRLRG
Repeat found in LOC122059873
Repeat occurs 12 times in a sequence of 1285 amino acids
Location between 6486100 and 6494033
Coverage of 5.6 %
Instances:
LDLNNS | LDLSYF | LDLWSN | LDLSNN | LDLSNN
LDLRNN | LDLGKN | LDLSSN | LDLSNN | LDLSQN
LDLSGY | LDLSTF |
pattern: LDL[SGNWR][SNQGKTY][SYNF]
MGIYLPFARLLLFLSFFYFFFLLLLLLPQIQAAQDSHICLHDQKSALLQFRHEFIITDNISFS
SWEPQTDCCSWDGVTCNVTTGYVIGLDLNNSIGFSSDMISYKVSAPIYSNSSLFGLHHLQRLNL
AYNDFNLAPIPSGFDRLPSLTHLNLSYSNFSGQIPWEFSRLTRLVSLDLSYFSDYAKLERPNLK
ELIQNLTSLVELYLDGVDLSSPSSSPTQGSSDHWSQVLPQSLPNLRALSLSYCSLSGPIHPSFS
QLVFLEELRLDGNYNLSLEAPNFLPLNFSHLKTLSLSYCGLYGEFPNSVFQLPNLRILDVSGNS
LLNGQLPDFPLNNSFQFISLYQTNFSGALPDSIGNLRFLTTLRLGNCNFSGSIPPSLTNLTQLT
ELDLWSNSFTGSVPMFNKDNVPNLAFLELSDNLLDGTIHSSLFTLPSLYRLFLSNIQLTGRLDQ
LFHNSSLSSSSSLVEMDLSGNKLNGQIPKSISQFSSLQSLYLSSNDFSGTVKLEDMFGNLENLS
SLHLSNNNLLSLEFGNTSSSFRPQFYELGLGSCHIKEFPNFLRTQEQLEYLDLSNNQIQGQVPN
WLWKGALGDVDLSYNGIVDLEPAFADHSNNDTSNNTTTPSTLYFYSLKLSSCNLRVFPEFLKNL
NVENLDLSNNTISGQVPNWIWKEGLSYLNLSYNSLNTLELPLLLNLSSSPLSTLDLRNNKLQGS
LPIPASSITFFSISNNSFVGGIPESICKLSGLQVFDASHNNLSGSIPECLGNIDTLSVLDVQGN
RFHGMPPNFTNASSLRTLKMNGNILEGQVPRSLANCTSLEVLDLGKNKMFDTFPFLLGKLPMLR
VLVLHSNRFYGPIKHPRTLDEFPMLQILDLSSNTFTGNFPIEYFGSLKAMMFKKGNKSEPQYIG
ESYYRDSVTIMNKGLEMELVRILTIYTALDLSNNRFQGEIPKVIGELTSLVVLNMSQNDLVGQI
PSSIGNMAELESLDLSQNKLSGQIPRQLANLTFLAFLNLSQNHLEGPIPRGNQLETFSNTSYNA
NPGLCGFPLSRKCRATSDDMPPQGPQGERSTNDSKFDWKFMLMGYGCGMQDSFINSTNSWEPEK
DCCSCGGVTCNFTTGYVIDLDLSGYVIGFHYSSYYVIDSKVSGPIYSDSSLFSLHHLQRLNLAF
NDFKLTPFPSGFDQLPILTHLNLSHSNFPGQIPWEFSRLTRLVSLDLSTFNFDENYYFNICSRI
LKLEKPNLKALIQNLTSLVELYLNGVDLSSPSPKHGRSSHWSQVLPQSLPNLRALSLSGCRLSG
PNHTSF

Similar gene clusters

NC_056569 - Cluster 76 - Cyclopeptide

Gene cluster description

NC_056569 - Gene Cluster 76. Type = cyclopeptide. Location: 20866690 - 21483577 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056569 - Cluster 77 - Cyclopeptide

Gene cluster description

NC_056569 - Gene Cluster 77. Type = cyclopeptide. Location: 24314376 - 25027571 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122060031
Repeat occurs 8 times in a sequence of 1451 amino acids
Location between 24425641 and 24440261
Coverage of 4.96 %
Instances:
IQSYLRGSI | IQSHWRGWS | IQSHIRGWV | IQSHIRQWI | IQSHWRSWL
IQSPELKAL | IQSHVRKWI | IQSFWKGYL |
pattern: IQS[PYHF][EVIWL][KLR][KSGQ][ASYW][ISLV]
MEGEQPSPSPSPHPSTLLRDISNFRTPKHSSQNPTLNSPFPPFFTASKQTPSSSSSAFRRRTS
IAPSSKSKAARKLKAFELEQYQSSRKAQIRKEKSLKYLAKSLTAWLNFLFENPKYCGCDLSALT
GAARLENVGTVFANGKKESWPGVDVGIDGAWRTPKRQKDKSWKPAQFADTKPLLSLSAFSSLEV
SMREVCSFEDLTKRMRAYMSLGSCKEVLSAMIQVAKNIDEGRFKMKAHCPIVTDVGMKKKATRV
LMTYNPIWLRIGLYIIFGGDSLLPTVDVNSDQEVLFLKMVVEKQFFSHVGLAKAYAYNKLVEGL
YRPGYFEALGNIILKRFLFLVLILDRAKSQSSLPINYGIDGTDGGSPLLFISQSDIKSSRQVIH
DFLSSEVMHGEGNVLAHLVIIGYKVHYQQCSLFEYDFSVKELFEDLQDGIRLCRAIQLLQHDAS
ILTKMAVPSDTRKRDLANCQLAMQYIKHAAVPLYDEDGVMILAEDVANGDKELTLSLLWNIFVH
LQLPLLINQTLLLEEISKVKGANVDHSKGKPSNLMEMLLEWVQVICHKYDVKVESFASLVDGKA
MWCLMDHYFHKELHCSHFRKGVQDANYEDSILLSADNTDAIHNFILAQKITSILGNFPEVVQMS
DILEHNGPCNEHSVIVLLVFFSSQLIGRKSMDMLNLHKLLGCNCQSPHKKNLSLEKCFLNSEKL
VKENGSDEHRDAVKTVRAIHAWWEEMAKRNNSRVHKSSASNVQSHVASNQLSDIQRERAAKIIQ
SYLRGSI
ERHKFLKIKSAASLLQTVILAWLMVRQMGTKKASSSITERQSYGWQEHSHTFQKYFK
FLVDRHSFIRLRNSALIIQQAARAWIKQIHQSESILDVESKLTDGKLIFKDLQIDAEDNIQVAW
RELTMHTSFSNKCIAAIKIQSHWRGWSMRKNFLHQRRAAIKIQTMIRYLKCQMVLQQCTLRSAI
IIQSHIRGWVARREANRVRHQIVVIQGHSRSWLVRKDFLNQKNAIIKIQNGLRCVKCWKAFQSY
RHAAIEIQRFIRGQVDRRRLVVLKLQRWWKSILWMKSRTRAVTVIQSHIRQWIAKQAARRERHR
VVVIQSHWRSWLVRKDFLNQKNAIIKIQNCLRCVKCWKNFQSYRHAAIEIQRIVRGRVNRRRLV
GASCLQSVMRYCCMDETSRDFIQSPELKALLYSVLKLQRWWKSVLWMKTRTRAAIIIQSHVRKW
I
AKQVSIRERNRVVVIQSFWKGYLARKESKDRLQDLRIRVQKSAANVDDSMRLINRLVAALSEL
LNIRSVTSILHICSTLDVATTHSQNCCEKLVSAGAIDTLLKLISSVSRSIPDQEVLKYALSALR
NLARYSHLAEVLINTHGSVETFLWELLRNKEEGFFIASELLKKLCSTQKGYDAVHRRPLLLKRL
YSLAEDLKRKARNDKRNGRPLALRDSTERRLREVEELLKLITTG
Repeat found in LOC122060031
Repeat occurs 7 times in a sequence of 1344 amino acids
Location between 24425641 and 24440261
Coverage of 4.69 %
Instances:
IQSYLRGSI | IQSHWRGWS | IQSHIRGWV | IQSHWRSWL | IQSPELKAL
IQSHVRKWI | IQSFWKGYL |
pattern: IQS[PYHF][EVIWL][KLR][KSG][ASYW][ISLV]
MEGEQPSPSPSPHPSTLLRDISNFRTPKHSSQNPTLNSPFPPFFTASKQTPSSSSSAFRRRTS
IAPSSKSKAARKLKAFELEQYQSSRKAQIRKEKSLKYLAKSLTAWLNFLFENPKYCGCDLSALT
GAARLENVGTVFANGKKESWPGVDVGIDGAWRTPKRQKDKSWKPAQFADTKPLLSLSAFSSLEV
SMREVCSFEDLTKRMRAYMSLGSCKEVLSAMIQVAKNIDEGRFKMKAHCPIVTDVGMKKKATRV
LMTYNPIWLRIGLYIIFGGDSLLPTVDVNSDQEVLFLKMVVEKQFFSHVGLAKAYAYNKLVEGL
YRPGYFEALGNIILKRFLFLVLILDRAKSQSSLPINYGIDGTDGGSPLLFISQSDIKSSRQVIH
DFLSSEVMHGEGNVLAHLVIIGYKVHYQQCSLFEYDFSVKELFEDLQDGIRLCRAIQLLQHDAS
ILTKMAVPSDTRKRDLANCQLAMQYIKHAAVPLYDEDGVMILAEDVANGDKELTLSLLWNIFVH
LQLPLLINQTLLLEEISKVKGANVDHSKGKPSNLMEMLLEWVQVICHKYDVKVESFASLVDGKA
MWCLMDHYFHKELHCSHFRKGVQDANYEDSILLSADNTDAIHNFILAQKITSILGNFPEVVQMS
DILEHNGPCNEHSVIVLLVFFSSQLIGRKSMDMLNLHKLLGCNCQSPHKKNLSLEKCFLNSEKL
VKENGSDEHRDAVKTVRAIHAWWEEMAKRNNSRVHKSSASNVQSHVASNQLSDIQRERAAKIIQ
SYLRGSI
ERHKFLKIKSAASLLQTVILAWLMVRQMGTKKASSSITERQSYGWQEHSHTFQKYFK
FLVDRHSFIRLRNSALIIQQAARAWIKQIHQSESILDVESKLTDGKLIFKDLQIDAEDNIQVAW
RELTMHTSFSNKCIAAIKIQSHWRGWSMRKNFLHQRRAAIKIQTMIRYLKCQMVLQQCTLRSAI
IIQSHIRGWVARREANRVRHQIVVIQSHWRSWLVRKDFLNQKNAIIKIQNCLRCVKCWKNFQSY
RHAAIEIQRIVRGRVNRRRLVGASCLQSVMRYCCMDETSRDFIQSPELKALLYSVLKLQRWWKS
VLWMKTRTRAAIIIQSHVRKWIAKQVSIRERNRVVVIQSFWKGYLARKESKDRLQDLRIRVQKS
AANVDDSMRLINRLVAALSELLNIRSVTSILHICSTLDVATTHSQNCCEKLVSAGAIDTLLKLI
SSVSRSIPDQEVLKYALSALRNLARYSHLAEVLINTHGSVETFLWELLRNKEEGFFIASELLKK
LCSTQKGYDAVHRRPLLLKRLYSLAEDLKRKARNDKRNGRPLALRDSTERRLREVEELLKLITT
G
Repeat found in LOC122059325
Repeat occurs 13 times in a sequence of 281 amino acids
Location between 24951097 and 24952161
Coverage of 55.52 %
Instances:
DDDEFAGELDDD | DDDDDDDDSEDE | DDDDDDDSEDED | DDDDDDSEDEDD | DDDDDSEDEDDD
DDDDSEDEDDDD | DDDSEDEDDDDD | DDDDDYSDDDDD | DDDDYSDDDDDE | DDDYSDDDDDES
DDDDDESGENQV | DDDDESGENQVV | DDDESGENQVVP |
pattern: DDD[DSEY][SEDYF][SDEGAY][DSEG][SEGND][SENQDL][ENQVD][DEQV][SEVPD]
MKECELCNYPARMYCDSDQASLCWDCDAKVHCANFLVARHSRSLLCHICQSPTPWKASGSKLG
PTVSVCQRCANNCNGSDERSEIDEEREGANEDDDEFAGELDDDDDDDDSEDEDDDDDYSDDDDD
ESGENQV
VPWSSTPPPPVASSSSSDDESSSNGRKRGIAESTMPFSLKRTRENADLGCHDDLGCS
SSQPNNDTAFAATAARRGSKDEEATSYGSFRPSKCLKRTLANSSVPPAAAVLSSLQRFTSGEDS
TATILEIRKLSKDPGAVDLDHHNRRI

Similar gene clusters

NC_056570 - Cluster 78 - Lignan-fatty_acid

Gene cluster description

NC_056570 - Gene Cluster 78. Type = lignan-fatty_acid. Location: 5617287 - 5694283 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056570 - Cluster 79 - Terpene

Gene cluster description

NC_056570 - Gene Cluster 79. Type = terpene. Location: 11065795 - 11331195 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NC_056570 - Cluster 80 - Putative

Gene cluster description

NC_056570 - Gene Cluster 80. Type = putative. Location: 21449781 - 21833882 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056570 - Cluster 81 - Saccharide-alkaloid

Gene cluster description

NC_056570 - Gene Cluster 81. Type = saccharide-alkaloid. Location: 22044508 - 22275765 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056570 - Cluster 82 - Saccharide

Gene cluster description

NC_056570 - Gene Cluster 82. Type = saccharide. Location: 25667245 - 25782663 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_056570 - Cluster 83 - Cyclopeptide

Gene cluster description

NC_056570 - Gene Cluster 83. Type = cyclopeptide. Location: 28706304 - 29428941 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_056570 - Cluster 84 - Putative

Gene cluster description

NC_056570 - Gene Cluster 84. Type = putative. Location: 29143126 - 29280706 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_024868115 - Cluster 85 - Phenolamide

Gene cluster description

NW_024868115 - Gene Cluster 85. Type = phenolamide. Location: 1956 - 88597 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_024868124 - Cluster 86 - Cyclopeptide

Gene cluster description

NW_024868124 - Gene Cluster 86. Type = cyclopeptide. Location: 1 - 189126 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


The following known motifs were found in CDS LOC122063160
Location between 18047 and 18683
VS[AI]Y was found 2 times in this sequence
Sequence:
MADQTCQCNLKGPCQCSTHQHNRSRLSLWTFLGVFIFLAGVTALTVWLVYRPHKPHFAVSSVS
IY
SLNMSTPSFISTNMQFTIITHNPNSRVSIYYDHFSAYVSYHNQAITTEVELQPLFQDPHSTV
VMSPIFGGGDVPVSSDTTNGLVVDENYGVVALRLVLLGELRWKAGVFWSRRYGIYVKCEMLVGF
KEDCAGQVPLLGNPECRVDV

Similar gene clusters

NW_024868169 - Cluster 87 - Cyclopeptide

Gene cluster description

NW_024868169 - Gene Cluster 87. Type = cyclopeptide. Location: 1 - 173245 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_024868210 - Cluster 88 - Cyclopeptide

Gene cluster description

NW_024868210 - Gene Cluster 88. Type = cyclopeptide. Location: 1 - 160178 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_024868316 - Cluster 89 - Transporter_associated

Gene cluster description

NW_024868316 - Gene Cluster 89. Type = transporter_associated. Location: 78681 - 324179 nt. Click on genes for more information.
Show pHMM detection rules used
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_024868374 - Cluster 90 - Cyclopeptide

Gene cluster description

NW_024868374 - Gene Cluster 90. Type = cyclopeptide. Location: 1 - 118553 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


The following known motifs were found in CDS LOC122064947
Location between 98517 and 116268
.L.Y..Y was found 2 times in this sequence
Sequence:
MVDRVVDKKICTEQSFSPLASFSLSPPSRSRLSPPATVGWGSPPATVGWEREHEDFHHLEPLL
NDSSKRYNSVDVQGSSVGGESITSYANANLLSIITFSWMGPLVALGYKKTLDLEDVPQLSSCDS
ANVVFSSIRNKIESDGNNGGQISSTKLVKALILLTWKEILWTGLLSIVCTLASYVGPYFIESFF
HYLKSSHQLKYKGYVLVFIFFLSKLIRCFSERHLFFQLRKMGNRVRTALFAIIYKKNLVLSSQA
KQGHTSGENVNLMTVDVEKTGFFSWYLHDIWRVPIQFVLALLNLYKRLGLASLVAFAATMILML
ANVPLIKLKEKIQGELMDSKDRRMKVTSEALRNMRILKLQGWEMKFLSKITERRKFETKWLKKL
LYTSAMASFVYSSTPLFVSMATFGFCELMGIQLNSGKILSALATFEMLQEPIHNLPDTISMILQ
IKVSLNRITSFLCLDDLDQNIVQKLPRDSTEVAIEIIKGNFSWDIHSPNLTLKDLNFQVKHGMK
VVVCGSVGSGKSSLISCILGEVPKISGSIKLKGKKAYVAQLPWIQSGKIVDNILFGKEMDKERY
EMILEACSLKKDLELFAFGDQTIVGERGINLSGGQKQRIQIARALYHDADIYLLDDPFSAVDAH
TGTHLFKECLLGILGSKTVIYVTHQVEFFPFADLILVMKDGKITQAGKYEEILSSGTDFVELVG
AHNKALAKLNSIEHGVALNNLINNEEDDNMLCGEKSNQDDEVRESENYKTEELVEPKGQLVQEE
KREKGGVNPLLYLKYVTTAYKGAFVPLILLAQILFELLQIVSSYWMVLDNPDSEDVRPHVMGCN
PIFIYIALTFGSSLCTFTRSMLIVTAGYKTSTLLFNKMHLCIFRAPLLFFDSTPSGRILNRVSI
DQSAADTSIPHKFEQLLISIIGFLGTIAVMSQVAWQMLIVSIPMTVACIWYQQYYISTARELSR
LTGVCQAPITQHFVESSLGSITIRCFGQEERFMDTNLKLVDGYFRPKFHFSGAIEWLCFRMDML
ASITYAIFFVFLISVPHGVLSPGIMGLAVTYGLSFSMHGIIWDLCSLENNIISIERIMQYTCIP
SEPPLVVEENKPDDEWPSQGKVDIVNLQVRYASHLPLVLRDLTCTFPGGIKIGIVGRTGSGKST
LVQALFHMLEPVAGQIWIDGINIFKIGLHDLRSRLSIIPQDPIMFEGTLRSNLDPLEDYTDEQI
WEVLDKCQLGDEIRKKEGKLDTSVNENGENWSIGQRQLVCLGRVLLKRCKVLVLDEATASMDTA
TEYLIQQTLRQHFSGSTVITIAHRITSILDIDMVLLLDNGLILEYDSPAKLLEIKSSSFAKLVK
EYTGRYNC

Similar gene clusters

NW_024868400 - Cluster 91 - Cyclopeptide

Gene cluster description

NW_024868400 - Gene Cluster 91. Type = cyclopeptide. Location: 1 - 529537 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122065109
Repeat occurs 4 times in a sequence of 98 amino acids
Location between 507038 and >507563
Coverage of 57.14 %
Instances:
RHDRHDRHDRHDHH | RHDRHDRHDHHDHH | RHDRHDHHDHHRLR | RHDHHDHHRLRKDP |
pattern: RHD[HR]HD[HR]H[DR][HLR][HR][DKR][DHL][PHR]
MENHQEANGWKSVPQFGGWDQKSVNATNYSVVFNQARANRKQNKDRPFSAGNEEDLIARHDRH
DRHDRHDHH
DHHRLRKDPVVRKKKTSNYFNCCLWA

Similar gene clusters

NW_024868405 - Cluster 92 - Cyclopeptide

Gene cluster description

NW_024868405 - Gene Cluster 92. Type = cyclopeptide. Location: 12539 - 59852 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

No significant ClusterBlast hits found.

NW_024868422 - Cluster 93 - Cyclopeptide

Gene cluster description

NW_024868422 - Gene Cluster 93. Type = cyclopeptide. Location: 1 - 110796 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

No significant ClusterBlast hits found.

NW_024868658 - Cluster 94 - Cyclopeptide

Gene cluster description

NW_024868658 - Gene Cluster 94. Type = cyclopeptide. Location: 1 - 73811 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_024868771 - Cluster 95 - Cyclopeptide

Gene cluster description

NW_024868771 - Gene Cluster 95. Type = cyclopeptide. Location: 30820 - 62812 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

No significant ClusterBlast hits found.

NW_024868871 - Cluster 96 - Cyclopeptide

Gene cluster description

NW_024868871 - Gene Cluster 96. Type = cyclopeptide. Location: 1 - 53649 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_024868887 - Cluster 97 - Cyclopeptide

Gene cluster description

NW_024868887 - Gene Cluster 97. Type = cyclopeptide. Location: 1 - 43159 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

No significant ClusterBlast hits found.

NW_024868967 - Cluster 98 - Cyclopeptide

Gene cluster description

NW_024868967 - Gene Cluster 98. Type = cyclopeptide. Location: 1 - 46394 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_024868972 - Cluster 99 - Cyclopeptide

Gene cluster description

NW_024868972 - Gene Cluster 99. Type = cyclopeptide. Location: 1 - 45962 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_024869088 - Cluster 100 - Cyclopeptide

Gene cluster description

NW_024869088 - Gene Cluster 100. Type = cyclopeptide. Location: 1 - 37316 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_024869167 - Cluster 101 - Cyclopeptide

Gene cluster description

NW_024869167 - Gene Cluster 101. Type = cyclopeptide. Location: 1 - 33088 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_024869172 - Cluster 102 - Cyclopeptide

Gene cluster description

NW_024869172 - Gene Cluster 102. Type = cyclopeptide. Location: 1 - 32906 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_024869429 - Cluster 103 - Cyclopeptide

Gene cluster description

NW_024869429 - Gene Cluster 103. Type = cyclopeptide. Location: 1 - 19506 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_024869446 - Cluster 104 - Cyclopeptide

Gene cluster description

NW_024869446 - Gene Cluster 104. Type = cyclopeptide. Location: 1 - 18409 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_024869731 - Cluster 105 - Cyclopeptide

Gene cluster description

NW_024869731 - Gene Cluster 105. Type = cyclopeptide. Location: 1 - 401614 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

No significant ClusterBlast hits found.

NW_024869847 - Cluster 106 - Cyclopeptide

Gene cluster description

NW_024869847 - Gene Cluster 106. Type = cyclopeptide. Location: 1 - 385394 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC122068388
Repeat occurs 3 times in a sequence of 276 amino acids
Location between 97591 and 98621
Coverage of 6.52 %
Instances:
QSPAPA | QSPSPS | QSPASA |
pattern: QSP[AS][PS][AS]
MLIISSELEKIVHLLQMATFFGSPPFLSHPLTRIHHLTLASSQSPAPASPPPSSQPQSPSPSP
SLRTESLPEQQSPASAKVEQQKPIKPVTTTTKVDSTDWIASNLTRRFGLGAGVAWVGFLAIGVI
SEQIKTRREVSQQEANTRDVEKEEEVVLPNGIRYSEMRVGGGASPRPGDLVVIDLKGAVEGTGE
VFVDTFGRDKKPLALVMGSRPYTRGMCEGIEYALRSMKAGGKRRVVIPPSLGFGENGADLGGLQ
VPPSATVEYIVEVEKVSIAPA

Similar gene clusters

NW_024870494 - Cluster 107 - Lignan

Gene cluster description

NW_024870494 - Gene Cluster 107. Type = lignan. Location: 288726 - 350733 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_024870495 - Cluster 108 - Cyclopeptide

Gene cluster description

NW_024870495 - Gene Cluster 108. Type = cyclopeptide. Location: 1 - 317436 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_024870497 - Cluster 109 - Cyclopeptide

Gene cluster description

NW_024870497 - Gene Cluster 109. Type = cyclopeptide. Location: 1 - 315566 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


The following known motifs were found in CDS LOC122069426
Location between 280583 and 288451
.L.Y..Y was found 2 times in this sequence
Sequence:
MIEFLHYFSQMGTSEDLNNDMNIWIFFSQSSATIFFHGFSACSHIILLLVLSIFWACKRCMVP
TLDNSKPMLKNTYFLYYWLTIISCMGLSFFGLLLCILISLSLYRHGWSDLKLVTLLDFAFKTLT
WFGISVYLHIQFSHSSESRFPILLRIWWVFYFLMSCFHVVVHWKNSSLPILLWVSNVVDAIVGL
FFCYAGLFGKGREDAYFQLLEPLFKISNSQSDGDDREGLSVGGESVTPYANASFFSIFTFHWMN
PLLALGYAKILDLEDVPQVSNCDSANVVFACFRNKLESDNNGSSNDGQVSTHKLVNALIFSTWK
EILWTGLLSIVCTLASYVGPYLIVDFIEYLNSSHQLKYNGYVLVFMFFLSMLIKCLLERHLFFI
LRKMVIRVRAALFAIIYKKSLRLSNQSKQGHTCGENTNLLSVDVERIGFFSWYLHDIWRVPMQV
ALGLWILYIRLGLASFVAFVATLILILANVPLGKLQENFQGELMDTKDRRMKVTSEALRNMRIL
KLQGWEMRFLTRITEFRNFEARWLKKLLYASAMASFVYLIAPMFVSTVTFGCCVVMRIPLDSGN
ILSALATFEMLQKPIYNFPDTISMVVQTKVSLDRIASFLCLDDLPPNIIQNFPRDSVKIAIEII
KGNFSWDIHSPNLTLKDLNFQVYRGMRVAVCGFVGSGKSSLLSCILGEVPKLSGAIKLNGTKAY
VAQSPWIQSGTIVDNILFGKEMDKDRYEMILEACSLKKDLELYILGDQTVIGDRGVNLSGGQKQ
RIQIARALYHDADIYLLDDPFSAVDAQTGTHLFKECLLGILDSKTVIYVTHQVEFLHTADLILV
MRDGRITQAGKYEEILSSGTDFIELVGAHKKALEDLISIEHKATLDNSINDEEDGYSLYSDKSI
QDKEESKLKNYKTEKLVGLEGQLVQEEMREKGGVSLLVYWKYVTAAYKGALVPLILLAQILFQL
LIIVSSYWMVLTTPISKDVRPHVRASTLIFVYVALTIGSSLCVLIRSMLILTSGYKTTMLLFNK
MHLCIFRAPMSFIDSTPSGRILNRASIDQSAVDTSIPRQFEELLISIIEFLGIIAVMSQVAWKM
LIVFIPMIVTCIWFQQYYISAARELSRLGRVCQTPIIHHFSESSLGSTTIRCFDQEDRFMDTNL
KLIDGYSRPKFHFSGAMEWLCFRMDFLASITYAIFLVFLILIPKGVLSPGVIGLAVTYGLGFSM
QGVVWDLSNLENKIISIERILQYVCIPCEPPLVIEENMPDHEWPSQGKVDIIDLQVRYAQHLPL
VLRGITCSFPGGMKIGIVGRTGSGKSTLVQTLFRMFEPTAGQIWIDGINISNIGVHDLRSRLSI
IPQDPTMFEGTLRSNLDPLEKYTDEQIWEALDRCQLGDEVRKKEGKLDSIVAENGENWSMGQRQ
LVCLGRVLLKRSKVLVLDEATASVDTATDYLIQQTLRQHFSGSTIITIAHRITSILDNDMVLLL
DNAGLILEYDSPTKLLETKSSSFAKLVKEYTRRCTS
The following known motifs were found in CDS LOC122069426
Location between 280583 and 288451
.L.Y..Y was found 2 times in this sequence
Sequence:
MIEFLHYFSQMGTSEDLNNDMNIWIFFSQSSATIFFHGFSACSHIILLLVLSIFWACKRCMVP
TLDNSKPMLKNTYFLYYWLTIISCMGLSFFGLLLCILISLSLYRHGWSDLKLVTLLDFAFKTLT
WFGISVYLHIQFSHSSESRFPILLRIWWVFYFLMSCFHVVVHWKNSSLPILLWVSNVVDAIVGL
FFCYAGLFGKGREDAYFQLLEPLFKISNSQSDGDDREGLSVGGESVTPYANASFFSIFTFHWMN
PLLALGYAKILDLEDVPQVSNCDSANVVFACFRNKLESDNNGSSNDGQVSTHKLVNALIFSTWK
EILWTGLLSIVCTLASYVGPYLIVDFIEYLNSSHQLKYNGYVLVFMFFLSMLIKCLLERHLFFI
LRKMVIRVRAALFAIIYKKSLRLSNQSKQGHTCGENTNLLSVDVERIGFFSWYLHDIWRVPMQV
ALGLWILYIRLGLASFVAFVATLILILANVPLGKLQENFQGELMDTKDRRMKVTSEALRNMRIL
KLQGWEMRFLTRITEFRNFEARWLKKLLYASAMASFVYLIAPMFVSTVTFGCCVVMRIPLDSGN
ILSALATFEMLQKPIYNFPDTISMVVQTKVSLDRIASFLCLDDLPPNIIQNFPRDSVKIAIEII
KGNFSWDIHSPNLTLKDLNFQVYRGMRVAVCGFVGSGKSSLLSCILGEVPKLSGAIKLNGTKAY
VAQSPWIQSGTIVDNILFGKEMDKDRYEMILEACSLKKDLELYILGDQTVIGDRGVNLSGGQKQ
RIQIARALYHDADIYLLDDPFSAVDAQTGTHLFKECLLGILDSKTVIYVTHQVEFLHTADLILV
MRDGRITQAGKYEEILSSGTDFIELVGAHKKALEDLISIEHKATLDNSINDEEDGYSLYSDKSI
QDKEESKLKNYKTEKLVGLEGQLVQEEMREKGGVSLLVYWKYVTAAYKGALVPLILLAQILFQL
LIIVSSYWMVLTTPISKDVRPHVRASTLIFVYVALTIGSSLCVLIRSMLILTSGYKTTMLLFNK
MHLCIFRAPMSFIDSTPSGRILNRASIDQSAVDTSIPRQFEELLISIIEFLGIIAVMSQVAWKM
LIVFIPMIVTCIWFQQYYISAARELSRLGRVCQTPIIHHFSESSLGSTTIRCFDQEDRFMDTNL
KLIDGYSRPKFHFSGAMEWLCFRMDFLASITYAIFLVFLILIPKGVLSPGVIGLAVTYGLGFSM
QGVVWDLSNLENKIISIERILQYVCIPCEPPLVIEENMPDHEWPSQGKVDIIDLQVRYAQHLPL
VLRGITCSFPGGMKIGIVGRTGSGKSTLVQTLFRMFEPTAGQIWIDGINISNIGVHDLRSRLSI
IPQDPTMFEGTLRSNLDPLEKYTDEQIWEALDRCQLGDEVRKKEGKLDSIVAENGENWSMGQRQ
LVCLGRVLLKRSKVLVLDEATASVDTATDYLIQQTLRQHFSGSTIITIAHRITSILDNDMVLLL
DNGLILEYDSPTKLLETKSSSFAKLVKEYTRRCTS
The following known motifs were found in CDS LOC122069426
Location between 280583 and 286794
.L.Y..Y was found 2 times in this sequence
Sequence:
MIEFLHYFSQMGTSEDLNNDMNIWIFFSQSSATIFFHGFSACSHIILLLVLSIFWACKRCMVP
TLDNSKPMLKNTYFLYYWLTIISCMGLSFFGLLLCILISLSLYRHGWSDLKLVTLLDFAFKTLT
WFGISVYLHIQFSHSSESRFPILLRIWWVFYFLMSCFHVVVHWKNSSLPILLWVSNVVDAIVGL
FFCYAGLFGKGREDAYFQLLEPLFKISNSQSDGDDREGLSVGGESVTPYANASFFSIFTFHWMN
PLLALGYAKILDLEDVPQVSNCDSANVVFACFRNKLESDNNGSSNDGQVSTHKLVNALIFSTWK
EILWTGLLSIVCTLASYVGPYLIVDFIEYLNSSHQLKYNGYVLVFMFFLSMLIKCLLERHLFFI
LRKMVIRVRAALFAIIYKKSLRLSNQSKQGHTCGENTNLLSVDVERIGFFSWYLHDIWRVPMQV
ALGLWILYIRLGLASFVAFVATLILILANVPLGKLQENFQGELMDTKDRRMKVTSEALRNMRIL
KLQGWEMRFLTRITEFRNFEARWLKKLLYASAMASFVYLIAPMFVSTVTFGCCVVMRIPLDSGN
ILSALATFEMLQKPIYNFPDTISMVVQTKVSLDRIASFLCLDDLPPNIIQNFPRDSVKIAIEII
KGNFSWDIHSPNLTLKDLNFQVYRGMRVAVCGFVGSGKSSLLSCILGEVPKLSGAIKLNGTKAY
VAQSPWIQSGTIVDNILFGKEMDKDRYEMILEACSLKKDLELYILGDQTVIGDRGVNLSGGQKQ
RIQIARALYHDADIYLLDDPFSAVDAQTGTHLFKECLLGILDSKTVIYVTHQVEFLHTADLILV
MRDGRITQAGKYEEILSSGTDFIELVGAHKKALEDLISIEHKATLDNSINDEEDGYSLYSDKSI
QDKEESKLKNYKTEKLVGLEGQLVQEEMREKGGVSLLVYWKYVTAAYKGALVPLILLAQILFQL
LIIVSSYWMVLTTPISKDVRPHVRASTLIFVYVALTIGSSLCVLIRSMLILTSGYKTTMLLFNK
MHLCIFRAPMSFIDSTPSGRILNRASIDQSAVDTSIPRQFEELLISIIEFLGIIAVMSQVAWKM
LIVFIPMIVTCIWFQVLLVWQSHMGLVSVCKGLYGILVISRTKSYPLKEYCSTSASLVSLLWL

Similar gene clusters

NW_024870581 - Cluster 110 - Cyclopeptide

Gene cluster description

NW_024870581 - Gene Cluster 110. Type = cyclopeptide. Location: 1 - 240977 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_024870586 - Cluster 111 - Cyclopeptide

Gene cluster description

NW_024870586 - Gene Cluster 111. Type = cyclopeptide. Location: 1 - 247233 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NW_024870639 - Cluster 112 - Terpene-alkaloid

Gene cluster description

NW_024870639 - Gene Cluster 112. Type = terpene-alkaloid. Location: 560333 - 704711 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_024870659 - Cluster 113 - Fatty_acid

Gene cluster description

NW_024870659 - Gene Cluster 113. Type = fatty_acid. Location: 81478 - 149426 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_024870669 - Cluster 114 - Cyclopeptide

Gene cluster description

NW_024870669 - Gene Cluster 114. Type = cyclopeptide. Location: 1 - 404052 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NW_024870673 - Cluster 115 - Cyclopeptide

Gene cluster description

NW_024870673 - Gene Cluster 115. Type = cyclopeptide. Location: 1 - 230815 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters