Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_067244.1:
Cluster 1Putative224975255225822818847.562OG-FeII_Oxy, DIOX_N, Dimerisation, Methyltransf_2, p450---
Cluster 2Polyketide253752808253991533238.72Chal_sti_synt_C, Chal_sti_synt_N, Peptidase_S10, polyprenyl_synt---
The following clusters are from record NC_067245.1:
Cluster 3Terpene21096202511217401.60Terpene_synth, Terpene_synth_C, p450---
Cluster 4Saccharide-Alkaloid1797150718324864353.36Bet_v_1, Methyltransf_11, UDPGT_2, adh_short, adh_short_C2, p450---
Cluster 5Saccharide229833202306477381.45Cellulose_synt, NAD_binding_1, UDPGT_2*saccharide--
Cluster 6Lignan-Alkaloid33725362349946471269.29BBE, Dirigent, FAD_binding_4, adh_short_C2---
Cluster 7Terpene3879815139045182247.03Methyltransf_11, Peptidase_S10, Terpene_synth, Terpene_synth_C---
Cluster 8Terpene46980305480452991064.99SQHop_cyclase_C, SQHop_cyclase_N, Transferase, p450triterpene--
Cluster 9Lignan5129269451481939189.25AMP-binding, Amino_oxidase, Dirigent---
Cluster 10Lignan-Saccharide8464570684990173344.47Aminotran_1_2, Dirigent, Glycos_transf_1---
Cluster 11Terpene200915784201237093321.31ADH_N, Aldo_ket_red, Terpene_synth, Terpene_synth_C---
Cluster 12Saccharide-Alkaloid231183795231472462288.67Abhydrolase_3, Pyridoxal_deC, UDPGT_2cyanogenic glucoside, monoterpenoid--
Cluster 13Alkaloid240966709241481170514.46Aldo_ket_red, Lipoxygenase, Peptidase_S10, Pyridoxal_deC---
Cluster 14Cyclopeptide293847334294838102990.77BURP---
Cluster 15Terpene299443095300116401673.31Terpene_synth, Terpene_synth_C, p450-momilactone B (42% of genes show similarity)BGC0000671.3_c1
Cluster 16Terpene325455149326370901915.752OG-FeII_Oxy, 3Beta_HSD, DIOX_N, SQHop_cyclase_C, SQHop_cyclase_N, p450triterpene--
Cluster 17Terpene336458363336689903231.54Terpene_synth, Terpene_synth_C, adh_short, p450---
The following clusters are from record NC_067246.1:
Cluster 18Saccharide-Alkaloid1967614611978314811070.02AMP-binding, Str_synth, UDPGT_2, p450flavonoid, oleananes--
Cluster 19Saccharide206051296206555421504.12UDPGT_2, p450---
Cluster 20Alkaloid230351755230503863152.11Bet_v_1, Epimerase, Methyltransf_11---
Cluster 21Saccharide261997571262249227251.66Glycos_transf_1, UDPGT_2, p450---
The following clusters are from record NC_067247.1:
Cluster 22Saccharide3603256936242455209.89SQS_PSY, UDPGT_2, adh_short*saccharide--
Cluster 23Saccharide5729698458255402958.42Abhydrolase_3, UDPGT_2---
Cluster 24Saccharide8204893982587862538.92Glyco_hydro_1, UDPGT_2*saccharide-6--
Cluster 25Lignan-Terpene119589273120419633830.363Beta_HSD, AMP-binding, Dimerisation, Dirigent, Methyltransf_2, SQHop_cyclase_C, SQHop_cyclase_N, p450triterpene--
Cluster 26Saccharide-Alkaloid169244083169750038505.95AMP-binding, Str_synth, UDPGT_2flavonoid--
Cluster 27Saccharide171132931171553080420.152OG-FeII_Oxy, Aldo_ket_red, DIOX_N, UDPGT_2cytokinin--
The following clusters are from record NC_067248.1:
Cluster 28Saccharide5903861095844505.46Abhydrolase_3, Glycos_transf_1, Peptidase_S10, p450---
Cluster 29Saccharide205899693206192971293.283Beta_HSD, Glycos_transf_2, Transferase---
Cluster 30Terpene-Alkaloid254751026255175568424.54HMGL-like, Str_synth, Terpene_synth, Terpene_synth_C---
Cluster 31Phenolamide259630981260206783575.80Orn_Arg_deC_N, Orn_DAP_Arg_deC, Transferase-hydroxycinnamoylputrescine (100% of genes show similarity)BGC0002622.2_c1
Cluster 32Cyclopeptide2618445182654550483610.53BURP---
Cluster 33Cyclopeptide2629154562655111022595.65BURP---
Cluster 34Cyclopeptide2634243522658487772424.43BURP---
Cluster 35Fatty_acid267891381268111854220.47ADH_N, ADH_zinc_N, AMP-binding, Epimerase, Transferase---
The following clusters are from record NC_067249.1:
Cluster 36Saccharide3489727235469624572.35Amino_oxidase, Methyltransf_11, UDPGT_2cyanogenic glucoside-3, monoterpenoid-3--
Cluster 37Saccharide4805700648593106536.10Glycos_transf_1, Glycos_transf_2, Methyltransf_11---
Cluster 38Saccharide6767144367812054140.613Beta_HSD, UDPGT_2, p450---
Cluster 39Saccharide191320459191523890203.43Aldo_ket_red, UDPGT_2, p450flavonoid-6, oleananes-6--
The following clusters are from record NC_067250.1:
Cluster 40Fatty_acid-Polyketide2016719820959519792.32Chal_sti_synt_C, Chal_sti_synt_N, FA_desaturase, UbiA---
Cluster 41Saccharide3865659638919374262.78Epimerase, Glyco_hydro_1, p450---
Cluster 42Cyclopeptide54628721570935882464.87BURP---
Cluster 43Saccharide7655300177002543449.54Methyltransf_11, UDPGT_2, adh_short---
Cluster 44Saccharide7797323178335041361.81ECH_2, UDPGT_2, adh_short_C2flavonoid-3--
Cluster 45Saccharide-Polyketide9197194892394019422.07Abhydrolase_3, Cellulose_synt, Chal_sti_synt_C, Chal_sti_synt_N-beta-diketones (50% of genes show similarity)BGC0002403.2_c1
Cluster 46Alkaloid1047280691061733621445.29Aminotran_1_2, BBE, FAD_binding_4---
Cluster 47Alkaloid156468058156716456248.40Abhydrolase_3, Cu_amine_oxid, adh_short---
Cluster 48Cyclopeptide1774905761788560291365.45BURP---
Cluster 49Terpene186299622186564179264.56Terpene_synth, Terpene_synth_C, adh_short, p450---
Cluster 50Polyketide189769159189924259155.10Aldo_ket_red, Chal_sti_synt_C, Chal_sti_synt_N---
Cluster 51Saccharide-Terpene191172347192056424884.08SQHop_cyclase_C, SQHop_cyclase_N, Transferase, UDPGT_2, p450flavonoid, triterpene--
Cluster 52Saccharide22254520322261447969.28Abhydrolase_3, UDPGT_2, p450flavonoid--
Cluster 53Putative223800593224135400334.81Transferase, adh_short_C2---
Cluster 54Saccharide-Alkaloid229319276229512076192.80Aminotran_1_2, Pyridoxal_deC, UDPGT_2small phenolic--
Cluster 55Saccharide230140995230612042471.05ADH_N, ADH_zinc_N, Methyltransf_3, UDPGT_2flavonoid, oleananes--
Cluster 56Fatty_acid236250140236354924104.78FA_desaturase_2, Transferase---
Cluster 57Saccharide-Terpene244447655244659362211.712OG-FeII_Oxy, Glycos_transf_1, SQHop_cyclase_C, SQHop_cyclase_Nsterol--
Cluster 58Saccharide251263830251593892330.06Cellulose_synt, p450---
Cluster 59Saccharide254016837254432994416.16Methyltransf_11, SQS_PSY, UDPGT_2flavonoid--
The following clusters are from record NW_026138795.1:
Cluster 60Putative210545464194253.653Beta_HSD, Epimerase, Transferase, p450---
The following clusters are from record NW_026138842.1:
Cluster 61Phenolamide477513773833296.32Pyridoxal_deC, Transferase---
The following clusters are from record NW_026140057.1:
Cluster 62Putative304968813233508.26Abhydrolase_3, adh_short---
The following clusters are from record NW_026140398.1:
Cluster 63Terpene22643692926993662.622OG-FeII_Oxy, DIOX_N, Peptidase_S10, Terpene_synth, Terpene_synth_C---

NC_067244 - Cluster 1 - Putative

Gene cluster description

NC_067244 - Gene Cluster 1. Type = putative. Location: 224975255 - 225822818 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067244 - Cluster 2 - Polyketide

Gene cluster description

NC_067244 - Gene Cluster 2. Type = polyketide. Location: 253752808 - 253991533 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 3 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 3. Type = terpene. Location: 2109620 - 2511217 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 4 - Saccharide-alkaloid

Gene cluster description

NC_067245 - Gene Cluster 4. Type = saccharide-alkaloid. Location: 17971507 - 18324864 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 5 - Saccharide

Gene cluster description

NC_067245 - Gene Cluster 5. Type = saccharide. Location: 22983320 - 23064773 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 6 - Lignan-alkaloid

Gene cluster description

NC_067245 - Gene Cluster 6. Type = lignan-alkaloid. Location: 33725362 - 34994647 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/lignan: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 7 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 7. Type = terpene. Location: 38798151 - 39045182 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 8 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 8. Type = terpene. Location: 46980305 - 48045299 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 9 - Lignan

Gene cluster description

NC_067245 - Gene Cluster 9. Type = lignan. Location: 51292694 - 51481939 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 10 - Lignan-saccharide

Gene cluster description

NC_067245 - Gene Cluster 10. Type = lignan-saccharide. Location: 84645706 - 84990173 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 11 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 11. Type = terpene. Location: 200915784 - 201237093 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 12 - Saccharide-alkaloid

Gene cluster description

NC_067245 - Gene Cluster 12. Type = saccharide-alkaloid. Location: 231183795 - 231472462 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 13 - Alkaloid

Gene cluster description

NC_067245 - Gene Cluster 13. Type = alkaloid. Location: 240966709 - 241481170 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 14 - Cyclopeptide

Gene cluster description

NC_067245 - Gene Cluster 14. Type = cyclopeptide. Location: 293847334 - 294838102 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC127336078
Repeat occurs 9 times in a sequence of 501 amino acids
Location between 294341514 and 294343921
Coverage of 21.56 %
Instances:
SYGSKQEDDPNK | SYGSEQEGDPNK | SYGSEQEDDPHK | SYGSEQEDDPNK | SYGSEQEGDPNK
SYGSKQEDDPHK | SYGSEQEDDPNK | SYGSEQEEGPHK | SYGSEQKEVQKI |
pattern: SYGS[KE]Q[KE][EDG][DGV][QP][HKN][IK]
MQHMVAHDMLVPLIVSLLLIAGGGGWHASFVNAATTAARKLETPMTKAYWETLLPGTSMPPAI
SDLLGQQNVWDKGEEVPSKDIVSHGSEQEDPHKATMSYESEKDNDPHKITISYGSKQEDDPNKI
TMSYGSEQEGDPNKATISYGSEQEDDPHKATMSYGSEQEDDPNKITMSYGSEQEGDPNKATMSY
GSKQEDDPHK
VTMSYGSEQEDDPNKLTMSYGSEQEEGPHKVTMSYGSEQKEVQKIILSQHEARM
NGQGSHHHKAHSHSTKRQQADVFFFHDMLRPGFIITPTIPPTSSLPSLLPRSVAGSIPFSHERL
SDIISMFAPASLSMARKIRWTLDMCEHPRTLPGQSAGCATSLESLAELPPSFLKTRNVRAFSAA
DLPVEARGTRALRGRYNVTAVLKVSGESSEIVTCHDLTYPYAVYYCHTANRTSAYTVTLTSVED
GVVPKTMEALAVCHLDTSKWSPKNPFFELHNLKPGEVTVCHFLTKLSIIWVPGS

Similar gene clusters

NC_067245 - Cluster 15 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 15. Type = terpene. Location: 299443095 - 300116401 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_067245 - Cluster 16 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 16. Type = terpene. Location: 325455149 - 326370901 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 17 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 17. Type = terpene. Location: 336458363 - 336689903 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067246 - Cluster 18 - Saccharide-alkaloid

Gene cluster description

NC_067246 - Gene Cluster 18. Type = saccharide-alkaloid. Location: 196761461 - 197831481 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067246 - Cluster 19 - Saccharide

Gene cluster description

NC_067246 - Gene Cluster 19. Type = saccharide. Location: 206051296 - 206555421 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067246 - Cluster 20 - Alkaloid

Gene cluster description

NC_067246 - Gene Cluster 20. Type = alkaloid. Location: 230351755 - 230503863 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067246 - Cluster 21 - Saccharide

Gene cluster description

NC_067246 - Gene Cluster 21. Type = saccharide. Location: 261997571 - 262249227 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 22 - Saccharide

Gene cluster description

NC_067247 - Gene Cluster 22. Type = saccharide. Location: 36032569 - 36242455 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 23 - Saccharide

Gene cluster description

NC_067247 - Gene Cluster 23. Type = saccharide. Location: 57296984 - 58255402 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 24 - Saccharide

Gene cluster description

NC_067247 - Gene Cluster 24. Type = saccharide. Location: 82048939 - 82587862 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 25 - Lignan-terpene

Gene cluster description

NC_067247 - Gene Cluster 25. Type = lignan-terpene. Location: 119589273 - 120419633 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 26 - Saccharide-alkaloid

Gene cluster description

NC_067247 - Gene Cluster 26. Type = saccharide-alkaloid. Location: 169244083 - 169750038 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 27 - Saccharide

Gene cluster description

NC_067247 - Gene Cluster 27. Type = saccharide. Location: 171132931 - 171553080 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067248 - Cluster 28 - Saccharide

Gene cluster description

NC_067248 - Gene Cluster 28. Type = saccharide. Location: 590386 - 1095844 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067248 - Cluster 29 - Saccharide

Gene cluster description

NC_067248 - Gene Cluster 29. Type = saccharide. Location: 205899693 - 206192971 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067248 - Cluster 30 - Terpene-alkaloid

Gene cluster description

NC_067248 - Gene Cluster 30. Type = terpene-alkaloid. Location: 254751026 - 255175568 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067248 - Cluster 31 - Phenolamide

Gene cluster description

NC_067248 - Gene Cluster 31. Type = phenolamide. Location: 259630981 - 260206783 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_067248 - Cluster 32 - Cyclopeptide

Gene cluster description

NC_067248 - Gene Cluster 32. Type = cyclopeptide. Location: 261844518 - 265455048 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 445 amino acids
Location between 263647703 and 263651862
Coverage of 12.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNP
SPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKN
ANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEG
TMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTS
SRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQ
PKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 11.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 21.46 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKNANPSLSEE | LHWKSANPSSSAS |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
ENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSAS
LVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 21.46 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKNANPSLSEE | LHWKSANPSSSAS |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSAS
LVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 17.15 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWKSANPSSSAS

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSE
ENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 16.11 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWKSANPSSSASLV

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 17.15 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWKSANPSSSAS

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 470 amino acids
Location between 263647703 and 263651862
Coverage of 13.4 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRH
WHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMK
AKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQ
IAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYF
ELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN
LHWKNANPSLSEENG | LHWKSANPSSSASLV |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE][ELN][VGN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSANPSSSASLV
PEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 16.14 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 448 amino acids
Location between 263647703 and 263651862
Coverage of 12.5 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTIL
PEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHV
CTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFY
CHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMF
Y
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 19.4 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKSANPSSSAS |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
ENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 403 amino acids
Location between 263647703 and 263651862
Coverage of 22.33 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN
LHWKSANPSSSASLV |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHI
KVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQ
VAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQ
LSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELL
GGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 19.4 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKSANPSSSAS |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302833
Repeat occurs 9 times in a sequence of 499 amino acids
Location between 262920147 and 262921902
Coverage of 10.82 %
Instances:
PQNFTA | PQNFTA | PQNFTA | PQNFTA | PQNFTA
PQNFTA | PQNFTT | PQNFTA | PQNFTT |
pattern: PQNFT[AT]
MPAGDNPRSISEKKAALRESPKQSKNAAVNQQARAPPFPKDKAAETVGIKRPQPNGPLSATNH
HVPGNPGANGHLVYVRRKVETDQSKGGATSGAESATSLSSKKPGICGPQEQSLKRQSSVPNTQS
APVSASPAASASIPALHSASLPANLSFGKQSPGKVSAQPSVAVTASPPQRNVVSTGMPQNFTAA
NTSSLPQCNIVSTTMPQNFTAANNSSLPQRNVVSTAMPQNFTAVNASSPPQRNGVSTAMPQNFT
A
SNTAGLPQRSVVSTAMPQNFTAANTASPPRRNVVSTAMPQNFTAANTPSPPQRNVLSTAMPQN
FTT
ASTASPPQRNVVSTAMPQNFTAANTASPTHRNVVSAAVPQNFTTVSMAHCNVAATSTASRD
AVVTTTTRSPASLQRSSNQDWKERFIRLQEFLKNNEQSGQEEYIRMLHSLSSVGRSKHAIELEK
RAVNLLIEEGKELQKMKSLNVLGKLPPADHPSVPTQPTFAMRLPFQPFPPRR
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 468 amino acids
Location between 263647703 and 263651862
Coverage of 13.46 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWK
NANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWH
IKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAK
QVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIA
QLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFEL
LGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWKSANPSSSASLV

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 16.14 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY

Similar gene clusters

NC_067248 - Cluster 33 - Cyclopeptide

Gene cluster description

NC_067248 - Gene Cluster 33. Type = cyclopeptide. Location: 262915456 - 265511102 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 470 amino acids
Location between 263647703 and 263651862
Coverage of 13.4 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRH
WHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMK
AKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQ
IAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYF
ELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 21.46 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKNANPSLSEE | LHWKSANPSSSAS |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
ENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSAS
LVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWKSANPSSSASLV

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 403 amino acids
Location between 263647703 and 263651862
Coverage of 22.33 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN
LHWKSANPSSSASLV |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHI
KVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQ
VAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQ
LSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELL
GGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWKSANPSSSASLV

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 19.4 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKSANPSSSAS |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
ENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 448 amino acids
Location between 263647703 and 263651862
Coverage of 12.5 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTIL
PEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHV
CTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFY
CHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMF
Y
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302833
Repeat occurs 9 times in a sequence of 499 amino acids
Location between 262920147 and 262921902
Coverage of 10.82 %
Instances:
PQNFTA | PQNFTA | PQNFTA | PQNFTA | PQNFTA
PQNFTA | PQNFTT | PQNFTA | PQNFTT |
pattern: PQNFT[AT]
MPAGDNPRSISEKKAALRESPKQSKNAAVNQQARAPPFPKDKAAETVGIKRPQPNGPLSATNH
HVPGNPGANGHLVYVRRKVETDQSKGGATSGAESATSLSSKKPGICGPQEQSLKRQSSVPNTQS
APVSASPAASASIPALHSASLPANLSFGKQSPGKVSAQPSVAVTASPPQRNVVSTGMPQNFTAA
NTSSLPQCNIVSTTMPQNFTAANNSSLPQRNVVSTAMPQNFTAVNASSPPQRNGVSTAMPQNFT
A
SNTAGLPQRSVVSTAMPQNFTAANTASPPRRNVVSTAMPQNFTAANTPSPPQRNVLSTAMPQN
FTT
ASTASPPQRNVVSTAMPQNFTAANTASPTHRNVVSAAVPQNFTTVSMAHCNVAATSTASRD
AVVTTTTRSPASLQRSSNQDWKERFIRLQEFLKNNEQSGQEEYIRMLHSLSSVGRSKHAIELEK
RAVNLLIEEGKELQKMKSLNVLGKLPPADHPSVPTQPTFAMRLPFQPFPPRR
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 16.14 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 16.11 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 17.15 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWKSANPSSSAS

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 445 amino acids
Location between 263647703 and 263651862
Coverage of 12.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNP
SPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKN
ANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEG
TMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTS
SRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQ
PKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 11.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 17.15 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWKSANPSSSAS

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSE
ENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 468 amino acids
Location between 263647703 and 263651862
Coverage of 13.46 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWK
NANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWH
IKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAK
QVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIA
QLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFEL
LGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 21.46 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKNANPSLSEE | LHWKSANPSSSAS |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSAS
LVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 16.14 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 19.4 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKSANPSSSAS |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN
LHWKNANPSLSEENG | LHWKSANPSSSASLV |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE][ELN][VGN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSANPSSSASLV
PEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY

Similar gene clusters

NC_067248 - Cluster 34 - Cyclopeptide

Gene cluster description

NC_067248 - Gene Cluster 34. Type = cyclopeptide. Location: 263424352 - 265848777 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 17.15 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWKSANPSSSAS

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSE
ENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 16.14 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 16.11 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWKSANPSSSASLV

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 21.46 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKNANPSLSEE | LHWKSANPSSSAS |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSAS
LVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 468 amino acids
Location between 263647703 and 263651862
Coverage of 13.46 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWK
NANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWH
IKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAK
QVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIA
QLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFEL
LGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 470 amino acids
Location between 263647703 and 263651862
Coverage of 13.4 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRH
WHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMK
AKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQ
IAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYF
ELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 16.14 %
Instances:
LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP | LHWGKNPNP
LHWKNANPS | LHWKNANPS | LHWKSANPS |
pattern: LHW[KG][KSN][AN][PN][PN][PS]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NP
SPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPS
LSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 19.4 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKSANPSSSAS |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 445 amino acids
Location between 263647703 and 263651862
Coverage of 12.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNP
SPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKN
ANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEG
TMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTS
SRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQ
PKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN
LHWKNANPSLSEENG | LHWKSANPSSSASLV |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE][ELN][VGN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSANPSSSASLV
PEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 448 amino acids
Location between 263647703 and 263651862
Coverage of 12.5 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTIL
PEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHV
CTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFY
CHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMF
Y
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 12.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 21.46 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKNANPSLSEE | LHWKSANPSSSAS |
pattern: LHW[KG][KSN][AN][PN][PN][PS][SL][PS][ASE][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
ENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSAS
LVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWKSANPSSSASLV

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 403 amino acids
Location between 263647703 and 263651862
Coverage of 22.33 %
Instances:
LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN | LHWGKNPNPSPSEEN
LHWKSANPSSSASLV |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE][EL][VN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEEN
GFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHI
KVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQ
VAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQ
LSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELL
GGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 17.15 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWKSANPSSSAS

pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
NGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 11.53 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 19.4 %
Instances:
LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE | LHWGKNPNPSPSE
LHWKSANPSSSAS |
pattern: LHW[KG][KS][AN][PN][PN][PS]S[PS][AS][SE]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
E
ENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSE
ENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 11.58 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
W
KNANPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 12.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYSQHW
SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 11.56 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYSQHW | SSYALHW
SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 13.43 %
Instances:
SSYALHW | SSYALHW | SSYALHW | SSYALHW | SSYALHW
SSYSQHW | SSYALHW | SSYALHW | SSYALHW |
pattern: SSY[AS][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY

Similar gene clusters

NC_067248 - Cluster 35 - Fatty_acid

Gene cluster description

NC_067248 - Gene Cluster 35. Type = fatty_acid. Location: 267891381 - 268111854 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067249 - Cluster 36 - Saccharide

Gene cluster description

NC_067249 - Gene Cluster 36. Type = saccharide. Location: 34897272 - 35469624 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067249 - Cluster 37 - Saccharide

Gene cluster description

NC_067249 - Gene Cluster 37. Type = saccharide. Location: 48057006 - 48593106 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067249 - Cluster 38 - Saccharide

Gene cluster description

NC_067249 - Gene Cluster 38. Type = saccharide. Location: 67671443 - 67812054 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067249 - Cluster 39 - Saccharide

Gene cluster description

NC_067249 - Gene Cluster 39. Type = saccharide. Location: 191320459 - 191523890 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 40 - Fatty_acid-polyketide

Gene cluster description

NC_067250 - Gene Cluster 40. Type = fatty_acid-polyketide. Location: 20167198 - 20959519 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/polyketide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 41 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 41. Type = saccharide. Location: 38656596 - 38919374 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 42 - Cyclopeptide

Gene cluster description

NC_067250 - Gene Cluster 42. Type = cyclopeptide. Location: 54628721 - 57093588 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC127312752
Repeat occurs 9 times in a sequence of 435 amino acids
Location between 55857036 and 55865272
Coverage of 31.03 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSRENDFG | PAKWNPSTSHKDDFD | PAKWIPSTSEEDDFG
PAKWNPSTSQEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHSPIPQ |
pattern: PAKW[ISN]PS[IT]S[QHRE][KES][PDGN][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDFG
VSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEGDFG
VSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGM
PKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFST
KVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQ
LKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 10 times in a sequence of 454 amino acids
Location between 55857036 and 55865272
Coverage of 33.04 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSRENDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG
PAKWNPSTSHEDDFD | PAKWIPSTSKEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHSPIPQ

pattern: PAKW[ISN]PS[IT]S[QHKR][KES][PDGN][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 6 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSQEDDFGASAYP
AKWNPSTSQEGDFG
VSAYPAKWNPSTSHEDDFDASAYPAKWIPSTSKEDDFGVSAYPAKWNPST
SQEDDFG
ASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKK
SLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSD
KEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDF
PYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMP
QNYIMFY
Repeat found in LOC127312752
Repeat occurs 10 times in a sequence of 454 amino acids
Location between 55857036 and 55865272
Coverage of 33.04 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSRENDFG | PAKWNPSTSHKDDFD | PAKWIPSTSEEDDFG
PAKWNPSTSQEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHSPIPQ

pattern: PAKW[ISN]PS[IT]S[QHRE][KES][PDGN][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDFG
VSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEDDFG
ASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKK
SLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSD
KEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDF
PYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMP
QNYIMFY
Repeat found in LOC127312752
Repeat occurs 9 times in a sequence of 435 amino acids
Location between 55857036 and 55865272
Coverage of 31.03 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHEDDFD
PAKWIPSTSKEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHSPIPQ |
pattern: PAKW[ISN]PS[IT]S[QHK][KES][PDG][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 6 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYP
AKWNPSTSHEDDFD
ASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEGDFG
VSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGM
PKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFST
KVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQ
LKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 9 times in a sequence of 435 amino acids
Location between 55857036 and 55865272
Coverage of 31.03 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSRENDFG | PAKWNPSTSHKDDFD | PAKWIPSTSEEDDFG
PAKWNPSTSQEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHSPIPQ |
pattern: PAKW[ISN]PS[IT]S[QHRE][KES][PDGN][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDFG
VSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEGDFG
VSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGM
PKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFST
KVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQ
LKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 11 times in a sequence of 473 amino acids
Location between 55857036 and 55865272
Coverage of 20.93 %
Instances:
PAKWSPSTS | PAKWNPSIS | PAKWNPSTS | PAKWNPSTS | PAKWIPSTS
PAKWNPSTS | PAKWNPSTS | PAKWIPSTS | PAKWNPSTS | PAKWNPSTS
PAKWNPSTS |
pattern: PAKW[ISN]PS[IT]S
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTS
EEDDFGVSSYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHEDDFDASAYPAKWIPST
S
KEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQ
AQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHN
SVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYT
VVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDK
QYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 8 times in a sequence of 416 amino acids
Location between 55857036 and 55865272
Coverage of 17.31 %
Instances:
PAKWSPSTS | PAKWNPSIS | PAKWNPSTS | PAKWNPSTS | PAKWIPSTS
PAKWNPSTS | PAKWNPSTS | PAKWNPSTS |
pattern: PAKW[ISN]PS[IT]S
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTS
EEDDFGVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
S
HSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILS
HFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHE
TPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHM
NTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 12 times in a sequence of 492 amino acids
Location between 55857036 and 55865272
Coverage of 21.95 %
Instances:
PAKWSPSTS | PAKWNPSIS | PAKWNPSTS | PAKWNPSTS | PAKWIPSTS
PAKWNPSTS | PAKWNPSTS | PAKWNPSTS | PAKWIPSTS | PAKWNPSTS
PAKWNPSTS | PAKWNPSTS |
pattern: PAKW[ISN]PS[IT]S
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTS
EEDDFGVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
S
HEDDFDASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFG
VSAY
PAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSA
PLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLR
VRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNG
MPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 8 times in a sequence of 416 amino acids
Location between 55857036 and 55865272
Coverage of 17.31 %
Instances:
PAKWSPSTS | PAKWNPSIS | PAKWNPSTS | PAKWNPSTS | PAKWIPSTS
PAKWNPSTS | PAKWNPSTS | PAKWNPSTS |
pattern: PAKW[ISN]PS[IT]S
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTS
EEDDFGVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
S
HSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILS
HFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHE
TPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHM
NTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 11 times in a sequence of 473 amino acids
Location between 55857036 and 55865272
Coverage of 34.88 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSHKDDFD | PAKWIPSTSEEDDFG | PAKWNPSTSQEDDFG
PAKWNPSTSQEGDFG | PAKWNPSTSHEDDFD | PAKWIPSTSKEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG
PAKWNPSTSHSPIPQ |
pattern: PAKW[ISN]PS[IT]S[QHKE][KES][PDG][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSHKDDFDVSAYPAKWIPSTSEEDDFGVSSYP
AKWNPSTSQEDDFG
ASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHEDDFDASAYPAKWIPST
SKEDDFG
VSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQ
AQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHN
SVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYT
VVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDK
QYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 12 times in a sequence of 492 amino acids
Location between 55857036 and 55865272
Coverage of 36.59 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSHKDDFD | PAKWIPSTSEEDDFG | PAKWNPSTSQEDDFG
PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHEDDFD | PAKWIPSTSKEDDFG | PAKWNPSTSQEDDFG
PAKWNPSTSQEGDFG | PAKWNPSTSHSPIPQ |
pattern: PAKW[ISN]PS[IT]S[QHKE][KES][PDG][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSHKDDFDVSAYPAKWIPSTSEEDDFGVSSYP
AKWNPSTSQEDDFG
ASAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
SHEDDFD
ASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFG
VSAY
PAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSA
PLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLR
VRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNG
MPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 8 times in a sequence of 416 amino acids
Location between 55857036 and 55865272
Coverage of 17.31 %
Instances:
PAKWSPSTS | PAKWNPSIS | PAKWNPSTS | PAKWNPSTS | PAKWIPSTS
PAKWNPSTS | PAKWNPSTS | PAKWNPSTS |
pattern: PAKW[ISN]PS[IT]S
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTS
EEDDFGVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
S
HSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILS
HFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHE
TPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHM
NTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 9 times in a sequence of 435 amino acids
Location between 55857036 and 55865272
Coverage of 31.03 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHEDDFD
PAKWIPSTSKEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHSPIPQ |
pattern: PAKW[ISN]PS[IT]S[QHK][KES][PDG][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 6 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYP
AKWNPSTSHEDDFD
ASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEGDFG
VSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGM
PKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFST
KVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQ
LKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 13 times in a sequence of 511 amino acids
Location between 55857036 and 55865272
Coverage of 38.16 %
Instances:
PAKWSPSTSQKDDFG | PAKWNPSISQEDDFG | PAKWNPSTSRENDFG | PAKWNPSTSHKDDFD | PAKWIPSTSEEDDFG
PAKWNPSTSQEDDFG | PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHEDDFD | PAKWIPSTSKEDDFG
PAKWNPSTSQEDDFG | PAKWNPSTSQEGDFG | PAKWNPSTSHSPIPQ |
pattern: PAKW[ISN]PS[IT]S[QHKRE][KES][PDGN][ID][PF][QDG]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FG
VSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDFG
VSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEGDFG
VSAYPAKWNPSTSHEDDFDASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFG
ASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTV
LPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMC
FSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYC
HRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 12 times in a sequence of 492 amino acids
Location between 55857036 and 55865272
Coverage of 21.95 %
Instances:
PAKWSPSTS | PAKWNPSIS | PAKWNPSTS | PAKWNPSTS | PAKWIPSTS
PAKWNPSTS | PAKWNPSTS | PAKWNPSTS | PAKWIPSTS | PAKWNPSTS
PAKWNPSTS | PAKWNPSTS |
pattern: PAKW[ISN]PS[IT]S
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTS
EEDDFGVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
S
HEDDFDASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFG
VSAY
PAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSA
PLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLR
VRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNG
MPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY

Similar gene clusters

NC_067250 - Cluster 43 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 43. Type = saccharide. Location: 76553001 - 77002543 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 44 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 44. Type = saccharide. Location: 77973231 - 78335041 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 45 - Saccharide-polyketide

Gene cluster description

NC_067250 - Gene Cluster 45. Type = saccharide-polyketide. Location: 91971948 - 92394019 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_067250 - Cluster 46 - Alkaloid

Gene cluster description

NC_067250 - Gene Cluster 46. Type = alkaloid. Location: 104728069 - 106173362 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 47 - Alkaloid

Gene cluster description

NC_067250 - Gene Cluster 47. Type = alkaloid. Location: 156468058 - 156716456 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 48 - Cyclopeptide

Gene cluster description

NC_067250 - Gene Cluster 48. Type = cyclopeptide. Location: 177490576 - 178856029 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeat found in LOC127316944
Repeat occurs 4 times in a sequence of 408 amino acids
Location between 178172571 and 178174033
Coverage of 7.84 %
Instances:
AADTSSFV | AADTPYGY | AADTPFGY | AADTPFGY |
pattern: AADT[PS][YSF][GF][YV]
MARFLIALLAAALVAVQAGGQLGHAAPATAEVFWRTVLPHSPLPDAILRLLRPAADTSSFVSK
AEERPPFDYQDYRRSSSGDGSSKRTGAAARAGDFDYDDYRGGGAADTPYGYSYSKQAPSTNQAE
ERRVAAADTPFGYDYKAPSTNQAGERLAGAAADTPFGYDYKAPSTAMGGGQAAEPTKTKVFFQE
ESVRVGERLPFHFPPAGTSAPALGFLPRRVADSVPFTAPALPGVLAAFGVAPGSTTASSMEATL
RACETPTMAGESKFCATSLEALVERAMGVLGTRDVRPVTSTLPRDGAPLQTYTVRAVQRVEGGP
VFVACHPEPFPYTVYRCHTTGPSRAYMLEMEGAVDGVTIATVCHTDTSMWNPEHISFRLLGTKP
GGTPVCHLMPYGHIIWAKNVKRSTA

Similar gene clusters

NC_067250 - Cluster 49 - Terpene

Gene cluster description

NC_067250 - Gene Cluster 49. Type = terpene. Location: 186299622 - 186564179 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 50 - Polyketide

Gene cluster description

NC_067250 - Gene Cluster 50. Type = polyketide. Location: 189769159 - 189924259 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 51 - Saccharide-terpene

Gene cluster description

NC_067250 - Gene Cluster 51. Type = saccharide-terpene. Location: 191172347 - 192056424 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 52 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 52. Type = saccharide. Location: 222545203 - 222614479 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 53 - Putative

Gene cluster description

NC_067250 - Gene Cluster 53. Type = putative. Location: 223800593 - 224135400 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 54 - Saccharide-alkaloid

Gene cluster description

NC_067250 - Gene Cluster 54. Type = saccharide-alkaloid. Location: 229319276 - 229512076 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 55 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 55. Type = saccharide. Location: 230140995 - 230612042 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 56 - Fatty_acid

Gene cluster description

NC_067250 - Gene Cluster 56. Type = fatty_acid. Location: 236250140 - 236354924 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 57 - Saccharide-terpene

Gene cluster description

NC_067250 - Gene Cluster 57. Type = saccharide-terpene. Location: 244447655 - 244659362 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 58 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 58. Type = saccharide. Location: 251263830 - 251593892 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 59 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 59. Type = saccharide. Location: 254016837 - 254432994 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026138795 - Cluster 60 - Putative

Gene cluster description

NW_026138795 - Gene Cluster 60. Type = putative. Location: 210545 - 464194 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026138842 - Cluster 61 - Phenolamide

Gene cluster description

NW_026138842 - Gene Cluster 61. Type = phenolamide. Location: 477513 - 773833 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NW_026140057 - Cluster 62 - Putative

Gene cluster description

NW_026140057 - Gene Cluster 62. Type = putative. Location: 304968 - 813233 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026140398 - Cluster 63 - Terpene

Gene cluster description

NW_026140398 - Gene Cluster 63. Type = terpene. Location: 2264369 - 2926993 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters