Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_067244.1:
Cluster 1Cyclopeptide2780165728558594756.94BURP---
Cluster 2Putative224975255225822818847.562OG-FeII_Oxy, DIOX_N, Dimerisation, Methyltransf_2, p450---
Cluster 3Polyketide253752808253991533238.72Chal_sti_synt_C, Chal_sti_synt_N, Peptidase_S10, polyprenyl_synt---
The following clusters are from record NC_067245.1:
Cluster 4Terpene21096202511217401.60Terpene_synth, Terpene_synth_C, p450---
Cluster 5Saccharide-Alkaloid1797150718324864353.36Bet_v_1, Methyltransf_11, UDPGT_2, adh_short, adh_short_C2, p450---
Cluster 6Saccharide229833202306477381.45Cellulose_synt, NAD_binding_1, UDPGT_2*saccharide--
Cluster 7Lignan-Alkaloid33725362349946471269.29BBE, Dirigent, FAD_binding_4, adh_short_C2---
Cluster 8Terpene3879815139045182247.03Methyltransf_11, Peptidase_S10, Terpene_synth, Terpene_synth_C---
Cluster 9Terpene46980305480452991064.99SQHop_cyclase_C, SQHop_cyclase_N, Transferase, p450triterpeneavenacin (16% of genes show similarity)BGCMANUAL1.1_c1
Cluster 10Lignan5129269451481939189.25AMP-binding, Amino_oxidase, Dirigent---
Cluster 11Saccharide-Lignan8464570684990173344.47Aminotran_1_2, Dirigent, Glycos_transf_1---
Cluster 12Terpene200915784201237093321.31ADH_N, Aldo_ket_red, Terpene_synth, Terpene_synth_C---
Cluster 13Cyclopeptide2131384342145154471377.01BURP---
Cluster 14Saccharide-Alkaloid231183795231472462288.67Abhydrolase_3, Pyridoxal_deC, UDPGT_2cyanogenic glucoside, monoterpenoid--
Cluster 15Alkaloid240966709241481170514.46Aldo_ket_red, Lipoxygenase, Peptidase_S10, Pyridoxal_deC---
Cluster 16Cyclopeptide293847334294838102990.77BURP---
Cluster 17Terpene299224459300116401891.94LTP_2, Terpene_synth, Terpene_synth_C, p450-momilactone B (42% of genes show similarity)BGC0000671.3_c1
Cluster 18Terpene325455149326370901915.752OG-FeII_Oxy, 3Beta_HSD, DIOX_N, SQHop_cyclase_C, SQHop_cyclase_N, p450triterpeneavenacin (16% of genes show similarity)BGCMANUAL1.1_c1
Cluster 19Terpene336458363336689903231.54Terpene_synth, Terpene_synth_C, adh_short, p450---
The following clusters are from record NC_067246.1:
Cluster 20Saccharide-Alkaloid1967614611978314811070.02AMP-binding, Str_synth, UDPGT_2, p450flavonoid, oleananes--
Cluster 21Saccharide206051296206555421504.12UDPGT_2, p450---
Cluster 22Alkaloid230351755230503863152.11Bet_v_1, Epimerase, Methyltransf_11---
Cluster 23Saccharide261997571262249227251.66Glycos_transf_1, UDPGT_2, p450---
The following clusters are from record NC_067247.1:
Cluster 24Saccharide3603256936242455209.89SQS_PSY, UDPGT_2, adh_short*saccharide--
Cluster 25Saccharide5729698458255402958.42Abhydrolase_3, UDPGT_2---
Cluster 26Saccharide8204893982587862538.92Glyco_hydro_1, UDPGT_2*saccharide-6--
Cluster 27Terpene-Lignan119589273120419633830.363Beta_HSD, AMP-binding, Dimerisation, Dirigent, Methyltransf_2, SQHop_cyclase_C, SQHop_cyclase_N, p450triterpene--
Cluster 28Saccharide-Alkaloid169244083169750038505.95AMP-binding, Str_synth, UDPGT_2flavonoid--
Cluster 29Saccharide171132931171553080420.152OG-FeII_Oxy, Aldo_ket_red, DIOX_N, UDPGT_2cytokinin--
The following clusters are from record NC_067248.1:
Cluster 30Saccharide5903861095844505.46Abhydrolase_3, Glycos_transf_1, Peptidase_S10, p450---
Cluster 31Cyclopeptide1489554441517423182786.87BURP---
Cluster 32Saccharide205899693206192971293.283Beta_HSD, Glycos_transf_2, Transferase---
Cluster 33Cyclopeptide2255872852265931271005.84BURP---
Cluster 34Terpene-Alkaloid254751026255175568424.54HMGL-like, Str_synth, Terpene_synth, Terpene_synth_C---
Cluster 35Phenolamide259630981260206783575.80Orn_Arg_deC_N, Orn_DAP_Arg_deC, Transferase-hydroxycinnamoylputrescine (100% of genes show similarity)BGC0002622.2_c1
Cluster 36Cyclopeptide2618445182654550483610.53BURP---
Cluster 37Cyclopeptide2629154562655111022595.65BURP---
Cluster 38Cyclopeptide2634243522658487772424.43BURP---
Cluster 39Fatty_acid-Transporter_associated267891381268155016263.63ABC2_membrane, ABC_tran, ADH_N, ADH_zinc_N, AMP-binding, Epimerase, NAD_binding_4, Transferase---
The following clusters are from record NC_067249.1:
Cluster 40Saccharide3489727235469624572.35Amino_oxidase, Methyltransf_11, UDPGT_2cyanogenic glucoside-3, monoterpenoid-3--
Cluster 41Saccharide4805700648593106536.10Glycos_transf_1, Glycos_transf_2, Methyltransf_11---
Cluster 42Saccharide6767144367812054140.613Beta_HSD, UDPGT_2, p450---
Cluster 43Saccharide191320459191523890203.43Aldo_ket_red, UDPGT_2, p450flavonoid-6, oleananes-6--
The following clusters are from record NC_067250.1:
Cluster 44Polyketide-Fatty_acid2016719820959519792.32Chal_sti_synt_N, FA_desaturase, UbiA---
Cluster 45Saccharide3865659638919374262.78Epimerase, Glyco_hydro_1, p450---
Cluster 46Cyclopeptide54628721570935882464.87BURP---
Cluster 47Saccharide7638281577002543619.73ABC_tran, Methyltransf_11, UDPGT_2, adh_short---
Cluster 48Saccharide7797323178335041361.81ECH_2, UDPGT_2, adh_short_C2flavonoid-3--
Cluster 49Saccharide-Polyketide9197194892394019422.07Abhydrolase_3, Cellulose_synt, Chal_sti_synt_N-beta-diketones (50% of genes show similarity)BGC0002403.2_c1
Cluster 50Alkaloid1047280691061733621445.29Aminotran_1_2, BBE, FAD_binding_4---
Cluster 51Alkaloid156468058156716456248.40Abhydrolase_3, Cu_amine_oxid, adh_short---
Cluster 52Cyclopeptide1774905761788560291365.45BURP---
Cluster 53Cyclopeptide185876124186455769579.64BURP, Terpene_synth, Terpene_synth_C, p450---
Cluster 54Terpene186299622186564179264.56Terpene_synth, Terpene_synth_C, adh_short, p450---
Cluster 55Polyketide189769159189924259155.10Aldo_ket_red, Chal_sti_synt_C, Chal_sti_synt_N---
Cluster 56Saccharide-Terpene191172347192056424884.08SQHop_cyclase_C, SQHop_cyclase_N, Transferase, UDPGT_2, p450flavonoid, triterpene--
Cluster 57Saccharide22254520322261447969.28Abhydrolase_3, UDPGT_2, p450flavonoid--
Cluster 58Transporter_associated223800593224217096416.50ABC_tran, Transferase, adh_short_C2---
Cluster 59Saccharide-Alkaloid229319276229512076192.80Aminotran_1_2, Pyridoxal_deC, UDPGT_2small phenolic--
Cluster 60Saccharide230140995230612042471.05ADH_N, ADH_zinc_N, Methyltransf_3, UDPGT_2flavonoid, oleananes--
Cluster 61Fatty_acid236250140236354924104.78FA_desaturase_2, Transferase---
Cluster 62Saccharide-Terpene244447655244659362211.712OG-FeII_Oxy, Glycos_transf_1, SQHop_cyclase_C, SQHop_cyclase_Nsterol--
Cluster 63Terpene-Cyclopeptide-Saccharide-Fatty_acid2481093762513989813289.61AMP-binding, BURP, Cellulose_synt, Dimerisation, Methyltransf_2, Terpene_synth, Transferase, p450---
Cluster 64Saccharide251263830251593892330.06Cellulose_synt, p450---
Cluster 65Saccharide254016837254432994416.16Methyltransf_11, SQS_PSY, UDPGT_2flavonoid--
The following clusters are from record NW_026138763.1:
Cluster 66Cyclopeptide1667601667.60BURP---
The following clusters are from record NW_026138795.1:
Cluster 67Putative210545464194253.653Beta_HSD, Epimerase, Transferase, p450---
The following clusters are from record NW_026138842.1:
Cluster 68Phenolamide477513773833296.32Pyridoxal_deC, Transferase---
The following clusters are from record NW_026140057.1:
Cluster 69Putative304968813233508.26Abhydrolase_3, adh_short---
The following clusters are from record NW_026140398.1:
Cluster 70Terpene22643692926993662.622OG-FeII_Oxy, DIOX_N, Peptidase_S10, Terpene_synth, Terpene_synth_C---

NC_067244 - Cluster 1 - Cyclopeptide

Gene cluster description

NC_067244 - Gene Cluster 1. Type = cyclopeptide. Location: 27801657 - 28558594 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

No repeats detected in this cluster.

Similar gene clusters

NC_067244 - Cluster 2 - Putative

Gene cluster description

NC_067244 - Gene Cluster 2. Type = putative. Location: 224975255 - 225822818 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067244 - Cluster 3 - Polyketide

Gene cluster description

NC_067244 - Gene Cluster 3. Type = polyketide. Location: 253752808 - 253991533 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 4 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 4. Type = terpene. Location: 2109620 - 2511217 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 5 - Saccharide-alkaloid

Gene cluster description

NC_067245 - Gene Cluster 5. Type = saccharide-alkaloid. Location: 17971507 - 18324864 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 6 - Saccharide

Gene cluster description

NC_067245 - Gene Cluster 6. Type = saccharide. Location: 22983320 - 23064773 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 7 - Lignan-alkaloid

Gene cluster description

NC_067245 - Gene Cluster 7. Type = lignan-alkaloid. Location: 33725362 - 34994647 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 8 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 8. Type = terpene. Location: 38798151 - 39045182 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 9 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 9. Type = terpene. Location: 46980305 - 48045299 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_067245 - Cluster 10 - Lignan

Gene cluster description

NC_067245 - Gene Cluster 10. Type = lignan. Location: 51292694 - 51481939 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 11 - Saccharide-lignan

Gene cluster description

NC_067245 - Gene Cluster 11. Type = saccharide-lignan. Location: 84645706 - 84990173 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 12 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 12. Type = terpene. Location: 200915784 - 201237093 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 13 - Cyclopeptide

Gene cluster description

NC_067245 - Gene Cluster 13. Type = cyclopeptide. Location: 213138434 - 214515447 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127334505
Repeat occurs 9 times in a sequence of 1929 amino acids
Location between 213302497 and 213314289
Coverage of 2.8 %
Instances:
PGFHAK | PGFGAD | PGFGAD | PGFGAD | PGFGAD
PGFGAE | PGFGAD | PGFEPQ | PGFCQE |
pattern: PGF[HGCE][QAP][EQKD]
MEADARGSADHVATAAPPLKTDDGAAAMAPAPPLKAEDGAAPMGPAPQMGDELTSYNNPHTEE
DEEKPDLAVKLEHASVEAVSVPFQVKVQIAGEKGSTTDEMPVMTMDDEAVSVPFQVKVEIGDEE
GSRTDEMPMMSKDDEDLLAESDDLNTCHVANGGLLGKARIAPCSQTFSSRDVLCMRATEDRQTG
RSQHEWDGLAGNGGDWGGMAEVMTTEAGLKMVSEGLSDSVKHGAERWPRGVDGAVFMTDDSHVL
RRGDSMPKIEPEVSGGLHEDSVPSVAGRIGVSLNGNTDRAGETSAPGVSQCSKEADLAGEGVVS
CDGALLKTSLDDLQARCREPHCYAVGFSNSVKFDIQQLPHHMEAMLLKKDVKRELEMDGLLPKI
EANVSSPAHQDSVPTTFGRNLAVYLDGEDGEISQHKTGMEKLASGSLGGGMLSGENKFSTMEGC
YANNRGLGHQGFSGIELGCPDHLALEHNACNFVADKPCSTVFAENGNDEELENQNSESLSVSRR
RNPRRSASSRNPTLEKHDQICKGSNSSRKSNKVESSCSLVESTINKFPKKITKLRSGINRPLKS
TAWGSLQKLTCGFGLNGEPSTSNSHLVTLENGRSHKRSGKKEQPSVRNSQSSTCAKNKSPPFSA
MGFASDELNGQPTFSVTTGTYASSEGYIGNFPRLDLRALVNGSGDAHKTAQYMSIKTDLQQLDR
CLESATQETCPAYICGDFAKSTSEPSLNIASVGFSPDSVLEVASATCENNTSDSGKNLASSSTD
LEQWAQTVRGDENTRNEEINPSHAISGFIGKGKVQVLEKSNAVRKSKKVGKQERQKKDGMKGKN
IKNRGSTKSPSKLGAFSDDSYSLVPSVPPKSGSCFELATSVTQCISMREHESVQGRPVIVCALD
TVKSQRRKKTDDSGGKKDTMWDPYVDGKGMNQKIAGDILFDSGLSTSPPPLVTDLAASHTNEQN
GASVSQRPRAAWVCCDDCQKWRCIPADLADTIGTTNSRWTCKDNQDKTFAGCSIPQEKTNAEIN
AELDLSDASADEADNDRSNSKASRAPSWTHVRTNSFLHRNRRNQSIDESMVCNCKPPQDGRMGC
RDGCLNRMLNIECEKRTCPCGEQCSNQQFQRRSYAKISWFHSGKKGYGLELQEEVSEGRFLIEY
VGEVLDITAYEHRQRDYASKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVN
GEVCIGIFATRNIKKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYIGGDISGSGIISQDDAEA
EHFQPLVTYKDAEELLGNAYSRVADPSIVEHETSIQREDSKDAEEMLGNAYSHVADPNIVEHET
SIKQEDSNNCPPATPDSEPHQQTSPILSDTSEPENSMEAWSPQDTEEVTRTPVHVSRNIESSLQ
FPVYVTQPLEFLEKTPSPIDGLMAPNVMNRSTPSSDMGGSLVPGFHAKKENSLKHHRNVKLPCP
IDTEHTLGVEVRLNSLLDRDGGISRRKDSTNEYLRLLLRTAADGEAGGTSKSIRDLALILDALL
QTKSGAVLLDIINMNGLQMLHNILKQNRDHFLRRPIIRKLLKVLEFLALRGILTAEKINEGPRL
AGMESFRDSMLKLTRHSDKHVHPIARKFCDKWILPYMDGPANCSTHSYSARRKRKSRWDYQPES
HYKKTVKVYSGHRELDLQTSLIGNGSQGNRVTNSNHNDVPVAVDDAPPGFGADDDVPPGFGADV
DVPPGFGADEDVPPGFGADEDVPPGFGAEDDVPPGFGADDDAPPGFEPQQEHQPAQAPLDLGGA
PGFCQE
RYLPNSSTSSEIPVALVQHFGTPEVEGTQCGLKWKVAPGVPFNPRGSPSPSTSSMHMP
HHDGAPAMKQNSSGYRGRGADRGGRIHRNWRNGERTRSPYDQEGRRSHSNHHRLERCEPPRHQE
HHGSGFTGRE
Repeat found in LOC127329066
Repeat occurs 3 times in a sequence of 173 amino acids
Location between 213576806 and 213577922
Coverage of 10.4 %
Instances:
VFDHVD | VFDRTP | VFDYTP |
pattern: VFD[HRY][TV][PD]
MDKDASTGGGGKVFDHVDDSGDAVFSGLGNLSSNMTDDADESADKTIRESLDLLNQDFAAYSA
DSLPRFLDPPVFDRTPAVEKGNMGSPVFDYTPLRQVLESESVEATPVMEETQSQSVAGTQIVTQ
DYMVEEQLAKEKERQAAKLAQHQEKLTTYYRKYPAKAKSAKGNAHT
Repeat found in LOC127334525
Repeat occurs 4 times in a sequence of 335 amino acids
Location between 214257473 and 214260546
Coverage of 11.94 %
Instances:
DDDDDDEETE | DDDDDEETED | DDDDEETEDE | DDDEETEDEE |
pattern: DDD[ED][ED][ETD][ET][ETD][ETD][ED]
MQAAPHRRHRWVKEWIPQDLIIAGGPCALYKWVREDRLAALKSNDKEQVTDSAKPEPTTEILF
LCSYDGCGKTFVDAGALKKHANVHGERQYVCHYEGCGKKFLDSSKLKRHFLIHTGEKNFVCHHE
GCGKAFSLDFNLKAHMKTHFADNYHLCPSPECGRRFTQESKLRAHYRAQHDKNPGGPVMNLHNP
VKPLATPPVPSADRPYVCPYDGCAKAYIHEYKLNLHLRKEHPNHYADAGAEAGPSRGTASKKSR
RRKPNQTPNLPLKIPKRSEYSAPSPAVSIPEEHQWPRKVLYEEDSEETEEEGENLEDGGSRYRA
ASSDDDDDDEETEDEE

Similar gene clusters

NC_067245 - Cluster 14 - Saccharide-alkaloid

Gene cluster description

NC_067245 - Gene Cluster 14. Type = saccharide-alkaloid. Location: 231183795 - 231472462 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 15 - Alkaloid

Gene cluster description

NC_067245 - Gene Cluster 15. Type = alkaloid. Location: 240966709 - 241481170 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067245 - Cluster 16 - Cyclopeptide

Gene cluster description

NC_067245 - Gene Cluster 16. Type = cyclopeptide. Location: 293847334 - 294838102 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127336078
Repeat occurs 9 times in a sequence of 501 amino acids
Location between 294341514 and 294343921
Coverage of 21.56 %
Instances:
SYGSKQEDDPNK | SYGSEQEGDPNK | SYGSEQEDDPHK | SYGSEQEDDPNK | SYGSEQEGDPNK
SYGSKQEDDPHK | SYGSEQEDDPNK | SYGSEQEEGPHK | SYGSEQKEVQKI |
pattern: SYGS[EK]Q[EK][EGD][GVD][QP][HKN][KI]
MQHMVAHDMLVPLIVSLLLIAGGGGWHASFVNAATTAARKLETPMTKAYWETLLPGTSMPPAI
SDLLGQQNVWDKGEEVPSKDIVSHGSEQEDPHKATMSYESEKDNDPHKITISYGSKQEDDPNKI
TMSYGSEQEGDPNKATISYGSEQEDDPHKATMSYGSEQEDDPNKITMSYGSEQEGDPNKATMSY
GSKQEDDPHK
VTMSYGSEQEDDPNKLTMSYGSEQEEGPHKVTMSYGSEQKEVQKIILSQHEARM
NGQGSHHHKAHSHSTKRQQADVFFFHDMLRPGFIITPTIPPTSSLPSLLPRSVAGSIPFSHERL
SDIISMFAPASLSMARKIRWTLDMCEHPRTLPGQSAGCATSLESLAELPPSFLKTRNVRAFSAA
DLPVEARGTRALRGRYNVTAVLKVSGESSEIVTCHDLTYPYAVYYCHTANRTSAYTVTLTSVED
GVVPKTMEALAVCHLDTSKWSPKNPFFELHNLKPGEVTVCHFLTKLSIIWVPGS

Similar gene clusters

NC_067245 - Cluster 17 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 17. Type = terpene. Location: 299224459 - 300116401 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_067245 - Cluster 18 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 18. Type = terpene. Location: 325455149 - 326370901 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_067245 - Cluster 19 - Terpene

Gene cluster description

NC_067245 - Gene Cluster 19. Type = terpene. Location: 336458363 - 336689903 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067246 - Cluster 20 - Saccharide-alkaloid

Gene cluster description

NC_067246 - Gene Cluster 20. Type = saccharide-alkaloid. Location: 196761461 - 197831481 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067246 - Cluster 21 - Saccharide

Gene cluster description

NC_067246 - Gene Cluster 21. Type = saccharide. Location: 206051296 - 206555421 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067246 - Cluster 22 - Alkaloid

Gene cluster description

NC_067246 - Gene Cluster 22. Type = alkaloid. Location: 230351755 - 230503863 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067246 - Cluster 23 - Saccharide

Gene cluster description

NC_067246 - Gene Cluster 23. Type = saccharide. Location: 261997571 - 262249227 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 24 - Saccharide

Gene cluster description

NC_067247 - Gene Cluster 24. Type = saccharide. Location: 36032569 - 36242455 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 25 - Saccharide

Gene cluster description

NC_067247 - Gene Cluster 25. Type = saccharide. Location: 57296984 - 58255402 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 26 - Saccharide

Gene cluster description

NC_067247 - Gene Cluster 26. Type = saccharide. Location: 82048939 - 82587862 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 27 - Terpene-lignan

Gene cluster description

NC_067247 - Gene Cluster 27. Type = terpene-lignan. Location: 119589273 - 120419633 nt. Click on genes for more information.
Show pHMM detection rules used
plants/lignan: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Dirigent]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 28 - Saccharide-alkaloid

Gene cluster description

NC_067247 - Gene Cluster 28. Type = saccharide-alkaloid. Location: 169244083 - 169750038 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067247 - Cluster 29 - Saccharide

Gene cluster description

NC_067247 - Gene Cluster 29. Type = saccharide. Location: 171132931 - 171553080 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067248 - Cluster 30 - Saccharide

Gene cluster description

NC_067248 - Gene Cluster 30. Type = saccharide. Location: 590386 - 1095844 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067248 - Cluster 31 - Cyclopeptide

Gene cluster description

NC_067248 - Gene Cluster 31. Type = cyclopeptide. Location: 148955444 - 151742318 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127300864
Repeat occurs 4 times in a sequence of 697 amino acids
Location between 149129868 and 149131962
Coverage of 3.44 %
Instances:
SWNSFL | SWNALI | SWNAML | SWNSML |
pattern: SWN[SA][FLM][IL]
MRHRRHGHHLTIAAAKSHATLLKSRVISPTPWNQLLTAYSHSPLGLAAARRLFDEIPRPDAVS
WNSFL
TAHVSAGAHHAAWRLLRAMHALGLASNTFALGSALRSAAASRCPALGTQLHSLALKCSL
ADNVFSASAMLDMYAKCGRIRDARRVFDGMPERNTVSWNALIAGYVESGKVEQALELFLHMERQ
GFVPDEATFAALLPAVDGSNHLLMRQLHGKIMKFGSMLGLTVLNAAITAYSQCGALAECRRFFD
GIGDSRDLISWNAMLGAYTHHGMACEAMRFFVRMMQESGVQLDMYSFTSIISVCPEHDDQRGIV
VHGMVIKNGLEGVTPVCNALISMYTRFSDKCMMEDAHRCFDSLLLKDTVSWNSMLTGYSQHGMS
ADALRFFTCMHSENIRTDEYAFSAALRSCADLAVLLLGRQIHGSIIRSGFASNSFVSSSLIFMY
SKTGILDDAKMSFEEADKSSSVPWNSMMFGYAQHGHAQTVYNLFNEMVELQVPLDHVTFVALIT
ACSHAGLVDEGSEILNAMETRYGIPLRMEHYACGIDMYGRAGQLDRAKELIDSMPFEPDAMVWM
TLLGACRIHGNMELASEVASHLLVAEPRQHSTYVLLSSMYSGLEMWSDRAIVQRAMRNRGLSKV
PGWSWIEVKNEVHSFNAEDRSHPRMNEIYEVLSLLFQVATMFSSSEDDEAITAISSDA

Similar gene clusters

NC_067248 - Cluster 32 - Saccharide

Gene cluster description

NC_067248 - Gene Cluster 32. Type = saccharide. Location: 205899693 - 206192971 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067248 - Cluster 33 - Cyclopeptide

Gene cluster description

NC_067248 - Gene Cluster 33. Type = cyclopeptide. Location: 225587285 - 226593127 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_067248 - Cluster 34 - Terpene-alkaloid

Gene cluster description

NC_067248 - Gene Cluster 34. Type = terpene-alkaloid. Location: 254751026 - 255175568 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067248 - Cluster 35 - Phenolamide

Gene cluster description

NC_067248 - Gene Cluster 35. Type = phenolamide. Location: 259630981 - 260206783 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_067248 - Cluster 36 - Cyclopeptide

Gene cluster description

NC_067248 - Gene Cluster 36. Type = cyclopeptide. Location: 261844518 - 265455048 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127302833
Repeat occurs 9 times in a sequence of 499 amino acids
Location between 262920147 and 262921902
Coverage of 27.05 %
Instances:
PQNFTAANTSSLPQC | PQNFTAANNSSLPQR | PQNFTAVNASSPPQR | PQNFTASNTAGLPQR | PQNFTAANTASPPRR
PQNFTAANTPSPPQR | PQNFTTASTASPPQR | PQNFTAANTASPTHR | PQNFTTVSMAHCNVA |
pattern: PQNFT[TA][SVA][SN][TMAN][SAP][HGS][CPL][TPN][HQRV][CRA]
MPAGDNPRSISEKKAALRESPKQSKNAAVNQQARAPPFPKDKAAETVGIKRPQPNGPLSATNH
HVPGNPGANGHLVYVRRKVETDQSKGGATSGAESATSLSSKKPGICGPQEQSLKRQSSVPNTQS
APVSASPAASASIPALHSASLPANLSFGKQSPGKVSAQPSVAVTASPPQRNVVSTGMPQNFTAA
NTSSLPQC
NIVSTTMPQNFTAANNSSLPQRNVVSTAMPQNFTAVNASSPPQRNGVSTAMPQNFT
ASNTAGLPQR
SVVSTAMPQNFTAANTASPPRRNVVSTAMPQNFTAANTPSPPQRNVLSTAMPQN
FTTASTASPPQR
NVVSTAMPQNFTAANTASPTHRNVVSAAVPQNFTTVSMAHCNVAATSTASRD
AVVTTTTRSPASLQRSSNQDWKERFIRLQEFLKNNEQSGQEEYIRMLHSLSSVGRSKHAIELEK
RAVNLLIEEGKELQKMKSLNVLGKLPPADHPSVPTQPTFAMRLPFQPFPPRR
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 470 amino acids
Location between 263647703 and 263651862
Coverage of 28.72 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRH
WHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMK
AKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQ
IAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYF
ELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 448 amino acids
Location between 263647703 and 263651862
Coverage of 26.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTIL
PEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHV
CTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFY
CHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMF
Y
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 26.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 26.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 26.91 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 403 amino acids
Location between 263647703 and 263651862
Coverage of 22.33 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHI
KVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQ
VAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQ
LSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELL
GGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 22.39 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 26.91 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 22.39 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYSQHWFENP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYSQHWFENP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 468 amino acids
Location between 263647703 and 263651862
Coverage of 28.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWK
NAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWH
IKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAK
QVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIA
QLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFEL
LGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 19.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 19.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 445 amino acids
Location between 263647703 and 263651862
Coverage of 19.78 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNP
SPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKN
ANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEG
TMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTS
SRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQ
PKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 401 amino acids
Location between 263647703 and 263651862
Coverage of 16.46 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW
GFDASSYALHW |
pattern: GFD[SA]SSYALHW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKV
KTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVA
ETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLS
NHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGG
ERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302857
Repeat occurs 3 times in a sequence of 328 amino acids
Location between 264211381 and 264215176
Coverage of 13.72 %
Instances:
NGFDASSYAEHWGRN | NGFDASSYAEHWGRN | NGFDASSYAEHWGRN |
pattern: NGFDASSYAEHWGRN
MGAKVLAIAFVLLLLTFEKNAFTMAADPGTRNDRAEPPPSQENGFDASSYAEHWGRNPSASEK
NGFDASSYAEHWGRN
PSASENGFDASSYAEHWGRNPSASGSLVPGAQTRHWHIKVKTGMLFLKK
ILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSLKAKQVAETLRSCVKP
IDREEPHVCTSSRKAMSRFATTSLGSNLTQAAFTRIHGHESPTTRYVVAQIAQLSNHTVACHPM
DFPYEVFYCHRPKEVQALRVQLNDMKNGMTRVTATVMCHMNTSNWDKEYFELLGGERGEPVCHY
MPQNYIMFY
Repeat found in LOC127302857
Repeat occurs 3 times in a sequence of 329 amino acids
Location between 264211381 and 264215176
Coverage of 13.68 %
Instances:
NGFDASSYAEHWGRN | NGFDASSYAEHWGRN | NGFDASSYAEHWGRN |
pattern: NGFDASSYAEHWGRN
MGAKVLAIAFVLLLLTFEKNAFTMAADPGTRNDRAEPPPSQENGFDASSYAEHWGRNPSASEK
NGFDASSYAEHWGRN
PSASEENGFDASSYAEHWGRNPSASGSLVPGAQTRHWHIKVKTGMLFLK
KILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSLKAKQVAETLRSCVK
PIDREEPHVCTSSRKAMSRFATTSLGSNLTQAAFTRIHGHESPTTRYVVAQIAQLSNHTVACHP
MDFPYEVFYCHRPKEVQALRVQLNDMKNGMTRVTATVMCHMNTSNWDKEYFELLGGERGEPVCH
YMPQNYIMFY

Similar gene clusters

NC_067248 - Cluster 37 - Cyclopeptide

Gene cluster description

NC_067248 - Gene Cluster 37. Type = cyclopeptide. Location: 262915456 - 265511102 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127302833
Repeat occurs 9 times in a sequence of 499 amino acids
Location between 262920147 and 262921902
Coverage of 27.05 %
Instances:
PQNFTAANTSSLPQC | PQNFTAANNSSLPQR | PQNFTAVNASSPPQR | PQNFTASNTAGLPQR | PQNFTAANTASPPRR
PQNFTAANTPSPPQR | PQNFTTASTASPPQR | PQNFTAANTASPTHR | PQNFTTVSMAHCNVA |
pattern: PQNFT[TA][SVA][SN][TMAN][SAP][HGS][CPL][TPN][HQRV][CRA]
MPAGDNPRSISEKKAALRESPKQSKNAAVNQQARAPPFPKDKAAETVGIKRPQPNGPLSATNH
HVPGNPGANGHLVYVRRKVETDQSKGGATSGAESATSLSSKKPGICGPQEQSLKRQSSVPNTQS
APVSASPAASASIPALHSASLPANLSFGKQSPGKVSAQPSVAVTASPPQRNVVSTGMPQNFTAA
NTSSLPQC
NIVSTTMPQNFTAANNSSLPQRNVVSTAMPQNFTAVNASSPPQRNGVSTAMPQNFT
ASNTAGLPQR
SVVSTAMPQNFTAANTASPPRRNVVSTAMPQNFTAANTPSPPQRNVLSTAMPQN
FTTASTASPPQR
NVVSTAMPQNFTAANTASPTHRNVVSAAVPQNFTTVSMAHCNVAATSTASRD
AVVTTTTRSPASLQRSSNQDWKERFIRLQEFLKNNEQSGQEEYIRMLHSLSSVGRSKHAIELEK
RAVNLLIEEGKELQKMKSLNVLGKLPPADHPSVPTQPTFAMRLPFQPFPPRR
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 470 amino acids
Location between 263647703 and 263651862
Coverage of 28.72 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRH
WHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMK
AKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQ
IAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYF
ELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 448 amino acids
Location between 263647703 and 263651862
Coverage of 26.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTIL
PEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHV
CTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFY
CHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMF
Y
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 26.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 26.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 26.91 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 403 amino acids
Location between 263647703 and 263651862
Coverage of 22.33 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHI
KVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQ
VAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQ
LSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELL
GGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 22.39 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 26.91 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 22.39 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYSQHWFENP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYSQHWFENP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 468 amino acids
Location between 263647703 and 263651862
Coverage of 28.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWK
NAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWH
IKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAK
QVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIA
QLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFEL
LGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 19.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 19.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 445 amino acids
Location between 263647703 and 263651862
Coverage of 19.78 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNP
SPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKN
ANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEG
TMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTS
SRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQ
PKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 401 amino acids
Location between 263647703 and 263651862
Coverage of 16.46 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW
GFDASSYALHW |
pattern: GFD[SA]SSYALHW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKV
KTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVA
ETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLS
NHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGG
ERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302857
Repeat occurs 3 times in a sequence of 328 amino acids
Location between 264211381 and 264215176
Coverage of 13.72 %
Instances:
NGFDASSYAEHWGRN | NGFDASSYAEHWGRN | NGFDASSYAEHWGRN |
pattern: NGFDASSYAEHWGRN
MGAKVLAIAFVLLLLTFEKNAFTMAADPGTRNDRAEPPPSQENGFDASSYAEHWGRNPSASEK
NGFDASSYAEHWGRN
PSASENGFDASSYAEHWGRNPSASGSLVPGAQTRHWHIKVKTGMLFLKK
ILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSLKAKQVAETLRSCVKP
IDREEPHVCTSSRKAMSRFATTSLGSNLTQAAFTRIHGHESPTTRYVVAQIAQLSNHTVACHPM
DFPYEVFYCHRPKEVQALRVQLNDMKNGMTRVTATVMCHMNTSNWDKEYFELLGGERGEPVCHY
MPQNYIMFY
Repeat found in LOC127302857
Repeat occurs 3 times in a sequence of 329 amino acids
Location between 264211381 and 264215176
Coverage of 13.68 %
Instances:
NGFDASSYAEHWGRN | NGFDASSYAEHWGRN | NGFDASSYAEHWGRN |
pattern: NGFDASSYAEHWGRN
MGAKVLAIAFVLLLLTFEKNAFTMAADPGTRNDRAEPPPSQENGFDASSYAEHWGRNPSASEK
NGFDASSYAEHWGRN
PSASEENGFDASSYAEHWGRNPSASGSLVPGAQTRHWHIKVKTGMLFLK
KILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSLKAKQVAETLRSCVK
PIDREEPHVCTSSRKAMSRFATTSLGSNLTQAAFTRIHGHESPTTRYVVAQIAQLSNHTVACHP
MDFPYEVFYCHRPKEVQALRVQLNDMKNGMTRVTATVMCHMNTSNWDKEYFELLGGERGEPVCH
YMPQNYIMFY

Similar gene clusters

NC_067248 - Cluster 38 - Cyclopeptide

Gene cluster description

NC_067248 - Gene Cluster 38. Type = cyclopeptide. Location: 263424352 - 265848777 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 470 amino acids
Location between 263647703 and 263651862
Coverage of 28.72 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRH
WHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMK
AKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQ
IAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYF
ELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 448 amino acids
Location between 263647703 and 263651862
Coverage of 26.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALH
WKNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTIL
PEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHV
CTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFY
CHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMF
Y
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 26.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHW
KNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 26.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 26.91 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 403 amino acids
Location between 263647703 and 263651862
Coverage of 22.33 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHI
KVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQ
VAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQ
LSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELL
GGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 22.39 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 26.91 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NAN
PSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 402 amino acids
Location between 263647703 and 263651862
Coverage of 22.39 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIK
VKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQV
AETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQL
SNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLG
GERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 425 amino acids
Location between 263647703 and 263651862
Coverage of 24.71 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYSQHWFENP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHW
KSAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLE
PKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQA
ALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMAR
VTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYSQHWFENP
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 469 amino acids
Location between 263647703 and 263651862
Coverage of 28.78 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHW
KNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHW
HIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKA
KQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQI
AQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFE
LLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 9 times in a sequence of 468 amino acids
Location between 263647703 and 263651862
Coverage of 28.85 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP
GFDASSYSQHWFENP | GFDASSYALHWKNAN | GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSY[SA][QL]HW[GFK][ESKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWK
NAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWH
IKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAK
QVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIA
QLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFEL
LGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 24.76 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SAN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 380 amino acids
Location between 263647703 and 263651862
Coverage of 19.74 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGT
MFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSS
RKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQP
KEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 19.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 24.82 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKNAN
GFDASSYALHWKNAN | GFDASSYALHWKSAN |
pattern: GFD[SA]SSYALHW[GK][SKN][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
AN
PSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 5 times in a sequence of 379 amino acids
Location between 263647703 and 263651862
Coverage of 19.79 %
Instances:
GFDSSSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWGKNP | GFDASSYALHWKSAN

pattern: GFD[SA]SSYALHW[GK][SK][AN][PN]
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTM
FARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSR
KAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPK
EVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 445 amino acids
Location between 263647703 and 263651862
Coverage of 19.78 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNP
SPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKN
ANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEG
TMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTS
SRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQ
PKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 6 times in a sequence of 401 amino acids
Location between 263647703 and 263651862
Coverage of 16.46 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW
GFDASSYALHW |
pattern: GFD[SA]SSYALHW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWKNAN
PSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKV
KTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVA
ETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLS
NHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGG
ERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 447 amino acids
Location between 263647703 and 263651862
Coverage of 19.69 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNP
NPSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHW
KNANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILP
EGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVC
TSSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYC
HQPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 8 times in a sequence of 446 amino acids
Location between 263647703 and 263651862
Coverage of 19.73 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW
GFDASSYALHW | GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPN
PSPSEENGFDASSYSQHWFENPNRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWK
NANPSLSEENGFDASSYALHWKSANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPE
GTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCT
SSRKAMARFATTALGSNLTQAALTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCH
QPKEVQALRVQLNDMKNGMARVTATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 424 amino acids
Location between 263647703 and 263651862
Coverage of 18.16 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSEENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENP
NRSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWK
SANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEP
KYFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAA
LTKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARV
TATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302847
Repeat occurs 7 times in a sequence of 423 amino acids
Location between 263647703 and 263651862
Coverage of 18.2 %
Instances:
GFDSSSYALHW | GFDASSYALHW | GFDASSYALHW | GFDASSYSQHW | GFDASSYALHW
GFDASSYALHW | GFDASSYALHW |
pattern: GFD[SA]SSY[SA][QL]HW
MGTKVLTIAFVLLLLNFEKNAFTMAADPVTMNDRAELPASQKIGFDSSSYALHWGKNPNPSPS
EENGFDASSYALHWGKNPNPSPSENGFDASSYALHWGKNPNPSPSEENGFDASSYSQHWFENPN
RSPSEENGFDASSYALHWKNANPSPSEENGFDASSYALHWKNANPSLSEENGFDASSYALHWKS
ANPSSSASLVPEAQTRHWHIKVKTGMLFLKKILHVGTILPEGTMFARADMPKPDSSVSTPLEPK
YFATIVSRFKIPYNSMKAKQVAETLHSCVNPIDREEPHVCTSSRKAMARFATTALGSNLTQAAL
TKIHGHESLTTRYVVAQIAQLSNHTVACHPMDFPYEVFYCHQPKEVQALRVQLNDMKNGMARVT
ATVMCHMNTSNWDKEYFELLGGERGEPVCHYMPQNYIMFY
Repeat found in LOC127302857
Repeat occurs 3 times in a sequence of 328 amino acids
Location between 264211381 and 264215176
Coverage of 13.72 %
Instances:
NGFDASSYAEHWGRN | NGFDASSYAEHWGRN | NGFDASSYAEHWGRN |
pattern: NGFDASSYAEHWGRN
MGAKVLAIAFVLLLLTFEKNAFTMAADPGTRNDRAEPPPSQENGFDASSYAEHWGRNPSASEK
NGFDASSYAEHWGRN
PSASENGFDASSYAEHWGRNPSASGSLVPGAQTRHWHIKVKTGMLFLKK
ILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSLKAKQVAETLRSCVKP
IDREEPHVCTSSRKAMSRFATTSLGSNLTQAAFTRIHGHESPTTRYVVAQIAQLSNHTVACHPM
DFPYEVFYCHRPKEVQALRVQLNDMKNGMTRVTATVMCHMNTSNWDKEYFELLGGERGEPVCHY
MPQNYIMFY
Repeat found in LOC127302857
Repeat occurs 3 times in a sequence of 329 amino acids
Location between 264211381 and 264215176
Coverage of 13.68 %
Instances:
NGFDASSYAEHWGRN | NGFDASSYAEHWGRN | NGFDASSYAEHWGRN |
pattern: NGFDASSYAEHWGRN
MGAKVLAIAFVLLLLTFEKNAFTMAADPGTRNDRAEPPPSQENGFDASSYAEHWGRNPSASEK
NGFDASSYAEHWGRN
PSASEENGFDASSYAEHWGRNPSASGSLVPGAQTRHWHIKVKTGMLFLK
KILHVGTILPEGTMFARADMPKPDSSVSTPLEPKYFATIVSRFKIPYNSLKAKQVAETLRSCVK
PIDREEPHVCTSSRKAMSRFATTSLGSNLTQAAFTRIHGHESPTTRYVVAQIAQLSNHTVACHP
MDFPYEVFYCHRPKEVQALRVQLNDMKNGMTRVTATVMCHMNTSNWDKEYFELLGGERGEPVCH
YMPQNYIMFY

Similar gene clusters

NC_067248 - Cluster 39 - Fatty_acid-transporter_associated

Gene cluster description

NC_067248 - Gene Cluster 39. Type = fatty_acid-transporter_associated. Location: 267891381 - 268155016 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067249 - Cluster 40 - Saccharide

Gene cluster description

NC_067249 - Gene Cluster 40. Type = saccharide. Location: 34897272 - 35469624 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067249 - Cluster 41 - Saccharide

Gene cluster description

NC_067249 - Gene Cluster 41. Type = saccharide. Location: 48057006 - 48593106 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067249 - Cluster 42 - Saccharide

Gene cluster description

NC_067249 - Gene Cluster 42. Type = saccharide. Location: 67671443 - 67812054 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067249 - Cluster 43 - Saccharide

Gene cluster description

NC_067249 - Gene Cluster 43. Type = saccharide. Location: 191320459 - 191523890 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 44 - Polyketide-fatty_acid

Gene cluster description

NC_067250 - Gene Cluster 44. Type = polyketide-fatty_acid. Location: 20167198 - 20959519 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 45 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 45. Type = saccharide. Location: 38656596 - 38919374 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 46 - Cyclopeptide

Gene cluster description

NC_067250 - Gene Cluster 46. Type = cyclopeptide. Location: 54628721 - 57093588 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127312752
Repeat occurs 13 times in a sequence of 511 amino acids
Location between 55857036 and 55865272
Coverage of 38.16 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHEDDF | YPAKWIPSTSKEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEKRQ][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEGDF
GVSAYPAKWNPSTSHEDDFDASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFG
ASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTV
LPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMC
FSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYC
HRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 10 times in a sequence of 454 amino acids
Location between 55857036 and 55865272
Coverage of 33.04 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEDDF | YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP

pattern: YPAKW[SIN]PS[TI]S[HEQR][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEDDF
GASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKK
SLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSD
KEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDF
PYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMP
QNYIMFY
Repeat found in LOC127312752
Repeat occurs 9 times in a sequence of 435 amino acids
Location between 55857036 and 55865272
Coverage of 31.03 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEQR][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEGDF
GVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGM
PKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFST
KVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQ
LKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 12 times in a sequence of 492 amino acids
Location between 55857036 and 55865272
Coverage of 36.59 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHEDDF | YPAKWIPSTSKEDDF | YPAKWNPSTSQEDDF
YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEKRQ][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
SHEDDF
DASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFG
VSA
YPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSA
PLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLR
VRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNG
MPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 9 times in a sequence of 435 amino acids
Location between 55857036 and 55865272
Coverage of 31.03 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEQR][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEGDF
GVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGM
PKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFST
KVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQ
LKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 8 times in a sequence of 416 amino acids
Location between 55857036 and 55865272
Coverage of 28.85 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEQR][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
SHSPIP
QAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILS
HFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHE
TPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHM
NTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 12 times in a sequence of 492 amino acids
Location between 55857036 and 55865272
Coverage of 36.59 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHEDDF | YPAKWIPSTSKEDDF | YPAKWNPSTSQEDDF
YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEKRQ][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
SHEDDF
DASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFG
VSA
YPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSA
PLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLR
VRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNG
MPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 8 times in a sequence of 416 amino acids
Location between 55857036 and 55865272
Coverage of 28.85 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEQR][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
SHSPIP
QAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILS
HFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHE
TPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHM
NTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 11 times in a sequence of 473 amino acids
Location between 55857036 and 55865272
Coverage of 34.88 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEGDF | YPAKWNPSTSHEDDF | YPAKWIPSTSKEDDF | YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF
YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEKRQ][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHEDDFDASAYPAKWIPST
SKEDDF
GVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQ
AQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHN
SVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYT
VVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDK
QYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 8 times in a sequence of 416 amino acids
Location between 55857036 and 55865272
Coverage of 28.85 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSRENDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEQR][ESK][GPDN][ID][FP]
The following known motifs were found:
VS[AI]Y was found 5 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSHKDDFDVSAYP
AKWIPSTSEEDDF
GVSSYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
SHSPIP
QAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILS
HFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHE
TPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHM
NTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 10 times in a sequence of 454 amino acids
Location between 55857036 and 55865272
Coverage of 33.04 %
Instances:
AYPAKWSPSTSQKDD | AYPAKWNPSISQEDD | AYPAKWNPSTSREND | AYPAKWNPSTSQEDD | AYPAKWNPSTSQEGD
AYPAKWNPSTSHEDD | AYPAKWIPSTSKEDD | AYPAKWNPSTSQEDD | AYPAKWNPSTSQEGD | AYPAKWNPSTSHSPI

pattern: AYPAKW[SIN]PS[TI]S[HQRK][ESK][GPDN][ID]
The following known motifs were found:
VS[AI]Y was found 6 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSRENDFGVGAYPAKWNPSTSQEDDFGASAYP
AKWNPSTSQEGD
FGVSAYPAKWNPSTSHEDDFDASAYPAKWIPSTSKEDDFGVSAYPAKWNPST
SQEDD
FGASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKK
SLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSD
KEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDF
PYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMP
QNYIMFY
Repeat found in LOC127312752
Repeat occurs 12 times in a sequence of 492 amino acids
Location between 55857036 and 55865272
Coverage of 36.59 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF | YPAKWNPSTSQEDDF
YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF | YPAKWNPSTSHEDDF | YPAKWIPSTSKEDDF | YPAKWNPSTSQEDDF
YPAKWNPSTSQEGDF | YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEQK][ESK][GPD][ID][FP]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSHKDDFDVSAYPAKWIPSTSEEDDFGVSSYP
AKWNPSTSQEDDF
GASAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPST
SHEDDF
DASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFG
VSA
YPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSA
PLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLR
VRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNG
MPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 11 times in a sequence of 473 amino acids
Location between 55857036 and 55865272
Coverage of 34.88 %
Instances:
YPAKWSPSTSQKDDF | YPAKWNPSISQEDDF | YPAKWNPSTSHKDDF | YPAKWIPSTSEEDDF | YPAKWNPSTSQEDDF
YPAKWNPSTSQEGDF | YPAKWNPSTSHEDDF | YPAKWIPSTSKEDDF | YPAKWNPSTSQEDDF | YPAKWNPSTSQEGDF
YPAKWNPSTSHSPIP |
pattern: YPAKW[SIN]PS[TI]S[HEQK][ESK][GPD][ID][FP]
The following known motifs were found:
VS[AI]Y was found 7 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
F
GVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSHKDDFDVSAYPAKWIPSTSEEDDFGVSSYP
AKWNPSTSQEDDF
GASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHEDDFDASAYPAKWIPST
SKEDDF
GVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYPAKWNPSTSHSPIPQ
AQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGMPKSDSSAPLESKYFGIILSHFKIPHN
SVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFSTKVLGVLRVRQTVTRIYGHETPSSRYT
VVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQLKDKKNGMPLVTTTAMCHMNTSDWDK
QYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 9 times in a sequence of 435 amino acids
Location between 55857036 and 55865272
Coverage of 31.03 %
Instances:
SAYPAKWSPSTSQKD | SAYPAKWNPSISQED | SAYPAKWNPSTSQED | SAYPAKWNPSTSQEG | SAYPAKWNPSTSHED
SAYPAKWIPSTSKED | SAYPAKWNPSTSQED | SAYPAKWNPSTSQEG | SAYPAKWNPSTSHSP |
pattern: SAYPAKW[SIN]PS[TI]S[HQK][ESK][GPD]
The following known motifs were found:
VS[AI]Y was found 6 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYP
AKWNPSTSHED
DFDASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEG
DFGVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGM
PKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFST
KVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQ
LKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY
Repeat found in LOC127312752
Repeat occurs 9 times in a sequence of 435 amino acids
Location between 55857036 and 55865272
Coverage of 31.03 %
Instances:
SAYPAKWSPSTSQKD | SAYPAKWNPSISQED | SAYPAKWNPSTSQED | SAYPAKWNPSTSQEG | SAYPAKWNPSTSHED
SAYPAKWIPSTSKED | SAYPAKWNPSTSQED | SAYPAKWNPSTSQEG | SAYPAKWNPSTSHSP |
pattern: SAYPAKW[SIN]PS[TI]S[HQK][ESK][GPD]
The following known motifs were found:
VS[AI]Y was found 6 times in this sequence
MEAKVVSIAFVLLLLTFGKNAKATTANPVTRNEGDESPTPHEDDFGVSAYPAKWSPSTSQKDD
FGVSAYPAKWNPSISQEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPSTSQEGDFGVSAYP
AKWNPSTSHED
DFDASAYPAKWIPSTSKEDDFGVSAYPAKWNPSTSQEDDFGASAYPAKWNPST
SQEG
DFGVSAYPAKWNPSTSHSPIPQAQMRHQQIKVQTGMLFLKKSLHVGTVLPEGTMFTRVGM
PKSDSSAPLESKYFGIILSHFKIPHNSVKAKQVADTIRSCDKPSDKEEPHMCFSSRGAMQRFST
KVLGVLRVRQTVTRIYGHETPSSRYTVVRITPLSTKMVPCHPMDFPYEVFYCHRPKEVQSFRVQ
LKDKKNGMPLVTTTAMCHMNTSDWDKQYFEMLGGVRGEPICHYMPQNYIMFY

Similar gene clusters

NC_067250 - Cluster 47 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 47. Type = saccharide. Location: 76382815 - 77002543 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 48 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 48. Type = saccharide. Location: 77973231 - 78335041 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 49 - Saccharide-polyketide

Gene cluster description

NC_067250 - Gene Cluster 49. Type = saccharide-polyketide. Location: 91971948 - 92394019 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_067250 - Cluster 50 - Alkaloid

Gene cluster description

NC_067250 - Gene Cluster 50. Type = alkaloid. Location: 104728069 - 106173362 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 51 - Alkaloid

Gene cluster description

NC_067250 - Gene Cluster 51. Type = alkaloid. Location: 156468058 - 156716456 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 52 - Cyclopeptide

Gene cluster description

NC_067250 - Gene Cluster 52. Type = cyclopeptide. Location: 177490576 - 178856029 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127315896
Repeat occurs 3 times in a sequence of 261 amino acids
Location between 177594645 and 177595492
Coverage of 9.2 %
Instances:
ENSDEENS | ENSAAENS | ENSDEEDT |
pattern: ENS[AD][EA]E[DN][ST]
MSTLASVQAENSDEENSAAENSDEEDTGARPSTLKEFITLVVTDEEGRRLTRTMRTTDKLQDL
IDYYYAMVPIVKAGNGSFVTGMRSTSTSSRCSTSVLQDFRGRRFTRTMRRTDKMQGLMDFYYDM
VPAVAIAHDDGGAGGIFLHRGECIAGDQTPADLQLKHFDMIYFVPRRRGGVVEEKPGSFVTLYV
TDGQNKVYLTMRRTDRLKVLLDVYRDKVPTGWWGGRFTRYGKHLNHQSTPADHDMKDGDELDFF
SRIIYR

Similar gene clusters

NC_067250 - Cluster 53 - Cyclopeptide

Gene cluster description

NC_067250 - Gene Cluster 53. Type = cyclopeptide. Location: 185876124 - 186455769 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_067250 - Cluster 54 - Terpene

Gene cluster description

NC_067250 - Gene Cluster 54. Type = terpene. Location: 186299622 - 186564179 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 55 - Polyketide

Gene cluster description

NC_067250 - Gene Cluster 55. Type = polyketide. Location: 189769159 - 189924259 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 56 - Saccharide-terpene

Gene cluster description

NC_067250 - Gene Cluster 56. Type = saccharide-terpene. Location: 191172347 - 192056424 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 57 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 57. Type = saccharide. Location: 222545203 - 222614479 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 58 - Transporter_associated

Gene cluster description

NC_067250 - Gene Cluster 58. Type = transporter_associated. Location: 223800593 - 224217096 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/transporter_associated: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[MatE/LTP_2/ABC2_membrane/ABC_tran]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 59 - Saccharide-alkaloid

Gene cluster description

NC_067250 - Gene Cluster 59. Type = saccharide-alkaloid. Location: 229319276 - 229512076 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 60 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 60. Type = saccharide. Location: 230140995 - 230612042 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 61 - Fatty_acid

Gene cluster description

NC_067250 - Gene Cluster 61. Type = fatty_acid. Location: 236250140 - 236354924 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 62 - Saccharide-terpene

Gene cluster description

NC_067250 - Gene Cluster 62. Type = saccharide-terpene. Location: 244447655 - 244659362 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 63 - Terpene-cyclopeptide-saccharide-fatty_acid

Gene cluster description

NC_067250 - Gene Cluster 63. Type = terpene-cyclopeptide-saccharide-fatty_acid. Location: 248109376 - 251398981 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127316289
Repeat occurs 3 times in a sequence of 266 amino acids
Location between 249001515 and 249002502
Coverage of 6.77 %
Instances:
ARRGRS | ARRSSN | ARRSSS |
pattern: ARR[GS][SR][SN]
MAGLAAGSVYVRTPSVREKVSGRRFRENPPVVFPTTRTVLAQTPPSTPTTPSSLLSSTAPSTR
RYSLSAPSPPPDVARDIAALSSARRGRSIILSRRGTVILPCPPWCDKNGDFPPDLSRWWRHGGD
QVAVVAGGDPIAVVLLLEARRSSNRVMWLDPPEVVVKMPICSGYVVVALVAARRSSSGAGGGTV
ERKRRLSAEQVVALEQSFDEEKRKLEPERKIELPRRLGMAPRKVAVWFQNGRARWKANQLAGRD
VLPADNGRLRS
Repeat found in LOC127311600
Repeat occurs 5 times in a sequence of 499 amino acids
Location between 249366285 and 249368727
Coverage of 7.01 %
Instances:
VSYASEG | VSYGSKL | VSYGSEG | VSYGSDG | VSYGGQE

pattern: VSY[GA][GS][EQKD][EGL]
MDVLVLLISLILVAGRGSTSHANSETTMDYWQTLLPRTTMPPAILDLLNHHAGSSNVKSIKNS
KNVYAASQAGDDFKKIFVSYASEGNIKDLKNDMILEGKQDSKKVYAEDLKKILAARPRSEENLK
EISVSYGSKLEAGVKEIFVSYGSEGGQDSKKASVVDLEKILAARPRNEENLNGFFATYGSKLEE
DLKEISVSYGSDGEEDLKNISASYGIKGEEGVKEISVSYGGQEDEAGLNKISASEGEKAGGGHV
HVHTHGHRSRRIADVFFFHDMLRPGTVMTPTIPPTSSLPALLPRHVANSIPFSAKRFPDIVAMF
APASLAIAGEMRWTLDTCEHPRPLPGQKAGCATSIESLAELTASLLGTRDVLAFSADMPLEVAD
TSALRARHNVTAARRLSTSLEVVTCHDLTYPYAVFYCHTSNPTAAYMVTLESQDGAAPAMEALA
VCHLDTSQWSPENPFFKLHSVSPGEVAVCHFLSKLSTIWVRAGELGDLRAAE
Repeat found in LOC127311600
Repeat occurs 5 times in a sequence of 501 amino acids
Location between 249366285 and 249368727
Coverage of 6.99 %
Instances:
VSYASEG | VSYGSKL | VSYGSEG | VSYGSDG | VSYGGQE

pattern: VSY[GA][GS][EQKD][EGL]
MDVLVLLISLILVAGRGSTSHANSETTMDYWQTLLPRTTMPPAILDLLNHHAGSSNVKSIKNS
KNVYAASQAGDDFKKIFVSYASEGNIKDLKNDMILEGKQDSKKVYAEDLKKILAARPRSEENLK
EISVSYGSKLEAGVKEIFVSYGSEGGQDSKKASVVDLEKILAARPRNEENLNGFFATYGSKLEE
DLKEISVSYGSDGEEDLKNISASYGIKGEEGVKEISVSYGGQEDEAGLNKISASEGEKVTAGGG
HVHVHTHGHRSRRIADVFFFHDMLRPGTVMTPTIPPTSSLPALLPRHVANSIPFSAKRFPDIVA
MFAPASLAIAGEMRWTLDTCEHPRPLPGQKAGCATSIESLAELTASLLGTRDVLAFSADMPLEV
ADTSALRARHNVTAARRLSTSLEVVTCHDLTYPYAVFYCHTSNPTAAYMVTLESQDGAAPAMEA
LAVCHLDTSQWSPENPFFKLHSVSPGEVAVCHFLSKLSTIWVRAGELGDLRAAE

Similar gene clusters

NC_067250 - Cluster 64 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 64. Type = saccharide. Location: 251263830 - 251593892 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_067250 - Cluster 65 - Saccharide

Gene cluster description

NC_067250 - Gene Cluster 65. Type = saccharide. Location: 254016837 - 254432994 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026138763 - Cluster 66 - Cyclopeptide

Gene cluster description

NW_026138763 - Gene Cluster 66. Type = cyclopeptide. Location: 1 - 667601 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC127318784
Repeat occurs 3 times in a sequence of 388 amino acids
Location between 542740 and 544292
Coverage of 6.19 %
Instances:
TGPTFACA | TGPVFASA | TGPTRPAS |
pattern: TGP[TV][FR][AP][CSA][SA]
MAFSGGFMEKAKPYFAMIALQFGYAGMNVLTKVSLNGGMSHYVLVVYRHAFATVAIAPFALIL
ERKVRPKMTWSIFFQIFVLALLGPVIDQNFYYVGLKYTGPTFACAMSNILPAMTFVMAVIFRME
KIELKKVRCQAKIFGTVVTVAGAMLMTLYKGPLMHLAWTRHSQASGGGEAPAVVDPTGKEWFLG
SLFVIIATLAWASLFILQTHTIKQYTAQLSLTTLICFVGTIQAVVVTFVMERRPSVWTIGFDMN
LLAAAYAGIVTSSIAYYVQGLVIKKTGPVFASAFSPLMMIIVAVMGSFILSEKIYLGAVLGAVV
IVVGLYAVLWGKHKETQEKEATPLPVATNGHGATNTEMIGAAGARNDRDATGPTRPASNGRGGG
SASAV

Similar gene clusters

NW_026138795 - Cluster 67 - Putative

Gene cluster description

NW_026138795 - Gene Cluster 67. Type = putative. Location: 210545 - 464194 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026138842 - Cluster 68 - Phenolamide

Gene cluster description

NW_026138842 - Gene Cluster 68. Type = phenolamide. Location: 477513 - 773833 nt. Click on genes for more information.
Show pHMM detection rules used
plants/phenolamide: (minimum(2,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Pyridoxal_deC]) or minimum(2,[MatE,Orn_Arg_deC_N,YjeF_N,Putative_PNPOx,PNP_phzG_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,Orn_DAP_Arg_deC,Orn_Arg_deC_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026140057 - Cluster 69 - Putative

Gene cluster description

NW_026140057 - Gene Cluster 69. Type = putative. Location: 304968 - 813233 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NW_026140398 - Cluster 70 - Terpene

Gene cluster description

NW_026140398 - Gene Cluster 70. Type = terpene. Location: 2264369 - 2926993 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters