Identified secondary metabolite clusters

Cluster Type From To Size (kb) Core domains Product/substrate predicted by subgroup Most similar known cluster MIBiG BGC-ID
The following clusters are from record NC_037254.1:
Cluster 1Cyclopeptide26174653274795657.33BURP---
Cluster 2Cyclopeptide1133967711770629430.95BURP---
Cluster 3Fatty_acid-Saccharide1956606619693493127.43CER1-like_C, FA_hydroxylase, Glycos_transf_1, Transferase---
Cluster 4Polyketide2025156620527628276.06Abhydrolase_3, Chal_sti_synt_C, Chal_sti_synt_N, Lyase_aromatic, p450---
Cluster 5Polyketide2133525521508693173.44Chal_sti_synt_C, Chal_sti_synt_N, Lyase_aromatic, TPMT---
Cluster 6Saccharide2274985023086259336.41ABC2_membrane, ABC_tran, Aminotran_1_2, Epimerase, Glyco_hydro_1---
The following clusters are from record NC_037255.1:
Cluster 7Cyclopeptide1338837013864606476.24BURP---
The following clusters are from record NC_037256.1:
Cluster 8Alkaloid1214763112292173144.54ECH_2, His_biosynth, Pyridoxal_deC, Trp_syntA-benzoxazinone DIMBOA (17% of genes show similarity)BGC0000810.3_c1
Cluster 9Cyclopeptide13717291151906111473.32BURP---
The following clusters are from record NC_037261.1:
Cluster 10Saccharide83871491621977.50ADH_N, ADH_zinc_N, Glyco_transf_28, Methyltransf_11, UDPGT*saccharide-2--
Cluster 11Saccharide47610714941364180.29Aldo_ket_red, Amino_oxidase, Glycos_transf_1---
The following clusters are from record NC_037263.1:
Cluster 12Polyketide1706868177800071.13Acetyltransf_1, Chal_sti_synt_C, Chal_sti_synt_N, Epimerase---
The following clusters are from record NC_037264.1:
Cluster 13Cyclopeptide42602434977554717.31BURP---
The following clusters are from record NC_037265.1:
Cluster 14Cyclopeptide1121943112058320838.89BURP---
Cluster 15Saccharide150306901508580155.11ADH_N, ADH_zinc_N, Abhydrolase_3, Glyco_hydro_1---
The following clusters are from record NC_037266.1:
Cluster 16Terpene25299012639729109.83Aminotran_1_2, Aminotran_3, Prenyltrans, SQS_PSY---
The following clusters are from record NC_037268.1:
Cluster 17Cyclopeptide91816409775683594.04BURP---
Cluster 18Cyclopeptide1144772712317548869.82BURP---
The following clusters are from record NC_037269.1:
Cluster 19Cyclopeptide47612375210321449.08BURP---
The following clusters are from record NC_037270.1:
Cluster 20Fatty_acid-Saccharide1261840913000103381.692OG-FeII_Oxy, Acetyltransf_1, CER1-like_C, FA_hydroxylase, Glycos_transf_1---
Cluster 21Terpene1422030314401124180.82Abhydrolase_3, SQHop_cyclase_C, SQHop_cyclase_N, adh_short_C2---
The following clusters are from record NC_037271.1:
Cluster 22Fatty_acid109561121100039844.29Amino_oxidase, FA_hydroxylase---
The following clusters are from record NC_037273.1:
Cluster 23Polyketide49187065029736111.03Chal_sti_synt_C, Chal_sti_synt_N, Lyase_aromatic, adh_short_C2---
Cluster 24Fatty_acid-Saccharide104855801054726561.69FA_hydroxylase, Glycos_transf_2, adh_short---

NC_037254 - Cluster 1 - Cyclopeptide

Gene cluster description

NC_037254 - Gene Cluster 1. Type = cyclopeptide. Location: 2617465 - 3274795 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112278897
Repeat occurs 5 times in a sequence of 659 amino acids
Location between 3034798 and 3036987
Coverage of 7.59 %
Instances:
EPEADPEPEA | EPEAEAEPEP | EPEPEPEPVV | EPEPEPVVEP | EPEPVVEPVP

pattern: EPE[PA][EDV][PVA][EV][PV][EV][PVA]
MEIQDSVAAACHADRQRLLEVFRVAIAKPLPASQTAVATMVEAVKPQKQTVLVQDENQLLENG
LRTLLEELVGAAVANGKSVMQYGRAEGPEDDGNGLLTRLLDIVLYLCEKGNVEGGMIFQLLEDL
TEVSTIKDCKEVFGYIEGKQETLGKSELFSRGKLVMLRTCNQLLRRLSKTNDVVFCGRILMFLA
HFFPLSERSAVNIKGVFNTSNETKYEKEPPGDSPAIDFTFYKTFWSLQEYFSNPTTLTHPSKWQ
TFSSNLGIVLETFEMQPIGEEDESLSLDGDDDSTFNIKYLTSSNLMSLELKDPGFRRHVLVQCL
IIFDYMKTPGKAEKDGPREGVREELKVHEDRVKKLLRTTPPKGREFLASIEHILERDKNWVWWK
RDGCIPFERFVQEKKIPSDPPKRRPRWRLGNRELSRLWKWSDDNPNAITHAKRVKMPPLAEYFK
PLAEDMDPEAGIEEEYHHKNNKVYCWKGLRFAARQDLDALSRYSDMKIMGVVPPDLLTPEVRSS
LYKPTDRPKAKGVKKDEEYTPNLTQTEDASGSAPTSVSLMELDATTLARDPEEATAMDADGPAV
ADVEPEADPEPEAEAEPEPEPEPVVEPVPLTDLSDAKPETVQTPVEAEPSSSKVVSEPPPVGNQ
SPSPLKSEARSGKKRAHPDD

Similar gene clusters

NC_037254 - Cluster 2 - Cyclopeptide

Gene cluster description

NC_037254 - Gene Cluster 2. Type = cyclopeptide. Location: 11339677 - 11770629 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112276432
Repeat occurs 4 times in a sequence of 358 amino acids
Location between 11707848 and 11710755
Coverage of 8.94 %
Instances:
AAAASPPK | AAASPPKP | AAAGGAKK | AAAGGAKK |
pattern: AAA[GSA][GPS][PA][KP][KP]
MACTGHCLGAAMATVSRLPHFVPCVNAQKCHGSAGGHLVVSLQQVAWVPSTFGGKGIAGLGNV
LKFRAGESSTYGRMASQRMIVMASQEAGTSTLVVQEVEFIWKGPGSKVLLSGDFVNWESQVPLE
KSGSEGWFVVKQNIAPGTYKYKFIVDGEWQHSPDYPTVSDNSGGFNNEITVSEEKSTPKPTRKE
DNAGSSKGEVKGSTEEVAKVDTPKPAAAASPPKPKVAAAGGAKKAAAGGAKKAVKAVAKPLPET
MLEDVIPRLTAYFEKEQGVSAVDIQFDDNQLQGTFIKNEVQYNFWAYFPDGKLEGSRGFSLTSH
GCSPSTVEPFLIDESKITGESVVQQVIKRLFAQKLLATN
Repeat found in LOC112276432
Repeat occurs 4 times in a sequence of 267 amino acids
Location between 11707848 and 11709914
Coverage of 11.99 %
Instances:
AAAASPPK | AAASPPKP | AAAGGAKK | AAAGGAKK |
pattern: AAA[GSA][GPS][PA][KP][KP]
MGAVVVQEVEFIWKGPGSKVLLSGDFVNWESQVPLEKSGSEGWFVVKQNIAPGTYKYKFIVDG
EWQHSPDYPTVSDNSGGFNNEITVSEEKSTPKPTRKEDNAGSSKGEVKGSTEEVAKVDTPKPAA
AASPPK
PKVAAAGGAKKAAAGGAKKAVKAVAKPLPETMLEDVIPRLTAYFEKEQGVSAVDIQFD
DNQLQGTFIKNEVQYNFWAYFPDGKLEGSRGFSLTSHGCSPSTVEPFLIDESKITGESVVQQVI
KRLFAQKLLATN

Similar gene clusters

NC_037254 - Cluster 3 - Fatty_acid-saccharide

Gene cluster description

NC_037254 - Gene Cluster 3. Type = fatty_acid-saccharide. Location: 19566066 - 19693493 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NC_037254 - Cluster 4 - Polyketide

Gene cluster description

NC_037254 - Gene Cluster 4. Type = polyketide. Location: 20251566 - 20527628 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037254 - Cluster 5 - Polyketide

Gene cluster description

NC_037254 - Gene Cluster 5. Type = polyketide. Location: 21335255 - 21508693 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037254 - Cluster 6 - Saccharide

Gene cluster description

NC_037254 - Gene Cluster 6. Type = saccharide. Location: 22749850 - 23086259 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NC_037255 - Cluster 7 - Cyclopeptide

Gene cluster description

NC_037255 - Gene Cluster 7. Type = cyclopeptide. Location: 13388370 - 13864606 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112279812
Repeat occurs 3 times in a sequence of 290 amino acids
Location between 13610127 and 13611177
Coverage of 6.21 %
Instances:
EKSKDR | EKSGRS | EKSGRK |
pattern: EKS[GK][RD][KRS]
MGGHGGLNILPQKRWNVYNFDNREKVKRDEEKAAREEAIERQQARQREAELRLEKLREVAQAK
RNGSCEFRVADGEGTAEQPLLGNEQENEVAALAEAEGAKHFNLFEIAGQPEQGTAKAKGNNDWI
NDLKVGTRGGLDKQDIRELKKMEKSKDRAPEDEGYEFGCGLVGKSGKKPWYATKRFMVERTSDG
GVGSRISRSRSPEERPSKRSRRSEEKSGRSENRAKKEKREKSGRKSMEELRAERVRREQQERDR
AQKLLLANRKVNGDGVYGINRSGRPYYHQSFGNAR

Similar gene clusters

NC_037256 - Cluster 8 - Alkaloid

Gene cluster description

NC_037256 - Gene Cluster 8. Type = alkaloid. Location: 12147631 - 12292173 nt. Click on genes for more information.
Show pHMM detection rules used
plants/alkaloid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Bet_v_1/Cu_amine_oxid/Str_synth/BBE/Orn_DAP_Arg_deC/Pyridoxal_deC]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

Similar known gene clusters

NC_037256 - Cluster 9 - Cyclopeptide

Gene cluster description

NC_037256 - Gene Cluster 9. Type = cyclopeptide. Location: 13717291 - 15190611 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


The following known motifs were found in CDS LOC112281496
Location between 13759990 and 13777142
.L.Y..Y was found 2 times in this sequence
Sequence:
MVSYRRLLAAVEAALLTPNPAPHHRADLSHALHICVPDFQDFLKYPGPKAEDRAQVASREVRL
PNSAPTILDNQDAQIALKLSEDYNLNEIYCVGLLVSAHQEWNSLGREPVEILRLSAGLWFTERR
ALITSLQLLLRAVVLDDELDPDLVADIRSYIERLLQGGLRARLIHLIKELNREESAGLGGPGVE
PYVMDSRGAVVQRRNVIQKERLSLCHCLVLTCLIVRINAQEAKDLYDLLKDSASDESISQDAVK
LQITYTIMFALLDSLISDALGGSQEMGSVLALDAGFRKEFQQMLMDAGELNVTAEGFTGVIRFV
WAVYLMLTKGALDYSPTGSFSEDDTYSSLCLNRACEHDVFEFFTTRVLQTATFQNDDEELVFMY
NAYLHKLMTGFLAQPAGREKIKAMKDAAMVAVDTYVDMKDTMSDVDGKAQQQAIVLQAKPFISL
LTMIGEVYQREPELIMDNDDLWNFVRFAGEDHNTYHTLVAFLNMLTALAASEEGAKKIYQMLQN
KAIRTLGWQTLFNSLIVYDQRFRQCLQNAGAFLPPFQEGDARALEAYLKVLRRVMEKGNEMERS
QWFMDIEPLFKLLSYENVPPYLKGALRNAIATFVPVSPVMKNKVWSFLEQYDLPVVATPLLSDG
SAQQVSSQVYDMTFELNEVEARQEEYPSTLSYLKLLNVLIENESDGPDKGGRYIGIFRFVRDQV
FAPYAQRAYANPVEKWELVATALRHFELMLSTYQLTEDAVRNSSDHLLPPENSLPGMAAPGLPA
AITRLPVTELMKDLMSGKVIYRNIMSILMVGVNSVMEQRTSQLYGPALEEAISLCLQILFLALS
KDTLFSEAWRPVYQPIDNILSHDIRQIVTVLEYVRYDMSPLIQRCSVQIMKVLSARMPQLVSII
LEAGAASSLIEDYAACLETRAEEPQAPENPDEDIGSLILRLLLANLDQPAPNVTHLLLKFDVNQ
LVERTMLQPKRHFSCLRVILDVLDTLARPEVNAGLHELGFQLMYELCVDPITCGPVVELLRSEK
YEFFSKHLNTFVCEPLPKRSTNQQLRVSSLQQRAWLLKLVALELHLGDMDVVVHRDSCRRLLSR
LFLREPQSWETGIPSNLMPARLTLTTTDNSIHKTKVLELLEILQFQLPESPSDFPPELHGLKEE
LKVDDILGSPATVDQGGVYHISERGDRLIDLSAFRDLLWQEYKRLEGQYNFLVNGQKQSELREA
VQQLLRWAWKRNKNLEEQAAQLHMLVGWSQLVEITISRRFHFLGSRTHVLFEILDASISATTSQ
DCSLRMAFLLSQVALTTMAKLQEQSIISPGEGDSTDDVTYVDVLSSVRLSNSACHTILSKLIAS
ILRHESSESLRRRQYAILLSYFHYCQGMVNRDLPLSVMRTLLVGAGDEDMEVEKLDRDQAELAQ
VNFSLLKRNAAPLVDVVARDATNGSETGKAMAYYVLDALLAVDHHQVFLSQLQSRGLLHSCLAE
ISSNSYQAILLPSAESLRRLYTLESELALLLRVGFHNRKRGAQTLYAMGALRHLSSCRAIDAHL
TDDAKWEQVKVGVGMPNQHDRQHQLVSPVLRLVLCFTSLIDSTEVVDGDRNEVALEVLDFIKSH
HGLLDRILRDDNPNVHIADLDELQLATAILSKVWPVEESSEFGYTQAMFNLAYVYFSLDAESRN
RFIHHIQEAEKINESSGIARESLRKMELQVARVRCNLIAYLYALVTKHNLRLHIYKPDIGDGAT
MGPYNLGRQRQPTLKLVADLLQQTSLDLELALEEKALLLARLQDVNELSRHEVDDIIKAYGRQE
RSDPADSIRKRRYVAMVEMCSAAGSRECQVSSLIFLVEHALEILYVHLEKLTKPQNRQGRFELN
LETNWGRKEDVQLLAGKLLPVFDRLECLNEDRVGRSMKHLQRLVHSLKSRIMMLSSETY

Similar gene clusters

NC_037261 - Cluster 10 - Saccharide

Gene cluster description

NC_037261 - Gene Cluster 10. Type = saccharide. Location: 838714 - 916219 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037261 - Cluster 11 - Saccharide

Gene cluster description

NC_037261 - Gene Cluster 11. Type = saccharide. Location: 4761071 - 4941364 nt. Click on genes for more information.
Show pHMM detection rules used
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037263 - Cluster 12 - Polyketide

Gene cluster description

NC_037263 - Gene Cluster 12. Type = polyketide. Location: 1706868 - 1778000 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037264 - Cluster 13 - Cyclopeptide

Gene cluster description

NC_037264 - Gene Cluster 13. Type = cyclopeptide. Location: 4260243 - 4977554 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output

Similar gene clusters

NC_037265 - Cluster 14 - Cyclopeptide

Gene cluster description

NC_037265 - Gene Cluster 14. Type = cyclopeptide. Location: 11219431 - 12058320 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112290436
Repeat occurs 9 times in a sequence of 732 amino acids
Location between 11390519 and 11392718
Coverage of 8.61 %
Instances:
EDEEQDQ | EDELEQH | EDEHEKQ | EDEHEKQ | EDEHEKQ
EDEHEKQ | EDEHEKQ | EDEHEKQ | EDELEQH |
pattern: EDE[EHL][EQ][KDQ][HQ]
MYSLLSRKYSKQHHMRKPRCKFMHVCCSVTVLLILITVMCRPMPSAPSTPESIASMAVGESLQ
LNTQSIPAVEVAQETKTEADQSKLLENEMKEQVQGDSEATRSERREVEQEEGALRSGQEDEEQD
Q
HAEDKHEQRAENRHAQQAEDELEQHLEDVHGRHAEVKNKQRVEDEHEKQVEDEHEKQVEDEHE
KQ
VEDEHEKQMEDEHEKQMEDEHEKQVDDEHEQRVEDGQGQQAEDELEQHLEDVQDQHAEVKHQ
QRVDDEHEQQVKLDSRSAEHKQAGFPEEHQEGSKPKIKYSKELGAEHGHGIFQSKQSEQEQKVE
PEPQLKKHGREVTISCDGRRVYTYDLPPSMNIDILKNCSGKLVPWLNFCAHHQNYGFGIAVNTT
NNNFRKDWYGTDAYMLEVIFYERMRTYTCRTSNPGEADLFFIPFFSGLEALPYLYTDGKRRLQQ
GRELVEWLEANATQTWRRHGGHDHFLIAGRTAWDFCRPLTAVTWWGTSLFSNPEMENTTAMLLE
RRSWRGDEMAVPYPVGFHPSTSASLQSWIKLVRSSTRKYLFSFSGALRPQLVFSIREILSQQCT
QAGSACSRLDCGKIKCSHEPQPIYTSLLQAKFCLQPRGDTATRRSVIDSIVSGCIPVFFHKDTA
FTQYRWHLPNDYDNFSVFIDEEDIKNGKADVKKILEGYSAKQVEQMRERLIGIIPNVLYRHPKS
KDLSESMRDAFDLTIEGMAQKAIQFKSST
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 529 amino acids
Location between 12008658 and 12012193
Coverage of 3.4 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[IL][KRS][IM]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKAYNNILKVQFPLKCANGTVELIEGYRAQHSHHRLPVKG
GIRM
APNVDAEETMALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRK
SFIGPSIDVPAPDYGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVF
FCLRDFLDDEALVDKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAEC
ENGIDIPALKAYHKKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIV
AEAANGPVTPLAETILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRM
LINALEAELGNGHKISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAY
LIAIERIATSYRVSGIFP
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 529 amino acids
Location between 12008658 and 12012193
Coverage of 3.4 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[IL][KRS][IM]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKAYNNILKVQFPLKCANGTVELIEGYRAQHSHHRLPVKG
GIRM
APNVDAEETMALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRK
SFIGPSIDVPAPDYGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVF
FCLRDFLDDEALVDKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAEC
ENGIDIPALKAYHKKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIV
AEAANGPVTPLAETILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRM
LINALEAELGNGHKISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAY
LIAIERIATSYRVSGIFP
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 529 amino acids
Location between 12008658 and 12012193
Coverage of 3.4 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[IL][KRS][IM]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKAYNNILKVQFPLKCANGTVELIEGYRAQHSHHRLPVKG
GIRM
APNVDAEETMALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRK
SFIGPSIDVPAPDYGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVF
FCLRDFLDDEALVDKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAEC
ENGIDIPALKAYHKKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIV
AEAANGPVTPLAETILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRM
LINALEAELGNGHKISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAY
LIAIERIATSYRVSGIFP
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 529 amino acids
Location between 12008658 and 12012193
Coverage of 3.4 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[IL][KRS][IM]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKAYNNILKVQFPLKCANGTVELIEGYRAQHSHHRLPVKG
GIRM
APNVDAEETMALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRK
SFIGPSIDVPAPDYGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVF
FCLRDFLDDEALVDKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAEC
ENGIDIPALKAYHKKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIV
AEAANGPVTPLAETILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRM
LINALEAELGNGHKISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAY
LIAIERIATSYRVSGIFP
Repeat found in LOC112290116
Repeat occurs 3 times in a sequence of 516 amino acids
Location between 12008658 and 12012193
Coverage of 3.49 %
Instances:
KGGLSI | KGGIRM | KGGIKI |
pattern: KGG[IL][KRS][IM]
MAMFMLKGGLSIVAPSARLCAVAGSSSRFSTAVEVSATTTTAAKTIAVPTGHGTAVEQGDPTF
LESVDIYFDKAATIASATPDILAHIKCANGTVELIEGYRAQHSHHRLPVKGGIRMAPNVDAEET
MALAALMTFKCAVVDVPFGGAKGGIKIDPTKYSANEKESIIRRYTSELVRKSFIGPSIDVPAPD
YGTGPQEMAWIKDTYEHLKPNDINGAACVTGKPLEEGGIDGRQEATGLGVFFCLRDFLDDEALV
DKLGLTTGIKGKTFIVQGFGNVGRHTIDYIYDAGGLITAIAEVDGGLVAECENGIDIPALKAYH
KKMGTITSFPGAKTVKFAPGILELPCDVLIPAALETQIHSGNAGNVKAKIVAEAANGPVTPLAE
TILEGKGVVILPDLLLNAGGVTVSYFEWLKNLNHIHFGRMSRRMEEKGQRMLINALEAELGNGH
KISHEVRSQVVKGNREVDFVWSGLEETMLLSWENVKRISQERNCNYRTAAYLIAIERIATSYRV
SGIFP

Similar gene clusters

NC_037265 - Cluster 15 - Saccharide

Gene cluster description

NC_037265 - Gene Cluster 15. Type = saccharide. Location: 15030690 - 15085801 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037266 - Cluster 16 - Terpene

Gene cluster description

NC_037266 - Gene Cluster 16. Type = terpene. Location: 2529901 - 2639729 nt. Click on genes for more information.
Show pHMM detection rules used
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

No significant ClusterBlast hits found.

NC_037268 - Cluster 17 - Cyclopeptide

Gene cluster description

NC_037268 - Gene Cluster 17. Type = cyclopeptide. Location: 9181640 - 9775683 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112293622
Repeat occurs 4 times in a sequence of 257 amino acids
Location between 9186293 and 9187321
Coverage of 9.34 %
Instances:
GCETNV | GCEKDV | GCEKDV | GCEKDV |
pattern: GCE[KT][ND]V
MAISRQSRHAGLLVAATLAMIFIGSADAWCKEGTKDCDYNHKNGCETNVLYDVKNCGDCGNKC
ACGPYAVPKCNKGRCEFSCKAGWANCNNDWKDGCEKDVGKDVFNCGGCGKKCECGPNAIAKCNG
GRCQYSCKAGWANCNGDWKDGCEKDVSKDLYNCGACGKKCECGPYASAKCSGGKCQYTCKWGWA
NCNGDWKDGCEKDVGSDVNNCGGCGNKCPQPKYFGGEATCKNGKCRDQCKKGMKFDAHKKCCVR
RY
Repeat found in LOC112293623
Repeat occurs 4 times in a sequence of 257 amino acids
Location between 9190567 and 9191595
Coverage of 9.34 %
Instances:
GCETNV | GCEKDV | GCEKDV | GCEKDV |
pattern: GCE[KT][ND]V
MAISRQSRHAGLLVAATLAMIFIGSADAWCKEGTKDCDYNHKNGCETNVLYDVKNCGDCGNKC
ACGPYAVPKCNKGRCEFSCKAGWANCNNDWKDGCEKDVGKDVFNCGGCGKKCECGPNAIAKCNG
GRCQYSCKAGWANCNGDWKDGCEKDVSKDLYNCGACGKKCECGPYASAKCSGGKCQYTCKWGWA
NCNGDWKDGCEKDVGSDVNNCGGCGNKCPQPKYFGGEATCKNGKCRDQCKKGMKFDAHKKCCVR
RY
Repeat found in LOC112293824
Repeat occurs 4 times in a sequence of 257 amino acids
Location between 9232223 and 9233251
Coverage of 9.34 %
Instances:
GCETNV | GCEKDV | GCEKDV | GCEKDV |
pattern: GCE[KT][ND]V
MAISRQSRHAGLLVAATLAMIFIGSADAWCKEGTKDCDYNHKNGCETNVLYDVKNCGDCGNKC
ACGPYAVPKCNKGRCEFSCKAGWANCNNDWKDGCEKDVGKDVFNCGGCGKKCECGPNAIAKCNG
GRCQYSCKAGWANCNGDWKDGCEKDVSKDLYNCGACGKKCECGPYASAKCSGGKCQYTCKWGWA
NCNGDWKDGCEKDVGSDVNNCGGCGNKCPQPKYFGGEATCKNGKCRDQCKKGMKFDAHKKCCVR
RY

Similar gene clusters

No significant ClusterBlast hits found.

NC_037268 - Cluster 18 - Cyclopeptide

Gene cluster description

NC_037268 - Gene Cluster 18. Type = cyclopeptide. Location: 11447727 - 12317548 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 891 amino acids
Location between 11652778 and 11658088
Coverage of 18.52 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MSNPSFGSAGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAP
VMPPGGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDL
AVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCE
FESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAP
GVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPP
PPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSAD
RGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESR
EWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDL
ADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSM
FDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVF
TQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSS
AKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSR
TESKTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEET
TLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 897 amino acids
Location between 11652778 and 11658088
Coverage of 18.39 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MSNPSFGSAGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAP
VMPPGVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPP
GMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSP
KGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAAN
REEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERA
KTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSAD
VEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEER
EKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQ
REYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDP
SAHVVSMFDSDSEEVAGPKEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRS
AVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAK
SLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERD
RDRSSRTESKTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEF
LGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTN
KP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 898 amino acids
Location between 11652778 and 11658088
Coverage of 18.37 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MSNPSFGSAGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAP
VMPPGVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPP
GMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSP
KGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAAN
REEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERA
KTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSAD
VEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEER
EKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQ
REYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDP
SAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRR
SAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFA
KSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERER
DRDRSSRTESKTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITE
FLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGT
NKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 898 amino acids
Location between 11652778 and 11658088
Coverage of 18.37 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MSNPSFGSAGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAP
VMPPGVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPP
GMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSP
KGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAAN
REEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERA
KTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSAD
VEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEER
EKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQ
REYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDP
SAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRR
SAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFA
KSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERER
DRDRSSRTESKTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITE
FLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGT
NKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 898 amino acids
Location between 11652778 and 11658088
Coverage of 18.37 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MSNPSFGSAGGQPEPGGGSIASTQPVMRPGYGMPIQQQPRPYAPPPAGFVPSQGPAPGQSAAP
VMPPGVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPP
GMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSP
KGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAAN
REEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERA
KTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSAD
VEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERLYEER
EKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQ
REYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDP
SAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRR
SAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFA
KSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERER
DRDRSSRTESKTSE
TKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITE
FLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGT
NKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 838 amino acids
Location between 11652778 and 11657341
Coverage of 19.69 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MLSGVMYIAGMKPGSGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVL
KPPPGMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPT
TGSPKGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEE
EAANREEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAML
AERAKTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQ
TSADVEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENRAREIERL
YE
EREKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKR
NRRQREYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVI
LPDPSAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIG
SGRRSAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAA
AEFAKSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDW
ERERDRDRSSRTESKTSETKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISK
KITEFLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGL
VTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 845 amino acids
Location between 11652778 and 11657341
Coverage of 19.53 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MLSGVMYIAGMKPGSVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGI
RPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSW
KRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQ
ERIALEEEAANREEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAV
QKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDD
TISEQKQTSADVEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENR
AREIERLYE
EREKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRS
AAYDEKRNRRQREYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTN
HVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRK
LGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGL
AHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRD
REK
DRDWERERDRDRSSRTESKTSETKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEK
MRPWISKKITEFLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEV
RRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 845 amino acids
Location between 11652778 and 11657341
Coverage of 19.53 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MLSGVMYIAGMKPGSVPVSGSAGAPRYSASSYGQMVRPPVFTPRLPMGPLPVPMLARPPGTGI
RPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPHTTVYVGKISSTVEDEFLRAILQLCGGIKSW
KRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNKYSLEGQELVLNVNQATREYLEHYVSKKKEQ
ERIALEEEAANREEEAAPGVEVVPAPKKAMNGVYSDKDECTSKEDSKKFGLVTQADKEVDDQAV
QKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSSVADGPSLVKDGDSSADGVKSENVSARNDDD
TISEQKQTSADVEASSADRGRRSERVSERERERDLELDKEREVERYERERERKRLRRERDRENR
AREIERLYE
EREKEWESREWEKERQRLHDKEWEKEREKERRRLIKEQEEESDDEDDRHKHRYRS
AAYDEKRNRRQREYEEDLADRIREQEERASAELYKRHKPSPEPAVISNDENVADVDEKPIPSTN
HVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDAGNGVHPIGIEDNSAAAVNDLKLSSSPAPRK
LGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRTLVPIDYSAEEMHVVSSVSCAPPPPSSSGGL
AHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDKANDRDRSRKEKDRDRDRDRDRDRERERDRD
REK
DRDWERERDRDRSSRTESKTSETKNVVNAKQLIDAIPTTKEDLFNFPVNWNVYDTHGLHEK
MRPWISKKITEFLGEEETTLVDFIVSSTRKHVSAATMLELLEAILDDEAEMFVLKMWRMLIFEV
RRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 811 amino acids
Location between 11652778 and 11657124
Coverage of 20.35 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPH
TTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNK
YSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGVEVVPAPKKAMNGVY
SDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSS
VADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRGRRSERVSERERERD
LELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREWEKERQRLHDKEWEK
EREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLADRIREQEERASAELY
KRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDA
GNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRT
LVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDK
ANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTESKTSETKNVVNAKQ
LIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTLVDFIVSSTRKHVSA
ATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 811 amino acids
Location between 11652778 and 11657124
Coverage of 20.35 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPH
TTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNK
YSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGVEVVPAPKKAMNGVY
SDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSS
VADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRGRRSERVSERERERD
LELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREWEKERQRLHDKEWEK
EREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLADRIREQEERASAELY
KRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDA
GNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRT
LVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDK
ANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTESKTSETKNVVNAKQ
LIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTLVDFIVSSTRKHVSA
ATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 811 amino acids
Location between 11652778 and 11657124
Coverage of 20.35 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPH
TTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNK
YSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGVEVVPAPKKAMNGVY
SDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSS
VADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRGRRSERVSERERERD
LELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREWEKERQRLHDKEWEK
EREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLADRIREQEERASAELY
KRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDA
GNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRT
LVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDK
ANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTESKTSETKNVVNAKQ
LIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTLVDFIVSSTRKHVSA
ATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP
Repeat found in LOC112293561
Repeat occurs 11 times in a sequence of 811 amino acids
Location between 11652778 and 11657124
Coverage of 20.35 %
Instances:
RDRENRAREIERLYE | RDRSRKEKDRDRDRD | RDRDRDRDRDRERER | RDRDRDRDRERERDR | RDRDRDRERERDRDR
RDRDRERERDRDREK | RDRERERDRDREKDR | RDRDREKDRDWERER | RDREKDRDWERERDR | RDRDRSSRTESKTSE
RDRSSRTESKTSETK |
pattern: RDR[EDS][RKNS][REKDS][REKATS][EKRD][REWDTS][RIEKD][REWDST][REKDS][REKTDL][REYDST][ERKD]
MVRPPVFTPRLPMGPLPVPMLARPPGTGIRPMPPVLKPPPGMLLNDLAVKGLAQVAAPPEKPH
TTVYVGKISSTVEDEFLRAILQLCGGIKSWKRAQDPTTGSPKGFGFCEFESAEGVLRALRLLNK
YSLEGQELVLNVNQATREYLEHYVSKKKEQERIALEEEAANREEEAAPGVEVVPAPKKAMNGVY
SDKDECTSKEDSKKFGLVTQADKEVDDQAVQKLNAMLAERAKTRPLPPPPPLPSTSETNVKSSS
VADGPSLVKDGDSSADGVKSENVSARNDDDTISEQKQTSADVEASSADRGRRSERVSERERERD
LELDKEREVERYERERERKRLRRERDRENRAREIERLYEEREKEWESREWEKERQRLHDKEWEK
EREKERRRLIKEQEEESDDEDDRHKHRYRSAAYDEKRNRRQREYEEDLADRIREQEERASAELY
KRHKPSPEPAVISNDENVADVDEKPIPSTNHVEEPVILPDPSAHVVSMFDSDSEEVAGPIEEDA
GNGVHPIGIEDNSAAAVNDLKLSSSPAPRKLGFGLIGSGRRSAVPSVFTQEDEDPTPDANKLRT
LVPIDYSAEEMHVVSSVSCAPPPPSSSGGLAHNLAAAAEFAKSLSLSSAKVENDKDRSRRSRDK
ANDRDRSRKEKDRDRDRDRDRDRERERDRDREKDRDWERERDRDRSSRTESKTSETKNVVNAKQ
LIDAIPTTKEDLFNFPVNWNVYDTHGLHEKMRPWISKKITEFLGEEETTLVDFIVSSTRKHVSA
ATMLELLEAILDDEAEMFVLKMWRMLIFEVRRVETGLVTGTNKP

Similar gene clusters

NC_037269 - Cluster 19 - Cyclopeptide

Gene cluster description

NC_037269 - Gene Cluster 19. Type = cyclopeptide. Location: 4761237 - 5210321 nt. Click on genes for more information.
Show pHMM detection rules used
plants/cyclopeptide: (BURP)

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Repeatfinder output


Repeat found in LOC112294411
Repeat occurs 3 times in a sequence of 122 amino acids
Location between 4995059 and 4995880
Coverage of 14.75 %
Instances:
LELQLQ | LELELR | LELRVE |
pattern: LEL[ERQ][VL][ERQ]
MPQQPVRILFWLFLSTVRDCCLFGECAPECNSSICYKGIAPDCYLELQLQTHLVLELELRVEA
ECNQILVWSVVGDEILSFPYSRFDSAFPQFRSRCSTYGKGLSSCGHGVELEGAKERNVR
Repeat found in LOC112294411
Repeat occurs 3 times in a sequence of 148 amino acids
Location between 4995115 and 4995880
Coverage of 12.16 %
Instances:
LELQLQ | LELELR | LELRVE |
pattern: LEL[ERQ][VL][ERQ]
MPQQPVRILFWLFLSTVRDCCLFGECAPECNSSICYKGIAPDCYLELQLQTHLVLELELRVEA
ECNQILVWSVVGDEILSFPYSRFDSAFPQFRSRCSTYGKGLSSCGHVHLKKQNSILFYFILSIS
LCCQAGAVASAESASQAEFLS

Similar gene clusters

NC_037270 - Cluster 20 - Fatty_acid-saccharide

Gene cluster description

NC_037270 - Gene Cluster 20. Type = fatty_acid-saccharide. Location: 12618409 - 13000103 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037270 - Cluster 21 - Terpene

Gene cluster description

NC_037270 - Gene Cluster 21. Type = terpene. Location: 14220303 - 14401124 nt. Click on genes for more information.
Show pHMM detection rules used
plants/plant: (minimum(4,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[]))
plants/terpene: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Terpene_synth/Terpene_synth_C/Prenyltrans/SQHop_cyclase_C/SQHop_cyclase_N/PRISE]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037271 - Cluster 22 - Fatty_acid

Gene cluster description

NC_037271 - Gene Cluster 22. Type = fatty_acid. Location: 10956112 - 11000398 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037273 - Cluster 23 - Polyketide

Gene cluster description

NC_037273 - Gene Cluster 23. Type = polyketide. Location: 4918706 - 5029736 nt. Click on genes for more information.
Show pHMM detection rules used
plants/polyketide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Chal_sti_synt_C/Chal_sti_synt_N]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Thr_dehydrat_C]) or minimum(3,[E1_dh,PALP,Thr_dehydrat_C,Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[AMP-binding,Chal_sti_synt_C,Chal_sti_synt_N]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters

NC_037273 - Cluster 24 - Fatty_acid-saccharide

Gene cluster description

NC_037273 - Gene Cluster 24. Type = fatty_acid-saccharide. Location: 10485580 - 10547265 nt. Click on genes for more information.
Show pHMM detection rules used
plants/fatty_acid: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[FA_desaturase/FA_desaturase_2/FA_hydroxylase/CER1-like_C]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,ECH_2]) or minimum(3,[Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Transferase,AMP-binding]))
plants/saccharide: (minimum(3,[NAD_binding_4, FAE1_CUT1_RppA, HAD_RAM2_N, Orn_DAP_Arg_deC,Pyridoxal_deC,BBE,FA_hydroxylase,CER1-like_C,ECH_2,Oxidored_FMN,3Beta_HSD,Glyco_hydro_1,ADH_N,ADH_N_2,Abhydrolase_3,Aldo_ket_red,cMT,nMT,oMT,adh_short,Chal_sti_synt_C,Chal_sti_synt_N,COesterase,UDPGT,Glyco_transf_28,Glycos_transf_1,Glycos_transf_2,Lycopene_cycl,NAD_binding_1,p450,SQHop_cyclase_C,SQHop_cyclase_N,Prenyltrans,Terpene_synth_C,Terpene_synth,Transferase,Aminotran_1_2,AMP-binding,DIOX_N,Dirigent,Bet_v_1,Cu_amine_oxid,Str_synth,Trp_syntA,His_biosynth,adh_short_C2,Peptidase_S10,Prenyltransf,Epimerase,2OG-FeII_Oxy,Aminotran_3,Methyltransf_2,Methyltransf_3,Methyltransf_7,PRISE,Cellulose_synt,Chalcone,ERG4_ERG24,FA_desaturase,FA_desaturase_2,Methyltransf_11,polyprenyl_synt,SE,SQS_PSY,TPMT,UbiA,Lipoxygenase,Lyase_aromatic,HMGL-like,Chalcone_3,Chalcone_2,Acetyltransf_1,UDPGT_2,GMC_oxred_N,GMC_oxred_C,Amino_oxidase,DAHP_synth_1,DAHP_synth_2],[Glycos_transf_1/Glycos_transf_2/Glycos_transf_28/UDPGT/UDPGT_2/Glyco_hydro_1/Cellulose_synt]))

Legend:

Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes

Similar gene clusters