Gene cluster description
CM038907 - Gene Cluster 1. Type = cyclopeptide. Location: 6938664 - 7612373 nt. Click on genes for more information.
Show pHMM detection rules usedplants/cyclopeptide: (BURP)
Legend:
Only available when smCOG analysis was run
biosynthetic genes
transport-related genes
regulatory genes
other genes
Repeatfinder output
Repeat found in CY35_01G043200
Repeat occurs 3 times in a sequence of 325 amino acids
Location between 7007193 and 7009510
Coverage of 7.38 %
Instances:
ATSTPGGD | ATSTPGGT | ATSTPGGD |
pattern: ATSTPGG[TD]
MATSTPGGDGASDTKRGMATSTPGGTSDTKRSIATSTPGGDEGASETKRSRLAALHRKHAAAV
AATTTPSPSKQPPPSSSSPASFRIFKTDPNDLNYAKLCPSVFNGPLVSQLQLRKDDSNVAVDKV
LRELLGSNAKMSGKIGGTIETRVQDRVLLLDNPATQGGAVDRAKNKAHRSRSKRSSKHLSLRQH
RHLGSFNLPIEYQKYELYLPMHEMWKEYARKLVHNCNDAMMQARLLTADLHGAMIAVVETKSTS
YMGTNGIMVRETENTFGVMTIKDKLRVVPKAGTVFTLQLDTLRVTLFGNNLFLRGLHPSKRQQI
KPTIEL
Repeat found in CY35_01G044300
Repeat occurs 29 times in a sequence of 1318 amino acids
Location between 7078729 and 7082686
Coverage of 30.8 %
Instances:
KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLLECIEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK
KRLVKLPKCMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK
KRLVKLPKCMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK
KRLVKLPECMEEMK | KRLVKLPKCMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK
KRLMKLLECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLDVGGCDLDCLP
KRLVKLPECMEEMK | KRLVKLPECMEEMK | KRLMKLPECMEEMK | KRLVKLPECIEEMT |
pattern: KRL[MVD][KV][LG][LPG][EKC][CD][MLI][ED][EC][ML][PKT]
MKSLTWLNVRGCDLDCLPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVGGCDLDC
LPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTSLDVSECDLDCLPQGMGRLEKLSDLNWS
NNKRLVKLLECIEEMKSLTRLDVSGCDLDCLPQGMGRLEELSNLNLSNNKRLVKLPECMEEMKS
LTSLDVGGCDLDCLPQGMGRLEELSNLNLSNNKRLVKLPECMEEMKSLKKLDVGGCDLDCLPQG
MGRLEKLSDLNLSNNKRLVKLPKCMEEMKSLTWLDVSGCDLDCLPQGMGRLEKLSDLNLSNNKR
LVKLPECMEEMKSLTRLDVGGCDLDCLPRGMGRLEKLYGLILKDNKRLVKLPECMEEMKSLKKL
DVGGCDLDCLLRGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVGGCDLDCLPRGMGRL
EKLYGLILKDNKRLVKLPECMEEMKSLKKLDVGGCDLDCLPQGMGRLEKLSDLNLSNNKRLVKL
PKCMEEMKSLTWFDVGGCDLDYLPQKMGRLEKLYGLILKDNKRLVKLPECMEEMKSLTWLDVSG
CDLDCLPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVGGCDLDCLPRGMGRLEKLY
GLILKDNKRLVKLPECMEEMKSLKKLDVGGCDLDCLLRGMGRLEKLSDLNLSNNKRLVKLPECM
EEMKSLTRLDVGGCDLDCLPRGMGRLEKLYGLILKDNKRLVKLPECMEEMKSLKKLDVGGCDLD
CLPQGMGRLEKLSDLNLSNNKRLVKLPKCMEEMKSLTWFDVGGCDLDYLPQKMGRLEKLYGLIL
KDNKRLVKLPECMEEMKSLTWLDVSGCDLDCLPQGMGRLEKLSDLNLNNNKRLVKLPECMEEMK
SLTWLDVSGCDLDCLPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLKTLDVGGCDLDCLPQ
GMGRLEKLYELILKDNKRLMKLLECMEEMKSLTRLDVGGCDLDCLPQGMGRLEELSNLNLSNNK
RLVKLPECMEEMKSLTWLDVSGCDLDCLPQGMGRLEKLSDLNLNNNKRLVKLPECMEEMKSLTW
LDVGGCDLDCLPQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLKRLDVGGCDLDCLPQGMGR
LEELSTLNLSNNKRLVKLPECMEEMKSLTWLDVGGCDLDCLLQGMGRLEKLSNLNLSNNKRLVK
LPECMEEMKSLTRLDVGGCDLDCLPQGMGQLEKLSDLNLSNNKRLMKLPECMEEMKSLTRLDVG
GCDLDCLPQGMGRLEKLYGLILKDNKRLVKLPECIEEMT
Repeat found in CY35_01G044400
Repeat occurs 5 times in a sequence of 984 amino acids
Location between 7083149 and 7090399
Coverage of 4.07 %
Instances:
LNLKPISQ | LNLQERLQ | LNLSNNKR | LNLSNNKR | LNLSNNKR
pattern: LNL[QKS][EPN][RNI][LSK][RQ]
MENSMMNKCTSTIKNIHQRILESEEVLNQNQCEHLLMEFKRTLDIIKENVSKLTTCNEIGELL
TYDLHQVVIKVNDMVEACCSKKWWVEAIFQLHNEDAFMDILQDLKLCIDAMSTIVVDNNIHHVD
GESLNLKPISQDKLEDDQNQLSKRPSFQAICERLKKLKIELISSHLHTLKPTFGDKGLRTYASL
SSEIVDINKEMYQSHVDISKEEGEISKLPKTMEASLVTQMDNLQQSIESHKKNILNMPKAMEGQ
ILQSIPRELVGIEKIVLNLQERLQVQPTIGIVGMGGIGKTTMAKALYDHIYHNFEAHCFMPNIK
ANKDNFQLLIDILKELGHDGKITNIVKGEEVLRHLFCTKKMLIILDDVRCQKQLDDILPIDLDF
TNGSRIIMTSRSWIDLRNNVKEEGKFDMPYLDNNNAMELFKKYVANNQSERKEEFGLITSQIVK
ACGGLPLSLKVLGSYLRKETDLKIWQQALKKLQQAHSLDGRQNDEQLWGILRISFDELAEEERY
MFLDIVCFFCTSNNHSNMMTKATALRIWDDEKCSPELTLSTLVNMSLVQIASNGLFVVHDQLRD
MGRMISKKEYNGSRWNVEAKELTPQFLKGLEHTQGLLIEGGEILNYDQNVMVNMLDLRFLKVTS
WDCKHVNILIDIVKHSPNLKWLHLEFDHEFNGQKSLCQISPLFDLLELRVLNIMVGTYWNMYSS
TPTIFKKTICNFSKLKKLQEFGILGIFNLEFDENFGELSALKVVNLHVTSWKKLPKTFERLKNL
EELYLQQNANLIELPQSLGQLTNLKTINVSNCDLDYVPEGLGQLKNLSYLNLSNNKRLVKLLEC
MEEMKSLTSLNVGGCDLDCLPQGMGQLEKLSNLNLSNNKRLVKLPECMEEMKSLTWLDVGGCDL
DCLLQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVGGCDLDCLPQGMRRLEKLSNLI
LKNNKRLMKLPENGAIGKVVRLEFE
Repeat found in CY35_01G_324
Repeat occurs 5 times in a sequence of 1005 amino acids
Location between 7083149 and 7090399
Coverage of 3.98 %
Instances:
LNLKPISQ | LNLQERLQ | LNLSNNKR | LNLSNNKR | LNLSNNKR
pattern: LNL[QKS][EPN][RNI][LSK][RQ]
MENSMMNKCTSTIKNIHQRILESEEVLNQNQCEHLLMEFKRTLDIIKENVSKLTTCNEIGELL
TYDLHQVVIKVNDMVEACCSKKWWVEAIFQLHNEDAFMDILQDLKLCIDAMSTIVVDNNIHHVD
GESLNLKPISQDKLEDDQNQLSKRPSFQAICERLKKLKIELISSHLHTLKPTFGDKGLRTYASL
SSEIVDINKEMYQSHVDISKEEGEISKLPKTMEASLVTQMDNLQQSIESHKKNILNMPKAMEIS
SITQMQNFHFVEPHKEDNQGQILQSIPRELVGIEKIVLNLQERLQVQPTIGIVGMGGIGKTTMA
KALYDHIYHNFEAHCFMPNIKANKDNFQLLIDILKELGHDGKITNIVKGEEVLRHLFCTKKMLI
ILDDVRCQKQLDDILPIDLDFTNGSRIIMTSRSWIDLRNNVKEEGKFDMPYLDNNNAMELFKKY
VANNQSERKEEFGLITSQIVKACGGLPLSLKVLGSYLRKETDLKIWQQALKKLQQAHSLDGRQN
DEQLWGILRISFDELAEEERYMFLDIVCFFCTSNNHSNMMTKATALRIWDDEKCSPELTLSTLV
NMSLVQIASNGLFVVHDQLRDMGRMISKKEYNGSRWNVEAKELTPQFLKGLEHTQGLLIEGGEI
LNYDQNVMVNMLDLRFLKVTSWDCKHVNILIDIVKHSPNLKWLHLEFDHEFNGQKSLCQISPLF
DLLELRVLNIMVGTYWNMYSSTPTIFKKTICNFSKLKKLQEFGILGIFNLEFDENFGELSALKV
VNLHVTSWKKLPKTFERLKNLEELYLQQNANLIELPQSLGQLTNLKTINVSNCDLDYVPEGLGQ
LKNLSYLNLSNNKRLVKLLECMEEMKSLTSLNVGGCDLDCLPQGMGQLEKLSNLNLSNNKRLVK
LPECMEEMKSLTWLDVGGCDLDCLLQGMGRLEKLSDLNLSNNKRLVKLPECMEEMKSLTRLDVG
GCDLDCLPQGMRRLEKLSNLILKNNKRLMKLPENGAIGKVVRLEFE
Repeat found in CY35_01G047500
Repeat occurs 5 times in a sequence of 512 amino acids
Location between 7563737 and 7567965
Coverage of 5.86 %
Instances:
VGSTMD | VGSGVK | VGSGLG | VGSGVG | VGSGIG
pattern: VGS[TG][MLVI][KDG]
MVFVGLVFGFVVGVGLMTGLHYCMLHRSRKRIQKIAAIRLLNSIQQDELRKLCGSSFPTWVSF
PTFEKVNWLNHNLAKVWPSVVMATEQLVKEALQPILEQYRPPGIQALKLDKFNIGTVPPKFDGI
RVQSLHKSQVIMDMEFRWGGDASIILGINPVIGPKLPVQLKNFSLFTTVRVIFQLTEEMPCISA
VVVALLSKPKPQIKYTLKVIGGSTGAIPGLSEMIDEMIESAVADQVQWPHRIVVPIGNAPPDVL
SNLGLKLQGKLTVQVLKATNLKNLEMVGKSDPYVRLYVRVLFKEKTRVIDNNLNPVWNEQFEFD
VEDQETQSLILDVKDEDNIGTDKKLGVTSIPLASLKPDVEEEITKNLAVSLDRDRVKDKGDRGS
ITIKVLYHPYTKEEQDAAMEAEKKKLEEKERLKNAGIVGSTMDAVGSGVKLVGTGVGMVGSGLG
AGASVVGSGVGIVGSGIGKAGKRLSRVVTRHASSNKLTSPATASPVSASPMHQQNGSFRASITQ
E
Repeat found in CY35_01G_349
Repeat occurs 5 times in a sequence of 512 amino acids
Location between 7563737 and 7567965
Coverage of 5.86 %
Instances:
VGSTMD | VGSGVK | VGSGLG | VGSGVG | VGSGIG
pattern: VGS[TG][MLVI][KDG]
MVFVGLVFGFVVGVGLMTGLHYCMLHRSRKRIQKIAAIRLLNSIQQDELRKLCGSSFPTWVSF
PTFEKVNWLNHNLAKVWPSVVMATEQLVKEALQPILEQYRPPGIQALKLDKFNIGTVPPKFDGI
RVQSLHKSQVIMDMEFRWGGDASIILGINPVIGPKLPVQLKNFSLFTTVRVIFQLTEEMPCISA
VVVALLSKPKPQIKYTLKVIGGSTGAIPGLSEMIDEMIESAVADQVQWPHRIVVPIGNAPPDVL
SNLGLKLQGKLTVQVLKATNLKNLEMVGKSDPYVRLYVRVLFKEKTRVIDNNLNPVWNEQFEFD
VEDQETQSLILDVKDEDNIGTDKKLGVTSIPLASLKPDVEEEITKNLAVSLDRDRVKDKGDRGS
ITIKVLYHPYTKEEQDAAMEAEKKKLEEKERLKNAGIVGSTMDAVGSGVKLVGTGVGMVGSGLG
AGASVVGSGVGIVGSGIGKAGKRLSRVVTRHASSNKLTSPATASPVSASPMHQQNGSFRASITQ
E



