Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCTGTAGAGAAACGAGTGGAGATAAGAGGTCTCGGTCTCCTTCTCCGTCGTCGTTTGGACGGAGAACTTCGGAACTTCGGGTTGCAGAAAATCCACATTGTCATTTGCACTGGTTTTCCCGTTCTTCACGGGAAGGACCGATGACGAATGACCTTGCGGGTTCTTCTATCAGAAACCATGACAATGGAAGTCGTCTTTGTGAAAATAAAGACGAACATTTCTGTAAACTCTCTCAGTTTTGCGAGAATTTACAATGGGAATCGGCATCGAAAAAGTTTCGGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAATTCGAAATCGAGTATAGGGTTGAAACATGGAAATATGTGTGATGGTCATAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGGAAATTTTAGTTGGTAATAATTTGCAGATATTCCATATGAACATTGGGGCCATTAAAGATAGTAACGTAAAGAACAATGGGGATACTTCCAGAAGCTTTGGAATCGATGACTATAGCCATTTGTCTTCATCTAGAAAGTTTGATGGGCCCTCTTACGAGACCAATGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAGAAAATTCCTACAGAGGAAGACGAAACGAAACTTCTTCACATGGGATACAAGCGTCTCATCTACAGTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTTGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGAAGGAACCATATTGAGCACTTTGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGGAGTGACATTGATGCTGCTCTCAACAGTCCTTTCTCTCAGCAATTGGTTCGTATCCCGCAAGATGATTTCTATCAAGCTTCTACTCGGACCAGTGTTGTAATGGATCCAGTTGTTGAAGGATTCACTGAAAGCCATTTGGAAGAGACCACCCGACCAAGAGACCGTTATGATCTTTTCAAAGAACCATTCATCATTGAAGGTTCTTATATGGACACTGCCCCTTTTGCGATGGAACAGTATGGCAAAGTTTTGGGTTCAGGAACTGAAAGTTCGCTGAAGAGTGAAAGAGAAGCATATATAAGCAGCGAGAAATTACTCTTGCCTAAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCGAATGAGGATGGATTAAGTGGATCATTAGTATCAAAACATGATTTGAGCGACATGGAAGACAGTAGAAAGCTGAGATGGGAAGCCCCACATTCAACAAAGCCGAGGGTTGAAGGAACAAGATGTAGAATGCATGATCCTAGGTCTGGTTCATCTAGAAAATCAAATGTGTTTAGCAGAATCCAGTTTTTAAGCCATAGAGTTGAAAAGAGTGCTGTTAAAGATACTGACATCAATTTAATTGGTAGAGACAAGCGATGGAATGACGAGGATACTTCTATATCCTTGACATCCTCTAAACGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCCGCGTTCAAAGCCTAAGCGTAGAGACCTAAAGAAGCGTTTGGGTTTCCCCTTAGGGGATCCCAGTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAACAAACAAGCGTCTGAGGAAGACGAAAGTCAATCATAGGTGCCTTGATGTTCAAACAGGTGATTACTTGGAAGAGAAGGTGCAAAGTCCAACCAGTAGGCCACCACTTGAAGATTCAGAGGAGTTGAACCAGCTAATAAAGAGCGCCTTTCTCAAGTTTGTCAAAGTTCTGAATGAGAATCCAGCCAGACGAAAGAAGTTCAGAGAGCCGGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCAGGTGTGTTTTTCCCCATTTTGCTGTGTTTACCTCCATCGAGCTTTTGATGCTCCTATCCTTATTAGAATTTAGAACTCTTATTTTAGCTTGATCGCCATTTAAGTTGAGTTTTGTAATGGGGAAAGATGCTTATTAAAAACAGTTCTATAAAATGAAACATTTTTAAGTACTTGTAAAGAAAACACTTAAATTTTTCTAGAATTGTTTAATGAAAAAGTGATTCATACAAACACTACTCTCAATGCTCATCTTAAAACTCATGTTTAATATCTTTCTCAAGGTCAAAACTCAAGCGGGTGACTGTCTCATTGGTTTTATGTGCTATAACATGTTTGTCTGATCCCCGTTAAATATTTCTTGTGAAATGGAAAATGCTTTTAATCCCTTGCTAGGTTTGACAGGATTTAAAAACAATATATTTAAGGGATGCTTGGCTCAAGTTAGGGGAGTAGGAGTTGGAAGGAGTGGAATCAATTTTATGCCTTATTAACTTTTTACACTGTGGGCTCCTGGAGTTCAGTATCCACTCTTTGCCTCAAACGCCCCCTGGTAGAAGTTTTTAGAAACATTTTCTATGATTATGGACTTCGATGATATAAGTTTGAACTGAGGAAGACTTTTGTTTTTGACTGCTGTTTTCATATTTCTCTCTTTTCATATGTTTAATATATTTATTTATTTTTGTATGGAAGCTTAAGCTTCATATTGAAATGAATTTGATGGGATTCTGTAGGGGGCTGTGGTTTAGAACTGTTGGAATCAGTCTCTCATTCCTTTCAAATGATCAGTGAAAGAGTTGTCTTTGACCTTTGTTTTGATTTTAAATTGAGAAATCATTGAAATGTAGAACATGGACGGAAGAGAACAGATCAGAAGAAAATTAAGCAAGCAATCAAAGAGAAAAAGAGTGTAATGCAAGCTAAAAAATTTCTTGCAATCAGCACTTTGGTTTGAGTTTCCCTCAACACTCCCTTAACTACATAATCTATTGCCTTGTATTGTTCTTTTCATCTCTTCTTAACTTTGCAATGAGTGAATAATTTGAGGTTCATATATTCTAGTACGTAAGACGTTTTCCATGGAATTCCACTTTGGCCTGGCAAAACCTCCCTATCTTGTTGCACATGTTTCAGGTTTCTGTATATGTATCGCTTGCTTATTTACTTGCATTTTTATTAGCAAGTCCAAGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTCCCAGACGTTGGGAGGATTGAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGCAGCGCCAAACGGTCTATGGATTCGAAGGATATTGCCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAGTTGATAACCTGTCCGAACGGTTAGCCATAAGTTGTGAGGAGCTGGAGGTTGTCATTAGAGGTAAACCTCTTTGTCTTGCCTATTTTGCAGACTATATTATTTTATTTTGAAAATAGTTCCAAATACTCTTGTGAATTGCAAAATCTTAATATAGTACCTTTCTGTTAGATATTGCTATTAAATTTAATAGAATTCTAGGTATGCTACGTGTATCTACGTCACTAAAATTGACTCACATAAGGATATGATTTTCAAAATATTCAAAGTATATGCATGTAATTATTTTCTTTTCTTTGCTTTTCAGTTAATACAGTCCTGTTAGGCAAGTTTAACTTTGTCGACTGTCTAATCGATAAAATCATTATCAGTTTAGTCTTTTGTTTCACCAAAGTTTTGGTTTGTCCCAAAACCAATGAGCAGGAATGGGTTGTGGAGGGAAGATCGAAGTGGTACGTGGTAAACCTGGAAACCATAGTATTATGATAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGGTTTGCAGATAAGAGTCATGGTAGGGACGAGTTCCATAAAATCAATTCGAGTCATCTCATCGACAGCCACAATGATCTGCATATAGCAACAGGAGCAAACACATTGGAGAGTGTACTGTATGGTTACTTAGGCCTCGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGCGATCTGTGGTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCGTCCCTTGACTGTTAG
mRNA sequence
ATGAGCTGTAGAGAAACGAGTGGAGATAAGAGGTCTCGGTCTCCTTCTCCGTCGTCGTTTGGACGGAGAACTTCGGAACTTCGGGTTGCAGAAAATCCACATTGTCATTTGCACTGGTTTTCCCGTTCTTCACGGGAAGGACCGATGACGAATGACCTTGCGGGTTCTTCTATCAGAAACCATGACAATGGAAGTCGTCTTTGTGAAAATAAAGACGAACATTTCTGTAAACTCTCTCAGTTTTGCGAGAATTTACAATGGGAATCGGCATCGAAAAAGTTTCGGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAATTCGAAATCGAGTATAGGGTTGAAACATGGAAATATGTGTGATGGTCATAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGGAAATTTTAGTTGGTAATAATTTGCAGATATTCCATATGAACATTGGGGCCATTAAAGATAGTAACGTAAAGAACAATGGGGATACTTCCAGAAGCTTTGGAATCGATGACTATAGCCATTTGTCTTCATCTAGAAAGTTTGATGGGCCCTCTTACGAGACCAATGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAGAAAATTCCTACAGAGGAAGACGAAACGAAACTTCTTCACATGGGATACAAGCGTCTCATCTACAGTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTTGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGAAGGAACCATATTGAGCACTTTGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGGAGTGACATTGATGCTGCTCTCAACAGTCCTTTCTCTCAGCAATTGGTTCGTATCCCGCAAGATGATTTCTATCAAGCTTCTACTCGGACCAGTGTTGTAATGGATCCAGTTGTTGAAGGATTCACTGAAAGCCATTTGGAAGAGACCACCCGACCAAGAGACCGTTATGATCTTTTCAAAGAACCATTCATCATTGAAGGTTCTTATATGGACACTGCCCCTTTTGCGATGGAACAGTATGGCAAAGTTTTGGGTTCAGGAACTGAAAGTTCGCTGAAGAGTGAAAGAGAAGCATATATAAGCAGCGAGAAATTACTCTTGCCTAAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCGAATGAGGATGGATTAAGTGGATCATTAGTATCAAAACATGATTTGAGCGACATGGAAGACAGTAGAAAGCTGAGATGGGAAGCCCCACATTCAACAAAGCCGAGGGTTGAAGGAACAAGATGTAGAATGCATGATCCTAGGTCTGGTTCATCTAGAAAATCAAATGTGTTTAGCAGAATCCAGTTTTTAAGCCATAGAGTTGAAAAGAGTGCTGTTAAAGATACTGACATCAATTTAATTGGTAGAGACAAGCGATGGAATGACGAGGATACTTCTATATCCTTGACATCCTCTAAACGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCCGCGTTCAAAGCCTAAGCGTAGAGACCTAAAGAAGCGTTTGGGTTTCCCCTTAGGGGATCCCAGTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAACAAACAAGCGTCTGAGGAAGACGAAAGTCAATCATAGGTGCCTTGATGTTCAAACAGGTGATTACTTGGAAGAGAAGGTGCAAAGTCCAACCAGTAGGCCACCACTTGAAGATTCAGAGGAGTTGAACCAGCTAATAAAGAGCGCCTTTCTCAAGTTTGTCAAAGTTCTGAATGAGAATCCAGCCAGACGAAAGAAGTTCAGAGAGCCGGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCAGCAAGTCCAAGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTCCCAGACGTTGGGAGGATTGAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGCAGCGCCAAACGGTCTATGGATTCGAAGGATATTGCCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAGTTGATAACCTGTCCGAACGGTTAGCCATAAGTTGTGAGGAGCTGGAGGTTGTCATTAGAGGAATGGGTTGTGGAGGGAAGATCGAAGTGGTACGTGGTAAACCTGGAAACCATAGTATTATGATAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGGTTTGCAGATAAGAGTCATGGTAGGGACGAGTTCCATAAAATCAATTCGAGTCATCTCATCGACAGCCACAATGATCTGCATATAGCAACAGGAGCAAACACATTGGAGAGTGTACTGTATGGTTACTTAGGCCTCGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGCGATCTGTGGTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCGTCCCTTGACTGTTAG
Coding sequence (CDS)
ATGAGCTGTAGAGAAACGAGTGGAGATAAGAGGTCTCGGTCTCCTTCTCCGTCGTCGTTTGGACGGAGAACTTCGGAACTTCGGGTTGCAGAAAATCCACATTGTCATTTGCACTGGTTTTCCCGTTCTTCACGGGAAGGACCGATGACGAATGACCTTGCGGGTTCTTCTATCAGAAACCATGACAATGGAAGTCGTCTTTGTGAAAATAAAGACGAACATTTCTGTAAACTCTCTCAGTTTTGCGAGAATTTACAATGGGAATCGGCATCGAAAAAGTTTCGGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAATTCGAAATCGAGTATAGGGTTGAAACATGGAAATATGTGTGATGGTCATAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGGAAATTTTAGTTGGTAATAATTTGCAGATATTCCATATGAACATTGGGGCCATTAAAGATAGTAACGTAAAGAACAATGGGGATACTTCCAGAAGCTTTGGAATCGATGACTATAGCCATTTGTCTTCATCTAGAAAGTTTGATGGGCCCTCTTACGAGACCAATGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAGAAAATTCCTACAGAGGAAGACGAAACGAAACTTCTTCACATGGGATACAAGCGTCTCATCTACAGTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTTGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGAAGGAACCATATTGAGCACTTTGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGGAGTGACATTGATGCTGCTCTCAACAGTCCTTTCTCTCAGCAATTGGTTCGTATCCCGCAAGATGATTTCTATCAAGCTTCTACTCGGACCAGTGTTGTAATGGATCCAGTTGTTGAAGGATTCACTGAAAGCCATTTGGAAGAGACCACCCGACCAAGAGACCGTTATGATCTTTTCAAAGAACCATTCATCATTGAAGGTTCTTATATGGACACTGCCCCTTTTGCGATGGAACAGTATGGCAAAGTTTTGGGTTCAGGAACTGAAAGTTCGCTGAAGAGTGAAAGAGAAGCATATATAAGCAGCGAGAAATTACTCTTGCCTAAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCGAATGAGGATGGATTAAGTGGATCATTAGTATCAAAACATGATTTGAGCGACATGGAAGACAGTAGAAAGCTGAGATGGGAAGCCCCACATTCAACAAAGCCGAGGGTTGAAGGAACAAGATGTAGAATGCATGATCCTAGGTCTGGTTCATCTAGAAAATCAAATGTGTTTAGCAGAATCCAGTTTTTAAGCCATAGAGTTGAAAAGAGTGCTGTTAAAGATACTGACATCAATTTAATTGGTAGAGACAAGCGATGGAATGACGAGGATACTTCTATATCCTTGACATCCTCTAAACGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCCGCGTTCAAAGCCTAAGCGTAGAGACCTAAAGAAGCGTTTGGGTTTCCCCTTAGGGGATCCCAGTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAACAAACAAGCGTCTGAGGAAGACGAAAGTCAATCATAGGTGCCTTGATGTTCAAACAGGTGATTACTTGGAAGAGAAGGTGCAAAGTCCAACCAGTAGGCCACCACTTGAAGATTCAGAGGAGTTGAACCAGCTAATAAAGAGCGCCTTTCTCAAGTTTGTCAAAGTTCTGAATGAGAATCCAGCCAGACGAAAGAAGTTCAGAGAGCCGGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCAGCAAGTCCAAGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTCCCAGACGTTGGGAGGATTGAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGCAGCGCCAAACGGTCTATGGATTCGAAGGATATTGCCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAGTTGATAACCTGTCCGAACGGTTAGCCATAAGTTGTGAGGAGCTGGAGGTTGTCATTAGAGGAATGGGTTGTGGAGGGAAGATCGAAGTGGTACGTGGTAAACCTGGAAACCATAGTATTATGATAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGGTTTGCAGATAAGAGTCATGGTAGGGACGAGTTCCATAAAATCAATTCGAGTCATCTCATCGACAGCCACAATGATCTGCATATAGCAACAGGAGCAAACACATTGGAGAGTGTACTGTATGGTTACTTAGGCCTCGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGCGATCTGTGGTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCGTCCCTTGACTGTTAG
Protein sequence
MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLFANNPANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHVRDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDDFYQASTRTSVVMDPVVEGFTESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKHDLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC
Homology
BLAST of ClCG02G021960 vs. NCBI nr
Match:
XP_038900433.1 (uncharacterized protein LOC120087658 [Benincasa hispida])
HSP 1 Score: 1351.7 bits (3497), Expect = 0.0e+00
Identity = 708/847 (83.59%), Postives = 748/847 (88.31%), Query Frame = 0
Query: 1 MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRN 60
M+ RETS DKRS+ SPSSFGRRTSE RV ENPHCH WFSRSSRE P+TN LAGSSIR+
Sbjct: 1 MNYRETSCDKRSQ--SPSSFGRRTSEPRVEENPHCHSLWFSRSSREVPVTNGLAGSSIRD 60
Query: 61 HDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLFANNPANANSKSSIGLKHGNM 120
H NGSRL EN DEHF KLSQ CENLQ ES SKKFRWENLFANNPANANSKSS+GLKH N+
Sbjct: 61 HYNGSRLYENTDEHFRKLSQLCENLQRESPSKKFRWENLFANNPANANSKSSMGLKHENI 120
Query: 121 CDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSH 180
CDG+NRGIRVSGSHLGTSS IL G+NL+ FHMNIG KDSNVKNNGD SRSFGIDD SH
Sbjct: 121 CDGYNRGIRVSGSHLGTSSNNILGGSNLRTFHMNIGETKDSNVKNNGDISRSFGIDDCSH 180
Query: 181 LSSSRKFDGPSYETNDVHVRDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKS 240
LSSSRKFDGP YET+DVHVRDRPIFESAENS+RGRRN SSHG+QAS+LQSSAPVTESK
Sbjct: 181 LSSSRKFDGPLYETSDVHVRDRPIFESAENSHRGRRNVASSHGLQASNLQSSAPVTESKG 240
Query: 241 ILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDDF 300
I QDEFHD LEYKRARRN+IE FDDSNQYFSVQP KRSDIDA LNS FSQQ+VRIPQDDF
Sbjct: 241 ISQDEFHDFLEYKRARRNNIEQFDDSNQYFSVQPGKRSDIDATLNSTFSQQMVRIPQDDF 300
Query: 301 YQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYG 360
YQ STRTSVVMD VVEGF TESHLEETTRPRDRYD FKEPF+IEGSYM TAPF ME YG
Sbjct: 301 YQDSTRTSVVMDSVVEGFKDTESHLEETTRPRDRYDSFKEPFVIEGSYMGTAPFEMELYG 360
Query: 361 KVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKH--DLSDM 420
+ LGSG ESS+K EREAYISSEKLLL +EDGYRT YGKW +EDG++GSLVSKH DLSDM
Sbjct: 361 EGLGSGAESSMKGEREAYISSEKLLLAEEDGYRTYYGKWLHEDGVNGSLVSKHKQDLSDM 420
Query: 421 EDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDIN 480
E SRKLRW+A +STK RVEGTRC MH+P S SSRK NVFSRIQFLSH E AVKDTDIN
Sbjct: 421 EGSRKLRWKATNSTKLRVEGTRCIMHEPGSCSSRKPNVFSRIQFLSHGDENIAVKDTDIN 480
Query: 481 LIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVR 540
L R K WN+EDTSI LTSSKR LPWVINHASP SK KRRDL+KRLGFPL DPSS+PLVR
Sbjct: 481 LNCRSKWWNEEDTSIYLTSSKRPLPWVINHASPHSKLKRRDLRKRLGFPLRDPSSSPLVR 540
Query: 541 EREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVK 600
+R+ K NKRLRK VNH CLDVQT DY+EEKVQSPTSR LED EELNQLIKSAFLKFVK
Sbjct: 541 DRKRKKNKRLRKRNVNHSCLDVQTDDYMEEKVQSPTSR-LLEDQEELNQLIKSAFLKFVK 600
Query: 601 VLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKAL 660
VL+ENPARRKKF EPG GIIKCIVCGSKSKEFADALSLSQHASQTL G RAEHLGL KAL
Sbjct: 601 VLSENPARRKKFTEPGCGIIKCIVCGSKSKEFADALSLSQHASQTLEGSRAEHLGLQKAL 660
Query: 661 CWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISC 720
CWLMGWSSEAAP+G W+RRILPL EVLALKEDLIIWPPVLIIHNSSIA+D+ SER+AISC
Sbjct: 661 CWLMGWSSEAAPDGRWVRRILPLEEVLALKEDLIIWPPVLIIHNSSIAIDSPSERVAISC 720
Query: 721 EELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEF 780
EELEVVIRGMGCGGKI+VVRGKPGN SIMI TF AMFSGLQEAERLHK FADKSHGRDEF
Sbjct: 721 EELEVVIRGMGCGGKIKVVRGKPGNQSIMIVTFDAMFSGLQEAERLHKSFADKSHGRDEF 780
Query: 781 HKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAI 840
KI SSHLIDSH DLH ATGANTL++VLYGYLGL EDLDKLDFETKKRSVVKSKKEIQAI
Sbjct: 781 QKIYSSHLIDSHKDLHKATGANTLDNVLYGYLGLTEDLDKLDFETKKRSVVKSKKEIQAI 840
Query: 841 VNASLDC 844
VNASL C
Sbjct: 841 VNASLHC 844
BLAST of ClCG02G021960 vs. NCBI nr
Match:
XP_008458617.1 (PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo] >KAA0033382.1 uncharacterized protein E6C27_scaffold111G00320 [Cucumis melo var. makuwa])
HSP 1 Score: 1261.1 bits (3262), Expect = 0.0e+00
Identity = 669/848 (78.89%), Postives = 719/848 (84.79%), Query Frame = 0
Query: 1 MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRN 60
M+ RE + DKRS+ SPS FGRRTSE RV E PHC+ HWFSRSSRE PMTN+L GSSIR+
Sbjct: 1 MNSREMNRDKRSQ--SPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRD 60
Query: 61 HDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLFANNP-ANANSKSSIGLKHGN 120
H NGSRL +KDEHF KLSQFCENLQ ES +KKF+WENLF NN AN NSK+S+GLKH N
Sbjct: 61 HYNGSRLYFHKDEHFRKLSQFCENLQGESPAKKFQWENLFVNNNLANGNSKASMGLKHVN 120
Query: 121 MCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYS 180
DG NRGIRVSGSHLGTSSK IL G NL+ FHMNIGA KDSNVKNNGDTSRS GI+D +
Sbjct: 121 GSDGDNRGIRVSGSHLGTSSKSIL-GGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDCN 180
Query: 181 HLSSSRKFDGPSYETNDVHVRDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESK 240
HLSSSRK+DGP ++ N+VHVRDRPIFE ENS+RGRRNETSS GIQASHL SSAPV ESK
Sbjct: 181 HLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAESK 240
Query: 241 SILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD 300
I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA + PFSQ +VRIPQDD
Sbjct: 241 GISQGEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDD 300
Query: 301 FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQY 360
FY+ STRTSVVMD VVEGF TESH EETTRPRD ++ F IEGS M TAPFAMEQY
Sbjct: 301 FYRDSTRTSVVMDSVVEGFQDTESHFEETTRPRD-HNAF-----IEGSCMSTAPFAMEQY 360
Query: 361 GKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKH--DLSD 420
+VLGSGTESS EREAYISSEKLLL +EDGYRTN+GKW+ EDG++GS VSKH DL D
Sbjct: 361 VEVLGSGTESSQDGEREAYISSEKLLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGD 420
Query: 421 MEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDI 480
MED RKL W+A HSTKPRVEG R +MHDP GS +K NVFSRIQFL+H VKDTD
Sbjct: 421 MEDRRKLTWKAQHSTKPRVEGARSKMHDPGPGSFKKPNVFSRIQFLNH----GDVKDTDF 480
Query: 481 NLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLV 540
NL R+ DEDTS SSKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LV
Sbjct: 481 NLNCRNNWQVDEDTSF---SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLV 540
Query: 541 REREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV 600
RERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Sbjct: 541 RERERKRNKRLRKTNVDHGCLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFV 600
Query: 601 KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKA 660
KVL+ENPARRKK EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKA
Sbjct: 601 KVLSENPARRKKLTEPGCGIITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKA 660
Query: 661 LCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAIS 720
LCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPVLIIHNSSIA+D LS+ +AIS
Sbjct: 661 LCWLMGWSSETAPNGLWVRRILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAIS 720
Query: 721 CEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDE 780
CEELE VIRGMGCGGKI+VVRG+PGN SIM+ TFGAMFSGLQEAERLHK FADKSHGRDE
Sbjct: 721 CEELEAVIRGMGCGGKIKVVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDE 780
Query: 781 FHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQA 840
HKIN HLIDS+ DLH ATGANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQA
Sbjct: 781 VHKINLRHLIDSNVDLHKATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQA 832
Query: 841 IVNASLDC 844
IVNASL C
Sbjct: 841 IVNASLQC 832
BLAST of ClCG02G021960 vs. NCBI nr
Match:
XP_011657058.1 (uncharacterized protein LOC105435801 [Cucumis sativus] >KGN46926.1 hypothetical protein Csa_020974 [Cucumis sativus])
HSP 1 Score: 351.3 bits (900), Expect = 2.3e-92
Identity = 177/221 (80.09%), Postives = 190/221 (85.97%), Query Frame = 0
Query: 623 SKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEV 682
SKSKEF DALSL QHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+R ILP VEV
Sbjct: 34 SKSKEFVDALSLPQHASRTLEGSRAEHLGLHKALCWLMGWSSEIAPNGLWVRMILPPVEV 93
Query: 683 LALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNH 742
LALKEDLIIWP VLIIHNSSIA+D E +AISCE+LE +R MGCGGK +VVRGK N
Sbjct: 94 LALKEDLIIWPSVLIIHNSSIAIDKRYEGVAISCEKLEAAVRAMGCGGKFKVVRGKAVNQ 153
Query: 743 SIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLES 802
SIM+ TFGAMF GLQEAERLH FADKSHGRDEFHKIN L+DS+ D+H ATGANTLES
Sbjct: 154 SIMVVTFGAMFYGLQEAERLHNNFADKSHGRDEFHKINLRCLVDSNVDMHKATGANTLES 213
Query: 803 VLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC 844
V YGYLGL EDLDKLDFETKKRSVV+SKKEIQAIV+ASL C
Sbjct: 214 VRYGYLGLVEDLDKLDFETKKRSVVRSKKEIQAIVHASLQC 254
BLAST of ClCG02G021960 vs. NCBI nr
Match:
XP_022140332.1 (uncharacterized protein LOC111011032 [Momordica charantia])
HSP 1 Score: 303.9 bits (777), Expect = 4.3e-78
Identity = 153/184 (83.15%), Postives = 164/184 (89.13%), Query Frame = 0
Query: 660 MGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEEL 719
MGWSSE APNGLW++RILP VE ALKEDLIIWPPVLIIHNSSIA DN SE++ ISCEEL
Sbjct: 1 MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEEL 60
Query: 720 EVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKI 779
EVVIRGMG GGKI+VVRGKP N SIM+ TF AMFSGLQEAERLHK FADKSHGRDEFH+I
Sbjct: 61 EVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEI 120
Query: 780 NSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNA 839
NSSH IDSH DLH A GAN +ESVLYGYLGLAED +KLDFETKKRSVVKSKKEIQAIV+A
Sbjct: 121 NSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDA 180
Query: 840 SLDC 844
+L C
Sbjct: 181 TLQC 183
BLAST of ClCG02G021960 vs. NCBI nr
Match:
XP_017982234.1 (PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao])
HSP 1 Score: 303.5 bits (776), Expect = 5.6e-78
Identity = 174/331 (52.57%), Postives = 221/331 (66.77%), Query Frame = 0
Query: 515 RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSR 574
R+ +K+RLG P + N + R K K L++ VN VQ D V+ +
Sbjct: 291 RKSIKQRLGPPCHVHNPNYMPRVERHKMRKLLQE-NVNDFPEGVQARDVDLRHVKRGRTE 350
Query: 575 PPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALS 634
PP EDSEE Q I AF+KFVK+LNENPA+R+K+RE G +G +KC VCGSKS+EF + LS
Sbjct: 351 PP-EDSEEFEQQIHGAFVKFVKILNENPAQRRKYREKGEAGTLKCCVCGSKSEEFVNTLS 410
Query: 635 LSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIW 694
L HA + + GLRA HLGLHK+LC+LMGW+S AA NGLW ++ LP VE LA+KEDL+IW
Sbjct: 411 LVTHAFTSRMVGLRANHLGLHKSLCFLMGWNSVAASNGLWRQKTLPDVEALAMKEDLVIW 470
Query: 695 PPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMIATFGA 754
PP++I+HNSSIA N R+ +S EE+E +R MG G G +V RGKP N SIM F
Sbjct: 471 PPIVILHNSSIATTNSDNRIIVSIEEIEAFLRDMGFGWGISKVCRGKPANQSIMTVIFHG 530
Query: 755 MFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA 814
FSGL+EAERLHK +A+ HGR EF +IN S L + ++ VLYGYLG+A
Sbjct: 531 TFSGLKEAERLHKLYAENKHGRAEFQQINCSSGETKKAPL------DKVKDVLYGYLGIA 590
Query: 815 EDLDKLDFETKKRSVVKSKKEIQAIVNASLD 843
DLDKLDFETK R++VKSKKEI A +A L+
Sbjct: 591 GDLDKLDFETKSRALVKSKKEIYATADALLN 613
BLAST of ClCG02G021960 vs. ExPASy TrEMBL
Match:
A0A5A7SQC0 (XS domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold111G00320 PE=4 SV=1)
HSP 1 Score: 1261.1 bits (3262), Expect = 0.0e+00
Identity = 669/848 (78.89%), Postives = 719/848 (84.79%), Query Frame = 0
Query: 1 MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRN 60
M+ RE + DKRS+ SPS FGRRTSE RV E PHC+ HWFSRSSRE PMTN+L GSSIR+
Sbjct: 1 MNSREMNRDKRSQ--SPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRD 60
Query: 61 HDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLFANNP-ANANSKSSIGLKHGN 120
H NGSRL +KDEHF KLSQFCENLQ ES +KKF+WENLF NN AN NSK+S+GLKH N
Sbjct: 61 HYNGSRLYFHKDEHFRKLSQFCENLQGESPAKKFQWENLFVNNNLANGNSKASMGLKHVN 120
Query: 121 MCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYS 180
DG NRGIRVSGSHLGTSSK IL G NL+ FHMNIGA KDSNVKNNGDTSRS GI+D +
Sbjct: 121 GSDGDNRGIRVSGSHLGTSSKSIL-GGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDCN 180
Query: 181 HLSSSRKFDGPSYETNDVHVRDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESK 240
HLSSSRK+DGP ++ N+VHVRDRPIFE ENS+RGRRNETSS GIQASHL SSAPV ESK
Sbjct: 181 HLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAESK 240
Query: 241 SILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD 300
I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA + PFSQ +VRIPQDD
Sbjct: 241 GISQGEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDD 300
Query: 301 FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQY 360
FY+ STRTSVVMD VVEGF TESH EETTRPRD ++ F IEGS M TAPFAMEQY
Sbjct: 301 FYRDSTRTSVVMDSVVEGFQDTESHFEETTRPRD-HNAF-----IEGSCMSTAPFAMEQY 360
Query: 361 GKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKH--DLSD 420
+VLGSGTESS EREAYISSEKLLL +EDGYRTN+GKW+ EDG++GS VSKH DL D
Sbjct: 361 VEVLGSGTESSQDGEREAYISSEKLLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGD 420
Query: 421 MEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDI 480
MED RKL W+A HSTKPRVEG R +MHDP GS +K NVFSRIQFL+H VKDTD
Sbjct: 421 MEDRRKLTWKAQHSTKPRVEGARSKMHDPGPGSFKKPNVFSRIQFLNH----GDVKDTDF 480
Query: 481 NLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLV 540
NL R+ DEDTS SSKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LV
Sbjct: 481 NLNCRNNWQVDEDTSF---SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLV 540
Query: 541 REREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV 600
RERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Sbjct: 541 RERERKRNKRLRKTNVDHGCLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFV 600
Query: 601 KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKA 660
KVL+ENPARRKK EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKA
Sbjct: 601 KVLSENPARRKKLTEPGCGIITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKA 660
Query: 661 LCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAIS 720
LCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPVLIIHNSSIA+D LS+ +AIS
Sbjct: 661 LCWLMGWSSETAPNGLWVRRILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAIS 720
Query: 721 CEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDE 780
CEELE VIRGMGCGGKI+VVRG+PGN SIM+ TFGAMFSGLQEAERLHK FADKSHGRDE
Sbjct: 721 CEELEAVIRGMGCGGKIKVVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDE 780
Query: 781 FHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQA 840
HKIN HLIDS+ DLH ATGANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQA
Sbjct: 781 VHKINLRHLIDSNVDLHKATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQA 832
Query: 841 IVNASLDC 844
IVNASL C
Sbjct: 841 IVNASLQC 832
BLAST of ClCG02G021960 vs. ExPASy TrEMBL
Match:
A0A1S3C894 (uncharacterized protein LOC103497964 OS=Cucumis melo OX=3656 GN=LOC103497964 PE=4 SV=1)
HSP 1 Score: 1261.1 bits (3262), Expect = 0.0e+00
Identity = 669/848 (78.89%), Postives = 719/848 (84.79%), Query Frame = 0
Query: 1 MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRN 60
M+ RE + DKRS+ SPS FGRRTSE RV E PHC+ HWFSRSSRE PMTN+L GSSIR+
Sbjct: 1 MNSREMNRDKRSQ--SPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRD 60
Query: 61 HDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLFANNP-ANANSKSSIGLKHGN 120
H NGSRL +KDEHF KLSQFCENLQ ES +KKF+WENLF NN AN NSK+S+GLKH N
Sbjct: 61 HYNGSRLYFHKDEHFRKLSQFCENLQGESPAKKFQWENLFVNNNLANGNSKASMGLKHVN 120
Query: 121 MCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYS 180
DG NRGIRVSGSHLGTSSK IL G NL+ FHMNIGA KDSNVKNNGDTSRS GI+D +
Sbjct: 121 GSDGDNRGIRVSGSHLGTSSKSIL-GGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDCN 180
Query: 181 HLSSSRKFDGPSYETNDVHVRDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESK 240
HLSSSRK+DGP ++ N+VHVRDRPIFE ENS+RGRRNETSS GIQASHL SSAPV ESK
Sbjct: 181 HLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAESK 240
Query: 241 SILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD 300
I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA + PFSQ +VRIPQDD
Sbjct: 241 GISQGEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDD 300
Query: 301 FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQY 360
FY+ STRTSVVMD VVEGF TESH EETTRPRD ++ F IEGS M TAPFAMEQY
Sbjct: 301 FYRDSTRTSVVMDSVVEGFQDTESHFEETTRPRD-HNAF-----IEGSCMSTAPFAMEQY 360
Query: 361 GKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKH--DLSD 420
+VLGSGTESS EREAYISSEKLLL +EDGYRTN+GKW+ EDG++GS VSKH DL D
Sbjct: 361 VEVLGSGTESSQDGEREAYISSEKLLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGD 420
Query: 421 MEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDI 480
MED RKL W+A HSTKPRVEG R +MHDP GS +K NVFSRIQFL+H VKDTD
Sbjct: 421 MEDRRKLTWKAQHSTKPRVEGARSKMHDPGPGSFKKPNVFSRIQFLNH----GDVKDTDF 480
Query: 481 NLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLV 540
NL R+ DEDTS SSKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LV
Sbjct: 481 NLNCRNNWQVDEDTSF---SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLV 540
Query: 541 REREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV 600
RERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Sbjct: 541 RERERKRNKRLRKTNVDHGCLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFV 600
Query: 601 KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKA 660
KVL+ENPARRKK EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKA
Sbjct: 601 KVLSENPARRKKLTEPGCGIITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKA 660
Query: 661 LCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAIS 720
LCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPVLIIHNSSIA+D LS+ +AIS
Sbjct: 661 LCWLMGWSSETAPNGLWVRRILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAIS 720
Query: 721 CEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDE 780
CEELE VIRGMGCGGKI+VVRG+PGN SIM+ TFGAMFSGLQEAERLHK FADKSHGRDE
Sbjct: 721 CEELEAVIRGMGCGGKIKVVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDE 780
Query: 781 FHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQA 840
HKIN HLIDS+ DLH ATGANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQA
Sbjct: 781 VHKINLRHLIDSNVDLHKATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQA 832
Query: 841 IVNASLDC 844
IVNASL C
Sbjct: 841 IVNASLQC 832
BLAST of ClCG02G021960 vs. ExPASy TrEMBL
Match:
A0A0A0KGN5 (XS domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G151630 PE=4 SV=1)
HSP 1 Score: 351.3 bits (900), Expect = 1.1e-92
Identity = 177/221 (80.09%), Postives = 190/221 (85.97%), Query Frame = 0
Query: 623 SKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEV 682
SKSKEF DALSL QHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+R ILP VEV
Sbjct: 34 SKSKEFVDALSLPQHASRTLEGSRAEHLGLHKALCWLMGWSSEIAPNGLWVRMILPPVEV 93
Query: 683 LALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNH 742
LALKEDLIIWP VLIIHNSSIA+D E +AISCE+LE +R MGCGGK +VVRGK N
Sbjct: 94 LALKEDLIIWPSVLIIHNSSIAIDKRYEGVAISCEKLEAAVRAMGCGGKFKVVRGKAVNQ 153
Query: 743 SIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLES 802
SIM+ TFGAMF GLQEAERLH FADKSHGRDEFHKIN L+DS+ D+H ATGANTLES
Sbjct: 154 SIMVVTFGAMFYGLQEAERLHNNFADKSHGRDEFHKINLRCLVDSNVDMHKATGANTLES 213
Query: 803 VLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC 844
V YGYLGL EDLDKLDFETKKRSVV+SKKEIQAIV+ASL C
Sbjct: 214 VRYGYLGLVEDLDKLDFETKKRSVVRSKKEIQAIVHASLQC 254
BLAST of ClCG02G021960 vs. ExPASy TrEMBL
Match:
A0A6J1CGJ5 (uncharacterized protein LOC111011032 OS=Momordica charantia OX=3673 GN=LOC111011032 PE=4 SV=1)
HSP 1 Score: 303.9 bits (777), Expect = 2.1e-78
Identity = 153/184 (83.15%), Postives = 164/184 (89.13%), Query Frame = 0
Query: 660 MGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEEL 719
MGWSSE APNGLW++RILP VE ALKEDLIIWPPVLIIHNSSIA DN SE++ ISCEEL
Sbjct: 1 MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEEL 60
Query: 720 EVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKI 779
EVVIRGMG GGKI+VVRGKP N SIM+ TF AMFSGLQEAERLHK FADKSHGRDEFH+I
Sbjct: 61 EVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEI 120
Query: 780 NSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNA 839
NSSH IDSH DLH A GAN +ESVLYGYLGLAED +KLDFETKKRSVVKSKKEIQAIV+A
Sbjct: 121 NSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDA 180
Query: 840 SLDC 844
+L C
Sbjct: 181 TLQC 183
BLAST of ClCG02G021960 vs. ExPASy TrEMBL
Match:
A0A6J0ZXA5 (uncharacterized protein LOC110412979 OS=Herrania umbratica OX=108875 GN=LOC110412979 PE=4 SV=1)
HSP 1 Score: 302.0 bits (772), Expect = 7.9e-78
Identity = 174/331 (52.57%), Postives = 220/331 (66.47%), Query Frame = 0
Query: 515 RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSR 574
R+ +K+RLG P + N + R + K K L K VN VQ D V+ +
Sbjct: 718 RKRIKQRLGPPCHVHNPNYMPRTQRHKMRK-LLKENVNDFHEGVQARDVDLRHVKRGRTE 777
Query: 575 PPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALS 634
PP EDS+E Q I+ AF+++VK+LNENPA+R+K+ E G +G +KC VCGSKS+EF + LS
Sbjct: 778 PP-EDSKEFEQQIRGAFVQYVKILNENPAQRRKYTEKGEAGTLKCCVCGSKSEEFVNTLS 837
Query: 635 LSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIW 694
L HA + + GLR HLGLHKALC+LMGW+S AA NGLW ++ LP VE LA+KEDL+IW
Sbjct: 838 LVTHAFTSRMVGLRVNHLGLHKALCFLMGWNSVAASNGLWRQKTLPDVEALAMKEDLVIW 897
Query: 695 PPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMIATFGA 754
PPV+I+HNSSIA N R+ +S EE+E +R MG G G +V RGKP N SIM F
Sbjct: 898 PPVVILHNSSIATTNSDHRIIVSIEEIEAFLRDMGFGRGISKVCRGKPANQSIMTVIFHG 957
Query: 755 MFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA 814
FSGL+EAERLHK +A+ HGR EF +IN S L + +E VLYGYLG+A
Sbjct: 958 TFSGLKEAERLHKLYAENKHGRAEFQQINCSTGETKKVPL------DKVEDVLYGYLGIA 1017
Query: 815 EDLDKLDFETKKRSVVKSKKEIQAIVNASLD 843
DLDKLDFETK R++VKSKKEI A +A LD
Sbjct: 1018 GDLDKLDFETKSRALVKSKKEIYATADALLD 1040
BLAST of ClCG02G021960 vs. TAIR 10
Match:
AT3G22430.1 (CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380); BEST Arabidopsis thaliana protein match is: XS domain-containing protein / XS zinc finger domain-containing protein-related (TAIR:AT5G23570.1); Has 565 Blast hits to 510 proteins in 121 species: Archae - 2; Bacteria - 90; Metazoa - 191; Fungi - 32; Plants - 51; Viruses - 4; Other Eukaryotes - 195 (source: NCBI BLink). )
HSP 1 Score: 134.4 bits (337), Expect = 4.2e-31
Identity = 87/254 (34.25%), Postives = 131/254 (51.57%), Query Frame = 0
Query: 587 IKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALSLSQHA-SQTLGG 646
+K +FL FVK + E+P +K + E G G ++C+VCG SK+ D SL H
Sbjct: 253 LKKSFLGFVKRVFEDPMEKKNYLENGRKGRLQCLVCGRSSKDVQDTHSLVMHTYCSDDSS 312
Query: 647 LRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIA 706
R HLGLHKALC LMGW+ AP+ + LP E + LIIWPP +I+ N+S
Sbjct: 313 SRVHHLGLHKALCVLMGWNFSKAPDNSKAYQNLPADEAAINQAQLIIWPPHVIVQNTSTG 372
Query: 707 VDNLSERLAISCEELEVVIRGMG-CGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLH 766
+ ++ IR +G GGK + + G+ G+ I + F SGL++A R+
Sbjct: 373 KGKEGRMEGFGNKTMDNRIRELGLTGGKSKSLYGREGHLGITLFKFAGDDSGLRDAMRMA 432
Query: 767 KRFADKSHGRDEFHKIN--SSHLIDSHNDLHIATGANTLES--VLYGYLGLAEDLDKLDF 826
+ F + GR + ++ + D N + T E + YGYL DLDK+D
Sbjct: 433 EYFEKINRGRKSWGRVQPLTPSKDDEKNPGLVEVDGRTGEKKRIFYGYLATVTDLDKVDV 492
Query: 827 ETKKRSVVKSKKEI 834
ETKK++ ++S +E+
Sbjct: 493 ETKKKTTIESLREL 506
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038900433.1 | 0.0e+00 | 83.59 | uncharacterized protein LOC120087658 [Benincasa hispida] | [more] |
XP_008458617.1 | 0.0e+00 | 78.89 | PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo] >KAA0033382.1 unc... | [more] |
XP_011657058.1 | 2.3e-92 | 80.09 | uncharacterized protein LOC105435801 [Cucumis sativus] >KGN46926.1 hypothetical ... | [more] |
XP_022140332.1 | 4.3e-78 | 83.15 | uncharacterized protein LOC111011032 [Momordica charantia] | [more] |
XP_017982234.1 | 5.6e-78 | 52.57 | PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7SQC0 | 0.0e+00 | 78.89 | XS domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... | [more] |
A0A1S3C894 | 0.0e+00 | 78.89 | uncharacterized protein LOC103497964 OS=Cucumis melo OX=3656 GN=LOC103497964 PE=... | [more] |
A0A0A0KGN5 | 1.1e-92 | 80.09 | XS domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G151630 PE=4 SV=... | [more] |
A0A6J1CGJ5 | 2.1e-78 | 83.15 | uncharacterized protein LOC111011032 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
A0A6J0ZXA5 | 7.9e-78 | 52.57 | uncharacterized protein LOC110412979 OS=Herrania umbratica OX=108875 GN=LOC11041... | [more] |
Match Name | E-value | Identity | Description | |
AT3G22430.1 | 4.2e-31 | 34.25 | CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380); ... | [more] |