Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCTGTAGAGAAACGAGTGGAGATAAGAGGTCTCGGTCTCCTTCTCCGTCGTCGTTTGGACGGAGAACTTCGGAACTTCGGGTTGCAGAAAATCCACATTGTCATTTGCACTGGTTTTCCCGTTCTTCACGGGAAGGACCGACGACGAATGACCTTGCGGGTTCTTCTATCAGAAACCATGACAATGGAAGTCGTCTTTGTGAAAATAAAGACGAACATTTCCGCAAACTCTCTCAGTTTTGCGAGAATTTACAATGGGAATCGGCGTCGAAAAAGTTTCGGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAATTCGAAATCGAGTATAGGGTTGAAACATGGAAATATGTGTGATGGTCGTCATAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGGAAATTTTAGTTGGTAATAATTTGCATATGAACATTGGGGCAATTAAAGATAGTAACGTAAAGAACAATGGGGATACTTCCAGAAGCTTTGGAATCGATGACTATAGCCATTTGTCTTCATCTAGAAAGTTTGATGGGCCCTCTTACGAGACCAATGATGTTCATGTTCGGGATCGTCTGATCTTTGAATCAGCAGAAAATTCCTACAGAGGAAGACGAAACGAAACTTCTTCACATGGGATACAAGCGTCTCATCTACAGTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTTGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGAAGGAACCATATTGAGCACTTTGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGGAGTGACATTGATGCTGCTCTCAACAGTCCTTTCTCTCAGCAATTGGTTCGTATCCCGCAAGATGATTTCTATCAAGCTTCTACTCGGACCAGTGTTGTAATGGATCCTGTTGTTGAAGGATTCACTGAAAGCCATTTGGAAGAGACCACCCGACCAAGAGACCGTTATGATCTTTTCAAAGAACCATTCATCATTGAAGGTTCTTATATGGACACTGCCCCTTTTGCGATGGAACAGTATGGCAAAGTTTTGGGTTCAGGAACTGAAAGTTCGCTGAAGAGTGAAAGAGAAGCATATATAAGCAGCGAGAAATTACTCTTGCCTAAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCGAATGAGGATGGATTAAGTGGATCATTAGTATCAAAACATGATTTGAGCGACATGGAAGACAGTAGAAAGCTGAGATGGGAAGCCCCACATTCAACAAAGCCGAGGGTTGAAGGAACAAGATGTAGAATGCATGATCCTAGGTCTGGTTCATCTAGAAAATCAAATGTGTTTAGCAGAATCCAGTTTTTAAGCCATAGAGTTGAAAAGAGTGCTGTTAAAGATACTGACATCAATTTAATTGGTAGAGACAAGCGATGGAATGACGAGGATACTTCTATATCCTTGACATCCTCTAAACGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCCGCGTTCAAAGCCTAAGCGTAGAGACCTAAAGAAGCGTTTGGGTTTCCCCTTAGGGGATCCCAGTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAACAAACAAGCGTCTGAGGAAGACGAAAGTCAATCATAGGTGCCTTGATGTTCAAACAGGTGATTACTTGGAAGAGAAGGTGCAAAGTCCAACCAGTAGGCCACCACTTGAAGATTCAGAGGAGTTGAACCAGCTAATAAAGAGCGCCTTTCTCAAGTTTGTCAAAGTTCTGAATGAGAATCCAGCCAGACGAAAGAAGTTCAGAGAGCCGGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCAGGTGTGTTTTTCCCCATTTTGCTGTGTTTACCTCCATTGAGCTTTTGATGCTCCTATCCTTATTAGAATTTAGAACTCTTATTTTAGCTTGATCGCCATTTAAGTTGAGTTTTGTAATGGGGAAAGATGCTTATTAAAAACAGTTCTATAAAATGAAACATTTTTAAGTACTTGTATAGAAAACACTTAAATTTTTCTAGAATTGTTTAATGAAAAAGTGATTCATACAAACACTACTCTCAATGCTCATCTTAAAACTCATGTTTAATATCTTTCTCAAGGTCAAAACTCAAGCGGGGGACTGTCTCATTGGTTTTATGTGCTATAACATGTTTGTCTGATCCCCGTTAAATATTTCTTGTGAAATGGAAAATGCTTTTAATCCCTTGCTAGGTTTGACAGGATTTAAAAACAATATATTTAAGGGATGCTTCGCTCAAGTTAGGGGAGTAGGAGTTGGAAGGAGTGGAATCAATTTTATGCCTTGTTAACTTTTTACATTGTGGGCTCCTGGAGTTCAGTATCCACTCTTTGCCCCAAACGCCCCCTGGTAGAAGTTTTTAGAAACATTTTCTATGATTATGGACTTCGATGATATAAGTTTGAACTGAGGAAGACTTTTGTTTTTGACTGCTGTTTTCATATTTCTCTCTTTTCATATGTTTAATATATTTATTTATTTTTGTATGGAAGCTTAAGCTTCATATTGAAATGAATTTGATGGGATTCTGTAGGGGGCTGTGGTTTAGAACTGTTGGAATCAGTCTCTCATTCCTTTCAAATGATCAGTGAAAGAGTTGTCTTTCACCTTTGTTTTGATTTTAAATTGAGAAATCATTGAAATATAGAACATGGACGGAAGAGAATAGATCAGAAGAAAATTAAGCAAGCAATCAAAGAGAAAAAGAGTGTAATGCAAGCTAAAAAATTTCTTGCAATCAGCACTTTGGTTTGAGTTTACCTCAACACTCCCGTAACTACATAATCTATTGCCTTGTATTGTTCTTTTCATCTCTTCTTAACTTTGCAATGAGTGAATAATTTGAGGTTCGTATATTCTAGTATGTAAGACGTTTTCCATGGAATTCCACTTTAGCCTGGCAAAACCTCCCTATCTTGTTGCACATGTTTCAGGTTTCTGTATATGTATCGCTTGCTTATTTACTTGCATTTTTATTAGCAAGTCCAAGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTCCCAGACGTTGGGAGGATTGAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGCAGCGCCAAACGGTCTATGGATTCGAAGGATATTACCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAATTGATAACCTGTCCGAACGGGTAGCCATAAGTTGTGAGGAGCTGGAGGTTGTCATTAGAGGTAAACCTTTTCTTTGCTGTAACTATGAATTGCTGGTCTTTCCATTTCTAATATTCAAGTTGCATGTTGATGGTATATTATTCAAGTCATCTTTTCAAGTTAACCAAGATGTGATATAAAGACATTCTCATTTCTCAATATATTTTTAAAATAGTGCTCCCTTGGTTGTTTTTATGTTATCCTACCTGGAATCAATTATATGTATTGTTTTTCCTTTGAGATAATTACAAATCACACCTCTTTGTCTTGCCTATTTTGCAGACTATATTATTTTATTTTGAAAATAGTTCCAAATACTCTTGTGAATTGCAAAATCTTAATATAGTACCTTTCTGTTAGATATTGCTATTAAATTTAACAGAATTCTAGGTATGCTACGTGTATCTACGTCACTAAAATTGACTCACATAAGGATATGATTTTCAAAATATTCAAAGTATATGCATGTAATTATTTTCTTTTCTTTTGCTTTTCAGTTAATACAGTCCTGTTAGGCAAGTCTTCACTTACTCTTACAAGTTATGCACGTTTAACTTTGTCGACTGTCTGAAATAATATGCTAATCGATAAAATCATTATCAGTTTAGTCTTTTGTTTCACCAAAGTTTTGGTTTGTCCCAAAACCAATGAGCAGGAATGGGTTGTGGAGGGAAGATCGAAGTGGTACGTGGTAAACCTGGAAACCATAGTATTATGGTAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAATGTTTGCAGATAAGAGTCATGGTAGGCACGAGTTCCATAAAATCAATTCGAGTCATCTCATCGACAGCCACAATGATCTGCATATAGCAACAGGAGCAAACACATTGGAGAGTGTACTGTATGGTTACTTAGGCCTCGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGCGATCTGTGGTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCGTCCCTTGACTGTTAG
mRNA sequence
ATGAGCTGTAGAGAAACGAGTGGAGATAAGAGGTCTCGGTCTCCTTCTCCGTCGTCGTTTGGACGGAGAACTTCGGAACTTCGGGTTGCAGAAAATCCACATTGTCATTTGCACTGGTTTTCCCGTTCTTCACGGGAAGGACCGACGACGAATGACCTTGCGGGTTCTTCTATCAGAAACCATGACAATGGAAGTCGTCTTTGTGAAAATAAAGACGAACATTTCCGCAAACTCTCTCAGTTTTGCGAGAATTTACAATGGGAATCGGCGTCGAAAAAGTTTCGGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAATTCGAAATCGAGTATAGGGTTGAAACATGGAAATATGTGTGATGGTCGTCATAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGGAAATTTTAGTTGGTAATAATTTGCATATGAACATTGGGGCAATTAAAGATAGTAACGTAAAGAACAATGGGGATACTTCCAGAAGCTTTGGAATCGATGACTATAGCCATTTGTCTTCATCTAGAAAGTTTGATGGGCCCTCTTACGAGACCAATGATGTTCATGTTCGGGATCGTCTGATCTTTGAATCAGCAGAAAATTCCTACAGAGGAAGACGAAACGAAACTTCTTCACATGGGATACAAGCGTCTCATCTACAGTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTTGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGAAGGAACCATATTGAGCACTTTGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGGAGTGACATTGATGCTGCTCTCAACAGTCCTTTCTCTCAGCAATTGGTTCGTATCCCGCAAGATGATTTCTATCAAGCTTCTACTCGGACCAGTGTTGTAATGGATCCTGTTGTTGAAGGATTCACTGAAAGCCATTTGGAAGAGACCACCCGACCAAGAGACCGTTATGATCTTTTCAAAGAACCATTCATCATTGAAGGTTCTTATATGGACACTGCCCCTTTTGCGATGGAACAGTATGGCAAAGTTTTGGGTTCAGGAACTGAAAGTTCGCTGAAGAGTGAAAGAGAAGCATATATAAGCAGCGAGAAATTACTCTTGCCTAAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCGAATGAGGATGGATTAAGTGGATCATTAGTATCAAAACATGATTTGAGCGACATGGAAGACAGTAGAAAGCTGAGATGGGAAGCCCCACATTCAACAAAGCCGAGGGTTGAAGGAACAAGATGTAGAATGCATGATCCTAGGTCTGGTTCATCTAGAAAATCAAATGTGTTTAGCAGAATCCAGTTTTTAAGCCATAGAGTTGAAAAGAGTGCTGTTAAAGATACTGACATCAATTTAATTGGTAGAGACAAGCGATGGAATGACGAGGATACTTCTATATCCTTGACATCCTCTAAACGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCCGCGTTCAAAGCCTAAGCGTAGAGACCTAAAGAAGCGTTTGGGTTTCCCCTTAGGGGATCCCAGTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAACAAACAAGCGTCTGAGGAAGACGAAAGTCAATCATAGGTGCCTTGATGTTCAAACAGGTGATTACTTGGAAGAGAAGGTGCAAAGTCCAACCAGTAGGCCACCACTTGAAGATTCAGAGGAGTTGAACCAGCTAATAAAGAGCGCCTTTCTCAAGTTTGTCAAAGTTCTGAATGAGAATCCAGCCAGACGAAAGAAGTTCAGAGAGCCGGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCAGCAAGTCCAAGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTCCCAGACGTTGGGAGGATTGAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGCAGCGCCAAACGGTCTATGGATTCGAAGGATATTACCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAATTGATAACCTGTCCGAACGGGTAGCCATAAGTTGTGAGGAGCTGGAGGTTGTCATTAGAGGAATGGGTTGTGGAGGGAAGATCGAAGTGGTACGTGGTAAACCTGGAAACCATAGTATTATGGTAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAATGTTTGCAGATAAGAGTCATGGTAGGCACGAGTTCCATAAAATCAATTCGAGTCATCTCATCGACAGCCACAATGATCTGCATATAGCAACAGGAGCAAACACATTGGAGAGTGTACTGTATGGTTACTTAGGCCTCGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGCGATCTGTGGTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCGTCCCTTGACTGTTAG
Coding sequence (CDS)
ATGAGCTGTAGAGAAACGAGTGGAGATAAGAGGTCTCGGTCTCCTTCTCCGTCGTCGTTTGGACGGAGAACTTCGGAACTTCGGGTTGCAGAAAATCCACATTGTCATTTGCACTGGTTTTCCCGTTCTTCACGGGAAGGACCGACGACGAATGACCTTGCGGGTTCTTCTATCAGAAACCATGACAATGGAAGTCGTCTTTGTGAAAATAAAGACGAACATTTCCGCAAACTCTCTCAGTTTTGCGAGAATTTACAATGGGAATCGGCGTCGAAAAAGTTTCGGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAATTCGAAATCGAGTATAGGGTTGAAACATGGAAATATGTGTGATGGTCGTCATAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGGAAATTTTAGTTGGTAATAATTTGCATATGAACATTGGGGCAATTAAAGATAGTAACGTAAAGAACAATGGGGATACTTCCAGAAGCTTTGGAATCGATGACTATAGCCATTTGTCTTCATCTAGAAAGTTTGATGGGCCCTCTTACGAGACCAATGATGTTCATGTTCGGGATCGTCTGATCTTTGAATCAGCAGAAAATTCCTACAGAGGAAGACGAAACGAAACTTCTTCACATGGGATACAAGCGTCTCATCTACAGTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTTGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGAAGGAACCATATTGAGCACTTTGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGGAGTGACATTGATGCTGCTCTCAACAGTCCTTTCTCTCAGCAATTGGTTCGTATCCCGCAAGATGATTTCTATCAAGCTTCTACTCGGACCAGTGTTGTAATGGATCCTGTTGTTGAAGGATTCACTGAAAGCCATTTGGAAGAGACCACCCGACCAAGAGACCGTTATGATCTTTTCAAAGAACCATTCATCATTGAAGGTTCTTATATGGACACTGCCCCTTTTGCGATGGAACAGTATGGCAAAGTTTTGGGTTCAGGAACTGAAAGTTCGCTGAAGAGTGAAAGAGAAGCATATATAAGCAGCGAGAAATTACTCTTGCCTAAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCGAATGAGGATGGATTAAGTGGATCATTAGTATCAAAACATGATTTGAGCGACATGGAAGACAGTAGAAAGCTGAGATGGGAAGCCCCACATTCAACAAAGCCGAGGGTTGAAGGAACAAGATGTAGAATGCATGATCCTAGGTCTGGTTCATCTAGAAAATCAAATGTGTTTAGCAGAATCCAGTTTTTAAGCCATAGAGTTGAAAAGAGTGCTGTTAAAGATACTGACATCAATTTAATTGGTAGAGACAAGCGATGGAATGACGAGGATACTTCTATATCCTTGACATCCTCTAAACGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCCGCGTTCAAAGCCTAAGCGTAGAGACCTAAAGAAGCGTTTGGGTTTCCCCTTAGGGGATCCCAGTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAACAAACAAGCGTCTGAGGAAGACGAAAGTCAATCATAGGTGCCTTGATGTTCAAACAGGTGATTACTTGGAAGAGAAGGTGCAAAGTCCAACCAGTAGGCCACCACTTGAAGATTCAGAGGAGTTGAACCAGCTAATAAAGAGCGCCTTTCTCAAGTTTGTCAAAGTTCTGAATGAGAATCCAGCCAGACGAAAGAAGTTCAGAGAGCCGGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCAGCAAGTCCAAGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTCCCAGACGTTGGGAGGATTGAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGCAGCGCCAAACGGTCTATGGATTCGAAGGATATTACCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAATTGATAACCTGTCCGAACGGGTAGCCATAAGTTGTGAGGAGCTGGAGGTTGTCATTAGAGGAATGGGTTGTGGAGGGAAGATCGAAGTGGTACGTGGTAAACCTGGAAACCATAGTATTATGGTAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAATGTTTGCAGATAAGAGTCATGGTAGGCACGAGTTCCATAAAATCAATTCGAGTCATCTCATCGACAGCCACAATGATCTGCATATAGCAACAGGAGCAAACACATTGGAGAGTGTACTGTATGGTTACTTAGGCCTCGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGCGATCTGTGGTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCGTCCCTTGACTGTTAG
Protein sequence
MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPTTNDLAGSSIRNHDNGSRLCENKDEHFRKLSQFCENLQWESASKKFRWENLFANNPANANSKSSIGLKHGNMCDGRHNRGIRVSGSHLGTSSKEILVGNNLHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHVRDRLIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDDFYQASTRTSVVMDPVVEGFTESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKHDLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMVATFGAMFSGLQEAERLHKMFADKSHGRHEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC
Homology
BLAST of CaUC02G047300 vs. NCBI nr
Match:
XP_038900433.1 (uncharacterized protein LOC120087658 [Benincasa hispida])
HSP 1 Score: 1333.9 bits (3451), Expect = 0.0e+00
Identity = 707/848 (83.37%), Postives = 744/848 (87.74%), Query Frame = 0
Query: 1 MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPTTNDLAGSSIRN 60
M+ RETS DKRS+ SPSSFGRRTSE RV ENPHCH WFSRSSRE P TN LAGSSIR+
Sbjct: 1 MNYRETSCDKRSQ--SPSSFGRRTSEPRVEENPHCHSLWFSRSSREVPVTNGLAGSSIRD 60
Query: 61 HDNGSRLCENKDEHFRKLSQFCENLQWESASKKFRWENLFANNPANANSKSSIGLKHGNM 120
H NGSRL EN DEHFRKLSQ CENLQ ES SKKFRWENLFANNPANANSKSS+GLKH N+
Sbjct: 61 HYNGSRLYENTDEHFRKLSQLCENLQRESPSKKFRWENLFANNPANANSKSSMGLKHENI 120
Query: 121 CDGRHNRGIRVSGSHLGTSSKEILVGNNL---HMNIGAIKDSNVKNNGDTSRSFGIDDYS 180
CDG +NRGIRVSGSHLGTSS IL G+NL HMNIG KDSNVKNNGD SRSFGIDD S
Sbjct: 121 CDG-YNRGIRVSGSHLGTSSNNILGGSNLRTFHMNIGETKDSNVKNNGDISRSFGIDDCS 180
Query: 181 HLSSSRKFDGPSYETNDVHVRDRLIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESK 240
HLSSSRKFDGP YET+DVHVRDR IFESAENS+RGRRN SSHG+QAS+LQSSAPVTESK
Sbjct: 181 HLSSSRKFDGPLYETSDVHVRDRPIFESAENSHRGRRNVASSHGLQASNLQSSAPVTESK 240
Query: 241 SILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD 300
I QDEFHD LEYKRARRN+IE FDDSNQYFSVQP KRSDIDA LNS FSQQ+VRIPQDD
Sbjct: 241 GISQDEFHDFLEYKRARRNNIEQFDDSNQYFSVQPGKRSDIDATLNSTFSQQMVRIPQDD 300
Query: 301 FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQY 360
FYQ STRTSVVMD VVEGF TESHLEETTRPRDRYD FKEPF+IEGSYM TAPF ME Y
Sbjct: 301 FYQDSTRTSVVMDSVVEGFKDTESHLEETTRPRDRYDSFKEPFVIEGSYMGTAPFEMELY 360
Query: 361 GKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKH--DLSD 420
G+ LGSG ESS+K EREAYISSEKLLL +EDGYRT YGKW +EDG++GSLVSKH DLSD
Sbjct: 361 GEGLGSGAESSMKGEREAYISSEKLLLAEEDGYRTYYGKWLHEDGVNGSLVSKHKQDLSD 420
Query: 421 MEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDI 480
ME SRKLRW+A +STK RVEGTRC MH+P S SSRK NVFSRIQFLSH E AVKDTDI
Sbjct: 421 MEGSRKLRWKATNSTKLRVEGTRCIMHEPGSCSSRKPNVFSRIQFLSHGDENIAVKDTDI 480
Query: 481 NLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLV 540
NL R K WN+EDTSI LTSSKR LPWVINHASP SK KRRDL+KRLGFPL DPSS+PLV
Sbjct: 481 NLNCRSKWWNEEDTSIYLTSSKRPLPWVINHASPHSKLKRRDLRKRLGFPLRDPSSSPLV 540
Query: 541 REREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV 600
R+R+ K NKRLRK VNH CLDVQT DY+EEKVQSPTSR LED EELNQLIKSAFLKFV
Sbjct: 541 RDRKRKKNKRLRKRNVNHSCLDVQTDDYMEEKVQSPTSR-LLEDQEELNQLIKSAFLKFV 600
Query: 601 KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKA 660
KVL+ENPARRKKF EPG GIIKCIVCGSKSKEFADALSLSQHASQTL G RAEHLGL KA
Sbjct: 601 KVLSENPARRKKFTEPGCGIIKCIVCGSKSKEFADALSLSQHASQTLEGSRAEHLGLQKA 660
Query: 661 LCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAIS 720
LCWLMGWSSEAAP+G W+RRILPL EVLALKEDLIIWPPVLIIHNSSIAID+ SERVAIS
Sbjct: 661 LCWLMGWSSEAAPDGRWVRRILPLEEVLALKEDLIIWPPVLIIHNSSIAIDSPSERVAIS 720
Query: 721 CEELEVVIRGMGCGGKIEVVRGKPGNHSIMVATFGAMFSGLQEAERLHKMFADKSHGRHE 780
CEELEVVIRGMGCGGKI+VVRGKPGN SIM+ TF AMFSGLQEAERLHK FADKSHGR E
Sbjct: 721 CEELEVVIRGMGCGGKIKVVRGKPGNQSIMIVTFDAMFSGLQEAERLHKSFADKSHGRDE 780
Query: 781 FHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQA 840
F KI SSHLIDSH DLH ATGANTL++VLYGYLGL EDLDKLDFETKKRSVVKSKKEIQA
Sbjct: 781 FQKIYSSHLIDSHKDLHKATGANTLDNVLYGYLGLTEDLDKLDFETKKRSVVKSKKEIQA 840
Query: 841 IVNASLDC 842
IVNASL C
Sbjct: 841 IVNASLHC 844
BLAST of CaUC02G047300 vs. NCBI nr
Match:
XP_008458617.1 (PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo] >KAA0033382.1 uncharacterized protein E6C27_scaffold111G00320 [Cucumis melo var. makuwa])
HSP 1 Score: 1244.6 bits (3219), Expect = 0.0e+00
Identity = 668/848 (78.77%), Postives = 714/848 (84.20%), Query Frame = 0
Query: 1 MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPTTNDLAGSSIRN 60
M+ RE + DKRS+ SPS FGRRTSE RV E PHC+ HWFSRSSRE P TN+L GSSIR+
Sbjct: 1 MNSREMNRDKRSQ--SPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRD 60
Query: 61 HDNGSRLCENKDEHFRKLSQFCENLQWESASKKFRWENLFANNP-ANANSKSSIGLKHGN 120
H NGSRL +KDEHFRKLSQFCENLQ ES +KKF+WENLF NN AN NSK+S+GLKH N
Sbjct: 61 HYNGSRLYFHKDEHFRKLSQFCENLQGESPAKKFQWENLFVNNNLANGNSKASMGLKHVN 120
Query: 121 MCDGRHNRGIRVSGSHLGTSSKEILVGN--NLHMNIGAIKDSNVKNNGDTSRSFGIDDYS 180
DG NRGIRVSGSHLGTSSK IL GN HMNIGA KDSNVKNNGDTSRS GI+D +
Sbjct: 121 GSDG-DNRGIRVSGSHLGTSSKSILGGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDCN 180
Query: 181 HLSSSRKFDGPSYETNDVHVRDRLIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESK 240
HLSSSRK+DGP ++ N+VHVRDR IFE ENS+RGRRNETSS GIQASHL SSAPV ESK
Sbjct: 181 HLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAESK 240
Query: 241 SILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD 300
I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA + PFSQ +VRIPQDD
Sbjct: 241 GISQGEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDD 300
Query: 301 FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQY 360
FY+ STRTSVVMD VVEGF TESH EETTRPRD ++ F IEGS M TAPFAMEQY
Sbjct: 301 FYRDSTRTSVVMDSVVEGFQDTESHFEETTRPRD-HNAF-----IEGSCMSTAPFAMEQY 360
Query: 361 GKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKH--DLSD 420
+VLGSGTESS EREAYISSEKLLL +EDGYRTN+GKW+ EDG++GS VSKH DL D
Sbjct: 361 VEVLGSGTESSQDGEREAYISSEKLLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGD 420
Query: 421 MEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDI 480
MED RKL W+A HSTKPRVEG R +MHDP GS +K NVFSRIQFL+H VKDTD
Sbjct: 421 MEDRRKLTWKAQHSTKPRVEGARSKMHDPGPGSFKKPNVFSRIQFLNH----GDVKDTDF 480
Query: 481 NLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLV 540
NL R+ DEDTS SSKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LV
Sbjct: 481 NLNCRNNWQVDEDTSF---SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLV 540
Query: 541 REREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV 600
RERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Sbjct: 541 RERERKRNKRLRKTNVDHGCLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFV 600
Query: 601 KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKA 660
KVL+ENPARRKK EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKA
Sbjct: 601 KVLSENPARRKKLTEPGCGIITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKA 660
Query: 661 LCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAIS 720
LCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPVLIIHNSSIAID LS+ VAIS
Sbjct: 661 LCWLMGWSSETAPNGLWVRRILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAIS 720
Query: 721 CEELEVVIRGMGCGGKIEVVRGKPGNHSIMVATFGAMFSGLQEAERLHKMFADKSHGRHE 780
CEELE VIRGMGCGGKI+VVRG+PGN SIMV TFGAMFSGLQEAERLHK FADKSHGR E
Sbjct: 721 CEELEAVIRGMGCGGKIKVVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDE 780
Query: 781 FHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQA 840
HKIN HLIDS+ DLH ATGANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQA
Sbjct: 781 VHKINLRHLIDSNVDLHKATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQA 832
Query: 841 IVNASLDC 842
IVNASL C
Sbjct: 841 IVNASLQC 832
BLAST of CaUC02G047300 vs. NCBI nr
Match:
XP_011657058.1 (uncharacterized protein LOC105435801 [Cucumis sativus] >KGN46926.1 hypothetical protein Csa_020974 [Cucumis sativus])
HSP 1 Score: 348.6 bits (893), Expect = 1.5e-91
Identity = 179/221 (81.00%), Postives = 189/221 (85.52%), Query Frame = 0
Query: 621 SKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEV 680
SKSKEF DALSL QHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+R ILP VEV
Sbjct: 34 SKSKEFVDALSLPQHASRTLEGSRAEHLGLHKALCWLMGWSSEIAPNGLWVRMILPPVEV 93
Query: 681 LALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCGGKIEVVRGKPGNH 740
LALKEDLIIWP VLIIHNSSIAID E VAISCE+LE +R MGCGGK +VVRGK N
Sbjct: 94 LALKEDLIIWPSVLIIHNSSIAIDKRYEGVAISCEKLEAAVRAMGCGGKFKVVRGKAVNQ 153
Query: 741 SIMVATFGAMFSGLQEAERLHKMFADKSHGRHEFHKINSSHLIDSHNDLHIATGANTLES 800
SIMV TFGAMF GLQEAERLH FADKSHGR EFHKIN L+DS+ D+H ATGANTLES
Sbjct: 154 SIMVVTFGAMFYGLQEAERLHNNFADKSHGRDEFHKINLRCLVDSNVDMHKATGANTLES 213
Query: 801 VLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC 842
V YGYLGL EDLDKLDFETKKRSVV+SKKEIQAIV+ASL C
Sbjct: 214 VRYGYLGLVEDLDKLDFETKKRSVVRSKKEIQAIVHASLQC 254
BLAST of CaUC02G047300 vs. NCBI nr
Match:
XP_017982234.1 (PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao])
HSP 1 Score: 304.7 bits (779), Expect = 2.5e-78
Identity = 174/331 (52.57%), Postives = 222/331 (67.07%), Query Frame = 0
Query: 513 RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSR 572
R+ +K+RLG P + N + R K K L++ VN VQ D V+ +
Sbjct: 291 RKSIKQRLGPPCHVHNPNYMPRVERHKMRKLLQE-NVNDFPEGVQARDVDLRHVKRGRTE 350
Query: 573 PPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALS 632
PP EDSEE Q I AF+KFVK+LNENPA+R+K+RE G +G +KC VCGSKS+EF + LS
Sbjct: 351 PP-EDSEEFEQQIHGAFVKFVKILNENPAQRRKYREKGEAGTLKCCVCGSKSEEFVNTLS 410
Query: 633 LSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIW 692
L HA + + GLRA HLGLHK+LC+LMGW+S AA NGLW ++ LP VE LA+KEDL+IW
Sbjct: 411 LVTHAFTSRMVGLRANHLGLHKSLCFLMGWNSVAASNGLWRQKTLPDVEALAMKEDLVIW 470
Query: 693 PPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMVATFGA 752
PP++I+HNSSIA N R+ +S EE+E +R MG G G +V RGKP N SIM F
Sbjct: 471 PPIVILHNSSIATTNSDNRIIVSIEEIEAFLRDMGFGWGISKVCRGKPANQSIMTVIFHG 530
Query: 753 MFSGLQEAERLHKMFADKSHGRHEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA 812
FSGL+EAERLHK++A+ HGR EF +IN S L + ++ VLYGYLG+A
Sbjct: 531 TFSGLKEAERLHKLYAENKHGRAEFQQINCSSGETKKAPL------DKVKDVLYGYLGIA 590
Query: 813 EDLDKLDFETKKRSVVKSKKEIQAIVNASLD 841
DLDKLDFETK R++VKSKKEI A +A L+
Sbjct: 591 GDLDKLDFETKSRALVKSKKEIYATADALLN 613
BLAST of CaUC02G047300 vs. NCBI nr
Match:
XP_021279328.1 (uncharacterized protein LOC110412979 [Herrania umbratica])
HSP 1 Score: 302.8 bits (774), Expect = 9.5e-78
Identity = 174/331 (52.57%), Postives = 221/331 (66.77%), Query Frame = 0
Query: 513 RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSR 572
R+ +K+RLG P + N + R + K K L K VN VQ D V+ +
Sbjct: 718 RKRIKQRLGPPCHVHNPNYMPRTQRHKMRK-LLKENVNDFHEGVQARDVDLRHVKRGRTE 777
Query: 573 PPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALS 632
PP EDS+E Q I+ AF+++VK+LNENPA+R+K+ E G +G +KC VCGSKS+EF + LS
Sbjct: 778 PP-EDSKEFEQQIRGAFVQYVKILNENPAQRRKYTEKGEAGTLKCCVCGSKSEEFVNTLS 837
Query: 633 LSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIW 692
L HA + + GLR HLGLHKALC+LMGW+S AA NGLW ++ LP VE LA+KEDL+IW
Sbjct: 838 LVTHAFTSRMVGLRVNHLGLHKALCFLMGWNSVAASNGLWRQKTLPDVEALAMKEDLVIW 897
Query: 693 PPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMVATFGA 752
PPV+I+HNSSIA N R+ +S EE+E +R MG G G +V RGKP N SIM F
Sbjct: 898 PPVVILHNSSIATTNSDHRIIVSIEEIEAFLRDMGFGRGISKVCRGKPANQSIMTVIFHG 957
Query: 753 MFSGLQEAERLHKMFADKSHGRHEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA 812
FSGL+EAERLHK++A+ HGR EF +IN S L + +E VLYGYLG+A
Sbjct: 958 TFSGLKEAERLHKLYAENKHGRAEFQQINCSTGETKKVPL------DKVEDVLYGYLGIA 1017
Query: 813 EDLDKLDFETKKRSVVKSKKEIQAIVNASLD 841
DLDKLDFETK R++VKSKKEI A +A LD
Sbjct: 1018 GDLDKLDFETKSRALVKSKKEIYATADALLD 1040
BLAST of CaUC02G047300 vs. ExPASy TrEMBL
Match:
A0A5A7SQC0 (XS domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold111G00320 PE=4 SV=1)
HSP 1 Score: 1244.6 bits (3219), Expect = 0.0e+00
Identity = 668/848 (78.77%), Postives = 714/848 (84.20%), Query Frame = 0
Query: 1 MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPTTNDLAGSSIRN 60
M+ RE + DKRS+ SPS FGRRTSE RV E PHC+ HWFSRSSRE P TN+L GSSIR+
Sbjct: 1 MNSREMNRDKRSQ--SPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRD 60
Query: 61 HDNGSRLCENKDEHFRKLSQFCENLQWESASKKFRWENLFANNP-ANANSKSSIGLKHGN 120
H NGSRL +KDEHFRKLSQFCENLQ ES +KKF+WENLF NN AN NSK+S+GLKH N
Sbjct: 61 HYNGSRLYFHKDEHFRKLSQFCENLQGESPAKKFQWENLFVNNNLANGNSKASMGLKHVN 120
Query: 121 MCDGRHNRGIRVSGSHLGTSSKEILVGN--NLHMNIGAIKDSNVKNNGDTSRSFGIDDYS 180
DG NRGIRVSGSHLGTSSK IL GN HMNIGA KDSNVKNNGDTSRS GI+D +
Sbjct: 121 GSDG-DNRGIRVSGSHLGTSSKSILGGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDCN 180
Query: 181 HLSSSRKFDGPSYETNDVHVRDRLIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESK 240
HLSSSRK+DGP ++ N+VHVRDR IFE ENS+RGRRNETSS GIQASHL SSAPV ESK
Sbjct: 181 HLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAESK 240
Query: 241 SILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD 300
I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA + PFSQ +VRIPQDD
Sbjct: 241 GISQGEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDD 300
Query: 301 FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQY 360
FY+ STRTSVVMD VVEGF TESH EETTRPRD ++ F IEGS M TAPFAMEQY
Sbjct: 301 FYRDSTRTSVVMDSVVEGFQDTESHFEETTRPRD-HNAF-----IEGSCMSTAPFAMEQY 360
Query: 361 GKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKH--DLSD 420
+VLGSGTESS EREAYISSEKLLL +EDGYRTN+GKW+ EDG++GS VSKH DL D
Sbjct: 361 VEVLGSGTESSQDGEREAYISSEKLLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGD 420
Query: 421 MEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDI 480
MED RKL W+A HSTKPRVEG R +MHDP GS +K NVFSRIQFL+H VKDTD
Sbjct: 421 MEDRRKLTWKAQHSTKPRVEGARSKMHDPGPGSFKKPNVFSRIQFLNH----GDVKDTDF 480
Query: 481 NLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLV 540
NL R+ DEDTS SSKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LV
Sbjct: 481 NLNCRNNWQVDEDTSF---SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLV 540
Query: 541 REREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV 600
RERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Sbjct: 541 RERERKRNKRLRKTNVDHGCLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFV 600
Query: 601 KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKA 660
KVL+ENPARRKK EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKA
Sbjct: 601 KVLSENPARRKKLTEPGCGIITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKA 660
Query: 661 LCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAIS 720
LCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPVLIIHNSSIAID LS+ VAIS
Sbjct: 661 LCWLMGWSSETAPNGLWVRRILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAIS 720
Query: 721 CEELEVVIRGMGCGGKIEVVRGKPGNHSIMVATFGAMFSGLQEAERLHKMFADKSHGRHE 780
CEELE VIRGMGCGGKI+VVRG+PGN SIMV TFGAMFSGLQEAERLHK FADKSHGR E
Sbjct: 721 CEELEAVIRGMGCGGKIKVVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDE 780
Query: 781 FHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQA 840
HKIN HLIDS+ DLH ATGANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQA
Sbjct: 781 VHKINLRHLIDSNVDLHKATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQA 832
Query: 841 IVNASLDC 842
IVNASL C
Sbjct: 841 IVNASLQC 832
BLAST of CaUC02G047300 vs. ExPASy TrEMBL
Match:
A0A1S3C894 (uncharacterized protein LOC103497964 OS=Cucumis melo OX=3656 GN=LOC103497964 PE=4 SV=1)
HSP 1 Score: 1244.6 bits (3219), Expect = 0.0e+00
Identity = 668/848 (78.77%), Postives = 714/848 (84.20%), Query Frame = 0
Query: 1 MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPTTNDLAGSSIRN 60
M+ RE + DKRS+ SPS FGRRTSE RV E PHC+ HWFSRSSRE P TN+L GSSIR+
Sbjct: 1 MNSREMNRDKRSQ--SPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRD 60
Query: 61 HDNGSRLCENKDEHFRKLSQFCENLQWESASKKFRWENLFANNP-ANANSKSSIGLKHGN 120
H NGSRL +KDEHFRKLSQFCENLQ ES +KKF+WENLF NN AN NSK+S+GLKH N
Sbjct: 61 HYNGSRLYFHKDEHFRKLSQFCENLQGESPAKKFQWENLFVNNNLANGNSKASMGLKHVN 120
Query: 121 MCDGRHNRGIRVSGSHLGTSSKEILVGN--NLHMNIGAIKDSNVKNNGDTSRSFGIDDYS 180
DG NRGIRVSGSHLGTSSK IL GN HMNIGA KDSNVKNNGDTSRS GI+D +
Sbjct: 121 GSDG-DNRGIRVSGSHLGTSSKSILGGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDCN 180
Query: 181 HLSSSRKFDGPSYETNDVHVRDRLIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESK 240
HLSSSRK+DGP ++ N+VHVRDR IFE ENS+RGRRNETSS GIQASHL SSAPV ESK
Sbjct: 181 HLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAESK 240
Query: 241 SILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD 300
I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA + PFSQ +VRIPQDD
Sbjct: 241 GISQGEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDD 300
Query: 301 FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQY 360
FY+ STRTSVVMD VVEGF TESH EETTRPRD ++ F IEGS M TAPFAMEQY
Sbjct: 301 FYRDSTRTSVVMDSVVEGFQDTESHFEETTRPRD-HNAF-----IEGSCMSTAPFAMEQY 360
Query: 361 GKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKH--DLSD 420
+VLGSGTESS EREAYISSEKLLL +EDGYRTN+GKW+ EDG++GS VSKH DL D
Sbjct: 361 VEVLGSGTESSQDGEREAYISSEKLLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGD 420
Query: 421 MEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDI 480
MED RKL W+A HSTKPRVEG R +MHDP GS +K NVFSRIQFL+H VKDTD
Sbjct: 421 MEDRRKLTWKAQHSTKPRVEGARSKMHDPGPGSFKKPNVFSRIQFLNH----GDVKDTDF 480
Query: 481 NLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLV 540
NL R+ DEDTS SSKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LV
Sbjct: 481 NLNCRNNWQVDEDTSF---SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLV 540
Query: 541 REREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV 600
RERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Sbjct: 541 RERERKRNKRLRKTNVDHGCLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFV 600
Query: 601 KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKA 660
KVL+ENPARRKK EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKA
Sbjct: 601 KVLSENPARRKKLTEPGCGIITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKA 660
Query: 661 LCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAIS 720
LCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPVLIIHNSSIAID LS+ VAIS
Sbjct: 661 LCWLMGWSSETAPNGLWVRRILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAIS 720
Query: 721 CEELEVVIRGMGCGGKIEVVRGKPGNHSIMVATFGAMFSGLQEAERLHKMFADKSHGRHE 780
CEELE VIRGMGCGGKI+VVRG+PGN SIMV TFGAMFSGLQEAERLHK FADKSHGR E
Sbjct: 721 CEELEAVIRGMGCGGKIKVVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDE 780
Query: 781 FHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQA 840
HKIN HLIDS+ DLH ATGANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQA
Sbjct: 781 VHKINLRHLIDSNVDLHKATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQA 832
Query: 841 IVNASLDC 842
IVNASL C
Sbjct: 841 IVNASLQC 832
BLAST of CaUC02G047300 vs. ExPASy TrEMBL
Match:
A0A0A0KGN5 (XS domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G151630 PE=4 SV=1)
HSP 1 Score: 348.6 bits (893), Expect = 7.3e-92
Identity = 179/221 (81.00%), Postives = 189/221 (85.52%), Query Frame = 0
Query: 621 SKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEV 680
SKSKEF DALSL QHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+R ILP VEV
Sbjct: 34 SKSKEFVDALSLPQHASRTLEGSRAEHLGLHKALCWLMGWSSEIAPNGLWVRMILPPVEV 93
Query: 681 LALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCGGKIEVVRGKPGNH 740
LALKEDLIIWP VLIIHNSSIAID E VAISCE+LE +R MGCGGK +VVRGK N
Sbjct: 94 LALKEDLIIWPSVLIIHNSSIAIDKRYEGVAISCEKLEAAVRAMGCGGKFKVVRGKAVNQ 153
Query: 741 SIMVATFGAMFSGLQEAERLHKMFADKSHGRHEFHKINSSHLIDSHNDLHIATGANTLES 800
SIMV TFGAMF GLQEAERLH FADKSHGR EFHKIN L+DS+ D+H ATGANTLES
Sbjct: 154 SIMVVTFGAMFYGLQEAERLHNNFADKSHGRDEFHKINLRCLVDSNVDMHKATGANTLES 213
Query: 801 VLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC 842
V YGYLGL EDLDKLDFETKKRSVV+SKKEIQAIV+ASL C
Sbjct: 214 VRYGYLGLVEDLDKLDFETKKRSVVRSKKEIQAIVHASLQC 254
BLAST of CaUC02G047300 vs. ExPASy TrEMBL
Match:
A0A6J0ZXA5 (uncharacterized protein LOC110412979 OS=Herrania umbratica OX=108875 GN=LOC110412979 PE=4 SV=1)
HSP 1 Score: 302.8 bits (774), Expect = 4.6e-78
Identity = 174/331 (52.57%), Postives = 221/331 (66.77%), Query Frame = 0
Query: 513 RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSR 572
R+ +K+RLG P + N + R + K K L K VN VQ D V+ +
Sbjct: 718 RKRIKQRLGPPCHVHNPNYMPRTQRHKMRK-LLKENVNDFHEGVQARDVDLRHVKRGRTE 777
Query: 573 PPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALS 632
PP EDS+E Q I+ AF+++VK+LNENPA+R+K+ E G +G +KC VCGSKS+EF + LS
Sbjct: 778 PP-EDSKEFEQQIRGAFVQYVKILNENPAQRRKYTEKGEAGTLKCCVCGSKSEEFVNTLS 837
Query: 633 LSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIW 692
L HA + + GLR HLGLHKALC+LMGW+S AA NGLW ++ LP VE LA+KEDL+IW
Sbjct: 838 LVTHAFTSRMVGLRVNHLGLHKALCFLMGWNSVAASNGLWRQKTLPDVEALAMKEDLVIW 897
Query: 693 PPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMVATFGA 752
PPV+I+HNSSIA N R+ +S EE+E +R MG G G +V RGKP N SIM F
Sbjct: 898 PPVVILHNSSIATTNSDHRIIVSIEEIEAFLRDMGFGRGISKVCRGKPANQSIMTVIFHG 957
Query: 753 MFSGLQEAERLHKMFADKSHGRHEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA 812
FSGL+EAERLHK++A+ HGR EF +IN S L + +E VLYGYLG+A
Sbjct: 958 TFSGLKEAERLHKLYAENKHGRAEFQQINCSTGETKKVPL------DKVEDVLYGYLGIA 1017
Query: 813 EDLDKLDFETKKRSVVKSKKEIQAIVNASLD 841
DLDKLDFETK R++VKSKKEI A +A LD
Sbjct: 1018 GDLDKLDFETKSRALVKSKKEIYATADALLD 1040
BLAST of CaUC02G047300 vs. ExPASy TrEMBL
Match:
A0A6J1CGJ5 (uncharacterized protein LOC111011032 OS=Momordica charantia OX=3673 GN=LOC111011032 PE=4 SV=1)
HSP 1 Score: 300.8 bits (769), Expect = 1.8e-77
Identity = 154/184 (83.70%), Postives = 163/184 (88.59%), Query Frame = 0
Query: 658 MGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEEL 717
MGWSSE APNGLW++RILP VE ALKEDLIIWPPVLIIHNSSIA DN SE+V ISCEEL
Sbjct: 1 MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEEL 60
Query: 718 EVVIRGMGCGGKIEVVRGKPGNHSIMVATFGAMFSGLQEAERLHKMFADKSHGRHEFHKI 777
EVVIRGMG GGKI+VVRGKP N SIMV TF AMFSGLQEAERLHK FADKSHGR EFH+I
Sbjct: 61 EVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEI 120
Query: 778 NSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNA 837
NSSH IDSH DLH A GAN +ESVLYGYLGLAED +KLDFETKKRSVVKSKKEIQAIV+A
Sbjct: 121 NSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDA 180
Query: 838 SLDC 842
+L C
Sbjct: 181 TLQC 183
BLAST of CaUC02G047300 vs. TAIR 10
Match:
AT3G22430.1 (CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380); BEST Arabidopsis thaliana protein match is: XS domain-containing protein / XS zinc finger domain-containing protein-related (TAIR:AT5G23570.1); Has 565 Blast hits to 510 proteins in 121 species: Archae - 2; Bacteria - 90; Metazoa - 191; Fungi - 32; Plants - 51; Viruses - 4; Other Eukaryotes - 195 (source: NCBI BLink). )
HSP 1 Score: 133.3 bits (334), Expect = 9.3e-31
Identity = 87/254 (34.25%), Postives = 131/254 (51.57%), Query Frame = 0
Query: 585 IKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALSLSQHA-SQTLGG 644
+K +FL FVK + E+P +K + E G G ++C+VCG SK+ D SL H
Sbjct: 253 LKKSFLGFVKRVFEDPMEKKNYLENGRKGRLQCLVCGRSSKDVQDTHSLVMHTYCSDDSS 312
Query: 645 LRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIA 704
R HLGLHKALC LMGW+ AP+ + LP E + LIIWPP +I+ N+S
Sbjct: 313 SRVHHLGLHKALCVLMGWNFSKAPDNSKAYQNLPADEAAINQAQLIIWPPHVIVQNTSTG 372
Query: 705 IDNLSERVAISCEELEVVIRGMG-CGGKIEVVRGKPGNHSIMVATFGAMFSGLQEAERLH 764
+ ++ IR +G GGK + + G+ G+ I + F SGL++A R+
Sbjct: 373 KGKEGRMEGFGNKTMDNRIRELGLTGGKSKSLYGREGHLGITLFKFAGDDSGLRDAMRMA 432
Query: 765 KMFADKSHGRHEFHKIN--SSHLIDSHNDLHIATGANTLES--VLYGYLGLAEDLDKLDF 824
+ F + GR + ++ + D N + T E + YGYL DLDK+D
Sbjct: 433 EYFEKINRGRKSWGRVQPLTPSKDDEKNPGLVEVDGRTGEKKRIFYGYLATVTDLDKVDV 492
Query: 825 ETKKRSVVKSKKEI 832
ETKK++ ++S +E+
Sbjct: 493 ETKKKTTIESLREL 506
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038900433.1 | 0.0e+00 | 83.37 | uncharacterized protein LOC120087658 [Benincasa hispida] | [more] |
XP_008458617.1 | 0.0e+00 | 78.77 | PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo] >KAA0033382.1 unc... | [more] |
XP_011657058.1 | 1.5e-91 | 81.00 | uncharacterized protein LOC105435801 [Cucumis sativus] >KGN46926.1 hypothetical ... | [more] |
XP_017982234.1 | 2.5e-78 | 52.57 | PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao] | [more] |
XP_021279328.1 | 9.5e-78 | 52.57 | uncharacterized protein LOC110412979 [Herrania umbratica] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7SQC0 | 0.0e+00 | 78.77 | XS domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... | [more] |
A0A1S3C894 | 0.0e+00 | 78.77 | uncharacterized protein LOC103497964 OS=Cucumis melo OX=3656 GN=LOC103497964 PE=... | [more] |
A0A0A0KGN5 | 7.3e-92 | 81.00 | XS domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G151630 PE=4 SV=... | [more] |
A0A6J0ZXA5 | 4.6e-78 | 52.57 | uncharacterized protein LOC110412979 OS=Herrania umbratica OX=108875 GN=LOC11041... | [more] |
A0A6J1CGJ5 | 1.8e-77 | 83.70 | uncharacterized protein LOC111011032 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
Match Name | E-value | Identity | Description | |
AT3G22430.1 | 9.3e-31 | 34.25 | CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380); ... | [more] |