Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTGATTTAAATTTTGCTATATCTTATAATATTTTAAACATAATTGGTATATTTAGAAGGTGGTTTTAAAAAACTTAGAGATAGAAACGTATTAAAGTTGGTAATATAGGGTATAAATGATCTGTTTGGGAAGAGGGCAACCGTTTTTGGAATCCATTATACAAATTCATGAGTCTCTCTCTCATTCTCTTGAAATGCATTGTTATAACAGCAGAGTCCAATGATCTCCTCCACTGAATAGGAAACACACATGATTGCACCCAACAAACCAATCCTTTCATATTATTCTTATTTCTCTCCTTCTTCATCTCTGCTTCAATCATATAAAACTCTCTGTTTTTTCCCCACACATAGCCATTAGCCATTAGCCATTAGCCATTAGCCATTTCTCTCAAATGGCTACTTCCTCATGTACAGCCACTTCCCTCCATGGCTTCTACCACTTCCTGTCCCACCAGCTCGACCAGCTCGATGACGCCTTCGTCTCCTCCGATTTCATGTCCCTTCACTTTCTTCAGAAAGTACTTTCTCTTCTTACAACTCTGCACTCTCACCTCATCCAGCTGGGTCAGCGCCTTCACCTCCCCGTCGGCGGTAAGTGGCTCGACGAGTATATGGATGAAAGCTCCCGCCTTTGGGAAGCTTGTCAGGTCCTTAAATCTGGAATCTCAAGGATGGAGGTTTTCCATGGTGAAGCTTCTGCCATAGCTTCTTCTTTGCAGGATCCTCACTTTCTTCGTTTCAATCCTCGAGCGTCTCGACGGGTATGTATGTATGTATGTATGTATGTATGTATGTATGTATGTATGTTTTTATTTGAATTTGAATTTGATTTGGGTTCTGTTTGCCTTGAAAGTTCTGTTCACTTTCTTGTGTTTTGCTCTGTTTTGAGTTTTTTTTTTTTTTTTGGAGGGGGGTTTTGTAGGTTCTTCGTGCAATTACTGATTTCGAGAGGAATGTTTTTGGATTGGAAGAAGAAAATAGAAGCTTGATGAATACAAGGATTCAGCCATTATCACTGCTTTGTTTCAATGGTAGCGGCGCGTCGACCGGAATGAGATCGACATCAAAGTTGAATGCTTTTAATGGATTCAGAGGTGTTCTTCATGCAGTGAAGAGCATTAGCTCATTGCTTTTAATGATTCTTCTGTGTGCTCTTGTGTATTGTTGGCCAGAATCAAGTTTCCATGGAAGTAATGGGAATGAGAATGAAGATGATCACCATCAAAGAACCATGTTCAGCTCAAGCTTTGTAGCTTCAATGGAGAGATTGAGACAGAGAGTGGCAAATGAAATAGAGAGAGTAGAAGGGCAGCCAGTGGGAGTTTTGCTGTTTGAATTCAGAGAAGCAAAGGCAGCCATGGAAGGCCTGAAAATGGAGCTTGAAAAGGGTTTGGAAGAAGATGAAGAAGTTGAAATTGAAGAGAAAGTTGAGAGATTAAATGGTTGGTTTGGGTCATTAAGAATTGGAGTGGATGCCATTATTGGACAGCTTGATGATTTCTTTGATGAGATTGTTGAGGGAAGAAAGAAACTTTTGGATATGTGCACTCATAACAGATAGATAATATATTTTTTTTTTTTGTAAAAAAAAAAAAGCATCTGGGTATGTTGCAAGATGCAAACTTTCAAGAAGAATGTAGTCAAAAACAAATTTTGGAAGATGAGAGAAATTCTAGCAGAATTGTTTGTAATTTATTATTATTTTTTTTCCTTTTTGAAATCACATTGTTTATTGTAGGATGTGATTATCTTTGCAGGCTATGAGAGGTTAAAAGAAGATAATTAATAAGGGGGAGAGAGAGAGGAATTGATTCTTCAATTGCTTTGAAGAAAAGCAGCCAACAAACCA
mRNA sequence
GTTTGATTTAAATTTTGCTATATCTTATAATATTTTAAACATAATTGGTATATTTAGAAGGTGGTTTTAAAAAACTTAGAGATAGAAACGTATTAAAGTTGGTAATATAGGGTATAAATGATCTGTTTGGGAAGAGGGCAACCGTTTTTGGAATCCATTATACAAATTCATGAGTCTCTCTCTCATTCTCTTGAAATGCATTGTTATAACAGCAGAGTCCAATGATCTCCTCCACTGAATAGGAAACACACATGATTGCACCCAACAAACCAATCCTTTCATATTATTCTTATTTCTCTCCTTCTTCATCTCTGCTTCAATCATATAAAACTCTCTGTTTTTTCCCCACACATAGCCATTAGCCATTAGCCATTAGCCATTAGCCATTTCTCTCAAATGGCTACTTCCTCATGTACAGCCACTTCCCTCCATGGCTTCTACCACTTCCTGTCCCACCAGCTCGACCAGCTCGATGACGCCTTCGTCTCCTCCGATTTCATGTCCCTTCACTTTCTTCAGAAAGTACTTTCTCTTCTTACAACTCTGCACTCTCACCTCATCCAGCTGGGTCAGCGCCTTCACCTCCCCGTCGGCGGTAAGTGGCTCGACGAGTATATGGATGAAAGCTCCCGCCTTTGGGAAGCTTGTCAGGTCCTTAAATCTGGAATCTCAAGGATGGAGGTTTTCCATGGTGAAGCTTCTGCCATAGCTTCTTCTTTGCAGGATCCTCACTTTCTTCGTTTCAATCCTCGAGCGTCTCGACGGGTTCTTCGTGCAATTACTGATTTCGAGAGGAATGTTTTTGGATTGGAAGAAGAAAATAGAAGCTTGATGAATACAAGGATTCAGCCATTATCACTGCTTTGTTTCAATGGTAGCGGCGCGTCGACCGGAATGAGATCGACATCAAAGTTGAATGCTTTTAATGGATTCAGAGGTGTTCTTCATGCAGTGAAGAGCATTAGCTCATTGCTTTTAATGATTCTTCTGTGTGCTCTTGTGTATTGTTGGCCAGAATCAAGTTTCCATGGAAGTAATGGGAATGAGAATGAAGATGATCACCATCAAAGAACCATGTTCAGCTCAAGCTTTGTAGCTTCAATGGAGAGATTGAGACAGAGAGTGGCAAATGAAATAGAGAGAGTAGAAGGGCAGCCAGTGGGAGTTTTGCTGTTTGAATTCAGAGAAGCAAAGGCAGCCATGGAAGGCCTGAAAATGGAGCTTGAAAAGGGTTTGGAAGAAGATGAAGAAGTTGAAATTGAAGAGAAAGTTGAGAGATTAAATGGTTGGTTTGGGTCATTAAGAATTGGAGTGGATGCCATTATTGGACAGCTTGATGATTTCTTTGATGAGATTGTTGAGGGAAGAAAGAAACTTTTGGATATGTGCACTCATAACAGATAGATAATATATTTTTTTTTTTTGTAAAAAAAAAAAAGCATCTGGGTATGTTGCAAGATGCAAACTTTCAAGAAGAATGTAGTCAAAAACAAATTTTGGAAGATGAGAGAAATTCTAGCAGAATTGTTTGTAATTTATTATTATTTTTTTTCCTTTTTGAAATCACATTGTTTATTGTAGGATGTGATTATCTTTGCAGGCTATGAGAGGTTAAAAGAAGATAATTAATAAGGGGGAGAGAGAGAGGAATTGATTCTTCAATTGCTTTGAAGAAAAGCAGCCAACAAACCA
Coding sequence (CDS)
ATGGCTACTTCCTCATGTACAGCCACTTCCCTCCATGGCTTCTACCACTTCCTGTCCCACCAGCTCGACCAGCTCGATGACGCCTTCGTCTCCTCCGATTTCATGTCCCTTCACTTTCTTCAGAAAGTACTTTCTCTTCTTACAACTCTGCACTCTCACCTCATCCAGCTGGGTCAGCGCCTTCACCTCCCCGTCGGCGGTAAGTGGCTCGACGAGTATATGGATGAAAGCTCCCGCCTTTGGGAAGCTTGTCAGGTCCTTAAATCTGGAATCTCAAGGATGGAGGTTTTCCATGGTGAAGCTTCTGCCATAGCTTCTTCTTTGCAGGATCCTCACTTTCTTCGTTTCAATCCTCGAGCGTCTCGACGGGTTCTTCGTGCAATTACTGATTTCGAGAGGAATGTTTTTGGATTGGAAGAAGAAAATAGAAGCTTGATGAATACAAGGATTCAGCCATTATCACTGCTTTGTTTCAATGGTAGCGGCGCGTCGACCGGAATGAGATCGACATCAAAGTTGAATGCTTTTAATGGATTCAGAGGTGTTCTTCATGCAGTGAAGAGCATTAGCTCATTGCTTTTAATGATTCTTCTGTGTGCTCTTGTGTATTGTTGGCCAGAATCAAGTTTCCATGGAAGTAATGGGAATGAGAATGAAGATGATCACCATCAAAGAACCATGTTCAGCTCAAGCTTTGTAGCTTCAATGGAGAGATTGAGACAGAGAGTGGCAAATGAAATAGAGAGAGTAGAAGGGCAGCCAGTGGGAGTTTTGCTGTTTGAATTCAGAGAAGCAAAGGCAGCCATGGAAGGCCTGAAAATGGAGCTTGAAAAGGGTTTGGAAGAAGATGAAGAAGTTGAAATTGAAGAGAAAGTTGAGAGATTAAATGGTTGGTTTGGGTCATTAAGAATTGGAGTGGATGCCATTATTGGACAGCTTGATGATTTCTTTGATGAGATTGTTGAGGGAAGAAAGAAACTTTTGGATATGTGCACTCATAACAGATAG
Protein sequence
MATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQRLHLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRASRRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGSGASTGMRSTSKLNAFNGFRGVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLRQRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEEDEEVEIEEKVERLNGWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR
Homology
BLAST of ClCG05G009550 vs. NCBI nr
Match:
XP_038878659.1 (uncharacterized protein LOC120070841 [Benincasa hispida])
HSP 1 Score: 609.0 bits (1569), Expect = 2.5e-170
Identity = 312/338 (92.31%), Postives = 324/338 (95.86%), Query Frame = 0
Query: 1 MATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQR 60
MATSS TATSL+GFY FLSH+LD L AFVSSDFMSLHFLQKVLSLL TLHSHLIQLGQR
Sbjct: 1 MATSSSTATSLNGFYQFLSHELDDLHHAFVSSDFMSLHFLQKVLSLLRTLHSHLIQLGQR 60
Query: 61 LHLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRA 120
LHLPVGGKWLDEYMDESSRLWEACQVLKS ISRME+FH EASAIASSLQDPHF+RFNPRA
Sbjct: 61 LHLPVGGKWLDEYMDESSRLWEACQVLKSAISRMELFHVEASAIASSLQDPHFIRFNPRA 120
Query: 121 SRRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGS--GASTGMRSTSKLNAFNG 180
SRRVLRAITDFERNV+GLEEENRSLMNTRIQPLSLLCFNGS GASTGM STSKLNAFNG
Sbjct: 121 SRRVLRAITDFERNVYGLEEENRSLMNTRIQPLSLLCFNGSSDGASTGMGSTSKLNAFNG 180
Query: 181 FRGVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDD-HHQRTMFSSSFVASME 240
FRGVLHAVKSISSLLLMILLC+LVYCWPESSFHGSNG ENE+D HHQRTMFSSSFVASME
Sbjct: 181 FRGVLHAVKSISSLLLMILLCSLVYCWPESSFHGSNGTENENDQHHQRTMFSSSFVASME 240
Query: 241 RLRQRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEEDEEVEIEEKVERLNG 300
RL+QRVANEI+RV+GQPVG+LLFEFREAKAAMEGLK+ELEKGLEEDEEVE EEKVERLNG
Sbjct: 241 RLKQRVANEIDRVDGQPVGILLFEFREAKAAMEGLKVELEKGLEEDEEVETEEKVERLNG 300
Query: 301 WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR
Sbjct: 301 WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 338
BLAST of ClCG05G009550 vs. NCBI nr
Match:
XP_008456067.1 (PREDICTED: uncharacterized protein LOC103496110 [Cucumis melo])
HSP 1 Score: 596.3 bits (1536), Expect = 1.7e-166
Identity = 308/344 (89.53%), Postives = 319/344 (92.73%), Query Frame = 0
Query: 2 ATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQRL 61
++SSCTATSLHGFYHFLSH+LD LD AF+SSDFMSLHFLQKVLSLL TLHS LIQLGQRL
Sbjct: 51 SSSSCTATSLHGFYHFLSHELDDLDHAFLSSDFMSLHFLQKVLSLLRTLHSQLIQLGQRL 110
Query: 62 HLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRAS 121
HLPVGGKWLDEYMDESSRLWEA QVLKSGISRMEVFH EASAIASSLQDPHFLRFNPRAS
Sbjct: 111 HLPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPRAS 170
Query: 122 RRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFN-GSGASTGMRSTSKLNAFNGFR 181
RRVLRAITDFERNVFGLEEENRSLMNTRI PLSLLCFN GS STGM STSKLNAFNGFR
Sbjct: 171 RRVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGGSSMSTGMGSTSKLNAFNGFR 230
Query: 182 GVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLR 241
GVLHAVK+ISSLLLMILLC LVYCWPESSFHGSNG ENE+D HQRTMFSSSFVASMERL+
Sbjct: 231 GVLHAVKNISSLLLMILLCGLVYCWPESSFHGSNGIENEEDQHQRTMFSSSFVASMERLK 290
Query: 242 QRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEEDEE---------VEIEEK 301
QRVANEIERV+ QPVG+LLFEFREAKAAMEGLK+ELEKGLEEDEE VEIEEK
Sbjct: 291 QRVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEDEEEEEEEEEEKVEIEEK 350
Query: 302 VERLNGWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
+ERLN WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR
Sbjct: 351 IERLNSWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 394
BLAST of ClCG05G009550 vs. NCBI nr
Match:
TYK22554.1 (uncharacterized protein E5676_scaffold584G00090 [Cucumis melo var. makuwa])
HSP 1 Score: 595.1 bits (1533), Expect = 3.7e-166
Identity = 307/343 (89.50%), Postives = 319/343 (93.00%), Query Frame = 0
Query: 2 ATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQRL 61
++SSCTATSLHGFYHFLSH+LD LD AF+SSDFMSLHFLQKVLSLL TLHS LIQLGQRL
Sbjct: 9 SSSSCTATSLHGFYHFLSHELDDLDHAFLSSDFMSLHFLQKVLSLLRTLHSQLIQLGQRL 68
Query: 62 HLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRAS 121
HLPVGGKWLDEYMDESSRLWEA QVLKSGISRMEVFH EASAIASSLQDPHFLRFNPRAS
Sbjct: 69 HLPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPRAS 128
Query: 122 RRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFN-GSGASTGMRSTSKLNAFNGFR 181
RRVLRAITDFERNVFGLEEENRSLMNTRI PLSLLCFN GS STGM STSKLNAFNGFR
Sbjct: 129 RRVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGGSSMSTGMGSTSKLNAFNGFR 188
Query: 182 GVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLR 241
GVLHAVK+ISSLLLMILLC LVYCWPESSFHGSNG ENE+D HQRTMFSSSFVASMERL+
Sbjct: 189 GVLHAVKNISSLLLMILLCGLVYCWPESSFHGSNGIENEEDQHQRTMFSSSFVASMERLK 248
Query: 242 QRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEEDEE--------VEIEEKV 301
QRVANEIERV+ QPVG+LLFEFREAKAAMEGLK+ELEKGLEE+EE VEIEEK+
Sbjct: 249 QRVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEEEEEEEEEEEKVEIEEKI 308
Query: 302 ERLNGWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
ERLN WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR
Sbjct: 309 ERLNSWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 351
BLAST of ClCG05G009550 vs. NCBI nr
Match:
XP_004146285.1 (uncharacterized protein LOC101211523 [Cucumis sativus] >KGN57591.1 hypothetical protein Csa_009796 [Cucumis sativus])
HSP 1 Score: 594.0 bits (1530), Expect = 8.2e-166
Identity = 308/339 (90.86%), Postives = 319/339 (94.10%), Query Frame = 0
Query: 1 MAT-SSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQ 60
MAT SSCTATSLHGFYHFLSH+LD LD AFVSSDFMSLHFLQKVLSLL TLHS LIQLGQ
Sbjct: 1 MATSSSCTATSLHGFYHFLSHELDDLDHAFVSSDFMSLHFLQKVLSLLRTLHSQLIQLGQ 60
Query: 61 RLHLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPR 120
RLHLPVGGKWLDEYMDESSRLWEA QVLKSGISRMEVFH EASAIASSLQDPHFLRFNPR
Sbjct: 61 RLHLPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPR 120
Query: 121 ASRRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGSGA-STGMRSTSKLNAFNG 180
ASRRVLRAITDFERNVFGLEEENRSLMNTRI PLSLLCFNGS + S+GM STSKLNAFNG
Sbjct: 121 ASRRVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGSSSVSSGMGSTSKLNAFNG 180
Query: 181 FRGVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMER 240
FRGVLHAVK+ISSLLLMILLC LVYCWPES FHGSNG NE+D HQRTMFSSSF+ASMER
Sbjct: 181 FRGVLHAVKNISSLLLMILLCGLVYCWPESIFHGSNGIGNEEDQHQRTMFSSSFIASMER 240
Query: 241 LRQRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEED--EEVEIEEKVERLN 300
L+QRVANEIERV+ QPVG+LLFEFREAKAAMEGLK+ELEKGLEED EEVEIEEK+ERLN
Sbjct: 241 LKQRVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEDDEEEVEIEEKIERLN 300
Query: 301 GWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR
Sbjct: 301 SWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 339
BLAST of ClCG05G009550 vs. NCBI nr
Match:
KAG6600364.1 (hypothetical protein SDJN03_05597, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 554.7 bits (1428), Expect = 5.5e-154
Identity = 282/338 (83.43%), Postives = 302/338 (89.35%), Query Frame = 0
Query: 1 MATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQR 60
M TSSCTATSLHGFYHFLSHQLD LD AF+SSDFMSLHFL KVLSLL LHSHLIQLG R
Sbjct: 1 MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
Query: 61 LHLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRA 120
LHLPVGGKWLDEYMD+SSRLW+ACQVLKSGISR+E++H EAS IASSLQDPH LRFN RA
Sbjct: 61 LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASVIASSLQDPHLLRFNHRA 120
Query: 121 SRRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGSGASTGMRSTSKLNAFNGFR 180
S+RVLRAI D ERN F LEEENR LMNTRIQPLSLLCFN + A TGM STSK NAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
Query: 181 GVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLR 240
GVLHAVKSISSLLLMILLC+LVYCWPESSFH ++G ENEDDHHQRTMFSSSFVASM RLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
Query: 241 QRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEEDEEV---EIEEKVERLNG 300
QRVANEI+RVEGQPVG+LLFEFREAKAAM+ LK ELEK LEE+EE EIEEK E+L
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEEDEIEEKAEKLKS 300
Query: 301 WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
W G+LR GVDAI+G+LDDFFDEIVEGRKKLLD+CTHNR
Sbjct: 301 WNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTHNR 338
BLAST of ClCG05G009550 vs. ExPASy TrEMBL
Match:
A0A1S3C325 (uncharacterized protein LOC103496110 OS=Cucumis melo OX=3656 GN=LOC103496110 PE=4 SV=1)
HSP 1 Score: 596.3 bits (1536), Expect = 8.0e-167
Identity = 308/344 (89.53%), Postives = 319/344 (92.73%), Query Frame = 0
Query: 2 ATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQRL 61
++SSCTATSLHGFYHFLSH+LD LD AF+SSDFMSLHFLQKVLSLL TLHS LIQLGQRL
Sbjct: 51 SSSSCTATSLHGFYHFLSHELDDLDHAFLSSDFMSLHFLQKVLSLLRTLHSQLIQLGQRL 110
Query: 62 HLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRAS 121
HLPVGGKWLDEYMDESSRLWEA QVLKSGISRMEVFH EASAIASSLQDPHFLRFNPRAS
Sbjct: 111 HLPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPRAS 170
Query: 122 RRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFN-GSGASTGMRSTSKLNAFNGFR 181
RRVLRAITDFERNVFGLEEENRSLMNTRI PLSLLCFN GS STGM STSKLNAFNGFR
Sbjct: 171 RRVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGGSSMSTGMGSTSKLNAFNGFR 230
Query: 182 GVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLR 241
GVLHAVK+ISSLLLMILLC LVYCWPESSFHGSNG ENE+D HQRTMFSSSFVASMERL+
Sbjct: 231 GVLHAVKNISSLLLMILLCGLVYCWPESSFHGSNGIENEEDQHQRTMFSSSFVASMERLK 290
Query: 242 QRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEEDEE---------VEIEEK 301
QRVANEIERV+ QPVG+LLFEFREAKAAMEGLK+ELEKGLEEDEE VEIEEK
Sbjct: 291 QRVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEDEEEEEEEEEEKVEIEEK 350
Query: 302 VERLNGWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
+ERLN WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR
Sbjct: 351 IERLNSWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 394
BLAST of ClCG05G009550 vs. ExPASy TrEMBL
Match:
A0A5D3DG23 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold584G00090 PE=4 SV=1)
HSP 1 Score: 595.1 bits (1533), Expect = 1.8e-166
Identity = 307/343 (89.50%), Postives = 319/343 (93.00%), Query Frame = 0
Query: 2 ATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQRL 61
++SSCTATSLHGFYHFLSH+LD LD AF+SSDFMSLHFLQKVLSLL TLHS LIQLGQRL
Sbjct: 9 SSSSCTATSLHGFYHFLSHELDDLDHAFLSSDFMSLHFLQKVLSLLRTLHSQLIQLGQRL 68
Query: 62 HLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRAS 121
HLPVGGKWLDEYMDESSRLWEA QVLKSGISRMEVFH EASAIASSLQDPHFLRFNPRAS
Sbjct: 69 HLPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPRAS 128
Query: 122 RRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFN-GSGASTGMRSTSKLNAFNGFR 181
RRVLRAITDFERNVFGLEEENRSLMNTRI PLSLLCFN GS STGM STSKLNAFNGFR
Sbjct: 129 RRVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGGSSMSTGMGSTSKLNAFNGFR 188
Query: 182 GVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLR 241
GVLHAVK+ISSLLLMILLC LVYCWPESSFHGSNG ENE+D HQRTMFSSSFVASMERL+
Sbjct: 189 GVLHAVKNISSLLLMILLCGLVYCWPESSFHGSNGIENEEDQHQRTMFSSSFVASMERLK 248
Query: 242 QRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEEDEE--------VEIEEKV 301
QRVANEIERV+ QPVG+LLFEFREAKAAMEGLK+ELEKGLEE+EE VEIEEK+
Sbjct: 249 QRVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEEEEEEEEEEEKVEIEEKI 308
Query: 302 ERLNGWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
ERLN WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR
Sbjct: 309 ERLNSWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 351
BLAST of ClCG05G009550 vs. ExPASy TrEMBL
Match:
A0A0A0LC30 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G221740 PE=4 SV=1)
HSP 1 Score: 594.0 bits (1530), Expect = 4.0e-166
Identity = 308/339 (90.86%), Postives = 319/339 (94.10%), Query Frame = 0
Query: 1 MAT-SSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQ 60
MAT SSCTATSLHGFYHFLSH+LD LD AFVSSDFMSLHFLQKVLSLL TLHS LIQLGQ
Sbjct: 1 MATSSSCTATSLHGFYHFLSHELDDLDHAFVSSDFMSLHFLQKVLSLLRTLHSQLIQLGQ 60
Query: 61 RLHLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPR 120
RLHLPVGGKWLDEYMDESSRLWEA QVLKSGISRMEVFH EASAIASSLQDPHFLRFNPR
Sbjct: 61 RLHLPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPR 120
Query: 121 ASRRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGSGA-STGMRSTSKLNAFNG 180
ASRRVLRAITDFERNVFGLEEENRSLMNTRI PLSLLCFNGS + S+GM STSKLNAFNG
Sbjct: 121 ASRRVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGSSSVSSGMGSTSKLNAFNG 180
Query: 181 FRGVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMER 240
FRGVLHAVK+ISSLLLMILLC LVYCWPES FHGSNG NE+D HQRTMFSSSF+ASMER
Sbjct: 181 FRGVLHAVKNISSLLLMILLCGLVYCWPESIFHGSNGIGNEEDQHQRTMFSSSFIASMER 240
Query: 241 LRQRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEED--EEVEIEEKVERLN 300
L+QRVANEIERV+ QPVG+LLFEFREAKAAMEGLK+ELEKGLEED EEVEIEEK+ERLN
Sbjct: 241 LKQRVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEDDEEEVEIEEKIERLN 300
Query: 301 GWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
WFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR
Sbjct: 301 SWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 339
BLAST of ClCG05G009550 vs. ExPASy TrEMBL
Match:
A0A6J1FMG3 (uncharacterized protein LOC111447146 OS=Cucurbita moschata OX=3662 GN=LOC111447146 PE=4 SV=1)
HSP 1 Score: 554.3 bits (1427), Expect = 3.5e-154
Identity = 282/337 (83.68%), Postives = 303/337 (89.91%), Query Frame = 0
Query: 1 MATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQR 60
M TSSCTATSLHGFYHFLSHQLD LD AF+SSDFMSLHFL KVLSLL LHSHLIQLG R
Sbjct: 1 MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
Query: 61 LHLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRA 120
LHLPVGGKWLDEYMD+SSRLW+ACQVLKSGISR+E++H EASAIASSLQDPH LRFN RA
Sbjct: 61 LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120
Query: 121 SRRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGSGASTGMRSTSKLNAFNGFR 180
S+RVLRAI D ERN F LEEENR LMNTRIQPLSLLCFN + A TGM STSK NAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
Query: 181 GVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLR 240
GVLHAVKSISSLLLMILLC+LVYCWPESSFH ++G ENEDDHHQRTMFSSSFVASM RLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
Query: 241 QRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGLEEDEEV--EIEEKVERLNGW 300
QRVANEI+RVEGQPVG+LLFEFREAKAAM+ LK ELEK LEE+EE EIEEK E+L W
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSW 300
Query: 301 FGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
G+LR GVDAI+G+LDDFFDEIVEGRKKLLD+CT+NR
Sbjct: 301 NGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 337
BLAST of ClCG05G009550 vs. ExPASy TrEMBL
Match:
A0A6J1J2L7 (uncharacterized protein LOC111480769 OS=Cucurbita maxima OX=3661 GN=LOC111480769 PE=4 SV=1)
HSP 1 Score: 552.4 bits (1422), Expect = 1.3e-153
Identity = 282/349 (80.80%), Postives = 305/349 (87.39%), Query Frame = 0
Query: 1 MATSSCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQR 60
M TSSCTATSLHGFYHFLSHQLD LD AF+SSDFMSLHFL KVLSLL LH+HLIQLG R
Sbjct: 1 MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHTHLIQLGHR 60
Query: 61 LHLPVGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRA 120
LHLPVGGKWLDEYMD+SSRLW+ACQVLKSGISR+E++H EASAI+SSLQDPH LRFN RA
Sbjct: 61 LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAISSSLQDPHLLRFNHRA 120
Query: 121 SRRVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGSGASTGMRSTSKLNAFNGFR 180
S+RVLRAI D ERN F LEEENR LMNTRIQPLSLLCFN + A TGM STSK NAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
Query: 181 GVLHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLR 240
GVLHAVKSISSLLLMILLC+LVYCWPESSFH ++G ENEDDHHQRT FSSSFVASM RLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTTFSSSFVASMARLR 240
Query: 241 QRVANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGL--------------EEDEEV 300
QRVANEI+RVEGQPVG+LLFEFREAKAAM+ LK ELEKGL EE+EEV
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKGLEEEEEEEEEEEEEEEEEEEV 300
Query: 301 EIEEKVERLNGWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 336
EIEEK+E+L W G+LR GVDAI+G+LDDFFDEIVEGRKKLLD+CTHNR
Sbjct: 301 EIEEKLEKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTHNR 349
BLAST of ClCG05G009550 vs. TAIR 10
Match:
AT1G22030.1 (CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G77855.1); Has 99 Blast hits to 99 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 0; Plants - 96; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 291.2 bits (744), Expect = 1.1e-78
Identity = 164/342 (47.95%), Postives = 228/342 (66.67%), Query Frame = 0
Query: 5 SCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQRLHLP 64
SC A S++GFY FL+ ++ L+ ++S++FMS+HFLQ+ L LL T HSHL L Q+L LP
Sbjct: 4 SC-ANSVNGFYSFLNRSMEDLERVYLSNNFMSVHFLQRALCLLRTSHSHLTLLVQKLQLP 63
Query: 65 VGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLRFNPRASRRV 124
VG KWLDEYMDESS+LWEAC V+KS +S +E F +IAS+L R +P+ SR+V
Sbjct: 64 VGDKWLDEYMDESSKLWEACLVIKSAVSSVENFSSAGISIASTLD----RRLSPQLSRQV 123
Query: 125 LRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGSGASTGMRSTSKL-NAFNGFRGVL 184
+RAI+ R G+EEENR+LM R+Q ++ ++T M S++KL N F+GFRGVL
Sbjct: 124 IRAISGCRREAIGIEEENRALMENRVQRFPF--WSEQTSATAMESSTKLQNGFSGFRGVL 183
Query: 185 HAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLRQRV 244
+A +++SSLLLM+L+ LVYC+P G + + Q F +M RL+QRV
Sbjct: 184 YATRNMSSLLLMVLMNGLVYCFP-----GDAATQTQTQITQTQSQVGGFAGAMGRLQQRV 243
Query: 245 ANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEKGL------------EEDEEVEIEEK 304
A E+ R+ G G+L+ E+R +KAA+E LK ELE+ EE++E E+ E+
Sbjct: 244 AAEVGRM-GIRKGILMHEYRRSKAALEELKAELERRFCGGGGGGGEREEEEEDERELRER 303
Query: 305 VERLNGWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTH 334
VE L G+FG+LR G ++I+ Q+DDFFDEIVEGRKKLLD C+H
Sbjct: 304 VENLKGYFGNLRNGTESIVAQIDDFFDEIVEGRKKLLDFCSH 332
BLAST of ClCG05G009550 vs. TAIR 10
Match:
AT1G77855.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22030.1); Has 120 Blast hits to 120 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 120; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 263.1 bits (671), Expect = 3.1e-70
Identity = 151/337 (44.81%), Postives = 218/337 (64.69%), Query Frame = 0
Query: 5 SCTATSLHGFYHFLSHQLDQLDDAFVSSDFMSLHFLQKVLSLLTTLHSHLIQLGQRLHLP 64
SC + S++GFY FL+ ++ L+ ++S++FMSL FLQ+V+ LL T HSHL L Q+L+LP
Sbjct: 4 SC-SNSVNGFYSFLNRSMEDLERVYISNNFMSLQFLQRVICLLRTSHSHLTLLVQKLNLP 63
Query: 65 VGGKWLDEYMDESSRLWEACQVLKSGISRMEVFHGEASAIASSLQDPHFLR--FNPRASR 124
VG KWLD+YMDE+S+LW+ C V+KS IS +E F A +I S+L + R +P+ SR
Sbjct: 64 VGDKWLDDYMDETSKLWDVCHVIKSAISTIESFCSSAISITSTLDGHYHHRRLLSPQISR 123
Query: 125 RVLRAITDFERNVFGLEEENRSLMNTRIQPLSLLCFNGSGASTGMRSTSKLNAFNGFRGV 184
+V+RAI+ R G+EEENR+LM RIQ ++ +TGM S+ N F+GFRGV
Sbjct: 124 QVIRAISGCRREAVGIEEENRALMENRIQRFPF--WSEQVTTTGMESSKIQNGFSGFRGV 183
Query: 185 LHAVKSISSLLLMILLCALVYCWPESSFHGSNGNENEDDHHQRTMFSSSFVASMERLRQR 244
++ +K+I+SLLL+IL+ LVY P ++ +M RL+QR
Sbjct: 184 MNTMKNINSLLLVILMQGLVYYIPGD--------------------TTVPTGTMMRLKQR 243
Query: 245 VANEIERVEGQPVGVLLFEFREAKAAMEGLKMELEK------GLEEDEEVEIEEKVERLN 304
VA E+ER+ G G++++E+R +K AME LK+ELE+ G EE E + E++E L
Sbjct: 244 VAAEMERI-GVRKGMMMYEYRRSKTAMEELKVELERRCCGGGGEEEAVEKGLRERIENLK 303
Query: 305 GWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTH 334
G GSLR G ++I+ Q+DDFFD+IV+GRK LLD C+H
Sbjct: 304 GSVGSLRNGTESIVAQIDDFFDDIVDGRKMLLDYCSH 316
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038878659.1 | 2.5e-170 | 92.31 | uncharacterized protein LOC120070841 [Benincasa hispida] | [more] |
XP_008456067.1 | 1.7e-166 | 89.53 | PREDICTED: uncharacterized protein LOC103496110 [Cucumis melo] | [more] |
TYK22554.1 | 3.7e-166 | 89.50 | uncharacterized protein E5676_scaffold584G00090 [Cucumis melo var. makuwa] | [more] |
XP_004146285.1 | 8.2e-166 | 90.86 | uncharacterized protein LOC101211523 [Cucumis sativus] >KGN57591.1 hypothetical ... | [more] |
KAG6600364.1 | 5.5e-154 | 83.43 | hypothetical protein SDJN03_05597, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3C325 | 8.0e-167 | 89.53 | uncharacterized protein LOC103496110 OS=Cucumis melo OX=3656 GN=LOC103496110 PE=... | [more] |
A0A5D3DG23 | 1.8e-166 | 89.50 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A0A0LC30 | 4.0e-166 | 90.86 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G221740 PE=4 SV=1 | [more] |
A0A6J1FMG3 | 3.5e-154 | 83.68 | uncharacterized protein LOC111447146 OS=Cucurbita moschata OX=3662 GN=LOC1114471... | [more] |
A0A6J1J2L7 | 1.3e-153 | 80.80 | uncharacterized protein LOC111480769 OS=Cucurbita maxima OX=3661 GN=LOC111480769... | [more] |
Match Name | E-value | Identity | Description | |
AT1G22030.1 | 1.1e-78 | 47.95 | CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511); BEST Ar... | [more] |
AT1G77855.1 | 3.1e-70 | 44.81 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |