Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAATCCCTAATGGCATAGTTTTGTTTTTTTTACTCCGACGTTTGTTTAATAACGTTTTCGGTTTGTTTCCGTTTTCTATTTAACGGAAACGTTATCAGAAAAACAATTCCTATTTTTCTCTATTTCTCCTCCGCCGCTGCTCCTTCATCCACCCTGGTTTCTTCCTTCTCCCTCACAGCCGGTGACGACCAACTACCGCCGCACCAACCCCACTCCCTTCTCGGTCGTCGGAGAACCCCTCTATCAATTTTCTTTCTCCTCAACCTGCCATTTGTCTATTTCCCTACTTTTCTCGTTTTCGCCCGTCTCCGATTCTCACTCTTTCTTCTTCGCGCCGTCCGCCTCGTCAAGGAATCACGTTTTGAATTCATAAGCACTTTCAACTGAATTTCTATTATTAACATGGCGAGGAAGAAGAGTAATACGAATAGAGGATCATCCAACCCCGGATTCGGTCCTCAAAATTCCATCACTCTAAGACAGGAAGCTACTGGGAAAATCAAACCCAAAGTATCTAACAATGCCAAAGTTTATTTGAATCATTTGGAAAATTTAGCGACTTGGGCTAGTGGGCAACCGTCTTTACCTTCATTGGCTGCTTTCTTTGGGCAACGTCTTGCTGCTGCAGCCGAGTCTTTGGCGGTTTCTCCTGACCCTTCTCTATTTCTCTGCGCGAGGTTCGTATGTATTCTGCTCTTTACATTTATTTAGCTTTGCATGCCTAGATTTGTTGTTTTTGTAGCGTTTATAACAGGGCAAGTTCATAAATATTGGTTGTATTTGAGGATGGGAGGTTTTTGTTGAGCTTTTTTGATTAGGTCTAAGTGAATGATTTATAGCAGTATTGTGAAATATTTAGGGGGGCACTGTTATACACCGTAATCTAATTAGTTCTAAGTTATTGCAATGGTGGAATTCAACAGTTGAACAATCGGTTGAAGAATTGAAAGTGTTCAATGCTATTTTGAGACTACTGAAAAACATATCAATTGGTGCTGGAGATGAAAGAAGCTGGAATTTGTGTTGCTCTGGTAATCTCAGTGAGATCTTTGGCTCTGCCTGATTGATTTCAGGAGAGCTATTGAAGGAGAGATTTCAAAGATAGTGACTAGAGGAACATGCCTCTGAAGATAAAAACTTAGAGTGATTAGCATGTTTAAATGGTTATTTGTTTGTTTGTTTGTTGAGAACGTTATAATAAACATCTCCTATGGTGGCCTTGAATCTCTATGGGTGTTTTTTTTTGCTTGGAAGAGTATGAAGGAATTGACCACCACGTCTAGCATGTCTCATTTGGCGATCGATTTTGGAACTCATTAGTCATGATTATAAAGTTTTGGATACTTTGCCATTTTTTGTTGCGTTGAAGGCTTGCTATCCCTTCTGCTATGTGGTAATCTCTCGAGGAAAAAGCCAGTAAGCTATGGGGTATGGCCAATTCAATGAATTTTTTTTTTTGGAGCGTGTGAGTTGGGAAGAAAAAGAAAGCCTTTAACAGCCGGGCTAAAAACTTGTGGGTGAAGTGCATGAGTTAGTCAAGCCTCATTGTTCTTTATTTCACTAAATCGTGTGTTTAAGCACTATATTTCCCTCGTTGTAATACTAATTAGGGTGCCTTTTGTTTTGTTTTTTTTTTTTTTTATGACAGTTTTCTTCTTTCTCCTTTGACTTCCAGGGAACTCATAATCAAATAATGTTACATTTGCCTAAAAAAATGTTTACATCGGTCACATTACTTTTTTTTTAAAAATAGAAATATCACATTACTCATAGGACGAGGCTGATCATTACAGTATTGGTTGAAGATCCGTTATTCTATTTTCTTATTGTTAAATTTTTACAATTACTGGTGCTTTTTATTCAATACATGGTTGGAGTGGACACCTGGTTCGAACCATATAGCTGTTCCATGTTAAGGACTTTTATCATGTCACATTTGAATGTTGTCTTGAAATCTAATCTTTTTTGGCTATAAAACTAACCAATCTTACCCGTTATATTACATTCAGGTGTGAAACAATTCTCCAACCTGGTTCTAACTGCCACATACGAATAGAGAAGAATACTGCCAAGAAACGTCGAAGACACAAAAAAGGCAGTAATTTGACACAGAATGTTGTGGCGTATTATTGCCACTACTGCTCATGTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCTTTATGGCACAGAGTGTGTAAGCAAGGTAAAATCTGTGGTTGTCAAGGATGGTAAAGAATGTGAAAACAAGATTTTTACCATGGATACTCCTAAAATTCCTCCTCTTACAACTGTAGACTGTCTAACTATCGACACTCCTGCAATTCCTTCTTTGTCAACAACTAGAGACGATCTAACCATTGATACTAGTGCAATTTCTCCAACTGGGGACATTTCTGTTGTTGATGGCCCTGCTATTTCGTCTCCTAGAACAACTCCTGCAATTTCTTCCACCTTGAGCGTAACAAGTATATCAAGATCGCAAGTTAGGGACATTCCTACTTTAGATGCTCCTGCAACTCCTCTCACCTTGACTGGAATGACTCTGTTGGATTCGAAGAGGAGAAAGAGGAAGAAACCGTCATCCAAGAATCAAACTGAACCTGAGAGTTGTTCTGGTCCAACATCACATGGGGAAACAAGCGAAGGCACATCCAAAAGGAAGCGTAATAGAAAATCATGGACAAGTTTGAAGGAAATTGCTCAAAGGGAGGAAGAGAGAGGTAAACAAAACGTGGCTGGATTGGCAATTCCATTCTCCCTGTTAGAGACCTAATAACCAAATTGTTTTGTAAGCGATTCTCCCAATAAAGAATTAATTTTATTTTATATCTTGCAGCATTGTCGATTTAAATTTGCAATGTTTGTTGAGAGTAGATCTTCAAATTGATGTAAAATTCTCAGGGAATCATACTAAATTGACAAGACTTGGAG
mRNA sequence
TAAAATCCCTAATGGCATAGTTTTGTTTTTTTTACTCCGACGTTTGTTTAATAACGTTTTCGGTTTGTTTCCGTTTTCTATTTAACGGAAACGTTATCAGAAAAACAATTCCTATTTTTCTCTATTTCTCCTCCGCCGCTGCTCCTTCATCCACCCTGGTTTCTTCCTTCTCCCTCACAGCCGGTGACGACCAACTACCGCCGCACCAACCCCACTCCCTTCTCGGTCGTCGGAGAACCCCTCTATCAATTTTCTTTCTCCTCAACCTGCCATTTGTCTATTTCCCTACTTTTCTCGTTTTCGCCCGTCTCCGATTCTCACTCTTTCTTCTTCGCGCCGTCCGCCTCGTCAAGGAATCACGTTTTGAATTCATAAGCACTTTCAACTGAATTTCTATTATTAACATGGCGAGGAAGAAGAGTAATACGAATAGAGGATCATCCAACCCCGGATTCGGTCCTCAAAATTCCATCACTCTAAGACAGGAAGCTACTGGGAAAATCAAACCCAAAGTATCTAACAATGCCAAAGTTTATTTGAATCATTTGGAAAATTTAGCGACTTGGGCTAGTGGGCAACCGTCTTTACCTTCATTGGCTGCTTTCTTTGGGCAACGTCTTGCTGCTGCAGCCGAGTCTTTGGCGGTTTCTCCTGACCCTTCTCTATTTCTCTGCGCGAGGTGTGAAACAATTCTCCAACCTGGTTCTAACTGCCACATACGAATAGAGAAGAATACTGCCAAGAAACGTCGAAGACACAAAAAAGGCAGTAATTTGACACAGAATGTTGTGGCGTATTATTGCCACTACTGCTCATGTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCTTTATGGCACAGAGTGTGTAAGCAAGGTAAAATCTGTGGTTGTCAAGGATGGTAAAGAATGTGAAAACAAGATTTTTACCATGGATACTCCTAAAATTCCTCCTCTTACAACTGTAGACTGTCTAACTATCGACACTCCTGCAATTCCTTCTTTGTCAACAACTAGAGACGATCTAACCATTGATACTAGTGCAATTTCTCCAACTGGGGACATTTCTGTTGTTGATGGCCCTGCTATTTCGTCTCCTAGAACAACTCCTGCAATTTCTTCCACCTTGAGCGTAACAAGTATATCAAGATCGCAAGTTAGGGACATTCCTACTTTAGATGCTCCTGCAACTCCTCTCACCTTGACTGGAATGACTCTGTTGGATTCGAAGAGGAGAAAGAGGAAGAAACCGTCATCCAAGAATCAAACTGAACCTGAGAGTTGTTCTGGTCCAACATCACATGGGGAAACAAGCGAAGGCACATCCAAAAGGAAGCGTAATAGAAAATCATGGACAAGTTTGAAGGAAATTGCTCAAAGGGAGGAAGAGAGAGGTAAACAAAACGTGGCTGGATTGGCAATTCCATTCTCCCTGTTAGAGACCTAATAACCAAATTGTTTTGTAAGCGATTCTCCCAATAAAGAATTAATTTTATTTTATATCTTGCAGCATTGTCGATTTAAATTTGCAATGTTTGTTGAGAGTAGATCTTCAAATTGATGTAAAATTCTCAGGGAATCATACTAAATTGACAAGACTTGGAG
Coding sequence (CDS)
ATGGCGAGGAAGAAGAGTAATACGAATAGAGGATCATCCAACCCCGGATTCGGTCCTCAAAATTCCATCACTCTAAGACAGGAAGCTACTGGGAAAATCAAACCCAAAGTATCTAACAATGCCAAAGTTTATTTGAATCATTTGGAAAATTTAGCGACTTGGGCTAGTGGGCAACCGTCTTTACCTTCATTGGCTGCTTTCTTTGGGCAACGTCTTGCTGCTGCAGCCGAGTCTTTGGCGGTTTCTCCTGACCCTTCTCTATTTCTCTGCGCGAGGTGTGAAACAATTCTCCAACCTGGTTCTAACTGCCACATACGAATAGAGAAGAATACTGCCAAGAAACGTCGAAGACACAAAAAAGGCAGTAATTTGACACAGAATGTTGTGGCGTATTATTGCCACTACTGCTCATGTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCTTTATGGCACAGAGTGTGTAAGCAAGGTAAAATCTGTGGTTGTCAAGGATGGTAAAGAATGTGAAAACAAGATTTTTACCATGGATACTCCTAAAATTCCTCCTCTTACAACTGTAGACTGTCTAACTATCGACACTCCTGCAATTCCTTCTTTGTCAACAACTAGAGACGATCTAACCATTGATACTAGTGCAATTTCTCCAACTGGGGACATTTCTGTTGTTGATGGCCCTGCTATTTCGTCTCCTAGAACAACTCCTGCAATTTCTTCCACCTTGAGCGTAACAAGTATATCAAGATCGCAAGTTAGGGACATTCCTACTTTAGATGCTCCTGCAACTCCTCTCACCTTGACTGGAATGACTCTGTTGGATTCGAAGAGGAGAAAGAGGAAGAAACCGTCATCCAAGAATCAAACTGAACCTGAGAGTTGTTCTGGTCCAACATCACATGGGGAAACAAGCGAAGGCACATCCAAAAGGAAGCGTAATAGAAAATCATGGACAAGTTTGAAGGAAATTGCTCAAAGGGAGGAAGAGAGAGGTAAACAAAACGTGGCTGGATTGGCAATTCCATTCTCCCTGTTAGAGACCTAA
Protein sequence
MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPSLPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKKGSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTMDTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPAISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGPTSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSLLET*
Homology
BLAST of CSPI04G05800 vs. ExPASy TrEMBL
Match:
A0A0A0KUP1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G048010 PE=4 SV=1)
HSP 1 Score: 673.3 bits (1736), Expect = 5.4e-190
Identity = 349/350 (99.71%), Postives = 350/350 (100.00%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS
Sbjct: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNC+IRIEKNTAKKRRRHKK
Sbjct: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCNIRIEKNTAKKRRRHKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM
Sbjct: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA
Sbjct: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
Query: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP
Sbjct: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
Query: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSLLET 351
TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSLLET
Sbjct: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSLLET 350
BLAST of CSPI04G05800 vs. ExPASy TrEMBL
Match:
A0A5D3CYJ3 (Rpr2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004600 PE=4 SV=1)
HSP 1 Score: 610.9 bits (1574), Expect = 3.3e-171
Identity = 315/347 (90.78%), Postives = 327/347 (94.24%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MARKK NT RGSSNP GPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS
Sbjct: 1 MARKKGNTKRGSSNPTSGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
LPSLAAFFGQRLAAAAESLAV+PDPSLFLCARCET+LQPGSNC+IRIEKN AKKRRRHKK
Sbjct: 61 LPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRRRHKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
SN+TQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKI T+
Sbjct: 121 ASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKILTV 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
D P PPLTTVDCLTIDTPAIPSLSTTRDD+ +DT+A+ PT DISV DGPAISSPRTTPA
Sbjct: 181 DAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAVPPTEDISVDDGPAISSPRTTPA 240
Query: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
I ST SVTS+SRSQVRDIPTLDAPATPLTLT MTLLDSKRRKRKKPSSKN+TEPESCS P
Sbjct: 241 IPSTSSVTSMSRSQVRDIPTLDAPATPLTLTAMTLLDSKRRKRKKPSSKNRTEPESCSAP 300
Query: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSL 348
TSHGE SE TSKRKRNRKSWTSLKEIAQREEE+GKQNVAGLAIPFSL
Sbjct: 301 TSHGEKSEDTSKRKRNRKSWTSLKEIAQREEEKGKQNVAGLAIPFSL 347
BLAST of CSPI04G05800 vs. ExPASy TrEMBL
Match:
A0A1S3BU13 (uncharacterized protein LOC103493157 OS=Cucumis melo OX=3656 GN=LOC103493157 PE=4 SV=1)
HSP 1 Score: 610.9 bits (1574), Expect = 3.3e-171
Identity = 315/347 (90.78%), Postives = 327/347 (94.24%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MARKK NT RGSSNP GPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS
Sbjct: 1 MARKKGNTKRGSSNPTSGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
LPSLAAFFGQRLAAAAESLAV+PDPSLFLCARCET+LQPGSNC+IRIEKN AKKRRRHKK
Sbjct: 61 LPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRRRHKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
SN+TQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKI T+
Sbjct: 121 ASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKILTV 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
D P PPLTTVDCLTIDTPAIPSLSTTRDD+ +DT+A+ PT DISV DGPAISSPRTTPA
Sbjct: 181 DAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAVPPTEDISVDDGPAISSPRTTPA 240
Query: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
I ST SVTS+SRSQVRDIPTLDAPATPLTLT MTLLDSKRRKRKKPSSKN+TEPESCS P
Sbjct: 241 IPSTSSVTSMSRSQVRDIPTLDAPATPLTLTAMTLLDSKRRKRKKPSSKNRTEPESCSAP 300
Query: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSL 348
TSHGE SE TSKRKRNRKSWTSLKEIAQREEE+GKQNVAGLAIPFSL
Sbjct: 301 TSHGEKSEDTSKRKRNRKSWTSLKEIAQREEEKGKQNVAGLAIPFSL 347
BLAST of CSPI04G05800 vs. ExPASy TrEMBL
Match:
A0A5A7TNC0 (Rpr2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001240 PE=4 SV=1)
HSP 1 Score: 609.8 bits (1571), Expect = 7.4e-171
Identity = 315/347 (90.78%), Postives = 326/347 (93.95%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MARKK NT RGSSNP GPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS
Sbjct: 1 MARKKGNTKRGSSNPTSGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
LPSLAAFFGQRLAAAAESLAV+PDPSLFLCARCET+LQPGSNC+IRIEKN AKKR RHKK
Sbjct: 61 LPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRGRHKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
SN+TQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKI TM
Sbjct: 121 ASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKILTM 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
D P PPLTTVDCLTIDTPAIPSLSTTRDD+ +DT+A+ PT DISV DGPAISSPRTTPA
Sbjct: 181 DAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAVPPTEDISVDDGPAISSPRTTPA 240
Query: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
I ST SVTS+SRSQVRDIPTLDAPATPLTLT MTLLDSKRRKRKKPSSKN+TEPESCS P
Sbjct: 241 IPSTSSVTSMSRSQVRDIPTLDAPATPLTLTAMTLLDSKRRKRKKPSSKNRTEPESCSAP 300
Query: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSL 348
TSHGE SE TSKRKRNRKSWTSLKEIAQREEE+GKQNVAGLAIPFSL
Sbjct: 301 TSHGEKSEDTSKRKRNRKSWTSLKEIAQREEEKGKQNVAGLAIPFSL 347
BLAST of CSPI04G05800 vs. ExPASy TrEMBL
Match:
A0A6J1J5I6 (uncharacterized protein LOC111482797 OS=Cucurbita maxima OX=3661 GN=LOC111482797 PE=4 SV=1)
HSP 1 Score: 409.1 bits (1050), Expect = 1.9e-110
Identity = 233/377 (61.80%), Postives = 263/377 (69.76%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MA++K NT +G+SNP GPQ+SIT+RQE TGK KPKVSNN K YLNHLENLATWASG+ S
Sbjct: 1 MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
+PSLAAFFGQRLA AAESLAV+PD SLF C RCETILQPGSNC IRIEKN AK+RRR KK
Sbjct: 61 IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
SN QN VAYYCH+CSCRNIKRGTPKGHMKVLY +VK V VKDGKECE
Sbjct: 121 CSNSRQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGKECETSAVER 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPR---- 240
T + LTID P IP D SAI PTGDI+ +D PAI
Sbjct: 181 PT---------EILTIDAPKIP-----------DASAIPPTGDITALDNPAIQLQTKGIL 240
Query: 241 --TTPAISSTLSVTSISRSQVRD-----------------------IPTLDAPATPLTLT 300
+PA STLSVT++ +SQ R+ +PT+DAPATP T T
Sbjct: 241 NINSPATPSTLSVTTLLKSQKREMTTLSEKHIGHDIRTDEEKKTGAVPTVDAPATPSTST 300
Query: 301 GMTLLDSKRRKRKKPSSKNQTEPESCSGPTSHGETSEGTSKRKRNRKSWTSLKEIAQREE 348
G+TLLDSK+RKR KPSSKNQTEP SCS PT+ G+ SEGTSKR R RKSWTSLKE+A+ E
Sbjct: 301 GVTLLDSKKRKRNKPSSKNQTEPRSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTNE 357
BLAST of CSPI04G05800 vs. NCBI nr
Match:
XP_004146565.1 (uncharacterized protein LOC101220608 [Cucumis sativus] >KGN53340.1 hypothetical protein Csa_014704 [Cucumis sativus])
HSP 1 Score: 673.3 bits (1736), Expect = 1.1e-189
Identity = 349/350 (99.71%), Postives = 350/350 (100.00%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS
Sbjct: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNC+IRIEKNTAKKRRRHKK
Sbjct: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCNIRIEKNTAKKRRRHKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM
Sbjct: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA
Sbjct: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
Query: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP
Sbjct: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
Query: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSLLET 351
TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSLLET
Sbjct: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSLLET 350
BLAST of CSPI04G05800 vs. NCBI nr
Match:
XP_008452026.1 (PREDICTED: uncharacterized protein LOC103493157 [Cucumis melo] >TYK16632.1 Rpr2 domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 610.9 bits (1574), Expect = 6.8e-171
Identity = 315/347 (90.78%), Postives = 327/347 (94.24%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MARKK NT RGSSNP GPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS
Sbjct: 1 MARKKGNTKRGSSNPTSGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
LPSLAAFFGQRLAAAAESLAV+PDPSLFLCARCET+LQPGSNC+IRIEKN AKKRRRHKK
Sbjct: 61 LPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRRRHKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
SN+TQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKI T+
Sbjct: 121 ASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKILTV 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
D P PPLTTVDCLTIDTPAIPSLSTTRDD+ +DT+A+ PT DISV DGPAISSPRTTPA
Sbjct: 181 DAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAVPPTEDISVDDGPAISSPRTTPA 240
Query: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
I ST SVTS+SRSQVRDIPTLDAPATPLTLT MTLLDSKRRKRKKPSSKN+TEPESCS P
Sbjct: 241 IPSTSSVTSMSRSQVRDIPTLDAPATPLTLTAMTLLDSKRRKRKKPSSKNRTEPESCSAP 300
Query: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSL 348
TSHGE SE TSKRKRNRKSWTSLKEIAQREEE+GKQNVAGLAIPFSL
Sbjct: 301 TSHGEKSEDTSKRKRNRKSWTSLKEIAQREEEKGKQNVAGLAIPFSL 347
BLAST of CSPI04G05800 vs. NCBI nr
Match:
KAA0044832.1 (Rpr2 domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 609.8 bits (1571), Expect = 1.5e-170
Identity = 315/347 (90.78%), Postives = 326/347 (93.95%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MARKK NT RGSSNP GPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS
Sbjct: 1 MARKKGNTKRGSSNPTSGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
LPSLAAFFGQRLAAAAESLAV+PDPSLFLCARCET+LQPGSNC+IRIEKN AKKR RHKK
Sbjct: 61 LPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRGRHKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
SN+TQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKI TM
Sbjct: 121 ASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKILTM 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRTTPA 240
D P PPLTTVDCLTIDTPAIPSLSTTRDD+ +DT+A+ PT DISV DGPAISSPRTTPA
Sbjct: 181 DAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAVPPTEDISVDDGPAISSPRTTPA 240
Query: 241 ISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKRRKRKKPSSKNQTEPESCSGP 300
I ST SVTS+SRSQVRDIPTLDAPATPLTLT MTLLDSKRRKRKKPSSKN+TEPESCS P
Sbjct: 241 IPSTSSVTSMSRSQVRDIPTLDAPATPLTLTAMTLLDSKRRKRKKPSSKNRTEPESCSAP 300
Query: 301 TSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAGLAIPFSL 348
TSHGE SE TSKRKRNRKSWTSLKEIAQREEE+GKQNVAGLAIPFSL
Sbjct: 301 TSHGEKSEDTSKRKRNRKSWTSLKEIAQREEEKGKQNVAGLAIPFSL 347
BLAST of CSPI04G05800 vs. NCBI nr
Match:
XP_038906436.1 (uncharacterized protein LOC120092350 isoform X1 [Benincasa hispida])
HSP 1 Score: 490.7 bits (1262), Expect = 1.0e-134
Identity = 276/372 (74.19%), Postives = 297/372 (79.84%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MA+KK N +GSSNP GPQ+SITLRQE TGKIKPKVSNN KVYLNHLENLATWA GQPS
Sbjct: 1 MAKKKGNAKKGSSNPTSGPQDSITLRQEITGKIKPKVSNNVKVYLNHLENLATWACGQPS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
+PSLA FFGQRLAAAAESLAV+PD SLFLC RCETILQPGSNC IRIEKN AK+RR+H K
Sbjct: 61 IPSLATFFGQRLAAAAESLAVAPDASLFLCQRCETILQPGSNCSIRIEKNNAKRRRKHNK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
SNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTE VSK+KSV V+DGKECEN
Sbjct: 121 CSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTEFVSKLKSVGVEDGKECEN----- 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPRT--- 240
KI PL T + LTIDTPAIP STT +D IDT AI PTGDISVVDGP SSPRT
Sbjct: 181 ---KISPLPTGNRLTIDTPAIPP-STTGEDQNIDTRAIPPTGDISVVDGPVFSSPRTKDI 240
Query: 241 ----TPAISSTLSVTSISRSQ------------------VRDIPTLDAPATPLTLTGMTL 300
PA STLSV ++SRSQ V DIPT+DAPATP T+TG+TL
Sbjct: 241 LNINAPATPSTLSVQTLSRSQKMKLLSNKQTGPASVEERVGDIPTVDAPATPPTMTGITL 300
Query: 301 LDSKRRKRKKPSSKNQTEPESCSGPTSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGK 348
LDSKRRKRKKPSSKNQTEPES S PT++G+ + G SKRKRNRKSWTSLKEIAQR+EERGK
Sbjct: 301 LDSKRRKRKKPSSKNQTEPES-SAPTTYGDKTVGMSKRKRNRKSWTSLKEIAQRDEERGK 360
BLAST of CSPI04G05800 vs. NCBI nr
Match:
XP_022984531.1 (uncharacterized protein LOC111482797 [Cucurbita maxima])
HSP 1 Score: 409.1 bits (1050), Expect = 3.9e-110
Identity = 233/377 (61.80%), Postives = 263/377 (69.76%), Query Frame = 0
Query: 1 MARKKSNTNRGSSNPGFGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60
MA++K NT +G+SNP GPQ+SIT+RQE TGK KPKVSNN K YLNHLENLATWASG+ S
Sbjct: 1 MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60
Query: 61 LPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHIRIEKNTAKKRRRHKK 120
+PSLAAFFGQRLA AAESLAV+PD SLF C RCETILQPGSNC IRIEKN AK+RRR KK
Sbjct: 61 IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKK 120
Query: 121 GSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKIFTM 180
SN QN VAYYCH+CSCRNIKRGTPKGHMKVLY +VK V VKDGKECE
Sbjct: 121 CSNSRQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGKECETSAVER 180
Query: 181 DTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISPTGDISVVDGPAISSPR---- 240
T + LTID P IP D SAI PTGDI+ +D PAI
Sbjct: 181 PT---------EILTIDAPKIP-----------DASAIPPTGDITALDNPAIQLQTKGIL 240
Query: 241 --TTPAISSTLSVTSISRSQVRD-----------------------IPTLDAPATPLTLT 300
+PA STLSVT++ +SQ R+ +PT+DAPATP T T
Sbjct: 241 NINSPATPSTLSVTTLLKSQKREMTTLSEKHIGHDIRTDEEKKTGAVPTVDAPATPSTST 300
Query: 301 GMTLLDSKRRKRKKPSSKNQTEPESCSGPTSHGETSEGTSKRKRNRKSWTSLKEIAQREE 348
G+TLLDSK+RKR KPSSKNQTEP SCS PT+ G+ SEGTSKR R RKSWTSLKE+A+ E
Sbjct: 301 GVTLLDSKKRKRNKPSSKNQTEPRSCSAPTADGDRSEGTSKRNRKRKSWTSLKEVARTNE 357
BLAST of CSPI04G05800 vs. TAIR 10
Match:
AT5G41270.1 (CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 122.5 bits (306), Expect = 6.9e-28
Identity = 108/308 (35.06%), Postives = 145/308 (47.08%), Query Frame = 0
Query: 47 HLENLATWAS-GQPSLPSLAAFFGQRLAAAAESLAVSPDPSLFLCARCETILQPGSNCHI 106
HL+NLA W+S G +PSLA+ G+RLAA ES ++ DP L C RCETIL+PG NC++
Sbjct: 28 HLKNLALWSSTGDTPIPSLASLLGRRLAADTESTGITTDPDLVSCQRCETILKPGFNCNV 87
Query: 107 RIEK---NTAKKRRRHKKGSNL--TQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVS 166
RIEK N KKR R KK +N+ QN V Y+C++CS RN+KRGT KG MK LY
Sbjct: 88 RIEKVSANVKKKRNRCKKSNNICFPQNNVVYHCNFCSHRNLKRGTAKGQMKELY------ 147
Query: 167 KVKSVVVKDGKECENKIFTMDTPKIPPLTTVDCLTIDTPAIPSLSTTRDDLTIDTSAISP 226
+ K PKI + ++T+
Sbjct: 148 -----------PFKPKTARSSRPKI----------------------KKEMTMPQE---- 207
Query: 227 TGDISVVDGPAISSPRTTPAISSTLSVTSISRSQVRDIPTLDAPATPLTLTGMTLLDSKR 286
+ +SSP + + QV + D P P+ LT L+ R
Sbjct: 208 ------IQSNMLSSPERS------------VKDQVEEKSVGDTP-KPMMLT----LERDR 258
Query: 287 RKRKKPSSKNQTEPESCSGPTSHGETSEGTSKRKRNRKSWTSLKEIAQREEERGKQNVAG 346
R R KP SK +EP+S E + G S +++ + WTS+KEIA E K + AG
Sbjct: 268 RIR-KPKSKKPSEPQSVP------EKTVGGSNKRKRKSPWTSMKEIA----ETNKSSKAG 258
Query: 347 -LAIPFSL 348
IPF L
Sbjct: 328 NFKIPFLL 258
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KUP1 | 5.4e-190 | 99.71 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G048010 PE=4 SV=1 | [more] |
A0A5D3CYJ3 | 3.3e-171 | 90.78 | Rpr2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... | [more] |
A0A1S3BU13 | 3.3e-171 | 90.78 | uncharacterized protein LOC103493157 OS=Cucumis melo OX=3656 GN=LOC103493157 PE=... | [more] |
A0A5A7TNC0 | 7.4e-171 | 90.78 | Rpr2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_s... | [more] |
A0A6J1J5I6 | 1.9e-110 | 61.80 | uncharacterized protein LOC111482797 OS=Cucurbita maxima OX=3661 GN=LOC111482797... | [more] |
Match Name | E-value | Identity | Description | |
XP_004146565.1 | 1.1e-189 | 99.71 | uncharacterized protein LOC101220608 [Cucumis sativus] >KGN53340.1 hypothetical ... | [more] |
XP_008452026.1 | 6.8e-171 | 90.78 | PREDICTED: uncharacterized protein LOC103493157 [Cucumis melo] >TYK16632.1 Rpr2 ... | [more] |
KAA0044832.1 | 1.5e-170 | 90.78 | Rpr2 domain-containing protein [Cucumis melo var. makuwa] | [more] |
XP_038906436.1 | 1.0e-134 | 74.19 | uncharacterized protein LOC120092350 isoform X1 [Benincasa hispida] | [more] |
XP_022984531.1 | 3.9e-110 | 61.80 | uncharacterized protein LOC111482797 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT5G41270.1 | 6.9e-28 | 35.06 | CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Ha... | [more] |