Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTGGGCTCTGCATCACCACTCTCTCTCTCGCGCGCGCCAGGTTTCTGTGTCCAGAGAGGATATGGTCTCTAGCTCCCTCTCACTTCTCTCTCCCACCTCATTTCCTTCCATTTCCAAAACTGATTCACCATCTTCTTCTATCCCCAACAAATTTGGGATTGGGCCTTTCTCAGAATCATCCAAACGAAGGTGTACATTCCCAAAACGTGTTAGGTTGTTCCGGTGCCAAATTCTTGGCTCATCTTCATCTTCCAATCAATCACGCGATGATGCCTCGGCGGAGCTGTTTTTGCAGAACAATTCAATTGCGGATTTCATGAGGTTCAAACGGGATGGAAGCAGTGCGGAGCTTCAGACTGCTACAGTTAGTTACAGAAAGAAGTTCCCCTGGTCTATCCTGCAGCCATTTGTTCAGGTTCGAGCTCGAATTCTAATTGCTTCAATTGGGTATTTGTGTTTCTTACTTGGGGTTTCTTTAATTTGTGGCTTGTGGTCTTCAATTGTTTCCAGGTTGATTTGGTATCAACGATCCATATCGCAGACAAAGAGTATGTTTACCTACCCAGTTTATGGGGTCCTGTCTCGTTTTATCATTTCGACCATCTTCTATGTTTTCTCGGTTTTCTGCTCGCAAATTGTTGGTTGTAACCTATGGACTAGGAGATATGATCATGCAATTTGATTATAATTTGGAAGGGTAGTGCCTTAGTAATTAGGCATTAGTGCTGTCATTGAGAGAAGAGCTGGAATTGTGACCAAGTATATGTTTATATGGCAACAGGCGATCATCAACAAACTAAGAAACTGTTACATATGGAAAAAGAGACGAAGAACTGCATTTAAGAAAAAAATTGTTCTATATCTGAAATGACAAACAAAAAGGGACAAAATTGTCATGGGAAGATATTGCAGGCACTTTTCTATTTTGAATCGAAGAAGAAAAATTTCCTGTGTTGTAACTGTGTTTTGGTGTTCTGGGCATTGAATGAACTCTCTTTTCACTAGTCTTTCTATCTTCCTTGTTATGTGGGTAGTTTGCGTTTGAAAATAACTGATGCAGTGCTTGAAATGGGTGTTTTTATAAGGTTTCGATAATAGATTGCTAACTTTGATTTTTCTTATCAGTTGACTTACTACAACATGACTTTTTGATTGACCAAGTATGTTGCCATCATTTTTGCATAGCTACTTCGAGGCCCTTCAGAAAGAACTAGACTCCTACGATTGTATACTTTATGAGATGGTAGCTAGTAGGGAGAGCTTAGAAAGCAGAAGAAATCCAGACGCCACAAAGAAATTAAAAAGTTCGCGATCACGAGGATTTAATATTCTGGGTTGCATTCAACGACAGATGGCTCGTGTTCTTACGCTTGATTTCCAATTAGATTGTCTTGATTACCAGGCTGCAAATTGGTTCCATGCAGATCTAGACTACGAAACCTTCAGAATACTTCAGGTCATTTCCTTCTACCTTCCAGCACAATAGATTGAGTGAGAAACATCTTTTCTGAGGATGTCAGAGTGCATGCGATTGGACATGCCCATTTTCATTCTTTGAATAATTGAACCATAGCATTTGATAAGTCACAAGAGTCTATTGTTTACCACTTGGCCTTAGTTCCTCATTTTTACAATCTCACATCATTCTCAGTAGTTTTTGACATCTAAAGGTTGGCGTTGTATAATGATATATTTTAACGCTGGATTTGTTGCTGTTACGTTACAGACTGAAAAAGGTGAAAACTTCTTTACATTTGCAAGAGACATGACTATACGATCCACCAAAGCTATGGTTCAGCCTACTGCAGTACCGGACGATCTTGAACCTTGGAAGTCAAAACTTCTATGGGCGTCTCGTGTGCTTCCCATGCCACTCGTTGGACTTCTTATCATTGGAAGTGTTTGTGCAGACGTGGGAAGTCAAGCATCAGAATTTCCAGAATTTGAAGCATTGTCGAGGCTCGATTTGGGTGCTGCAATGAAGGTCTTTCTGGCAAAGCGACTAACATCTGAGTAAGTATATTTTGAGTTCTTCCGGATGTACATTCCCTTGTTGGTTTGGAAGTGATAACTAATAATCCATTCCAACTGGACTTCATTTCTATTTCTTAATGGTGAGTCTGGAGGTTATTCTGAATCTAAATTAGTAGATGCTTTTACCTTCCCTCTCCATTCCCCCAGGGAAAGAGTGAAGTCTTAGTTCATCTGTTTTTGTATGTGATATAATTCATTGCTGGTGAAGGTTCACACAAGTAACTGCTGAGGTGGAGGAGAGTTCTGTAATAATTGGCGAAAGGAACAAAGCTGCAACAGAGGCGCTTAGAGATGCCATCGACAAGGGCCACAACAAAATCGCCATACTATACGGTGGCGGTCACATGCCCGACTTGGGGAGGCGATTGCGAGAGGAGTTCGACCTCATTCCTTGTCGTGTAAAGTGGATAACAGCATGGTCTATTACAAATCGAAAACTATCCAGCAGTTCTCTCCCATTTCTGAAGGCTCTAGCCGATGCCTCGGGTTGGCCGTTGAACCGTTACCAGACCTTGGCGCTGCTAATCTTCTCCTCAGTCCTTGCAGTGGATCTCTGGTTTTGGGAACTCTTTTTCGGCACAGCGGCAAATTGGATCTCCGAAGTCGCTTTAGAAGTCTATCAGTATATTGATAATGTACAGCTGATG
mRNA sequence
TTTTGGGCTCTGCATCACCACTCTCTCTCTCGCGCGCGCCAGGTTTCTGTGTCCAGAGAGGATATGGTCTCTAGCTCCCTCTCACTTCTCTCTCCCACCTCATTTCCTTCCATTTCCAAAACTGATTCACCATCTTCTTCTATCCCCAACAAATTTGGGATTGGGCCTTTCTCAGAATCATCCAAACGAAGGTGTACATTCCCAAAACGTGTTAGGTTGTTCCGGTGCCAAATTCTTGGCTCATCTTCATCTTCCAATCAATCACGCGATGATGCCTCGGCGGAGCTGTTTTTGCAGAACAATTCAATTGCGGATTTCATGAGGTTCAAACGGGATGGAAGCAGTGCGGAGCTTCAGACTGCTACAGTTAGTTACAGAAAGAAGTTCCCCTGGTCTATCCTGCAGCCATTTGTTCAGGTTGATTTGGTATCAACGATCCATATCGCAGACAAAGACTACTTCGAGGCCCTTCAGAAAGAACTAGACTCCTACGATTGTATACTTTATGAGATGGTAGCTAGTAGGGAGAGCTTAGAAAGCAGAAGAAATCCAGACGCCACAAAGAAATTAAAAAGTTCGCGATCACGAGGATTTAATATTCTGGGTTGCATTCAACGACAGATGGCTCGTGTTCTTACGCTTGATTTCCAATTAGATTGTCTTGATTACCAGGCTGCAAATTGGTTCCATGCAGATCTAGACTACGAAACCTTCAGAATACTTCAGACTGAAAAAGGTGAAAACTTCTTTACATTTGCAAGAGACATGACTATACGATCCACCAAAGCTATGGTTCAGCCTACTGCAGTACCGGACGATCTTGAACCTTGGAAGTCAAAACTTCTATGGGCGTCTCGTGTGCTTCCCATGCCACTCGTTGGACTTCTTATCATTGGAAGTGTTTGTGCAGACGTGGGAAGTCAAGCATCAGAATTTCCAGAATTTGAAGCATTGTCGAGGCTCGATTTGGGTGCTGCAATGAAGGTCTTTCTGGCAAAGCGACTAACATCTGAGTTCACACAAGTAACTGCTGAGGTGGAGGAGAGTTCTGTAATAATTGGCGAAAGGAACAAAGCTGCAACAGAGGCGCTTAGAGATGCCATCGACAAGGGCCACAACAAAATCGCCATACTATACGGTGGCGGTCACATGCCCGACTTGGGGAGGCGATTGCGAGAGGAGTTCGACCTCATTCCTTGTCGTGTAAAGTGGATAACAGCATGGTCTATTACAAATCGAAAACTATCCAGCAGTTCTCTCCCATTTCTGAAGGCTCTAGCCGATGCCTCGGGTTGGCCGTTGAACCGTTACCAGACCTTGGCGCTGCTAATCTTCTCCTCAGTCCTTGCAGTGGATCTCTGGTTTTGGGAACTCTTTTTCGGCACAGCGGCAAATTGGATCTCCGAAGTCGCTTTAGAAGTCTATCAGTATATTGATAATGTACAGCTGATG
Coding sequence (CDS)
TTTTGGGCTCTGCATCACCACTCTCTCTCTCGCGCGCGCCAGGTTTCTGTGTCCAGAGAGGATATGGTCTCTAGCTCCCTCTCACTTCTCTCTCCCACCTCATTTCCTTCCATTTCCAAAACTGATTCACCATCTTCTTCTATCCCCAACAAATTTGGGATTGGGCCTTTCTCAGAATCATCCAAACGAAGGTGTACATTCCCAAAACGTGTTAGGTTGTTCCGGTGCCAAATTCTTGGCTCATCTTCATCTTCCAATCAATCACGCGATGATGCCTCGGCGGAGCTGTTTTTGCAGAACAATTCAATTGCGGATTTCATGAGGTTCAAACGGGATGGAAGCAGTGCGGAGCTTCAGACTGCTACAGTTAGTTACAGAAAGAAGTTCCCCTGGTCTATCCTGCAGCCATTTGTTCAGGTTGATTTGGTATCAACGATCCATATCGCAGACAAAGACTACTTCGAGGCCCTTCAGAAAGAACTAGACTCCTACGATTGTATACTTTATGAGATGGTAGCTAGTAGGGAGAGCTTAGAAAGCAGAAGAAATCCAGACGCCACAAAGAAATTAAAAAGTTCGCGATCACGAGGATTTAATATTCTGGGTTGCATTCAACGACAGATGGCTCGTGTTCTTACGCTTGATTTCCAATTAGATTGTCTTGATTACCAGGCTGCAAATTGGTTCCATGCAGATCTAGACTACGAAACCTTCAGAATACTTCAGACTGAAAAAGGTGAAAACTTCTTTACATTTGCAAGAGACATGACTATACGATCCACCAAAGCTATGGTTCAGCCTACTGCAGTACCGGACGATCTTGAACCTTGGAAGTCAAAACTTCTATGGGCGTCTCGTGTGCTTCCCATGCCACTCGTTGGACTTCTTATCATTGGAAGTGTTTGTGCAGACGTGGGAAGTCAAGCATCAGAATTTCCAGAATTTGAAGCATTGTCGAGGCTCGATTTGGGTGCTGCAATGAAGGTCTTTCTGGCAAAGCGACTAACATCTGAGTTCACACAAGTAACTGCTGAGGTGGAGGAGAGTTCTGTAATAATTGGCGAAAGGAACAAAGCTGCAACAGAGGCGCTTAGAGATGCCATCGACAAGGGCCACAACAAAATCGCCATACTATACGGTGGCGGTCACATGCCCGACTTGGGGAGGCGATTGCGAGAGGAGTTCGACCTCATTCCTTGTCGTGTAAAGTGGATAACAGCATGGTCTATTACAAATCGAAAACTATCCAGCAGTTCTCTCCCATTTCTGAAGGCTCTAGCCGATGCCTCGGGTTGGCCGTTGAACCGTTACCAGACCTTGGCGCTGCTAATCTTCTCCTCAGTCCTTGCAGTGGATCTCTGGTTTTGGGAACTCTTTTTCGGCACAGCGGCAAATTGGATCTCCGAAGTCGCTTTAGAAGTCTATCAGTATATTGATAATGTACAGCTGATG
Protein sequence
FWALHHHSLSRARQVSVSREDMVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGSSSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVDLVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNILGCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRSTKAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLALLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM
Homology
BLAST of MS003820 vs. NCBI nr
Match:
XP_022145081.1 (uncharacterized protein LOC111014591 [Momordica charantia])
HSP 1 Score: 909.1 bits (2348), Expect = 1.7e-260
Identity = 463/463 (100.00%), Postives = 463/463 (100.00%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 81
MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS
Sbjct: 1 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 60
Query: 82 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 141
SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD
Sbjct: 61 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 120
Query: 142 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 201
LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL
Sbjct: 121 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 180
Query: 202 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 261
GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST
Sbjct: 181 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 240
Query: 262 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 321
KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL
Sbjct: 241 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 300
Query: 322 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 381
DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG
Sbjct: 301 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 360
Query: 382 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 441
GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA
Sbjct: 361 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 420
Query: 442 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM
Sbjct: 421 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 463
BLAST of MS003820 vs. NCBI nr
Match:
XP_038879904.1 (uncharacterized protein LOC120071619 [Benincasa hispida] >XP_038879913.1 uncharacterized protein LOC120071628 [Benincasa hispida])
HSP 1 Score: 817.0 bits (2109), Expect = 8.6e-233
Identity = 422/468 (90.17%), Postives = 440/468 (94.02%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTD-SPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILG 81
MV++SLS+L PTS SISKTD S SSSIP KF GPFS+SS R FPKR RLFRCQ+ G
Sbjct: 1 MVANSLSILLPTSIRSISKTDSSSSSSIPTKF--GPFSDSSNPRFRFPKRFRLFRCQVPG 60
Query: 82 SSSS----SNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQP 141
SSSS SNQ R+DAS +LF QNNSIADFMRFKRDG+SAELQTA VSY+KKFPWSILQP
Sbjct: 61 SSSSSSSASNQLREDASPDLFFQNNSIADFMRFKRDGTSAELQTAIVSYKKKFPWSILQP 120
Query: 142 FVQVDLVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSR 201
FVQVDLVSTIHIADKDYFEALQKEL+SYDCILYEMVASRESLE+RRNP ATKKLKSSRSR
Sbjct: 121 FVQVDLVSTIHIADKDYFEALQKELESYDCILYEMVASRESLENRRNPVATKKLKSSRSR 180
Query: 202 GFNILGCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDM 261
G NILGCIQRQMARVLTLDFQLDCLDYQA+NW+HADLDYETFRILQTEKGENFFTFARDM
Sbjct: 181 GLNILGCIQRQMARVLTLDFQLDCLDYQASNWYHADLDYETFRILQTEKGENFFTFARDM 240
Query: 262 TIRSTKAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFE 321
TIRSTKAMVQPTAVP+DLEPWKSKLLWASRVLPMPLVGLLIIGSVCAD GSQASEFPEFE
Sbjct: 241 TIRSTKAMVQPTAVPEDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADGGSQASEFPEFE 300
Query: 322 ALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIA 381
ALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA+DKGHN+IA
Sbjct: 301 ALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDALDKGHNRIA 360
Query: 382 ILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNR 441
ILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSIT RKL+SSSLPFLKALAD SGWPLNR
Sbjct: 361 ILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSITKRKLASSSLPFLKALADVSGWPLNR 420
Query: 442 YQTLALLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
YQTLALLIFSSVLAVDLWFWELFFGTAANWISE ALEVY+YIDNVQLM
Sbjct: 421 YQTLALLIFSSVLAVDLWFWELFFGTAANWISEFALEVYRYIDNVQLM 466
BLAST of MS003820 vs. NCBI nr
Match:
XP_023531025.1 (uncharacterized protein LOC111793404 [Cucurbita pepo subsp. pepo] >XP_023531026.1 uncharacterized protein LOC111793404 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 810.4 bits (2092), Expect = 8.1e-231
Identity = 416/463 (89.85%), Postives = 435/463 (93.95%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 81
MVS+SL LLSPTSFPSISKTDSPSSSI F G FS+SSKR FPKRVRLFRCQILGS
Sbjct: 1 MVSTSLLLLSPTSFPSISKTDSPSSSISTNF--GSFSQSSKRSFRFPKRVRLFRCQILGS 60
Query: 82 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 141
SS SNQ RDD +AELFLQNNSIADFMRFKRDGSS ELQTA V+YRKKFPWSILQPF+QVD
Sbjct: 61 SSPSNQLRDDGTAELFLQNNSIADFMRFKRDGSSTELQTAVVTYRKKFPWSILQPFLQVD 120
Query: 142 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 201
LVSTIHIADKDYFEALQKEL+SYDCILYEMVASRESLE+RRNP A KKL+SSRSRGFN+L
Sbjct: 121 LVSTIHIADKDYFEALQKELESYDCILYEMVASRESLENRRNPTAMKKLQSSRSRGFNLL 180
Query: 202 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 261
GCIQRQMARVLTLDFQLDCLDYQA+NW+HADLDYETF ILQTEKGE+FFTFARDMTIRST
Sbjct: 181 GCIQRQMARVLTLDFQLDCLDYQASNWYHADLDYETFTILQTEKGESFFTFARDMTIRST 240
Query: 262 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 321
KA+VQPTA DLEPWKSKLL ASRVLPMPLVG+LIIGSVCAD GSQASEFPEFEALS L
Sbjct: 241 KALVQPTAGSKDLEPWKSKLLRASRVLPMPLVGMLIIGSVCADGGSQASEFPEFEALSSL 300
Query: 322 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 381
DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA+DKGHNKIAILYGG
Sbjct: 301 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAMDKGHNKIAILYGG 360
Query: 382 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 441
GHMPDLGRRLRE+FDLIP RVKWITAWSI RK+SSSSLPFLK+LAD SGWPLNRYQTLA
Sbjct: 361 GHMPDLGRRLREDFDLIPSRVKWITAWSIAKRKVSSSSLPFLKSLADVSGWPLNRYQTLA 420
Query: 442 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
LLIFSSVLAVDLWFWELFFGTAANWISEVAL+VYQYIDNVQLM
Sbjct: 421 LLIFSSVLAVDLWFWELFFGTAANWISEVALDVYQYIDNVQLM 461
BLAST of MS003820 vs. NCBI nr
Match:
KAG6587960.1 (hypothetical protein SDJN03_16525, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 809.7 bits (2090), Expect = 1.4e-230
Identity = 415/463 (89.63%), Postives = 434/463 (93.74%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 81
MVS+SL LLSPTSFPSISKTDSPSSSI F G FS+SSKR FPKR RLFRCQILGS
Sbjct: 1 MVSTSLLLLSPTSFPSISKTDSPSSSISTNF--GSFSQSSKRNFRFPKRFRLFRCQILGS 60
Query: 82 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 141
SS SNQ RDD +AELFLQNNSIADFMRFKRDGSS ELQTA VSYRKKFPWSILQPF+QVD
Sbjct: 61 SSPSNQLRDDGTAELFLQNNSIADFMRFKRDGSSTELQTAVVSYRKKFPWSILQPFLQVD 120
Query: 142 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 201
LVSTIHIADKDYFEALQKEL+SYDCILYEMVASRESLE+RRNP A KKL+SSRSRGFN+L
Sbjct: 121 LVSTIHIADKDYFEALQKELESYDCILYEMVASRESLENRRNPTAMKKLQSSRSRGFNLL 180
Query: 202 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 261
GCIQRQMARVLTLDFQLDCLDYQA+NW+HADLDYETF ILQTEKGE+FFTFARDMTIRST
Sbjct: 181 GCIQRQMARVLTLDFQLDCLDYQASNWYHADLDYETFTILQTEKGESFFTFARDMTIRST 240
Query: 262 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 321
KA+VQPTA DLEPWKSKLL ASRVLPMPL+G+LIIGSVCAD GSQASEFPEFEALS L
Sbjct: 241 KALVQPTAGSKDLEPWKSKLLRASRVLPMPLIGMLIIGSVCADGGSQASEFPEFEALSSL 300
Query: 322 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 381
DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA+DKGHNKIAILYGG
Sbjct: 301 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAMDKGHNKIAILYGG 360
Query: 382 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 441
GHMPDLGRRLRE+FDLIP RVKWITAWSI RK+SSSSLPFLK+LAD SGWPLNRYQTLA
Sbjct: 361 GHMPDLGRRLREDFDLIPSRVKWITAWSIAKRKVSSSSLPFLKSLADVSGWPLNRYQTLA 420
Query: 442 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
LLIFSSVLAVDLWFWELFFGTAANWISEVAL+VYQYIDNVQLM
Sbjct: 421 LLIFSSVLAVDLWFWELFFGTAANWISEVALDVYQYIDNVQLM 461
BLAST of MS003820 vs. NCBI nr
Match:
KAG7021849.1 (hypothetical protein SDJN02_15577 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 809.7 bits (2090), Expect = 1.4e-230
Identity = 416/463 (89.85%), Postives = 434/463 (93.74%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 81
MVS+SL LLSPTSFPSISKTDSPSSSI F G FS+SSKR FPKR RLFRCQILGS
Sbjct: 1 MVSTSLLLLSPTSFPSISKTDSPSSSISTNF--GSFSQSSKRSFRFPKRFRLFRCQILGS 60
Query: 82 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 141
SS SNQ RDD +AELFLQNNSIADFMRFKRDGSS ELQTA VSYRKKFPWSILQPF+QVD
Sbjct: 61 SSPSNQLRDDGTAELFLQNNSIADFMRFKRDGSSTELQTAVVSYRKKFPWSILQPFLQVD 120
Query: 142 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 201
LVSTIHIADKDYFEALQKEL+SYDCILYEMVASRESLE+RRNP A KKL+SSRSRGFN+L
Sbjct: 121 LVSTIHIADKDYFEALQKELESYDCILYEMVASRESLENRRNPTAMKKLQSSRSRGFNLL 180
Query: 202 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 261
GCIQRQMARVLTLDFQLDCLDYQA+NW+HADLDYETF ILQTEKGE+FFTFARDMTIRST
Sbjct: 181 GCIQRQMARVLTLDFQLDCLDYQASNWYHADLDYETFTILQTEKGESFFTFARDMTIRST 240
Query: 262 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 321
KA+VQPTA DLEPWKSKLL ASRVLPMPLVG+LIIGSVCAD GSQASEFPEFEALS L
Sbjct: 241 KALVQPTAGSKDLEPWKSKLLRASRVLPMPLVGMLIIGSVCADGGSQASEFPEFEALSSL 300
Query: 322 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 381
DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA+DKGHNKIAILYGG
Sbjct: 301 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAMDKGHNKIAILYGG 360
Query: 382 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 441
GHMPDLGRRLRE+FDLIP RVKWITAWSI RK+SSSSLPFLK+LAD SGWPLNRYQTLA
Sbjct: 361 GHMPDLGRRLREDFDLIPSRVKWITAWSIAKRKVSSSSLPFLKSLADVSGWPLNRYQTLA 420
Query: 442 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
LLIFSSVLAVDLWFWELFFGTAANWISEVAL+VYQYIDNVQLM
Sbjct: 421 LLIFSSVLAVDLWFWELFFGTAANWISEVALDVYQYIDNVQLM 461
BLAST of MS003820 vs. ExPASy TrEMBL
Match:
A0A6J1CVA7 (uncharacterized protein LOC111014591 OS=Momordica charantia OX=3673 GN=LOC111014591 PE=4 SV=1)
HSP 1 Score: 909.1 bits (2348), Expect = 8.1e-261
Identity = 463/463 (100.00%), Postives = 463/463 (100.00%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 81
MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS
Sbjct: 1 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 60
Query: 82 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 141
SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD
Sbjct: 61 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 120
Query: 142 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 201
LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL
Sbjct: 121 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 180
Query: 202 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 261
GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST
Sbjct: 181 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 240
Query: 262 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 321
KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL
Sbjct: 241 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 300
Query: 322 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 381
DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG
Sbjct: 301 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 360
Query: 382 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 441
GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA
Sbjct: 361 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 420
Query: 442 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM
Sbjct: 421 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 463
BLAST of MS003820 vs. ExPASy TrEMBL
Match:
A0A6J1L277 (uncharacterized protein LOC111499173 OS=Cucurbita maxima OX=3661 GN=LOC111499173 PE=4 SV=1)
HSP 1 Score: 807.4 bits (2084), Expect = 3.3e-230
Identity = 414/463 (89.42%), Postives = 433/463 (93.52%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 81
MVS+SL LLSP+SFPSISKTDSPSSSI F G FS+SSKR FPKR RLFRCQILG
Sbjct: 1 MVSTSLLLLSPSSFPSISKTDSPSSSISTNF--GSFSQSSKRSFRFPKRFRLFRCQILGP 60
Query: 82 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 141
SS SNQ RDD +AELFLQNNSIADFMRFKRDGSS ELQTA VSYRKKFPWSILQPF+QVD
Sbjct: 61 SSPSNQFRDDGTAELFLQNNSIADFMRFKRDGSSTELQTAVVSYRKKFPWSILQPFLQVD 120
Query: 142 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 201
LVSTIHIADKDYFEALQKEL+SYDCILYEMVASRESLE+RRNP A KKL+SSRSRGFNIL
Sbjct: 121 LVSTIHIADKDYFEALQKELESYDCILYEMVASRESLENRRNPTAMKKLQSSRSRGFNIL 180
Query: 202 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 261
GCIQRQMARVLTLDFQLDCLDYQA+NW+HADLDYETF ILQTEKGENFFTFARDMTIRST
Sbjct: 181 GCIQRQMARVLTLDFQLDCLDYQASNWYHADLDYETFTILQTEKGENFFTFARDMTIRST 240
Query: 262 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 321
KA+VQPTA DLEPWKSKLL ASRVLPMPLVG+LIIGSVCAD GSQASEFPEFEALS L
Sbjct: 241 KALVQPTAGSKDLEPWKSKLLRASRVLPMPLVGMLIIGSVCADGGSQASEFPEFEALSSL 300
Query: 322 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 381
DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA+DKGHNKIAILYGG
Sbjct: 301 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAMDKGHNKIAILYGG 360
Query: 382 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 441
GHMPDLGRRLRE+FDLIP RVKWITAWSI RK+SSSSLPFLK+LAD SGWPLNRYQTLA
Sbjct: 361 GHMPDLGRRLREDFDLIPSRVKWITAWSIAKRKVSSSSLPFLKSLADVSGWPLNRYQTLA 420
Query: 442 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
LLIFSSVLAVDLWFWELFFGTAANWIS+VAL+VYQY+DNVQLM
Sbjct: 421 LLIFSSVLAVDLWFWELFFGTAANWISQVALDVYQYVDNVQLM 461
BLAST of MS003820 vs. ExPASy TrEMBL
Match:
A0A6J1EYY7 (uncharacterized protein LOC111440833 OS=Cucurbita moschata OX=3662 GN=LOC111440833 PE=4 SV=1)
HSP 1 Score: 806.6 bits (2082), Expect = 5.7e-230
Identity = 414/463 (89.42%), Postives = 433/463 (93.52%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 81
MVS+SL LLSPTSFPSISKTDSPSSSI F G FS+SSKR FPKR RLFRCQILG+
Sbjct: 1 MVSTSLLLLSPTSFPSISKTDSPSSSISTNF--GSFSQSSKRSFRFPKRFRLFRCQILGA 60
Query: 82 SSSSNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRKKFPWSILQPFVQVD 141
SS SNQ RDD +AELFLQNNSIADFMRFKRDGS ELQTA VSYRKKFPWSILQPF+QVD
Sbjct: 61 SSPSNQLRDDGTAELFLQNNSIADFMRFKRDGSCTELQTAVVSYRKKFPWSILQPFLQVD 120
Query: 142 LVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSRGFNIL 201
LVSTIHIADKDYFEALQKEL+SYDCILYEMVASRESLE+RRNP A KKL+SSRSRGFN+L
Sbjct: 121 LVSTIHIADKDYFEALQKELESYDCILYEMVASRESLENRRNPTAMKKLQSSRSRGFNLL 180
Query: 202 GCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDMTIRST 261
GCIQRQMARVLTLDFQLDCLDYQA+NW+HADLDYETF ILQTEKGE+FFTFARDMTIRST
Sbjct: 181 GCIQRQMARVLTLDFQLDCLDYQASNWYHADLDYETFTILQTEKGESFFTFARDMTIRST 240
Query: 262 KAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFEALSRL 321
KA+VQPTA DLEPWKSKLL ASRVLPMPLVG+LIIGSVCAD GSQASEFPEFEALS L
Sbjct: 241 KALVQPTAGSKDLEPWKSKLLRASRVLPMPLVGMLIIGSVCADGGSQASEFPEFEALSSL 300
Query: 322 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIAILYGG 381
DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA+DKGHNKIAILYGG
Sbjct: 301 DLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAMDKGHNKIAILYGG 360
Query: 382 GHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALADASGWPLNRYQTLA 441
GHMPDLGRRLRE+FDLIP RVKWITAWSI RK+SSSSLPFLK+LAD SGWPLNRYQTLA
Sbjct: 361 GHMPDLGRRLREDFDLIPSRVKWITAWSIAKRKVSSSSLPFLKSLADVSGWPLNRYQTLA 420
Query: 442 LLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
LLIFSSVLAVDLWFWELFFGTAANWISEVAL+VYQYIDNVQLM
Sbjct: 421 LLIFSSVLAVDLWFWELFFGTAANWISEVALDVYQYIDNVQLM 461
BLAST of MS003820 vs. ExPASy TrEMBL
Match:
A0A0A0LTP1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G248120 PE=4 SV=1)
HSP 1 Score: 805.4 bits (2079), Expect = 1.3e-229
Identity = 424/492 (86.18%), Postives = 440/492 (89.43%), Query Frame = 0
Query: 1 FWALHHHSLSRARQVSVSREDMVSSSLSLLSPTSFPSISKTDSP-------SSSIPNKFG 60
FW L HS + +SREDMVS+SLSLL P SFPSI K DSP SSSIP KF
Sbjct: 60 FWVL--HSQFSSSPCFLSREDMVSNSLSLLLPISFPSIFKPDSPSSSSSSSSSSIPTKF- 119
Query: 61 IGPFSESSKRRCTFPKRVRLFRCQILGSSSS-SNQSRDDASAELFLQNNSIADFMRFKRD 120
PF S R FPK RLFRCQI SSSS SNQ RDDAS + F QNNSIADFMRFKRD
Sbjct: 120 --PFFSDSSR---FPKSFRLFRCQIPASSSSASNQLRDDASPDPFFQNNSIADFMRFKRD 179
Query: 121 GSSAELQTATVSYRKKFPWSILQPFVQVDLVSTIHIADKDYFEALQKELDSYDCILYEMV 180
G SAELQTA VSY+KKFPWSILQPFVQVDLVSTIHIADK+YF+ALQKEL+SYD ILYEMV
Sbjct: 180 GPSAELQTAIVSYKKKFPWSILQPFVQVDLVSTIHIADKEYFKALQKELESYDSILYEMV 239
Query: 181 ASRESLESRRNPDATKKLKSSRSRGFNILGCIQRQMARVLTLDFQLDCLDYQAANWFHAD 240
AS+ESLE+R+NP A KKLKSSRSRG NILGCIQRQMARVLTLDFQLDCLDYQA+NW+HAD
Sbjct: 240 ASKESLENRKNPAAMKKLKSSRSRGLNILGCIQRQMARVLTLDFQLDCLDYQASNWYHAD 299
Query: 241 LDYETFRILQTEKGENFFTFARDMTIRSTKAMVQPTAVPDDLEPWKSKLLWASRVLPMPL 300
LDYETFRILQTEKGENFFTFARDMTIRSTKAMVQPT VP+DLEPWKSKLLWASRVLPMPL
Sbjct: 300 LDYETFRILQTEKGENFFTFARDMTIRSTKAMVQPTTVPEDLEPWKSKLLWASRVLPMPL 359
Query: 301 VGLLIIGSVCADVGSQASEFPEFEALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVI 360
VGLLIIGSVCAD GSQASEFPEFEALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVI
Sbjct: 360 VGLLIIGSVCADGGSQASEFPEFEALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVI 419
Query: 361 IGERNKAATEALRDAIDKGHNKIAILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSITN 420
IGERNKAATEALRDA+DKGHN+IAILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSIT
Sbjct: 420 IGERNKAATEALRDALDKGHNRIAILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSITK 479
Query: 421 RKLSSSSLPFLKALADASGWPLNRYQTLALLIFSSVLAVDLWFWELFFGTAANWISEVAL 480
RKL SSSLPFLKALAD SGWPLNRYQTLALLIFSSVLAVDLWFWELFFGTAANWISEVAL
Sbjct: 480 RKLGSSSLPFLKALADVSGWPLNRYQTLALLIFSSVLAVDLWFWELFFGTAANWISEVAL 539
Query: 481 EVYQYIDNVQLM 485
EVYQYIDNVQLM
Sbjct: 540 EVYQYIDNVQLM 543
BLAST of MS003820 vs. ExPASy TrEMBL
Match:
A0A1S3CJ62 (uncharacterized protein LOC103501560 OS=Cucumis melo OX=3656 GN=LOC103501560 PE=4 SV=1)
HSP 1 Score: 796.2 bits (2055), Expect = 7.6e-227
Identity = 419/477 (87.84%), Postives = 431/477 (90.36%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSP-----SSSIPNKFGIGPFSESSKRRCTFPKRVRLFRC 81
MVS+SLSLL P SFPSISK DSP SSSIP KF PF S R FPKR RLFRC
Sbjct: 1 MVSNSLSLLLPISFPSISKPDSPSSSSSSSSIPTKF---PFFTDSSR---FPKRFRLFRC 60
Query: 82 QILGSSSS---------SNQSRDDASAELFLQNNSIADFMRFKRDGSSAELQTATVSYRK 141
QI SSSS SNQ RDDAS + F QNNSIADFMRFKRDG SAELQTA VSY+K
Sbjct: 61 QIPASSSSSSSSSSSSLSNQLRDDASPDSFFQNNSIADFMRFKRDGPSAELQTAIVSYKK 120
Query: 142 KFPWSILQPFVQVDLVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDAT 201
KFPWSILQPFVQVDLVSTIHIADK+YFEALQKEL+SYD +LYEMVASRESLE+RRNP A
Sbjct: 121 KFPWSILQPFVQVDLVSTIHIADKEYFEALQKELESYDSVLYEMVASRESLENRRNPVAM 180
Query: 202 KKLKSSRSRGFNILGCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGE 261
KKLKSSRSRG NILGCIQRQMARVLTLDFQLDCLDYQA+NW HADLDYETFRILQTEKGE
Sbjct: 181 KKLKSSRSRGLNILGCIQRQMARVLTLDFQLDCLDYQASNWCHADLDYETFRILQTEKGE 240
Query: 262 NFFTFARDMTIRSTKAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGS 321
NFFTFARDMTIRSTKAMVQPTAVP+DLEPWKSKLLWASRVLPMPLVGLLIIGSVCAD GS
Sbjct: 241 NFFTFARDMTIRSTKAMVQPTAVPEDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADGGS 300
Query: 322 QASEFPEFEALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA 381
QASEFPEFEALSRLDL AAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA
Sbjct: 301 QASEFPEFEALSRLDLSAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDA 360
Query: 382 IDKGHNKIAILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSITNRKLSSSSLPFLKALA 441
+DKGHN+IAILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSIT RKL+SSSLPFLKALA
Sbjct: 361 LDKGHNRIAILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSITKRKLASSSLPFLKALA 420
Query: 442 DASGWPLNRYQTLALLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
D SGWPLNRYQTLALLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM
Sbjct: 421 DVSGWPLNRYQTLALLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 471
BLAST of MS003820 vs. TAIR 10
Match:
AT5G19540.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 580.9 bits (1496), Expect = 9.7e-166
Identity = 302/469 (64.39%), Postives = 364/469 (77.61%), Query Frame = 0
Query: 22 MVSSSLSLLSPTSFPSISKTDSPSSSIPNKFGIGPFSESSKRRCTFPKRVRLFRCQILGS 81
M SS ++L P S TD SSS +S S R + + RC +
Sbjct: 1 MAVSSFAVLPPFGSSYSSTTDRSSSS-------SFYSSSKLLRISQYRFQSNSRCPNVFV 60
Query: 82 SSSSNQSRDDAS-AELFLQNNSIADFMRFKR----DGSSAELQTATVSYRKKFPWSILQP 141
S S NQ DDAS A LFL++NSIAD+MRFKR ++ELQTA VSY+K+FPW +L P
Sbjct: 61 SCSLNQPSDDASDAALFLESNSIADYMRFKRRPDPGNGTSELQTAIVSYKKRFPWILLNP 120
Query: 142 FVQVDLVSTIHIADKDYFEALQKELDSYDCILYEMVASRESLESRRNPDATKKLKSSRSR 201
F+QVDLVSTIHIADK+YF LQKEL+ YD ILYEMVAS+E+LE+RRNP A+K+LKSSRSR
Sbjct: 121 FLQVDLVSTIHIADKEYFTTLQKELEPYDSILYEMVASKETLENRRNPIASKRLKSSRSR 180
Query: 202 GFNILGCIQRQMARVLTLDFQLDCLDYQAANWFHADLDYETFRILQTEKGENFFTFARDM 261
GF+ILG IQRQMARVLTLDFQLDCLDY NW+HADLD+ETF++LQ EKGE+FF+FARDM
Sbjct: 181 GFSILGFIQRQMARVLTLDFQLDCLDYDTENWYHADLDFETFQLLQKEKGESFFSFARDM 240
Query: 262 TIRSTKAMVQPTAVPDDLEPWKSKLLWASRVLPMPLVGLLIIGSVCADVGSQASEFPEFE 321
TIRSTKAM+QP V + + W+SKLLW SRV PMPLVGL +IG+ CAD G Q S++PE E
Sbjct: 241 TIRSTKAMIQPALVTEGRDTWRSKLLWVSRVFPMPLVGLFLIGAFCADFGDQTSDYPELE 300
Query: 322 ALSRLDLGAAMKVFLAKRLTSEFTQVTAEVEESSVIIGERNKAATEALRDAIDKGHNKIA 381
ALSRLD GAAMKVFLAKRLTSE T T+++EE SVIIGERN+AATEALR AI++GH +I
Sbjct: 301 ALSRLDFGAAMKVFLAKRLTSELTLETSDIEEKSVIIGERNRAATEALRRAIEQGHKRIG 360
Query: 382 ILYGGGHMPDLGRRLREEFDLIPCRVKWITAWSITN-RKLSSSSLPFLKALADASGWPLN 441
ILYGGGHMPDLGRRLREEFDL+P V+W+TAWSI N L +SS P L+ +A+A WPLN
Sbjct: 361 ILYGGGHMPDLGRRLREEFDLVPSEVRWVTAWSIRNPTDLETSSYPILRKMAEALRWPLN 420
Query: 442 RYQTLALLIFSSVLAVDLWFWELFFGTAANWISEVALEVYQYIDNVQLM 485
RYQTLALLIFSSVLA+DL FWELFFG+ +W +++ E+YQ+IDN +++
Sbjct: 421 RYQTLALLIFSSVLALDLCFWELFFGSTIDWATQIGAELYQFIDNTKIV 462
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022145081.1 | 1.7e-260 | 100.00 | uncharacterized protein LOC111014591 [Momordica charantia] | [more] |
XP_038879904.1 | 8.6e-233 | 90.17 | uncharacterized protein LOC120071619 [Benincasa hispida] >XP_038879913.1 unchara... | [more] |
XP_023531025.1 | 8.1e-231 | 89.85 | uncharacterized protein LOC111793404 [Cucurbita pepo subsp. pepo] >XP_023531026.... | [more] |
KAG6587960.1 | 1.4e-230 | 89.63 | hypothetical protein SDJN03_16525, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7021849.1 | 1.4e-230 | 89.85 | hypothetical protein SDJN02_15577 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CVA7 | 8.1e-261 | 100.00 | uncharacterized protein LOC111014591 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
A0A6J1L277 | 3.3e-230 | 89.42 | uncharacterized protein LOC111499173 OS=Cucurbita maxima OX=3661 GN=LOC111499173... | [more] |
A0A6J1EYY7 | 5.7e-230 | 89.42 | uncharacterized protein LOC111440833 OS=Cucurbita moschata OX=3662 GN=LOC1114408... | [more] |
A0A0A0LTP1 | 1.3e-229 | 86.18 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G248120 PE=4 SV=1 | [more] |
A0A1S3CJ62 | 7.6e-227 | 87.84 | uncharacterized protein LOC103501560 OS=Cucumis melo OX=3656 GN=LOC103501560 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT5G19540.1 | 9.7e-166 | 64.39 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |