Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAATCTTCCTCGGTTTGGCCGCACATGGCAACGTTTCTCTTCCCTCCCCCGCCCCGCCACTGCCTCACAACCGGAACAACTTCAGCCGGCACCAGCGCCGGCGCCGATGAATGGGCGTGAAATTTCACCCAACTCTCCGGTCACCGCCCAAGTTCTCCAAACTTCTCCAATTAGAGAAAGATCTTCTCGACTTCCATCTCCCACGAAGAAATTTGCCTCGCCGCCTTCTTCTCCAAAATATAGACCCGCCGCCGCCGCTTCGCCTCCCAAGCCACTGTCTCCAACTCCGTCGTACAACCGTTACGACGGCGAACGACGGTCTAGTGCCACCACATCTCCCAAAGCCATTAAACCCTCCTATACCTCTCCGCCGCTGTCGCCTGCCAAGCACAAATACCCAACCGCCGCCACCGCCGCACCGCTCTCTCCTCTAACTCTGCCTCGGTCGGAGGCGAAACATGAACCTGGGCCCACGATTCGATCCAGAAGCCCACCGGAGGTTAGTTAACTTTAAATATTACGATATATATTAATTTCTTTTTACGTAAATACTACTTTTATTGGTATAATTTAATTATTTTGATTATTGTGCTATCAAAATGTCAATTTTTTTTGGTCTACTTTCCATTTTTATTCGTTTTAGTTATCATAATTTTTAAAAGTCCCTTTTAATTTTTCATATTTTTTTTATAAAGGGACCATTTTAATTTTTTTTTGCTATGACTGAATAAATTACCTCCCAAATCAAAAGTTTAAACTCATGATTTTGATAAATTTAATTATATATATGGTCGAAAATTTTATTCGAGTTCTTTTCTGTCACAATTTTAGCGTCATATTTTATTTACTAATTTTAAAAAAATGTAATCATAAATTTGTATTAAAATATTTTTATTCCATAAGATATGTGTTGAAATGTTGCAACAAAGAACAAAAGATGTTCAACTTTAAAAAGTAAACTAAAACGGACCTATCAAGATTTATAAGATATATGAACTATGATACTCCTTGACCAAATGATTTTGAGATGACACTCGACGATATTAGTATTAGAGTTGAACAAGTTTAAACGACCATCCAAGCAAAAAAATGAGATTCAACCAACATTAATGTATCCATAATTTACGTGTGTATGAGTTTAACAAGATGACTTTCTCTTGTTTTGTTCCAATCAAATTATAGTTTATCATTGAATAATTCAGGTTAGTTCTGGTATATATTTAATCTTGTATACATATTCTTAATCAGGTCGAGCAGAAATCGATACATTATCCGAAGGTCGAGAAGCCGACGAAACCCGATCACCGGCCGTCAGAGTACAACTCCGGCAAGCCCCAATACAAGCAGCAGCAGCAACAGAGCGACGTGATAACCATCAAAGGCGAGAACGTAGGCGCCATCATGCACATAACTCAATCATCTGACGGCACAGAAATGGTGAAGAAAAAGCCAAGCACAGAGAGTGGAAACGATGATGAGAAAGGCAACAAATCAAGTTCTTTGCCGGCGAAATCATTCATGAACAGCAATTTTCAAGGGGTCAATAATTCCGTTCTGTACAACTCCTCCTTCAGCCACCGTGATCCAGGCCTGCACCTGTCTTTCTCCAAGAAGCCCGCCCATGGCCATGCCCAATCTGTTCTTGGTTCGACCTACTAG
mRNA sequence
ATGGCAAATCTTCCTCGGTTTGGCCGCACATGGCAACGTTTCTCTTCCCTCCCCCGCCCCGCCACTGCCTCACAACCGGAACAACTTCAGCCGGCACCAGCGCCGGCGCCGATGAATGGGCGTGAAATTTCACCCAACTCTCCGGTCACCGCCCAAGTTCTCCAAACTTCTCCAATTAGAGAAAGATCTTCTCGACTTCCATCTCCCACGAAGAAATTTGCCTCGCCGCCTTCTTCTCCAAAATATAGACCCGCCGCCGCCGCTTCGCCTCCCAAGCCACTGTCTCCAACTCCGTCGTACAACCGTTACGACGGCGAACGACGGTCTAGTGCCACCACATCTCCCAAAGCCATTAAACCCTCCTATACCTCTCCGCCGCTGTCGCCTGCCAAGCACAAATACCCAACCGCCGCCACCGCCGCACCGCTCTCTCCTCTAACTCTGCCTCGGTCGGAGGCGAAACATGAACCTGGGCCCACGATTCGATCCAGAAGCCCACCGGAGGTCGAGCAGAAATCGATACATTATCCGAAGGTCGAGAAGCCGACGAAACCCGATCACCGGCCGTCAGAGTACAACTCCGGCAAGCCCCAATACAAGCAGCAGCAGCAACAGAGCGACGTGATAACCATCAAAGGCGAGAACGTAGGCGCCATCATGCACATAACTCAATCATCTGACGGCACAGAAATGGTGAAGAAAAAGCCAAGCACAGAGAGTGGAAACGATGATGAGAAAGGCAACAAATCAAGTTCTTTGCCGGCGAAATCATTCATGAACAGCAATTTTCAAGGGGTCAATAATTCCGTTCTGTACAACTCCTCCTTCAGCCACCGTGATCCAGGCCTGCACCTGTCTTTCTCCAAGAAGCCCGCCCATGGCCATGCCCAATCTGTTCTTGGTTCGACCTACTAG
Coding sequence (CDS)
ATGGCAAATCTTCCTCGGTTTGGCCGCACATGGCAACGTTTCTCTTCCCTCCCCCGCCCCGCCACTGCCTCACAACCGGAACAACTTCAGCCGGCACCAGCGCCGGCGCCGATGAATGGGCGTGAAATTTCACCCAACTCTCCGGTCACCGCCCAAGTTCTCCAAACTTCTCCAATTAGAGAAAGATCTTCTCGACTTCCATCTCCCACGAAGAAATTTGCCTCGCCGCCTTCTTCTCCAAAATATAGACCCGCCGCCGCCGCTTCGCCTCCCAAGCCACTGTCTCCAACTCCGTCGTACAACCGTTACGACGGCGAACGACGGTCTAGTGCCACCACATCTCCCAAAGCCATTAAACCCTCCTATACCTCTCCGCCGCTGTCGCCTGCCAAGCACAAATACCCAACCGCCGCCACCGCCGCACCGCTCTCTCCTCTAACTCTGCCTCGGTCGGAGGCGAAACATGAACCTGGGCCCACGATTCGATCCAGAAGCCCACCGGAGGTCGAGCAGAAATCGATACATTATCCGAAGGTCGAGAAGCCGACGAAACCCGATCACCGGCCGTCAGAGTACAACTCCGGCAAGCCCCAATACAAGCAGCAGCAGCAACAGAGCGACGTGATAACCATCAAAGGCGAGAACGTAGGCGCCATCATGCACATAACTCAATCATCTGACGGCACAGAAATGGTGAAGAAAAAGCCAAGCACAGAGAGTGGAAACGATGATGAGAAAGGCAACAAATCAAGTTCTTTGCCGGCGAAATCATTCATGAACAGCAATTTTCAAGGGGTCAATAATTCCGTTCTGTACAACTCCTCCTTCAGCCACCGTGATCCAGGCCTGCACCTGTCTTTCTCCAAGAAGCCCGCCCATGGCCATGCCCAATCTGTTCTTGGTTCGACCTACTAG
Protein sequence
MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIRERSSRLPSPTKKFASPPSSPKYRPAAAASPPKPLSPTPSYNRYDGERRSSATTSPKAIKPSYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKVEKPTKPDHRPSEYNSGKPQYKQQQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKPSTESGNDDEKGNKSSSLPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAHGHAQSVLGSTY
Homology
BLAST of Moc05g04890 vs. NCBI nr
Match:
XP_022147442.1 (proline-rich receptor-like protein kinase PERK2 [Momordica charantia])
HSP 1 Score: 581.6 bits (1498), Expect = 3.8e-162
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR
Sbjct: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAAAASPPKPLSPTPSYNRYDGERRSSATTSPKAIKP 120
ERSSRLPSPTKKFASPPSSPKYRPAAAASPPKPLSPTPSYNRYDGERRSSATTSPKAIKP
Sbjct: 61 ERSSRLPSPTKKFASPPSSPKYRPAAAASPPKPLSPTPSYNRYDGERRSSATTSPKAIKP 120
Query: 121 SYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKVE 180
SYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKVE
Sbjct: 121 SYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKVE 180
Query: 181 KPTKPDHRPSEYNSGKPQYKQQQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKPSTES 240
KPTKPDHRPSEYNSGKPQYKQQQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKPSTES
Sbjct: 181 KPTKPDHRPSEYNSGKPQYKQQQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKPSTES 240
Query: 241 GNDDEKGNKSSSLPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAHGHAQSVL 300
GNDDEKGNKSSSLPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAHGHAQSVL
Sbjct: 241 GNDDEKGNKSSSLPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAHGHAQSVL 300
Query: 301 GSTY 305
GSTY
Sbjct: 301 GSTY 304
BLAST of Moc05g04890 vs. NCBI nr
Match:
XP_022945677.1 (DNA-directed RNA polymerase II subunit 1-like [Cucurbita moschata])
HSP 1 Score: 355.1 bits (910), Expect = 5.8e-94
Identity = 202/299 (67.56%), Postives = 231/299 (77.26%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTW RFSSLPRP TA + + P PA + E+ P++P TA VLQTSPI+
Sbjct: 1 MANLPRFGRTWNRFSSLPRPGTAPRLD----VPPPAATSEPEVYPSAPRTANVLQTSPIK 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAA-AASPPKPLSPTPSYNRYDGERRSSATTSPKAIK 120
ERS R+ SP +K+ SPP+SPKYR AA + SP KPLSP P+YNRYDGERRSSA SPK K
Sbjct: 61 ERSPRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSALASPKTFK 120
Query: 121 PSYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKV 180
+YTSPP SPAKHKY T+ APLSPL LP E +HEP P +R RSPPEV+QKS+ Y K
Sbjct: 121 TTYTSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPVVRPRSPPEVQQKSVLYQKT 180
Query: 181 --EKPTKPDHRPSEYNSGKPQYKQ-QQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKP 240
EKP K DHR SEY+SGKPQ KQ QQQSDVI IKGENVGA+MHITQSSD TE KKKP
Sbjct: 181 TSEKPAKTDHRASEYSSGKPQQKQTHQQQSDVINIKGENVGAVMHITQSSDATETHKKKP 240
Query: 241 STESGNDDEK-GNKSSS-LPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAH 294
+ +++EK NKSSS +P KSFMNSNFQGVNNS+LYNSS SHRDPGLHL+F+KKP H
Sbjct: 241 TVSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTH 295
BLAST of Moc05g04890 vs. NCBI nr
Match:
KAG6597058.1 (hypothetical protein SDJN03_10238, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 354.8 bits (909), Expect = 7.6e-94
Identity = 202/299 (67.56%), Postives = 231/299 (77.26%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTW RFSSLPRP TA + + P PA + E+ P++P TA VLQTSPI+
Sbjct: 1 MANLPRFGRTWNRFSSLPRPGTAPRLD----VPPPAATSEPEVYPSAPRTANVLQTSPIK 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAA-AASPPKPLSPTPSYNRYDGERRSSATTSPKAIK 120
ERS R+ SP +K+ SPP+SPKYR AA + SP KPLSP P+YNRYDGERRSSA SPK K
Sbjct: 61 ERSPRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSALASPKTFK 120
Query: 121 PSYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKV 180
+YTSPP SPAKHKY T+ APLSPL LP E +HEP P +R RSPPEV+QKS+ Y K
Sbjct: 121 TTYTSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQKT 180
Query: 181 --EKPTKPDHRPSEYNSGKPQYKQ-QQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKP 240
EKP K DHR SEY+SGKPQ KQ QQQSDVI IKGENVGA+MHITQSSD TE KKKP
Sbjct: 181 TSEKPAKTDHRASEYSSGKPQQKQTHQQQSDVINIKGENVGAVMHITQSSDATETHKKKP 240
Query: 241 STESGNDDEK-GNKSSS-LPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAH 294
+ +++EK NKSSS +P KSFMNSNFQGVNNS+LYNSS SHRDPGLHL+F+KKP H
Sbjct: 241 TVSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTH 295
BLAST of Moc05g04890 vs. NCBI nr
Match:
XP_023538835.1 (proline-rich extensin-like protein EPR1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 354.0 bits (907), Expect = 1.3e-93
Identity = 203/299 (67.89%), Postives = 230/299 (76.92%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTW RFSSLPRP TA + + PA A P E+ P P TA VLQTSPI+
Sbjct: 1 MANLPRFGRTWNRFSSLPRPGTAPRLDVPPPAAASEP----EVYPPPPRTANVLQTSPIK 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAA-AASPPKPLSPTPSYNRYDGERRSSATTSPKAIK 120
ERS R+PSP +K+ SPP+SPKYR AA + SP KPLSP P+YNRYDGERR+SA SPK K
Sbjct: 61 ERSPRIPSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRTSALASPKTFK 120
Query: 121 PSYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKV 180
+YTSPP SPAKHKY T+ APLSPL LP E +HEP P +R RSPPEV+QKS+ Y K
Sbjct: 121 TTYTSPPRSPAKHKYSTSTLQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQKT 180
Query: 181 --EKPTKPDHRPSEYNSGKPQYKQ-QQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKP 240
EKP K DHR SEY+SGKPQ KQ QQQSDVI IKGENVGA+MHITQSSD TE KKKP
Sbjct: 181 TSEKPAKTDHRASEYSSGKPQQKQTHQQQSDVINIKGENVGAVMHITQSSDATETHKKKP 240
Query: 241 STESGNDDEK-GNKSSS-LPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAH 294
+ +++EK NKSSS +P KSFMNSNFQGVNNS+LYNSS SHRDPGLHL+F+KKP H
Sbjct: 241 TVSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTH 295
BLAST of Moc05g04890 vs. NCBI nr
Match:
XP_022974814.1 (DNA-directed RNA polymerase II subunit 1-like isoform X2 [Cucurbita maxima] >XP_022974815.1 DNA-directed RNA polymerase II subunit 1-like isoform X3 [Cucurbita maxima])
HSP 1 Score: 353.2 bits (905), Expect = 2.2e-93
Identity = 202/299 (67.56%), Postives = 231/299 (77.26%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTW RFSSLPRPATA + + P PA + E+ ++P TA VLQTSPI+
Sbjct: 1 MANLPRFGRTWNRFSSLPRPATAPRFD----VPPPAATSEPEVFSSAPPTANVLQTSPIK 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAA-AASPPKPLSPTPSYNRYDGERRSSATTSPKAIK 120
ERS R+ SP +K+ SPP+SPKYR AA + SP KPLSP P+YNRYDGERRSSA SPK K
Sbjct: 61 ERSPRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSAQASPKTFK 120
Query: 121 PSYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKV 180
+YTSPP SPAKHKY T+ APLSPL LP E +HEP P +R RSPPEV+QKS+ Y K
Sbjct: 121 TTYTSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQKT 180
Query: 181 --EKPTKPDHRPSEYNSGKPQYKQ-QQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKP 240
EKP K DHR SEY+SGKPQ KQ QQQSDVI IKGENVGA+MHITQSSD TE KKKP
Sbjct: 181 TSEKPAKTDHRASEYSSGKPQQKQTHQQQSDVINIKGENVGAVMHITQSSDATETHKKKP 240
Query: 241 STESGNDDEK-GNKSSS-LPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAH 294
+ +++EK NKSSS +P KSFMNSNFQGVNNS+LYNSS SHRDPGLHL+F+KKP H
Sbjct: 241 TVSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTH 295
BLAST of Moc05g04890 vs. ExPASy TrEMBL
Match:
A0A6J1D2D2 (proline-rich receptor-like protein kinase PERK2 OS=Momordica charantia OX=3673 GN=LOC111016367 PE=4 SV=1)
HSP 1 Score: 581.6 bits (1498), Expect = 1.9e-162
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR
Sbjct: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAAAASPPKPLSPTPSYNRYDGERRSSATTSPKAIKP 120
ERSSRLPSPTKKFASPPSSPKYRPAAAASPPKPLSPTPSYNRYDGERRSSATTSPKAIKP
Sbjct: 61 ERSSRLPSPTKKFASPPSSPKYRPAAAASPPKPLSPTPSYNRYDGERRSSATTSPKAIKP 120
Query: 121 SYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKVE 180
SYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKVE
Sbjct: 121 SYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKVE 180
Query: 181 KPTKPDHRPSEYNSGKPQYKQQQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKPSTES 240
KPTKPDHRPSEYNSGKPQYKQQQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKPSTES
Sbjct: 181 KPTKPDHRPSEYNSGKPQYKQQQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKPSTES 240
Query: 241 GNDDEKGNKSSSLPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAHGHAQSVL 300
GNDDEKGNKSSSLPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAHGHAQSVL
Sbjct: 241 GNDDEKGNKSSSLPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAHGHAQSVL 300
Query: 301 GSTY 305
GSTY
Sbjct: 301 GSTY 304
BLAST of Moc05g04890 vs. ExPASy TrEMBL
Match:
A0A6J1G1L2 (DNA-directed RNA polymerase II subunit 1-like OS=Cucurbita moschata OX=3662 GN=LOC111449836 PE=4 SV=1)
HSP 1 Score: 355.1 bits (910), Expect = 2.8e-94
Identity = 202/299 (67.56%), Postives = 231/299 (77.26%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTW RFSSLPRP TA + + P PA + E+ P++P TA VLQTSPI+
Sbjct: 1 MANLPRFGRTWNRFSSLPRPGTAPRLD----VPPPAATSEPEVYPSAPRTANVLQTSPIK 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAA-AASPPKPLSPTPSYNRYDGERRSSATTSPKAIK 120
ERS R+ SP +K+ SPP+SPKYR AA + SP KPLSP P+YNRYDGERRSSA SPK K
Sbjct: 61 ERSPRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSALASPKTFK 120
Query: 121 PSYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKV 180
+YTSPP SPAKHKY T+ APLSPL LP E +HEP P +R RSPPEV+QKS+ Y K
Sbjct: 121 TTYTSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPVVRPRSPPEVQQKSVLYQKT 180
Query: 181 --EKPTKPDHRPSEYNSGKPQYKQ-QQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKP 240
EKP K DHR SEY+SGKPQ KQ QQQSDVI IKGENVGA+MHITQSSD TE KKKP
Sbjct: 181 TSEKPAKTDHRASEYSSGKPQQKQTHQQQSDVINIKGENVGAVMHITQSSDATETHKKKP 240
Query: 241 STESGNDDEK-GNKSSS-LPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAH 294
+ +++EK NKSSS +P KSFMNSNFQGVNNS+LYNSS SHRDPGLHL+F+KKP H
Sbjct: 241 TVSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTH 295
BLAST of Moc05g04890 vs. ExPASy TrEMBL
Match:
A0A6J1IEX0 (DNA-directed RNA polymerase II subunit 1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111473598 PE=4 SV=1)
HSP 1 Score: 353.2 bits (905), Expect = 1.1e-93
Identity = 202/299 (67.56%), Postives = 231/299 (77.26%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTW RFSSLPRPATA + + P PA + E+ ++P TA VLQTSPI+
Sbjct: 1 MANLPRFGRTWNRFSSLPRPATAPRFD----VPPPAATSEPEVFSSAPPTANVLQTSPIK 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAA-AASPPKPLSPTPSYNRYDGERRSSATTSPKAIK 120
ERS R+ SP +K+ SPP+SPKYR AA + SP KPLSP P+YNRYDGERRSSA SPK K
Sbjct: 61 ERSPRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSAQASPKTFK 120
Query: 121 PSYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKV 180
+YTSPP SPAKHKY T+ APLSPL LP E +HEP P +R RSPPEV+QKS+ Y K
Sbjct: 121 TTYTSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQKT 180
Query: 181 --EKPTKPDHRPSEYNSGKPQYKQ-QQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKP 240
EKP K DHR SEY+SGKPQ KQ QQQSDVI IKGENVGA+MHITQSSD TE KKKP
Sbjct: 181 TSEKPAKTDHRASEYSSGKPQQKQTHQQQSDVINIKGENVGAVMHITQSSDATETHKKKP 240
Query: 241 STESGNDDEK-GNKSSS-LPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAH 294
+ +++EK NKSSS +P KSFMNSNFQGVNNS+LYNSS SHRDPGLHL+F+KKP H
Sbjct: 241 TVSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTH 295
BLAST of Moc05g04890 vs. ExPASy TrEMBL
Match:
A0A6J1IHF4 (DNA-directed RNA polymerase II subunit 1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111473598 PE=4 SV=1)
HSP 1 Score: 352.8 bits (904), Expect = 1.4e-93
Identity = 202/299 (67.56%), Postives = 231/299 (77.26%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
MANLPRFGRTW RFSSLPRPATA + + P PA + E+ ++P TA VLQTSPI+
Sbjct: 1 MANLPRFGRTWNRFSSLPRPATAPRFD----VPPPAATSEPEVFSSAPPTANVLQTSPIK 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAA-AASPPKPLSPTPSYNRYDGERRSSATTSPKAIK 120
ERS R+ SP +K+ SPP+SPKYR AA + SP KPLSP P+YNRYDGERRSSA SPK K
Sbjct: 61 ERSPRITSPVRKYHSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSAQASPKTFK 120
Query: 121 PSYTSPPLSPAKHKYPTAATAAPLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKV 180
+YTSPP SPAKHKY T+ APLSPL LP E +HEP P +R RSPPEV+QKS+ Y K
Sbjct: 121 TTYTSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQKT 180
Query: 181 --EKPTKPDHRPSEYNSGKPQYKQ-QQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKP 240
EKP K DHR SEY+SGKPQ KQ QQQSDVI IKGENVGA+MHITQSSD TE KKKP
Sbjct: 181 TSEKPAKTDHRASEYSSGKPQQKQTHQQQSDVINIKGENVGAVMHITQSSDATETHKKKP 240
Query: 241 STESGNDDEK-GNKSSS-LPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFSKKPAH 294
+ +++EK NKSSS +P KSFMNSNFQGVNNS+LYNSS SHRDPGLHL+F+KKP H
Sbjct: 241 TVSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTH 295
BLAST of Moc05g04890 vs. ExPASy TrEMBL
Match:
A0A0A0L5T3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G119290 PE=4 SV=1)
HSP 1 Score: 316.2 bits (809), Expect = 1.5e-82
Identity = 189/304 (62.17%), Postives = 225/304 (74.01%), Query Frame = 0
Query: 1 MANLPRFGRTWQRFSSLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIR 60
M+NLPRFGR W RFSSLPRP TA++PE A P E+ P++ T QTSPI+
Sbjct: 1 MSNLPRFGRQWNRFSSLPRPGTATRPEPQPFTAATEP----EVFPSAVPTTNTFQTSPIK 60
Query: 61 ERSSRLPSPTKKFASPPSSPKYRPAAAASPPKPLSPTPSYNRYDGERRSSATTSPKAIKP 120
+R+ RL SP KKF+SPPSSPKY A SP KPLSP P +NRY+GERR+SATTSPK KP
Sbjct: 61 QRTPRLSSPVKKFSSPPSSPKYSGAGTVSPRKPLSPPPVHNRYEGERRTSATTSPKTFKP 120
Query: 121 SYTSPPLSPAKHKYPTAATA-APLSPLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHY--P 180
++ SPP SP+K ++ T TA APLSPL LPRS+ + EP ++R RSPPE+EQK I Y
Sbjct: 121 THISPPPSPSKPRHSTVPTAVAPLSPLALPRSQVRREPEHSLRPRSPPEIEQKKILYQTT 180
Query: 181 KVEKPTKPDH--RPSEYNSGKPQYKQQQQ--QSDVITIKGENVGAIMHITQSSDGTEMVK 240
EKPTK DH + EY + KPQ KQQ Q QSDVI IKGENVGA+MHITQSSDG+E++K
Sbjct: 181 TTEKPTKTDHYRQNDEYGASKPQQKQQHQQLQSDVINIKGENVGAVMHITQSSDGSEVIK 240
Query: 241 KKPST-ESGNDDEKGNKS-SSLPAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSF-SKK 295
KKP+ +S ++EK NKS S+ P KSFMNSNFQGVNNS+LYNSS SHRDPGLHL+F SKK
Sbjct: 241 KKPTVGQSKENEEKTNKSNSNYPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFGSKK 300
BLAST of Moc05g04890 vs. TAIR 10
Match:
AT2G46630.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 110095 Blast hits to 59224 proteins in 2216 species: Archae - 177; Bacteria - 15429; Metazoa - 38345; Fungi - 18843; Plants - 13341; Viruses - 3084; Other Eukaryotes - 20876 (source: NCBI BLink). )
HSP 1 Score: 68.2 bits (165), Expect = 1.3e-11
Identity = 97/338 (28.70%), Postives = 144/338 (42.60%), Query Frame = 0
Query: 16 SLPRPATASQPEQLQPAPAPAPMNGREISPNSPVTAQVLQTSPIRERSSRLPSPTKKFAS 75
S PR P + Q P+P ++ P +P + TSP +ERS SP + S
Sbjct: 47 SPPRQRQPRSPPRQQDPPSPP---RQQQQPLTPPRQKAPPTSPPQERSP-YHSPPSRHMS 106
Query: 76 PPSSPKYRPAAAASPPKPLS-----PTPSY----------NRYDGERRSSATTSPKAIKP 135
PP+ PK AA PP P S P+P N SS +T+ +++K
Sbjct: 107 PPTPPK---AATPPPPPPRSSYTSPPSPKEVQEALPPRKPNSPPSPAHSSRSTTSESVKT 166
Query: 136 SYTSPPLSPAKHKYPTAATAAPLS-PLTLPRSEAKHEPGPTIRSRSPPEVEQKSIHYPKV 195
SP S K P+ +P S P +L SE + + + + + + H
Sbjct: 167 --RSPSESENHRKAPSPRVLSPYSLPASLLHSERETTQKNILTAEKTSQTHETNHHNQNH 226
Query: 196 EKPTKPDHRPSEYNS--------GKPQYKQQQQQSD----------VITIKGENVGAIMH 255
+H ++ +S G K +Q S VITI GEN GA+M
Sbjct: 227 NHDYNQNHNYNQNHSYNQNQNHQGNNPKKMHRQPSSSDSENIMSTRVITIAGENKGAVME 286
Query: 256 ITQSSDGTEM------------------VKKKPSTESGNDDEKGNK----------SSSL 292
I +S G + + + S+ S +D+ +G K +S+L
Sbjct: 287 ILRSPQGNKTGGSGTHSSRVSHGTGEKGRRLQSSSSSSSDEGEGKKKTTKNVPNKGNSNL 346
BLAST of Moc05g04890 vs. TAIR 10
Match:
AT1G63310.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G20362.1); Has 78 Blast hits to 77 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 46.6 bits (109), Expect = 4.1e-05
Identity = 30/95 (31.58%), Postives = 48/95 (50.53%), Query Frame = 0
Query: 194 SGKPQYKQQQQQSDVITIKGENVGAIMHITQSSDGTEMVKKKPSTESGNDDEKGNKSSSL 253
+G QY +++ VIT+ G N+GA M K + G+ D+ G+
Sbjct: 38 AGPSQYDEEEDGIRVITLSGSNLGATM------------KTELDNNHGDRDQNGDHELDF 97
Query: 254 PAKSFMNSNFQGVNNSVLYNSSFSHRDPGLHLSFS 289
+++NSNFQ VNNS++ + + DPG+HL S
Sbjct: 98 -LSTYVNSNFQAVNNSIMIGAKYETHDPGVHLDIS 119
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022147442.1 | 3.8e-162 | 100.00 | proline-rich receptor-like protein kinase PERK2 [Momordica charantia] | [more] |
XP_022945677.1 | 5.8e-94 | 67.56 | DNA-directed RNA polymerase II subunit 1-like [Cucurbita moschata] | [more] |
KAG6597058.1 | 7.6e-94 | 67.56 | hypothetical protein SDJN03_10238, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023538835.1 | 1.3e-93 | 67.89 | proline-rich extensin-like protein EPR1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022974814.1 | 2.2e-93 | 67.56 | DNA-directed RNA polymerase II subunit 1-like isoform X2 [Cucurbita maxima] >XP_... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D2D2 | 1.9e-162 | 100.00 | proline-rich receptor-like protein kinase PERK2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1G1L2 | 2.8e-94 | 67.56 | DNA-directed RNA polymerase II subunit 1-like OS=Cucurbita moschata OX=3662 GN=L... | [more] |
A0A6J1IEX0 | 1.1e-93 | 67.56 | DNA-directed RNA polymerase II subunit 1-like isoform X2 OS=Cucurbita maxima OX=... | [more] |
A0A6J1IHF4 | 1.4e-93 | 67.56 | DNA-directed RNA polymerase II subunit 1-like isoform X1 OS=Cucurbita maxima OX=... | [more] |
A0A0A0L5T3 | 1.5e-82 | 62.17 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G119290 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT2G46630.1 | 1.3e-11 | 28.70 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... | [more] |
AT1G63310.1 | 4.1e-05 | 31.58 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |