Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGAAGAAGAACAGGGGAAAGAAGAAAATAGGAGCATCTAACTCCACACCCGGTCGTCCTCAAGATTCGATCACCCTCAGACAGGAAAAGACTGGAAAAATCCAGCCCAAACCCTACAACAATGTCAAAATTTATTTGAGCCACTTGGAAAACCTAGCGACTTGGGCCAGTGGGCAGGCCTCTATACCTTCATTGGCTGCTTTCTTCGGGCGGCGCTTTGCCGCTGCTGCGGATTCTTCAGGTGTCGTTCCCGACGCCTCTCTATTTCTTTGCCAGAGGTCCGTTCGTCTTCTGCCCTTCACTTATTCTTGTGATTTTAAATGCGTTTTTCTAGTTTAAGCTCACTTGCCAGTTTTAATGTTTGTAGAGGGGTAGGTTCATGATGGTTGATTGTATTTGAGGGTAGAAGTTCGTATTTAGCTTTTCTGATTAGTGCTAAGTGACATTAATCTGATGTTAGAAACATTTAAATATGAATAATGGGCGCTTCTTATTGGGCATTATGATGTTTTGATGCAAAATATTGAGAATGAAGGGATAAGGTAATGTACTTGGAAAATTCTATCTGCTAGAGTTTATGCAAGTGTTAAGTCTTTCTGTGCTACCGGCGAGAGCTTAAGTGTAGAATTAGTATGCAGGAGATACCATCTAGCATTCATTTGACAGAAAAAAAATCATGCGAGGGCGTGGAACACCATATATTTTCTCTACTTGAGAGAGTATTAGGTTTTAGATTTTCATAATTTTTTTCCTGTATTTTGTAGCAGGGTTGGGGTGTAAGCTTGTGTGTTGTCCTTGTCTTATTTGGGCATGCCCTTGGGAGATAATCAGATAACCAAAGAGGCTAAAGTTCTGGAAACCTGTGAAGGGAAAAGTCTTACAAAGCTTGATAAATGGAGGAACTTATCCAACAGTGGAAGATTGACTTTGTGAGACGCAGTCCTAGTAAAATTACCTTCCTACTTTATGTGCATCTTTTTTATGCTGAGGAAAATTGTAGCTTTAATGGAGAGGAAAATGAGGAACTTTCTTTGGTAAGGTTCTTTAGAGGGAAAGCGTAGCTTAGCCATCTAGTTCATTGGAAGGGGGTATCCTTGGAAAATGGAGGGTTGGAATTGGGAACTTGAGGCAAAGGAATAAAGCCCAAATGGCAAAGTTGGTTGGGCGTTTCTTTAGAAGGAAAGTCGGTTTGGTGCATGGTATGGTAATTTTTTTTAAAAAATAGTGCATGGTATCTATGAGTAAAAGGAGAGAGATGCAAAGGAGGAAAAGGAGTGGAGGCACGGTAGAAGTCCTCGGTGGAACATTACTAAGTTGAAGGGTATTGATTCTCACTTTCCGGTCAAAGTTGGAGATGGAAAGAAGACTAAATTTTGGGAGGATGTGTGGGTGGATAATCAATCCTTAAGTATGAATTTTCATTTTTCCTAATTTAATGCAATGGTGGAATCCGAGTTTTGAACAATCAATAAACTTTGGATGAGCAATAAAAGAGTTGGAATATAATGCGAGGAGGAATTTGTCGGATGAAGAATTTGGAGAGTTCAAGTTCAATGCTATTTTTGAGGCTGCTAGAAAACAGACAAGTGGGTGCAGGAGTGGATGAAAGAAGCTGGAATTTGTGTAGCTCTGGCGATCTCAGTCAAATGACTCAAATCTTTGGTTTTTTAGCCTGATTGATTTTGGGAGAGCTGTAGAAAGAGAGCTTCCAACCATGGTGTGGAAAAGAATGTGTGTCAAAAAGAAAAACGTTTGAGTGAATAGCCTGCCTTAGTCGACTCAACACAGTAGAGAGGTTACGACGAAAATCGCCTACGATGGTCTTGAATTGGTGCTTGTTTTCCTTGGAAGAGTATGAAGGAATCGACCACCTTTTCTTCCATTCCTCTTTTGGCAAGAGATGATGGAAGTTGGAACTCATTACTACAAAATTTCGGTGACTTTTGGGCTTTTCATTGATCTTTGAAGGTCGACATATTCCTTCTACTATGTGGTTACTCTCTTAAGGACAAAGGTAAAACTAGGAAACTATGGTCAATGCAATGAAATCTGTTCTTTGGAGCTTGTGGTTGGGAAGAGACAATCAAGTGCTTAACGGCAAGGGCAAGGCAAAATACTGGATTGAAGTGGGTCTAACTAACAACTATATTCCCCTTGTAAAATGCCTTTTCATGTTAAGTATTGCTCAGGTCATCAGACTTTGCGTTTGCCTGCTTTGATTTTTTTCTTCCCACCTTTCTTCGAAAAAAATCTCAAAATTTGTACATGCCTTATGTGATAATGAAAATGACATTAGTTTAGATGCCTGTTTTGAGGACTTCTATCATATCCTGTTTAAATTTTATATTGGCATCTTGCTTGGATCTTCATTGGCTTCAAAACTAGCTGCTTACTTGCCATCATATTGGTTTCAGGTGTGAAACAATTCTCCAACCTGGCTCTAACTGCTCCATACGAATTGAAAAGAATAAAGCAAAGAGACGTCGGAAGCACAACAAATGCAGTAACTTGACACAGAACAATTTGGTGTATTATTGCCACTACTGCTCATGTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCGTTATGCACAAAAATCCAAAGCAGTTGAAGAATCAGAGCCTATAGAATCTAAAAGCAAGGTGAAAGTTTTGAATGTCAGGCGTGGTAAAGAATGTGAGGCAACTGCAGTGCAGATGACAACAGAGATCCTTACTATTGATGCTCCTATGATTCCTTCTCCAACAACTAGAGAGATTGGTACTATTGATGCTCCCATAGCTCCTCCTACAACTGGGGATACTCTTGTCGTTGGTGCTTCTGCAATTCTGCCTCCTAGAATGGAGGACATTCTTATCATCAATGCTCCTGCAACTCCTCCCACTGTGAGCGGAACGACTCTGTCAAAATCGCAGAAGAGGAAAAAGAGGAAATTAGCAGCTAAGAATCAAACTGGACCCGAAAATAGCTGTGCTCCAACTGACTCGGAGAAGAAAACTGGGGACATTCCTACTGTTGATGCTCCCGCAACTCCTCCGGCCATGATCGGAATGACTCTGCTGGAATCAAAGAAGAGGAAAAGGAAGAAACCGTCATCTAAGAATCAAACTGAACCTGAAAGTAGCTGTGCTCCAACAGCAGAAGGAGATAAAACTGAAGGCACATCCAAAAGAAAGCGGAAAAGAAAATCATGGACAAGTTTGAAGGAAATCGCCCAGACAAATGAACAGAGTGGTAAACAGAACGTGACTGATTTTCCAATTCCATTTTCCTTACAAGGCACT
mRNA sequence
ATGGCGAAGAAGAACAGGGGAAAGAAGAAAATAGGAGCATCTAACTCCACACCCGGTCGTCCTCAAGATTCGATCACCCTCAGACAGGAAAAGACTGGAAAAATCCAGCCCAAACCCTACAACAATGTCAAAATTTATTTGAGCCACTTGGAAAACCTAGCGACTTGGGCCAGTGGGCAGGCCTCTATACCTTCATTGGCTGCTTTCTTCGGGCGGCGCTTTGCCGCTGCTGCGGATTCTTCAGGTGTCGTTCCCGACGCCTCTCTATTTCTTTGCCAGAGGTGTGAAACAATTCTCCAACCTGGCTCTAACTGCTCCATACGAATTGAAAAGAATAAAGCAAAGAGACGTCGGAAGCACAACAAATGCAGTAACTTGACACAGAACAATTTGGTGTATTATTGCCACTACTGCTCATGTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCGTTATGCACAAAAATCCAAAGCAGTTGAAGAATCAGAGCCTATAGAATCTAAAAGCAAGGTGAAAGTTTTGAATGTCAGGCGTGGTAAAGAATGTGAGGCAACTGCAGTGCAGATGACAACAGAGATCCTTACTATTGATGCTCCTATGATTCCTTCTCCAACAACTAGAGAGATTGGTACTATTGATGCTCCCATAGCTCCTCCTACAACTGGGGATACTCTTGTCGTTGGTGCTTCTGCAATTCTGCCTCCTAGAATGGAGGACATTCTTATCATCAATGCTCCTGCAACTCCTCCCACTGTGAGCGGAACGACTCTGTCAAAATCGCAGAAGAGGAAAAAGAGGAAATTAGCAGCTAAGAATCAAACTGGACCCGAAAATAGCTGTGCTCCAACTGACTCGGAGAAGAAAACTGGGGACATTCCTACTGTTGATGCTCCCGCAACTCCTCCGGCCATGATCGGAATGACTCTGCTGGAATCAAAGAAGAGGAAAAGGAAGAAACCGTCATCTAAGAATCAAACTGAACCTGAAAGTAGCTGTGCTCCAACAGCAGAAGGAGATAAAACTGAAGGCACATCCAAAAGAAAGCGGAAAAGAAAATCATGGACAAGTTTGAAGGAAATCGCCCAGACAAATGAACAGAGTGGTAAACAGAACGTGACTGATTTTCCAATTCCATTTTCCTTACAAGGCACT
Coding sequence (CDS)
ATGGCGAAGAAGAACAGGGGAAAGAAGAAAATAGGAGCATCTAACTCCACACCCGGTCGTCCTCAAGATTCGATCACCCTCAGACAGGAAAAGACTGGAAAAATCCAGCCCAAACCCTACAACAATGTCAAAATTTATTTGAGCCACTTGGAAAACCTAGCGACTTGGGCCAGTGGGCAGGCCTCTATACCTTCATTGGCTGCTTTCTTCGGGCGGCGCTTTGCCGCTGCTGCGGATTCTTCAGGTGTCGTTCCCGACGCCTCTCTATTTCTTTGCCAGAGGTGTGAAACAATTCTCCAACCTGGCTCTAACTGCTCCATACGAATTGAAAAGAATAAAGCAAAGAGACGTCGGAAGCACAACAAATGCAGTAACTTGACACAGAACAATTTGGTGTATTATTGCCACTACTGCTCATGTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCGTTATGCACAAAAATCCAAAGCAGTTGAAGAATCAGAGCCTATAGAATCTAAAAGCAAGGTGAAAGTTTTGAATGTCAGGCGTGGTAAAGAATGTGAGGCAACTGCAGTGCAGATGACAACAGAGATCCTTACTATTGATGCTCCTATGATTCCTTCTCCAACAACTAGAGAGATTGGTACTATTGATGCTCCCATAGCTCCTCCTACAACTGGGGATACTCTTGTCGTTGGTGCTTCTGCAATTCTGCCTCCTAGAATGGAGGACATTCTTATCATCAATGCTCCTGCAACTCCTCCCACTGTGAGCGGAACGACTCTGTCAAAATCGCAGAAGAGGAAAAAGAGGAAATTAGCAGCTAAGAATCAAACTGGACCCGAAAATAGCTGTGCTCCAACTGACTCGGAGAAGAAAACTGGGGACATTCCTACTGTTGATGCTCCCGCAACTCCTCCGGCCATGATCGGAATGACTCTGCTGGAATCAAAGAAGAGGAAAAGGAAGAAACCGTCATCTAAGAATCAAACTGAACCTGAAAGTAGCTGTGCTCCAACAGCAGAAGGAGATAAAACTGAAGGCACATCCAAAAGAAAGCGGAAAAGAAAATCATGGACAAGTTTGAAGGAAATCGCCCAGACAAATGAACAGAGTGGTAAACAGAACGTGACTGATTTTCCAATTCCATTTTCCTTACAAGGCACT
Protein sequence
MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKHNKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNVRRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPRMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSLQGT
Homology
BLAST of MS012771 vs. NCBI nr
Match:
XP_022136585.1 (uncharacterized protein LOC111008256 isoform X1 [Momordica charantia])
HSP 1 Score: 745.7 bits (1924), Expect = 2.0e-211
Identity = 388/388 (100.00%), Postives = 388/388 (100.00%), Query Frame = 0
Query: 1 MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ 60
MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ
Sbjct: 1 MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ 60
Query: 61 ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH 120
ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH
Sbjct: 61 ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH 120
Query: 121 NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV 180
NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV
Sbjct: 121 NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV 180
Query: 181 RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPR 240
RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPR
Sbjct: 181 RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPR 240
Query: 241 MEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVD 300
MEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVD
Sbjct: 241 MEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVD 300
Query: 301 APATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTS 360
APATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTS
Sbjct: 301 APATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTS 360
Query: 361 LKEIAQTNEQSGKQNVTDFPIPFSLQGT 389
LKEIAQTNEQSGKQNVTDFPIPFSLQGT
Sbjct: 361 LKEIAQTNEQSGKQNVTDFPIPFSLQGT 388
BLAST of MS012771 vs. NCBI nr
Match:
XP_022136586.1 (uncharacterized protein LOC111008256 isoform X2 [Momordica charantia])
HSP 1 Score: 564.3 bits (1453), Expect = 8.1e-157
Identity = 294/294 (100.00%), Postives = 294/294 (100.00%), Query Frame = 0
Query: 95 CETILQPGSNCSIRIEKNKAKRRRKHNKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKV 154
CETILQPGSNCSIRIEKNKAKRRRKHNKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKV
Sbjct: 41 CETILQPGSNCSIRIEKNKAKRRRKHNKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKV 100
Query: 155 RYAQKSKAVEESEPIESKSKVKVLNVRRGKECEATAVQMTTEILTIDAPMIPSPTTREIG 214
RYAQKSKAVEESEPIESKSKVKVLNVRRGKECEATAVQMTTEILTIDAPMIPSPTTREIG
Sbjct: 101 RYAQKSKAVEESEPIESKSKVKVLNVRRGKECEATAVQMTTEILTIDAPMIPSPTTREIG 160
Query: 215 TIDAPIAPPTTGDTLVVGASAILPPRMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAA 274
TIDAPIAPPTTGDTLVVGASAILPPRMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAA
Sbjct: 161 TIDAPIAPPTTGDTLVVGASAILPPRMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAA 220
Query: 275 KNQTGPENSCAPTDSEKKTGDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPES 334
KNQTGPENSCAPTDSEKKTGDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPES
Sbjct: 221 KNQTGPENSCAPTDSEKKTGDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPES 280
Query: 335 SCAPTAEGDKTEGTSKRKRKRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSLQGT 389
SCAPTAEGDKTEGTSKRKRKRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSLQGT
Sbjct: 281 SCAPTAEGDKTEGTSKRKRKRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSLQGT 334
BLAST of MS012771 vs. NCBI nr
Match:
XP_038906436.1 (uncharacterized protein LOC120092350 isoform X1 [Benincasa hispida])
HSP 1 Score: 433.3 bits (1113), Expect = 2.2e-117
Identity = 257/386 (66.58%), Postives = 279/386 (72.28%), Query Frame = 0
Query: 1 MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ 60
MAKK +G K G+SN T G PQDSITLRQE TGKI+PK NNVK+YL+HLENLATWA GQ
Sbjct: 1 MAKK-KGNAKKGSSNPTSG-PQDSITLRQEITGKIKPKVSNNVKVYLNHLENLATWACGQ 60
Query: 61 ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH 120
SIPSLA FFG+R AAAA+S V PDASLFLCQRCETILQPGSNCSIRIEKN AKRRRKH
Sbjct: 61 PSIPSLATFFGQRLAAAAESLAVAPDASLFLCQRCETILQPGSNCSIRIEKNNAKRRRKH 120
Query: 121 NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV 180
NKCSNLTQN + YYCHYCSCRNIKRGTPKGHMKV Y E SK+K + V
Sbjct: 121 NKCSNLTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYG-----------TEFVSKLKSVGV 180
Query: 181 RRGKECEATAVQMTT-EILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPP 240
GKECE + T LTID P IP TT E ID PP TGD VV P
Sbjct: 181 EDGKECENKISPLPTGNRLTIDTPAIPPSTTGEDQNIDTRAIPP-TGDISVVDGPVFSSP 240
Query: 241 RMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTV 300
R +DIL INAPATP T+S TLS+SQ K KL + QTG P E++ GDIPTV
Sbjct: 241 RTKDILNINAPATPSTLSVQTLSRSQ---KMKLLSNKQTG------PASVEERVGDIPTV 300
Query: 301 DAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWT 360
DAPATPP M G+TLL+SK+RKRKKPSSKNQTEPESS APT GDKT G SKRKR RKSWT
Sbjct: 301 DAPATPPTMTGITLLDSKRRKRKKPSSKNQTEPESS-APTTYGDKTVGMSKRKRNRKSWT 360
Query: 361 SLKEIAQTNEQSGKQNVTDFPIPFSL 386
SLKEIAQ +E+ GKQNV IPFSL
Sbjct: 361 SLKEIAQRDEERGKQNVAVLAIPFSL 362
BLAST of MS012771 vs. NCBI nr
Match:
KAG6577189.1 (hypothetical protein SDJN03_24763, partial [Cucurbita argyrosperma subsp. sororia] >KAG7015188.1 hypothetical protein SDJN02_22821, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 427.2 bits (1097), Expect = 1.5e-115
Identity = 246/383 (64.23%), Postives = 277/383 (72.32%), Query Frame = 0
Query: 4 KNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQASI 63
K +G K GASN T G PQDSIT+RQE TGK +PK NNVK YL+HLENLATWASG+ASI
Sbjct: 3 KRKGNTKKGASNPTSG-PQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKASI 62
Query: 64 PSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKHNKC 123
PSLAAFFG+R A AA+S V PDASLF CQRCETILQPGSNCSIRIEKN AKRRR+ KC
Sbjct: 63 PSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKKC 122
Query: 124 SNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNVRRG 183
N TQNN+ YYCH+CSCRNIKRGTPKGHMKV Y A E +VK ++V+ G
Sbjct: 123 CNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLY---DAAFER--------RVKPVDVKDG 182
Query: 184 KECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPRMED 243
KECE +AV+ TEILTIDAP IP DA PP TGD + AI P+ E
Sbjct: 183 KECETSAVERPTEILTIDAPKIP----------DASAIPPPTGDITALDNPAIQLPKTEG 242
Query: 244 ILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVDAPA 303
IL IN+PATP VS TTLSK QK K ++ G E TD EKKTG +PTVD PA
Sbjct: 243 ILNINSPATPSAVSITTLSKPQKWKMTTTLSEKHIGHETR---TDREKKTGAVPTVDTPA 302
Query: 304 TPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTSLKE 363
TP G+TLL+SKKRKR KPSSKNQTEP S APTA+GD++EGTSKR RKRKSWTSLKE
Sbjct: 303 TPSTSTGVTLLDSKKRKRNKPSSKNQTEPGSCSAPTADGDRSEGTSKRNRKRKSWTSLKE 360
Query: 364 IAQTNEQSGKQ-NVTDFPIPFSL 386
+A+TNEQSGKQ N+ + IPFSL
Sbjct: 363 VARTNEQSGKQKNMAELAIPFSL 360
BLAST of MS012771 vs. NCBI nr
Match:
XP_022931505.1 (uncharacterized protein LOC111437660 [Cucurbita moschata])
HSP 1 Score: 424.1 bits (1089), Expect = 1.3e-114
Identity = 246/383 (64.23%), Postives = 275/383 (71.80%), Query Frame = 0
Query: 4 KNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQASI 63
K +G K GASN T G PQDSIT+RQE TGK +PK NNVK YL+HLENLATWASG+ASI
Sbjct: 3 KRKGNTKKGASNPTSG-PQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKASI 62
Query: 64 PSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKHNKC 123
PSLAAFFG+R A AA+S V PDASLF CQRCETILQPGSNCSIRIEKN AKRRR+ KC
Sbjct: 63 PSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKKC 122
Query: 124 SNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNVRRG 183
N TQNN+ YYCH+CSCRNIKRGTPKGHMKV Y A E +VK L+V+ G
Sbjct: 123 CNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLY---DAAFER--------RVKPLDVKDG 182
Query: 184 KECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPRMED 243
KECE +AV+ TEILTIDAP IP DA PP TGD + AI P E
Sbjct: 183 KECETSAVERPTEILTIDAPKIP----------DASAIPPPTGDITALDNPAIQLPETEG 242
Query: 244 ILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVDAPA 303
IL IN+PA P TVS TTLSK QK K ++ G E TD EKKTG +PTVD PA
Sbjct: 243 ILNINSPAAPSTVSITTLSKPQKWKMTTTLSEKHIGHETR---TDKEKKTGAVPTVDTPA 302
Query: 304 TPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTSLKE 363
TP G+TLL+SKKRKR KPSSKNQTE S APTA+GD++EGTSKR RKRKSWTSLKE
Sbjct: 303 TPSTSTGVTLLDSKKRKRNKPSSKNQTELGSCSAPTADGDRSEGTSKRNRKRKSWTSLKE 360
Query: 364 IAQTNEQSGKQ-NVTDFPIPFSL 386
+A+TNEQSGKQ N+ + IPFSL
Sbjct: 363 VARTNEQSGKQKNMAELAIPFSL 360
BLAST of MS012771 vs. ExPASy TrEMBL
Match:
A0A6J1C4C1 (uncharacterized protein LOC111008256 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008256 PE=4 SV=1)
HSP 1 Score: 745.7 bits (1924), Expect = 9.5e-212
Identity = 388/388 (100.00%), Postives = 388/388 (100.00%), Query Frame = 0
Query: 1 MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ 60
MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ
Sbjct: 1 MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ 60
Query: 61 ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH 120
ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH
Sbjct: 61 ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH 120
Query: 121 NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV 180
NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV
Sbjct: 121 NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV 180
Query: 181 RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPR 240
RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPR
Sbjct: 181 RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPR 240
Query: 241 MEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVD 300
MEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVD
Sbjct: 241 MEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVD 300
Query: 301 APATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTS 360
APATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTS
Sbjct: 301 APATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTS 360
Query: 361 LKEIAQTNEQSGKQNVTDFPIPFSLQGT 389
LKEIAQTNEQSGKQNVTDFPIPFSLQGT
Sbjct: 361 LKEIAQTNEQSGKQNVTDFPIPFSLQGT 388
BLAST of MS012771 vs. ExPASy TrEMBL
Match:
A0A6J1C3W9 (uncharacterized protein LOC111008256 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111008256 PE=4 SV=1)
HSP 1 Score: 564.3 bits (1453), Expect = 3.9e-157
Identity = 294/294 (100.00%), Postives = 294/294 (100.00%), Query Frame = 0
Query: 95 CETILQPGSNCSIRIEKNKAKRRRKHNKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKV 154
CETILQPGSNCSIRIEKNKAKRRRKHNKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKV
Sbjct: 41 CETILQPGSNCSIRIEKNKAKRRRKHNKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKV 100
Query: 155 RYAQKSKAVEESEPIESKSKVKVLNVRRGKECEATAVQMTTEILTIDAPMIPSPTTREIG 214
RYAQKSKAVEESEPIESKSKVKVLNVRRGKECEATAVQMTTEILTIDAPMIPSPTTREIG
Sbjct: 101 RYAQKSKAVEESEPIESKSKVKVLNVRRGKECEATAVQMTTEILTIDAPMIPSPTTREIG 160
Query: 215 TIDAPIAPPTTGDTLVVGASAILPPRMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAA 274
TIDAPIAPPTTGDTLVVGASAILPPRMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAA
Sbjct: 161 TIDAPIAPPTTGDTLVVGASAILPPRMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAA 220
Query: 275 KNQTGPENSCAPTDSEKKTGDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPES 334
KNQTGPENSCAPTDSEKKTGDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPES
Sbjct: 221 KNQTGPENSCAPTDSEKKTGDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPES 280
Query: 335 SCAPTAEGDKTEGTSKRKRKRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSLQGT 389
SCAPTAEGDKTEGTSKRKRKRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSLQGT
Sbjct: 281 SCAPTAEGDKTEGTSKRKRKRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSLQGT 334
BLAST of MS012771 vs. ExPASy TrEMBL
Match:
A0A6J1EZL5 (uncharacterized protein LOC111437660 OS=Cucurbita moschata OX=3662 GN=LOC111437660 PE=4 SV=1)
HSP 1 Score: 424.1 bits (1089), Expect = 6.3e-115
Identity = 246/383 (64.23%), Postives = 275/383 (71.80%), Query Frame = 0
Query: 4 KNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQASI 63
K +G K GASN T G PQDSIT+RQE TGK +PK NNVK YL+HLENLATWASG+ASI
Sbjct: 3 KRKGNTKKGASNPTSG-PQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKASI 62
Query: 64 PSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKHNKC 123
PSLAAFFG+R A AA+S V PDASLF CQRCETILQPGSNCSIRIEKN AKRRR+ KC
Sbjct: 63 PSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKKC 122
Query: 124 SNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNVRRG 183
N TQNN+ YYCH+CSCRNIKRGTPKGHMKV Y A E +VK L+V+ G
Sbjct: 123 CNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLY---DAAFER--------RVKPLDVKDG 182
Query: 184 KECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPRMED 243
KECE +AV+ TEILTIDAP IP DA PP TGD + AI P E
Sbjct: 183 KECETSAVERPTEILTIDAPKIP----------DASAIPPPTGDITALDNPAIQLPETEG 242
Query: 244 ILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVDAPA 303
IL IN+PA P TVS TTLSK QK K ++ G E TD EKKTG +PTVD PA
Sbjct: 243 ILNINSPAAPSTVSITTLSKPQKWKMTTTLSEKHIGHETR---TDKEKKTGAVPTVDTPA 302
Query: 304 TPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTSLKE 363
TP G+TLL+SKKRKR KPSSKNQTE S APTA+GD++EGTSKR RKRKSWTSLKE
Sbjct: 303 TPSTSTGVTLLDSKKRKRNKPSSKNQTELGSCSAPTADGDRSEGTSKRNRKRKSWTSLKE 360
Query: 364 IAQTNEQSGKQ-NVTDFPIPFSL 386
+A+TNEQSGKQ N+ + IPFSL
Sbjct: 363 VARTNEQSGKQKNMAELAIPFSL 360
BLAST of MS012771 vs. ExPASy TrEMBL
Match:
A0A6J1J5I6 (uncharacterized protein LOC111482797 OS=Cucurbita maxima OX=3661 GN=LOC111482797 PE=4 SV=1)
HSP 1 Score: 416.4 bits (1069), Expect = 1.3e-112
Identity = 246/383 (64.23%), Postives = 281/383 (73.37%), Query Frame = 0
Query: 4 KNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQASI 63
K +G K GASN T G PQDSIT+RQE TGK +PK NNVK YL+HLENLATWASG+ASI
Sbjct: 3 KRKGNTKKGASNPTSG-PQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKASI 62
Query: 64 PSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKHNKC 123
PSLAAFFG+R A AA+S V PDASLF CQRCETILQPGSNCSIRIEKN AKRRR+ KC
Sbjct: 63 PSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKKC 122
Query: 124 SNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNVRRG 183
SN QNN+ YYCH+CSCRNIKRGTPKGHMKV Y A E +VK ++V+ G
Sbjct: 123 SNSRQNNVAYYCHHCSCRNIKRGTPKGHMKVLY---DAAFER--------RVKPVDVKDG 182
Query: 184 KECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPRMED 243
KECE +AV+ TEILTIDAP IP DA PP TGD + AI + +
Sbjct: 183 KECETSAVERPTEILTIDAPKIP----------DASAIPP-TGDITALDNPAI-QLQTKG 242
Query: 244 ILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVDAPA 303
IL IN+PATP T+S TTL KSQKR+ L+ K+ G + TD EKKTG +PTVDAPA
Sbjct: 243 ILNINSPATPSTLSVTTLLKSQKREMTTLSEKH-IGHD---IRTDEEKKTGAVPTVDAPA 302
Query: 304 TPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKRKSWTSLKE 363
TP G+TLL+SKKRKR KPSSKNQTEP S APTA+GD++EGTSKR RKRKSWTSLKE
Sbjct: 303 TPSTSTGVTLLDSKKRKRNKPSSKNQTEPRSCSAPTADGDRSEGTSKRNRKRKSWTSLKE 357
Query: 364 IAQTNEQSGKQ-NVTDFPIPFSL 386
+A+TNEQSGKQ N+ + IPFSL
Sbjct: 363 VARTNEQSGKQKNMAELAIPFSL 357
BLAST of MS012771 vs. ExPASy TrEMBL
Match:
A0A5D3CYJ3 (Rpr2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004600 PE=4 SV=1)
HSP 1 Score: 374.0 bits (959), Expect = 7.5e-100
Identity = 223/392 (56.89%), Postives = 261/392 (66.58%), Query Frame = 0
Query: 1 MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ 60
MA+K +G K G+SN T G PQ+SITLRQE TGKI+PK NN K+YL+HLENLATWASGQ
Sbjct: 1 MARK-KGNTKRGSSNPTSG-PQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQ 60
Query: 61 ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH 120
S+PSLAAFFG+R AAAA+S V PD SLFLC RCET+LQPGSNC IRIEKN AK+RR+H
Sbjct: 61 PSLPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRRRH 120
Query: 121 NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV 180
K SN+TQN + YYCHYCSCRNIKRGTPKGHMKV Y E SKVK + V
Sbjct: 121 KKASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYG-----------TECVSKVKSVVV 180
Query: 181 RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAP--PTTGDTLVVGASAILP 240
+ GKECE +ILT+DAP P TT + TID P P TT D + V +A+
Sbjct: 181 KDGKECE-------NKILTVDAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAV-- 240
Query: 241 PRMEDILI-----INAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKT 300
P EDI + I++P T P + P S + S +
Sbjct: 241 PPTEDISVDDGPAISSPRTTPAI-----------------------PSTSSVTSMSRSQV 300
Query: 301 GDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKR 360
DIPT+DAPATP + MTLL+SK+RKRKKPSSKN+TEPES APT+ G+K+E TSKRKR
Sbjct: 301 RDIPTLDAPATPLTLTAMTLLDSKRRKRKKPSSKNRTEPESCSAPTSHGEKSEDTSKRKR 347
Query: 361 KRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSL 386
RKSWTSLKEIAQ E+ GKQNV IPFSL
Sbjct: 361 NRKSWTSLKEIAQREEEKGKQNVAGLAIPFSL 347
BLAST of MS012771 vs. TAIR 10
Match:
AT5G41270.1 (CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 128.3 bits (321), Expect = 1.4e-29
Identity = 110/343 (32.07%), Postives = 144/343 (41.98%), Query Frame = 0
Query: 49 HLENLATWAS-GQASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSI 108
HL+NLA W+S G IPSLA+ GRR AA +S+G+ D L CQRCETIL+PG NC++
Sbjct: 28 HLKNLALWSSTGDTPIPSLASLLGRRLAADTESTGITTDPDLVSCQRCETILKPGFNCNV 87
Query: 109 RIEKNKAKRRRKHNKCSN-----LTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKA 168
RIEK A ++K N+C QNN+VY+C++CS RN+KRGT KG MK Y K K
Sbjct: 88 RIEKVSANVKKKRNRCKKSNNICFPQNNVVYHCNFCSHRNLKRGTAKGQMKELYPFKPKT 147
Query: 169 VEESEPIESKSKVKVLNVRRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAP 228
S P K+K +MT
Sbjct: 148 ARSSRP-----KIK--------------KEMT---------------------------- 207
Query: 229 PTTGDTLVVGASAILPPRMEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPEN 288
P + LS PE
Sbjct: 208 -----------------------------MPQEIQSNMLS----------------SPER 258
Query: 289 SCAPTDSEKKTGDIPTVDAPATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEG 348
S EK GD P M L + R+ +KP SK +EP+S
Sbjct: 268 SVKDQVEEKSVGDTPK-----------PMMLTLERDRRIRKPKSKKPSEPQS------VP 258
Query: 349 DKTEGTSKRKRKRKSWTSLKEIAQTNEQSGKQNVTDFPIPFSL 386
+KT G S +++++ WTS+KEIA+TN+ S N F IPF L
Sbjct: 328 EKTVGGSNKRKRKSPWTSMKEIAETNKSSKAGN---FKIPFLL 258
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022136585.1 | 2.0e-211 | 100.00 | uncharacterized protein LOC111008256 isoform X1 [Momordica charantia] | [more] |
XP_022136586.1 | 8.1e-157 | 100.00 | uncharacterized protein LOC111008256 isoform X2 [Momordica charantia] | [more] |
XP_038906436.1 | 2.2e-117 | 66.58 | uncharacterized protein LOC120092350 isoform X1 [Benincasa hispida] | [more] |
KAG6577189.1 | 1.5e-115 | 64.23 | hypothetical protein SDJN03_24763, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022931505.1 | 1.3e-114 | 64.23 | uncharacterized protein LOC111437660 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C4C1 | 9.5e-212 | 100.00 | uncharacterized protein LOC111008256 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1C3W9 | 3.9e-157 | 100.00 | uncharacterized protein LOC111008256 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1EZL5 | 6.3e-115 | 64.23 | uncharacterized protein LOC111437660 OS=Cucurbita moschata OX=3662 GN=LOC1114376... | [more] |
A0A6J1J5I6 | 1.3e-112 | 64.23 | uncharacterized protein LOC111482797 OS=Cucurbita maxima OX=3661 GN=LOC111482797... | [more] |
A0A5D3CYJ3 | 7.5e-100 | 56.89 | Rpr2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... | [more] |
Match Name | E-value | Identity | Description | |
AT5G41270.1 | 1.4e-29 | 32.07 | CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Ha... | [more] |