MS012374 (gene) Bitter gourd (TR) v1

Overview
NameMS012374
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionhistone acetyltransferase KAT6B-like
Locationscaffold63: 275064 .. 278591 (+)
RNA-Seq ExpressionMS012374
SyntenyMS012374
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGTGCCTTCTAACAAGTCGTCTTCTCCGTCGATGGTAGCCGGAAGAACGAGCCCTAATTCTCGAAATTCCGAAGTCGGCAACGCCGTCCGCCGGAGCTTCTCCGGCAACCCGTTCACGAGGCCGTCGATCGTCGCCAATCCGAGAAGCCTCAATCCCGTCACTCCGGCTAATAGTCCCTCGGGTTTGTTCCTCTTCTCTTCCTATTTAATTGCTTTATTGAGATTTTCGCATGGTTTAAGTCTGTTTGGTTCCCGAGAAAATGGCTGAGAAAATTTTAGTAGCGTTTGAACGGCTACCAAGTCAAAAATCTATACTCATTGGCATAATGTCTCTCTTCTTCGCTACGATTTTGATCGATCTTTTGATTCCTCCACCGACTGTTTCTTCCGTTCTTTGAACGATCTAGACAGCTTTAAATCTGTTTGGTTGCCAAGAAAGTGGTTGAGAAAATTTAAACATTTTCGTTATGATTTTGATGGTTATTTCGATTCTTCCACCGATTTCTTCCCCGTTGTTTTCCTTTAAATCGTTTCTGCGATCTGCAATTGATTTTCTTATTATGTACTGATTTTTCAGATTATCCGCGAAGGAATTCTGTAAGCAGAGAAATTCTATTTACTTCTCGTGATAACGAGGAGAAAGAAAACGGGAAAGATCAGAACCCGAAACCCATCCGAGTTCGTTCACCGACGGTCGGGAAATCGTCGAAGCACTTCATGTCTCCGACGATCTCCGCCGCCTCCAAGATTGCTGTTTCCCCGAAGAAAAAGATTTTGGGTGATCGCAACGAGTCAGTCCGGTCGTCTCTTTCATTTTCCGGCACGAAAAGCTCTTCACTCAACTCGGTGAATCCAAACCCAGAGGCAGCAGCAACAGCAGTTGAATCTGATACAAATCCTGAAATCGTTCCGATTTCAAATTCCTCCATTGCAGCGCCAACTCCCAAATCATCAAAAACCGTGAGATTCGCTGGTTTTGAGGTCATTTCTGGTTCGTATGATGATGCAGAATCCACATACAGGTACGATACGAACACAGAGGTGGTGACAGTGGCAGTTGAAACCGATTTAAAACCAGAAATCGCTCCGATTTCTAGTTCAAAAACTGTGAGATTTTCTGATGTTGAGGTAATCTCTGATTTGAAGAACAATTTTGAGTCTCCGGCTAAGAATATTTTTACAGAAGAATTGGATTGTGTCAATCTCGATCCTAGTTTTAAAATCAGTCCTGTTTCTTCTCCAATGGTAGCACCTCTCGATGCAGATCCATTAATGCCTCCTTATGATCCCAAAACTAATTATTTATCACCAAGGCCACAGTTCCTCCATTACAGGCCCAACCGAAGAATTGAGCGAGGCAACAGACTTGAGGAGTTCTTTTCCTCTGTCAATGCTTCCGAGTCCGAATTCGCCGAGGAAACTGAGTCTGAGAATTCGCTGAAGGAATCTGATGAATCTTCTTCCAATGAATCAGAGCAGGGAGAGGAAGAAGTGGAAGAAAAAGATGAAGAAAGGATTCATGTTTCTGAACAGAAGGAAACTATTGAAGTTAAGAAGTCAAGTGGGATGTTCAAGATAAGTTCTCTGCTTTTTATCCTTTCAATCACTTTCTTTTCGATATGTGCTGTGGTTCGTGATCCAAATATCTCAGAAAGATCAAGCTTATTAATGGTGGAGCATCCATCTGAAATTTACGAGTTTGCGAAGATGAATTTCAATGTGTTGGTTGGAAAACTTGAGGTTTGGCATGCGGATTCCATATCTTCTATTTTTCATGTGGTTTCCAACTTCAGAGGAGGAGCGCCATCATTGATTTATCTTAACCAAACCGAGTTCCTCTACAAGGATGTGGATGGGCAATCTCTCGTAACTCATCAGACTTTGTGGGAAAAAGAAATTTTTTTGAATGTAATCGAAGAAGGCGCCACGAAGGAAAGAAGAATTGAATATGCTGAGACGGAAGATCATGGCGTGGAGGAGGAAGAAGAATCATTGCAAGAGATTGAAGCCATTAAGGAGATAGAAGCTACGAATGAAGAAGAAGAAGAACAATTGTTGCAGGAGATTGAAAGCAAAACTAGTTATCCAGAAGTTGGTGAAGAGAATGACGAGATTTCTGCAAAATCAGCTTCTGAAAACATCGATGAAGAAGATGCCCAAGAGAAAGAAACCGAAGAGAATTATGCAGTATCATCAGCTGATTTTGAAATTCTTGATCAAATTGAACCATCTGCTTCTAATAAAATCGACGAAGACGACGTCCACGAGAACGAAATTGAAGAGAATTATGAAGGATCATCAGCTGATTTTGAAATTCTTGATCAAATTGAGCTATCTGCTTCTGATGAAATCGACGAAGACGACGTCCACGAGAACGAAAGGAAAGAGAATCATGAAGCATCATCGGCTGATTTTGAAACCCTAGATCAAGTCGGACCAGAAGCTGAAACAGGGGAAACAGAGGAAGAAAACAACGATTCGCTACTGCAACAACAAAGCAACACAGCTCCAGTTTTTTCCCCTTTTGCTGAACCTCAATCTGATTTTCAGTCTTCAAATGGCAGCAACATCAAGATTTTACAGGGAATCTCAGGAGATTTTACACATAACATGATGTTCGCGTTGCTGCTATCTCTAATTATACCTGCAGGATTTATTTATGCAAAAAAATCAGGCTCAAAACCAACCATTGCGGCGGTGGAAGAGCAAAAGCAATCATTGATGAAGGAGGAGGAGAAGACGAACCACAGTCCGGAGGAAGAAGAAGAAGCGGCAGATGATGAGCATGATGATGATATGACTGGAGAATCCTGCTCTTCTGAAACGAGCAGTTTCCAGTACAGCAGCATGAGAGGAGCAGCCGGAGCAGTGAAAGAACCGAGTGAAGCTCATAGCCATAGCCATGGGAGGAAGAAGAGGAAGAATTCAAGAAGAGAATCTCTGGCTTCTTCCTCGGATGAAATTTCCATTTCTGCTTCTCCTTCTCCATCTTATGGGAGTTTCACTACCTACGAGAAAATCAAGCATCATGTGAGTATCCCAAAACTGAAATTCTTTTGGGTTTGATTTTTCTTTTTTTAATATTTTTTTCCCTATGTGTTTAGAAAAAATCTATTCTAAAAAATTATAAGTTTGTTTTGCAAGAGAAGTAACAATTGTAGAAAACTCCCCAATGTTTTGTTATTTCCATGTATTTTTCCTTAATCATATATTATTGATTTAAATTGTGTTTTTAAATATATTATCTGCTTTTATTATGGTCTAATAAAAATAATTTTGTGGAGTTTTTTGTAAAAAAAAAAGGTTTTTCTAAATCAGTTTAAATAAAAATTAGGGTCATAGTTGTGAAATATGACGAGTCCTTAAATTAAAAAATACATAAAAAAAAAGTTTAAAAATTGTAATGTTTTTATCTCTCGAGTTTAACATTTTTGTTTTCTTTTATTTACAGGGAAATGGAGAAGAAGAGATTATGACTCCAGTTAGACGCTCTAATAGAATTAGAAAGCAACATAATAATAGT

mRNA sequence

ATGGCGGTGCCTTCTAACAAGTCGTCTTCTCCGTCGATGGTAGCCGGAAGAACGAGCCCTAATTCTCGAAATTCCGAAGTCGGCAACGCCGTCCGCCGGAGCTTCTCCGGCAACCCGTTCACGAGGCCGTCGATCGTCGCCAATCCGAGAAGCCTCAATCCCGTCACTCCGGCTAATAGTCCCTCGGATTATCCGCGAAGGAATTCTGTAAGCAGAGAAATTCTATTTACTTCTCGTGATAACGAGGAGAAAGAAAACGGGAAAGATCAGAACCCGAAACCCATCCGAGTTCGTTCACCGACGGTCGGGAAATCGTCGAAGCACTTCATGTCTCCGACGATCTCCGCCGCCTCCAAGATTGCTGTTTCCCCGAAGAAAAAGATTTTGGGTGATCGCAACGAGTCAGTCCGGTCGTCTCTTTCATTTTCCGGCACGAAAAGCTCTTCACTCAACTCGGTGAATCCAAACCCAGAGGCAGCAGCAACAGCAGTTGAATCTGATACAAATCCTGAAATCGTTCCGATTTCAAATTCCTCCATTGCAGCGCCAACTCCCAAATCATCAAAAACCGTGAGATTCGCTGGTTTTGAGGTCATTTCTGGTTCGTATGATGATGCAGAATCCACATACAGGTACGATACGAACACAGAGGTGGTGACAGTGGCAGTTGAAACCGATTTAAAACCAGAAATCGCTCCGATTTCTAGTTCAAAAACTGTGAGATTTTCTGATGTTGAGGTAATCTCTGATTTGAAGAACAATTTTGAGTCTCCGGCTAAGAATATTTTTACAGAAGAATTGGATTGTGTCAATCTCGATCCTAGTTTTAAAATCAGTCCTGTTTCTTCTCCAATGGTAGCACCTCTCGATGCAGATCCATTAATGCCTCCTTATGATCCCAAAACTAATTATTTATCACCAAGGCCACAGTTCCTCCATTACAGGCCCAACCGAAGAATTGAGCGAGGCAACAGACTTGAGGAGTTCTTTTCCTCTGTCAATGCTTCCGAGTCCGAATTCGCCGAGGAAACTGAGTCTGAGAATTCGCTGAAGGAATCTGATGAATCTTCTTCCAATGAATCAGAGCAGGGAGAGGAAGAAGTGGAAGAAAAAGATGAAGAAAGGATTCATGTTTCTGAACAGAAGGAAACTATTGAAGTTAAGAAGTCAAGTGGGATGTTCAAGATAAGTTCTCTGCTTTTTATCCTTTCAATCACTTTCTTTTCGATATGTGCTGTGGTTCGTGATCCAAATATCTCAGAAAGATCAAGCTTATTAATGGTGGAGCATCCATCTGAAATTTACGAGTTTGCGAAGATGAATTTCAATGTGTTGGTTGGAAAACTTGAGGTTTGGCATGCGGATTCCATATCTTCTATTTTTCATGTGGTTTCCAACTTCAGAGGAGGAGCGCCATCATTGATTTATCTTAACCAAACCGAGTTCCTCTACAAGGATGTGGATGGGCAATCTCTCGTAACTCATCAGACTTTGTGGGAAAAAGAAATTTTTTTGAATGTAATCGAAGAAGGCGCCACGAAGGAAAGAAGAATTGAATATGCTGAGACGGAAGATCATGGCGTGGAGGAGGAAGAAGAATCATTGCAAGAGATTGAAGCCATTAAGGAGATAGAAGCTACGAATGAAGAAGAAGAAGAACAATTGTTGCAGGAGATTGAAAGCAAAACTAGTTATCCAGAAGTTGGTGAAGAGAATGACGAGATTTCTGCAAAATCAGCTTCTGAAAACATCGATGAAGAAGATGCCCAAGAGAAAGAAACCGAAGAGAATTATGCAGTATCATCAGCTGATTTTGAAATTCTTGATCAAATTGAACCATCTGCTTCTAATAAAATCGACGAAGACGACGTCCACGAGAACGAAATTGAAGAGAATTATGAAGGATCATCAGCTGATTTTGAAATTCTTGATCAAATTGAGCTATCTGCTTCTGATGAAATCGACGAAGACGACGTCCACGAGAACGAAAGGAAAGAGAATCATGAAGCATCATCGGCTGATTTTGAAACCCTAGATCAAGTCGGACCAGAAGCTGAAACAGGGGAAACAGAGGAAGAAAACAACGATTCGCTACTGCAACAACAAAGCAACACAGCTCCAGTTTTTTCCCCTTTTGCTGAACCTCAATCTGATTTTCAGTCTTCAAATGGCAGCAACATCAAGATTTTACAGGGAATCTCAGGAGATTTTACACATAACATGATGTTCGCGTTGCTGCTATCTCTAATTATACCTGCAGGATTTATTTATGCAAAAAAATCAGGCTCAAAACCAACCATTGCGGCGGTGGAAGAGCAAAAGCAATCATTGATGAAGGAGGAGGAGAAGACGAACCACAGTCCGGAGGAAGAAGAAGAAGCGGCAGATGATGAGCATGATGATGATATGACTGGAGAATCCTGCTCTTCTGAAACGAGCAGTTTCCAGTACAGCAGCATGAGAGGAGCAGCCGGAGCAGTGAAAGAACCGAGTGAAGCTCATAGCCATAGCCATGGGAGGAAGAAGAGGAAGAATTCAAGAAGAGAATCTCTGGCTTCTTCCTCGGATGAAATTTCCATTTCTGCTTCTCCTTCTCCATCTTATGGGAGTTTCACTACCTACGAGAAAATCAAGCATCATGGAAATGGAGAAGAAGAGATTATGACTCCAGTTAGACGCTCTAATAGAATTAGAAAGCAACATAATAATAGT

Coding sequence (CDS)

ATGGCGGTGCCTTCTAACAAGTCGTCTTCTCCGTCGATGGTAGCCGGAAGAACGAGCCCTAATTCTCGAAATTCCGAAGTCGGCAACGCCGTCCGCCGGAGCTTCTCCGGCAACCCGTTCACGAGGCCGTCGATCGTCGCCAATCCGAGAAGCCTCAATCCCGTCACTCCGGCTAATAGTCCCTCGGATTATCCGCGAAGGAATTCTGTAAGCAGAGAAATTCTATTTACTTCTCGTGATAACGAGGAGAAAGAAAACGGGAAAGATCAGAACCCGAAACCCATCCGAGTTCGTTCACCGACGGTCGGGAAATCGTCGAAGCACTTCATGTCTCCGACGATCTCCGCCGCCTCCAAGATTGCTGTTTCCCCGAAGAAAAAGATTTTGGGTGATCGCAACGAGTCAGTCCGGTCGTCTCTTTCATTTTCCGGCACGAAAAGCTCTTCACTCAACTCGGTGAATCCAAACCCAGAGGCAGCAGCAACAGCAGTTGAATCTGATACAAATCCTGAAATCGTTCCGATTTCAAATTCCTCCATTGCAGCGCCAACTCCCAAATCATCAAAAACCGTGAGATTCGCTGGTTTTGAGGTCATTTCTGGTTCGTATGATGATGCAGAATCCACATACAGGTACGATACGAACACAGAGGTGGTGACAGTGGCAGTTGAAACCGATTTAAAACCAGAAATCGCTCCGATTTCTAGTTCAAAAACTGTGAGATTTTCTGATGTTGAGGTAATCTCTGATTTGAAGAACAATTTTGAGTCTCCGGCTAAGAATATTTTTACAGAAGAATTGGATTGTGTCAATCTCGATCCTAGTTTTAAAATCAGTCCTGTTTCTTCTCCAATGGTAGCACCTCTCGATGCAGATCCATTAATGCCTCCTTATGATCCCAAAACTAATTATTTATCACCAAGGCCACAGTTCCTCCATTACAGGCCCAACCGAAGAATTGAGCGAGGCAACAGACTTGAGGAGTTCTTTTCCTCTGTCAATGCTTCCGAGTCCGAATTCGCCGAGGAAACTGAGTCTGAGAATTCGCTGAAGGAATCTGATGAATCTTCTTCCAATGAATCAGAGCAGGGAGAGGAAGAAGTGGAAGAAAAAGATGAAGAAAGGATTCATGTTTCTGAACAGAAGGAAACTATTGAAGTTAAGAAGTCAAGTGGGATGTTCAAGATAAGTTCTCTGCTTTTTATCCTTTCAATCACTTTCTTTTCGATATGTGCTGTGGTTCGTGATCCAAATATCTCAGAAAGATCAAGCTTATTAATGGTGGAGCATCCATCTGAAATTTACGAGTTTGCGAAGATGAATTTCAATGTGTTGGTTGGAAAACTTGAGGTTTGGCATGCGGATTCCATATCTTCTATTTTTCATGTGGTTTCCAACTTCAGAGGAGGAGCGCCATCATTGATTTATCTTAACCAAACCGAGTTCCTCTACAAGGATGTGGATGGGCAATCTCTCGTAACTCATCAGACTTTGTGGGAAAAAGAAATTTTTTTGAATGTAATCGAAGAAGGCGCCACGAAGGAAAGAAGAATTGAATATGCTGAGACGGAAGATCATGGCGTGGAGGAGGAAGAAGAATCATTGCAAGAGATTGAAGCCATTAAGGAGATAGAAGCTACGAATGAAGAAGAAGAAGAACAATTGTTGCAGGAGATTGAAAGCAAAACTAGTTATCCAGAAGTTGGTGAAGAGAATGACGAGATTTCTGCAAAATCAGCTTCTGAAAACATCGATGAAGAAGATGCCCAAGAGAAAGAAACCGAAGAGAATTATGCAGTATCATCAGCTGATTTTGAAATTCTTGATCAAATTGAACCATCTGCTTCTAATAAAATCGACGAAGACGACGTCCACGAGAACGAAATTGAAGAGAATTATGAAGGATCATCAGCTGATTTTGAAATTCTTGATCAAATTGAGCTATCTGCTTCTGATGAAATCGACGAAGACGACGTCCACGAGAACGAAAGGAAAGAGAATCATGAAGCATCATCGGCTGATTTTGAAACCCTAGATCAAGTCGGACCAGAAGCTGAAACAGGGGAAACAGAGGAAGAAAACAACGATTCGCTACTGCAACAACAAAGCAACACAGCTCCAGTTTTTTCCCCTTTTGCTGAACCTCAATCTGATTTTCAGTCTTCAAATGGCAGCAACATCAAGATTTTACAGGGAATCTCAGGAGATTTTACACATAACATGATGTTCGCGTTGCTGCTATCTCTAATTATACCTGCAGGATTTATTTATGCAAAAAAATCAGGCTCAAAACCAACCATTGCGGCGGTGGAAGAGCAAAAGCAATCATTGATGAAGGAGGAGGAGAAGACGAACCACAGTCCGGAGGAAGAAGAAGAAGCGGCAGATGATGAGCATGATGATGATATGACTGGAGAATCCTGCTCTTCTGAAACGAGCAGTTTCCAGTACAGCAGCATGAGAGGAGCAGCCGGAGCAGTGAAAGAACCGAGTGAAGCTCATAGCCATAGCCATGGGAGGAAGAAGAGGAAGAATTCAAGAAGAGAATCTCTGGCTTCTTCCTCGGATGAAATTTCCATTTCTGCTTCTCCTTCTCCATCTTATGGGAGTTTCACTACCTACGAGAAAATCAAGCATCATGGAAATGGAGAAGAAGAGATTATGACTCCAGTTAGACGCTCTAATAGAATTAGAAAGCAACATAATAATAGT

Protein sequence

MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANSPSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAASKIAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSIAAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSKTVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERGNRLEEFFSSVNASESEFAEETESENSLKESDESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKSSGMFKISSLLFILSITFFSICAVVRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQTEFLYKDVDGQSLVTHQTLWEKEIFLNVIEEGATKERRIEYAETEDHGVEEEEESLQEIEAIKEIEATNEEEEEQLLQEIESKTSYPEVGEENDEISAKSASENIDEEDAQEKETEENYAVSSADFEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQSSNGSNIKILQGISGDFTHNMMFALLLSLIIPAGFIYAKKSGSKPTIAAVEEQKQSLMKEEEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRGAAGAVKEPSEAHSHSHGRKKRKNSRRESLASSSDEISISASPSPSYGSFTTYEKIKHHGNGEEEIMTPVRRSNRIRKQHNNS
Homology
BLAST of MS012374 vs. NCBI nr
Match: XP_022149260.1 (uncharacterized protein LOC111017725 [Momordica charantia])

HSP 1 Score: 1634.0 bits (4230), Expect = 0.0e+00
Identity = 899/904 (99.45%), Postives = 900/904 (99.56%), Query Frame = 0

Query: 1   MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
           MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS
Sbjct: 1   MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60

Query: 61  PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAASKI 120
           PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSP+VGKSSKHFMSPTISAASKI
Sbjct: 61  PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPSVGKSSKHFMSPTISAASKI 120

Query: 121 AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI 180
           AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI
Sbjct: 121 AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI 180

Query: 181 AAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSKTV 240
           AAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSKTV
Sbjct: 181 AAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSKTV 240

Query: 241 RFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLMPPYDP 300
           RFSDVEVISDLKNNFES  KNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLMPPYDP
Sbjct: 241 RFSDVEVISDLKNNFESADKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLMPPYDP 300

Query: 301 KTNYLSPRPQFLHYRPNRRIERGNRLEEFFSSVNASESEFAEETESENSLKESDESSSNE 360
           KTNYLSPRPQFLHYRPNRRIERGNRLEEFFSSVNASESEFAEETESENSLKESDESSSNE
Sbjct: 301 KTNYLSPRPQFLHYRPNRRIERGNRLEEFFSSVNASESEFAEETESENSLKESDESSSNE 360

Query: 361 SEQGEEEVEEKDEERIHVSEQKETIEVKKSSGMFKISSLLFILSITFFSICAVVRDPNIS 420
           SEQGEEEVEEKDEERIHVSEQKE IEVKKSSGMFKISSLLFILSITFFSICAVVRDPNIS
Sbjct: 361 SEQGEEEVEEKDEERIHVSEQKEXIEVKKSSGMFKISSLLFILSITFFSICAVVRDPNIS 420

Query: 421 ERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQT 480
           ERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQT
Sbjct: 421 ERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQT 480

Query: 481 EFLYKDVDGQSLVTHQTLWEKEIFLNVIEEGATKERRIEYAETEDHGVEEEEESLQEIEA 540
           EFLYKDVDGQSLVTHQTLWEKE FLNVIEEGATKERRIEYAETEDHGVEEEEESLQEIEA
Sbjct: 481 EFLYKDVDGQSLVTHQTLWEKENFLNVIEEGATKERRIEYAETEDHGVEEEEESLQEIEA 540

Query: 541 IKEIEATNEEEEEQLLQEIESKTSYPEVGEENDEISAKSASENIDEEDAQEKETEENYAV 600
           IKEIEATNEEEEEQLLQEIESKTSYPEVGEENDEISAKSASENIDEEDAQEKETEENYAV
Sbjct: 541 IKEIEATNEEEEEQLLQEIESKTSYPEVGEENDEISAKSASENIDEEDAQEKETEENYAV 600

Query: 601 SSADFEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVH 660
           SSADFEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVH
Sbjct: 601 SSADFEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVH 660

Query: 661 ENERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQ 720
           ENERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQ
Sbjct: 661 ENERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQ 720

Query: 721 SSNGSNIKILQGISGDFTHNMMFALLLSLIIPAGFIYAKKSGSKPTIAAVEEQKQSLMKE 780
           SSNGSNIKILQGISGDFTHNMMFALLLSLIIPAGFIYAKKSGSKPTIAAVEEQKQSLMKE
Sbjct: 721 SSNGSNIKILQGISGDFTHNMMFALLLSLIIPAGFIYAKKSGSKPTIAAVEEQKQSLMKE 780

Query: 781 EEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRGAAGAVKEPSEAHSHSHGR 840
           EEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRGAAGAVKEPSEAHSHSHGR
Sbjct: 781 EEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRGAAGAVKEPSEAHSHSHGR 840

Query: 841 KKRKNSRRESLASSSDEISISASPSPSYGSFTTYEKIKHHGNGEEEIMTPVRRSNRIRKQ 900
           KKRKNSRRESLASSSDEISISASPSPSYGSFTTYEKIKHHGNGEEEIMTPVRRSNRIRKQ
Sbjct: 841 KKRKNSRRESLASSSDEISISASPSPSYGSFTTYEKIKHHGNGEEEIMTPVRRSNRIRKQ 900

Query: 901 HNNS 905
           HNNS
Sbjct: 901 HNNS 904

BLAST of MS012374 vs. NCBI nr
Match: XP_038903440.1 (uncharacterized protein LOC120090026 [Benincasa hispida])

HSP 1 Score: 759.2 bits (1959), Expect = 4.0e-215
Identity = 539/962 (56.03%), Postives = 659/962 (68.50%), Query Frame = 0

Query: 1   MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
           MA+PSN+SSSP+M++GRTSPNSRNSE+ N VRRSFSGNPF++PSIVANPR LNP+TPANS
Sbjct: 1   MALPSNRSSSPAMLSGRTSPNSRNSEISNPVRRSFSGNPFSKPSIVANPRGLNPITPANS 60

Query: 61  PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAASKI 120
           PSDYPRRNSVSRE  FTSR+ +EKEN KDQ+PKP+RVRSP VGKSSKHFMSPTISAASKI
Sbjct: 61  PSDYPRRNSVSRENSFTSRNIQEKENEKDQSPKPVRVRSPMVGKSSKHFMSPTISAASKI 120

Query: 121 AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI 180
           A SPKKKILGD+NE VRSS SFSG KSSSLNSVN + +++ T +ESDTNP+I P+S+S  
Sbjct: 121 AASPKKKILGDQNEPVRSSNSFSGMKSSSLNSVNQSSQSSKT-LESDTNPQIPPVSSS-- 180

Query: 181 AAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVV-TVAVETDLKPEIAPISSSKT 240
                KS+KTVRF GFEVIS S+DD+E+TYRYD N EVV T+AVE D+K E+AP+S S +
Sbjct: 181 -----KSTKTVRFGGFEVISDSHDDSETTYRYDLNPEVVATMAVEADMKSEMAPVSKSAS 240

Query: 241 V------RFSDVEVISDLKNNFES-PAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDAD 300
                    SD EVIS    + +S PA++   E++DCVNLDPSFKISP+SSPM+APLD D
Sbjct: 241 AVAPLESSNSDFEVISISNKDLDSPPARSNLIEDVDCVNLDPSFKISPISSPMIAPLDDD 300

Query: 301 PLMPPYDPKTNYLSPRPQFLHYRPNRRIER----GNRLEEFFSSVNASESEFAEETESEN 360
           P +PPYDPKTNYLSPRPQFLHYRPNRRI R    G   E+ FS  N S+SE  EET+SE+
Sbjct: 301 PSIPPYDPKTNYLSPRPQFLHYRPNRRINRYEPEGRLEEKLFSFANVSQSESMEETDSED 360

Query: 361 SLKESDESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKS-----SGMFKISSLLFIL 420
           S KESDE+SSNESE    E EE++EE I+VSEQ  T E+K+S     S +FK SSLL IL
Sbjct: 361 SPKESDEASSNESEM---EEEEQEEEEINVSEQSPT-EMKQSSKLHFSSIFKTSSLLLIL 420

Query: 421 SITFFSICAV-VRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFH 480
               FSIC V V DPNI +R S L +E  SEI+ FAK NFNVLVGKLEVWH  SIS I  
Sbjct: 421 FTACFSICVVNVHDPNIFQRPSSLTMEDESEIFGFAKTNFNVLVGKLEVWHVKSISFISD 480

Query: 481 VVSNFRGGAPSLIYLNQTEFL--YKDVDGQSLV-THQTLWEKEIFLNVIEEGATKERRIE 540
           VV NFRGG P + Y NQTEF   Y +++ Q LV +HQT+WE+E  LNVIE  A K+R I+
Sbjct: 481 VVFNFRGGLPLIHYENQTEFFNEYFNMNEQCLVLSHQTVWEEENNLNVIE--AMKDREID 540

Query: 541 YAE---TEDHGVEEEEESLQEIEAIKEIEATNEE----EEEQLLQEIESKTSYPEVGEEN 600
             E    ++   +EEE+  +E+     IE    E    EEE+L QEIE+     E  +E 
Sbjct: 541 IFEEPIEKECQNKEEEQEAEELPREIGIETDERESEIVEEEELFQEIEAMKVREEQEQEQ 600

Query: 601 DEISAKSAS--------ENIDEEDAQEKETEENYAVSSADFEILDQIEPSASNKIDEDDV 660
           +++  +  +        EN++ E   E+E E+   VS  + E       + +N+ + D+ 
Sbjct: 601 EDVLQEIEAIKMREIFVENVERESQNEEELED---VSFQETE-------ANANEEENDEA 660

Query: 661 HENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENERKENHEASS-ADFETLDQV-- 720
            +  ++E  E S          E SASD++ E++  + + +EN + SS +D +  DQ+  
Sbjct: 661 FQESLQETIEES----------ENSASDKLTEEEYVQEKPEENFKFSSLSDLKFHDQIEQ 720

Query: 721 GPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQSSNGSN----IKILQGISGDF 780
              A TGETEEE N      Q    PV  P AE QSDF+  NG      I+   GIS DF
Sbjct: 721 AAAAATGETEEEKNTEF---QYQLPPVSPPAAEHQSDFEEKNGGKIIDLIRTKNGISQDF 780

Query: 781 THN---MMFALLLSLIIPAGFIYAKKSGSKPT-----IAAVEEQKQSLMKEEEKTNHSPE 840
           T N   ++ A+LL  +I  G IYA++SGSKP+     IA  EE+KQ L+K EEK N S  
Sbjct: 781 TQNTAIIISAILLGTLI-IGLIYARQSGSKPSSSMAAIAEEEEEKQPLVK-EEKMNQSLV 840

Query: 841 EEEEAADD---EHDDDMTGESCSSETSSFQYSSMRGA-AGAVKEPSEAHSHSHGRKK-RK 900
           EEEE  ++   E +DDM GE CSSETSSFQYSSMR     A K  SE  SHSHGRKK RK
Sbjct: 841 EEEEVVEEEGHEEEDDMGGEFCSSETSSFQYSSMREEDTKAGKRSSEVQSHSHGRKKMRK 900

Query: 901 NSRRESLASSS-DEISISASPSPSYGSFTTYEKIK-HHGNGEEEIMTPVRRSNRIRKQHN 905
           NSRRES+ASSS DE S+S S SPSYGSFTTYEKI   HG G++EI+TPVRRS RIRKQHN
Sbjct: 901 NSRRESMASSSLDEYSVSTSASPSYGSFTTYEKIPIKHGKGDDEIVTPVRRSTRIRKQHN 923

BLAST of MS012374 vs. NCBI nr
Match: XP_022984665.1 (uncharacterized protein LOC111482876 isoform X5 [Cucurbita maxima])

HSP 1 Score: 717.2 bits (1850), Expect = 1.7e-202
Identity = 543/1045 (51.96%), Postives = 657/1045 (62.87%), Query Frame = 0

Query: 1    MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
            MA+PSN+SSSPSMV GRTSP SRNSE+ N V RSFS NPF++PSI  + +SLNP+TPAN+
Sbjct: 1    MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60

Query: 61   PS--DY-PRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAA 120
            PS  DY P+RNSVSREILFTSRDNE+KENGKDQ+PK  RVRSPTVGKS K+FMS TISAA
Sbjct: 61   PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120

Query: 121  SKIAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPE------ 180
            SKIAVSPKKKILGDRNE VRSSLSFSG KSSSLNSVNP PE A+ A ESDTNP       
Sbjct: 121  SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPE-ASMAFESDTNPPMPLISN 180

Query: 181  ----------------------------------------------IVPISNSSIAAPTP 240
                                                          IVPI+ S+IAA + 
Sbjct: 181  PKSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASS 240

Query: 241  KSSKTVRFAGFEVISGSYDDAESTYR--YDTNTEVVTVAVETDLKPEIAPIS-------- 300
            KSSKTV F GFEVIS SYDD+ESTYR  +D N E VTVAVE D +PEI PIS        
Sbjct: 241  KSSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVT 300

Query: 301  --SSKTVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADP 360
              +SK +RFSD+E +S+  N  ES   + FTEE+DCVNLDPSF ISPVSSPM+AP+DADP
Sbjct: 301  PEASKIMRFSDLEAVSN--NALESSVNSNFTEEVDCVNLDPSFNISPVSSPMIAPMDADP 360

Query: 361  LMPPYDPKTNYLSPRPQFLHYRPNRRIER-GNRLEEFFSSVNASESEFAEETESENSLKE 420
            ++ PYDPKTNYLSPRPQFLHY PNRRI R   R EE FS+        +EET+ E+  KE
Sbjct: 361  IITPYDPKTNYLSPRPQFLHYNPNRRINRPDGRFEELFST--------SEETDCEDPQKE 420

Query: 421  SDESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKS-----SGMFKISSLLFILSITF 480
            SDE SSNES+  EEE     EE + VSEQ  T EVKKS     S +FKISSLL IL    
Sbjct: 421  SDEVSSNESQMKEEE----KEEEVDVSEQGPT-EVKKSSKPLLSRIFKISSLLLILFTAC 480

Query: 481  FSICAV-VRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSN 540
             SIC V V DP I ERS+LL +   SEI+  AK NFNVLVGKLE+WHA+SIS I  VV N
Sbjct: 481  LSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFN 540

Query: 541  FRGGAPSLIYLNQTEFLYKDV--DGQSLV-THQTLWEKE-IFLNVIEEGATKERRIEYAE 600
            FRGG P LI+LNQTEF Y DV  D Q LV +HQ +WE+E   +N +E  A K+R  +  E
Sbjct: 541  FRGG-PPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAME--AMKDREGQNKE 600

Query: 601  TEDHGVEEEEESLQEIEAIKEI-------EATNEEEEEQLLQEIESKTSYPEVGE-ENDE 660
                G E+EE++ +E   +KEI       E+ NEE EEQ  QEIE++T+  E  E ENDE
Sbjct: 601  ----GQEQEEDAQEEAIKVKEIGIQTVERESQNEEVEEQSFQEIEARTNDSENSEKENDE 660

Query: 661  ISAKSASENID---------EEDAQEKETEENYAVSSADFEILDQIEPSASNKIDEDDVH 720
             S +S  E I+         E   Q++E ++  A+   +  I      S + +++E+   
Sbjct: 661  ASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQ 720

Query: 721  ENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENERKENHEASSADFETLDQVGPEA 780
            + E + N +    + E  +  E S  + ++E+ V E   +    +SS+DF+   Q+   A
Sbjct: 721  KTEAKAN-DQKDREEENDEASEESLLEIVEEESVQEKTVENFKASSSSDFKLHGQIEQAA 780

Query: 781  ETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQSSNGSN----IKILQGISGDFTHN- 840
             TGET+EE N      Q  + PV SP +E QSD +  NG      I+   GIS DFT N 
Sbjct: 781  ATGETQEETNTEF---QYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNT 840

Query: 841  --MMFALLLS--LIIPAGFIYAKKSGSK---PTIAAVEEQKQSLMKEEEKTNHS--PEEE 900
              ++ A+LL   LIIPAG IYA+KSGS+    T A  EEQ++  + +++KTN S   EEE
Sbjct: 841  AAIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEE 900

Query: 901  EEAADDEHDDDMTGESCSSETSS-FQYSSMR---------------------------GA 905
            EE A D+ DDDM GE CSSETSS FQYSS+R                             
Sbjct: 901  EEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSRRESI 960

BLAST of MS012374 vs. NCBI nr
Match: XP_022984664.1 (uncharacterized protein LOC111482876 isoform X4 [Cucurbita maxima])

HSP 1 Score: 715.7 bits (1846), Expect = 5.1e-202
Identity = 542/1045 (51.87%), Postives = 657/1045 (62.87%), Query Frame = 0

Query: 1    MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
            MA+PSN+SSSPSMV GRTSP SRNSE+ N V RSFS NPF++PSI  + +SLNP+TPAN+
Sbjct: 1    MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60

Query: 61   PS--DY-PRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAA 120
            PS  DY P+RNSVSREILFTSRDNE+KENGKDQ+PK  RVRSPTVGKS K+FMS TISAA
Sbjct: 61   PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120

Query: 121  SKIAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPE------ 180
            SKIAVSPKKKILGDRNE VRSSLSFSG KSSSLNSVNP PE A+ A ESDTNP       
Sbjct: 121  SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPE-ASMAFESDTNPPMPLISN 180

Query: 181  ----------------------------------------------IVPISNSSIAAPTP 240
                                                          IVPI+ S+IAA + 
Sbjct: 181  PKSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASS 240

Query: 241  KSSKTVRFAGFEVISGSYDDAESTYR--YDTNTEVVTVAVETDLKPEIAPIS-------- 300
            KSSKTV F GFEVIS SYDD+ESTYR  +D N E VTVAVE D +PEI PIS        
Sbjct: 241  KSSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVT 300

Query: 301  --SSKTVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADP 360
              +SK +RFSD+E +S+  N  ES   + FTEE+DCVNLDPSF ISPVSSPM+AP+DADP
Sbjct: 301  PEASKIMRFSDLEAVSN--NALESSVNSNFTEEVDCVNLDPSFNISPVSSPMIAPMDADP 360

Query: 361  LMPPYDPKTNYLSPRPQFLHYRPNRRIER-GNRLEEFFSSVNASESEFAEETESENSLKE 420
            ++ PYDPKTNYLSPRPQFLHY PNRRI R   R EE FS+        +EET+ E+  KE
Sbjct: 361  IITPYDPKTNYLSPRPQFLHYNPNRRINRPDGRFEELFST--------SEETDCEDPQKE 420

Query: 421  SDESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKS-----SGMFKISSLLFILSITF 480
            SDE SSNES+  EEE     EE + VSEQ  T EVKKS     S +FKISSLL IL    
Sbjct: 421  SDEVSSNESQMKEEE----KEEEVDVSEQGPT-EVKKSSKPLLSRIFKISSLLLILFTAC 480

Query: 481  FSICAV-VRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSN 540
             SIC V V DP I ERS+LL +   SEI+  AK NFNVLVGKLE+WHA+SIS I  VV N
Sbjct: 481  LSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFN 540

Query: 541  FRGGAPSLIYLNQTEFLYKDV--DGQSLV-THQTLWEKE-IFLNVIEEGATKERRIEYAE 600
            FRGG P LI+LNQTEF Y DV  D Q LV +HQ +WE+E   +N +E  A K+R  +  E
Sbjct: 541  FRGG-PPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAME--AMKDREGQNKE 600

Query: 601  TEDHGVEEEEESLQEIEAIKEI-------EATNEEEEEQLLQEIESKTSYPEVGE-ENDE 660
                G E+EE++ +E   +KEI       E+ NEE EEQ  QEIE++T+  E  E ENDE
Sbjct: 601  ----GQEQEEDAQEEAIKVKEIGIQTVERESQNEEVEEQSFQEIEARTNDSENSEKENDE 660

Query: 661  ISAKSASENID---------EEDAQEKETEENYAVSSADFEILDQIEPSASNKIDEDDVH 720
             S +S  E I+         E   Q++E ++  A+   +  I      S + +++E+   
Sbjct: 661  ASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQ 720

Query: 721  ENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENERKENHEASSADFETLDQVGPEA 780
            + E + N +    + E  +  E S  + ++E+ V E   +    +SS+DF+  D++   A
Sbjct: 721  KTEAKAN-DQKDREEENDEASEESLLEIVEEESVQEKTVENFKASSSSDFKLHDEIEQAA 780

Query: 781  ETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQSSNGSN----IKILQGISGDFTHN- 840
             T ET+EE N      Q  + PV SP +E QSD +  NG      I+   GIS DFT N 
Sbjct: 781  ATEETQEETNTEF---QYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNT 840

Query: 841  --MMFALLLS--LIIPAGFIYAKKSGSK---PTIAAVEEQKQSLMKEEEKTNHS--PEEE 900
              ++ A+LL   LIIPAG IYA+KSGS+    T A  EEQ++  + +++KTN S   EEE
Sbjct: 841  AAIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEE 900

Query: 901  EEAADDEHDDDMTGESCSSETSS-FQYSSMR---------------------------GA 905
            EE A D+ DDDM GE CSSETSS FQYSS+R                             
Sbjct: 901  EEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSRRESI 960

BLAST of MS012374 vs. NCBI nr
Match: XP_022984662.1 (uncharacterized protein LOC111482876 isoform X2 [Cucurbita maxima])

HSP 1 Score: 708.0 bits (1826), Expect = 1.1e-199
Identity = 546/1082 (50.46%), Postives = 661/1082 (61.09%), Query Frame = 0

Query: 1    MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
            MA+PSN+SSSPSMV GRTSP SRNSE+ N V RSFS NPF++PSI  + +SLNP+TPAN+
Sbjct: 1    MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60

Query: 61   PSDY-PRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAASK 120
            PSDY P+RNSVSREILFTSRDNE+KENGKDQ+PK  RVRSPTVGKS K+FMS TISAASK
Sbjct: 61   PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120

Query: 121  IAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPE-------- 180
            IAVSPKKKILGDRNE VRSSLSFSG KSSSLNSVNP PE A+ A ESDTNP         
Sbjct: 121  IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPE-ASMAFESDTNPPMPLISNPK 180

Query: 181  --------------------------------------------IVPISNSSIAAPTPKS 240
                                                        IVPI+ S+IAA + KS
Sbjct: 181  STKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSKS 240

Query: 241  SKTVRFAGFEVISGSYDDAESTYR--YDTNTEVVTVAVETDLKPEIAPIS---------- 300
            SKTV F GFEVIS SYDD+ESTYR  +D N E VTVAVE D +PEI PIS          
Sbjct: 241  SKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTPE 300

Query: 301  SSKTVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLM 360
            +SK +RFSD+E +S+  N  ES   + FTEE+DCVNLDPSF ISPVSSPM+AP+DADP++
Sbjct: 301  ASKIMRFSDLEAVSN--NALESSVNSNFTEEVDCVNLDPSFNISPVSSPMIAPMDADPII 360

Query: 361  PPYDPKTNYLSPRPQFLHYRPNRRIER-GNRLEEFFSSVNASESEFAEETESENSLKESD 420
             PYDPKTNYLSPRPQFLHY PNRRI R   R EE FS+        +EET+ E+  KESD
Sbjct: 361  TPYDPKTNYLSPRPQFLHYNPNRRINRPDGRFEELFST--------SEETDCEDPQKESD 420

Query: 421  ESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKS-----SGMFKISSLLFILSITFFS 480
            E SSNES+  EEE     EE + VSEQ  T EVKKS     S +FKISSLL IL     S
Sbjct: 421  EVSSNESQMKEEE----KEEEVDVSEQGPT-EVKKSSKPLLSRIFKISSLLLILFTACLS 480

Query: 481  ICAV-VRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFR 540
            IC V V DP I ERS+LL +   SEI+  AK NFNVLVGKLE+WHA+SIS I  VV NFR
Sbjct: 481  ICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFNFR 540

Query: 541  GGAPSLIYLNQTEFLYKDV--DGQSLV-THQTLWEKE-IFLNVIEEGATKERRIEYAETE 600
            GG P LI+LNQTEF Y DV  D Q LV +HQ +WE+E   +N +E  A K+R  +  E  
Sbjct: 541  GG-PPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAME--AMKDREGQNKE-- 600

Query: 601  DHGVEEEEESLQEIEAIKEI-------EATNEEEEEQLLQEIESKTSYPEVGE-ENDEIS 660
              G E+EE++ +E   +KEI       E+ NEE EEQ  QEIE++T+  E  E ENDE S
Sbjct: 601  --GQEQEEDAQEEAIKVKEIGIQTVERESQNEEVEEQSFQEIEARTNDSENSEKENDEAS 660

Query: 661  AKSASENID---------EEDAQEKETEENYAVSSADFEILDQIEPSASNK--------- 720
             +S  E I+         E   Q++E ++  A+   +  I + +E  + N+         
Sbjct: 661  EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGI-ETVERESQNEEVEEEPFQK 720

Query: 721  --------------------------IDEDDVHENEIEENYEGSSADFEILDQIELSASD 780
                                      ++E+ V E  +E     SS+DF++  QIE +A+ 
Sbjct: 721  TEAKANDQKDREEENDEASEESLLEIVEEESVQEKTVENFKASSSSDFKLHGQIEQAAAT 780

Query: 781  EIDEDDVHE----NERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPV 840
                 ++ +     E     E ++A  ET  ++   A T ET+EE N      Q  + PV
Sbjct: 781  GETHYEIEQAAATGETHYEIEQAAATGETHYEIEQAAATEETQEETNTEF---QYQSPPV 840

Query: 841  FSPFAEPQSDFQSSNGSN----IKILQGISGDFTHN---MMFALLLS--LIIPAGFIYAK 900
             SP +E QSD +  NG      I+   GIS DFT N   ++ A+LL   LIIPAG IYA+
Sbjct: 841  SSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYAR 900

Query: 901  KSGSK---PTIAAVEEQKQSLMKEEEKTNHS--PEEEEEAADDEHDDDMTGESCSSETSS 905
            KSGS+    T A  EEQ++  + +++KTN S   EEEEE A D+ DDDM GE CSSETSS
Sbjct: 901  KSGSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSS 960

BLAST of MS012374 vs. ExPASy TrEMBL
Match: A0A6J1D590 (uncharacterized protein LOC111017725 OS=Momordica charantia OX=3673 GN=LOC111017725 PE=4 SV=1)

HSP 1 Score: 1634.0 bits (4230), Expect = 0.0e+00
Identity = 899/904 (99.45%), Postives = 900/904 (99.56%), Query Frame = 0

Query: 1   MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
           MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS
Sbjct: 1   MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60

Query: 61  PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAASKI 120
           PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSP+VGKSSKHFMSPTISAASKI
Sbjct: 61  PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPSVGKSSKHFMSPTISAASKI 120

Query: 121 AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI 180
           AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI
Sbjct: 121 AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI 180

Query: 181 AAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSKTV 240
           AAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSKTV
Sbjct: 181 AAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSKTV 240

Query: 241 RFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLMPPYDP 300
           RFSDVEVISDLKNNFES  KNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLMPPYDP
Sbjct: 241 RFSDVEVISDLKNNFESADKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLMPPYDP 300

Query: 301 KTNYLSPRPQFLHYRPNRRIERGNRLEEFFSSVNASESEFAEETESENSLKESDESSSNE 360
           KTNYLSPRPQFLHYRPNRRIERGNRLEEFFSSVNASESEFAEETESENSLKESDESSSNE
Sbjct: 301 KTNYLSPRPQFLHYRPNRRIERGNRLEEFFSSVNASESEFAEETESENSLKESDESSSNE 360

Query: 361 SEQGEEEVEEKDEERIHVSEQKETIEVKKSSGMFKISSLLFILSITFFSICAVVRDPNIS 420
           SEQGEEEVEEKDEERIHVSEQKE IEVKKSSGMFKISSLLFILSITFFSICAVVRDPNIS
Sbjct: 361 SEQGEEEVEEKDEERIHVSEQKEXIEVKKSSGMFKISSLLFILSITFFSICAVVRDPNIS 420

Query: 421 ERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQT 480
           ERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQT
Sbjct: 421 ERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQT 480

Query: 481 EFLYKDVDGQSLVTHQTLWEKEIFLNVIEEGATKERRIEYAETEDHGVEEEEESLQEIEA 540
           EFLYKDVDGQSLVTHQTLWEKE FLNVIEEGATKERRIEYAETEDHGVEEEEESLQEIEA
Sbjct: 481 EFLYKDVDGQSLVTHQTLWEKENFLNVIEEGATKERRIEYAETEDHGVEEEEESLQEIEA 540

Query: 541 IKEIEATNEEEEEQLLQEIESKTSYPEVGEENDEISAKSASENIDEEDAQEKETEENYAV 600
           IKEIEATNEEEEEQLLQEIESKTSYPEVGEENDEISAKSASENIDEEDAQEKETEENYAV
Sbjct: 541 IKEIEATNEEEEEQLLQEIESKTSYPEVGEENDEISAKSASENIDEEDAQEKETEENYAV 600

Query: 601 SSADFEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVH 660
           SSADFEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVH
Sbjct: 601 SSADFEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVH 660

Query: 661 ENERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQ 720
           ENERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQ
Sbjct: 661 ENERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQ 720

Query: 721 SSNGSNIKILQGISGDFTHNMMFALLLSLIIPAGFIYAKKSGSKPTIAAVEEQKQSLMKE 780
           SSNGSNIKILQGISGDFTHNMMFALLLSLIIPAGFIYAKKSGSKPTIAAVEEQKQSLMKE
Sbjct: 721 SSNGSNIKILQGISGDFTHNMMFALLLSLIIPAGFIYAKKSGSKPTIAAVEEQKQSLMKE 780

Query: 781 EEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRGAAGAVKEPSEAHSHSHGR 840
           EEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRGAAGAVKEPSEAHSHSHGR
Sbjct: 781 EEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRGAAGAVKEPSEAHSHSHGR 840

Query: 841 KKRKNSRRESLASSSDEISISASPSPSYGSFTTYEKIKHHGNGEEEIMTPVRRSNRIRKQ 900
           KKRKNSRRESLASSSDEISISASPSPSYGSFTTYEKIKHHGNGEEEIMTPVRRSNRIRKQ
Sbjct: 841 KKRKNSRRESLASSSDEISISASPSPSYGSFTTYEKIKHHGNGEEEIMTPVRRSNRIRKQ 900

Query: 901 HNNS 905
           HNNS
Sbjct: 901 HNNS 904

BLAST of MS012374 vs. ExPASy TrEMBL
Match: A0A6J1J980 (uncharacterized protein LOC111482876 isoform X5 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)

HSP 1 Score: 717.2 bits (1850), Expect = 8.4e-203
Identity = 543/1045 (51.96%), Postives = 657/1045 (62.87%), Query Frame = 0

Query: 1    MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
            MA+PSN+SSSPSMV GRTSP SRNSE+ N V RSFS NPF++PSI  + +SLNP+TPAN+
Sbjct: 1    MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60

Query: 61   PS--DY-PRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAA 120
            PS  DY P+RNSVSREILFTSRDNE+KENGKDQ+PK  RVRSPTVGKS K+FMS TISAA
Sbjct: 61   PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120

Query: 121  SKIAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPE------ 180
            SKIAVSPKKKILGDRNE VRSSLSFSG KSSSLNSVNP PE A+ A ESDTNP       
Sbjct: 121  SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPE-ASMAFESDTNPPMPLISN 180

Query: 181  ----------------------------------------------IVPISNSSIAAPTP 240
                                                          IVPI+ S+IAA + 
Sbjct: 181  PKSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASS 240

Query: 241  KSSKTVRFAGFEVISGSYDDAESTYR--YDTNTEVVTVAVETDLKPEIAPIS-------- 300
            KSSKTV F GFEVIS SYDD+ESTYR  +D N E VTVAVE D +PEI PIS        
Sbjct: 241  KSSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVT 300

Query: 301  --SSKTVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADP 360
              +SK +RFSD+E +S+  N  ES   + FTEE+DCVNLDPSF ISPVSSPM+AP+DADP
Sbjct: 301  PEASKIMRFSDLEAVSN--NALESSVNSNFTEEVDCVNLDPSFNISPVSSPMIAPMDADP 360

Query: 361  LMPPYDPKTNYLSPRPQFLHYRPNRRIER-GNRLEEFFSSVNASESEFAEETESENSLKE 420
            ++ PYDPKTNYLSPRPQFLHY PNRRI R   R EE FS+        +EET+ E+  KE
Sbjct: 361  IITPYDPKTNYLSPRPQFLHYNPNRRINRPDGRFEELFST--------SEETDCEDPQKE 420

Query: 421  SDESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKS-----SGMFKISSLLFILSITF 480
            SDE SSNES+  EEE     EE + VSEQ  T EVKKS     S +FKISSLL IL    
Sbjct: 421  SDEVSSNESQMKEEE----KEEEVDVSEQGPT-EVKKSSKPLLSRIFKISSLLLILFTAC 480

Query: 481  FSICAV-VRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSN 540
             SIC V V DP I ERS+LL +   SEI+  AK NFNVLVGKLE+WHA+SIS I  VV N
Sbjct: 481  LSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFN 540

Query: 541  FRGGAPSLIYLNQTEFLYKDV--DGQSLV-THQTLWEKE-IFLNVIEEGATKERRIEYAE 600
            FRGG P LI+LNQTEF Y DV  D Q LV +HQ +WE+E   +N +E  A K+R  +  E
Sbjct: 541  FRGG-PPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAME--AMKDREGQNKE 600

Query: 601  TEDHGVEEEEESLQEIEAIKEI-------EATNEEEEEQLLQEIESKTSYPEVGE-ENDE 660
                G E+EE++ +E   +KEI       E+ NEE EEQ  QEIE++T+  E  E ENDE
Sbjct: 601  ----GQEQEEDAQEEAIKVKEIGIQTVERESQNEEVEEQSFQEIEARTNDSENSEKENDE 660

Query: 661  ISAKSASENID---------EEDAQEKETEENYAVSSADFEILDQIEPSASNKIDEDDVH 720
             S +S  E I+         E   Q++E ++  A+   +  I      S + +++E+   
Sbjct: 661  ASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQ 720

Query: 721  ENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENERKENHEASSADFETLDQVGPEA 780
            + E + N +    + E  +  E S  + ++E+ V E   +    +SS+DF+   Q+   A
Sbjct: 721  KTEAKAN-DQKDREEENDEASEESLLEIVEEESVQEKTVENFKASSSSDFKLHGQIEQAA 780

Query: 781  ETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQSSNGSN----IKILQGISGDFTHN- 840
             TGET+EE N      Q  + PV SP +E QSD +  NG      I+   GIS DFT N 
Sbjct: 781  ATGETQEETNTEF---QYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNT 840

Query: 841  --MMFALLLS--LIIPAGFIYAKKSGSK---PTIAAVEEQKQSLMKEEEKTNHS--PEEE 900
              ++ A+LL   LIIPAG IYA+KSGS+    T A  EEQ++  + +++KTN S   EEE
Sbjct: 841  AAIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEE 900

Query: 901  EEAADDEHDDDMTGESCSSETSS-FQYSSMR---------------------------GA 905
            EE A D+ DDDM GE CSSETSS FQYSS+R                             
Sbjct: 901  EEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSRRESI 960

BLAST of MS012374 vs. ExPASy TrEMBL
Match: A0A6J1JB72 (uncharacterized protein LOC111482876 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)

HSP 1 Score: 715.7 bits (1846), Expect = 2.5e-202
Identity = 542/1045 (51.87%), Postives = 657/1045 (62.87%), Query Frame = 0

Query: 1    MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
            MA+PSN+SSSPSMV GRTSP SRNSE+ N V RSFS NPF++PSI  + +SLNP+TPAN+
Sbjct: 1    MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60

Query: 61   PS--DY-PRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAA 120
            PS  DY P+RNSVSREILFTSRDNE+KENGKDQ+PK  RVRSPTVGKS K+FMS TISAA
Sbjct: 61   PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120

Query: 121  SKIAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPE------ 180
            SKIAVSPKKKILGDRNE VRSSLSFSG KSSSLNSVNP PE A+ A ESDTNP       
Sbjct: 121  SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPE-ASMAFESDTNPPMPLISN 180

Query: 181  ----------------------------------------------IVPISNSSIAAPTP 240
                                                          IVPI+ S+IAA + 
Sbjct: 181  PKSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASS 240

Query: 241  KSSKTVRFAGFEVISGSYDDAESTYR--YDTNTEVVTVAVETDLKPEIAPIS-------- 300
            KSSKTV F GFEVIS SYDD+ESTYR  +D N E VTVAVE D +PEI PIS        
Sbjct: 241  KSSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVT 300

Query: 301  --SSKTVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADP 360
              +SK +RFSD+E +S+  N  ES   + FTEE+DCVNLDPSF ISPVSSPM+AP+DADP
Sbjct: 301  PEASKIMRFSDLEAVSN--NALESSVNSNFTEEVDCVNLDPSFNISPVSSPMIAPMDADP 360

Query: 361  LMPPYDPKTNYLSPRPQFLHYRPNRRIER-GNRLEEFFSSVNASESEFAEETESENSLKE 420
            ++ PYDPKTNYLSPRPQFLHY PNRRI R   R EE FS+        +EET+ E+  KE
Sbjct: 361  IITPYDPKTNYLSPRPQFLHYNPNRRINRPDGRFEELFST--------SEETDCEDPQKE 420

Query: 421  SDESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKS-----SGMFKISSLLFILSITF 480
            SDE SSNES+  EEE     EE + VSEQ  T EVKKS     S +FKISSLL IL    
Sbjct: 421  SDEVSSNESQMKEEE----KEEEVDVSEQGPT-EVKKSSKPLLSRIFKISSLLLILFTAC 480

Query: 481  FSICAV-VRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSN 540
             SIC V V DP I ERS+LL +   SEI+  AK NFNVLVGKLE+WHA+SIS I  VV N
Sbjct: 481  LSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFN 540

Query: 541  FRGGAPSLIYLNQTEFLYKDV--DGQSLV-THQTLWEKE-IFLNVIEEGATKERRIEYAE 600
            FRGG P LI+LNQTEF Y DV  D Q LV +HQ +WE+E   +N +E  A K+R  +  E
Sbjct: 541  FRGG-PPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAME--AMKDREGQNKE 600

Query: 601  TEDHGVEEEEESLQEIEAIKEI-------EATNEEEEEQLLQEIESKTSYPEVGE-ENDE 660
                G E+EE++ +E   +KEI       E+ NEE EEQ  QEIE++T+  E  E ENDE
Sbjct: 601  ----GQEQEEDAQEEAIKVKEIGIQTVERESQNEEVEEQSFQEIEARTNDSENSEKENDE 660

Query: 661  ISAKSASENID---------EEDAQEKETEENYAVSSADFEILDQIEPSASNKIDEDDVH 720
             S +S  E I+         E   Q++E ++  A+   +  I      S + +++E+   
Sbjct: 661  ASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQ 720

Query: 721  ENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENERKENHEASSADFETLDQVGPEA 780
            + E + N +    + E  +  E S  + ++E+ V E   +    +SS+DF+  D++   A
Sbjct: 721  KTEAKAN-DQKDREEENDEASEESLLEIVEEESVQEKTVENFKASSSSDFKLHDEIEQAA 780

Query: 781  ETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQSSNGSN----IKILQGISGDFTHN- 840
             T ET+EE N      Q  + PV SP +E QSD +  NG      I+   GIS DFT N 
Sbjct: 781  ATEETQEETNTEF---QYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNT 840

Query: 841  --MMFALLLS--LIIPAGFIYAKKSGSK---PTIAAVEEQKQSLMKEEEKTNHS--PEEE 900
              ++ A+LL   LIIPAG IYA+KSGS+    T A  EEQ++  + +++KTN S   EEE
Sbjct: 841  AAIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEE 900

Query: 901  EEAADDEHDDDMTGESCSSETSS-FQYSSMR---------------------------GA 905
            EE A D+ DDDM GE CSSETSS FQYSS+R                             
Sbjct: 901  EEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSRRESI 960

BLAST of MS012374 vs. ExPASy TrEMBL
Match: A0A6J1J2S7 (uncharacterized protein LOC111482876 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)

HSP 1 Score: 708.0 bits (1826), Expect = 5.1e-200
Identity = 546/1082 (50.46%), Postives = 661/1082 (61.09%), Query Frame = 0

Query: 1    MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
            MA+PSN+SSSPSMV GRTSP SRNSE+ N V RSFS NPF++PSI  + +SLNP+TPAN+
Sbjct: 1    MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60

Query: 61   PSDY-PRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAASK 120
            PSDY P+RNSVSREILFTSRDNE+KENGKDQ+PK  RVRSPTVGKS K+FMS TISAASK
Sbjct: 61   PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120

Query: 121  IAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPE-------- 180
            IAVSPKKKILGDRNE VRSSLSFSG KSSSLNSVNP PE A+ A ESDTNP         
Sbjct: 121  IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPE-ASMAFESDTNPPMPLISNPK 180

Query: 181  --------------------------------------------IVPISNSSIAAPTPKS 240
                                                        IVPI+ S+IAA + KS
Sbjct: 181  STKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSKS 240

Query: 241  SKTVRFAGFEVISGSYDDAESTYR--YDTNTEVVTVAVETDLKPEIAPIS---------- 300
            SKTV F GFEVIS SYDD+ESTYR  +D N E VTVAVE D +PEI PIS          
Sbjct: 241  SKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTPE 300

Query: 301  SSKTVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLM 360
            +SK +RFSD+E +S+  N  ES   + FTEE+DCVNLDPSF ISPVSSPM+AP+DADP++
Sbjct: 301  ASKIMRFSDLEAVSN--NALESSVNSNFTEEVDCVNLDPSFNISPVSSPMIAPMDADPII 360

Query: 361  PPYDPKTNYLSPRPQFLHYRPNRRIER-GNRLEEFFSSVNASESEFAEETESENSLKESD 420
             PYDPKTNYLSPRPQFLHY PNRRI R   R EE FS+        +EET+ E+  KESD
Sbjct: 361  TPYDPKTNYLSPRPQFLHYNPNRRINRPDGRFEELFST--------SEETDCEDPQKESD 420

Query: 421  ESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKS-----SGMFKISSLLFILSITFFS 480
            E SSNES+  EEE     EE + VSEQ  T EVKKS     S +FKISSLL IL     S
Sbjct: 421  EVSSNESQMKEEE----KEEEVDVSEQGPT-EVKKSSKPLLSRIFKISSLLLILFTACLS 480

Query: 481  ICAV-VRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSNFR 540
            IC V V DP I ERS+LL +   SEI+  AK NFNVLVGKLE+WHA+SIS I  VV NFR
Sbjct: 481  ICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFNFR 540

Query: 541  GGAPSLIYLNQTEFLYKDV--DGQSLV-THQTLWEKE-IFLNVIEEGATKERRIEYAETE 600
            GG P LI+LNQTEF Y DV  D Q LV +HQ +WE+E   +N +E  A K+R  +  E  
Sbjct: 541  GG-PPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAME--AMKDREGQNKE-- 600

Query: 601  DHGVEEEEESLQEIEAIKEI-------EATNEEEEEQLLQEIESKTSYPEVGE-ENDEIS 660
              G E+EE++ +E   +KEI       E+ NEE EEQ  QEIE++T+  E  E ENDE S
Sbjct: 601  --GQEQEEDAQEEAIKVKEIGIQTVERESQNEEVEEQSFQEIEARTNDSENSEKENDEAS 660

Query: 661  AKSASENID---------EEDAQEKETEENYAVSSADFEILDQIEPSASNK--------- 720
             +S  E I+         E   Q++E ++  A+   +  I + +E  + N+         
Sbjct: 661  EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGI-ETVERESQNEEVEEEPFQK 720

Query: 721  --------------------------IDEDDVHENEIEENYEGSSADFEILDQIELSASD 780
                                      ++E+ V E  +E     SS+DF++  QIE +A+ 
Sbjct: 721  TEAKANDQKDREEENDEASEESLLEIVEEESVQEKTVENFKASSSSDFKLHGQIEQAAAT 780

Query: 781  EIDEDDVHE----NERKENHEASSADFETLDQVGPEAETGETEEENNDSLLQQQSNTAPV 840
                 ++ +     E     E ++A  ET  ++   A T ET+EE N      Q  + PV
Sbjct: 781  GETHYEIEQAAATGETHYEIEQAAATGETHYEIEQAAATEETQEETNTEF---QYQSPPV 840

Query: 841  FSPFAEPQSDFQSSNGSN----IKILQGISGDFTHN---MMFALLLS--LIIPAGFIYAK 900
             SP +E QSD +  NG      I+   GIS DFT N   ++ A+LL   LIIPAG IYA+
Sbjct: 841  SSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYAR 900

Query: 901  KSGSK---PTIAAVEEQKQSLMKEEEKTNHS--PEEEEEAADDEHDDDMTGESCSSETSS 905
            KSGS+    T A  EEQ++  + +++KTN S   EEEEE A D+ DDDM GE CSSETSS
Sbjct: 901  KSGSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSS 960

BLAST of MS012374 vs. ExPASy TrEMBL
Match: A0A6J1JB65 (uncharacterized protein LOC111482876 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)

HSP 1 Score: 705.3 bits (1819), Expect = 3.3e-199
Identity = 542/1068 (50.75%), Postives = 658/1068 (61.61%), Query Frame = 0

Query: 1    MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
            MA+PSN+SSSPSMV GRTSP SRNSE+ N V RSFS NPF++PSI  + +SLNP+TPAN+
Sbjct: 1    MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60

Query: 61   PS--DY-PRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAA 120
            PS  DY P+RNSVSREILFTSRDNE+KENGKDQ+PK  RVRSPTVGKS K+FMS TISAA
Sbjct: 61   PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120

Query: 121  SKIAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPE------ 180
            SKIAVSPKKKILGDRNE VRSSLSFSG KSSSLNSVNP PE A+ A ESDTNP       
Sbjct: 121  SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPE-ASMAFESDTNPPMPLISN 180

Query: 181  ----------------------------------------------IVPISNSSIAAPTP 240
                                                          IVPI+ S+IAA + 
Sbjct: 181  PKSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASS 240

Query: 241  KSSKTVRFAGFEVISGSYDDAESTYR--YDTNTEVVTVAVETDLKPEIAPIS-------- 300
            KSSKTV F GFEVIS SYDD+ESTYR  +D N E VTVAVE D +PEI PIS        
Sbjct: 241  KSSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVT 300

Query: 301  --SSKTVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADP 360
              +SK +RFSD+E +S+  N  ES   + FTEE+DCVNLDPSF ISPVSSPM+AP+DADP
Sbjct: 301  PEASKIMRFSDLEAVSN--NALESSVNSNFTEEVDCVNLDPSFNISPVSSPMIAPMDADP 360

Query: 361  LMPPYDPKTNYLSPRPQFLHYRPNRRIER-GNRLEEFFSSVNASESEFAEETESENSLKE 420
            ++ PYDPKTNYLSPRPQFLHY PNRRI R   R EE FS+        +EET+ E+  KE
Sbjct: 361  IITPYDPKTNYLSPRPQFLHYNPNRRINRPDGRFEELFST--------SEETDCEDPQKE 420

Query: 421  SDESSSNESEQGEEEVEEKDEERIHVSEQKETIEVKKS-----SGMFKISSLLFILSITF 480
            SDE SSNES+  EEE     EE + VSEQ  T EVKKS     S +FKISSLL IL    
Sbjct: 421  SDEVSSNESQMKEEE----KEEEVDVSEQGPT-EVKKSSKPLLSRIFKISSLLLILFTAC 480

Query: 481  FSICAV-VRDPNISERSSLLMVEHPSEIYEFAKMNFNVLVGKLEVWHADSISSIFHVVSN 540
             SIC V V DP I ERS+LL +   SEI+  AK NFNVLVGKLE+WHA+SIS I  VV N
Sbjct: 481  LSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFN 540

Query: 541  FRGGAPSLIYLNQTEFLYKDV--DGQSLV-THQTLWEKE-IFLNVIEEGATKERRIEYAE 600
            FRGG P LI+LNQTEF Y DV  D Q LV +HQ +WE+E   +N +E  A K+R  +  E
Sbjct: 541  FRGG-PPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAME--AMKDREGQNKE 600

Query: 601  TEDHGVEEEEESLQEIEAIKEI-------EATNEEEEEQLLQEIESKTSYPEVGE-ENDE 660
                G E+EE++ +E   +KEI       E+ NEE EEQ  QEIE++T+  E  E ENDE
Sbjct: 601  ----GQEQEEDAQEEAIKVKEIGIQTVERESQNEEVEEQSFQEIEARTNDSENSEKENDE 660

Query: 661  ISAKSASENID---------EEDAQEKETEENYAVSSADFEILDQIEPSASNKIDEDDVH 720
             S +S  E I+         E   Q++E ++  A+   +  I      S + +++E+   
Sbjct: 661  ASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEEPFQ 720

Query: 721  ENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENERKENHEASSADFETLDQVGPEA 780
            + E + N +    + E  +  E S  + ++E+ V E   +    +SS+DF+   Q+   A
Sbjct: 721  KTEAKAN-DQKDREEENDEASEESLLEIVEEESVQEKTVENFKASSSSDFKLHGQIEQAA 780

Query: 781  ETGE-----------------------TEEENNDSLLQQQSNTAPVFSPFAEPQSDFQSS 840
             TGE                       TEE   ++  + Q  + PV SP +E QSD +  
Sbjct: 781  ATGETHYEIEQAAATGETHYEIEQAAATEETQEETNTEFQYQSPPVSSPPSEHQSDVEEE 840

Query: 841  NGSN----IKILQGISGDFTHN---MMFALLLS--LIIPAGFIYAKKSGSK---PTIAAV 900
            NG      I+   GIS DFT N   ++ A+LL   LIIPAG IYA+KSGS+    T A  
Sbjct: 841  NGGKIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIA 900

Query: 901  EEQKQSLMKEEEKTNHS--PEEEEEAADDEHDDDMTGESCSSETSS-FQYSSMR------ 905
            EEQ++  + +++KTN S   EEEEE A D+ DDDM GE CSSETSS FQYSS+R      
Sbjct: 901  EEQQEEPLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVREGETEA 960

BLAST of MS012374 vs. TAIR 10
Match: AT1G16630.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G16270.1); Has 10587 Blast hits to 5736 proteins in 617 species: Archae - 88; Bacteria - 963; Metazoa - 3686; Fungi - 820; Plants - 541; Viruses - 438; Other Eukaryotes - 4051 (source: NCBI BLink). )

HSP 1 Score: 139.0 bits (349), Expect = 1.8e-32
Identity = 268/964 (27.80%), Postives = 407/964 (42.22%), Query Frame = 0

Query: 8   SSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANSPSDYPRR 67
           SSSPSM   R +P  RNSE G+ +RRSF GNPF+                    +D  RR
Sbjct: 20  SSSPSM-PSRPNPKQRNSETGDLMRRSFRGNPFS--------------------ADPSRR 79

Query: 68  NSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAASKIAVSPKKK 127
           NS+ RE      +  +KEN  D++     V+ PT  K SKHFMSPTISA SKI  SP+KK
Sbjct: 80  NSIGRE-CSNRVEIGDKENQNDKDQIANVVKGPT--KGSKHFMSPTISAVSKINPSPRKK 139

Query: 128 ILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSIAAPTPKS 187
           IL D+NE  RS                            D +   V + +S         
Sbjct: 140 ILSDKNEVSRS---------------------------FDKSHHQVQVKSSV-------- 199

Query: 188 SKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSKTVRFSDVEV 247
                         S+ D  S    D + + + +     L+ E      S  +  SD + 
Sbjct: 200 --------------SFSDVISIIGEDKDVDQICIDETKQLREE-----ESHDITVSDFDE 259

Query: 248 ISDLKNNFESPAKNIFTEELDCVNLDPSFKISPV------SSPMVAPLDADPLMPPYDPK 307
           I + K+N  S                 SFKISP+      + P+    + DP++ PYDPK
Sbjct: 260 ILERKSNDNS-----------------SFKISPLPPYVPCTFPVFESHEVDPVVAPYDPK 319

Query: 308 TNYLSPRPQFLHYRPNRRIERGN----RLEE-FFSSVNASESEFAEETESENSLKESDES 367
            NYLSPRPQFLHY+PN +IE  +    +LEE F S  ++S+++ + E E E   ++ +E 
Sbjct: 320 KNYLSPRPQFLHYKPNPKIEHRSDECKQLEELFISESSSSDTDLSAEREEEG--QQEEEV 379

Query: 368 SSNESEQGEEEVEEKDEERIHVS------------------------------EQKETIE 427
           +S E     EE E+  EER+  +                              E++ET +
Sbjct: 380 ASQEGVVAVEEQEDDGEERLEAAEEILDVDGEERLEAVESDDEEEEVVVGESIEEEETHQ 439

Query: 428 VKKSSGMFKISSLLFILSITFFSICAVVRDPNISER----SSLLMVEHPSEIYEFAKMNF 487
           + K S   K S LL  +     +   +V     S++    S         EI   A  NF
Sbjct: 440 ISKQSRFSKTSMLLGWILALGVAYLLLVSSTTFSQQTITDSPFYQFNISPEIIMSASENF 499

Query: 488 NVLVGKLEVWHADSISSIFHVVSNFRGGAPSLIY-LNQTEFLYKDVDGQSLVTHQTLWEK 547
             L  KL +W   S   +  +VS+ R    S+ +  +    L +D      V   T  E 
Sbjct: 500 EQLGAKLRMWAESSFVYLDKLVSSLREEEGSVPFQFHNLTVLLEDKRLSDAVFQSTSVEI 559

Query: 548 EIFLNVIEEGATKERRIEYAETEDHGVEEEEESLQEIEAIKEIEATNEEEEEQLLQEIES 607
            +   +++   + E  IE         EEE E+  EI     +EA  EE++ ++ QE E 
Sbjct: 560 IVDGFIVD---SLEVDIEEVNVGHQEPEEESENSGEI----SLEAVYEEDDNEVEQENEE 619

Query: 608 KTSYPEVGEEND---EISAKSASENIDEEDAQEKETEENYAVSSADFEILDQIEPSASNK 667
                E+ +E D   EI   + +E    E   E  +EE +     D              
Sbjct: 620 GKVNLEIVDECDEQAEIKIATDTEVNGGERYSESLSEEGHGGQETDV------------- 679

Query: 668 IDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENERKENHEASSADFETL 727
           ++  + +E   + N E + +D ++LD ++ +A        +  N++++   A+    +  
Sbjct: 680 VEGQEEYEENDQNNMEEAESDAQLLDDVQSAA--------ISSNQQEQTGVANVETVQEE 739

Query: 728 DQVGPEAETGETEEENNDSLLQQQSNTAPVFSPFAEPQSDFQSSNGSNIKILQGISGDFT 787
           + VG  A    +  E    +    +      S F E  +D  S +     IL  +SG   
Sbjct: 740 EGVGEIAGGSLSVSEEATDVEHDGNEVEEEESGFGEVVNDAGSED-----IL--LSGQKK 799

Query: 788 HNMMFALLLSLI--IPAGFIYAKKSGSKPTIAAVEEQKQSLMKEEEKTNHSP-------- 847
             ++F+ ++ ++  + AGF+ AKK  +KP +   E+ + + +   +   H P        
Sbjct: 800 VLVLFSTMMVILAAVAAGFLLAKKK-TKPVMLQHEDGEPTAISATKVVEHVPVENLIRER 843

Query: 848 -------EEEEEAADDEHDDDMTGESCSSETSSFQYSSMRGAAGAVKEPSEAHSH-SHGR 903
                  EEEEE  DD   +     S      SF +S  +       +  +   H S G 
Sbjct: 860 LSSLNFKEEEEEVGDDRKRE----VSSFPSEMSFSFSKNKPLHSCSNKKDDLKEHQSGGG 843

BLAST of MS012374 vs. TAIR 10
Match: AT2G16270.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G16630.1); Has 1844 Blast hits to 1256 proteins in 271 species: Archae - 6; Bacteria - 283; Metazoa - 434; Fungi - 153; Plants - 91; Viruses - 52; Other Eukaryotes - 825 (source: NCBI BLink). )

HSP 1 Score: 127.9 bits (320), Expect = 4.2e-29
Identity = 277/936 (29.59%), Postives = 386/936 (41.24%), Query Frame = 0

Query: 1   MAVPSNKSSSPS-MVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPAN 60
           MA P+NK+ S S  +  R +P  RNSE G+ +RRSF GNPF   S V            N
Sbjct: 1   MASPTNKNPSFSPPIPNRPNPKPRNSEAGDPLRRSFGGNPFPANSKV------------N 60

Query: 61  SPSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPTVGKSSKHFMSPTISAASK 120
            PSD  RRNS   +              K+   KP+++      K SK+FMSPTISA SK
Sbjct: 61  IPSDLTRRNSFGGD--------------KENETKPVQL----TPKGSKNFMSPTISAVSK 120

Query: 121 IAVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNP-NPEAAATAVESDTNPEIVPISNS 180
           I  SP+K++L D+NE  R   SFS  K   L   N  N   A + V              
Sbjct: 121 INASPRKRVLSDKNEMSR---SFSDVKGLILEDDNKRNHHRAKSCV-------------- 180

Query: 181 SIAAPTPKSSKTVRFAGFEVISGSYDDAESTYRYDTNTEVVTVAVETDLKPEIAPISSSK 240
                                  S+ D   T   D   + V               S   
Sbjct: 181 -----------------------SFSDVLHTICIDDEKKFVE--------------SHDM 240

Query: 241 TVRFSDVEVISDLKNNFESPAKNIFTEELDCVNLDPSFKIS-----PVSSPMVAPLDADP 300
           TV   D + + + K    S               DP F+IS     P +SP  A  + D 
Sbjct: 241 TVTDFDEKEVYENKGITYS---------------DPRFRISPRPSVPYTSPEFAACEVDT 300

Query: 301 LMPPYDPKTNYLSPRPQFLHYRPNRRIERG----NRLEEFFSSVNASESEFAEETESENS 360
           L+PPYDPK N+LSPRPQFLHY+PN RIE+      +LEE F S ++S+       ESE  
Sbjct: 301 LLPPYDPKKNFLSPRPQFLHYKPNPRIEKRFDECKQLEELFISESSSDDTELSVEESEEQ 360

Query: 361 LKESDES--SSNESEQGEEEVEEKDEERIHVSEQKETIEVKKSSGMFKISSLLFILSITF 420
            K+  E      E+E  E+   E DEE +  S ++ T +V K SG  K   L + L++  
Sbjct: 361 EKDGAEEVVVEEETEDVEQSEAESDEEMVCESVEETTSQVPKQSGSRKFKFLGWFLALAL 420

Query: 421 -FSICAVVRDPNISERSSLLMVEHPSEIYEFAKM-NFNVLVGKLEVWHADSISSIFHVVS 480
            + + +    P +  +SS      P EI EFAK  N + L  KL      S+  +  ++S
Sbjct: 421 GYLLVSATFSPLM--KSSFNEFHIPKEITEFAKANNLDQLSDKLWTLTESSLVYMDKLIS 480

Query: 481 NFRGGAPSLIYLNQTEFLYKDVDGQSLVTHQTLWEKEIFLNVIEEGATKERRIE--YAET 540
               G      L      Y   D  S V   T    EI    ++E +  E  +E      
Sbjct: 481 RLGRGNEEYSQLQFHNLTYTLED--STVFKPTC--VEIIQEPLQENSRSENSLEDGSVNE 540

Query: 541 EDHGVEEEEE------SLQEIEAIKEIEATNEEEEEQLLQEIESKTSYPEVGEENDEISA 600
           E+ G EE  E       L E++   +IE+ + E   + L E   + +  E+ E     S 
Sbjct: 541 EESGAEENSEVVCQFDELAEVKPSTDIESNDGERNLKALFEDGLELNIEELRE-----SE 600

Query: 601 KSASENIDEEDAQEKETEENYAVSSADFEILDQIEPSASNKIDEDDVHENEIEENYEGSS 660
            S  E ++ E   E+   E   ++  D E            I+     E+EI     GS 
Sbjct: 601 MSPEEKLETEKKLEETESEAIYINQPDVEFA---------AINVHQHIESEILVAESGSE 660

Query: 661 ADF-EILDQIELSASDEIDEDDVHENERKENHEASSADFETLDQVGPEAETGETEEENND 720
             F EI D + L              E    ++ +  D E+    G E   GE   E +D
Sbjct: 661 ESFGEIGDLLHL--------------EVGSYNDLAKGDAES----GSEEGFGEIAAETSD 720

Query: 721 SLLQQQSNTAPVFSPFAEPQSDFQSSNGSNIKILQGISGDFTHNMMFALLLSLIIPAGFI 780
            L               + +S  ++ N S   ++          ++ + +L L+  A F+
Sbjct: 721 DL-------------HLKVRSSNKAYNDSTKLMI----------VLSSTVLVLLAVASFV 750

Query: 781 YAKKSGSKPTIAAVEEQKQSLMKEEEKTNHSPEEE-------EEAADDEHDDDMTGESCS 840
           +AKK+     +AA +   +S M  E   +H PEE            ++E DD M      
Sbjct: 781 FAKKT---KLVAATKPAPESNM--ELNLSHVPEENLVKEKLFSLNFEEEVDDKM------ 750

Query: 841 SETSSFQYSSMRGAAGAVKEPSEAHSHSHGRKKRKNS------RRESLASSSDEISISAS 899
             ++SFQ  S        KEP      S G KK  N+      RRES+ASS+ E SI   
Sbjct: 841 --SNSFQKKS-----SCHKEP-----QSKGGKKNNNNSSSSKLRRESMASSASEYSIG-- 750

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022149260.10.0e+0099.45uncharacterized protein LOC111017725 [Momordica charantia][more]
XP_038903440.14.0e-21556.03uncharacterized protein LOC120090026 [Benincasa hispida][more]
XP_022984665.11.7e-20251.96uncharacterized protein LOC111482876 isoform X5 [Cucurbita maxima][more]
XP_022984664.15.1e-20251.87uncharacterized protein LOC111482876 isoform X4 [Cucurbita maxima][more]
XP_022984662.11.1e-19950.46uncharacterized protein LOC111482876 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1D5900.0e+0099.45uncharacterized protein LOC111017725 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A6J1J9808.4e-20351.96uncharacterized protein LOC111482876 isoform X5 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JB722.5e-20251.87uncharacterized protein LOC111482876 isoform X4 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1J2S75.1e-20050.46uncharacterized protein LOC111482876 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JB653.3e-19950.75uncharacterized protein LOC111482876 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT1G16630.11.8e-3227.80unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... [more]
AT2G16270.14.2e-2929.59unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 534..561
NoneNo IPR availableCOILSCoilCoilcoord: 577..597
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..167
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 344..358
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 774..790
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 125..167
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 694..722
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..75
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 336..379
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 564..585
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 852..872
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 773..904
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 76..92
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 560..722
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 658..673
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 877..893
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..111
NoneNo IPR availablePANTHERPTHR34775TRANSMEMBRANE PROTEINcoord: 1..902
NoneNo IPR availablePANTHERPTHR34775:SF4TRANSMEMBRANE PROTEINcoord: 1..902

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS012374.1MS012374.1mRNA