Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCTGCCGTCTAACACGTCGTCTTCTCCGTCGATGATCGCCGGAAGAACAAGTCCCAATTCTAGAAATTCGGAAATCGGCAACCCCTTCCGCCGGAGCTTCTCCGGTAACCTGTTTACGAAGCCCTCGATCGTCGCCAATCCGAGAAGCTTCAACCCCGTCACTCCGGCTAACAGTCCCTCGGGTTTGTTCGACTTTTCTCCTTGCTTAGTTTCTTTCTTGAAATTTTCGCGTGGTTCAAATCTGTTTGGTTGCCGAGAAAGTGGCAGAGAAAATTTAATAGCGTTTGAGCGGCTACCAAGACAAAAAATTTATACTCCATCTTCGCTATGATTTCGATAGTTATTTCGATTCTTCCACCGATTGTTTTGCGTTCTTCTAATCTGAAATTGCTTCCATGGCAAATATTTCTGCAATCTGCAATTGTTTCTTATTTGTTTCTTTGAATGATTTTCAGATTATCCACGAAGGAATTCTGTAAGCAGGGAAAGTATATTTACTTCTCGCGATAACGAGGAGAAAGAAAATGGGAAAGATCAGACTCCGAAACCCGTCCGAGTCCGTTCGCCGACAGTCGGAAAATCGACGAAGCACTTCATGTCTCCGACCATCTCCGCTGCCTCCAAGATCGCCGTCTCTCCGAAGAAGAAGATTCTGGGTGATAAGAACGAGCCAGTTCGGTCGTCTCTTTCATTTTCGGGCATGAAAAGCTCTTCAATCAACTCGGTGAATCCAAACCCAGAGGCAGCAACGGCAGTTCAAACCGATACGAACCCAGAAATCGTTTCGATTTCAAATTCCACCGTTGCAGCGCCAACTCCCAAATCATCAAAAACTGTGAGATTTGCTGATTTTGAGGTAATTTCTGGTTCGAATGATGATTCAGAGCCTCAGGTCGAGTCGGTTCTTACTGAAGAACCCAGAAATGGATACGTTACGAACCCAGAGGTCGCAATGGTAGTTGAAACAGATACGAAGCCAGAAATCGCTTCGATTTCAAATTCCGCCATTGCTGCAGCACCTCCCAAAGCTTCAAAAACTGTGAGATTTGCTGATTTCGAGGTAATCTCTGTCTCGAACAACAGGTTAGAGTCTCCGGCTAAGAATATTTTTACTGAAGAATTGGATTGTGTTAATCTCGACCCAAGTTTTAAGATCAGTCCTGTTTCTTCTCCAATGGTAGCACCTCTCGATGCAGATCCATTAATGCCTCCTTATGATCCCAAAACTAATTATCTATCGCCAAGGCCCCAGTTCCTCCATTACAGGCCAAACCGAAGAATCGAGCGATATCTCAGTAAGGAACCAGACGGCAGCAGATTTGAGGATTTCTTTTCCTCTGTCAATGCTTCCGAGTCCGAACTCAGTGAGGAGACCGGTTCTGAGTATTCACAAAAGGAATCCGATGAAGCTTCTTCCAATGAATCGCAGAAGGAAGAAGAAGAAGAAGAAGTGGAAGAAGAGGAGGAGGAGGAAAAAGAGCTTCATGTTTCTGAACAGTGCCCCATTGAAGTTAAAAAGTCAACTAAGTCGCAGTTTTCCCGGATATTCAAGATCAGTTCTCTGCTTTTGATCCTGGCGGTTGCTTGCTTTTCGATTTGTGTTGTTGTTGTCCATGATCCAAATATCTTAGACAGACCAAGCTTGTTAACAATGGAGGATCCATCTGAAATCAACAAGTTTGCAAAAATGAATTTTAATGGATTGGTTGAAAAACTTGAGCTTTGGCATGCTAATGCCATCTCTTTCGTTTCTGACGTGGTTTCCAACCTCAGAGGATCGCCGCCATTGATTTATCTTAACCAGTCTGACTTCTTCTACAAGGATGCCAGTGTGGATGGGCAGTGTCTTGTATCTCATCAGACCTTGTGGGAAGAAGAAAACAATTTGAATAATGTAATGGAAGCGATGAAGGAGAGAGAAATTGGCATTGAGCCTATTGAAAGAGAAGATCAGATTGAAGAAGAAGAAGACGAATCGTTGCAAGAGATCGAAGCCATGAAGGAGAGAGAAACTGACATCGAGCCTGTTGAGAGAGAAGCTCAGAATGAAGAAGAAGAAGAAGAAGAAGAGTCGTTGCAAGAGATTGAAGCCAAAGCCAATGATCCAAAAGACAGTGAAGAGAATGATGAGGCTTCTGCTTCAGCTTCTGAAGAATCATCGTTGCAAGAGATTGAAACCAATGATCCAGAAGACACTGAAGAGGACGATGAGACTTCTGCAAATTCAGCTTCAGCTTTTGAAGAATCATCGTTGCAAGAGATTGAAACCAGTGATCCAGAAGACACAGAAGAGGACGATGAGACTTCTGCAAATTCAGCTTCTGACAAACTCGATGAAGAAGACGCCCAAGAGAAAGAAACTGGAGAGAATTATAAAGAAGTATCATCATCAGATGAGTTTGAAACTCTTGATCAAAGTGAATCAGAAGCCGCGACAGAGAAAATCAAGGAAGATATGAACGATTCGCTGCAACAGAGCAACACAGAATCTCAATATCAGTCACCTCCTGCTTCTTCTCCTTCTGCTGAATTTCAATCTGATGTTGAAGAAAATGGCGGCAACATCGAGGTCACCGGAGCAGCAACGGAACATGGAATCTCAGGCGATTTTACACAGAATATGGCGGTTACGTCTGCGTTGCTGCTAGGTTTATCTCTTGTTATACCTGCAGGACTGGTTTATGCAAGAAGAGCGGGCTCAAAACGCTCAAAACCACCCATGGCGGCGGTGAAAGAGCAGCAGCCATTGTTGAAGGAGATGAAGACGAACCAGACGAGCCAGAGTCCAGTGGAAGAAGAAGAAGCGGCGGTGGACGATATGGCTGGAGAATCTTGCTCTTCTGAAATGAGCAGTTTCCAATACAGCAGCATGAGAGGAGAAGCAGAAGCAGCGAAGGGATCGAGTGAAGCTCATAACCGCCATAACCATGGGAGCAGGAAGACGAGGAAGAATTCAAGAAGAGAATCTCTGGCTTCTTCTTCTTTGGATGAAGATTCAGTGTCTGCTTCTCCATCTTATGGGAGTTTCACCACTTATGAGAGAATCCCAATCAAGCATGTGAGTATCCCAAAACTGAAAAATTCTTTGGTTTGA
mRNA sequence
ATGGCGCTGCCGTCTAACACGTCGTCTTCTCCGTCGATGATCGCCGGAAGAACAAGTCCCAATTCTAGAAATTCGGAAATCGGCAACCCCTTCCGCCGGAGCTTCTCCGGTAACCTGTTTACGAAGCCCTCGATCGTCGCCAATCCGAGAAGCTTCAACCCCGTCACTCCGGCTAACAGTCCCTCGGATTATCCACGAAGGAATTCTGTAAGCAGGGAAAGTATATTTACTTCTCGCGATAACGAGGAGAAAGAAAATGGGAAAGATCAGACTCCGAAACCCGTCCGAGTCCGTTCGCCGACAGTCGGAAAATCGACGAAGCACTTCATGTCTCCGACCATCTCCGCTGCCTCCAAGATCGCCGTCTCTCCGAAGAAGAAGATTCTGGGTGATAAGAACGAGCCAGTTCGGTCGTCTCTTTCATTTTCGGGCATGAAAAGCTCTTCAATCAACTCGGTGAATCCAAACCCAGAGGCAGCAACGGCAGTTCAAACCGATACGAACCCAGAAATCGTTTCGATTTCAAATTCCACCGTTGCAGCGCCAACTCCCAAATCATCAAAAACTGTGAGATTTGCTGATTTTGAGGTAATTTCTGGTTCGAATGATGATTCAGAGCCTCAGGTCGAGTCGGTTCTTACTGAAGAACCCAGAAATGGATACGTTACGAACCCAGAGGTCGCAATGGTAGTTGAAACAGATACGAAGCCAGAAATCGCTTCGATTTCAAATTCCGCCATTGCTGCAGCACCTCCCAAAGCTTCAAAAACTGTGAGATTTGCTGATTTCGAGGTAATCTCTGTCTCGAACAACAGGTTAGAGTCTCCGGCTAAGAATATTTTTACTGAAGAATTGGATTGTGTTAATCTCGACCCAAGTTTTAAGATCAGTCCTGTTTCTTCTCCAATGGTAGCACCTCTCGATGCAGATCCATTAATGCCTCCTTATGATCCCAAAACTAATTATCTATCGCCAAGGCCCCAGTTCCTCCATTACAGGCCAAACCGAAGAATCGAGCGATATCTCAGTAAGGAACCAGACGGCAGCAGATTTGAGGATTTCTTTTCCTCTGTCAATGCTTCCGAGTCCGAACTCAGTGAGGAGACCGGTTCTGAGTATTCACAAAAGGAATCCGATGAAGCTTCTTCCAATGAATCGCAGAAGGAAGAAGAAGAAGAAGAAGTGGAAGAAGAGGAGGAGGAGGAAAAAGAGCTTCATGTTTCTGAACAGTGCCCCATTGAAGTTAAAAAGTCAACTAAGTCGCAGTTTTCCCGGATATTCAAGATCAGTTCTCTGCTTTTGATCCTGGCGGTTGCTTGCTTTTCGATTTGTGTTGTTGTTGTCCATGATCCAAATATCTTAGACAGACCAAGCTTGTTAACAATGGAGGATCCATCTGAAATCAACAAGTTTGCAAAAATGAATTTTAATGGATTGGTTGAAAAACTTGAGCTTTGGCATGCTAATGCCATCTCTTTCGTTTCTGACGTGGTTTCCAACCTCAGAGGATCGCCGCCATTGATTTATCTTAACCAGTCTGACTTCTTCTACAAGGATGCCAGTGTGGATGGGCAGTGTCTTGTATCTCATCAGACCTTGTGGGAAGAAGAAAACAATTTGAATAATGTAATGGAAGCGATGAAGGAGAGAGAAATTGGCATTGAGCCTATTGAAAGAGAAGATCAGATTGAAGAAGAAGAAGACGAATCGTTGCAAGAGATCGAAGCCATGAAGGAGAGAGAAACTGACATCGAGCCTGTTGAGAGAGAAGCTCAGAATGAAGAAGAAGAAGAAGAAGAAGAGTCGTTGCAAGAGATTGAAGCCAAAGCCAATGATCCAAAAGACAGTGAAGAGAATGATGAGGCTTCTGCTTCAGCTTCTGAAGAATCATCGTTGCAAGAGATTGAAACCAATGATCCAGAAGACACTGAAGAGGACGATGAGACTTCTGCAAATTCAGCTTCAGCTTTTGAAGAATCATCGTTGCAAGAGATTGAAACCAGTGATCCAGAAGACACAGAAGAGGACGATGAGACTTCTGCAAATTCAGCTTCTGACAAACTCGATGAAGAAGACGCCCAAGAGAAAGAAACTGGAGAGAATTATAAAGAAGTATCATCATCAGATGAGTTTGAAACTCTTGATCAAAGTGAATCAGAAGCCGCGACAGAGAAAATCAAGGAAGATATGAACGATTCGCTGCAACAGAGCAACACAGAATCTCAATATCAGTCACCTCCTGCTTCTTCTCCTTCTGCTGAATTTCAATCTGATGTTGAAGAAAATGGCGGCAACATCGAGGTCACCGGAGCAGCAACGGAACATGGAATCTCAGGCGATTTTACACAGAATATGGCGGTTACGTCTGCGTTGCTGCTAGGTTTATCTCTTGTTATACCTGCAGGACTGGTTTATGCAAGAAGAGCGGGCTCAAAACGCTCAAAACCACCCATGGCGGCGGTGAAAGAGCAGCAGCCATTGTTGAAGGAGATGAAGACGAACCAGACGAGCCAGAGTCCAGTGGAAGAAGAAGAAGCGGCGGTGGACGATATGGCTGGAGAATCTTGCTCTTCTGAAATGAGCAGTTTCCAATACAGCAGCATGAGAGGAGAAGCAGAAGCAGCGAAGGGATCGAGTGAAGCTCATAACCGCCATAACCATGGGAGCAGGAAGACGAGGAAGAATTCAAGAAGAGAATCTCTGGCTTCTTCTTCTTTGGATGAAGATTCAGTGTCTGCTTCTCCATCTTATGGGAGTTTCACCACTTATGAGAGAATCCCAATCAAGCATGTGAGTATCCCAAAACTGAAAAATTCTTTGGTTTGA
Coding sequence (CDS)
ATGGCGCTGCCGTCTAACACGTCGTCTTCTCCGTCGATGATCGCCGGAAGAACAAGTCCCAATTCTAGAAATTCGGAAATCGGCAACCCCTTCCGCCGGAGCTTCTCCGGTAACCTGTTTACGAAGCCCTCGATCGTCGCCAATCCGAGAAGCTTCAACCCCGTCACTCCGGCTAACAGTCCCTCGGATTATCCACGAAGGAATTCTGTAAGCAGGGAAAGTATATTTACTTCTCGCGATAACGAGGAGAAAGAAAATGGGAAAGATCAGACTCCGAAACCCGTCCGAGTCCGTTCGCCGACAGTCGGAAAATCGACGAAGCACTTCATGTCTCCGACCATCTCCGCTGCCTCCAAGATCGCCGTCTCTCCGAAGAAGAAGATTCTGGGTGATAAGAACGAGCCAGTTCGGTCGTCTCTTTCATTTTCGGGCATGAAAAGCTCTTCAATCAACTCGGTGAATCCAAACCCAGAGGCAGCAACGGCAGTTCAAACCGATACGAACCCAGAAATCGTTTCGATTTCAAATTCCACCGTTGCAGCGCCAACTCCCAAATCATCAAAAACTGTGAGATTTGCTGATTTTGAGGTAATTTCTGGTTCGAATGATGATTCAGAGCCTCAGGTCGAGTCGGTTCTTACTGAAGAACCCAGAAATGGATACGTTACGAACCCAGAGGTCGCAATGGTAGTTGAAACAGATACGAAGCCAGAAATCGCTTCGATTTCAAATTCCGCCATTGCTGCAGCACCTCCCAAAGCTTCAAAAACTGTGAGATTTGCTGATTTCGAGGTAATCTCTGTCTCGAACAACAGGTTAGAGTCTCCGGCTAAGAATATTTTTACTGAAGAATTGGATTGTGTTAATCTCGACCCAAGTTTTAAGATCAGTCCTGTTTCTTCTCCAATGGTAGCACCTCTCGATGCAGATCCATTAATGCCTCCTTATGATCCCAAAACTAATTATCTATCGCCAAGGCCCCAGTTCCTCCATTACAGGCCAAACCGAAGAATCGAGCGATATCTCAGTAAGGAACCAGACGGCAGCAGATTTGAGGATTTCTTTTCCTCTGTCAATGCTTCCGAGTCCGAACTCAGTGAGGAGACCGGTTCTGAGTATTCACAAAAGGAATCCGATGAAGCTTCTTCCAATGAATCGCAGAAGGAAGAAGAAGAAGAAGAAGTGGAAGAAGAGGAGGAGGAGGAAAAAGAGCTTCATGTTTCTGAACAGTGCCCCATTGAAGTTAAAAAGTCAACTAAGTCGCAGTTTTCCCGGATATTCAAGATCAGTTCTCTGCTTTTGATCCTGGCGGTTGCTTGCTTTTCGATTTGTGTTGTTGTTGTCCATGATCCAAATATCTTAGACAGACCAAGCTTGTTAACAATGGAGGATCCATCTGAAATCAACAAGTTTGCAAAAATGAATTTTAATGGATTGGTTGAAAAACTTGAGCTTTGGCATGCTAATGCCATCTCTTTCGTTTCTGACGTGGTTTCCAACCTCAGAGGATCGCCGCCATTGATTTATCTTAACCAGTCTGACTTCTTCTACAAGGATGCCAGTGTGGATGGGCAGTGTCTTGTATCTCATCAGACCTTGTGGGAAGAAGAAAACAATTTGAATAATGTAATGGAAGCGATGAAGGAGAGAGAAATTGGCATTGAGCCTATTGAAAGAGAAGATCAGATTGAAGAAGAAGAAGACGAATCGTTGCAAGAGATCGAAGCCATGAAGGAGAGAGAAACTGACATCGAGCCTGTTGAGAGAGAAGCTCAGAATGAAGAAGAAGAAGAAGAAGAAGAGTCGTTGCAAGAGATTGAAGCCAAAGCCAATGATCCAAAAGACAGTGAAGAGAATGATGAGGCTTCTGCTTCAGCTTCTGAAGAATCATCGTTGCAAGAGATTGAAACCAATGATCCAGAAGACACTGAAGAGGACGATGAGACTTCTGCAAATTCAGCTTCAGCTTTTGAAGAATCATCGTTGCAAGAGATTGAAACCAGTGATCCAGAAGACACAGAAGAGGACGATGAGACTTCTGCAAATTCAGCTTCTGACAAACTCGATGAAGAAGACGCCCAAGAGAAAGAAACTGGAGAGAATTATAAAGAAGTATCATCATCAGATGAGTTTGAAACTCTTGATCAAAGTGAATCAGAAGCCGCGACAGAGAAAATCAAGGAAGATATGAACGATTCGCTGCAACAGAGCAACACAGAATCTCAATATCAGTCACCTCCTGCTTCTTCTCCTTCTGCTGAATTTCAATCTGATGTTGAAGAAAATGGCGGCAACATCGAGGTCACCGGAGCAGCAACGGAACATGGAATCTCAGGCGATTTTACACAGAATATGGCGGTTACGTCTGCGTTGCTGCTAGGTTTATCTCTTGTTATACCTGCAGGACTGGTTTATGCAAGAAGAGCGGGCTCAAAACGCTCAAAACCACCCATGGCGGCGGTGAAAGAGCAGCAGCCATTGTTGAAGGAGATGAAGACGAACCAGACGAGCCAGAGTCCAGTGGAAGAAGAAGAAGCGGCGGTGGACGATATGGCTGGAGAATCTTGCTCTTCTGAAATGAGCAGTTTCCAATACAGCAGCATGAGAGGAGAAGCAGAAGCAGCGAAGGGATCGAGTGAAGCTCATAACCGCCATAACCATGGGAGCAGGAAGACGAGGAAGAATTCAAGAAGAGAATCTCTGGCTTCTTCTTCTTTGGATGAAGATTCAGTGTCTGCTTCTCCATCTTATGGGAGTTTCACCACTTATGAGAGAATCCCAATCAAGCATGTGAGTATCCCAAAACTGAAAAATTCTTTGGTTTGA
Protein sequence
MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANSPSDYPRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAASKIAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPEIVSISNSTVAAPTPKSSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPEVAMVVETDTKPEIASISNSAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSVNASESELSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKKSTKSQFSRIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFNGLVEKLELWHANAISFVSDVVSNLRGSPPLIYLNQSDFFYKDASVDGQCLVSHQTLWEEENNLNNVMEAMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEEEEEEEESLQEIEAKANDPKDSEENDEASASASEESSLQEIETNDPEDTEEDDETSANSASAFEESSLQEIETSDPEDTEEDDETSANSASDKLDEEDAQEKETGENYKEVSSSDEFETLDQSESEAATEKIKEDMNDSLQQSNTESQYQSPPASSPSAEFQSDVEENGGNIEVTGAATEHGISGDFTQNMAVTSALLLGLSLVIPAGLVYARRAGSKRSKPPMAAVKEQQPLLKEMKTNQTSQSPVEEEEAAVDDMAGESCSSEMSSFQYSSMRGEAEAAKGSSEAHNRHNHGSRKTRKNSRRESLASSSLDEDSVSASPSYGSFTTYERIPIKHVSIPKLKNSLV
Homology
BLAST of Sgr029912.1 vs. NCBI nr
Match:
XP_022149260.1 (uncharacterized protein LOC111017725 [Momordica charantia])
HSP 1 Score: 851.7 bits (2199), Expect = 6.2e-243
Identity = 607/958 (63.36%), Postives = 699/958 (72.96%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MA+PSN SSSPSM+AGRTSPNSRNSE+GN RRSFSGN FT+PSIVANPRS NPVTPANS
Sbjct: 1 MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
Query: 61 PSDYPRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAASKI 120
PSDYPRRNSVSRE +FTSRDNEEKENGKDQ PKP+RVRSP+VGKS+KHFMSPTISAASKI
Sbjct: 61 PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPSVGKSSKHFMSPTISAASKI 120
Query: 121 AVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPE-AATAVQTDTNPEIVSISNSTV 180
AVSPKKKILGD+NE VRSSLSFSG KSSS+NSVNPNPE AATAV++DTNPEIV ISNS++
Sbjct: 121 AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI 180
Query: 181 AAPTPKSSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPEVAMV-VETDTKPE 240
AAPTPKSSKTVRFA FEVISGS DD+E Y TN EV V VETD KPE
Sbjct: 181 AAPTPKSSKTVRFAGFEVISGSYDDAESTYR----------YDTNTEVVTVAVETDLKPE 240
Query: 241 IASISNSAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISP 300
IA IS +SKTVRF+D EVIS N ES KNIFTEELDCVNLDPSFKISP
Sbjct: 241 IAPIS----------SSKTVRFSDVEVISDLKNNFESADKNIFTEELDCVNLDPSFKISP 300
Query: 301 VSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSV 360
VSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIER G+R E+FFSSV
Sbjct: 301 VSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIER-------GNRLEEFFSSV 360
Query: 361 NASESELSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQC-PIEVKK 420
NASESE +EET SE S KESDE+SSNES E+ EEEVEE++EE +HVSEQ IEVKK
Sbjct: 361 NASESEFAEETESENSLKESDESSSNES--EQGEEEVEEKDEE--RIHVSEQKEXIEVKK 420
Query: 421 STKSQFSRIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFN 480
S S +FKISSLL IL++ FSIC VV DPNI +R SLL +E PSEI +FAKMNFN
Sbjct: 421 S-----SGMFKISSLLFILSITFFSIC-AVVRDPNISERSSLLMVEHPSEIYEFAKMNFN 480
Query: 481 GLVEKLELWHANAISFVSDVVSNLR-GSPPLIYLNQSDFFYKDASVDGQCLVSHQTLWEE 540
LV KLE+WHA++IS + VVSN R G+P LIYLNQ++F YKD VDGQ LV+HQTLWE+
Sbjct: 481 VLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQTEFLYKD--VDGQSLVTHQTLWEK 540
Query: 541 ENNLNNVME-AMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEE 600
EN LN + E A KER IE E ED EEE+ESLQEIEA+KE E
Sbjct: 541 ENFLNVIEEGATKERR--IEYAETEDHGVEEEEESLQEIEAIKEIEA-----------TN 600
Query: 601 EEEEEESLQEIEAKANDPKDSEENDEASASASEESSLQEIETNDPEDTEEDDETSANSA- 660
EEEEE+ LQEIE+K + P+ EENDE SA ++ E+ I+ D ++ E ++ + +SA
Sbjct: 601 EEEEEQLLQEIESKTSYPEVGEENDEISAKSASEN----IDEEDAQEKETEENYAVSSAD 660
Query: 661 ----SAFEESSLQEIETSDPEDTEEDDE---TSAN---------SASDKLDEEDAQEKET 720
E S+ +I+ D + E ++ +SA+ SASD++DE+D E E
Sbjct: 661 FEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENER 720
Query: 721 GENYKEVSSSDEFETLDQSESEAATEKIKEDMNDSL--QQSNTESQYQSPPASSPSAEFQ 780
EN++ +SS +FETLDQ EA T + +E+ NDSL QQSNT P SP AE Q
Sbjct: 721 KENHE--ASSADFETLDQVGPEAETGETEEENNDSLLQQQSNT------APVFSPFAEPQ 780
Query: 781 SDVE-ENGGNIEVTGAATEHGISGDFTQNMAVTSALLLGLSLVIPAGLVYARRAGSKRSK 840
SD + NG NI++ GISGDFT NM ALL LSL+IPAG +YA+++G SK
Sbjct: 781 SDFQSSNGSNIKIL-----QGISGDFTHNMMF--ALL--LSLIIPAGFIYAKKSG---SK 840
Query: 841 PPMAAVKEQ-QPLLKEMKTNQTSQSPVEEEEAA----VDDMAGESCSSEMSSFQYSSMRG 900
P +AAV+EQ Q L+KE +T+ SP EEEEAA DDM GESCSSE SSFQYSSMRG
Sbjct: 841 PTIAAVEEQKQSLMKE--EEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRG 877
Query: 901 EAEAAKGSSEAHNRHNHGSRKTRKNSRRESLASSSLDEDSVSA--SPSYGSFTTYERI 927
A A K SEAH+ H+HG RK RKNSRRESLASSS DE S+SA SPSYGSFTTYE+I
Sbjct: 901 AAGAVKEPSEAHS-HSHG-RKKRKNSRRESLASSS-DEISISASPSPSYGSFTTYEKI 877
BLAST of Sgr029912.1 vs. NCBI nr
Match:
XP_022984664.1 (uncharacterized protein LOC111482876 isoform X4 [Cucurbita maxima])
HSP 1 Score: 794.3 bits (2050), Expect = 1.2e-225
Identity = 583/1050 (55.52%), Postives = 687/1050 (65.43%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MALPSN SSSPSM+ GRTSP SRNSEI NP RSFS N F+KPSI + +S NP+TPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DY-PRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DY P+RNSVSRE +FTSRDNE+KENGKDQ+PK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPE------- 180
SKIAVSPKKKILGD+NEPVRSSLSFSGMKSSS+NSVNP PEA+ A ++DTNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 ---------------------------------------------IVSISNSTVAAPTPK 240
IV I+ S +AA + K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPE-VAMVVETDTKPEIASISN 300
SSKTV F FEVIS S DDSE R+G+ NPE V + VE D +PEI IS+
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTY--------RHGHDPNPEAVTVAVEADAEPEIGPISD 300
Query: 301 SAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPVSSPMV 360
S IAA P+ASK +RF+D E +VSNN LES + FTEE+DCVNLDPSF ISPVSSPM+
Sbjct: 301 SDIAAVTPEASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPMI 360
Query: 361 APLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSVNASESE 420
AP+DADP++ PYDPKTNYLSPRPQFLHY PNRRI R PDG RFE+ FS+
Sbjct: 361 APMDADPIITPYDPKTNYLSPRPQFLHYNPNRRINR-----PDG-RFEELFST------- 420
Query: 421 LSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKKSTKSQFS 480
SEET E QKESDE SSNESQ +EEE+E+E+ VSEQ P EVKKS+K S
Sbjct: 421 -SEETDCEDPQKESDEVSSNESQM--------KEEEKEEEVDVSEQGPTEVKKSSKPLLS 480
Query: 481 RIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFNGLVEKLE 540
RIFKISSLLLIL AC SICVV VHDP I +R +LLTM D SEI AK NFN LV KLE
Sbjct: 481 RIFKISSLLLILFTACLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLE 540
Query: 541 LWHANAISFVSDVVSNLRGSPPLIYLNQSDFFYKDASVDGQCLV-SHQTLWEEENNLNNV 600
+WHAN+ISF+SDVV N RG PPLI+LNQ++FFY D + D QCLV SHQ +WEEENNL N
Sbjct: 541 IWHANSISFISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNA 600
Query: 601 MEAMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEEEEEEEESL 660
MEAMK+R E +++ +E++E QE EA+K +E I+ VERE+QN EE EE+S
Sbjct: 601 MEAMKDR-------EGQNKEGQEQEEDAQE-EAIKVKEIGIQTVERESQN--EEVEEQSF 660
Query: 661 QEIEAKANDPKDSE-ENDEASASASEESSLQEI---------------ETNDPEDTEEDD 720
QEIEA+ ND ++SE ENDEAS E SLQEI + + +DTE
Sbjct: 661 QEIEARTNDSENSEKENDEAS-----EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMK 720
Query: 721 ETSANSASAFEESSLQEIE----------TSDPEDTEED-DETSANSASDKLDEEDAQEK 780
E + ES +E+E +D +D EE+ DE S S + ++EE QEK
Sbjct: 721 EREIGIETVERESQNEEVEEEPFQKTEAKANDQKDREEENDEASEESLLEIVEEESVQEK 780
Query: 781 ETGENYKEVSSSDEFETLDQSESEAATEKIKEDMNDSLQQSNTESQYQSPPASSPSAEFQ 840
T EN+K SSSD F+ D+ E AATE+ +E ++NTE QYQSPP SSP +E Q
Sbjct: 781 -TVENFKASSSSD-FKLHDEIEQAAATEETQE-------ETNTEFQYQSPPVSSPPSEHQ 840
Query: 841 SDVEENGGNIEVTGAATEHGISGDFTQN-MAVTSALLLGLSLVIPAGLVYARRAGSKRSK 900
SDVEE G V T GIS DFTQN A+ SA+LLGL L+IPAGL+YAR++GS+R+
Sbjct: 841 SDVEEENGGKIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSGSRRTT 900
Query: 901 PPMAAVKEQQ--PLLKEMKTNQTSQSPVEEEEAA---VDDMAGESCSSEMSS-FQYSSMR 931
A +EQQ PLLK+ KTNQ+ EEE+A DDMAGE CSSE SS FQYSS+R
Sbjct: 901 STAAIAEEQQEEPLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVR 960
BLAST of Sgr029912.1 vs. NCBI nr
Match:
XP_038903440.1 (uncharacterized protein LOC120090026 [Benincasa hispida])
HSP 1 Score: 794.3 bits (2050), Expect = 1.2e-225
Identity = 562/960 (58.54%), Postives = 671/960 (69.90%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MALPSN SSSP+M++GRTSPNSRNSEI NP RRSFSGN F+KPSIVANPR NP+TPANS
Sbjct: 1 MALPSNRSSSPAMLSGRTSPNSRNSEISNPVRRSFSGNPFSKPSIVANPRGLNPITPANS 60
Query: 61 PSDYPRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAASKI 120
PSDYPRRNSVSRE+ FTSR+ +EKEN KDQ+PKPVRVRSP VGKS+KHFMSPTISAASKI
Sbjct: 61 PSDYPRRNSVSRENSFTSRNIQEKENEKDQSPKPVRVRSPMVGKSSKHFMSPTISAASKI 120
Query: 121 AVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPEIVSISNSTVA 180
A SPKKKILGD+NEPVRSS SFSGMKSSS+NSVN + +++ +++DTNP+I +S+S
Sbjct: 121 AASPKKKILGDQNEPVRSSNSFSGMKSSSLNSVNQSSQSSKTLESDTNPQIPPVSSS--- 180
Query: 181 APTPKSSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPEV--AMVVETDTKPE 240
KS+KTVRF FEVIS S+DDSE Y NPEV M VE D K E
Sbjct: 181 ----KSTKTVRFGGFEVISDSHDDSETTYR----------YDLNPEVVATMAVEADMKSE 240
Query: 241 IASISNSAIAAAPPKASKTVRFADFEVISVSNNRLES-PAKNIFTEELDCVNLDPSFKIS 300
+A +S SA A AP ++S +DFEVIS+SN L+S PA++ E++DCVNLDPSFKIS
Sbjct: 241 MAPVSKSASAVAPLESSN----SDFEVISISNKDLDSPPARSNLIEDVDCVNLDPSFKIS 300
Query: 301 PVSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSS 360
P+SSPM+APLD DP +PPYDPKTNYLSPRPQFLHYRPNRRI RY EP+G E FS
Sbjct: 301 PISSPMIAPLDDDPSIPPYDPKTNYLSPRPQFLHYRPNRRINRY---EPEGRLEEKLFSF 360
Query: 361 VNASESELSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKK 420
N S+SE EET SE S KESDEASSNES E+EEEE+EE+E++VSEQ P E+K+
Sbjct: 361 ANVSQSESMEETDSEDSPKESDEASSNES-------EMEEEEQEEEEINVSEQSPTEMKQ 420
Query: 421 STKSQFSRIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFN 480
S+K FS IFK SSLLLIL ACFSICVV VHDPNI RPS LTMED SEI FAK NFN
Sbjct: 421 SSKLHFSSIFKTSSLLLILFTACFSICVVNVHDPNIFQRPSSLTMEDESEIFGFAKTNFN 480
Query: 481 GLVEKLELWHANAISFVSDVVSNLRGSPPLI-YLNQSDFFYKDASVDGQCLV-SHQTLWE 540
LV KLE+WH +ISF+SDVV N RG PLI Y NQ++FF + +++ QCLV SHQT+WE
Sbjct: 481 VLVGKLEVWHVKSISFISDVVFNFRGGLPLIHYENQTEFFNEYFNMNEQCLVLSHQTVWE 540
Query: 541 EENNLNNVMEAMKEREIGI--EPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQN 600
EENNL NV+EAMK+REI I EPIE+E Q +EEE QE E + RE IE ERE+
Sbjct: 541 EENNL-NVIEAMKDREIDIFEEPIEKECQNKEEE----QEAEEL-PREIGIETDERES-- 600
Query: 601 EEEEEEEESLQEIEA-KANDPKDSEEND---EASASASEESSLQEIETNDPEDTEEDD-- 660
E EEEE QEIEA K + ++ E+ D E A E ++ +E + E +D
Sbjct: 601 -EIVEEEELFQEIEAMKVREEQEQEQEDVLQEIEAIKMREIFVENVERESQNEEELEDVS 660
Query: 661 --ETSANSASAFEESSLQEIETSDPEDTEEDDETSANSASDKLDEEDAQEKETGENYKEV 720
ET AN+ + + Q E +E E S NSASDKL EE+ +++ EN+K
Sbjct: 661 FQETEANANEEENDEAFQ-------ESLQETIEESENSASDKLTEEEYVQEKPEENFK-F 720
Query: 721 SSSDEFETLDQSESEAATEKIKEDMNDSLQQSNTESQYQSPPASSPSAEFQSDVEENGGN 780
SS + + DQ E AA ++ ++ NTE QYQ PP S P+AE QSD EE G
Sbjct: 721 SSLSDLKFHDQIEQAAAAA-----TGETEEEKNTEFQYQLPPVSPPAAEHQSDFEEKNGG 780
Query: 781 IEVTGAATEHGISGDFTQNMA-VTSALLLGLSLVIPAGLVYARRAGSKRSKPPMAAV--- 840
+ T++GIS DFTQN A + SA+LLG ++ GL+YAR++GSK S MAA+
Sbjct: 781 KIIDLIRTKNGISQDFTQNTAIIISAILLGTLII---GLIYARQSGSKPSS-SMAAIAEE 840
Query: 841 -KEQQPLLKEMKTNQTSQSPVEEEEAA-------VDDMAGESCSSEMSSFQYSSMRGE-A 900
+E+QPL+KE K N QS VEEEE DDM GE CSSE SSFQYSSMR E
Sbjct: 841 EEEKQPLVKEEKMN---QSLVEEEEVVEEEGHEEEDDMGGEFCSSETSSFQYSSMREEDT 899
Query: 901 EAAKGSSEAHNRHNHGSRKTRKNSRRESLASSSLDEDSV--SASPSYGSFTTYERIPIKH 931
+A K SSE + H+HG +K RKNSRRES+ASSSLDE SV SASPSYGSFTTYE+IPIKH
Sbjct: 901 KAGKRSSEVQS-HSHGRKKMRKNSRRESMASSSLDEYSVSTSASPSYGSFTTYEKIPIKH 899
BLAST of Sgr029912.1 vs. NCBI nr
Match:
XP_022984665.1 (uncharacterized protein LOC111482876 isoform X5 [Cucurbita maxima])
HSP 1 Score: 790.0 bits (2039), Expect = 2.2e-224
Identity = 582/1050 (55.43%), Postives = 685/1050 (65.24%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MALPSN SSSPSM+ GRTSP SRNSEI NP RSFS N F+KPSI + +S NP+TPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DY-PRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DY P+RNSVSRE +FTSRDNE+KENGKDQ+PK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPE------- 180
SKIAVSPKKKILGD+NEPVRSSLSFSGMKSSS+NSVNP PEA+ A ++DTNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 ---------------------------------------------IVSISNSTVAAPTPK 240
IV I+ S +AA + K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPE-VAMVVETDTKPEIASISN 300
SSKTV F FEVIS S DDSE R+G+ NPE V + VE D +PEI IS+
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTY--------RHGHDPNPEAVTVAVEADAEPEIGPISD 300
Query: 301 SAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPVSSPMV 360
S IAA P+ASK +RF+D E +VSNN LES + FTEE+DCVNLDPSF ISPVSSPM+
Sbjct: 301 SDIAAVTPEASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPMI 360
Query: 361 APLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSVNASESE 420
AP+DADP++ PYDPKTNYLSPRPQFLHY PNRRI R PDG RFE+ FS+
Sbjct: 361 APMDADPIITPYDPKTNYLSPRPQFLHYNPNRRINR-----PDG-RFEELFST------- 420
Query: 421 LSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKKSTKSQFS 480
SEET E QKESDE SSNESQ +EEE+E+E+ VSEQ P EVKKS+K S
Sbjct: 421 -SEETDCEDPQKESDEVSSNESQM--------KEEEKEEEVDVSEQGPTEVKKSSKPLLS 480
Query: 481 RIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFNGLVEKLE 540
RIFKISSLLLIL AC SICVV VHDP I +R +LLTM D SEI AK NFN LV KLE
Sbjct: 481 RIFKISSLLLILFTACLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLE 540
Query: 541 LWHANAISFVSDVVSNLRGSPPLIYLNQSDFFYKDASVDGQCLV-SHQTLWEEENNLNNV 600
+WHAN+ISF+SDVV N RG PPLI+LNQ++FFY D + D QCLV SHQ +WEEENNL N
Sbjct: 541 IWHANSISFISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNA 600
Query: 601 MEAMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEEEEEEEESL 660
MEAMK+R E +++ +E++E QE EA+K +E I+ VERE+QN EE EE+S
Sbjct: 601 MEAMKDR-------EGQNKEGQEQEEDAQE-EAIKVKEIGIQTVERESQN--EEVEEQSF 660
Query: 661 QEIEAKANDPKDSE-ENDEASASASEESSLQEI---------------ETNDPEDTEEDD 720
QEIEA+ ND ++SE ENDEAS E SLQEI + + +DTE
Sbjct: 661 QEIEARTNDSENSEKENDEAS-----EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMK 720
Query: 721 ETSANSASAFEESSLQEIE----------TSDPEDTEED-DETSANSASDKLDEEDAQEK 780
E + ES +E+E +D +D EE+ DE S S + ++EE QEK
Sbjct: 721 EREIGIETVERESQNEEVEEEPFQKTEAKANDQKDREEENDEASEESLLEIVEEESVQEK 780
Query: 781 ETGENYKEVSSSDEFETLDQSESEAATEKIKEDMNDSLQQSNTESQYQSPPASSPSAEFQ 840
T EN+K SSSD F+ Q E AAT + +E ++NTE QYQSPP SSP +E Q
Sbjct: 781 -TVENFKASSSSD-FKLHGQIEQAAATGETQE-------ETNTEFQYQSPPVSSPPSEHQ 840
Query: 841 SDVEENGGNIEVTGAATEHGISGDFTQN-MAVTSALLLGLSLVIPAGLVYARRAGSKRSK 900
SDVEE G V T GIS DFTQN A+ SA+LLGL L+IPAGL+YAR++GS+R+
Sbjct: 841 SDVEEENGGKIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSGSRRTT 900
Query: 901 PPMAAVKEQQ--PLLKEMKTNQTSQSPVEEEEAA---VDDMAGESCSSEMSS-FQYSSMR 931
A +EQQ PLLK+ KTNQ+ EEE+A DDMAGE CSSE SS FQYSS+R
Sbjct: 901 STAAIAEEQQEEPLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVR 960
BLAST of Sgr029912.1 vs. NCBI nr
Match:
XP_023551790.1 (uncharacterized protein LOC111809647 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 783.1 bits (2021), Expect = 2.7e-222
Identity = 580/1043 (55.61%), Postives = 684/1043 (65.58%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MALPSN SSSPSM+ GRT P SRNSEIGNP RSFS N F+KPSI + +S NP+TPANS
Sbjct: 1 MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
Query: 61 PSDY-PRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAASK 120
PSDY P+RNSVSRE +FTSRDNE+KENGKDQ+PK RVRSPTVGKS K+FMS TISAASK
Sbjct: 61 PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
Query: 121 IAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPEIVSISNSTV 180
IAVSPKKKILGD+NEPVRSSLSFSGMKSSS+NSVNP PEA+ A ++DTNP + IS
Sbjct: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLIS---- 180
Query: 181 AAPTPKSSKTVRFADFEVISGSNDDSE--------PQVESVLTEEP-------------- 240
PKS+KTVRF EVISGS +DSE P++ + +T+
Sbjct: 181 ---YPKSTKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEG 240
Query: 241 ---------------------------RNGYVTNPE-VAMVVETDTKPEIASISNSAIAA 300
R+G+ NPE V + VE + +PEI IS+S IAA
Sbjct: 241 SSKSSKTVTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAA 300
Query: 301 APPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPVSSPMVAPLDA 360
P+ASK +RF+DFE +VSNN LES + TEE+D VNLDPSF ISPVSSPM+A
Sbjct: 301 VTPEASKIMRFSDFE--AVSNNALESSVNSNLTEEVDSVNLDPSFNISPVSSPMIA---- 360
Query: 361 DPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSVNASESELSEET 420
P++ PYDPKTNYLSPRPQFLHY PNRRI R PDG RFE+ FSS SEET
Sbjct: 361 -PIITPYDPKTNYLSPRPQFLHYNPNRRINR-----PDG-RFEELFSS--------SEET 420
Query: 421 GSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKKSTKSQFSRIFKI 480
E QKESDE SSNESQ ++EEE+EE+E++VS Q P EVKKS+K SRIFKI
Sbjct: 421 DCEDPQKESDEVSSNESQ-------MKEEEKEEEEVNVSIQGPTEVKKSSKPLLSRIFKI 480
Query: 481 SSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFNGLVEKLELWHAN 540
SSLLLIL AC SICVV VHDP I +R +LLTM D SEI + AK NFN LV KLE+WHAN
Sbjct: 481 SSLLLILFTACLSICVVNVHDPTIFERSTLLTMGDQSEIFESAKTNFNVLVGKLEIWHAN 540
Query: 541 AISFVSDVVSNLRGSPPLIYLNQSDFFYKDASVDGQCLV-SHQTLWEEENNLNNVMEAMK 600
+ISF+SDVV N RG PPLI+LNQ++FFY D + D QCLV SHQ +WEEENNL N MEAMK
Sbjct: 541 SISFISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLINAMEAMK 600
Query: 601 EREI---GIEPIEREDQIEE--EEDESLQEIEAMKERETDIEPVEREAQNEEEEEEEESL 660
ER I EPI+RE Q +E E++E QE+EA+K RE IE ERE+ N EE EEES
Sbjct: 601 ERVIVDTFEEPIDREGQNKEGQEQEEDAQEVEAIKVREIGIETFERESHN--EEVEEESF 660
Query: 661 QEIEAKANDPKD-SEENDEASASASEESSLQEI------ETNDPEDTEEDDETSANSASA 720
QEIEA+ ND D EEN+EAS E SLQEI E + E E+ +E A
Sbjct: 661 QEIEARTNDSADIEEENNEAS-----EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMK 720
Query: 721 FEESSLQEIETS-----------DPEDTEEDDETSANSASDKLDEEDAQEKETGENYKEV 780
E ++ +E + EE+DE S S + ++EE QEK T EN+K
Sbjct: 721 VREIGIETVERESQNEEVEEEPFQKDSEEENDEASEESLLEIVEEEYVQEK-TVENFKAS 780
Query: 781 SSSDEFETLDQSESEAATEKIKEDMNDSLQQSNTESQYQSPPASSPSAEFQSDVEENGGN 840
SSSD F+ D+ E AAT + +E ++NTE QYQSPP SSP +E QSDVEE G
Sbjct: 781 SSSD-FKLHDEIEKAAATGETQE-------ETNTEFQYQSPPVSSPPSEHQSDVEEENGG 840
Query: 841 IEVTGAATEHGISGDFTQN-MAVTSALLLGLSLVIPAGLVYARRAGSKRSKPPMAAVKEQ 900
V T GIS DFTQN A+ SA+LLGL L+IPAGL+YAR++ S+R+ A +EQ
Sbjct: 841 KIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSSSRRTTSTAAIAEEQ 900
Query: 901 Q--PLLKEMKTNQTSQSPVEEEEAAV-----DDMAGESCSSEMSS-FQYSSMR-GEAEAA 931
Q PLLK+ KTNQ+ EEEE A+ DDMAGE CSSE SS FQYSS+R GE EAA
Sbjct: 901 QEEPLLKDKKTNQSLVE--EEEEEALDDDDEDDMAGEFCSSETSSFFQYSSVREGETEAA 960
BLAST of Sgr029912.1 vs. ExPASy TrEMBL
Match:
A0A6J1D590 (uncharacterized protein LOC111017725 OS=Momordica charantia OX=3673 GN=LOC111017725 PE=4 SV=1)
HSP 1 Score: 851.7 bits (2199), Expect = 3.0e-243
Identity = 607/958 (63.36%), Postives = 699/958 (72.96%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MA+PSN SSSPSM+AGRTSPNSRNSE+GN RRSFSGN FT+PSIVANPRS NPVTPANS
Sbjct: 1 MAVPSNKSSSPSMVAGRTSPNSRNSEVGNAVRRSFSGNPFTRPSIVANPRSLNPVTPANS 60
Query: 61 PSDYPRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAASKI 120
PSDYPRRNSVSRE +FTSRDNEEKENGKDQ PKP+RVRSP+VGKS+KHFMSPTISAASKI
Sbjct: 61 PSDYPRRNSVSREILFTSRDNEEKENGKDQNPKPIRVRSPSVGKSSKHFMSPTISAASKI 120
Query: 121 AVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPE-AATAVQTDTNPEIVSISNSTV 180
AVSPKKKILGD+NE VRSSLSFSG KSSS+NSVNPNPE AATAV++DTNPEIV ISNS++
Sbjct: 121 AVSPKKKILGDRNESVRSSLSFSGTKSSSLNSVNPNPEAAATAVESDTNPEIVPISNSSI 180
Query: 181 AAPTPKSSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPEVAMV-VETDTKPE 240
AAPTPKSSKTVRFA FEVISGS DD+E Y TN EV V VETD KPE
Sbjct: 181 AAPTPKSSKTVRFAGFEVISGSYDDAESTYR----------YDTNTEVVTVAVETDLKPE 240
Query: 241 IASISNSAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISP 300
IA IS +SKTVRF+D EVIS N ES KNIFTEELDCVNLDPSFKISP
Sbjct: 241 IAPIS----------SSKTVRFSDVEVISDLKNNFESADKNIFTEELDCVNLDPSFKISP 300
Query: 301 VSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSV 360
VSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIER G+R E+FFSSV
Sbjct: 301 VSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIER-------GNRLEEFFSSV 360
Query: 361 NASESELSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQC-PIEVKK 420
NASESE +EET SE S KESDE+SSNES E+ EEEVEE++EE +HVSEQ IEVKK
Sbjct: 361 NASESEFAEETESENSLKESDESSSNES--EQGEEEVEEKDEE--RIHVSEQKEXIEVKK 420
Query: 421 STKSQFSRIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFN 480
S S +FKISSLL IL++ FSIC VV DPNI +R SLL +E PSEI +FAKMNFN
Sbjct: 421 S-----SGMFKISSLLFILSITFFSIC-AVVRDPNISERSSLLMVEHPSEIYEFAKMNFN 480
Query: 481 GLVEKLELWHANAISFVSDVVSNLR-GSPPLIYLNQSDFFYKDASVDGQCLVSHQTLWEE 540
LV KLE+WHA++IS + VVSN R G+P LIYLNQ++F YKD VDGQ LV+HQTLWE+
Sbjct: 481 VLVGKLEVWHADSISSIFHVVSNFRGGAPSLIYLNQTEFLYKD--VDGQSLVTHQTLWEK 540
Query: 541 ENNLNNVME-AMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEE 600
EN LN + E A KER IE E ED EEE+ESLQEIEA+KE E
Sbjct: 541 ENFLNVIEEGATKERR--IEYAETEDHGVEEEEESLQEIEAIKEIEA-----------TN 600
Query: 601 EEEEEESLQEIEAKANDPKDSEENDEASASASEESSLQEIETNDPEDTEEDDETSANSA- 660
EEEEE+ LQEIE+K + P+ EENDE SA ++ E+ I+ D ++ E ++ + +SA
Sbjct: 601 EEEEEQLLQEIESKTSYPEVGEENDEISAKSASEN----IDEEDAQEKETEENYAVSSAD 660
Query: 661 ----SAFEESSLQEIETSDPEDTEEDDE---TSAN---------SASDKLDEEDAQEKET 720
E S+ +I+ D + E ++ +SA+ SASD++DE+D E E
Sbjct: 661 FEILDQIEPSASNKIDEDDVHENEIEENYEGSSADFEILDQIELSASDEIDEDDVHENER 720
Query: 721 GENYKEVSSSDEFETLDQSESEAATEKIKEDMNDSL--QQSNTESQYQSPPASSPSAEFQ 780
EN++ +SS +FETLDQ EA T + +E+ NDSL QQSNT P SP AE Q
Sbjct: 721 KENHE--ASSADFETLDQVGPEAETGETEEENNDSLLQQQSNT------APVFSPFAEPQ 780
Query: 781 SDVE-ENGGNIEVTGAATEHGISGDFTQNMAVTSALLLGLSLVIPAGLVYARRAGSKRSK 840
SD + NG NI++ GISGDFT NM ALL LSL+IPAG +YA+++G SK
Sbjct: 781 SDFQSSNGSNIKIL-----QGISGDFTHNMMF--ALL--LSLIIPAGFIYAKKSG---SK 840
Query: 841 PPMAAVKEQ-QPLLKEMKTNQTSQSPVEEEEAA----VDDMAGESCSSEMSSFQYSSMRG 900
P +AAV+EQ Q L+KE +T+ SP EEEEAA DDM GESCSSE SSFQYSSMRG
Sbjct: 841 PTIAAVEEQKQSLMKE--EEKTNHSPEEEEEAADDEHDDDMTGESCSSETSSFQYSSMRG 877
Query: 901 EAEAAKGSSEAHNRHNHGSRKTRKNSRRESLASSSLDEDSVSA--SPSYGSFTTYERI 927
A A K SEAH+ H+HG RK RKNSRRESLASSS DE S+SA SPSYGSFTTYE+I
Sbjct: 901 AAGAVKEPSEAHS-HSHG-RKKRKNSRRESLASSS-DEISISASPSPSYGSFTTYEKI 877
BLAST of Sgr029912.1 vs. ExPASy TrEMBL
Match:
A0A6J1JB72 (uncharacterized protein LOC111482876 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 794.3 bits (2050), Expect = 5.6e-226
Identity = 583/1050 (55.52%), Postives = 687/1050 (65.43%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MALPSN SSSPSM+ GRTSP SRNSEI NP RSFS N F+KPSI + +S NP+TPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DY-PRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DY P+RNSVSRE +FTSRDNE+KENGKDQ+PK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPE------- 180
SKIAVSPKKKILGD+NEPVRSSLSFSGMKSSS+NSVNP PEA+ A ++DTNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 ---------------------------------------------IVSISNSTVAAPTPK 240
IV I+ S +AA + K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPE-VAMVVETDTKPEIASISN 300
SSKTV F FEVIS S DDSE R+G+ NPE V + VE D +PEI IS+
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTY--------RHGHDPNPEAVTVAVEADAEPEIGPISD 300
Query: 301 SAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPVSSPMV 360
S IAA P+ASK +RF+D E +VSNN LES + FTEE+DCVNLDPSF ISPVSSPM+
Sbjct: 301 SDIAAVTPEASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPMI 360
Query: 361 APLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSVNASESE 420
AP+DADP++ PYDPKTNYLSPRPQFLHY PNRRI R PDG RFE+ FS+
Sbjct: 361 APMDADPIITPYDPKTNYLSPRPQFLHYNPNRRINR-----PDG-RFEELFST------- 420
Query: 421 LSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKKSTKSQFS 480
SEET E QKESDE SSNESQ +EEE+E+E+ VSEQ P EVKKS+K S
Sbjct: 421 -SEETDCEDPQKESDEVSSNESQM--------KEEEKEEEVDVSEQGPTEVKKSSKPLLS 480
Query: 481 RIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFNGLVEKLE 540
RIFKISSLLLIL AC SICVV VHDP I +R +LLTM D SEI AK NFN LV KLE
Sbjct: 481 RIFKISSLLLILFTACLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLE 540
Query: 541 LWHANAISFVSDVVSNLRGSPPLIYLNQSDFFYKDASVDGQCLV-SHQTLWEEENNLNNV 600
+WHAN+ISF+SDVV N RG PPLI+LNQ++FFY D + D QCLV SHQ +WEEENNL N
Sbjct: 541 IWHANSISFISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNA 600
Query: 601 MEAMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEEEEEEEESL 660
MEAMK+R E +++ +E++E QE EA+K +E I+ VERE+QN EE EE+S
Sbjct: 601 MEAMKDR-------EGQNKEGQEQEEDAQE-EAIKVKEIGIQTVERESQN--EEVEEQSF 660
Query: 661 QEIEAKANDPKDSE-ENDEASASASEESSLQEI---------------ETNDPEDTEEDD 720
QEIEA+ ND ++SE ENDEAS E SLQEI + + +DTE
Sbjct: 661 QEIEARTNDSENSEKENDEAS-----EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMK 720
Query: 721 ETSANSASAFEESSLQEIE----------TSDPEDTEED-DETSANSASDKLDEEDAQEK 780
E + ES +E+E +D +D EE+ DE S S + ++EE QEK
Sbjct: 721 EREIGIETVERESQNEEVEEEPFQKTEAKANDQKDREEENDEASEESLLEIVEEESVQEK 780
Query: 781 ETGENYKEVSSSDEFETLDQSESEAATEKIKEDMNDSLQQSNTESQYQSPPASSPSAEFQ 840
T EN+K SSSD F+ D+ E AATE+ +E ++NTE QYQSPP SSP +E Q
Sbjct: 781 -TVENFKASSSSD-FKLHDEIEQAAATEETQE-------ETNTEFQYQSPPVSSPPSEHQ 840
Query: 841 SDVEENGGNIEVTGAATEHGISGDFTQN-MAVTSALLLGLSLVIPAGLVYARRAGSKRSK 900
SDVEE G V T GIS DFTQN A+ SA+LLGL L+IPAGL+YAR++GS+R+
Sbjct: 841 SDVEEENGGKIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSGSRRTT 900
Query: 901 PPMAAVKEQQ--PLLKEMKTNQTSQSPVEEEEAA---VDDMAGESCSSEMSS-FQYSSMR 931
A +EQQ PLLK+ KTNQ+ EEE+A DDMAGE CSSE SS FQYSS+R
Sbjct: 901 STAAIAEEQQEEPLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVR 960
BLAST of Sgr029912.1 vs. ExPASy TrEMBL
Match:
A0A6J1J980 (uncharacterized protein LOC111482876 isoform X5 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 790.0 bits (2039), Expect = 1.1e-224
Identity = 582/1050 (55.43%), Postives = 685/1050 (65.24%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MALPSN SSSPSM+ GRTSP SRNSEI NP RSFS N F+KPSI + +S NP+TPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DY-PRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DY P+RNSVSRE +FTSRDNE+KENGKDQ+PK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPE------- 180
SKIAVSPKKKILGD+NEPVRSSLSFSGMKSSS+NSVNP PEA+ A ++DTNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 ---------------------------------------------IVSISNSTVAAPTPK 240
IV I+ S +AA + K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPE-VAMVVETDTKPEIASISN 300
SSKTV F FEVIS S DDSE R+G+ NPE V + VE D +PEI IS+
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTY--------RHGHDPNPEAVTVAVEADAEPEIGPISD 300
Query: 301 SAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPVSSPMV 360
S IAA P+ASK +RF+D E +VSNN LES + FTEE+DCVNLDPSF ISPVSSPM+
Sbjct: 301 SDIAAVTPEASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPMI 360
Query: 361 APLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSVNASESE 420
AP+DADP++ PYDPKTNYLSPRPQFLHY PNRRI R PDG RFE+ FS+
Sbjct: 361 APMDADPIITPYDPKTNYLSPRPQFLHYNPNRRINR-----PDG-RFEELFST------- 420
Query: 421 LSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKKSTKSQFS 480
SEET E QKESDE SSNESQ +EEE+E+E+ VSEQ P EVKKS+K S
Sbjct: 421 -SEETDCEDPQKESDEVSSNESQM--------KEEEKEEEVDVSEQGPTEVKKSSKPLLS 480
Query: 481 RIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFNGLVEKLE 540
RIFKISSLLLIL AC SICVV VHDP I +R +LLTM D SEI AK NFN LV KLE
Sbjct: 481 RIFKISSLLLILFTACLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLE 540
Query: 541 LWHANAISFVSDVVSNLRGSPPLIYLNQSDFFYKDASVDGQCLV-SHQTLWEEENNLNNV 600
+WHAN+ISF+SDVV N RG PPLI+LNQ++FFY D + D QCLV SHQ +WEEENNL N
Sbjct: 541 IWHANSISFISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNA 600
Query: 601 MEAMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEEEEEEEESL 660
MEAMK+R E +++ +E++E QE EA+K +E I+ VERE+QN EE EE+S
Sbjct: 601 MEAMKDR-------EGQNKEGQEQEEDAQE-EAIKVKEIGIQTVERESQN--EEVEEQSF 660
Query: 661 QEIEAKANDPKDSE-ENDEASASASEESSLQEI---------------ETNDPEDTEEDD 720
QEIEA+ ND ++SE ENDEAS E SLQEI + + +DTE
Sbjct: 661 QEIEARTNDSENSEKENDEAS-----EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMK 720
Query: 721 ETSANSASAFEESSLQEIE----------TSDPEDTEED-DETSANSASDKLDEEDAQEK 780
E + ES +E+E +D +D EE+ DE S S + ++EE QEK
Sbjct: 721 EREIGIETVERESQNEEVEEEPFQKTEAKANDQKDREEENDEASEESLLEIVEEESVQEK 780
Query: 781 ETGENYKEVSSSDEFETLDQSESEAATEKIKEDMNDSLQQSNTESQYQSPPASSPSAEFQ 840
T EN+K SSSD F+ Q E AAT + +E ++NTE QYQSPP SSP +E Q
Sbjct: 781 -TVENFKASSSSD-FKLHGQIEQAAATGETQE-------ETNTEFQYQSPPVSSPPSEHQ 840
Query: 841 SDVEENGGNIEVTGAATEHGISGDFTQN-MAVTSALLLGLSLVIPAGLVYARRAGSKRSK 900
SDVEE G V T GIS DFTQN A+ SA+LLGL L+IPAGL+YAR++GS+R+
Sbjct: 841 SDVEEENGGKIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSGSRRTT 900
Query: 901 PPMAAVKEQQ--PLLKEMKTNQTSQSPVEEEEAA---VDDMAGESCSSEMSS-FQYSSMR 931
A +EQQ PLLK+ KTNQ+ EEE+A DDMAGE CSSE SS FQYSS+R
Sbjct: 901 STAAIAEEQQEEPLLKDKKTNQSLVEEEEEEDALDDDDDDMAGEFCSSETSSFFQYSSVR 960
BLAST of Sgr029912.1 vs. ExPASy TrEMBL
Match:
A0A6J1J2S7 (uncharacterized protein LOC111482876 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 783.1 bits (2021), Expect = 1.3e-222
Identity = 581/1080 (53.80%), Postives = 688/1080 (63.70%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MALPSN SSSPSM+ GRTSP SRNSEI NP RSFS N F+KPSI + +S NP+TPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PSDY-PRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAASK 120
PSDY P+RNSVSRE +FTSRDNE+KENGKDQ+PK RVRSPTVGKS K+FMS TISAASK
Sbjct: 61 PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
Query: 121 IAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPE--------- 180
IAVSPKKKILGD+NEPVRSSLSFSGMKSSS+NSVNP PEA+ A ++DTNP
Sbjct: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNPKS 180
Query: 181 -------------------------------------------IVSISNSTVAAPTPKSS 240
IV I+ S +AA + KSS
Sbjct: 181 TKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSKSS 240
Query: 241 KTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPE-VAMVVETDTKPEIASISNSA 300
KTV F FEVIS S DDSE R+G+ NPE V + VE D +PEI IS+S
Sbjct: 241 KTVTFGGFEVISDSYDDSESTY--------RHGHDPNPEAVTVAVEADAEPEIGPISDSD 300
Query: 301 IAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPVSSPMVAP 360
IAA P+ASK +RF+D E +VSNN LES + FTEE+DCVNLDPSF ISPVSSPM+AP
Sbjct: 301 IAAVTPEASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPMIAP 360
Query: 361 LDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSVNASESELS 420
+DADP++ PYDPKTNYLSPRPQFLHY PNRRI R PDG RFE+ FS+ S
Sbjct: 361 MDADPIITPYDPKTNYLSPRPQFLHYNPNRRINR-----PDG-RFEELFST--------S 420
Query: 421 EETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKKSTKSQFSRI 480
EET E QKESDE SSNESQ +EEE+E+E+ VSEQ P EVKKS+K SRI
Sbjct: 421 EETDCEDPQKESDEVSSNESQM--------KEEEKEEEVDVSEQGPTEVKKSSKPLLSRI 480
Query: 481 FKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFNGLVEKLELW 540
FKISSLLLIL AC SICVV VHDP I +R +LLTM D SEI AK NFN LV KLE+W
Sbjct: 481 FKISSLLLILFTACLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLEIW 540
Query: 541 HANAISFVSDVVSNLRGSPPLIYLNQSDFFYKDASVDGQCLV-SHQTLWEEENNLNNVME 600
HAN+ISF+SDVV N RG PPLI+LNQ++FFY D + D QCLV SHQ +WEEENNL N ME
Sbjct: 541 HANSISFISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAME 600
Query: 601 AMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEEEEEEEESLQE 660
AMK+R E +++ +E++E QE EA+K +E I+ VERE+QN EE EE+S QE
Sbjct: 601 AMKDR-------EGQNKEGQEQEEDAQE-EAIKVKEIGIQTVERESQN--EEVEEQSFQE 660
Query: 661 IEAKANDPKDSE-ENDEASASASEESSLQEI---------------ETNDPEDTEEDDET 720
IEA+ ND ++SE ENDEAS E SLQEI + + +DTE E
Sbjct: 661 IEARTNDSENSEKENDEAS-----EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKER 720
Query: 721 SANSASAFEESSLQEIE----------TSDPEDTEED-DETSANSASDKLDEEDAQEKET 780
+ ES +E+E +D +D EE+ DE S S + ++EE QEK T
Sbjct: 721 EIGIETVERESQNEEVEEEPFQKTEAKANDQKDREEENDEASEESLLEIVEEESVQEK-T 780
Query: 781 GENYKEVSSSDEFETLDQSESEAATEKIKEDM---------------------------- 840
EN+K SSSD F+ Q E AAT + ++
Sbjct: 781 VENFKASSSSD-FKLHGQIEQAAATGETHYEIEQAAATGETHYEIEQAAATGETHYEIEQ 840
Query: 841 ----NDSLQQSNTESQYQSPPASSPSAEFQSDVEENGGNIEVTGAATEHGISGDFTQN-M 900
++ +++NTE QYQSPP SSP +E QSDVEE G V T GIS DFTQN
Sbjct: 841 AAATEETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNTA 900
Query: 901 AVTSALLLGLSLVIPAGLVYARRAGSKRSKPPMAAVKEQQ--PLLKEMKTNQTSQSPVEE 931
A+ SA+LLGL L+IPAGL+YAR++GS+R+ A +EQQ PLLK+ KTNQ+ EE
Sbjct: 901 AIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEEE 960
BLAST of Sgr029912.1 vs. ExPASy TrEMBL
Match:
A0A6J1JB65 (uncharacterized protein LOC111482876 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 783.1 bits (2021), Expect = 1.3e-222
Identity = 581/1069 (54.35%), Postives = 688/1069 (64.36%), Query Frame = 0
Query: 1 MALPSNTSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANS 60
MALPSN SSSPSM+ GRTSP SRNSEI NP RSFS N F+KPSI + +S NP+TPAN+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DY-PRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAA 120
PS DY P+RNSVSRE +FTSRDNE+KENGKDQ+PK RVRSPTVGKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKIAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPE------- 180
SKIAVSPKKKILGD+NEPVRSSLSFSGMKSSS+NSVNP PEA+ A ++DTNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 ---------------------------------------------IVSISNSTVAAPTPK 240
IV I+ S +AA + K
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 SSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPE-VAMVVETDTKPEIASISN 300
SSKTV F FEVIS S DDSE R+G+ NPE V + VE D +PEI IS+
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTY--------RHGHDPNPEAVTVAVEADAEPEIGPISD 300
Query: 301 SAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPVSSPMV 360
S IAA P+ASK +RF+D E +VSNN LES + FTEE+DCVNLDPSF ISPVSSPM+
Sbjct: 301 SDIAAVTPEASKIMRFSDLE--AVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPMI 360
Query: 361 APLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDFFSSVNASESE 420
AP+DADP++ PYDPKTNYLSPRPQFLHY PNRRI R PDG RFE+ FS+
Sbjct: 361 APMDADPIITPYDPKTNYLSPRPQFLHYNPNRRINR-----PDG-RFEELFST------- 420
Query: 421 LSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPIEVKKSTKSQFS 480
SEET E QKESDE SSNESQ +EEE+E+E+ VSEQ P EVKKS+K S
Sbjct: 421 -SEETDCEDPQKESDEVSSNESQM--------KEEEKEEEVDVSEQGPTEVKKSSKPLLS 480
Query: 481 RIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAKMNFNGLVEKLE 540
RIFKISSLLLIL AC SICVV VHDP I +R +LLTM D SEI AK NFN LV KLE
Sbjct: 481 RIFKISSLLLILFTACLSICVVNVHDPTIFERSTLLTMGDQSEIFASAKTNFNVLVGKLE 540
Query: 541 LWHANAISFVSDVVSNLRGSPPLIYLNQSDFFYKDASVDGQCLV-SHQTLWEEENNLNNV 600
+WHAN+ISF+SDVV N RG PPLI+LNQ++FFY D + D QCLV SHQ +WEEENNL N
Sbjct: 541 IWHANSISFISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNA 600
Query: 601 MEAMKEREIGIEPIEREDQIEEEEDESLQEIEAMKERETDIEPVEREAQNEEEEEEEESL 660
MEAMK+R E +++ +E++E QE EA+K +E I+ VERE+QN EE EE+S
Sbjct: 601 MEAMKDR-------EGQNKEGQEQEEDAQE-EAIKVKEIGIQTVERESQN--EEVEEQSF 660
Query: 661 QEIEAKANDPKDSE-ENDEASASASEESSLQEI---------------ETNDPEDTEEDD 720
QEIEA+ ND ++SE ENDEAS E SLQEI + + +DTE
Sbjct: 661 QEIEARTNDSENSEKENDEAS-----EESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMK 720
Query: 721 ETSANSASAFEESSLQEIE----------TSDPEDTEED-DETSANSASDKLDEEDAQEK 780
E + ES +E+E +D +D EE+ DE S S + ++EE QEK
Sbjct: 721 EREIGIETVERESQNEEVEEEPFQKTEAKANDQKDREEENDEASEESLLEIVEEESVQEK 780
Query: 781 ETGENYKEVSSSDEFETLDQSESEAATEKIKEDM-------------------NDSLQQS 840
T EN+K SSSD F+ Q E AAT + ++ ++ +++
Sbjct: 781 -TVENFKASSSSD-FKLHGQIEQAAATGETHYEIEQAAATGETHYEIEQAAATEETQEET 840
Query: 841 NTESQYQSPPASSPSAEFQSDVEENGGNIEVTGAATEHGISGDFTQN-MAVTSALLLGLS 900
NTE QYQSPP SSP +E QSDVEE G V T GIS DFTQN A+ SA+LLGL
Sbjct: 841 NTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNTAAIISAILLGLF 900
Query: 901 LVIPAGLVYARRAGSKRSKPPMAAVKEQQ--PLLKEMKTNQTSQSPVEEEEAA---VDDM 931
L+IPAGL+YAR++GS+R+ A +EQQ PLLK+ KTNQ+ EEE+A DDM
Sbjct: 901 LIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEEEEDALDDDDDDM 960
BLAST of Sgr029912.1 vs. TAIR 10
Match:
AT1G16630.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G16270.1); Has 10587 Blast hits to 5736 proteins in 617 species: Archae - 88; Bacteria - 963; Metazoa - 3686; Fungi - 820; Plants - 541; Viruses - 438; Other Eukaryotes - 4051 (source: NCBI BLink). )
HSP 1 Score: 75.1 bits (183), Expect = 3.4e-13
Identity = 259/973 (26.62%), Postives = 392/973 (40.29%), Query Frame = 0
Query: 7 TSSSPSMIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPANSPSDYPR 66
+SSSPSM R +P RNSE G+ RRSF GN F+ +D R
Sbjct: 19 SSSSPSM-PSRPNPKQRNSETGDLMRRSFRGNPFS--------------------ADPSR 78
Query: 67 RNSVSRE-SIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAASKIAVSPK 126
RNS+ RE S ++E +N KDQ V+ PT K +KHFMSPTISA SKI SP+
Sbjct: 79 RNSIGRECSNRVEIGDKENQNDKDQIANV--VKGPT--KGSKHFMSPTISAVSKINPSPR 138
Query: 127 KKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPEIVSISNSTVAAPTPK 186
KKIL DKNE RS D + V + +S
Sbjct: 139 KKILSDKNEVSRS--------------------------FDKSHHQVQVKSS-------- 198
Query: 187 SSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPEVAMVVETDTKPEIASISNS 246
V F+D I G + D V+ + +E +
Sbjct: 199 ----VSFSDVISIIGEDKD----VDQICIDETKQ-------------------------- 258
Query: 247 AIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKISPV------ 306
+ S + +DF+ I LE + N + SFKISP+
Sbjct: 259 ----LREEESHDITVSDFDEI------LERKS-----------NDNSSFKISPLPPYVPC 318
Query: 307 SSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFED-FFSSV 366
+ P+ + DP++ PYDPK NYLSPRPQFLHY+PN +IE + + + E+ F S
Sbjct: 319 TFPVFESHEVDPVVAPYDPKKNYLSPRPQFLHYKPNPKIEH---RSDECKQLEELFISES 378
Query: 367 NASESELSEETGSEYSQKESDEASSN-----ESQKEEEEEEVE----------------- 426
++S+++LS E E Q+E + AS E Q+++ EE +E
Sbjct: 379 SSSDTDLSAER-EEEGQQEEEVASQEGVVAVEEQEDDGEERLEAAEEILDVDGEERLEAV 438
Query: 427 EEEEEEKELHVSEQCPIEV--KKSTKSQFSRIFKISSLLLILAVACFSICVVVVHDPNIL 486
E ++EE+E+ V E E + S +S+FS+ + +L L VA + +
Sbjct: 439 ESDDEEEEVVVGESIEEEETHQISKQSRFSKTSMLLGWILALGVAYLLLVSSTTFSQQTI 498
Query: 487 DRPSLLTMEDPSEINKFAKMNFNGLVEKLELWHANAISFVSDVVSNLR---GSPPLIYLN 546
EI A NF L KL +W ++ ++ +VS+LR GS P + N
Sbjct: 499 TDSPFYQFNISPEIIMSASENFEQLGAKLRMWAESSFVYLDKLVSSLREEEGSVPFQFHN 558
Query: 547 QSDFFYKDASVDGQCLVSHQTLWEEENNLNNVMEA--MKEREIGIEPIEREDQIEEEEDE 606
+ +D + + ++ +++ + E+ IE + Q EEE E
Sbjct: 559 LT-VLLEDKRLSDAVF--------QSTSVEIIVDGFIVDSLEVDIEEVNVGHQEPEEESE 618
Query: 607 SLQEIEAMKERETDIEPVEREAQNEEEEEEEESLQEIEAKANDPKDSEENDEASASASEE 666
+ EI +E V E NE E+E EE + N E E+
Sbjct: 619 NSGEI--------SLEAVYEEDDNEVEQENEE--------------GKVNLEIVDECDEQ 678
Query: 667 SSLQEIETNDPEDTEEDDETSANSASAFEESSLQEIETSDPEDTEEDDETSANSA-SDKL 726
+ ++ + E E+ + +E+ + E E+ EE+D+ + A SD
Sbjct: 679 AEIKIATDTEVNGGERYSESLSEEGHGGQETDVVE----GQEEYEENDQNNMEEAESDAQ 738
Query: 727 DEEDAQEKETGENYKEVSSSDEFETLDQSESEAATEKIKEDMNDSLQQSNTESQYQSPPA 786
+D Q N +E + ET+ + E + E SL S A
Sbjct: 739 LLDDVQSAAISSNQQEQTGVANVETVQEEEG------VGEIAGGSLSVSE--------EA 798
Query: 787 SSPSAEFQSDVEENGGNIEVTGAATEHGISGDFTQNMAVTSALLLGLSLVIPAGLVYARR 846
+ + EE G EV A I + + V + ++ + + AG + A+
Sbjct: 799 TDVEHDGNEVEEEESGFGEVVNDAGSEDILLSGQKKVLVLFSTMMVILAAVAAGFLLAK- 816
Query: 847 AGSKRSKPPM-------------AAVKEQQPLLKEMKTNQTSQSPVEEEEAAVDDMAGE- 906
K++KP M V E P+ ++ +S + EEEE DD E
Sbjct: 859 ---KKTKPVMLQHEDGEPTAISATKVVEHVPVENLIRERLSSLNFKEEEEEVGDDRKREV 816
Query: 907 -SCSSEMSSFQYSSMRGEAEAAKGSSEAHNRHNHGSRKTRKNSRRESLASSSLDEDSVSA 927
S SEM SF +S + + + H G + N ES+ASS+ E S+
Sbjct: 919 SSFPSEM-SFSFSKNK-PLHSCSNKKDDLKEHQSGGGGKKSNDSGESMASSA-SEYSI-G 816
BLAST of Sgr029912.1 vs. TAIR 10
Match:
AT2G16270.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G16630.1); Has 1844 Blast hits to 1256 proteins in 271 species: Archae - 6; Bacteria - 283; Metazoa - 434; Fungi - 153; Plants - 91; Viruses - 52; Other Eukaryotes - 825 (source: NCBI BLink). )
HSP 1 Score: 67.0 bits (162), Expect = 9.2e-11
Identity = 256/947 (27.03%), Postives = 378/947 (39.92%), Query Frame = 0
Query: 1 MALPSNTSSSPS-MIAGRTSPNSRNSEIGNPFRRSFSGNLFTKPSIVANPRSFNPVTPAN 60
MA P+N + S S I R +P RNSE G+P RRSF GN F S V N
Sbjct: 1 MASPTNKNPSFSPPIPNRPNPKPRNSEAGDPLRRSFGGNPFPANSKV------------N 60
Query: 61 SPSDYPRRNSVSRESIFTSRDNEEKENGKDQTPKPVRVRSPTVGKSTKHFMSPTISAASK 120
PSD RRNS + K+ KPV++ K +K+FMSPTISA SK
Sbjct: 61 IPSDLTRRNSFGGD--------------KENETKPVQL----TPKGSKNFMSPTISAVSK 120
Query: 121 IAVSPKKKILGDKNEPVRSSLSFSGMKSSSINSVNPNPEAATAVQTDTNPEIVSISNSTV 180
I SP+K++L DKNE RS G+ N N +
Sbjct: 121 INASPRKRVLSDKNEMSRSFSDVKGLILEDDNKRNHH----------------------- 180
Query: 181 AAPTPKSSKTVRFADFEVISGSNDDSEPQVESVLTEEPRNGYVTNPEVAMVVETDTKPEI 240
++ V F+D + + DD + VE
Sbjct: 181 -----RAKSCVSFSDV-LHTICIDDEKKFVE----------------------------- 240
Query: 241 ASISNSAIAAAPPKASKTVRFADFEVISVSNNRLESPAKNIFTEELDCVNLDPSFKIS-- 300
S + DF+ V N+ DP F+IS
Sbjct: 241 ---------------SHDMTVTDFDEKEVYENK-------------GITYSDPRFRISPR 300
Query: 301 ---PVSSPMVAPLDADPLMPPYDPKTNYLSPRPQFLHYRPNRRIERYLSKEPDGSRFEDF 360
P +SP A + D L+PPYDPK N+LSPRPQFLHY+PN RIE+ + + + E+
Sbjct: 301 PSVPYTSPEFAACEVDTLLPPYDPKKNFLSPRPQFLHYKPNPRIEK---RFDECKQLEEL 360
Query: 361 FSSVNAS-ESELSEETGSEYSQKESDEASSNESQKEEEEEEVEEEEEEEKELHVSEQCPI 420
F S ++S ++ELS E +E ++ + E EEE E+VE+ E E E V E
Sbjct: 361 FISESSSDDTELSVE-----ESEEQEKDGAEEVVVEEETEDVEQSEAESDEEMVCESVEE 420
Query: 421 EVKKSTKSQFSRIFKISSLLLILAVACFSICVVVVHDPNILDRPSLLTMEDPSEINKFAK 480
+ K SR FK L LA+ + V P L + S P EI +FAK
Sbjct: 421 TTSQVPKQSGSRKFKFLGWFLALALG--YLLVSATFSP--LMKSSFNEFHIPKEITEFAK 480
Query: 481 M-NFNGLVEKLELWHANAISFVSDVVSNL-RGSPPLIYLNQSDFFY--KDASVDGQCLVS 540
N + L +KL +++ ++ ++S L RG+ L + Y +D++V V
Sbjct: 481 ANNLDQLSDKLWTLTESSLVYMDKLISRLGRGNEEYSQLQFHNLTYTLEDSTVFKPTCV- 540
Query: 541 HQTLWEEENNLNNVMEAMKEREIGIEPIEREDQIEEE-EDESLQEIEAMKERETDIEPVE 600
EI EP++ + E ED S+ E E+ E
Sbjct: 541 ---------------------EIIQEPLQENSRSENSLEDGSVNEEESGAE--------- 600
Query: 601 REAQNEEEEEEEESLQEIEAKANDPKDSEEND-EASASASEESSLQEIETNDPEDTEEDD 660
+N E + + L E++ D E ND E + A E L E+ + ++E
Sbjct: 601 ---ENSEVVCQFDELAEVKPST----DIESNDGERNLKALFEDGL-ELNIEELRESEMSP 660
Query: 661 ETSANSASAFEESSLQEIETSDPEDTEEDDETSANSASDKLDEEDAQEKETGE----NYK 720
E + EE+ + I + P+ + S+ L E E+ GE +
Sbjct: 661 EEKLETEKKLEETESEAIYINQPDVEFAAINVHQHIESEILVAESGSEESFGEIGDLLHL 720
Query: 721 EVSSSDEFETLD-QSESEAATEKIKEDMNDSLQQSNTESQYQSPPASSPSAEFQSDVEEN 780
EV S ++ D +S SE +I + +D L
Sbjct: 721 EVGSYNDLAKGDAESGSEEGFGEIAAETSDDL---------------------------- 731
Query: 781 GGNIEVTGAATEHGISGDFTQNMAVTSALLLGLSLVIPAGLVYARRAGSKRSKPPMAAVK 840
+++V + + D T+ M V S+ +L L V A V+A++ +AA K
Sbjct: 781 --HLKVRSSNKAY---NDSTKLMIVLSSTVLVLLAV--ASFVFAKKT------KLVAATK 731
Query: 841 EQQPLLKEMKTNQTSQSPVEEEEAAVDDMAGESCSSEMSSFQYSSMRGEAEAAKGSSEAH 900
E+ + + + +E+ + E +SFQ S + +KG +
Sbjct: 841 PAPESNMELNLSHVPEENLVKEKLFSLNFEEEVDDKMSNSFQKKSSCHKEPQSKGGKK-- 731
Query: 901 NRHNHGSRKTRKNSRRESLASSSLDEDSVSASPSYGSFTTYERIPIK 930
N +N S K RRES+ASS+ E S+ S SYGSFTTYE+IPIK
Sbjct: 901 NNNNSSSSKL----RRESMASSA-SEYSI-GSFSYGSFTTYEKIPIK 731
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022149260.1 | 6.2e-243 | 63.36 | uncharacterized protein LOC111017725 [Momordica charantia] | [more] |
XP_022984664.1 | 1.2e-225 | 55.52 | uncharacterized protein LOC111482876 isoform X4 [Cucurbita maxima] | [more] |
XP_038903440.1 | 1.2e-225 | 58.54 | uncharacterized protein LOC120090026 [Benincasa hispida] | [more] |
XP_022984665.1 | 2.2e-224 | 55.43 | uncharacterized protein LOC111482876 isoform X5 [Cucurbita maxima] | [more] |
XP_023551790.1 | 2.7e-222 | 55.61 | uncharacterized protein LOC111809647 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D590 | 3.0e-243 | 63.36 | uncharacterized protein LOC111017725 OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
A0A6J1JB72 | 5.6e-226 | 55.52 | uncharacterized protein LOC111482876 isoform X4 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1J980 | 1.1e-224 | 55.43 | uncharacterized protein LOC111482876 isoform X5 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1J2S7 | 1.3e-222 | 53.80 | uncharacterized protein LOC111482876 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1JB65 | 1.3e-222 | 54.35 | uncharacterized protein LOC111482876 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT1G16630.1 | 3.4e-13 | 26.62 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |
AT2G16270.1 | 9.2e-11 | 27.03 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |